rismedia.com
robots.txt

Robots Exclusion Standard data for rismedia.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	rismedia.com
Base Domain	rismedia.com
Scan Status	Ok
Last Scan	2024-09-26T10:55:08+00:00
Next Scan	2024-10-26T10:55:08+00:00

Last Scan

Scanned	2024-09-26T10:55:08+00:00
URL	https://rismedia.com/robots.txt
Redirect	https://www.rismedia.com:443/robots.txt
Redirect Domain	www.rismedia.com
Redirect Base	rismedia.com
Domain IPs	34.102.150.204
Redirect IPs	34.102.150.204
Response IP	34.102.150.204
Found	Yes
Hash	57a6ed11f5a954611218d45dbc2e2878622c31840b1658d01a3167088e98f0e3
SimHash	28709852aa2b

Groups

*

Rule	Path
Disallow	/wp-content/cache/

Rule

Path

Disallow

/wp-content/cache/

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

/

red

Rule	Path
Disallow	/

Rule

Path

Disallow

/

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

trovitbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

seekport

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

5

Back to top

Comments

BEGIN W3TC ROBOTS
END W3TC ROBOTS
Block Baidu
http://help.baidu.com/question?prod_en=master&class=Baiduspider
Block Yandex
https://yandex.com/support/webmaster/controlling-robot/robots-txt.xml
Block Redbot
https://redbot.org/
Block Dotbot
https://wowrack.org/
Block TrovitBot
https://www.trovit.com/bot.html
Block RogerBot
https://moz.com/help/moz-procedures/crawlers/rogerbot
Block SeekPort
http://seekport.com/
Generic bot rules

Back to top

rismedia.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

baiduspider

yandex

red

dotbot

trovitbot

rogerbot

seekport

*

Other Records

Comments

rismedia.com
robots.txt