rismedia.com
robots.txt

Robots Exclusion Standard data for rismedia.com

Resource Scan

Scan Details

Site Domain rismedia.com
Base Domain rismedia.com
Scan Status Ok
Last Scan2024-09-26T10:55:08+00:00
Next Scan 2024-10-26T10:55:08+00:00

Last Scan

Scanned2024-09-26T10:55:08+00:00
URL https://rismedia.com/robots.txt
Redirect https://www.rismedia.com:443/robots.txt
Redirect Domain www.rismedia.com
Redirect Base rismedia.com
Domain IPs 34.102.150.204
Redirect IPs 34.102.150.204
Response IP 34.102.150.204
Found Yes
Hash 57a6ed11f5a954611218d45dbc2e2878622c31840b1658d01a3167088e98f0e3
SimHash 28709852aa2b

Groups

*

Rule Path
Disallow /wp-content/cache/

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

red

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

seekport

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Comments

  • BEGIN W3TC ROBOTS
  • END W3TC ROBOTS
  • Block Baidu
  • http://help.baidu.com/question?prod_en=master&class=Baiduspider
  • Block Yandex
  • https://yandex.com/support/webmaster/controlling-robot/robots-txt.xml
  • Block Redbot
  • https://redbot.org/
  • Block Dotbot
  • https://wowrack.org/
  • Block TrovitBot
  • https://www.trovit.com/bot.html
  • Block RogerBot
  • https://moz.com/help/moz-procedures/crawlers/rogerbot
  • Block SeekPort
  • http://seekport.com/
  • Generic bot rules