schema-root.org
robots.txt

Robots Exclusion Standard data for schema-root.org

Resource Scan

Scan Details

Site Domain schema-root.org
Base Domain schema-root.org
Scan Status Ok
Last Scan2024-09-07T18:50:54+00:00
Next Scan 2024-10-07T18:50:54+00:00

Last Scan

Scanned2024-09-07T18:50:54+00:00
URL https://schema-root.org/robots.txt
Domain IPs 66.147.228.100
Response IP 66.147.228.100
Found Yes
Hash 62d909c5cd9b18a6f9c1b094d7c0bb291cd269e0ca12ebb3438ad5a48fd09025
SimHash 6999c2c2d335

Groups

*

Rule Path
Disallow /~
Disallow /_
Disallow /p/
Allow /_stacks/
Disallow */?*
Allow */?branch_index
Allow /?inc=about.html

Other Records

Field Value
crawl-delay 60

panscient.com

Rule Path
Disallow /

Comments

  • User-agent: bingbot
  • Disallow: /
  • User-agent: yandex
  • Disallow: /
  • User-Agent: MJ12bot
  • Crawl-Delay: 15