theoreocat.com
robots.txt

Robots Exclusion Standard data for theoreocat.com

Resource Scan

Scan Details

Site Domain theoreocat.com
Base Domain theoreocat.com
Scan Status Ok
Last Scan2025-11-02T20:36:31+00:00
Next Scan 2025-11-09T20:36:31+00:00

Last Scan

Scanned2025-11-02T20:36:31+00:00
URL https://theoreocat.com/robots.txt
Redirect https://www.theoreocat.com/robots.txt
Redirect Domain www.theoreocat.com
Redirect Base theoreocat.com
Domain IPs 199.34.228.72
Redirect IPs 199.34.228.72
Response IP 199.34.228.72
Found Yes
Hash ab45e4f762f1a7b93dddc588c4baf682f6928e31f0bbf3f1e6c089bbef8720ed
SimHash 4854dc762f93

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /https%3A//theoreocat-shop.fourthwall.com/en-cad
Disallow /https%3A//beacons.ai/theoreocat

Other Records

Field Value
sitemap https://www.TheOreoCat.com/sitemap.xml