historycrunch.com
robots.txt

Robots Exclusion Standard data for historycrunch.com

Resource Scan

Scan Details

Site Domain historycrunch.com
Base Domain historycrunch.com
Scan Status Ok
Last Scan2024-11-14T01:13:15+00:00
Next Scan 2024-11-21T01:13:15+00:00

Last Scan

Scanned2024-11-14T01:13:15+00:00
URL https://historycrunch.com/robots.txt
Redirect https://www.historycrunch.com/robots.txt
Redirect Domain www.historycrunch.com
Redirect Base historycrunch.com
Domain IPs 199.34.228.68
Redirect IPs 199.34.228.68
Response IP 199.34.228.68
Found Yes
Hash 751c22c2a3a5ede40be64e730a86c151652041c7fcc08ddfbe4704b5b13fcb50
SimHash aa5cdc442793

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /about.html

Other Records

Field Value
sitemap https://www.historycrunch.com/sitemap.xml