historycrunch.com
robots.txt

Robots Exclusion Standard data for historycrunch.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	historycrunch.com
Base Domain	historycrunch.com
Scan Status	Ok
Last Scan	2024-11-14T01:13:15+00:00
Next Scan	2024-11-21T01:13:15+00:00

Last Scan

Scanned	2024-11-14T01:13:15+00:00
URL	https://historycrunch.com/robots.txt
Redirect	https://www.historycrunch.com/robots.txt
Redirect Domain	www.historycrunch.com
Redirect Base	historycrunch.com
Domain IPs	199.34.228.68
Redirect IPs	199.34.228.68
Response IP	199.34.228.68
Found	Yes
Hash	751c22c2a3a5ede40be64e730a86c151652041c7fcc08ddfbe4704b5b13fcb50
SimHash	aa5cdc442793

Groups

nerdybot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

dotbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

*

Rule	Path
Disallow	/ajax/
Disallow	/apps/
Disallow	/about.html

Rule

Path

Disallow

/ajax/

Disallow

/apps/

Disallow

/about.html

Back to top

Other Records

Field	Value
sitemap	https://www.historycrunch.com/sitemap.xml

Field

Value

sitemap

https://www.historycrunch.com/sitemap.xml

Back to top

historycrunch.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

nerdybot

dotbot

Other Records

*

Other Records

historycrunch.com
robots.txt