classicalarchives.com
robots.txt

Robots Exclusion Standard data for classicalarchives.com

Resource Scan

Scan Details

Site Domain classicalarchives.com
Base Domain classicalarchives.com
Scan Status Ok
Last Scan2024-11-14T21:42:54+00:00
Next Scan 2024-11-21T21:42:54+00:00

Last Scan

Scanned2024-11-14T21:42:54+00:00
URL https://classicalarchives.com/robots.txt
Redirect https://www.classicalarchives.com/robots.txt
Redirect Domain www.classicalarchives.com
Redirect Base classicalarchives.com
Domain IPs 54.225.144.208
Redirect IPs 184.72.253.232, 2600:1f18:2314:6b00:bf4e:12fc:61a6:df16
Response IP 184.72.253.232
Found Yes
Hash 9d4135abb946ec1a162119b90e1b2f7edee29a58e5d56578e75faa4980292f55
SimHash 69beec65cf93

Groups

*

Rule Path
Disallow /r/
Disallow /m/
Disallow /d/
Disallow /anx/
Disallow /cgi-bin/
Disallow /submit/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.classicalarchives.com/sitemap.xml