historyarchive.org
robots.txt

Robots Exclusion Standard data for historyarchive.org

Resource Scan

Scan Details

Site Domain historyarchive.org
Base Domain historyarchive.org
Scan Status Ok
Last Scan2026-01-02T14:06:14+00:00
Next Scan 2026-01-09T14:06:14+00:00

Last Scan

Scanned2026-01-02T14:06:14+00:00
URL https://historyarchive.org/robots.txt
Domain IPs 192.64.113.82
Response IP 192.64.113.82
Found Yes
Hash c0a3c655e7eb66180764f59ac1ffce3826a3362359448ec17224c1d8dd5ee765
SimHash 286c2c40cf53

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /m/
Disallow /mobile/

Other Records

Field Value
sitemap https://historyarchive.org/sitemaps/sitemap.xml

Comments

  • Sitemap: https://historyarchive.org/sitemaps/sitemap-images.xml