arsoncole.com
robots.txt

Robots Exclusion Standard data for arsoncole.com

Resource Scan

Scan Details

Site Domain arsoncole.com
Base Domain arsoncole.com
Scan Status Ok
Last Scan2026-01-23T11:36:56+00:00
Next Scan 2026-02-22T11:36:56+00:00

Last Scan

Scanned2026-01-23T11:36:56+00:00
URL https://arsoncole.com/robots.txt
Domain IPs 2a00:41c0:94:231:94::146, 94.231.94.146
Response IP 94.231.94.146
Found Yes
Hash fd89c1d892dc05d3c15cfd56ac6179de3a1b8a9304992965c8c418f242c26916
SimHash 2959188486d3

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /admin/
Disallow /temp/

webreaper
webcopier
offline explorer
httrack
microsoft.url.control
emailcollector
penthesilea

Rule Path
Disallow /

Other Records

Field Value
sitemap http://ArsonCole.com