santaclausind.org
robots.txt

Robots Exclusion Standard data for santaclausind.org

Resource Scan

Scan Details

Site Domain santaclausind.org
Base Domain santaclausind.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-09T20:03:52+00:00
Next Scan 2025-12-08T20:03:52+00:00

Last Successful Scan

Scanned2024-04-25T19:56:58+00:00
URL https://santaclausind.org/robots.txt
Domain IPs 104.26.0.79, 104.26.1.79, 172.67.70.2, 2606:4700:20::681a:14f, 2606:4700:20::681a:4f, 2606:4700:20::ac43:4602
Response IP 104.26.0.79
Found Yes
Hash 0df58aeb5ed13c35f1dc468e60009692415f44bcbcbd24a68fd682f6e1dd9fab
SimHash 2200de326331

Groups

*

Rule Path
Disallow /csvs/
Disallow /orig_scratch/
Allow /
Allow /images/
Allow /galleria_photos/

Other Records

Field Value
crawl-delay 10