4archive.org
robots.txt

Robots Exclusion Standard data for 4archive.org

Resource Scan

Scan Details

Site Domain 4archive.org
Base Domain 4archive.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-07-08T00:46:59+00:00
Next Scan 2025-10-06T00:46:59+00:00

Last Successful Scan

Scanned2024-02-21T22:24:02+00:00
URL https://4archive.org/robots.txt
Domain IPs 104.21.11.163, 172.67.166.108, 2606:4700:3031::ac43:a66c, 2606:4700:3037::6815:ba3
Response IP 172.67.166.108
Found Yes
Hash 1ed8bf43d18203a554d80c7b6ae059beb752e3d6fdf99e77c7d1f4f896a25986
SimHash 0101cc24cb97

Groups

*

Rule Path
Disallow
Disallow /404.php
Disallow /report.php
Disallow /admin
Disallow /admin/*

Other Records

Field Value
sitemap https://4archive.org/sitemap.xml

Warnings

  • `host` is not a known field.