4archive.org
robots.txt
Robots Exclusion Standard data for 4archive.org
Resource Scan
Scan Details
Site Domain | 4archive.org |
Base Domain | 4archive.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-07-08T00:46:59+00:00 |
Next Scan | 2025-10-06T00:46:59+00:00 |
Last Successful Scan
Scanned | 2024-02-21T22:24:02+00:00 |
URL | https://4archive.org/robots.txt |
Domain IPs | 104.21.11.163, 172.67.166.108, 2606:4700:3031::ac43:a66c, 2606:4700:3037::6815:ba3 |
Response IP | 172.67.166.108 |
Found | Yes |
Hash | 1ed8bf43d18203a554d80c7b6ae059beb752e3d6fdf99e77c7d1f4f896a25986 |
SimHash | 0101cc24cb97 |
Groups
*
Rule | Path |
---|---|
Disallow | |
Disallow | /404.php |
Disallow | /report.php |
Disallow | /admin |
Disallow | /admin/* |
Other Records
Field | Value |
---|---|
sitemap | https://4archive.org/sitemap.xml |
Warnings
- `host` is not a known field.