file.org
robots.txt
Robots Exclusion Standard data for file.org
Resource Scan
Scan Details
Site Domain | file.org |
Base Domain | file.org |
Scan Status | Ok |
Last Scan | 2025-08-07T00:13:08+00:00 |
Next Scan | 2025-08-14T00:13:08+00:00 |
Last Scan
Scanned | 2025-08-07T00:13:08+00:00 |
URL | https://file.org/robots.txt |
Domain IPs | 104.26.0.228, 104.26.1.228, 172.67.69.28, 2606:4700:20::681a:1e4, 2606:4700:20::681a:e4, 2606:4700:20::ac43:451c |
Response IP | 104.26.1.228 |
Found | Yes |
Hash | 589c46045361a0bb4fe7681766e10b02aefb5a56ccd825788e6926a38ce2fc55 |
SimHash | 090d5d35ecb0 |
Groups
*
Rule | Path |
---|---|
Disallow | |
Disallow | /gsearch.html |
Disallow | /updatecheck/ |
Other Records
Field | Value |
---|---|
sitemap | https://file.org/combined-sitemap.xml |