file.org
robots.txt

Robots Exclusion Standard data for file.org

Resource Scan

Scan Details

Site Domain file.org
Base Domain file.org
Scan Status Ok
Last Scan2025-08-07T00:13:08+00:00
Next Scan 2025-08-14T00:13:08+00:00

Last Scan

Scanned2025-08-07T00:13:08+00:00
URL https://file.org/robots.txt
Domain IPs 104.26.0.228, 104.26.1.228, 172.67.69.28, 2606:4700:20::681a:1e4, 2606:4700:20::681a:e4, 2606:4700:20::ac43:451c
Response IP 104.26.1.228
Found Yes
Hash 589c46045361a0bb4fe7681766e10b02aefb5a56ccd825788e6926a38ce2fc55
SimHash 090d5d35ecb0

Groups

*

Rule Path
Disallow
Disallow /gsearch.html
Disallow /updatecheck/

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://file.org/combined-sitemap.xml