fileformats.org
robots.txt

Robots Exclusion Standard data for fileformats.org

Resource Scan

Scan Details

Site Domain fileformats.org
Base Domain fileformats.org
Scan Status Ok
Last Scan2024-11-15T08:00:55+00:00
Next Scan 2024-11-22T08:00:55+00:00

Last Scan

Scanned2024-11-15T08:00:55+00:00
URL https://fileformats.org/robots.txt
Domain IPs 104.21.64.40, 172.67.175.236, 2606:4700:3030::6815:4028, 2606:4700:3033::ac43:afec
Response IP 104.21.64.40
Found Yes
Hash b8af68f6aef9b4877a36cd2cbea475fb8990f3f02cbca96967829553dc0ff737
SimHash 65039a01c290

Groups

*

Rule Path
Disallow /download/
Disallow /pt/download/
Disallow /es/download/
Disallow /ru/download/
Disallow /ja/download/
Disallow /zh-cn/download/
Disallow /de/download/
Disallow /ko/download/
Disallow /it/download/
Disallow /fr/download/
Disallow /search/
Disallow /pt/search/
Disallow /es/search/
Disallow /ru/search/
Disallow /ja/search/
Disallow /zh-cn/search/
Disallow /de/search/
Disallow /ko/search/
Disallow /it/search/
Disallow /fr/search/
Disallow /search-online/
Disallow /pt/search-online/
Disallow /es/search-online/
Disallow /ru/search-online/
Disallow /ja/search-online/
Disallow /zh-cn/search-online/
Disallow /de/search-online/
Disallow /ko/search-online/
Disallow /it/search-online/
Disallow /fr/search-online/