fileformat.info
robots.txt
Robots Exclusion Standard data for fileformat.info
Resource Scan
Scan Details
Site Domain | fileformat.info |
Base Domain | fileformat.info |
Scan Status | Ok |
Last Scan | 2024-11-15T13:57:30+00:00 |
Next Scan | 2024-11-22T13:57:30+00:00 |
Last Scan
Scanned | 2024-11-15T13:57:30+00:00 |
URL | https://fileformat.info/robots.txt |
Redirect | https://www.fileformat.info/robots.txt |
Redirect Domain | www.fileformat.info |
Redirect Base | fileformat.info |
Domain IPs | 104.21.3.2, 172.67.129.246, 2606:4700:3031::ac43:81f6, 2606:4700:3035::6815:302 |
Redirect IPs | 104.21.3.2, 172.67.129.246, 2606:4700:3031::ac43:81f6, 2606:4700:3035::6815:302 |
Response IP | 172.67.129.246 |
Found | Yes |
Hash | 89d5c0c67d7c361290542ddefa2b366e3482cc96e344e477e9a3e95dc06c1232 |
SimHash | 0cf8cae39851 |
Groups
*
Rule | Path |
---|---|
Disallow | /_ |
Disallow | /about/feed |
Disallow | /about/javad |
Disallow | /about/webal |
Disallow | /down |
Disallow | /mirror/news |
Disallow | /other/bookm |
Disallow | /security |
Disallow | /user |
Disallow | /honeypot.txt |
Disallow | /format/unipage/sample/ |
Other Records
Field | Value |
---|---|
sitemap | http://www.fileformat.info/sitemap.xml |
Warnings
- `clean-param` is not a known field.
Comments