alltheweb.com
robots.txt
Robots Exclusion Standard data for alltheweb.com
Resource Scan
Scan Details
Site Domain | alltheweb.com |
Base Domain | alltheweb.com |
Scan Status | Ok |
Last Scan | 2024-10-21T09:14:15+00:00 |
Next Scan | 2024-11-20T09:14:15+00:00 |
Last Scan
Scanned | 2024-10-21T09:14:15+00:00 |
URL | https://alltheweb.com/robots.txt |
Redirect | https://search.yahoo.com/robots.txt |
Redirect Domain | search.yahoo.com |
Redirect Base | yahoo.com |
Domain IPs | 13.248.158.7, 76.223.84.192 |
Redirect IPs | 106.10.218.137, 2406:2000:e4:1404::3000 |
Response IP | 106.10.218.137 |
Found | Yes |
Hash | 2cda20bb212150ee16e2de139e655e7f444ad77cf8b5bc33286c645d21acccb8 |
SimHash | 74057b52c280 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /bin |
Disallow | /language |
Disallow | /yhs |
Disallow | /aol |
Disallow | /reviews |
Disallow | /click |