engadget.com
robots.txt
Robots Exclusion Standard data for engadget.com
Resource Scan
Scan Details
Site Domain | engadget.com |
Base Domain | engadget.com |
Scan Status | Ok |
Last Scan | 2024-04-30T21:55:45+00:00 |
Next Scan | 2024-05-07T21:55:45+00:00 |
Last Scan
Scanned | 2024-04-30T21:55:45+00:00 |
URL | https://engadget.com/robots.txt |
Redirect | https://www.engadget.com/robots.txt |
Redirect Domain | www.engadget.com |
Redirect Base | engadget.com |
Domain IPs | 13.248.158.7, 76.223.84.192 |
Redirect IPs | 106.10.236.137, 2406:2000:e4:1605::1000 |
Response IP | 106.10.236.137 |
Found | Yes |
Hash | c73c2b0ceeaef3fe11e8e4a829c72bab734245ea2d8bc19bb90ecd7e4714cdd3 |
SimHash | ed011a04c2b0 |
Groups
*
Rule | Path |
---|---|
Disallow | /forward |
Disallow | /traffic |
Disallow | /mm_track |
Disallow | /tag/expire-images* |
Disallow | /_remote |
Disallow | /_td_api |
Disallow | /_td |
Disallow | /_uac/adpage.html |