newsnow.co.uk
robots.txt
Robots Exclusion Standard data for newsnow.co.uk
Resource Scan
Scan Details
Site Domain | newsnow.co.uk |
Base Domain | newsnow.co.uk |
Scan Status | Ok |
Last Scan | 2024-11-14T05:04:14+00:00 |
Next Scan | 2024-11-21T05:04:14+00:00 |
Last Scan
Scanned | 2024-11-14T05:04:14+00:00 |
URL | https://newsnow.co.uk/robots.txt |
Redirect | https://www.newsnow.co.uk/robots.txt?utm_source=newsnow&utm_campaign=domains&utm_medium=web&utm_content=newsnow.co.uk |
Redirect Domain | www.newsnow.co.uk |
Redirect Base | newsnow.co.uk |
Domain IPs | 149.6.126.132, 213.146.191.132 |
Redirect IPs | 149.6.126.132, 213.146.191.132 |
Response IP | 149.6.126.132 |
Found | Yes |
Hash | 3752a990be65dfe45e1f93d855beba01a2f76c9a22f54f2c97c148bf3bd2de85 |
SimHash | c2055081c632 |
Groups
*
Rule | Path |
---|---|
Disallow | /h/*?p= |
Disallow | /h/*%26p%3D |
Disallow | /cgi-bin |
Disallow | /livefeed |
Disallow | /A |
Disallow | /share |
Disallow | /cgi/NGoto |
Disallow | /brand-new-look.html |
Disallow | /reg/* |
Disallow | /housead* |
Disallow | /http%3A* |
Disallow | /https%3A* |
Disallow | /ico/1.gif |
Disallow | /pharos.js* |
Disallow | /test-please-ignore/ |
Comments