newsnow.co.uk
robots.txt

Robots Exclusion Standard data for newsnow.co.uk

Resource Scan

Scan Details

Site Domain newsnow.co.uk
Base Domain newsnow.co.uk
Scan Status Ok
Last Scan2024-11-14T05:04:14+00:00
Next Scan 2024-11-21T05:04:14+00:00

Last Scan

Scanned2024-11-14T05:04:14+00:00
URL https://newsnow.co.uk/robots.txt
Redirect https://www.newsnow.co.uk/robots.txt?utm_source=newsnow&utm_campaign=domains&utm_medium=web&utm_content=newsnow.co.uk
Redirect Domain www.newsnow.co.uk
Redirect Base newsnow.co.uk
Domain IPs 149.6.126.132, 213.146.191.132
Redirect IPs 149.6.126.132, 213.146.191.132
Response IP 149.6.126.132
Found Yes
Hash 3752a990be65dfe45e1f93d855beba01a2f76c9a22f54f2c97c148bf3bd2de85
SimHash c2055081c632

Groups

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /classifieds/

*

Rule Path
Disallow /h/*?p=
Disallow /h/*%26p%3D
Disallow /cgi-bin
Disallow /livefeed
Disallow /A
Disallow /share
Disallow /cgi/NGoto
Disallow /brand-new-look.html
Disallow /reg/*
Disallow /housead*
Disallow /http%3A*
Disallow /https%3A*
Disallow /ico/1.gif
Disallow /pharos.js*
Disallow /test-please-ignore/

Comments

  • All robots all dirs