newsquest.co.uk
robots.txt

Robots Exclusion Standard data for newsquest.co.uk

Resource Scan

Scan Details

Site Domain newsquest.co.uk
Base Domain newsquest.co.uk
Scan Status Ok
Last Scan2024-10-29T16:55:28+00:00
Next Scan 2024-11-28T16:55:28+00:00

Last Scan

Scanned2024-10-29T16:55:28+00:00
URL https://newsquest.co.uk/robots.txt
Redirect https://www.newsquest.co.uk/robots.txt
Redirect Domain www.newsquest.co.uk
Redirect Base newsquest.co.uk
Domain IPs 93.174.10.10
Redirect IPs 93.174.10.10
Response IP 93.174.10.10
Found Yes
Hash f1d2f795b9552fbded1ed945c12502dd0fc0b3a6f33d184c52d54f8c8ef75d50
SimHash 413099727fb1

Groups

*

Rule Path
Disallow /cpresources/

Comments

  • robots.txt
  • live - don't allow web crawlers to index cpresources/ or vendor/