archant.co.uk
robots.txt

Robots Exclusion Standard data for archant.co.uk

Resource Scan

Scan Details

Site Domain archant.co.uk
Base Domain archant.co.uk
Scan Status Ok
Last Scan2024-10-06T06:01:23+00:00
Next Scan 2024-11-05T06:01:23+00:00

Last Scan

Scanned2024-10-06T06:01:23+00:00
URL https://www.archant.co.uk/robots.txt
Redirect https://www.newsquest.co.uk/robots.txt
Redirect Domain www.newsquest.co.uk
Redirect Base newsquest.co.uk
Domain IPs 13.107.246.59, 2620:1ec:bdf::59
Redirect IPs 93.174.10.10
Response IP 93.174.10.10
Found Yes
Hash f1d2f795b9552fbded1ed945c12502dd0fc0b3a6f33d184c52d54f8c8ef75d50
SimHash 413099727fb1

Groups

*

Rule Path
Disallow /cpresources/

Comments

  • robots.txt
  • live - don't allow web crawlers to index cpresources/ or vendor/