canterbury.co.uk
robots.txt

Robots Exclusion Standard data for canterbury.co.uk

Resource Scan

Scan Details

Site Domain canterbury.co.uk
Base Domain canterbury.co.uk
Scan Status Ok
Last Scan2024-09-27T00:15:27+00:00
Next Scan 2024-10-04T00:15:27+00:00

Last Scan

Scanned2024-09-27T00:15:27+00:00
URL https://canterbury.co.uk/robots.txt
Redirect https://www.canterbury.co.uk/robots.txt
Redirect Domain www.canterbury.co.uk
Redirect Base canterbury.co.uk
Domain IPs 51.89.232.38
Redirect IPs 51.89.232.38
Response IP 51.89.232.38
Found Yes
Hash eca7c815dea9e81bd811e9c140a322d3646dc08a4b93110fbccf6e6ef3790dd9
SimHash 330e7a01ce80

Groups

*

Rule Path
Disallow /aspnet_client/
Disallow /bin/
Disallow /config/
Disallow /data/
Disallow /install/
Disallow /masterpages/
Disallow /python/
Disallow /scripts/
Disallow /umbraco/
Disallow /umbraco_client/
Disallow /usercontrols/
Disallow /xslt/

ninjabot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://{HTTP_HOST}/sitemap.xml

Comments

  • robots.txt
  • Sitemap location
  • Exclude Files From All Robots:
  • End robots.txt file