thewalkingdiary.co.uk
robots.txt

Robots Exclusion Standard data for thewalkingdiary.co.uk

Resource Scan

Scan Details

Site Domain thewalkingdiary.co.uk
Base Domain thewalkingdiary.co.uk
Scan Status Ok
Last Scan2025-10-13T23:00:37+00:00
Next Scan 2025-10-20T23:00:37+00:00

Last Scan

Scanned2025-10-13T23:00:37+00:00
URL https://thewalkingdiary.co.uk/robots.txt
Redirect https://www.thewalkingdiary.co.uk/robots.txt
Redirect Domain www.thewalkingdiary.co.uk
Redirect Base thewalkingdiary.co.uk
Domain IPs 104.21.81.119, 172.67.189.82, 2606:4700:3033::6815:5177, 2606:4700:3037::ac43:bd52
Redirect IPs 104.21.81.119, 172.67.189.82, 2606:4700:3033::6815:5177, 2606:4700:3037::ac43:bd52
Response IP 172.67.189.82
Found Yes
Hash ae1396a682f7b4310eac7d8130f021c63f10ecb4ba60d7343d2820e190850703
SimHash 41601d527f93

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /search
Disallow /*?*
Disallow /?*

Other Records

Field Value
sitemap https://www.thewalkingdiary.co.uk/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.thewalkingdiary.co.uk/
  • live - don't allow web crawlers to index cpresources/ or vendor/