orca.org.uk
robots.txt

Robots Exclusion Standard data for orca.org.uk

Resource Scan

Scan Details

Site Domain orca.org.uk
Base Domain orca.org.uk
Scan Status Ok
Last Scan2026-01-05T00:43:52+00:00
Next Scan 2026-02-04T00:43:52+00:00

Last Scan

Scanned2026-01-05T00:43:52+00:00
URL https://orca.org.uk/robots.txt
Domain IPs 104.26.2.26, 104.26.3.26, 172.67.72.165, 2606:4700:20::681a:21a, 2606:4700:20::681a:31a, 2606:4700:20::ac43:48a5
Response IP 104.26.3.26
Found Yes
Hash 37e134ca3554336e742d93907b3ed01ae3a5c630718c6a496172fc2368767349
SimHash 03701d566f92

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://orca.org.uk/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://orca.org.uk/
  • live - don't allow web crawlers to index cpresources/ or vendor/