croquet.org.uk
robots.txt

Robots Exclusion Standard data for croquet.org.uk

Resource Scan

Scan Details

Site Domain croquet.org.uk
Base Domain croquet.org.uk
Scan Status Ok
Last Scan2026-02-25T09:40:08+00:00
Next Scan 2026-03-27T09:40:08+00:00

Last Scan

Scanned2026-02-25T09:40:08+00:00
URL https://croquet.org.uk/robots.txt
Redirect https://www.croquet.org.uk/robots.txt
Redirect Domain www.croquet.org.uk
Redirect Base croquet.org.uk
Domain IPs 104.21.94.77, 172.67.220.233, 2606:4700:3030::ac43:dce9, 2606:4700:3037::6815:5e4d
Redirect IPs 104.21.94.77, 172.67.220.233, 2606:4700:3030::ac43:dce9, 2606:4700:3037::6815:5e4d
Response IP 172.67.220.233
Found Yes
Hash 77823149bed110a6fcd9fa1f5a815afb60068cd9f87c3d07781a51dabf1eb66c
SimHash b007cfa27578

Groups

*

Rule Path
Disallow /history/archives/2001/
Disallow /history/archives/2002/
Disallow /history/archives/2003/
Disallow /history/archives/2004/
Disallow /history/archives/2005/
Disallow /history/archives/2006/
Disallow /history/archives/2007/
Disallow /history/archives/2008/
Disallow /private/
Disallow /dbase/
Disallow /secf/
Disallow /cert2005-2006/
Disallow /cert/
Disallow /_notes/
Disallow /pages/
Disallow /temp/
Disallow /aspnet_client/
Disallow /infra/cache/
Disallow /infra/docs/
Disallow /infra/gazette/
Disallow /W3SVC181/
Disallow /SQLBackups/
Disallow /*games/clubs/details
Disallow /*tournament/caCalendar
Disallow /*tournament/caEvents

Comments

  • robots.txt for www.croquet.org.uk
  • This file restricts webCrawlers (e.g. Google, Altavista...) from certain parts of the site
  • The first line is a required header for some sites.
  • Text following a hash on any line represents a comment
  • All Crawlers
  • Disallow: /scripts/
  • Lines added 2024-08-24 to prevent the server being overloaded