chriscloete.com
robots.txt

Robots Exclusion Standard data for chriscloete.com

Resource Scan

Scan Details

Site Domain chriscloete.com
Base Domain chriscloete.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-06-19T12:34:34+00:00
Next Scan 2025-07-19T12:34:34+00:00

Last Successful Scan

Scanned2025-05-14T12:04:25+00:00
URL https://www.chriscloete.com/robots.txt
Domain IPs 69.22.188.40, 69.22.188.41
Response IP 69.22.188.41
Found Yes
Hash b8a28f1677bd13a5671808ef138ca29adccc36688db4991f3cf50d3e590140ce
SimHash d01eca8aef4c

Groups

*

Rule Path
Disallow /adm/
Disallow /ajax/
Disallow /com/
Disallow /ext/
Disallow /ltr/
Disallow /mem/
Disallow /mu/
Disallow /pp/
Disallow /ezp/
Disallow /cart/
Disallow /c/*/login
Disallow /c/*/signup*
Disallow /fees

mj12bot

Rule Path
Disallow

twitterbot

Rule Path
Disallow

petalbot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Comments

  • ROBOTS.TXT FOR PHOTOSHELTER.COM
  • Was disallowed because it was overly aggressive
  • access re-enabled on May 30, 2013
  • User-agent: ia_archiver
  • Disallow: /