chriscloete.com
robots.txt

Robots Exclusion Standard data for chriscloete.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	chriscloete.com
Base Domain	chriscloete.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-06-19T12:34:34+00:00
Next Scan	2025-07-19T12:34:34+00:00

Last Successful Scan

Scanned	2025-05-14T12:04:25+00:00
URL	https://www.chriscloete.com/robots.txt
Domain IPs	69.22.188.40, 69.22.188.41
Response IP	69.22.188.41
Found	Yes
Hash	b8a28f1677bd13a5671808ef138ca29adccc36688db4991f3cf50d3e590140ce
SimHash	d01eca8aef4c

Groups

*

Rule	Path
Disallow	/adm/
Disallow	/ajax/
Disallow	/com/
Disallow	/ext/
Disallow	/ltr/
Disallow	/mem/
Disallow	/mu/
Disallow	/pp/
Disallow	/ezp/
Disallow	/cart/
Disallow	/c/*/login
Disallow	/c//signup
Disallow	/fees

Rule

Path

Disallow

/adm/

Disallow

/ajax/

Disallow

/com/

Disallow

/ext/

Disallow

/ltr/

Disallow

/mem/

Disallow

/mu/

Disallow

/pp/

Disallow

/ezp/

Disallow

/cart/

Disallow

/c/*/login

Disallow

/c/*/signup*

Disallow

/fees

mj12bot

Rule	Path
Disallow

Rule

Path

Disallow

twitterbot

Rule	Path
Disallow

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

riddler

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

ROBOTS.TXT FOR PHOTOSHELTER.COM
Was disallowed because it was overly aggressive
access re-enabled on May 30, 2013
User-agent: ia_archiver
Disallow: /

chriscloete.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

mj12bot

twitterbot

petalbot

riddler

baiduspider

ahrefsbot

amazonbot

claudebot

Comments

chriscloete.com
robots.txt