ciarandoyle.com
robots.txt

Robots Exclusion Standard data for ciarandoyle.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ciarandoyle.com
Base Domain	ciarandoyle.com
Scan Status	Ok
Last Scan	2026-01-13T04:12:55+00:00
Next Scan	2026-02-12T04:12:55+00:00

Last Scan

Scanned	2026-01-13T04:12:55+00:00
URL	https://ciarandoyle.com/robots.txt
Domain IPs	104.21.86.7, 172.67.213.184, 2606:4700:3030::6815:5607, 2606:4700:3030::ac43:d5b8
Response IP	104.21.86.7
Found	Yes
Hash	0ba9a350f1a40ea1fdfceeb22d0bfe1280aa757a9f1b1e0d094d6a8913811930
SimHash	103f5870d6e3

Groups

*

Rule	Path
Disallow	/wp-admin/
Disallow	/wp-login.php
Disallow	/cart/
Disallow	/checkout/
Disallow	/my-account/
Disallow	/thank-you/
Disallow	/private/
Disallow	/cgi-bin/
Disallow	/tmp/
Allow	/wp-admin/admin-ajax.php
Allow	/images/

Rule

Path

Disallow

/wp-admin/

Disallow

/wp-login.php

Disallow

/cart/

Disallow

/checkout/

Disallow

/my-account/

Disallow

/thank-you/

Disallow

/private/

Disallow

/cgi-bin/

Disallow

/tmp/

Allow

/wp-admin/admin-ajax.php

Allow

/images/

chatgpt-user

Rule	Path
Allow	/

Rule

Path

Allow

/

oai-searchbot

Rule	Path
Allow	/

Rule

Path

Allow

/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

semrushbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://ciarandoyle.com/sitemap.xml

Field

Value

sitemap

https://ciarandoyle.com/sitemap.xml

Back to top

Comments

Allow beneficial AI crawlers
Block OpenAI's GPTBot (does not power ChatGPT Search)
Block known aggressive bots
Block Common Crawl (used for model training, not traffic)

Back to top

ciarandoyle.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

chatgpt-user

oai-searchbot

gptbot

ahrefsbot

Other Records

semrushbot

Other Records

ccbot

Other Records

Comments

ciarandoyle.com
robots.txt