cdromance.org
robots.txt

Robots Exclusion Standard data for cdromance.org

Resource Scan

Scan Details

Site Domain cdromance.org
Base Domain cdromance.org
Scan Status Ok
Last Scan2026-02-24T09:04:30+00:00
Next Scan 2026-03-03T09:04:30+00:00

Last Scan

Scanned2026-02-24T09:04:30+00:00
URL https://cdromance.org/robots.txt
Domain IPs 104.21.31.170, 172.67.178.176, 2606:4700:3031::ac43:b2b0, 2606:4700:3036::6815:1faa
Response IP 172.67.178.176
Found Yes
Hash e92b601e0b1618f25a0a10b64828767746a4122d61ebcfe26575b7f0d2115554
SimHash 183a08c0c451

Groups

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

*

Rule Path
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /wp-content/plugins/cdr-main/public/
Disallow /*?fbclid=
Disallow /*?ref=
Disallow /*ep_filter_language%3D
Disallow /*ep_filter_genre%3D
Disallow /*ep_filter_region%3D
Disallow /*ep_filter_source%3D
Disallow /*ep_filter_

Comments

  • Block specific AI / extended crawlers
  • Default rules for all other bots