chapter.org
robots.txt

Robots Exclusion Standard data for chapter.org

Resource Scan

Scan Details

Site Domain chapter.org
Base Domain chapter.org
Scan Status Ok
Last Scan2025-12-23T23:14:44+00:00
Next Scan 2025-12-30T23:14:44+00:00

Last Scan

Scanned2025-12-23T23:14:44+00:00
URL https://chapter.org/robots.txt
Redirect https://www.chapter.org/robots.txt
Redirect Domain www.chapter.org
Redirect Base chapter.org
Domain IPs 104.21.34.5, 172.67.194.170, 2606:4700:3034::ac43:c2aa, 2606:4700:3036::6815:2205
Redirect IPs 104.21.34.5, 172.67.194.170, 2606:4700:3034::ac43:c2aa, 2606:4700:3036::6815:2205
Response IP 172.67.194.170
Found Yes
Hash 49b04f5c8e93c8cc9175ceb7eec0df4dde2264c0ca1668387d8929053d836001
SimHash 231e1952a777

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.chapter.org/sitemaps-2-sitemap.xml
sitemap https://www.chapter.org/cy/sitemaps-2-sitemap.xml

Comments

  • robots.txt for https://www.chapter.org/
  • live - don't allow web crawlers to index cpresources/ or vendor/