practicingtheway.org
robots.txt

Robots Exclusion Standard data for practicingtheway.org

Resource Scan

Scan Details

Site Domain practicingtheway.org
Base Domain practicingtheway.org
Scan Status Ok
Last Scan2025-06-21T12:14:21+00:00
Next Scan 2025-07-21T12:14:21+00:00

Last Scan

Scanned2025-06-21T12:14:21+00:00
URL https://practicingtheway.org/robots.txt
Redirect https://www.practicingtheway.org/robots.txt
Redirect Domain www.practicingtheway.org
Redirect Base practicingtheway.org
Domain IPs 198.202.211.1
Redirect IPs 198.202.211.1, 2620:cb:2000::1
Response IP 198.202.211.1
Found Yes
Hash 7d1391e3f1b51511c647a2827a2f7b6041078651ce4a997df2dc1d198aed011d
SimHash 58900a4b8252

Groups

meta-externalagent

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

googleother

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

sbintuitionsbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

amazon q

Rule Path
Disallow /

claude-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

yandexgpt

Rule Path
Disallow /

mistralbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

youbot

Rule Path
Disallow /

neevaai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.practicingtheway.org/sitemap.xml