wgbh.org
robots.txt

Robots Exclusion Standard data for wgbh.org

Resource Scan

Scan Details

Site Domain wgbh.org
Base Domain wgbh.org
Scan Status Ok
Last Scan2024-10-30T00:34:34+00:00
Next Scan 2024-11-29T00:34:34+00:00

Last Scan

Scanned2024-10-30T00:34:34+00:00
URL https://wgbh.org/robots.txt
Domain IPs 13.33.88.102, 13.33.88.62, 13.33.88.81, 13.33.88.85
Response IP 13.33.88.102
Found Yes
Hash bab6a4a6a0a3cd7b82e500aa26907e1549103e26b31b95960ee8ba02dd3ebf23
SimHash e8289a70e333

Groups

*

Rule Path
Allow /
Disallow /login?*
Disallow /search?*

Other Records

Field Value
crawl-delay 5

gptbot
chatgpt-user

Rule Path
Disallow /