projects.sfchronicle.com
robots.txt

Robots Exclusion Standard data for projects.sfchronicle.com

Resource Scan

Scan Details

Site Domain projects.sfchronicle.com
Base Domain sfchronicle.com
Scan Status Ok
Last Scan2024-09-13T16:10:33+00:00
Next Scan 2024-10-13T16:10:33+00:00

Last Scan

Scanned2024-09-13T16:10:33+00:00
URL https://projects.sfchronicle.com/robots.txt
Domain IPs 151.101.0.200, 151.101.128.200, 151.101.192.200, 151.101.64.200
Response IP 199.232.44.200
Found Yes
Hash e24dbdfca8f974ebe748331f08fd45e732a23b1c9ad9a59c42cf92b6db8519b6
SimHash 446088504a31

Groups

*

Rule Path
Allow /tools/podcasts/
Disallow /tools/
Disallow /temp-deploy/
Disallow /feeds/
Disallow /test-proj/
Disallow /app/
Disallow /2018/embeds/
Disallow /2019/embeds/
Disallow /2020/embeds/