ca.pbslearningmedia.org
robots.txt

Robots Exclusion Standard data for ca.pbslearningmedia.org

Resource Scan

Scan Details

Site Domain ca.pbslearningmedia.org
Base Domain pbslearningmedia.org
Scan Status Ok
Last Scan2025-11-27T00:22:47+00:00
Next Scan 2025-12-11T00:22:47+00:00

Last Scan

Scanned2025-11-27T00:22:47+00:00
URL https://ca.pbslearningmedia.org/robots.txt
Domain IPs 2600:9000:20a5:1c00:9:2ed8:25c0:93a1, 2600:9000:20a5:5000:9:2ed8:25c0:93a1, 2600:9000:20a5:5600:9:2ed8:25c0:93a1, 2600:9000:20a5:9200:9:2ed8:25c0:93a1, 2600:9000:20a5:9400:9:2ed8:25c0:93a1, 2600:9000:20a5:c000:9:2ed8:25c0:93a1, 2600:9000:20a5:ca00:9:2ed8:25c0:93a1, 2600:9000:20a5:ee00:9:2ed8:25c0:93a1, 3.169.71.109, 3.169.71.115, 3.169.71.64, 3.169.71.89
Response IP 3.169.71.115
Found Yes
Hash 2894cd1ad3a170747155bd224926d10f4f0dab46ff2e188bb85c254ab41cf456
SimHash 4d3518d4e145

Groups

*

Rule Path
Disallow /asset/
Disallow /student/code/
Disallow /student/signup/
Disallow /tools/storyboard/view/
Disallow /bypass/
Disallow /api/
Disallow /uua/
Disallow /searchStandards/
Disallow /login/
Disallow /profile/
Disallow /admin/

Other Records

Field Value
crawl-delay 5

googlebot

Rule Path
Allow /api/v2/
Disallow /asset/
Disallow /login/
Disallow /admin/

gptbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.pbslearningmedia.org/sitemap.xml