help.pbs.org
robots.txt

Robots Exclusion Standard data for help.pbs.org

Resource Scan

Scan Details

Site Domain help.pbs.org
Base Domain pbs.org
Scan Status Ok
Last Scan2025-06-01T13:24:27+00:00
Next Scan 2025-07-01T13:24:27+00:00

Last Scan

Scanned2025-06-01T13:24:27+00:00
URL https://help.pbs.org/robots.txt
Domain IPs 162.159.140.147, 172.66.0.145
Response IP 172.66.0.145
Found Yes
Hash af2f5e3d36654577df55b75c254079b0ab75a6895c88df1c8dbcf870a5556ff8
SimHash 261d0ded7551

Groups

*

Rule Path
Disallow /support/search
Disallow /support/tickets/
Disallow /support/login
Disallow /support/login-verification
Disallow /login/normal/
Allow /helpdesk/attachments
Disallow /helpdesk/
Disallow /public/tickets/
Disallow /*/hit$

Other Records

Field Value
sitemap https://help.pbs.org/support/sitemap.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: *
  • Disallow: /