orgcentral.psu.edu
robots.txt
Robots Exclusion Standard data for orgcentral.psu.edu
Resource Scan
Scan Details
Site Domain | orgcentral.psu.edu |
Base Domain | psu.edu |
Scan Status | Ok |
Last Scan | 2025-06-01T10:28:34+00:00 |
Next Scan | 2025-06-15T10:28:34+00:00 |
Last Scan
Scanned | 2025-06-01T10:28:34+00:00 |
URL | https://orgcentral.psu.edu/robots.txt |
Domain IPs | 13.68.101.62 |
Response IP | 13.68.101.62 |
Found | Yes |
Hash | 4a37406433d85f522be4d05eaeee0262570c29e06aa6a90af162b8193b2c4d99 |
SimHash | 6d145c15e95b |
Groups
*
Rule | Path |
---|---|
Disallow | /notfound |
Disallow | /forbidden |
Disallow | /error |
Disallow | /api/ |
Disallow | /engage/ |
Other Records
Field | Value |
---|---|
sitemap | https://orgcentral.psu.edu/sitemap.xml |