orgcentral.psu.edu
robots.txt

Robots Exclusion Standard data for orgcentral.psu.edu

Resource Scan

Scan Details

Site Domain orgcentral.psu.edu
Base Domain psu.edu
Scan Status Ok
Last Scan2025-06-01T10:28:34+00:00
Next Scan 2025-06-15T10:28:34+00:00

Last Scan

Scanned2025-06-01T10:28:34+00:00
URL https://orgcentral.psu.edu/robots.txt
Domain IPs 13.68.101.62
Response IP 13.68.101.62
Found Yes
Hash 4a37406433d85f522be4d05eaeee0262570c29e06aa6a90af162b8193b2c4d99
SimHash 6d145c15e95b

Groups

*

Rule Path
Disallow /notfound
Disallow /forbidden
Disallow /error
Disallow /api/
Disallow /engage/

Other Records

Field Value
sitemap https://orgcentral.psu.edu/sitemap.xml