www.it.psu.edu
robots.txt
Robots Exclusion Standard data for www.it.psu.edu
Resource Scan
Scan Details
| Site Domain | www.it.psu.edu |
| Base Domain | psu.edu |
| Scan Status | Ok |
| Last Scan | 2025-12-05T22:19:50+00:00 |
| Next Scan | 2026-01-04T22:19:50+00:00 |
Last Scan
| Scanned | 2025-12-05T22:19:50+00:00 |
| URL | https://www.it.psu.edu/robots.txt |
| Domain IPs | 100.24.182.117, 184.72.224.80, 3.91.109.122, 34.199.202.106, 34.227.238.166, 35.172.73.102 |
| Response IP | 100.24.182.117 |
| Found | Yes |
| Hash | 0080fa389a74d63f7fe34ee6a5c4ad9d9de2a678f2481d440bcc21c020241698 |
| SimHash | e0c657c081ab |
Groups
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.it.psu.edu/wp-sitemap.xml |
Warnings
- 6 invalid lines.