cla.purdue.edu
robots.txt
Robots Exclusion Standard data for cla.purdue.edu
Resource Scan
Scan Details
Site Domain | cla.purdue.edu |
Base Domain | purdue.edu |
Scan Status | Ok |
Last Scan | 2025-07-26T06:15:39+00:00 |
Next Scan | 2025-08-25T06:15:39+00:00 |
Last Scan
Scanned | 2025-07-26T06:15:39+00:00 |
URL | https://cla.purdue.edu/robots.txt |
Domain IPs | 128.210.7.106 |
Response IP | 128.210.7.106 |
Found | Yes |
Hash | 399948440dc824269bcca0469c171c37825db133435567fd90a5f260876e9ff1 |
SimHash | 800041ad7070 |
Groups
*
Rule | Path |
---|---|
Disallow | /.ssh/ |
Disallow | /images/ |
Disallow | /styles/ |
Disallow | /scripts/ |
Disallow | /clamonitor/ |
Disallow | /App_Data/ |
Disallow | /aspnet_client/ |
Disallow | /videos/ |
Disallow | /aspx_admins/ |
Disallow | /_template/ |
Disallow | /academics/history/festschrift/ |
Disallow | /testing/ |
Disallow | /testing-p/ |
Disallow | /english/navsa/ |
Disallow | /fll/ |
Disallow | /hk/ |
Disallow | /webmaster/ |
Disallow | /academic/comm/ |
Disallow | /slhs/ |
Disallow | /academic/engl/ |
Disallow | /devit/ |
Disallow | /dev/ |
Disallow | /about/contact/feedback/ |