cla.purdue.edu
robots.txt

Robots Exclusion Standard data for cla.purdue.edu

Resource Scan

Scan Details

Site Domain cla.purdue.edu
Base Domain purdue.edu
Scan Status Ok
Last Scan2025-07-26T06:15:39+00:00
Next Scan 2025-08-25T06:15:39+00:00

Last Scan

Scanned2025-07-26T06:15:39+00:00
URL https://cla.purdue.edu/robots.txt
Domain IPs 128.210.7.106
Response IP 128.210.7.106
Found Yes
Hash 399948440dc824269bcca0469c171c37825db133435567fd90a5f260876e9ff1
SimHash 800041ad7070

Groups

*

Rule Path
Disallow /.ssh/
Disallow /images/
Disallow /styles/
Disallow /scripts/
Disallow /clamonitor/
Disallow /App_Data/
Disallow /aspnet_client/
Disallow /videos/
Disallow /aspx_admins/
Disallow /_template/
Disallow /academics/history/festschrift/
Disallow /testing/
Disallow /testing-p/
Disallow /english/navsa/
Disallow /fll/
Disallow /hk/
Disallow /webmaster/
Disallow /academic/comm/
Disallow /slhs/
Disallow /academic/engl/
Disallow /devit/
Disallow /dev/
Disallow /about/contact/feedback/