cs.purdue.edu
robots.txt

Robots Exclusion Standard data for cs.purdue.edu

Resource Scan

Scan Details

Site Domain cs.purdue.edu
Base Domain purdue.edu
Scan Status Ok
Last Scan2025-07-03T02:33:13+00:00
Next Scan 2025-08-02T02:33:13+00:00

Last Scan

Scanned2025-07-03T02:33:13+00:00
URL https://cs.purdue.edu/robots.txt
Domain IPs 128.10.19.120
Response IP 128.10.19.120
Found Yes
Hash 13143c7f8f1a7b287348fbc8d4c21e444dfdc58167d8bec815a6003278084a61
SimHash 1449d1f0afd6

Groups

*

Rule Path
Disallow /homes/spaf/Yucks/
Disallow /homes/fultz
Disallow /help
Disallow /calendar
Disallow /success/QA
Disallow /academic_programs/courses/schedule/2002
Disallow /academic_programs/courses/schedule/2003
Disallow /academic_programs/courses/schedule/2004
Disallow /academic_programs/courses/catalog/2002
Disallow /academic_programs/courses/catalog/2003
Disallow /academic_programs/courses/catalog/2004
Disallow /academic_programs/courses/schedule/2005/Spring
Disallow /academic_programs/courses/schedule/2005/Summer
Disallow /academic_programs/courses/catalog/2005/Spring
Disallow /academic_programs/courses/catalog/2005/Summer