catalog.yale.edu
robots.txt

Robots Exclusion Standard data for catalog.yale.edu

Resource Scan

Scan Details

Site Domain catalog.yale.edu
Base Domain yale.edu
Scan Status Ok
Last Scan2025-03-03T18:21:16+00:00
Next Scan 2025-04-02T18:21:16+00:00

Last Scan

Scanned2025-03-03T18:21:16+00:00
URL https://catalog.yale.edu/robots.txt
Domain IPs 12.175.6.47
Response IP 12.175.6.47
Found Yes
Hash 9d4e12b0dea6fa215f20d93371e56a21d147ffee6781130e49b1cdff22ee7b07
SimHash 8d0d7ce5319f

Groups

*

Rule Path
Disallow /archive/
Disallow /admin/
Disallow /azindex/
Disallow /catalogcontents/
Disallow /cim/
Disallow /clmail/
Disallow /courseadmin/
Disallow /courseleaf/
Disallow /css/
Disallow /dbleaf/
Disallow /depts/
Disallow /first-year-student-handbook/
Disallow /fonts/
Disallow /gallery/
Disallow /images/
Disallow /js/
Disallow /mig/
Disallow /miscadmin
Disallow /navbar/
Disallow /pagewiz/
Disallow /pdf/
Disallow /programadmin/
Disallow /responseform/
Disallow /ribbit/
Disallow /search/
Disallow /sectionrequest/
Disallow /shared/
Disallow /styles/
Disallow /tmp/
Disallow /wen/
Disallow /wiztest/
Disallow /xsearch/

Other Records

Field Value
sitemap http://catalog.yale.edu/sitemap.xml