guide.wisc.edu
robots.txt

Robots Exclusion Standard data for guide.wisc.edu

Resource Scan

Scan Details

Site Domain guide.wisc.edu
Base Domain wisc.edu
Scan Status Ok
Last Scan2025-03-03T10:05:16+00:00
Next Scan 2025-04-02T10:05:16+00:00

Last Scan

Scanned2025-03-03T10:05:16+00:00
URL https://guide.wisc.edu/robots.txt
Domain IPs 12.175.6.47
Response IP 12.175.6.47
Found Yes
Hash 335930a32151d41a34050c88a9c404223827c49bfa68481ee851dd38e8c139b1
SimHash 8b5d1ce5b1d9

Groups

*

Rule Path
Disallow /archive/
Disallow /admin/
Disallow /azindex/
Disallow /badgeadmin/
Disallow /catalogcontents/
Disallow /cim/
Disallow /clmail/
Disallow /courseadmin/
Disallow /courseleaf/
Disallow /css/
Disallow /dbleaf/
Disallow /examadmin/
Disallow /depts/
Disallow /fonts/
Disallow /gallery/
Disallow /images/
Disallow /js/
Disallow /lo/
Disallow /mig/
Disallow /migration/
Disallow /miscadmin/
Disallow /navbar/
Disallow /pagewiz/
Disallow /programadmin/
Disallow /responseform/
Disallow /ribbit/
Disallow /search/
Disallow /shared/
Disallow /styles/
Disallow /tmp/
Disallow /wiztest/
Disallow /xsearch/
Disallow /wen/
Disallow /undergraduate/index.html
Disallow /graduate/index.html

Other Records

Field Value
sitemap http://guide.wisc.edu/sitemap.xml