it.wisc.edu
robots.txt

Robots Exclusion Standard data for it.wisc.edu

Resource Scan

Scan Details

Site Domain it.wisc.edu
Base Domain wisc.edu
Scan Status Ok
Last Scan2025-06-01T22:25:20+00:00
Next Scan 2025-07-01T22:25:20+00:00

Last Scan

Scanned2025-06-01T22:25:20+00:00
URL https://it.wisc.edu/robots.txt
Domain IPs 128.104.80.14
Response IP 128.104.80.14
Found Yes
Hash 9af465e0507d8b7ccacfce75ac78b0c320f62ec66d73af21ddc90e3e6bb9814a
SimHash 005dd8607901

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /
Disallow

semrushbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

npbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

seokicks.de

Rule Path
Disallow /

adsbot-google

Rule Path
Disallow /

Other Records

Field Value
sitemap https://it.wisc.edu/sitemap_index.xml

Warnings

  • `user agent` is not a known field.