cooperhall.son.wisc.edu
robots.txt

Robots Exclusion Standard data for cooperhall.son.wisc.edu

Resource Scan

Scan Details

Site Domain cooperhall.son.wisc.edu
Base Domain wisc.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-06-06T13:53:17+00:00
Next Scan 2024-09-04T13:53:17+00:00

Last Successful Scan

Scanned2023-05-29T13:45:38+00:00
URL https://cooperhall.son.wisc.edu/robots.txt
Domain IPs 192.0.78.12, 192.0.78.13
Response IP 192.0.78.13
Found Yes
Hash a861f3176c4bfffa4898ccbeaccbfe295faa2f62f86c64d3da509e933a6accd8
SimHash 38189e02e822

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-admin/admin-ajax.php
Disallow /author/*

ahrefssiteaudit

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

semrushbot-seoab

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mbcrawler/1.0

Rule Path
Disallow /