leadership.ucsc.edu
robots.txt

Robots Exclusion Standard data for leadership.ucsc.edu

Resource Scan

Scan Details

Site Domain leadership.ucsc.edu
Base Domain ucsc.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-18T12:58:32+00:00
Next Scan 2024-12-17T12:58:32+00:00

Last Successful Scan

Scanned2022-09-12T21:12:06+00:00
URL https://leadership.ucsc.edu/robots.txt
Response IP 18.140.226.100
Found Yes
Hash f2267bacc0a28293bc9b9121006bf38e554c9dfe32100afa660a7e083e1f6792
SimHash 191c9413c800

Groups

ahrefssiteaudit

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

semrushbot-seoab

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mbcrawler/1.0

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://leadership.ucsc.edu/sitemap.xml