soe.ucsc.edu
robots.txt

Robots Exclusion Standard data for soe.ucsc.edu

Resource Scan

Scan Details

Site Domain soe.ucsc.edu
Base Domain ucsc.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2024-09-30T16:40:40+00:00
Next Scan 2024-12-29T16:40:40+00:00

Last Successful Scan

Scanned2024-05-11T16:39:42+00:00
URL http://soe.ucsc.edu/robots.txt
Redirect https://engineering.ucsc.edu/robots.txt
Redirect Domain engineering.ucsc.edu
Redirect Base ucsc.edu
Domain IPs 128.114.47.196
Redirect IPs 100.21.83.198, 44.235.65.182
Response IP 44.235.65.182
Found Yes
Hash 85ed89389a8acb3f4cafca916c5f6b7179b27ba07c7516a65bce80b30f766b5f
SimHash e0c75fc0808b

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 30

yandex

Rule Path
Disallow /

moget

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

baiduspider+(+http://www.baidu.com/search/spider.htm)

Rule Path
Disallow /

baiduspider/2.0;+http://www.baidu.com/search/spider.html

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

mozilla/5.0(compatible; baiduspider/2.0; +http://www.baidu.com/search/spider.html)

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sosospider/2.0

Rule Path
Disallow /

sosospider+

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /wp-content/mu-plugins/

Other Records

Field Value
sitemap https://engineering.ucsc.edu/wp-sitemap.xml

Warnings

  • 6 invalid lines.