blogs.umass.edu
robots.txt

Robots Exclusion Standard data for blogs.umass.edu

Resource Scan

Scan Details

Site Domain blogs.umass.edu
Base Domain umass.edu
Scan Status Ok
Last Scan2024-06-17T10:00:46+00:00
Next Scan 2024-07-17T10:00:46+00:00

Last Scan

Scanned2024-06-17T10:00:46+00:00
URL https://blogs.umass.edu/robots.txt
Redirect https://websites.umass.edu/robots.txt
Redirect Domain websites.umass.edu
Redirect Base umass.edu
Domain IPs 3.133.52.101, 3.135.41.1, 3.139.126.51
Redirect IPs 3.133.52.101, 3.135.41.1, 3.139.126.51
Response IP 3.135.41.1
Found Yes
Hash 351caba65483023c0d164ef86f77e404dae70a75d619878c381c52fba20026f0
SimHash e0c657c0812b

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 30

yandex

Rule Path
Disallow /

moget

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

baiduspider+(+http://www.baidu.com/search/spider.htm)

Rule Path
Disallow /

baiduspider/2.0;+http://www.baidu.com/search/spider.html

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

mozilla/5.0(compatible; baiduspider/2.0; +http://www.baidu.com/search/spider.html)

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sosospider/2.0

Rule Path
Disallow /

sosospider+

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /wp-content/mu-plugins/

Warnings

  • 6 invalid lines.