tucsonsentinel.com
robots.txt

Robots Exclusion Standard data for tucsonsentinel.com

Resource Scan

Scan Details

Site Domain tucsonsentinel.com
Base Domain tucsonsentinel.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-26T20:57:43+00:00
Next Scan 2024-11-24T20:57:43+00:00

Last Successful Scan

Scanned2023-11-01T20:40:01+00:00
URL https://tucsonsentinel.com/robots.txt
Redirect https://www.tucsonsentinel.com/robots.txt
Redirect Domain www.tucsonsentinel.com
Redirect Base tucsonsentinel.com
Domain IPs 172.66.40.173, 172.66.43.83, 2606:4700:3108::ac42:28ad, 2606:4700:3108::ac42:2b53
Redirect IPs 172.66.40.173, 172.66.43.83, 2606:4700:3108::ac42:28ad, 2606:4700:3108::ac42:2b53
Response IP 172.66.43.83
Found Yes
Hash 7b163e2000c001e7e9d7b597dc14807cd2347e1d420339eb1aa894016e5b3be9
SimHash 2c1a5901e5b5

Groups

*

Rule Path
Allow /

*

Rule Path
Disallow /events/

*

Rule Path
Disallow /calendar/

*

Rule Path
Disallow /forums/

*

Rule Path
Disallow /reader/login/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

google-extended

Rule Path
Disallow /

applebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

applebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

ccbot/2.0

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

blp_bbot/0.1

Rule Path
Disallow /

go 1.1 package http

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

npbot

Rule Path
Disallow /

veooz

Rule Path
Disallow /

veoozbot

Rule Path
Disallow /

garlikcrawler

Rule Path
Disallow /

speedy

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

nerdbynature.bot

Rule Path
Disallow /

discobot

Rule Path
Disallow /

knowaboutbot

Rule Path
Disallow /

unwindfetchor

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ccbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

yahoomobile
yahoocachesystem
yahoo! slurp/site explorer
mozilla/4.05 [en]
lti/lemurproject
yahoo-blogs
yahoo-blogs/v3.9
yahoo-mmcrawler
yahoo-mmcrawler/3.x
yahooysmcm
yahooysmcm/2.0.0
yahoo-test
yahoo! mindset
y!j-bsc
y!j-bsc/1.0
y!j-bsc/1.0(http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html)
y!j-bsc
y!j-bsc/1.0
y!j-bsc/1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html)
y!j
y!j/1.0
y!j/1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html)
y!j
y!j/1.0
y!j/1.0 (http://help.yahoo.co.jp/help/jp/search/indexing/indexing-15.html)
mozilla/4.0 (compatible; y!j; for robot study; keyoshid)
mozilla/4.0 (compatible; y!j; for robot study; keyoshid)
mozilla/5.0 (compatible; yahoo! slurp china; http://misc.yahoo.com.cn/help.html)
mozilla/5.0 (compatible; yahoo! de slurp; http://help.yahoo.com/help/us/ysearch/slurp)
mozilla/5.0 (yahoo-test/4.0 mailto:vertical-crawl-support@yahoo-inc.com)
mozilla/5.0 (compatible; yahoo! slurp; http://help.yahoo.com/help/us/ysearch/slurp)

Rule Path
Disallow /

Other Records

Field Value
sitemap https://tucsonsentinel.com/news_sitemap.php
sitemap https://tucsonsentinel.com/sitemap.php
sitemap https://tucsonsentinel.com/sitemap_index.xml

Comments

  • The grub distributed search engine behaves very badly.
  • They totally overwhelm servers with traffic.
  • http://www.nameprotect.com/botinfo.html
  • This spider's output isn't public.
  • Entireweb
  • Foreign-language bot
  • Poorly behaved bot
  • Brandwatch
  • Russian image search engine
  • Magestic-12
  • Yoydao
  • NerdByNature.Net
  • Discovery Engine
  • No idea.
  • Gnip
  • These bots are designed to duplicate entire sites.
  • Awful Yahoo garbage bots

Warnings

  • 2 invalid lines.