longmontyarn.com
robots.txt

Robots Exclusion Standard data for longmontyarn.com

Resource Scan

Scan Details

Site Domain longmontyarn.com
Base Domain longmontyarn.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-04-30T20:31:21+00:00
Next Scan 2024-06-29T20:31:21+00:00

Last Successful Scan

Scanned2024-02-08T20:10:18+00:00
URL https://longmontyarn.com/robots.txt
Domain IPs 18.161.97.14, 18.161.97.20, 18.161.97.38, 18.161.97.41, 2600:9000:23d0:3400:12:3e9d:3a40:93a1, 2600:9000:23d0:4200:12:3e9d:3a40:93a1, 2600:9000:23d0:600:12:3e9d:3a40:93a1, 2600:9000:23d0:a00:12:3e9d:3a40:93a1, 2600:9000:23d0:aa00:12:3e9d:3a40:93a1, 2600:9000:23d0:ca00:12:3e9d:3a40:93a1, 2600:9000:23d0:ea00:12:3e9d:3a40:93a1, 2600:9000:23d0:f600:12:3e9d:3a40:93a1
Response IP 108.138.26.65
Found Yes
Hash 4626c905d2055a9b50e343d1df66b172a4815d3a7c02402ba63e848e8eb7d777
SimHash 931c55fa7fa9

Groups

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 4

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

goodzer

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

dotbot
dotbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

checkmarknetwork/1.0 (+http://www.checkmarknetwork.com/spider.html)

Rule Path
Disallow /

seekportbot
mauibot
houzzbot
baiduspider
baiduspider-image
serpstatbot
sogou blog
sogou inst spider
sogou news spider
sogou orion spider
sogou spider2
sogou web spider
uptimebot
yandex
yandexmobilebot
zoominfobot
megaindex.ru
alphaseobot-sa
proximic
amazonbot
petalbot
re-re studio
barkrowler
siteauditbot

Rule Path
Disallow /

semrushbot-ba
semrushbot

Rule Path
Disallow /

Comments

  • See if Semrush can behave - March 2023.
  • Update Nov 2023, no, they can't behave.

Warnings

  • 2 invalid lines.