homesteadhearth.com
robots.txt

Robots Exclusion Standard data for homesteadhearth.com

Resource Scan

Scan Details

Site Domain homesteadhearth.com
Base Domain homesteadhearth.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-06T09:49:24+00:00
Next Scan 2024-12-05T09:49:24+00:00

Last Successful Scan

Scanned2024-01-18T09:46:57+00:00
URL https://www.homesteadhearth.com/robots.txt
Domain IPs 2600:9000:2175:2800:1f:e8c:7600:93a1, 2600:9000:2175:5400:1f:e8c:7600:93a1, 2600:9000:2175:6200:1f:e8c:7600:93a1, 2600:9000:2175:9a00:1f:e8c:7600:93a1, 2600:9000:2175:ba00:1f:e8c:7600:93a1, 2600:9000:2175:d000:1f:e8c:7600:93a1, 2600:9000:2175:de00:1f:e8c:7600:93a1, 2600:9000:2175:e00:1f:e8c:7600:93a1, 52.84.45.127, 52.84.45.45, 52.84.45.54, 52.84.45.94
Response IP 18.245.31.126
Found Yes
Hash 4626c905d2055a9b50e343d1df66b172a4815d3a7c02402ba63e848e8eb7d777
SimHash 931c55fa7fa9

Groups

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 4

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

goodzer

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

dotbot
dotbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

checkmarknetwork/1.0 (+http://www.checkmarknetwork.com/spider.html)

Rule Path
Disallow /

seekportbot
mauibot
houzzbot
baiduspider
baiduspider-image
serpstatbot
sogou blog
sogou inst spider
sogou news spider
sogou orion spider
sogou spider2
sogou web spider
uptimebot
yandex
yandexmobilebot
zoominfobot
megaindex.ru
alphaseobot-sa
proximic
amazonbot
petalbot
re-re studio
barkrowler
siteauditbot

Rule Path
Disallow /

semrushbot-ba
semrushbot

Rule Path
Disallow /

Comments

  • See if Semrush can behave - March 2023.
  • Update Nov 2023, no, they can't behave.

Warnings

  • 2 invalid lines.