theplaidgiraffe.ca
robots.txt

Robots Exclusion Standard data for theplaidgiraffe.ca

Resource Scan

Scan Details

Site Domain theplaidgiraffe.ca
Base Domain theplaidgiraffe.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-04-19T17:17:46+00:00
Next Scan 2024-06-18T17:17:46+00:00

Last Successful Scan

Scanned2024-02-13T16:55:04+00:00
URL https://theplaidgiraffe.ca/robots.txt
Domain IPs 18.161.111.11, 18.161.111.127, 18.161.111.14, 18.161.111.17, 2600:9000:21f8:5200:1d:72a4:2fc0:93a1, 2600:9000:21f8:5400:1d:72a4:2fc0:93a1, 2600:9000:21f8:9400:1d:72a4:2fc0:93a1, 2600:9000:21f8:9c00:1d:72a4:2fc0:93a1, 2600:9000:21f8:a000:1d:72a4:2fc0:93a1, 2600:9000:21f8:a200:1d:72a4:2fc0:93a1, 2600:9000:21f8:c00:1d:72a4:2fc0:93a1, 2600:9000:21f8:d400:1d:72a4:2fc0:93a1
Response IP 18.164.52.52
Found Yes
Hash 4626c905d2055a9b50e343d1df66b172a4815d3a7c02402ba63e848e8eb7d777
SimHash 931c55fa7fa9

Groups

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 4

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

goodzer

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

dotbot
dotbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

checkmarknetwork/1.0 (+http://www.checkmarknetwork.com/spider.html)

Rule Path
Disallow /

seekportbot
mauibot
houzzbot
baiduspider
baiduspider-image
serpstatbot
sogou blog
sogou inst spider
sogou news spider
sogou orion spider
sogou spider2
sogou web spider
uptimebot
yandex
yandexmobilebot
zoominfobot
megaindex.ru
alphaseobot-sa
proximic
amazonbot
petalbot
re-re studio
barkrowler
siteauditbot

Rule Path
Disallow /

semrushbot-ba
semrushbot

Rule Path
Disallow /

Comments

  • See if Semrush can behave - March 2023.
  • Update Nov 2023, no, they can't behave.

Warnings

  • 2 invalid lines.