theplaidgiraffe.ca
robots.txt
Robots Exclusion Standard data for theplaidgiraffe.ca
Resource Scan
Scan Details
Site Domain | theplaidgiraffe.ca |
Base Domain | theplaidgiraffe.ca |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-16T17:18:25+00:00 |
Next Scan | 2024-12-15T17:18:25+00:00 |
Last Successful Scan
Scanned | 2024-02-13T16:55:04+00:00 |
URL | https://theplaidgiraffe.ca/robots.txt |
Domain IPs | 18.161.111.11, 18.161.111.127, 18.161.111.14, 18.161.111.17, 2600:9000:21f8:5200:1d:72a4:2fc0:93a1, 2600:9000:21f8:5400:1d:72a4:2fc0:93a1, 2600:9000:21f8:9400:1d:72a4:2fc0:93a1, 2600:9000:21f8:9c00:1d:72a4:2fc0:93a1, 2600:9000:21f8:a000:1d:72a4:2fc0:93a1, 2600:9000:21f8:a200:1d:72a4:2fc0:93a1, 2600:9000:21f8:c00:1d:72a4:2fc0:93a1, 2600:9000:21f8:d400:1d:72a4:2fc0:93a1 |
Response IP | 18.164.52.52 |
Found | Yes |
Hash | 4626c905d2055a9b50e343d1df66b172a4815d3a7c02402ba63e848e8eb7d777 |
SimHash | 931c55fa7fa9 |
Groups
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
crawl-delay | 4 |
seekportbot
mauibot
houzzbot
baiduspider
baiduspider-image
serpstatbot
sogou blog
sogou inst spider
sogou news spider
sogou orion spider
sogou spider2
sogou web spider
uptimebot
yandex
yandexmobilebot
zoominfobot
megaindex.ru
alphaseobot-sa
proximic
amazonbot
petalbot
re-re studio
barkrowler
siteauditbot
Rule | Path |
---|---|
Disallow | / |
Warnings
- 2 invalid lines.
Comments