thegreenhead.com
robots.txt

Robots Exclusion Standard data for thegreenhead.com

Resource Scan

Scan Details

Site Domain thegreenhead.com
Base Domain thegreenhead.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-31T02:47:02+00:00
Next Scan 2025-01-29T02:47:02+00:00

Last Successful Scan

Scanned2024-07-04T02:37:14+00:00
URL https://thegreenhead.com/robots.txt
Redirect https://www.thegreenhead.com/robots.txt
Redirect Domain www.thegreenhead.com
Redirect Base thegreenhead.com
Domain IPs 172.66.40.122, 172.66.43.134, 2606:4700:3108::ac42:287a, 2606:4700:3108::ac42:2b86
Redirect IPs 172.66.40.122, 172.66.43.134, 2606:4700:3108::ac42:287a, 2606:4700:3108::ac42:2b86
Response IP 172.66.43.134
Found Yes
Hash 02e66cfd04941695cf0f453ff3f502ecc446009e411d3213815170cc85f82a0b
SimHash e376d950ca99

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /link/
Disallow /link2/
Disallow /link3/
Disallow /link4/
Disallow /link5/

ahrefsbot
seznambot
brandverity
dataforseobot
barkrowler
dotbot
mj12bot
grapeshotcrawler
eyemonit uptime bot
seekportbot
petalbot
diffbot
serpstatbot
bytespider
bytedance
megaindex.ru/2.0
megaindex.ru
femtosearchbot
sogou web spider
sogou inst spider
yisouspider
baiduspider
imagesiftbot
gptbot
blexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thegreenhead.com/sitemaps/sitemaps.xml

Warnings

  • 1 invalid line.