hats.com
robots.txt

Robots Exclusion Standard data for hats.com

Resource Scan

Scan Details

Site Domain hats.com
Base Domain hats.com
Scan Status Ok
Last Scan2024-09-26T07:24:48+00:00
Next Scan 2024-10-26T07:24:48+00:00

Last Scan

Scanned2024-09-26T07:24:48+00:00
URL https://hats.com/robots.txt
Domain IPs 151.101.1.124, 151.101.129.124, 151.101.193.124, 151.101.65.124
Response IP 151.101.1.124
Found Yes
Hash 1c015e970583b50f7fd9c3f669d08ec99b19245574538f1f96bac6e44b0c670b
SimHash 4022fd016bca

Groups

gomezagent

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

aranhabot

Rule Path
Disallow /404/
Disallow /app/
Disallow /bin/
Disallow /bollman-cs/
Disallow /cgi-bin/
Disallow /channeladvisorapi/
Disallow /edi/
Disallow /dev/
Disallow /lib/
Disallow /magento/
Disallow /pub/media/
Disallow /report/
Disallow /skin/
Disallow /stats/
Disallow /setup/
Disallow /update/
Disallow /var/
Disallow /vendor/

Other Records

Field Value
crawl-delay 10

Comments

  • block Gomez agent
  • block YandexBot
  • block SemrushBot
  • allow googlebot
  • allow Google Images bot
  • Crawlers Setup
  • Amazon Aranhabot
  • Directories
  • this is the one in media