headiil.ee
robots.txt

Robots Exclusion Standard data for headiil.ee

Resource Scan

Scan Details

Site Domain headiil.ee
Base Domain headiil.ee
Scan Status Ok
Last Scan2024-10-01T22:24:40+00:00
Next Scan 2024-10-08T22:24:40+00:00

Last Scan

Scanned2024-10-01T22:24:40+00:00
URL https://headiil.ee/robots.txt
Domain IPs 157.230.96.52
Response IP 157.230.96.52
Found Yes
Hash 18278777c7f3ad83263b6e1b30f110b0c4968c24d33bbd57c329dcff9df2d691
SimHash d81578f44be9

Groups

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 60

netcraftsurveyagent

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 60

exabot

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

googlebot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 60

googlebot-image

Rule Path
Disallow

Other Records

Field Value
crawl-delay 60

yandex

Rule Path
Disallow

Other Records

Field Value
crawl-delay 60

yandexbot

Rule Path
Disallow

Other Records

Field Value
crawl-delay 60

dotbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

twitterbot

Rule Path
Disallow /

jamesbot

Rule Path
Disallow /

worldping-api

Rule Path
Disallow /

worldping

Rule Path
Disallow /

*

Rule Path
Disallow /refresh
Disallow /json
Disallow /deals/json
Disallow /admin/*
Disallow /phpinfo.php
Disallow /api
Disallow /README.md
Disallow /js/
Disallow /assets/
Disallow /css/

Other Records

Field Value
crawl-delay 60