valgamaalane.ee
robots.txt

Robots Exclusion Standard data for valgamaalane.ee

Resource Scan

Scan Details

Site Domain valgamaalane.ee
Base Domain valgamaalane.ee
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-11-13T23:34:43+00:00
Next Scan 2024-11-27T23:34:43+00:00

Last Successful Scan

Scanned2024-10-29T23:31:22+00:00
URL http://valgamaalane.ee/robots.txt
Domain IPs 185.154.221.183, 185.154.221.184
Response IP 185.154.221.184
Found Yes
Hash 47acc75901867eb0ff11f17ffb6732f91c160af2a159a2cb241ed22c2902f322
SimHash 2b0f4660caa1

Groups

*

Rule Path
Disallow /search*
Disallow /latest/*
Disallow /*/print/*
Disallow /print/*
Disallow /*/com/*
Disallow /mobile/*
Disallow /rest/*
Disallow /feed/*
Disallow /weather/*
Disallow /?schedule=*
Disallow /author/*

bingbot
msnbot
msnbot-media
yandexbot
ahrefsbot
seekportbot

Rule Path
Disallow /search*
Disallow /latest/*
Disallow /*/print/*
Disallow /print/*
Disallow /*/com/*
Disallow /mobile/*
Disallow /?schedule=*
Disallow /rest/*
Disallow /feed/*
Disallow /weather/*
Disallow /author/*

Other Records

Field Value
crawl-delay 60

mediapartners-google

Rule Path
Disallow

discobot
dotbot
yacybot
petalbot
semrushbot
grapeshot
barkrowler

Rule Path
Disallow *

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

femtosearchbot

Rule Path
Disallow *

Other Records

Field Value
sitemap https://valgamaalane.ee/sitemap