krizovkyslovnik.cz
robots.txt

Robots Exclusion Standard data for krizovkyslovnik.cz

Resource Scan

Scan Details

Site Domain krizovkyslovnik.cz
Base Domain krizovkyslovnik.cz
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-02T17:25:54+00:00
Next Scan 2024-10-16T17:25:54+00:00

Last Successful Scan

Scanned2024-09-17T17:21:57+00:00
URL https://krizovkyslovnik.cz/robots.txt
Domain IPs 89.221.217.219
Response IP 89.221.217.219
Found Yes
Hash c9894f64577fe1a91012c950276964cd8528b654d93e9b36fc611e1d2db1cda0
SimHash 2a59e4522670

Groups

*

Rule Path
Disallow /oou
Disallow /najit
Disallow /projit
Disallow /pridat
Disallow /cookies

bingbot

Rule Path
Disallow /oou
Disallow /najit
Disallow /projit
Disallow /pridat
Disallow /cookies

Other Records

Field Value
crawl-delay 300

ahrefsbot
amazonbot
archive.org_bot
baiduspider
barkrowler
bleriot
blexbot
bubing
ccbot
cliqzbot
dataforseobot
dataprovider
desu
deusu
dnyzbot
dotbot
embedly
exabot
genieo
heritrix
ia_archiver
lcc
linguee
linkdexbot
linkpadbot
ltx71
mauibot
megaindex.ru
mj12bot
msnbot
netseer
qwantify
riddler
seekport
semrushbot
semrushbot-sa
seokicks
seokicks-robot
serpstatbot
smtbot
sogou
spbot
spiderling
surdotlybot
turnitinbot
uptimebot
velenpublicwebcrawler
xovibot
yandexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://krizovkyslovnik.cz/sitemap.xml

Warnings

  • 1 invalid line.