cls.cz
robots.txt

Robots Exclusion Standard data for cls.cz

Resource Scan

Scan Details

Site Domain cls.cz
Base Domain cls.cz
Scan Status Ok
Last Scan2026-01-26T05:28:12+00:00
Next Scan 2026-02-09T05:28:12+00:00

Last Scan

Scanned2026-01-26T05:28:12+00:00
URL https://cls.cz/robots.txt
Redirect https://www.cls.cz/robots.txt
Redirect Domain www.cls.cz
Redirect Base cls.cz
Domain IPs 185.64.216.251
Redirect IPs 185.64.216.251
Response IP 185.64.216.251
Found Yes
Hash 219715333a1850f70054c794fd7782e46e0220f00f099c7397f0da3dd87e198e
SimHash f038dc888da2

Groups

ahrefsbot
andjing
aperture
arachnid
arale
aspeek
bixo
capek
ccrawler
crawwwler
dataparksearch
distributed web crawler
dotbot
ebot
gnu wget
grub
heritrix
hounder
ht://dig
hyper estraier
hyperspider
icdl crawler
icrawler
jobo
larm
metis
mj12bot
mnogosearch
nodecrawler
norconex http collector
nutch
openwebspider
opese
pavuk
petalbot/aspiegelbot
php-crawler
pycreep
pyspider
scrapy
semrushbot
sphider
stormcrawler
web harvest
webeater
weblech
websphinx
xapian
yacy

Rule Path
Disallow /

*

Rule Path
Disallow /admin
Disallow /prihlaseni
Disallow /registrace
Disallow /clanek-poslat
Disallow /clanek-pdf
Disallow /url_ext
Disallow /3366681
Disallow /vl/click
Disallow /vl/view
Disallow /linkout

mediapartners-google

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.cls.cz/sitemap.xml