blog.idnes.cz
robots.txt

Robots Exclusion Standard data for blog.idnes.cz

Resource Scan

Scan Details

Site Domain blog.idnes.cz
Base Domain idnes.cz
Scan Status Ok
Last Scan2024-11-09T01:49:28+00:00
Next Scan 2024-11-16T01:49:28+00:00

Last Scan

Scanned2024-11-09T01:49:28+00:00
URL https://blog.idnes.cz/robots.txt
Domain IPs 185.17.117.47
Response IP 185.17.117.47
Found Yes
Hash a35caa3903d1537b5bbbc7469858a8b1c56caf6533147fa372ef7118d351bc54
SimHash f11849409215

Groups

*

Rule Path
Disallow /*/diskuse*reakce%3D
Disallow /*/diskuse*vlakno%3D
Disallow /*/diskuse*strana%3D
Disallow /*/diskuse/
Allow /*/diskuse$
Disallow /*/diskuse*razeni%3D
Disallow /*/tisk$
Disallow /*/ankety/
Disallow /ankety.aspx*hlasuj
Disallow /export/
Disallow /data.aspx
Disallow /*/undefined

machinelearning

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://blog.idnes.cz/sitemap?type=sitemap