tv.idnes.cz
robots.txt

Robots Exclusion Standard data for tv.idnes.cz

Resource Scan

Scan Details

Site Domain tv.idnes.cz
Base Domain idnes.cz
Scan Status Ok
Last Scan2024-11-11T22:40:19+00:00
Next Scan 2024-11-18T22:40:19+00:00

Last Scan

Scanned2024-11-11T22:40:19+00:00
URL https://tv.idnes.cz/robots.txt
Domain IPs 185.17.117.42
Response IP 185.17.117.42
Found Yes
Hash a11df7d753d68a87b6236fee55921eb6faa0ca1113a8ac35b5b6c037a8f3a303
SimHash 71109941e617

Groups

*

Rule Path
Disallow /diskuse.asp*reakce%3D
Disallow /diskuse.asp*vlakno%3D
Disallow /diskuse.asp*strana%3D
Disallow /diskuse.asp*razeni%3D
Disallow /tiskni.asp
Disallow /Tiskni.asp
Disallow /*/ankety.asp
Disallow /*/Ankety.asp
Disallow /ankety.asp*hlasuj
Disallow /clanek.aspx?c=*
Disallow /Clanek.aspx?c=*
Disallow /*/clanek.aspx?c=*
Disallow /*/Clanek.aspx?c=*
Disallow /export/
Disallow /data.aspx
Disallow /*/undefined
Disallow /*/tv-mimohp.aspx
Disallow /*/tv-podcast.aspx

machinelearning

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://tv.idnes.cz/sitemap?type=sitemap