pocasi.idnes.cz
robots.txt

Robots Exclusion Standard data for pocasi.idnes.cz

Resource Scan

Scan Details

Site Domain pocasi.idnes.cz
Base Domain idnes.cz
Scan Status Ok
Last Scan2024-11-12T15:49:06+00:00
Next Scan 2024-11-19T15:49:06+00:00

Last Scan

Scanned2024-11-12T15:49:06+00:00
URL https://pocasi.idnes.cz/robots.txt
Domain IPs 185.17.117.46
Response IP 185.17.117.46
Found Yes
Hash 171b621ce0b6b090cc0a1f9da7d9711664eb9d7e19c5fd73a69fd92d08e394de
SimHash 40109860e437

Groups

*

Rule Path
Disallow /diskuse.asp*reakce%3D
Disallow /diskuse.asp*vlakno%3D
Disallow /diskuse.asp*strana%3D
Disallow /diskuse.asp*razeni%3D
Disallow /tiskni.asp
Disallow /Tiskni.asp
Disallow /*/ankety.asp
Disallow /*/Ankety.asp
Disallow /ankety.asp*hlasuj
Disallow /clanek.aspx?c=*
Disallow /Clanek.aspx?c=*
Disallow /profil.aspx
Disallow /Profil.aspx
Disallow /popup/
Disallow *%26noindex*
Disallow /SocialNetworks.aspx
Allow /

machinelearning

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://pocasi.idnes.cz/sitemap?type=sitemap