vice.idnes.cz
robots.txt

Robots Exclusion Standard data for vice.idnes.cz

Resource Scan

Scan Details

Site Domain vice.idnes.cz
Base Domain idnes.cz
Scan Status Ok
Last Scan2024-05-19T11:58:33+00:00
Next Scan 2024-06-18T11:58:33+00:00

Last Scan

Scanned2024-05-19T11:58:33+00:00
URL https://vice.idnes.cz/robots.txt
Domain IPs 185.17.117.45
Response IP 185.17.117.45
Found Yes
Hash 56124a6613b904f57249d916f1f3077ca40ef36306b39f59792043a9cf6614f4
SimHash 00409822e233

Groups

*

Rule Path
Disallow /diskuse.asp*reakce%3D
Disallow /diskuse.asp*vlakno%3D
Disallow /diskuse.asp*strana%3D
Disallow /diskuse.asp*razeni%3D
Disallow /tiskni.asp
Disallow /Tiskni.asp
Disallow /*/ankety.asp
Disallow /*/Ankety.asp
Disallow /ankety.asp*hlasuj
Disallow /clanek.aspx?c=*
Disallow /Clanek.aspx?c=*
Disallow /profil.aspx
Disallow /Profil.aspx
Disallow /popup/
Allow /

machinelearning

Rule Path
Disallow /

gptbot

Rule Path
Disallow /