idnes.cz
robots.txt

Robots Exclusion Standard data for idnes.cz

Resource Scan

Scan Details

Site Domain idnes.cz
Base Domain idnes.cz
Scan Status Ok
Last Scan2024-05-02T08:47:16+00:00
Next Scan 2024-05-09T08:47:16+00:00

Last Scan

Scanned2024-05-02T08:47:16+00:00
URL https://idnes.cz/robots.txt
Redirect https://www.idnes.cz/robots.txt
Redirect Domain www.idnes.cz
Redirect Base idnes.cz
Domain IPs 185.17.117.32
Redirect IPs 185.17.117.32
Response IP 185.17.117.32
Found Yes
Hash e989e9857f23085a7ec723579f0c6de73eb1bd09a0aa2fcc0faa37af6dfb570c
SimHash a90449029311

Groups

*

Rule Path
Disallow /*/diskuse*reakce%3D
Disallow /*/diskuse*vlakno%3D
Disallow /*/diskuse*strana%3D
Disallow /*/diskuse/
Allow /*/diskuse$
Disallow /*/diskuse*razeni%3D
Disallow /*/tisk$
Disallow /*/ankety/
Disallow /ankety.aspx*hlasuj
Disallow /export/
Disallow /data.aspx
Disallow /*/undefined

machinelearning

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.idnes.cz/sitemap.xml