liberec.idnes.cz
robots.txt

Robots Exclusion Standard data for liberec.idnes.cz

Resource Scan

Scan Details

Site Domain liberec.idnes.cz
Base Domain idnes.cz
Scan Status Ok
Last Scan2024-04-26T18:31:02+00:00
Next Scan 2024-05-26T18:31:02+00:00

Last Scan

Scanned2024-04-26T18:31:02+00:00
URL https://liberec.idnes.cz/robots.txt
Domain IPs 185.17.117.45
Response IP 185.17.117.45
Found Yes
Hash 52a91f2dc603927456f649ac10720fe75d1b75c78eb46c6eb8b781dcf6979a8a
SimHash 08449c20e6b7

Groups

*

Rule Path
Disallow /diskuse.asp*reakce%3D
Disallow /diskuse.asp*vlakno%3D
Disallow /diskuse.asp*strana%3D
Disallow /diskuse.asp*razeni%3D
Disallow /tiskni.asp
Disallow /Tiskni.asp
Disallow /*/ankety.asp
Disallow /*/Ankety.asp
Disallow /ankety.asp*hlasuj
Disallow /clanek.aspx?c=*
Disallow /Clanek.aspx?c=*
Disallow /*/clanek.aspx?c=*
Disallow /*/Clanek.aspx?c=*
Disallow /export/
Disallow /data.aspx
Disallow /*/undefined

machinelearning

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://liberec.idnes.cz/sitemap.aspx?type=sitemap