sdeleni.idnes.cz
robots.txt

Robots Exclusion Standard data for sdeleni.idnes.cz

Resource Scan

Scan Details

Site Domain sdeleni.idnes.cz
Base Domain idnes.cz
Scan Status Ok
Last Scan2024-11-09T02:47:17+00:00
Next Scan 2024-11-16T02:47:17+00:00

Last Scan

Scanned2024-11-09T02:47:17+00:00
URL https://sdeleni.idnes.cz/robots.txt
Domain IPs 185.17.117.42
Response IP 185.17.117.42
Found Yes
Hash ffb3aacfe9bc0c1bcef00801837e6aa7d23247085f9200aff36a894a69ed7f01
SimHash 74149940e697

Groups

*

Rule Path
Disallow /diskuse.asp*reakce%3D
Disallow /diskuse.asp*vlakno%3D
Disallow /diskuse.asp*strana%3D
Disallow /diskuse.asp*razeni%3D
Disallow /tiskni.asp
Disallow /Tiskni.asp
Disallow /*/ankety.asp
Disallow /*/Ankety.asp
Disallow /ankety.asp*hlasuj
Disallow /clanek.aspx?c=*
Disallow /Clanek.aspx?c=*
Disallow /*/clanek.aspx?c=*
Disallow /*/Clanek.aspx?c=*
Disallow /export/
Disallow /data.aspx
Disallow /*/undefined

machinelearning

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sdeleni.idnes.cz/sitemap?type=sitemap