mafra.cz
robots.txt

Robots Exclusion Standard data for mafra.cz

Resource Scan

Scan Details

Site Domain mafra.cz
Base Domain mafra.cz
Scan Status Ok
Last Scan2025-01-31T17:13:08+00:00
Next Scan 2025-02-07T17:13:08+00:00

Last Scan

Scanned2025-01-31T17:13:08+00:00
URL https://mafra.cz/robots.txt
Redirect https://www.mafra.cz/robots.txt
Redirect Domain www.mafra.cz
Redirect Base mafra.cz
Domain IPs 185.17.117.33
Redirect IPs 185.17.117.45
Response IP 185.17.117.45
Found Yes
Hash 0f5908b91d0b24c219856d774126a962ed41c86f2f1c9ed75bd91795a5e32eb6
SimHash 74509940e697

Groups

*

Rule Path
Disallow /diskuse.asp*reakce%3D
Disallow /diskuse.asp*vlakno%3D
Disallow /diskuse.asp*strana%3D
Disallow /diskuse.asp*razeni%3D
Disallow /tiskni.asp
Disallow /Tiskni.asp
Disallow /*/ankety.asp
Disallow /*/Ankety.asp
Disallow /ankety.asp*hlasuj
Disallow /clanek.aspx?c=*
Disallow /Clanek.aspx?c=*
Disallow /*/clanek.aspx?c=*
Disallow /*/Clanek.aspx?c=*
Disallow /export/
Disallow /data.aspx
Disallow /*/undefined

machinelearning

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /