tn.nova.cz
robots.txt

Robots Exclusion Standard data for tn.nova.cz

Resource Scan

Scan Details

Site Domain tn.nova.cz
Base Domain nova.cz
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-04T17:09:33+00:00
Next Scan 2024-10-03T17:09:33+00:00

Last Successful Scan

Scanned2024-05-14T17:08:19+00:00
URL https://tn.nova.cz/robots.txt
Domain IPs 104.18.28.12, 104.18.29.12, 2606:4700::6812:1c0c, 2606:4700::6812:1d0c
Response IP 104.18.29.12
Found Yes
Hash 98ad6d74d6486466911b6a0802e8e3e2d17abcb4b12d6ce0b0533a27fe258d40
SimHash 5951596f6893

Groups

*

Rule Path
Disallow /*?order*
Disallow /app/
Disallow /bin/
Disallow /lbin/
Disallow /ajax/
Disallow /api/v1/cmp/
Disallow /api/v1/program/
Disallow /sport/ms-2022-fotbal/hrac

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

machinelearning

Rule Path
Disallow /

Other Records

Field Value
sitemap https://tn.nova.cz/api/v1/sitemap-index

Comments

  • Welcome, dear robots, but not all of you!