tv.nova.cz
robots.txt

Robots Exclusion Standard data for tv.nova.cz

Resource Scan

Scan Details

Site Domain tv.nova.cz
Base Domain nova.cz
Scan Status Ok
Last Scan2024-04-29T12:05:39+00:00
Next Scan 2024-05-06T12:05:39+00:00

Last Scan

Scanned2024-04-29T12:05:39+00:00
URL https://tv.nova.cz/robots.txt
Domain IPs 104.18.28.12, 104.18.29.12, 2606:4700::6812:1c0c, 2606:4700::6812:1d0c
Response IP 104.18.29.12
Found Yes
Hash f0b95e51f1a4aa1e7811a4e92641b68d11c99a352428bb0ec8c4d524987d4ffd
SimHash 09155d7dcc93

Groups

*

Rule Path
Disallow /bin/
Disallow /*?back_url
Disallow /api/v1/user/*
Disallow /uzivatelsky-profil

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

machinelearning

Rule Path
Disallow /

Other Records

Field Value
sitemap https://tv.nova.cz/api/v1/sitemap-index

Comments

  • Welcome, dear robots, but not all of you!