cn.cz
robots.txt

Robots Exclusion Standard data for cn.cz

Resource Scan

Scan Details

Site Domain cn.cz
Base Domain cn.cz
Scan Status Ok
Last Scan2024-11-07T21:47:26+00:00
Next Scan 2024-11-14T21:47:26+00:00

Last Scan

Scanned2024-11-07T21:47:26+00:00
URL https://cn.cz/robots.txt
Redirect https://www.ceskenoviny.cz/robots.txt
Redirect Domain www.ceskenoviny.cz
Redirect Base ceskenoviny.cz
Domain IPs 2a01:430:0:37::48, 80.79.27.48
Redirect IPs 2a01:430:0:37::48, 80.79.27.48
Response IP 80.79.27.48
Found Yes
Hash 0bcc845aa3bc38e017f292135d0fb33bd1d03670fcf243e9119dd35e9e0e999c
SimHash 815a9b261137

Groups

*

Rule Path
Disallow /.*
Disallow /vyhledavani/
Disallow /tema/

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

machinelearning

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ceskenoviny.cz/sitemap.php
sitemap https://www.ceskenoviny.cz/sitemap_zpravy.php

Comments

  • CN robots.txt
  • For new training only
  • Not for training, only for user requests
  • Marker for disabling Bard and Vertex AI
  • Speech synthesis
  • Multi-purpose, commercial uses; including LLMs
  • suggested by SPIR