/.well-known/

Log In Sign Up

ciql.net
robots.txt

Robots Exclusion Standard data for ciql.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ciql.net
Base Domain	ciql.net
Scan Status	Ok
Last Scan	2025-09-02T09:57:10+00:00
Next Scan	2025-09-03T09:57:10+00:00

Last Scan

Scanned	2025-09-02T09:57:10+00:00
URL	https://ciql.net/robots.txt
Domain IPs	24.199.97.60, 2604:a880:4:1d0::681:9000
Response IP	24.199.97.60
Found	Yes
Hash	d258814c64c1ad13f8b536864a20ad4bf7ad7cc4c705e082d5faafd6353c5933
SimHash	74d45b05c7e4

Groups

*

Rule

Path

Disallow

/works/

Disallow

/works?

Disallow

/p/

Disallow

/assets/

gptbot
gptbot-user
chatgpt
chatgpt-user
ai2bot
ai2bot-dolma
aihitbot
amazonbot
anthropic-ai
applebot
applebot-extended
brightbot 1.0
bytespider
ccbot
claude-web
claudebot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawlspace
diffbot
duckassistbot
facebookbot
factset_spyderbot
firecrawlagent
friendlycrawler
google-cloudvertexbot
google-extended
googleother
googleother-image
googleother-video
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
imgproxy
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
novaact
oai-searchbot
omgili
omgilibot
operator
pangubot
perplexity-user
perplexitybot
petalbot
qualifiedbot
scrapy
semrushbot-ocob
semrushbot-swa
sidetrade indexer bot
tiktokspider
timpibot
velenpublicwebcrawler
webzio-extended
youbot

Rule

Path

Disallow

/

gptbot
claudebot
claude-user
claude-searchbot
ccbot
google-extended
applebot-extended
facebookbot
meta-externalagent
meta-externalfetcher
diffbot
perplexitybot
perplexityâuser
omgili
omgilibot
webzio-extended
imagesiftbot
bytespider
tiktokspider
amazonbot
youbot
semrushbot-ocob
petalbot
velenpublicwebcrawler
turnitinbot
timpibot
oai-searchbot
icc-crawler
ai2bot
ai2bot-dolma
dataforseobot
awariobot
awariosmartbot
awariorssbot
google-cloudvertexbot
pangubot
kangaroo bot
sentibot
img2dataset
meltwater
seekr
peer39_crawler
cohere-ai
cohere-training-data-crawler
duckassistbot
scrapy
cotoyogi
aihitbot
factset_spyderbot
firecrawlagent

Rule

Path

Disallow

/

*

Rule

Path

Allow

/

Back to top

Comments

Block all known AI crawlers and assistants
from using content for training AI models.
Source: https://robotstxt.com/ai
Block any non-specified AI crawlers (e.g., new
or unknown bots) from using content for training
AI models, while allowing the website to be
indexed and accessed by bots. These directives
are still experimental and may not be supported
by all AI crawlers.

Back to top

Warnings

`content-usage` is not a known field.
`disallowaitraining` is not a known field.

Back to top