cyanusai.com
robots.txt

Robots Exclusion Standard data for cyanusai.com

Resource Scan

Scan Details

Site Domain cyanusai.com
Base Domain cyanusai.com
Scan Status Ok
Last Scan2025-11-01T09:40:55+00:00
Next Scan 2025-11-08T09:40:55+00:00

Last Scan

Scanned2025-11-01T09:40:55+00:00
URL https://cyanusai.com/robots.txt
Domain IPs 104.21.93.210, 172.67.214.199, 2606:4700:3031::6815:5dd2, 2606:4700:3032::ac43:d6c7
Response IP 104.21.93.210
Found Yes
Hash 8550a987c0dfa127df7d3e2cbe78bd0c2153b65bb487cee07e65fc608198764c
SimHash 4435cb51c5d4

Groups

*

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

*

Rule Path
Allow /

google-adstxt

Rule Path
Disallow

mediapartners-google

Rule Path
Allow /app-ads.txt
Disallow

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /static/images/
Allow /static/favicon/

bingbot

Rule Path
Allow /

baiduspider

Rule Path
Allow /

yandexbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://cyanusai.com/sitemap.xml

Comments

  • As a condition of accessing this website, you agree to abide by the following
  • content signals:
  • (a) If a content-signal = yes, you may collect content for the corresponding
  • use.
  • (b) If a content-signal = no, you may not collect content for the
  • corresponding use.
  • (c) If the website operator does not include a content signal for a
  • corresponding use, the website operator neither grants nor restricts
  • permission via content signal with respect to the corresponding use.
  • The content signals and their meanings are:
  • search: building a search index and providing search results (e.g., returning
  • hyperlinks and short excerpts from your website's contents). Search does not
  • include providing AI-generated search summaries.
  • ai-input: inputting content into one or more AI models (e.g., retrieval
  • augmented generation, grounding, or other real-time taking of content for
  • generative AI search answers).
  • ai-train: training or fine-tuning AI models.
  • ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
  • RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT
  • AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content
  • Cyanus AI Website Robots.txt
  • Last updated: 2025-05-06
  • Allow all bots to access the entire site by default
  • Google AdMob crawler - exactly as recommended by Google
  • Google AdSense crawler
  • Google search crawler
  • Google image crawler
  • Bing bot
  • Baidu bot (popular in China)
  • Yandex bot
  • Sitemap location

Warnings

  • `content-signal` is not a known field.