cuzi.ai
robots.txt

Robots Exclusion Standard data for cuzi.ai

Resource Scan

Scan Details

Site Domain cuzi.ai
Base Domain cuzi.ai
Scan Status Ok
Last Scan2026-01-11T12:35:53+00:00
Next Scan 2026-02-10T12:35:53+00:00

Last Scan

Scanned2026-01-11T12:35:53+00:00
URL https://cuzi.ai/robots.txt
Domain IPs 104.21.5.197, 172.67.133.204, 2606:4700:3032::ac43:85cc, 2606:4700:3035::6815:5c5
Response IP 104.21.5.197
Found Yes
Hash bfafff481315f3e8bdec85d89edb71925992a10005167ed6f1f4337b14e569a9
SimHash d10c8fe2ea92

Groups

*

Rule Path
Allow /

gptbot
claude-web
anthropic-ai
perplexitybot
googleother
duckassistbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://cuzi.ai/sitemap.xml

Comments

  • 常规搜索引擎规则
  • Disallow: /private/
  • 网站地图
  • AI爬虫特定规则
  • 引导AI爬虫到llms.txt
  • 允许AI爬虫访问
  • 不允许AI爬虫访问
  • Disallow: /user-content/

Warnings

  • `llm-content` is not a known field.
  • `llm-full-content` is not a known field.