blog.ferstar.org
robots.txt

Robots Exclusion Standard data for blog.ferstar.org

Resource Scan

Scan Details

Site Domain blog.ferstar.org
Base Domain ferstar.org
Scan Status Ok
Last Scan2026-03-11T22:32:44+00:00
Next Scan 2026-04-10T22:32:44+00:00

Last Scan

Scanned2026-03-11T22:32:44+00:00
URL https://blog.ferstar.org/robots.txt
Domain IPs 104.21.83.52, 172.67.214.206, 2606:4700:3033::6815:5334, 2606:4700:3035::ac43:d6ce
Response IP 104.21.83.52
Found Yes
Hash 7b50b5d3eef0d0198bb9edf370f1a149394d4343c150530171b6de79d6e4eaed
SimHash 18406ac22fb1

Groups

*

Rule Path
Disallow /categories/
Disallow /tags/
Disallow /post/issue-*

Other Records

Field Value
sitemap https://blog.ferstar.org/sitemap.xml

Comments

  • --------------------------------------------------------------------------
  • Cloudflare Content Signals (AI 时代的内容授权协议)
  • --------------------------------------------------------------------------
  • 允许传统搜索(Google, Bing)索引,维持 UV 16k+ 的基本盘
  • search: yes
  • 允许 AI 实时引用(如 Perplexity/ChatGPT 联网),为你提供精准流量
  • ai-input: yes
  • 明确拒绝 AI 模型训练(防止内容被大模型直接“洗稿”式吸收)
  • ai-train: no
  • --------------------------------------------------------------------------
  • 标准 Robots 规则
  • --------------------------------------------------------------------------
  • 路径屏蔽:减少爬虫对管理页或空分类的无效扫描
  • 站点地图引导:确保 Googlebot 能直接找到最新的 URL 结构