markaicode.com
robots.txt

Robots Exclusion Standard data for markaicode.com

Resource Scan

Scan Details

Site Domain markaicode.com
Base Domain markaicode.com
Scan Status Ok
Last Scan2025-10-31T22:31:04+00:00
Next Scan 2025-11-07T22:31:04+00:00

Last Scan

Scanned2025-10-31T22:31:04+00:00
URL https://markaicode.com/robots.txt
Domain IPs 104.21.78.89, 172.67.219.18, 2606:4700:3032::6815:4e59, 2606:4700:3035::ac43:db12
Response IP 104.21.78.89
Found Yes
Hash 383b2b2edc6a89778b806602ad2a7a9a98b89db64e4f07fbf16f84cbde9a1281
SimHash 7d158e76e4bb

Groups

*

Rule Path
Allow /
Disallow /search/
Disallow /*?*
Disallow /admin/
Disallow /private/
Disallow /*.json$
Disallow /*_print$
Allow /posts/
Allow /images/
Allow /categories/
Allow /languages/
Allow /difficulty/
Allow /authors/

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

yandexbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

duckduckbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://markaicode.com/sitemap.xml
sitemap https://markaicode.com/sitemap-tags.xml
sitemap https://markaicode.com/sitemap-categories.xml
sitemap https://markaicode.com/sitemap-posts-1.xml
sitemap https://markaicode.com/sitemap-posts-2.xml
sitemap https://markaicode.com/sitemap-posts-3.xml
sitemap https://markaicode.com/sitemap-posts-4.xml
sitemap https://markaicode.com/sitemap-posts-5.xml
sitemap https://markaicode.com/sitemap-posts-6.xml
sitemap https://markaicode.com/sitemap-posts-7.xml
sitemap https://markaicode.com/sitemap-posts-8.xml

Comments

  • 允许搜索引擎访问以下目录
  • SEO optimization: exclude low-value tag pages (< 5 articles)
  • This prevents thin content from being indexed
  • 网站地图
  • 移除重复的User-agent配置
  • 阻止AI训练爬虫(可选)
  • 允许其他搜索引擎