calyp.cn
robots.txt

Robots Exclusion Standard data for calyp.cn

Resource Scan

Scan Details

Site Domain calyp.cn
Base Domain calyp.cn
Scan Status Ok
Last Scan2026-03-16T01:51:48+00:00
Next Scan 2026-04-15T01:51:48+00:00

Last Scan

Scanned2026-03-16T01:51:48+00:00
URL https://calyp.cn/robots.txt
Domain IPs 104.21.28.72, 172.67.144.161, 2606:4700:3032::ac43:90a1, 2606:4700:3037::6815:1c48
Response IP 104.21.28.72
Found Yes
Hash 5eefdd11b20ff5c89a6fa52ddda3c121e05c8036453200034a20ecfe54a9d075
SimHash 8079d0f27aa0

Groups

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

serankingbacklinksbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

iboubot

Rule Path
Disallow /

iboubot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

leakix

Rule Path
Disallow /

l9scan

Rule Path
Disallow /

amzn-searchbot

Rule Path Comment
Disallow /article_page/ 屏蔽目录 article_page
Disallow /category_collection/ 屏蔽目录 category_collection

Other Records

Field Value
crawl-delay 60

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-searchbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

facebookbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

*

Rule Path
Disallow

Comments

  • 屏蔽不å¿
  • ========= 威胁不大的爬虫,限制频率 =========
  • ========= 屏蔽AI爬虫 =========
  • ========= 屏蔽恶意/高风险爬虫 =========
  • å
  • å
  • å

Warnings

  • 5 invalid lines.