cece.com
robots.txt

Robots Exclusion Standard data for cece.com

Resource Scan

Scan Details

Site Domain cece.com
Base Domain cece.com
Scan Status Ok
Last Scan2025-11-21T05:10:53+00:00
Next Scan 2025-12-05T05:10:53+00:00

Last Scan

Scanned2025-11-21T05:10:53+00:00
URL https://cece.com/robots.txt
Domain IPs 81.70.125.124
Response IP 81.70.125.124
Found Yes
Hash 1a3ff6a2fcb3857d49e22e0385860c38304c2aefdba21a101bb57b07e8d5a2dc
SimHash 081224f6cf03

Groups

*

Rule Path
Allow /

baiduspider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 0.5

googlebot

Rule Path
Allow /

sogou web spider

Rule Path
Allow /

Other Records

Field Value
sitemap https://cece.com/sitemap.xml
sitemap https://cece.com/sitemap.xml

Comments

  • 特别允许百度爬虫
  • 百度专用站点地图
  • 允许其他主要搜索引擎
  • 禁止爬取的目录和文件
  • 禁止管理相关路径
  • 网站地图
  • 百度快速收录声明
  • 重要内容目录,建议优先收录

Warnings

  • 15 invalid lines.