eic.org.cn
robots.txt

Robots Exclusion Standard data for eic.org.cn

Resource Scan

Scan Details

Site Domain eic.org.cn
Base Domain eic.org.cn
Scan Status Ok
Last Scan2025-09-18T11:15:43+00:00
Next Scan 2025-10-18T11:15:43+00:00

Last Scan

Scanned2025-09-18T11:15:43+00:00
URL https://eic.org.cn/robots.txt
Redirect https://www.eic.org.cn/robots.txt
Redirect Domain www.eic.org.cn
Redirect Base eic.org.cn
Domain IPs 123.57.145.4
Redirect IPs 180.163.146.116
Response IP 180.163.146.116
Found Yes
Hash e3aec8d703782a1067034b48bc99603b9cf1707ae717162d2602a96cb6cb428a
SimHash 08341e32f681

Groups

*

Rule Path
Disallow /admin/
Disallow /api/
Disallow /tmp/
Disallow /private/

baiduspider

Rule Path
Allow /

googlebot

Rule Path
Allow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://yourdomain.com/sitemap.xml

Comments

  • 全局爬虫规则
  • 禁止访问管理页面和API端点
  • 禁止访问临时文件和私人目录
  • 允许特定爬虫访问
  • 禁止恶意爬虫
  • 站点地图