sdjky.net
robots.txt

Robots Exclusion Standard data for sdjky.net

Resource Scan

Scan Details

Site Domain sdjky.net
Base Domain sdjky.net
Scan Status Ok
Last Scan2025-12-18T03:14:25+00:00
Next Scan 2026-01-17T03:14:25+00:00

Last Scan

Scanned2025-12-18T03:14:25+00:00
URL http://www.sdjky.net/robots.txt
Domain IPs 180.163.146.116
Response IP 180.163.146.116
Found Yes
Hash e8ab094a7d69f0e8aceebd3a62ba635a853e73a5b1047d67017a67ef5f2862c8
SimHash 90862ac60b01

Groups

*

Rule Path
Disallow /

Comments

  • robots.txt for www.sdjky.net
  • 声明所有搜索引擎爬虫(* 表示通é
  • 禁止抓取网站根目录及所有子目录下的å†
  • 可选:明确声明禁止抓取的特定文件类型或目录(如果需要更细粒度控制)
  • Disallow: /images/ # 禁止抓取图片目录
  • Disallow: /admin/ # 禁止抓取后台管理目录
  • Disallow: *.pdf # 禁止抓取所有PDF文件(部分爬虫支持通é

Warnings

  • 3 invalid lines.