control.blog.sina.com.cn
robots.txt

Robots Exclusion Standard data for control.blog.sina.com.cn

Resource Scan

Scan Details

Site Domain control.blog.sina.com.cn
Base Domain sina.com.cn
Scan Status Ok
Last Scan2024-04-29T03:58:07+00:00
Next Scan 2024-05-29T03:58:07+00:00

Last Scan

Scanned2024-04-29T03:58:07+00:00
URL https://control.blog.sina.com.cn/robots.txt
Domain IPs 49.7.37.19
Response IP 49.7.37.19
Found Yes
Hash d75e706fb21467f1a1ff41ab7d6c23de3d662141eba411556cfaf4bffe18b82d
SimHash a9100a820710

Groups

*

Rule Path
Allow /admin/blogmove/
Disallow /admin/
Disallow /include/
Disallow /html/
Disallow /queue/
Disallow /config/

Comments

  • ÏÞÖƵÄËÑË÷ÒýÇæµÄUser-Agent´úÂ룬*±íʾËùÓÐ
  • ÏÞÖƲ»ÄÜËÑË÷µÄĿ¼£¬Disallow: Ϊ¿Õʱ¿ª·ÅËùÓÐĿ¼
  • ¿ª·ÅËÑË÷µÄĿ¼ÓÐ
  • /
  • /advice/
  • /help/
  • /lm/
  • /main/
  • /myblog/
  • ËÑË÷ÒýÇæUser-Agent´úÂë¶ÔÕÕ±í
  • ËÑË÷ÒýÇæ User-Agent´úÂë
  • AltaVista Scooter
  • Infoseek Infoseek
  • Hotbot Slurp
  • AOL Search Slurp
  • Excite ArchitextSpider
  • Google Googlebot
  • Goto Slurp
  • Lycos Lycos
  • MSN MSNBOT
  • Netscape Googlebot
  • NorthernLight Gulliver
  • WebCrawler ArchitextSpider
  • Iwon Slurp
  • Fast Fast
  • DirectHit Grabber
  • Yahoo Web Pages Googlebot
  • Looksmart Web Pages Slurp
  • Baiduspider Baidu