bhi.com.cn
robots.txt

Robots Exclusion Standard data for bhi.com.cn

Resource Scan

Scan Details

Site Domain bhi.com.cn
Base Domain bhi.com.cn
Scan Status Ok
Last Scan2026-01-04T03:27:15+00:00
Next Scan 2026-02-03T03:27:15+00:00

Last Scan

Scanned2026-01-04T03:27:15+00:00
URL https://www.bhi.com.cn/robots.txt
Domain IPs 218.241.155.118
Response IP 218.241.155.118
Found Yes
Hash 8b1f60afb9dfd21a06b97a24c7b3e70e5d9f24ca23e7b48379535d6630741476
SimHash 741fda735767

Groups

*

Rule Path
Disallow
Disallow /DownLoadInfo/
Disallow /Templete/
Disallow /Generate/
Disallow /Tools/
Disallow /ExportProject/
Disallow /*.pdf$
Disallow /Login/
Disallow /ExportProject/
Disallow /CreatIndexFile/
Disallow /MemberCenter/
Disallow /mobile_nzj/
Disallow /NavigationPage/
Disallow /Public/
Disallow /Public2/
Disallow /Solr/
Disallow /BhiHome/

Other Records

Field Value
crawl-delay 10

baiduspider

Rule Path
Allow /News/
Allow /Projects/
Allow /products/
Allow /DynamicTopic/

Other Records

Field Value
crawl-delay 2

bingbot

Rule Path
Allow /News/
Allow /Projects/
Allow /products/
Allow /DynamicTopic/

Other Records

Field Value
crawl-delay 1

yisouspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

python requests

Rule Path
Disallow /

apache httpclient

Rule Path
Disallow /

jsoup?

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.bhi.com.cn/SiteMap/SiteMap.html

Comments

  • ËÑË÷ÒýÇæÓÅ»¯
  • ×èÖ¹ÒÑÖªµÄץȡ¹¤¾ß

Warnings

  • 4 invalid lines.