hdzgcn.com
robots.txt

Robots Exclusion Standard data for hdzgcn.com

Resource Scan

Scan Details

Site Domain hdzgcn.com
Base Domain hdzgcn.com
Scan Status Ok
Last Scan2026-01-10T06:40:27+00:00
Next Scan 2026-02-09T06:40:27+00:00

Last Scan

Scanned2026-01-10T06:40:27+00:00
URL http://www.hdzgcn.com/robots.txt
Domain IPs 13.35.33.118, 13.35.33.163, 13.35.33.189, 13.35.33.70, 2600:9000:213e:2200:1b:47fe:5a00:21, 2600:9000:213e:3e00:1b:47fe:5a00:21, 2600:9000:213e:7400:1b:47fe:5a00:21, 2600:9000:213e:8a00:1b:47fe:5a00:21, 2600:9000:213e:ba00:1b:47fe:5a00:21, 2600:9000:213e:bc00:1b:47fe:5a00:21, 2600:9000:213e:de00:1b:47fe:5a00:21, 2600:9000:213e:e800:1b:47fe:5a00:21
Response IP 13.35.33.70
Found Yes
Hash 95ce72679715c38b594bfa0e0d76c8e087d80ade1b66e6782a6b497e3e77a672
SimHash 71079dd90f17

Groups

*

Rule Path
Disallow /m-login
Disallow /ce_make.html
Disallow /verify110manu.html
Disallow /ce_cust_403.html
Disallow /verify430manu.html
Disallow /cyproductlist_1/
Disallow /verify.html

baiduspider/2.0

Rule Path
Disallow /producer/
Disallow /thirdcode/
Disallow /nportal/
Disallow /icp
Disallow *api*
Disallow *.php
Disallow *.asp
Disallow /admin/
Disallow /npublic/
Disallow /site.txt

Other Records

Field Value
sitemap http://www.hdzgcn.com/sitemap.xml