cnnbht.com
robots.txt

Robots Exclusion Standard data for cnnbht.com

Resource Scan

Scan Details

Site Domain cnnbht.com
Base Domain cnnbht.com
Scan Status Ok
Last Scan2024-11-15T18:43:11+00:00
Next Scan 2024-11-16T18:43:11+00:00

Last Scan

Scanned2024-11-15T18:43:11+00:00
URL https://www.cnnbht.com/robots.txt
Domain IPs 13.33.88.106, 13.33.88.128, 13.33.88.90, 13.33.88.91, 2600:9000:223b:1000:3:b4b8:b1c0:93a1, 2600:9000:223b:1400:3:b4b8:b1c0:93a1, 2600:9000:223b:3600:3:b4b8:b1c0:93a1, 2600:9000:223b:400:3:b4b8:b1c0:93a1, 2600:9000:223b:e600:3:b4b8:b1c0:93a1, 2600:9000:223b:ec00:3:b4b8:b1c0:93a1, 2600:9000:223b:fa00:3:b4b8:b1c0:93a1, 2600:9000:223b:fc00:3:b4b8:b1c0:93a1
Response IP 13.33.88.106
Found Yes
Hash b2e4c876c54b96ea27e60ab54a57a9ac3103dabdcde977cdfa2940bcfb72124a
SimHash 75040ddc8f11

Groups

*

Rule Path
Disallow /m-login
Disallow /ce_make.html
Disallow /cyproductlist_1/
Disallow /verify.html

baiduspider/2.0

Rule Path
Disallow /producer/
Disallow /thirdcode/
Disallow /nportal/
Disallow /icp
Disallow *api*
Disallow *.php
Disallow *.asp
Disallow /admin/
Disallow /npublic/
Disallow /thirdcode/
Disallow /site.txt

Other Records

Field Value
sitemap https://www.cnnbht.com/sitemap.xml