whghjt.com
robots.txt

Robots Exclusion Standard data for whghjt.com

Resource Scan

Scan Details

Site Domain whghjt.com
Base Domain whghjt.com
Scan Status Ok
Last Scan2024-04-24T20:49:17+00:00
Next Scan 2024-05-24T20:49:17+00:00

Last Scan

Scanned2024-04-24T20:49:17+00:00
URL https://www.whghjt.com/robots.txt
Domain IPs 108.157.254.12, 108.157.254.128, 108.157.254.4, 108.157.254.85, 2600:9000:2753:3a00:1d:b209:4e40:93a1, 2600:9000:2753:5400:1d:b209:4e40:93a1, 2600:9000:2753:5800:1d:b209:4e40:93a1, 2600:9000:2753:6a00:1d:b209:4e40:93a1, 2600:9000:2753:8200:1d:b209:4e40:93a1, 2600:9000:2753:ba00:1d:b209:4e40:93a1, 2600:9000:2753:d800:1d:b209:4e40:93a1, 2600:9000:2753:e000:1d:b209:4e40:93a1
Response IP 108.157.254.12
Found Yes
Hash b66021f1e3800fb5158012193ca2e5f075ceffe57fcbef3c61319620b2e409a6
SimHash 75051ddd8d11

Groups

*

Rule Path
Disallow /m-login
Disallow /ce_make.html
Disallow /cyproductlist_1/
Disallow /verify.html

baiduspider/2.0

Rule Path
Disallow /producer/
Disallow /thirdcode/
Disallow /nportal/
Disallow /icp
Disallow *api*
Disallow *.php
Disallow *.asp
Disallow /admin/
Disallow /npublic/
Disallow /thirdcode/
Disallow /site.txt

Other Records

Field Value
sitemap https://www.whghjt.com/sitemap.xml