hj-cleanroom.com
robots.txt

Robots Exclusion Standard data for hj-cleanroom.com

Resource Scan

Scan Details

Site Domain hj-cleanroom.com
Base Domain hj-cleanroom.com
Scan Status Ok
Last Scan2024-10-31T03:36:32+00:00
Next Scan 2024-11-30T03:36:32+00:00

Last Scan

Scanned2024-10-31T03:36:32+00:00
URL https://www.hj-cleanroom.com/robots.txt
Domain IPs 108.157.254.106, 108.157.254.120, 108.157.254.126, 108.157.254.73, 2600:9000:2753:6200:1e:ebae:9a00:93a1, 2600:9000:2753:7800:1e:ebae:9a00:93a1, 2600:9000:2753:8a00:1e:ebae:9a00:93a1, 2600:9000:2753:9400:1e:ebae:9a00:93a1, 2600:9000:2753:ac00:1e:ebae:9a00:93a1, 2600:9000:2753:d400:1e:ebae:9a00:93a1, 2600:9000:2753:e000:1e:ebae:9a00:93a1, 2600:9000:2753:fe00:1e:ebae:9a00:93a1
Response IP 108.157.254.126
Found Yes
Hash d0926d95e1893ad9c0f78cd46c01da7602a16f0b2899d3a2094bf1ebfd949113
SimHash 75848ddd8f11

Groups

*

Rule Path
Disallow /m-login
Disallow /ce_make.html
Disallow /cyproductlist_1/
Disallow /verify.html

baiduspider/2.0

Rule Path
Disallow /producer/
Disallow /thirdcode/
Disallow /nportal/
Disallow /icp
Disallow *api*
Disallow *.php
Disallow *.asp
Disallow /admin/
Disallow /npublic/
Disallow /thirdcode/
Disallow /site.txt

Other Records

Field Value
sitemap https://www.hj-cleanroom.com/sitemap.xml