jsarchi.com
robots.txt

Robots Exclusion Standard data for jsarchi.com

Resource Scan

Scan Details

Site Domain jsarchi.com
Base Domain jsarchi.com
Scan Status Ok
Last Scan2025-10-22T18:18:45+00:00
Next Scan 2025-11-21T18:18:45+00:00

Last Scan

Scanned2025-10-22T18:18:45+00:00
URL http://www.jsarchi.com/robots.txt
Domain IPs 1.56.98.184, 116.162.168.167, 116.169.183.220, 175.43.23.215, 211.95.142.138, 221.204.15.51, 221.204.209.225, 42.56.64.131, 42.56.81.77, 60.13.97.57
Response IP 42.56.81.77
Found Yes
Hash 792429c11d8d87339403354eb0f5f2d39eeda0c2ad7a485d53682fb5b98c74f7
SimHash 75079ddd8d17

Groups

*

Rule Path
Disallow /m-login
Disallow /ce_make.html
Disallow /verify110manu.html
Disallow /ce_cust_403.html
Disallow /verify430manu.html
Disallow /cyproductlist_1/
Disallow /verify.html

baiduspider/2.0

Rule Path
Disallow /producer/
Disallow /thirdcode/
Disallow /nportal/
Disallow /icp
Disallow *api*
Disallow *.php
Disallow *.asp
Disallow /admin/
Disallow /npublic/
Disallow /site.txt

Other Records

Field Value
sitemap http://www.jsarchi.com/sitemap.xml