cpf.org.cn
robots.txt

Robots Exclusion Standard data for cpf.org.cn

Resource Scan

Scan Details

Site Domain cpf.org.cn
Base Domain cpf.org.cn
Scan Status Ok
Last Scan2024-11-14T01:48:44+00:00
Next Scan 2024-12-14T01:48:44+00:00

Last Scan

Scanned2024-11-14T01:48:44+00:00
URL https://www.cpf.org.cn/robots.txt
Domain IPs 13.33.88.24, 13.33.88.53, 13.33.88.54, 13.33.88.77, 2600:9000:223b:1c00:19:ea2e:a540:93a1, 2600:9000:223b:5400:19:ea2e:a540:93a1, 2600:9000:223b:600:19:ea2e:a540:93a1, 2600:9000:223b:8000:19:ea2e:a540:93a1, 2600:9000:223b:8600:19:ea2e:a540:93a1, 2600:9000:223b:9c00:19:ea2e:a540:93a1, 2600:9000:223b:a00:19:ea2e:a540:93a1, 2600:9000:223b:ca00:19:ea2e:a540:93a1
Response IP 13.33.88.54
Found Yes
Hash ff61be98fb42e81f14fa1eee91992b73964885b4a0fdd1e2b03e28fb1d86b1f4
SimHash 75859ddc0f11

Groups

*

Rule Path
Disallow /m-login
Disallow /ce_make.html
Disallow /cyproductlist_1/
Disallow /verify.html

baiduspider/2.0

Rule Path
Disallow /producer/
Disallow /thirdcode/
Disallow /nportal/
Disallow /icp
Disallow *api*
Disallow *.php
Disallow *.asp
Disallow /admin/
Disallow /npublic/
Disallow /thirdcode/
Disallow /site.txt

Other Records

Field Value
sitemap https://www.cpf.org.cn/sitemap.xml