cgcable.com
robots.txt

Robots Exclusion Standard data for cgcable.com

Resource Scan

Scan Details

Site Domain cgcable.com
Base Domain cgcable.com
Scan Status Ok
Last Scan2024-05-08T01:13:03+00:00
Next Scan 2024-06-07T01:13:03+00:00

Last Scan

Scanned2024-05-08T01:13:03+00:00
URL https://www.cgcable.com/robots.txt
Domain IPs 13.33.30.103, 13.33.30.121, 13.33.30.13, 13.33.30.8, 2600:9000:229f:1200:d:46ad:2e40:93a1, 2600:9000:229f:1400:d:46ad:2e40:93a1, 2600:9000:229f:3000:d:46ad:2e40:93a1, 2600:9000:229f:5600:d:46ad:2e40:93a1, 2600:9000:229f:7000:d:46ad:2e40:93a1, 2600:9000:229f:8200:d:46ad:2e40:93a1, 2600:9000:229f:a400:d:46ad:2e40:93a1, 2600:9000:229f:fe00:d:46ad:2e40:93a1
Response IP 13.33.30.121
Found Yes
Hash 8b278fe7c517e4d5682fd8a92a16f7a0712994fbb0843f40b7f40a1fc609bc2e
SimHash 75059ddd8f11

Groups

*

Rule Path
Disallow /m-login
Disallow /ce_make.html
Disallow /cyproductlist_1/
Disallow /verify.html

baiduspider/2.0

Rule Path
Disallow /producer/
Disallow /thirdcode/
Disallow /nportal/
Disallow /icp
Disallow *api*
Disallow *.php
Disallow *.asp
Disallow /admin/
Disallow /npublic/
Disallow /thirdcode/
Disallow /site.txt

Other Records

Field Value
sitemap https://www.cgcable.com/sitemap.xml