csdn.net
robots.txt
Robots Exclusion Standard data for csdn.net
Resource Scan
Scan Details
Site Domain | csdn.net |
Base Domain | csdn.net |
Scan Status | Ok |
Last Scan | 2024-11-11T12:59:15+00:00 |
Next Scan | 2024-11-25T12:59:15+00:00 |
Last Scan
Scanned | 2024-11-11T12:59:15+00:00 |
URL | https://www.csdn.net/robots.txt |
Domain IPs | 117.149.203.17, 123.129.227.84, 183.240.163.241, 220.185.184.6, 220.185.184.63 |
Response IP | 183.240.163.241 |
Found | Yes |
Hash | 07c013448465727c51bd1aebff845bd92404064634f11496abf9071a9b9911ab |
SimHash | 3902f6cd6dd0 |
Groups
*
Rule | Path |
---|---|
Disallow | /scripts |
Disallow | /public |
Disallow | /css/ |
Disallow | /images/ |
Disallow | /content/ |
Disallow | /ui/ |
Disallow | /js/ |
Disallow | /scripts/ |
Disallow | /article_preview.html* |
Disallow | /tag/ |
Disallow | /*?* |
Disallow | /link/ |
Disallow | /tags/ |
Disallow | /news/ |
Disallow | /xuexi/ |