csdn.net
robots.txt
Robots Exclusion Standard data for csdn.net
Resource Scan
Scan Details
Site Domain | csdn.net |
Base Domain | csdn.net |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-02-23T12:06:54+00:00 |
Next Scan | 2024-05-23T12:06:54+00:00 |
Last Successful Scan
Scanned | 2023-10-20T12:04:22+00:00 |
URL | https://www.csdn.net/robots.txt |
Domain IPs | 206.119.108.230, 206.119.110.229 |
Response IP | 206.119.110.229 |
Found | Yes |
Hash | 07c013448465727c51bd1aebff845bd92404064634f11496abf9071a9b9911ab |
SimHash | 3902f6cd6dd0 |
Groups
*
Rule | Path |
---|---|
Disallow | /scripts |
Disallow | /public |
Disallow | /css/ |
Disallow | /images/ |
Disallow | /content/ |
Disallow | /ui/ |
Disallow | /js/ |
Disallow | /scripts/ |
Disallow | /article_preview.html* |
Disallow | /tag/ |
Disallow | /*?* |
Disallow | /link/ |
Disallow | /tags/ |
Disallow | /news/ |
Disallow | /xuexi/ |