news.csdn.net
robots.txt

Robots Exclusion Standard data for news.csdn.net

Resource Scan

Scan Details

Site Domain news.csdn.net
Base Domain csdn.net
Scan Status Ok
Last Scan2024-05-10T10:32:41+00:00
Next Scan 2024-06-09T10:32:41+00:00

Last Scan

Scanned2024-05-10T10:32:41+00:00
URL https://news.csdn.net/robots.txt
Redirect https://www.csdn.net/robots.txt
Redirect Domain www.csdn.net
Redirect Base csdn.net
Domain IPs 120.46.76.152
Redirect IPs 117.149.203.62, 123.129.227.28, 220.185.184.16
Response IP 117.149.203.62
Found Yes
Hash 07c013448465727c51bd1aebff845bd92404064634f11496abf9071a9b9911ab
SimHash 3902f6cd6dd0

Groups

*

Rule Path
Disallow /scripts
Disallow /public
Disallow /css/
Disallow /images/
Disallow /content/
Disallow /ui/
Disallow /js/
Disallow /scripts/
Disallow /article_preview.html*
Disallow /tag/
Disallow /*?*
Disallow /link/
Disallow /tags/
Disallow /news/
Disallow /xuexi/