cnblogs.com
robots.txt

Robots Exclusion Standard data for cnblogs.com

Resource Scan

Scan Details

Site Domain cnblogs.com
Base Domain cnblogs.com
Scan Status Ok
Last Scan2024-11-13T06:00:46+00:00
Next Scan 2024-11-20T06:00:46+00:00

Last Scan

Scanned2024-11-13T06:00:46+00:00
URL https://cnblogs.com/robots.txt
Redirect https://www.cnblogs.com/robots.txt
Redirect Domain www.cnblogs.com
Redirect Base cnblogs.com
Domain IPs 101.37.97.51, 2400:3200:1300::e70
Redirect IPs 2400:3200:1300::e70, 8.222.133.242
Response IP 8.222.133.242
Found Yes
Hash e74fbea7201c834585c47426d4cb2960bca753f9838f62bfafb69046c01f9e22
SimHash 590ccc230711

Groups

*

Rule Path
Allow /
Disallow /?*
Disallow /*/tag/*/?*
Disallow /*/tag/*/default.html?*
Disallow /index.html*
Disallow /default.aspx*

Other Records

Field Value
sitemap https://www.cnblogs.com/sitemap.xml