51cg5.com
robots.txt
Robots Exclusion Standard data for 51cg5.com
Resource Scan
Scan Details
Site Domain | 51cg5.com |
Base Domain | 51cg5.com |
Scan Status | Ok |
Last Scan | 2025-04-09T23:17:55+00:00 |
Next Scan | 2025-05-09T23:17:55+00:00 |
Last Scan
Scanned | 2025-04-09T23:17:55+00:00 |
URL | https://51cg5.com/robots.txt |
Domain IPs | 104.21.85.139, 172.67.206.111, 2606:4700:3034::6815:558b, 2606:4700:3037::ac43:ce6f |
Response IP | 104.21.85.139 |
Found | Yes |
Hash | f8e65021e6660d518fc9386aed22ba1e4dad02f7c2b24f8e2a3623fbe83e915c |
SimHash | f105fe19ff71 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin* |
Disallow | /danmaku/* |
Disallow | /feed/* |
Disallow | /archives/*/comment-page-* |
Disallow | /comment-page-*?replyTo=* |
Disallow | /*.html/comment-page-* |
Disallow | /upload*/xiao/* |
Disallow | /tag/*xiaohaole.cc* |
Disallow | /tag/*xiaohaola.com* |
Disallow | /tag/*kaapm.com* |
Disallow | /*?replyTo=* |
Disallow | /*?ysclid=* |
Disallow | /*?continueFlag=* |
Other Records
Field | Value |
---|---|
sitemap | https://51cg5.com/sitemap.xml |