51cg5.com
robots.txt

Robots Exclusion Standard data for 51cg5.com

Resource Scan

Scan Details

Site Domain 51cg5.com
Base Domain 51cg5.com
Scan Status Ok
Last Scan2025-04-09T23:17:55+00:00
Next Scan 2025-05-09T23:17:55+00:00

Last Scan

Scanned2025-04-09T23:17:55+00:00
URL https://51cg5.com/robots.txt
Domain IPs 104.21.85.139, 172.67.206.111, 2606:4700:3034::6815:558b, 2606:4700:3037::ac43:ce6f
Response IP 104.21.85.139
Found Yes
Hash f8e65021e6660d518fc9386aed22ba1e4dad02f7c2b24f8e2a3623fbe83e915c
SimHash f105fe19ff71

Groups

*

Rule Path
Disallow /admin*
Disallow /danmaku/*
Disallow /feed/*
Disallow /archives/*/comment-page-*
Disallow /comment-page-*?replyTo=*
Disallow /*.html/comment-page-*
Disallow /upload*/xiao/*
Disallow /tag/*xiaohaole.cc*
Disallow /tag/*xiaohaola.com*
Disallow /tag/*kaapm.com*
Disallow /*?replyTo=*
Disallow /*?ysclid=*
Disallow /*?continueFlag=*

Other Records

Field Value
sitemap https://51cg5.com/sitemap.xml