hlcgw.com
robots.txt
Robots Exclusion Standard data for hlcgw.com
Resource Scan
Scan Details
Site Domain | hlcgw.com |
Base Domain | hlcgw.com |
Scan Status | Ok |
Last Scan | 2025-08-05T04:16:50+00:00 |
Next Scan | 2025-09-04T04:16:50+00:00 |
Last Scan
Scanned | 2025-08-05T04:16:50+00:00 |
URL | https://hlcgw.com/robots.txt |
Domain IPs | 104.21.40.108, 172.67.150.187, 2606:4700:3033::6815:286c, 2606:4700:3033::ac43:96bb |
Response IP | 104.21.40.108 |
Found | Yes |
Hash | 32096f2311485d1433463c68bba9a09fb853cd2358bc2dc27b448b081a8db8b5 |
SimHash | e005be9bbff1 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin* |
Disallow | /danmaku/* |
Disallow | /feed/* |
Disallow | /archives/*/comment-page-* |
Disallow | /comment-page-*?replyTo=* |
Disallow | /*.html/comment-page-* |
Disallow | /upload*/xiao/* |
Disallow | /tag/*xiaohaole.cc* |
Disallow | /tag/*xiaohaola.com* |
Disallow | /tag/*kaapm.com* |
Disallow | /*?replyTo=* |
Disallow | /*?ysclid=* |
Disallow | /*?continueFlag=* |
Disallow | /usr/* |
Disallow | /upload* |
Disallow | /cdn-cgi/* |
Disallow | /cmt/ |
Other Records
Field | Value |
---|---|
sitemap | https://hlcgw.com/sitemap.xml |