hlcgw.com
robots.txt

Robots Exclusion Standard data for hlcgw.com

Resource Scan

Scan Details

Site Domain hlcgw.com
Base Domain hlcgw.com
Scan Status Ok
Last Scan2025-08-05T04:16:50+00:00
Next Scan 2025-09-04T04:16:50+00:00

Last Scan

Scanned2025-08-05T04:16:50+00:00
URL https://hlcgw.com/robots.txt
Domain IPs 104.21.40.108, 172.67.150.187, 2606:4700:3033::6815:286c, 2606:4700:3033::ac43:96bb
Response IP 104.21.40.108
Found Yes
Hash 32096f2311485d1433463c68bba9a09fb853cd2358bc2dc27b448b081a8db8b5
SimHash e005be9bbff1

Groups

*

Rule Path
Disallow /admin*
Disallow /danmaku/*
Disallow /feed/*
Disallow /archives/*/comment-page-*
Disallow /comment-page-*?replyTo=*
Disallow /*.html/comment-page-*
Disallow /upload*/xiao/*
Disallow /tag/*xiaohaole.cc*
Disallow /tag/*xiaohaola.com*
Disallow /tag/*kaapm.com*
Disallow /*?replyTo=*
Disallow /*?ysclid=*
Disallow /*?continueFlag=*
Disallow /usr/*
Disallow /upload*
Disallow /cdn-cgi/*
Disallow /cmt/

Other Records

Field Value
sitemap https://hlcgw.com/sitemap.xml