wwcmsh.net
robots.txt

Robots Exclusion Standard data for wwcmsh.net

Resource Scan

Scan Details

Site Domain wwcmsh.net
Base Domain wwcmsh.net
Scan Status Ok
Last Scan5/3/2025, 5:35:48 AM
Next Scan 6/2/2025, 5:35:48 AM

Last Scan

Scanned5/3/2025, 5:35:48 AM
URL https://www.wwcmsh.net/robots.txt
Domain IPs 104.21.4.243, 172.67.187.61, 2606:4700:3031::6815:4f3, 2606:4700:3033::ac43:bb3d
Response IP 172.67.187.61
Found Yes
Hash fdf5e337891fe22f98acdf38b7a589db7004219af57d9d7d60b7d419b7d89634
SimHash 5118d951a7b3

Groups

*

Rule Path
Disallow /admin*
Disallow /*?replyTo=
Disallow /*?replytocom=
Disallow /feed/
Disallow /cmt/
Disallow /*comment-page-*

googlebot

Rule Path
Disallow
Allow /

yandex

Rule Path
Disallow

bingbot

Rule Path
Disallow

Other Records

Field Value
sitemap https://18hlw.com/sitemap.xml

Warnings

  • `clean-param` is not a known field.