buhta.ws
robots.txt

Robots Exclusion Standard data for buhta.ws

Resource Scan

Scan Details

Site Domain buhta.ws
Base Domain buhta.ws
Scan Status Ok
Last Scan2025-09-21T20:09:19+00:00
Next Scan 2025-09-28T20:09:19+00:00

Last Scan

Scanned2025-09-21T20:09:19+00:00
URL https://buhta.ws/robots.txt
Domain IPs 185.197.160.32, 193.42.110.247, 193.42.111.138
Response IP 193.42.110.247
Found Yes
Hash 38243067b41ef4da2336b982efb12f6b18390adbfa74733d28af2aa32b8ae09a
SimHash 7d0db0700531

Groups

googlebot

Rule Path
Disallow /nogooglebot/

*

Rule Path
Allow /
Disallow /engine/go.php
Disallow /user/
Disallow /newposts/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /*do%3Ddownload
Disallow /*do%3Dgo
Disallow /*do%3Dchangemail

Other Records

Field Value
sitemap https://buhta.ws/sitemap.xml