hl-inside.me
robots.txt

Robots Exclusion Standard data for hl-inside.me

Resource Scan

Scan Details

Site Domain hl-inside.me
Base Domain hl-inside.me
Scan Status Ok
Last Scan2025-10-25T00:26:04+00:00
Next Scan 2025-11-24T00:26:04+00:00

Last Scan

Scanned2025-10-25T00:26:04+00:00
URL https://hl-inside.me/robots.txt
Domain IPs 104.21.19.104, 172.67.185.196, 2606:4700:3032::6815:1368, 2606:4700:3033::ac43:b9c4
Response IP 172.67.185.196
Found Yes
Hash 98ef9b2528752f21af4ca28e5a86f52393de69c8ef87ab92a71f3d6aef5edc98
SimHash 0560c920c7b2

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /includes/
Allow /includes/css/
Allow /includes/js/
Disallow /login/
Disallow /go/
Disallow /trash/
Disallow /zaglushka/
Disallow /half-life/
Disallow /steam/
Disallow /topmenu/
Disallow */index.php

combine

Rule Path
Disallow /earth/

huaweisymantecspider

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

Warnings

  • `host` is not a known field.