htmlguardian.org
robots.txt

Robots Exclusion Standard data for htmlguardian.org

Resource Scan

Scan Details

Site Domain htmlguardian.org
Base Domain htmlguardian.org
Scan Status Ok
Last Scan2025-10-06T00:46:37+00:00
Next Scan 2025-11-05T00:46:37+00:00

Last Scan

Scanned2025-10-06T00:46:37+00:00
URL https://htmlguardian.org/robots.txt
Domain IPs 2a02:4780:b:1003:0:e47:b9fa:2, 89.117.9.125
Response IP 89.117.9.125
Found Yes
Hash 65200bd1150b9032746c6ae3d6f9e860fd65904f7e0f5bd20fc9a117fe84d252
SimHash ac296b40cfd2

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /data/