badhtml.com
robots.txt

Robots Exclusion Standard data for badhtml.com

Resource Scan

Scan Details

Site Domain badhtml.com
Base Domain badhtml.com
Scan Status Ok
Last Scan2025-12-08T17:42:23+00:00
Next Scan 2025-12-15T17:42:23+00:00

Last Scan

Scanned2025-12-08T17:42:23+00:00
URL https://badhtml.com/robots.txt
Domain IPs 104.21.75.237, 172.67.183.163, 2606:4700:3030::ac43:b7a3, 2606:4700:3036::6815:4bed
Response IP 172.67.183.163
Found Yes
Hash cf4cab9b144c48f1187d2af1a2dafcc8c2a5771f03794b8fdaccd960d011ea54
SimHash 6d58604c8a93

Groups

*

Rule Path
Disallow /donttrackthis/
Disallow /cgi-bin/

Other Records

Field Value
sitemap https://badhtml.com/sitemap.xml