htmlkit.com
robots.txt

Robots Exclusion Standard data for htmlkit.com

Resource Scan

Scan Details

Site Domain htmlkit.com
Base Domain htmlkit.com
Scan Status Ok
Last Scan2025-10-16T04:15:55+00:00
Next Scan 2025-11-15T04:15:55+00:00

Last Scan

Scanned2025-10-16T04:15:55+00:00
URL https://www.htmlkit.com/robots.txt
Domain IPs 104.21.1.170, 172.67.129.167, 2606:4700:3032::6815:1aa, 2606:4700:3037::ac43:81a7
Response IP 104.21.1.170
Found Yes
Hash 6a15c5a8f19b310d44fea03ca8451a58d89c4308c59ad44dd00613da05b658be
SimHash 80442a82e3d2

Groups

*

Rule Path
Disallow /dl/
Disallow /e/
Disallow /html-kit/pe*
Disallow /pe*

ia_archiver
rufusbot

Rule Path
Disallow /

sbider

Rule Path
Disallow /