htmlpublish.com
robots.txt
Robots Exclusion Standard data for htmlpublish.com
Resource Scan
Scan Details
| Site Domain | htmlpublish.com |
| Base Domain | htmlpublish.com |
| Scan Status | Ok |
| Last Scan | 2025-11-01T00:14:42+00:00 |
| Next Scan | 2025-12-01T00:14:42+00:00 |
Last Scan
| Scanned | 2025-11-01T00:14:42+00:00 |
| URL | https://htmlpublish.com/robots.txt |
| Redirect | https://www.htmlpublish.com/robots.txt |
| Redirect Domain | www.htmlpublish.com |
| Redirect Base | htmlpublish.com |
| Domain IPs | 104.21.72.238, 172.67.187.213, 2606:4700:3030::6815:48ee, 2606:4700:3032::ac43:bbd5 |
| Redirect IPs | 104.21.72.238, 172.67.187.213, 2606:4700:3030::6815:48ee, 2606:4700:3032::ac43:bbd5 |
| Response IP | 172.67.187.213 |
| Found | Yes |
| Hash | 1cbfa68f830bed2eb5e3fd10eb1ca6671e96b7d61fd9348d52cf3c81978e4db9 |
| SimHash | 0a0cd531a713 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
| Disallow | /terms/ |
| Disallow | /privacy/ |
| Disallow | /diagnostics/ |
| Disallow | /share/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.htmlpublish.com/sitemap.xml |