html.design
robots.txt

Robots Exclusion Standard data for html.design

Resource Scan

Scan Details

Site Domain html.design
Base Domain html.design
Scan Status Ok
Last Scan2025-12-08T00:12:24+00:00
Next Scan 2025-12-15T00:12:24+00:00

Last Scan

Scanned2025-12-08T00:12:24+00:00
URL https://html.design/robots.txt
Domain IPs 104.21.76.129, 172.67.195.122, 2606:4700:3033::ac43:c37a, 2606:4700:3034::6815:4c81
Response IP 104.21.76.129
Found Yes
Hash 0e549bfa4c085d4734e0ef2c47a31b40536cabed1e7b97aeb0c292f12d28766e
SimHash e114d84b6113

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /?s=
Disallow /search/
Disallow /tag/
Disallow /cart/
Disallow /checkout/
Disallow /purchase-confirmation/
Disallow /account/
Allow /wp-admin/admin-ajax.php

oai-searchbot

Rule Path
Disallow

google-extended

Rule Path
Disallow

msaibot

Rule Path
Disallow

Other Records

Field Value
sitemap https://html.design/sitemap_index.xml