clean.io
robots.txt

Robots Exclusion Standard data for clean.io

Resource Scan

Scan Details

Site Domain clean.io
Base Domain clean.io
Scan Status Ok
Last Scan2025-09-17T12:45:36+00:00
Next Scan 2025-10-17T12:45:36+00:00

Last Scan

Scanned2025-09-17T12:45:36+00:00
URL https://clean.io/robots.txt
Redirect https://www.clean.io/robots.txt
Redirect Domain www.clean.io
Redirect Base clean.io
Domain IPs 104.18.32.230, 172.64.155.26, 2606:4700:4403::6812:20e6, 2a06:98c1:3107::ac40:9b1a
Redirect IPs 104.18.32.230, 172.64.155.26, 2606:4700:4403::6812:20e6, 2a06:98c1:3107::ac40:9b1a
Response IP 172.64.155.26
Found Yes
Hash 9b05d6182b69ad680c20b836061c2a542e2122e17a135d73bf4d2987346dd8df
SimHash 36fdcee0deb3

Groups

hubspotcontentsearchbot

Rule Path
Disallow /sample-*
Disallow /blog/sample-*
Disallow /cleanad-customer-updates/
Disallow /cleanad-customer-updates/*
Disallow /cleancart-customer-updates/
Disallow /cleancart-customer-updates/*
Disallow /page/*
Disallow /author/*
Disallow /tag/*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

*

Rule Path
Disallow /page/
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://www.clean.io/sitemap.xml