carbonclean.com
robots.txt

Robots Exclusion Standard data for carbonclean.com

Resource Scan

Scan Details

Site Domain carbonclean.com
Base Domain carbonclean.com
Scan Status Ok
Last Scan2024-11-05T06:54:15+00:00
Next Scan 2024-12-05T06:54:15+00:00

Last Scan

Scanned2024-11-05T06:54:15+00:00
URL https://carbonclean.com/robots.txt
Redirect https://www.carbonclean.com/robots.txt
Redirect Domain www.carbonclean.com
Redirect Base carbonclean.com
Domain IPs 3.220.40.113
Redirect IPs 199.60.103.226, 199.60.103.30, 2606:2c40::c73c:671e, 2606:2c40::c73c:67e2
Response IP 199.60.103.226
Found Yes
Hash 5a4296eb669e029e60c7cc5b5ed38696774570381f9f6cb351b5127dec7a68d0
SimHash 7e55c670c5b3

Groups

*

Rule Path
Disallow /sample-*
Disallow /blog/sample-*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://www.carbonclean.com/sitemap.xml