explore.darwinbox.com
robots.txt

Robots Exclusion Standard data for explore.darwinbox.com

Resource Scan

Scan Details

Site Domain explore.darwinbox.com
Base Domain darwinbox.com
Scan Status Ok
Last Scan2025-08-01T10:16:56+00:00
Next Scan 2025-08-31T10:16:56+00:00

Last Scan

Scanned2025-08-01T10:16:56+00:00
URL https://explore.darwinbox.com/robots.txt
Domain IPs 199.60.103.227, 199.60.103.29, 2606:2c40::c73c:671d, 2606:2c40::c73c:67e3
Response IP 199.60.103.29
Found Yes
Hash 282c3044f78185abebb2472fcd373eca681f96350a1a1d4cdc66bf0c6bc110b5
SimHash b160677d47b3

Groups

*

Rule Path
Allow /lp/*
Allow /resources/*
Disallow /resources/*?utm
Disallow /resources/*/?utm
Disallow /lp/*?utm
Disallow /lp/*/?utm
Disallow /tp/*
Disallow /test
Disallow *?hsCtaTracking*
Disallow *?__hstc*
Disallow *?hsLang*
Disallow /cs/c*
Disallow /lp/cloud*
Disallow /lp/download-certificate*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*