corelight.com
robots.txt

Robots Exclusion Standard data for corelight.com

Resource Scan

Scan Details

Site Domain corelight.com
Base Domain corelight.com
Scan Status Ok
Last Scan2024-09-25T18:01:05+00:00
Next Scan 2024-10-25T18:01:05+00:00

Last Scan

Scanned2024-09-25T18:01:05+00:00
URL https://corelight.com/robots.txt
Domain IPs 199.60.103.106, 199.60.103.6
Response IP 199.60.103.6
Found Yes
Hash f3515ed6c99d5856d934dc12742f42a61b1f3b00893047ca4193c7f8c0ede3e0
SimHash 31eccc554d90

Groups

*

Rule Path
Disallow /new-blog*
Disallow /corelight-advanced-analytics
Disallow /company/careers/3207421-0
Disallow /form-confirmation
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

hubspotcontentsearchbot

Rule Path
Disallow /company/careers/
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://corelight.com/sitemap.xml

Comments

  • Block just HubSpot Site Search Indexing