corelight.com
robots.txt

Robots Exclusion Standard data for corelight.com

Resource Scan

Scan Details

Site Domain corelight.com
Base Domain corelight.com
Scan Status Ok
Last Scan2024-05-28T17:57:54+00:00
Next Scan 2024-06-27T17:57:54+00:00

Last Scan

Scanned2024-05-28T17:57:54+00:00
URL https://corelight.com/robots.txt
Domain IPs 199.60.103.106, 199.60.103.6
Response IP 199.60.103.106
Found Yes
Hash 45476e896772d295fc0c38856d6658e33b1d617974444e38c7e10acb0c015acd
SimHash 31c4de554d90

Groups

*

Rule Path
Disallow /new-blog*
Disallow /corelight-advanced-analytics
Disallow /company/careers/3207421-0
Disallow /form-confirmation
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/

hubspotcontentsearchbot

Rule Path
Disallow /company/careers/
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/

Other Records

Field Value
sitemap https://corelight.com/sitemap.xml

Comments

  • Block just HubSpot Site Search Indexing