insight.ca
robots.txt

Robots Exclusion Standard data for insight.ca

Resource Scan

Scan Details

Site Domain insight.ca
Base Domain insight.ca
Scan Status Ok
Last Scan2025-06-03T04:54:32+00:00
Next Scan 2025-07-03T04:54:32+00:00

Last Scan

Scanned2025-06-03T04:54:32+00:00
URL http://insight.ca/robots.txt
Redirect https://ca.insight.com/robots.txt
Redirect Domain ca.insight.com
Redirect Base insight.com
Domain IPs 198.206.188.98
Redirect IPs 104.69.158.211
Response IP 104.69.158.211
Found Yes
Hash c447ad7413839c4b4a3e664f2e7b2b21a5d1481c68c13d4fc4a4683180fe7990
SimHash 3b0c244243d1

Groups

*

Rule Path
Disallow /*?*
Disallow */.html
Disallow /en_CA/search*.html
Disallow /insightweb/
Disallow /ca/en/why-insight/sales-engagement.html
Disallow /ca/fr/why-insight/sales-engagement.html
Disallow /flytrap/
Disallow /content/dam/insight-web/*/solutions/service-provider/microsite/assets/
Disallow /content/dam/insight-web/*/pdfs/
Allow /insightweb/*.css$
Allow /*?qtype=
Allow /*?pq=

Other Records

Field Value
sitemap https://ca.insight.com/sitemap.xml

Comments

  • Robots Exclusion Protocol file for Insight.com
  • v6
  • Reference (de-facto protocol)
  • http://www.robotstxt.org/norobots-rfc.txt