insight.ca
robots.txt
Robots Exclusion Standard data for insight.ca
Resource Scan
Scan Details
Site Domain | insight.ca |
Base Domain | insight.ca |
Scan Status | Ok |
Last Scan | 2025-06-03T04:54:32+00:00 |
Next Scan | 2025-07-03T04:54:32+00:00 |
Last Scan
Scanned | 2025-06-03T04:54:32+00:00 |
URL | http://insight.ca/robots.txt |
Redirect | https://ca.insight.com/robots.txt |
Redirect Domain | ca.insight.com |
Redirect Base | insight.com |
Domain IPs | 198.206.188.98 |
Redirect IPs | 104.69.158.211 |
Response IP | 104.69.158.211 |
Found | Yes |
Hash | c447ad7413839c4b4a3e664f2e7b2b21a5d1481c68c13d4fc4a4683180fe7990 |
SimHash | 3b0c244243d1 |
Groups
*
Rule | Path |
---|---|
Disallow | /*?* |
Disallow | */.html |
Disallow | /en_CA/search*.html |
Disallow | /insightweb/ |
Disallow | /ca/en/why-insight/sales-engagement.html |
Disallow | /ca/fr/why-insight/sales-engagement.html |
Disallow | /flytrap/ |
Disallow | /content/dam/insight-web/*/solutions/service-provider/microsite/assets/ |
Disallow | /content/dam/insight-web/*/pdfs/ |
Allow | /insightweb/*.css$ |
Allow | /*?qtype= |
Allow | /*?pq= |
Other Records
Field | Value |
---|---|
sitemap | https://ca.insight.com/sitemap.xml |
Comments