insight.com
robots.txt
Robots Exclusion Standard data for insight.com
Resource Scan
Scan Details
Site Domain | insight.com |
Base Domain | insight.com |
Scan Status | Ok |
Last Scan | 2025-05-19T05:13:53+00:00 |
Next Scan | 2025-06-18T05:13:53+00:00 |
Last Scan
Scanned | 2025-05-19T05:13:53+00:00 |
URL | https://insight.com/robots.txt |
Redirect | https://www.insight.com/robots.txt |
Redirect Domain | www.insight.com |
Redirect Base | insight.com |
Domain IPs | 198.206.188.155 |
Redirect IPs | 104.69.44.54 |
Response IP | 104.69.44.54 |
Found | Yes |
Hash | 06e3826a023dae9abb8d52987facefcf27cc46f5d47e63755970df795df01662 |
SimHash | 3e0cb4424bb1 |
Groups
*
Rule | Path |
---|---|
Disallow | /*?* |
Disallow | */.html |
Disallow | /en_US/search*.html |
Disallow | /insightweb/ |
Disallow | /us/en/why-insight/sales-engagement.html |
Disallow | /flytrap/ |
Disallow | /content/dam/insight-web/*/solutions/service-provider/microsite/assets/ |
Disallow | /content/dam/insight-web/*/pdfs/ |
Allow | /insightweb/*.css$ |
Allow | /*?qtype= |
Allow | /*?pq= |
Allow | /*?identifier=shopping |
Allow | /*?partnermessage |
Other Records
Field | Value |
---|---|
sitemap | https://www.insight.com/sitemap.xml |
Comments