insight.de
robots.txt

Robots Exclusion Standard data for insight.de

Resource Scan

Scan Details

Site Domain insight.de
Base Domain insight.de
Scan Status Ok
Last Scan2025-07-02T11:46:14+00:00
Next Scan 2025-08-01T11:46:14+00:00

Last Scan

Scanned2025-07-02T11:46:14+00:00
URL https://www.insight.de/robots.txt
Redirect https://de.insight.com/robots.txt
Redirect Domain de.insight.com
Redirect Base insight.com
Domain IPs 199.181.77.75
Redirect IPs 23.50.81.110
Response IP 104.69.158.211
Found Yes
Hash 6151123defc19b85ef2f6a039428d021570d1d10cc9ef360d980c146e87d9169
SimHash 3f1c14425bd1

Groups

*

Rule Path
Disallow /*?*
Disallow */.html
Disallow /de_DE/search*.html
Disallow /insightweb/
Disallow /de/de/why-insight/sales-engagement.html
Disallow /flytrap/
Disallow /content/dam/insight-web/*/solutions/service-provider/microsite/assets/
Disallow /content/dam/insight-web/*/pdfs/
Allow /insightweb/*.css$
Allow /*?qtype=
Allow /*?pq=
Allow /*?identifier=shopping
Allow /*?partnermessage

Other Records

Field Value
sitemap https://de.insight.com/sitemap.xml

Comments

  • Robots Exclusion Protocol file for Insight.com
  • v6
  • Reference (de-facto protocol)
  • http://www.robotstxt.org/norobots-rfc.txt