imusa.com.co
robots.txt

Robots Exclusion Standard data for imusa.com.co

Resource Scan

Scan Details

Site Domain imusa.com.co
Base Domain imusa.com.co
Scan Status Ok
Last Scan2024-09-17T06:16:20+00:00
Next Scan 2024-10-17T06:16:20+00:00

Last Scan

Scanned2024-09-17T06:16:20+00:00
URL https://www.imusa.com.co/robots.txt
Domain IPs 13.33.88.125, 13.33.88.41, 13.33.88.66, 13.33.88.70, 2600:9000:223b:3000:7:7c8b:ee80:93a1, 2600:9000:223b:3400:7:7c8b:ee80:93a1, 2600:9000:223b:4800:7:7c8b:ee80:93a1, 2600:9000:223b:6600:7:7c8b:ee80:93a1, 2600:9000:223b:7e00:7:7c8b:ee80:93a1, 2600:9000:223b:9000:7:7c8b:ee80:93a1, 2600:9000:223b:aa00:7:7c8b:ee80:93a1, 2600:9000:223b:b800:7:7c8b:ee80:93a1
Response IP 13.33.88.66
Found Yes
Hash 1a55ab8d3ac72c1e1b55448957854552d10f6bb2fd8c52d16e9f97ae8517a562
SimHash ccd8e5235fd7

Groups

*

Rule Path
Disallow /img/*
Disallow /account/*
Disallow /account$
Disallow /login$
Disallow /login/*
Disallow /checkout/*
Disallow /busca/*
Disallow /quick-view/*
Disallow /espiar/*

*

Rule Path
Allow /*.css
Allow /*.jpeg
Allow /*.js
Allow /*.png
Allow /*.webp
Allow /*.jpg
Allow /*.svg
Allow /*.woff
Allow /*.gif
Allow /*.ico
Disallow /*?
Allow /*?page=
Disallow /*/b$
Disallow /*/d$
Allow /*graphql
Allow /api/catalog_system/
Allow /api/dataentities/SF/search?_where=address
Allow /api/dataentities/WU/search?_where=rol
Allow /*arquivos/ids/*width%3D

*

Rule Path
Allow /*skuId%3D
Allow /*idsku%3D

Other Records

Field Value
sitemap https://www.imusa.com.co/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.
  • ALLOWs QUE DEBEN ESTAR SIEMPRE
  • URLS NO DEBEN APARECER
  • Urls rastreables
  • vtexassets RASTREABLES IMG
  • ALLOW URL PARAMETROS: IMPORTANTE MERCHANT
  • SITEMAPS