crocs.com.ec
robots.txt

Robots Exclusion Standard data for crocs.com.ec

Resource Scan

Scan Details

Site Domain crocs.com.ec
Base Domain crocs.com.ec
Scan Status Ok
Last Scan2024-10-29T23:08:24+00:00
Next Scan 2024-11-28T23:08:24+00:00

Last Scan

Scanned2024-10-29T23:08:24+00:00
URL https://www.crocs.com.ec/robots.txt
Domain IPs 13.33.88.105, 13.33.88.46, 13.33.88.51, 13.33.88.8, 2600:9000:223b:3800:19:29ab:a8c0:93a1, 2600:9000:223b:5600:19:29ab:a8c0:93a1, 2600:9000:223b:5a00:19:29ab:a8c0:93a1, 2600:9000:223b:9a00:19:29ab:a8c0:93a1, 2600:9000:223b:b800:19:29ab:a8c0:93a1, 2600:9000:223b:da00:19:29ab:a8c0:93a1, 2600:9000:223b:f400:19:29ab:a8c0:93a1, 2600:9000:223b:fa00:19:29ab:a8c0:93a1
Response IP 13.33.88.105
Found Yes
Hash eb613f8745849289b908d0c264f649ef77de6021db0c3b5bd0bed10404384319
SimHash 64d025569ea1

Groups

*

Rule Path
Disallow /img/
Disallow /account/
Disallow /login/
Disallow /checkout/
Disallow /busca/
Disallow /quick-view/
Disallow /espiar/
Disallow /buscapagina/

*

Rule Path
Allow /*.css
Allow /*.jpeg
Allow /*.js
Allow /*.png
Allow /*.webp
Allow /*.jpg
Allow /*.svg
Allow /*.woff

*

Rule Path Comment
Disallow /secure/ -
Disallow /account/ -
Disallow /admin/ -
Disallow /busca/ -
Disallow /login/ -
Disallow /buscapagina -
Disallow /buscavazia -
Disallow /buscavazia/* -
Disallow /checkout/ -
Disallow /checkout/* -
Disallow /checkout /*
Disallow /checkout/ /cart
Disallow /checkout/cart/add -
Disallow /checkout /
Disallow /control/ -
Disallow /espiar/ -
Disallow /files/ -
Disallow /img/ -
Disallow /lista-de-deseos -
Disallow /login -
Disallow /quick-view/ -
Disallow /Sistema/ -
Disallow /Sistema/404 -
Disallow /Sistema/buscavazia -
Disallow /wishlist -
Disallow /te-ayudamos -
Allow /_v/public/graphql/v1?workspace= -
Allow /_v/segment/graphql/v1?workspace= -
Allow /_v/private/graphql/v1?workspace= -
Allow /api/vtexid/pub/authentication/providers?scope= -
Allow /_v/segment/routing/vtex.store%402.x/*device%3D* -
Allow /*?__pickRuntime=*device=* -
Allow /_v/public/vtex.styles-graphql/v1/font/*.otf -
Allow /_v/public/vtex.styles-graphql/v1/fonts/*workspace%3D* -
Allow /api/catalog_system/pub/products/search?fq= -
Allow /api/sessions?items=* -
Allow /arquivos/ids/*width%3D* -
Allow /assets/vtex.file-manager-graphql/images* -

*

Rule Path
Allow /*?idsku*
Disallow /*?
Disallow /*%26
Disallow /*%
Disallow /*/d$
Disallow /*/b$
Allow /*?skuId=
Allow /*?page=
Allow /*utm_
Allow /*?width

*

Rule Path
Disallow /*.aspx

Other Records

Field Value
sitemap https://www.crocs.com.ec/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.
  • ALLOWs QUE DEBEN ESTAR SIEMPRE
  • URLS NO DEBEN APARECER
  • JSON RASTREABLES
  • vtexassets RASTREABLES IMG
  • DISALLOW URL PARAMETROS
  • ALLOW URL PARAMETROS: IMPORTANTE MERCHANT
  • EVITAR 404 ERRORES