pepeganga.com
robots.txt

Robots Exclusion Standard data for pepeganga.com

Resource Scan

Scan Details

Site Domain pepeganga.com
Base Domain pepeganga.com
Scan Status Ok
Last Scan2024-11-16T04:59:02+00:00
Next Scan 2024-12-16T04:59:02+00:00

Last Scan

Scanned2024-11-16T04:59:02+00:00
URL https://www.pepeganga.com/robots.txt
Domain IPs 108.156.133.123, 108.156.133.31, 108.156.133.35, 108.156.133.88, 2600:9000:2755:1000:a:9b59:c7c0:93a1, 2600:9000:2755:2200:a:9b59:c7c0:93a1, 2600:9000:2755:7600:a:9b59:c7c0:93a1, 2600:9000:2755:7a00:a:9b59:c7c0:93a1, 2600:9000:2755:9a00:a:9b59:c7c0:93a1, 2600:9000:2755:a400:a:9b59:c7c0:93a1, 2600:9000:2755:e400:a:9b59:c7c0:93a1, 2600:9000:2755:fc00:a:9b59:c7c0:93a1
Response IP 108.156.133.88
Found Yes
Hash f46164ab848ebb992cdf394a8351e5ab53d75b5430159031a7041df13bd19360
SimHash 6cd005d69ea5

Groups

*

Rule Path
Disallow /img/
Disallow /account/
Disallow /login/
Disallow /checkout/
Disallow /busca/
Disallow /quick-view/
Disallow /espiar/
Disallow /buscapagina/

*

Rule Path
Allow /*.css
Allow /*.jpeg
Allow /*.js
Allow /*.png
Allow /*.webp
Allow /*.jpg
Allow /*.svg
Allow /*.woff
Allow /*.gif

*

Rule Path Comment
Disallow /secure/ -
Disallow /account/ -
Disallow /admin/ -
Disallow /busca/ -
Disallow /login/ -
Disallow /buscapagina -
Disallow /buscavazia -
Disallow /buscavazia/* -
Disallow /checkout/ -
Disallow /checkout/* -
Disallow /checkout /*
Disallow /checkout/ /cart
Disallow /checkout/cart/add -
Disallow /checkout /
Disallow /coleccion/ -
Disallow /control/ -
Disallow /espiar/ -
Disallow /files/ -
Disallow /img/ -
Disallow /lista-de-deseos -
Disallow /login -
Disallow /quick-view/ -
Disallow /Sistema/ -
Disallow /Sistema/404 -
Disallow /Sistema/buscavazia -
Disallow /wishlist -
Disallow /te-ayudamos -
Allow /_v/public/graphql/v1?workspace= -
Allow /_v/segment/graphql/v1?workspace= -
Allow /_v/private/graphql/v1?workspace= -
Allow /api/vtexid/pub/authentication/providers?scope= -
Allow /_v/segment/routing/vtex.store%402.x/*device%3D* -
Allow /*?__pickRuntime=*device=* -
Allow /_v/public/vtex.styles-graphql/v1/font/*.otf -
Allow /_v/public/vtex.styles-graphql/v1/fonts/*workspace%3D* -
Allow /*sessions?items= -
Allow /*search?fq= -
Allow /arquivos/ids/*width%3D* -

*

Rule Path
Disallow /*?
Disallow /*%26
Disallow /*%
Disallow /*/d$
Allow /*?idsku=
Allow /*?skuId=
Allow /*?page=

*

Rule Path
Disallow /*.aspx

Other Records

Field Value
sitemap https://www.pepeganga.com/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.
  • ALLOWs QUE DEBEN ESTAR SIEMPRE
  • URLS NO DEBEN APARECER
  • JSON RASTREABLES
  • vtexassets RASTREABLES IMG
  • DISALLOW URL PARAMETROS
  • ALLOW URL PARAMETROS: IMPORTANTE MERCHANT
  • EVITAR 404 ERRORES
  • SITEMAPS