sigo.com.gt
robots.txt

Robots Exclusion Standard data for sigo.com.gt

Resource Scan

Scan Details

Site Domain sigo.com.gt
Base Domain sigo.com.gt
Scan Status Ok
Last Scan2026-01-25T00:29:17+00:00
Next Scan 2026-02-24T00:29:17+00:00

Last Scan

Scanned2026-01-25T00:29:17+00:00
URL https://sigo.com.gt/robots.txt
Domain IPs 192.124.249.89
Response IP 192.124.249.89
Found Yes
Hash 9a555b8a1c32538231a9aaccb3dac0c82415f1a8ac22a75f5c98df6598c05faf
SimHash e0b45162c515

Groups

*

Rule Path
Allow */modules/*.css
Allow */modules/*.js
Allow */modules/*.png
Allow */modules/*.jpg
Allow */modules/*.gif
Allow */modules/*.svg
Allow */modules/*.webp
Allow /js/jquery/*
Disallow /*?order=
Disallow /*?tag=
Disallow /*?id_currency=
Disallow /*?search_query=
Disallow /*?back=
Disallow /*?n=
Disallow /*%26order%3D
Disallow /*%26tag%3D
Disallow /*%26id_currency%3D
Disallow /*%26search_query%3D
Disallow /*%26back%3D
Disallow /*%26n%3D
Disallow /*controller%3Daddresses
Disallow /*controller%3Daddress
Disallow /*controller%3Dauthentication
Disallow /*controller%3Dcart
Disallow /*controller%3Ddiscount
Disallow /*controller%3Dfooter
Disallow /*controller%3Dget-file
Disallow /*controller%3Dheader
Disallow /*controller%3Dhistory
Disallow /*controller%3Didentity
Disallow /*controller%3Dimages.inc
Disallow /*controller%3Dinit
Disallow /*controller%3Dmy-account
Disallow /*controller%3Dorder
Disallow /*controller%3Dorder-slip
Disallow /*controller%3Dorder-detail
Disallow /*controller%3Dorder-follow
Disallow /*controller%3Dorder-return
Disallow /*controller%3Dorder-confirmation
Disallow /*controller%3Dpagination
Disallow /*controller%3Dpassword
Disallow /*controller%3Dpdf-invoice
Disallow /*controller%3Dpdf-order-return
Disallow /*controller%3Dpdf-order-slip
Disallow /*controller%3Dproduct-sort
Disallow /*controller%3Dregistration
Disallow /*controller%3Dsearch
Disallow /*controller%3Dstatistics
Disallow /*controller%3Dattachment
Disallow /*controller%3Dguest-tracking
Disallow /app/
Disallow /cache/
Disallow /classes/
Disallow /config/
Disallow /controllers/
Disallow /download/
Disallow /js/
Disallow /localization/
Disallow /log/
Disallow /mails/
Disallow /modules/
Disallow /override/
Disallow /pdf/
Disallow /src/
Disallow /tools/
Disallow /translations/
Disallow /upload/
Disallow /var/
Disallow /vendor/
Disallow /webservice/
Disallow /direccion
Disallow /direcciones
Disallow /iniciar-sesion
Disallow /carrito
Disallow /descuento
Disallow /seguimiento-pedido-invitado
Disallow /historial-compra
Disallow /datos-personales
Disallow /mi-cuenta
Disallow /pedido
Disallow /confirmacion-pedido
Disallow /seguimiento-pedido
Disallow /facturas-abono
Disallow /ha-olvidado-su-contrasena
Disallow /registro
Disallow /busqueda

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

bingbot

Rule Path
Disallow

bingpreview

Rule Path
Disallow

facebookexternalhit

Rule Path
Disallow

instagram

Rule Path
Disallow

tiktokbot

Rule Path
Disallow

twitterbot

Rule Path
Disallow

whatsappcrawler

Rule Path
Disallow

linkedinbot

Rule Path
Disallow

telegrambot

Rule Path
Disallow

gptbot

Rule Path
Disallow

claudebot

Rule Path
Disallow

amazonbot

Rule Path
Disallow

applebot

Rule Path
Disallow

duckduckbot

Rule Path
Disallow

crawl

Rule Path
Disallow /

daum

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

seek

Rule Path
Disallow /

scanner

Rule Path
Disallow /

bw

Rule Path
Disallow /

domainstatsbot

Rule Path
Disallow /

mixrankbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

checker

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

proxmic

Rule Path
Disallow /

curl

Rule Path
Disallow /

go-http-client

Rule Path
Disallow /

robot

Rule Path
Disallow /

yandexbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

yandeximages

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

discover

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

feed

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.sigo.com.gt/1_index_sitemap.xml

Comments

  • robots.txt automatically generated by PrestaShop e-commerce open-source solution
  • https://www.prestashop.com - https://www.prestashop.com/forums
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • For more information about the robots.txt standard, see:
  • https://www.robotstxt.org/robotstxt.html
  • Allow Directives
  • Private pages
  • Directories for www.sigo.com.gt
  • Files
  • Sitemap
  • ==============================
  • BOTS IMPORTANTES (PERMITIDOS)
  • ==============================
  • ==============================
  • BLOQUEO DEFINITIVO
  • ==============================
  • ==============================
  • BOTS SECUNDARIOS (LENTOS)
  • ==============================

Warnings

  • 2 invalid lines.