gauss.com.br
robots.txt

Robots Exclusion Standard data for gauss.com.br

Resource Scan

Scan Details

Site Domain gauss.com.br
Base Domain gauss.com.br
Scan Status Ok
Last Scan2026-02-05T21:01:58+00:00
Next Scan 2026-03-07T21:01:58+00:00

Last Scan

Scanned2026-02-05T21:01:58+00:00
URL https://gauss.com.br/robots.txt
Redirect https://www.gauss.com.br/robots.txt
Redirect Domain www.gauss.com.br
Redirect Base gauss.com.br
Domain IPs 45.164.92.143
Redirect IPs 45.164.92.143
Response IP 45.164.92.143
Found Yes
Hash 424ea93311be7d7bb1dfa5f32cd0fbe36768275c3ffd6ea8ac68c38213a3720c
SimHash 2a34c77046e4

Groups

bingbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

imagespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

imagesift

Rule Path
Disallow /

scrapedia

Rule Path
Disallow /

scrapedia-receive

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /admin/
Disallow /private/

Other Records

Field Value
crawl-delay 1

googlebot-image

Rule Path
Disallow /admin/
Disallow /private/

facebookexternalhit

Rule Path
Disallow /admin/
Disallow /private/

twitterbot

Rule Path
Disallow /admin/
Disallow /private/

linkedinbot

Rule Path
Disallow /admin/
Disallow /private/

*

Rule Path
Disallow /admin/
Disallow /private/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /cgi-bin/
Disallow /*.php$
Disallow /*?*

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap https://www.gauss.com.br/sitemap.xml
sitemap https://www.gauss.com.br/sitemap_index.xml

Comments

  • robots.txt - Bloqueio de Bots Problemáticos
  • Gerado em: 01/Jul/2025
  • ==========================================
  • BLOQUEIO DE BOTS IDENTIFICADOS NOS LOGS
  • ==========================================
  • Microsoft Bing Bot - Causando erros 500
  • ==========================================
  • OUTROS BOTS PROBLEMÁTICOS
  • ==========================================
  • Scrapers e bots maliciosos
  • ==========================================
  • BOTS ADICIONAIS COMUNS PROBLEMÁTICOS
  • ==========================================
  • Bots que frequentemente causam sobrecarga
  • ==========================================
  • BOTS PERMITIDOS (Importantes para SEO)
  • ==========================================
  • Google (sempre permitir)
  • Facebook (para compartilhamentos)
  • Twitter (para cards)
  • LinkedIn
  • ==========================================
  • CONFIGURAÇÕES GERAIS
  • ==========================================
  • Todos os outros bots (regra geral)
  • ==========================================
  • SITEMAP
  • ==========================================
  • Sitemap principal (substitua pela URL real)
  • ==========================================
  • NOTAS IMPORTANTES
  • ==========================================
  • ATENÇÃO: robots.txt é apenas uma "sugestão"
  • Bots maliciosos podem ignorar estas regras
  • Use sempre .htaccess como backup para bloqueio efetivo
  • Para verificar se está funcionando:
  • https://www.gauss.com.br/robots.txt
  • Teste com Google Search Console:
  • https://search.google.com/search-console/robots-txt-tester