codigoderecarga.app
robots.txt

Robots Exclusion Standard data for codigoderecarga.app

Resource Scan

Scan Details

Site Domain codigoderecarga.app
Base Domain codigoderecarga.app
Scan Status Ok
Last Scan2026-02-17T02:06:52+00:00
Next Scan 2026-02-24T02:06:52+00:00

Last Scan

Scanned2026-02-17T02:06:52+00:00
URL https://codigoderecarga.app/robots.txt
Domain IPs 104.26.0.113, 104.26.1.113, 172.67.68.138, 2606:4700:20::681a:171, 2606:4700:20::681a:71, 2606:4700:20::ac43:448a
Response IP 104.26.0.113
Found Yes
Hash 761f8e3e20fdd73653e87040567ea25f1bda8dc3cf178e753f5c0cd22d5fe707
SimHash 6b1fdda02534

Groups

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

yandex

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

slurp

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /admin/
Disallow /admin/*

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://codigoderecarga.com/sitemap.xml

Comments

  • ============================================
  • CRAWLERS DE BUSCA PERMITIDOS
  • ============================================
  • ============================================
  • BLOQUEAR CRAWLERS DE IA / LLM
  • ============================================
  • OpenAI
  • Anthropic
  • Common Crawl (usado para treinar LLMs)
  • Google AI (Bard/Gemini)
  • Meta AI
  • Cohere
  • Perplexity
  • Apple AI
  • Amazon AI
  • You.com AI
  • Scrapers genericos
  • ============================================
  • REGRAS GERAIS
  • ============================================
  • Sitemap
  • Crawl delay