santilimonche.com
robots.txt

Robots Exclusion Standard data for santilimonche.com

Resource Scan

Scan Details

Site Domain santilimonche.com
Base Domain santilimonche.com
Scan Status Ok
Last Scan2026-02-03T11:46:48+00:00
Next Scan 2026-03-05T11:46:48+00:00

Last Scan

Scanned2026-02-03T11:46:48+00:00
URL https://santilimonche.com/robots.txt
Redirect https://www.santilimonche.com/robots.txt
Redirect Domain www.santilimonche.com
Redirect Base santilimonche.com
Domain IPs 34.175.201.153
Redirect IPs 34.175.201.153
Response IP 34.175.201.153
Found Yes
Hash be7388ace1ab3c7f04384bea8698686e70545cd22721448756dfea435b42c18f
SimHash 6114ce2225b7

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow */wp-content/podscache/*
Disallow /*?replytocom=
Allow /

gptbot

Rule Path
Allow /
Allow /llms.txt

chatgpt-user

Rule Path
Allow /
Allow /llms.txt

google-extended

Rule Path
Allow /
Allow /llms.txt

anthropic-ai

Rule Path
Allow /
Allow /llms.txt

ccbot

Rule Path
Allow /
Allow /llms.txt

cohere-ai

Rule Path
Allow /
Allow /llms.txt

Other Records

Field Value
sitemap https://www.santilimonche.com/sitemap_index.xml

Comments

  • SEO: Bloquear parametros de comentarios
  • ============================================
  • AI Crawlers - Friendly Configuration
  • ============================================
  • OpenAI GPT (ChatGPT, GPT-4, etc.)
  • OpenAI ChatGPT-User (browsing mode)
  • Google Bard/Gemini
  • Anthropic Claude
  • Common Crawl (used by many AI companies)
  • Cohere AI