valsequilloactualidad.es
robots.txt

Robots Exclusion Standard data for valsequilloactualidad.es

Resource Scan

Scan Details

Site Domain valsequilloactualidad.es
Base Domain valsequilloactualidad.es
Scan Status Ok
Last Scan2026-02-25T00:59:36+00:00
Next Scan 2026-03-04T00:59:36+00:00

Last Scan

Scanned2026-02-25T00:59:36+00:00
URL https://valsequilloactualidad.es/robots.txt
Domain IPs 162.55.89.36
Response IP 162.55.89.36
Found Yes
Hash 81d0413e155ce3f5cade28bc2c911acf611975c42f1d691529ecbf4891752397
SimHash 89944568367a

Groups

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

claude-web

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

ccbot

Rule Path
Allow /

youbot

Rule Path
Allow /

bytespider

Rule Path
Allow /

diffbot

Rule Path
Allow /

facebookbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

omgili

Rule Path
Allow /

omgilibot

Rule Path
Allow /

bingbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

grapeshot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

adbeat_bot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

nutch

Rule Path
Disallow /

dow jones searchbot

Rule Path
Disallow /

linkdex

Rule Path
Disallow /

linkdex.com

Rule Path
Disallow /

linkdex.com/v2.0

Rule Path
Disallow /

flamingo_searchengine+(+http://www.flamingosearch.com/bot)

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

spinn3r

Rule Path
Disallow /

owlin bot v. 3.0

Rule Path
Disallow /

owlin bot v3

Rule Path
Disallow /

owlin bot

Rule Path
Disallow /

owlin

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

domainappender

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

orangebot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

zumbot

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

yoozbot

Rule Path
Disallow /

go-http-client

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

Comments

  • robots.txt para folioepress.com
  • =====================================
  • PERMITIR BOTS DE IA Y LLMs PRINCIPALES
  • =====================================
  • OpenAI (ChatGPT)
  • Anthropic (Claude)
  • Perplexity
  • Common Crawl (fuente de datos para LLMs)
  • You.com
  • ByteDance (para modelos de IA)
  • Diffbot (extracción de datos estructurados)
  • Facebook (para IA de Meta)
  • Google Extended (Bard/Gemini)
  • Omgili (agregador de contenido para IA)
  • Bingbot (necesario para Copilot de Microsoft)
  • =====================================
  • PERMITIR BOTS DE BÚSQUEDA PRINCIPALES
  • =====================================
  • =====================================
  • BLOQUEAR BOTS PROBLEMÁTICOS Y ABUSIVOS
  • =====================================

Warnings

  • 6 invalid lines.