elcorreo.es
robots.txt

Robots Exclusion Standard data for elcorreo.es

Resource Scan

Scan Details

Site Domain elcorreo.es
Base Domain elcorreo.es
Scan Status Ok
Last Scan2024-11-07T17:15:20+00:00
Next Scan 2024-11-14T17:15:20+00:00

Last Scan

Scanned2024-11-07T17:15:20+00:00
URL http://elcorreo.es/robots.txt
Redirect https://www.elcorreoweb.es/robots.txt
Redirect Domain www.elcorreoweb.es
Redirect Base elcorreoweb.es
Domain IPs 31.214.178.55
Redirect IPs 199.232.194.133, 199.232.198.133
Response IP 151.101.198.133
Found Yes
Hash a1189690cd69bdf7568aa61a8828afb94b881a1f14a14d46d1180580f924ffeb
SimHash 79785b5cc4b4

Groups

googlebot-news

Rule Path
Allow /

googlebot

Rule Path
Allow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

google-extended

Rule Path
Allow /vida-y-estilo/
Allow /ocio/
Allow /sociedad/
Allow /economia/
Allow /viajes/
Disallow /

*

Rule Path
Disallow /*?p=
Disallow /deportes/futbol/
Disallow /deportes/baloncesto/
Disallow /clip/
Disallow /cds-statics/assets/fonts/
Disallow /fonts/

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.elcorreoweb.es/sitemap_google_news_65205.xml
sitemap https://www.elcorreoweb.es/sitemap_videos_actual_65205.xml
sitemap https://www.elcorreoweb.es/sitemap_fotos_actual_65205.xml

Comments

  • Bots Google
  • Bloqueo para bots GPT
  • Reglas generales para todos los bots
  • Bots Redes Sociales
  • Sitemaps