elperiodico.es
robots.txt

Robots Exclusion Standard data for elperiodico.es

Resource Scan

Scan Details

Site Domain elperiodico.es
Base Domain elperiodico.es
Scan Status Ok
Last Scan2024-11-19T13:17:27+00:00
Next Scan 2024-11-20T13:17:27+00:00

Last Scan

Scanned2024-11-19T13:17:27+00:00
URL http://elperiodico.es/robots.txt
Redirect http://www.elperiodico.com/robots.txt
Redirect Domain www.elperiodico.com
Redirect Base elperiodico.com
Domain IPs 195.76.147.109
Redirect IPs 199.232.194.133, 199.232.198.133
Response IP 146.75.42.133
Found Yes
Hash afb164167e11cd14c1972782a9b3e269a3ceda175baec91debdbbd3470bed993
SimHash fd3c8f1c06a6

Groups

googlebot-news

Rule Path
Allow /

googlebot

Rule Path
Allow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

google-extended

Rule Path
Disallow /
Allow /economia/
Allow /extra/

*

Rule Path
Disallow /*/blogscat/
Disallow /*/buscador*
Disallow /*/component/
Disallow /*/ext_resources/ads/
Disallow /*/ext_resources/portadas/
Disallow /*/UpdatedNewsElPeriodico.xml
Disallow /airbag/
Disallow /andorra/
Disallow /blogs/
Disallow /blogscat/
Disallow /buscador/ca/
Disallow /buscador/es/
Disallow /ca/
Disallow /campions/
Disallow /comentar.asp/
Disallow /component/
Disallow /comunes/
Disallow /domingo/
Disallow /edaragon/
Disallow /edasturias/
Disallow /edcastellon/
Disallow /edcordoba/
Disallow /edextremadura/
Disallow /edicion/
Disallow /estudiant/
Disallow /includes/
Disallow /info/
Disallow /mileniales/
Disallow /onbcn/
Disallow /r/
Disallow /stats/
Disallow /suep/
Disallow /swf/
Disallow /verano/
Disallow /viernes/
Disallow /vivo/
Disallow /alminuto.asp
Disallow /alta.asp
Disallow /archivo_titulares.asp
Disallow /comentar.asp
Disallow /default.asp
Disallow /encuestas.asp
Disallow /envio.asp
Disallow /envn.asp
Disallow /foros.asp
Disallow /galerias.asp
Disallow /print.asp
Disallow /r.asp
Disallow /rss.asp
Disallow /tickersp2.asp
Disallow /valorada.asp
Disallow /valorar.asp
Disallow /verpdf.asp
Disallow /verpdfmynews.asp
Disallow /videos2.asp
Disallow /verde-y-azul/
Disallow /buscando-respuestas/
Disallow /*/newsletters/
Disallow /deportes/futbol/
Disallow /deportes/baloncesto/
Disallow /clip/

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.elperiodico.com/es/google-news.xml
sitemap https://www.elperiodico.com/es/sitemap-noticias.xml

Comments

  • Bots Google
  • Bloqueo para bots GPT
  • Sitemaps.