huelva24.com
robots.txt

Robots Exclusion Standard data for huelva24.com

Resource Scan

Scan Details

Site Domain huelva24.com
Base Domain huelva24.com
Scan Status Ok
Last Scan2024-09-20T13:19:43+00:00
Next Scan 2024-09-27T13:19:43+00:00

Last Scan

Scanned2024-09-20T13:19:43+00:00
URL https://huelva24.com/robots.txt
Redirect https://www.huelva24.com/robots.txt
Redirect Domain www.huelva24.com
Redirect Base huelva24.com
Domain IPs 23.215.7.21, 23.215.7.28
Redirect IPs 23.32.29.107, 23.32.29.91
Response IP 23.52.40.49
Found Yes
Hash b24b87cc99d29c105e2ed1b66089735b8dd7a0448d7f9b17d26bb6b3e93a5528
SimHash e03c0278a1b0

Groups

*

Rule Path
Disallow */interactivo/comun/contactar.html
Disallow */interactivo/comun/condiciones.html
Disallow */interactivo/comun/privacidad.html
Disallow */interactivo/comun/publicidad.html
Disallow */registro/
Disallow */popups
Disallow /agencias/*/0$
Disallow /*-preview.html
Disallow /eltiempo/
Disallow /preview/
Disallow /_catalogo/
Disallow /includes/manuales/_catalogo/
Disallow /recortes/
Disallow /recortes/*
Disallow /elecciones/generales/resultados/
Disallow /*/undefined/*
Disallow /css/
Disallow /css-abc/
Disallow /cssp/
Disallow /iframepubli.html
Disallow /img/
Disallow /includes/
Disallow /includes-abc/
Disallow /includes_comun/
Disallow /includes_e/
Disallow /includes_m/
Disallow /js/
Disallow /js-abc/
Disallow /Media/
Disallow /MM/
Disallow /RC/
Disallow /SysConfig/
Disallow /tmp/
Disallow /Zona-C/
Disallow /deportes/futbol/directos/*/2010-2011/*
Disallow /deportes/futbol/directos/*/2011-2012/*
Disallow /deportes/futbol/directos/*/2012-2013/*
Disallow /deportes/futbol/directos/*/2013-2014/*
Disallow /deportes/futbol/directos/*/2014-2015/*
Disallow /deportes/futbol/directos/*/2015-2016/*
Disallow /gurme/*/busqueda/?
Disallow /*-nt.html$
Disallow /*-di.html$
Disallow /*-vid.html$
Disallow /*-ga.html$
Disallow /*-ft.html$
Disallow /*-aud.html$
Disallow /*-nts.html$
Disallow /*-dis.html$
Disallow /*-vis.html$
Disallow /*-gas.html$
Disallow /*-fts.html$
Disallow /*-auds.html$
Disallow */aggregate*
Disallow /includes_comun/*
Disallow /*?cartelera_cine
Disallow /acd/

twitterbot

Rule Path
Allow *

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.huelva24.com/sitemap.incremental.xml
sitemap https://www.huelva24.com/sitemap-video.xml
sitemap https://www.huelva24.com/sitemap.xml

Comments

  • new
  • Sitemaps
  • User Agents
  • CSV 925679