lagacetadesalamanca.es
robots.txt

Robots Exclusion Standard data for lagacetadesalamanca.es

Resource Scan

Scan Details

Site Domain lagacetadesalamanca.es
Base Domain lagacetadesalamanca.es
Scan Status Ok
Last Scan2024-06-02T09:40:36+00:00
Next Scan 2024-06-09T09:40:36+00:00

Last Scan

Scanned2024-06-02T09:40:36+00:00
URL https://lagacetadesalamanca.es/robots.txt
Redirect https://www.lagacetadesalamanca.es/robots.txt
Redirect Domain www.lagacetadesalamanca.es
Redirect Base lagacetadesalamanca.es
Domain IPs 34.111.196.5
Redirect IPs 23.44.5.120, 23.44.5.97
Response IP 125.56.219.17
Found Yes
Hash aa49c1576206144a5a40af69e40148cfe6592df8fc9b312630b663d3cb286e28
SimHash aa2593741271

Groups

mediapartners-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

twitterbot

Rule Path
Disallow *

*

Rule Path
Disallow /acd/
Disallow /fcgi-bin/
Disallow /prensa/
Disallow */registro/
Disallow */popups
Disallow /*preview.html
Disallow /preview/
Disallow /_catalogo/
Disallow /includes/manuales/_catalogo/
Disallow /*?ns_
Disallow /zHomePrueba/index.html
Disallow /pruebaspubli.html
Disallow /_cabeceraExterna/
Disallow /_config/
Disallow /externo/
Disallow /includes/
Disallow /metas.html
Disallow /_minicabecera/
Disallow /MM/
Disallow /modulos/
Disallow /RC/
Disallow /SysConfig/
Disallow /4900/vocento.lagacetadesalamanca/
Disallow /apoyos/documentos/
Disallow /gl-d/
Disallow */gl-d$
Disallow /ht-d/
Disallow */ht-d$
Disallow /bb-d/
Disallow */bb-d$
Disallow /module-amp/
Disallow */aggregate*
Disallow /backend/
Disallow */guia-tv/
Disallow /hemeroteca/*.html

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.lagacetadesalamanca.es/sitemap.xml
sitemap https://www.lagacetadesalamanca.es/sitemap.incremental.xml
sitemap https://www.lagacetadesalamanca.es/sitemap-video.xml
sitemap https://www.lagacetadesalamanca.es/sitemap-temas.xml

Comments

  • Robots www.lagacetadesalamanca.es
  • User Agents
  • mobile
  • Sitemaps