amp.diaridegirona.cat
robots.txt

Robots Exclusion Standard data for amp.diaridegirona.cat

Resource Scan

Scan Details

Site Domain amp.diaridegirona.cat
Base Domain diaridegirona.cat
Scan Status Ok
Last Scan2024-11-13T11:07:44+00:00
Next Scan 2024-11-20T11:07:44+00:00

Last Scan

Scanned2024-11-13T11:07:44+00:00
URL https://amp.diaridegirona.cat/robots.txt
Redirect https://www.diaridegirona.cat/robots.txt
Redirect Domain www.diaridegirona.cat
Redirect Base diaridegirona.cat
Domain IPs 199.232.194.133, 199.232.198.133
Redirect IPs 199.232.194.133, 199.232.198.133
Response IP 146.75.42.133
Found Yes
Hash ff64a4392e6d23a399fb08ac851ace3cba8ce49f71e3815965e64c136508a997
SimHash f8375b54c4e5

Groups

googlebot-news

Rule Path
Allow /

googlebot

Rule Path
Allow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

google-extended

Rule Path
Allow /vida-i-estil/
Allow /oci/
Allow /societat/
Allow /economia/
Disallow /

*

Rule Path
Disallow /cercador
Disallow /motogp/
Disallow /tendencias21/
Disallow /buscando-respuestas/
Disallow /verde-y-azul/
Disallow /economia/declaracion-renta/
Disallow /*?p=
Disallow /esports/futbol/
Disallow /clip/
Disallow /cds-statics/assets/fonts/

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.diaridegirona.cat/sitemap_google_news_f0f36.xml
sitemap https://www.diaridegirona.cat/sitemap_general_f0f36.xml

Comments

  • Bots Google
  • Bloqueo para bots GPT
  • Reglas generales para todos los bots
  • Bots Redes Sociales
  • Sitemaps