revistacuore.com
robots.txt

Robots Exclusion Standard data for revistacuore.com

Resource Scan

Scan Details

Site Domain revistacuore.com
Base Domain revistacuore.com
Scan Status Ok
Last Scan2024-09-28T13:39:48+00:00
Next Scan 2024-10-05T13:39:48+00:00

Last Scan

Scanned2024-09-28T13:39:48+00:00
URL http://revistacuore.com/robots.txt
Redirect https://www.elperiodico.com/cuore/robots.txt
Redirect Domain www.elperiodico.com
Redirect Base elperiodico.com
Domain IPs 195.57.161.165
Redirect IPs 199.232.194.133, 199.232.198.133
Response IP 146.75.42.133
Found Yes
Hash a12fc6bfecb195b2f3af4bfc3a47cb26191fcefc4635bc2e9b00be00e88f6498
SimHash 781d5b10c4f5

Groups

googlebot-news

Rule Path
Allow /

googlebot

Rule Path
Allow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /*?p=
Disallow /_amp/

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.elperiodico.com/cuore/sitemapToday.xml
sitemap https://www.elperiodico.com/cuore/sitemapNews.xml

Comments

  • Bots Google
  • Bloqueo para bots GPT
  • Reglas generales para todos los bots
  • Bots Redes Sociales
  • Sitemaps