www-origin.levante-emv.com
robots.txt

Robots Exclusion Standard data for www-origin.levante-emv.com

Resource Scan

Scan Details

Site Domain www-origin.levante-emv.com
Base Domain levante-emv.com
Scan Status Ok
Last Scan2024-11-10T16:56:54+00:00
Next Scan 2024-11-17T16:56:54+00:00

Last Scan

Scanned2024-11-10T16:56:54+00:00
URL http://www-origin.levante-emv.com/robots.txt
Response IP 213.4.119.37
Found Yes
Hash b30b7ce0640645ca7df17d29b2fcd520124b0a9c6f38574cfcb2a42940010d26
SimHash 692653d4c434

Groups

googlebot-news

Rule Path
Allow /

googlebot

Rule Path
Allow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

google-extended

Rule Path
Allow /vida-y-estilo/
Allow /ocio/
Allow /sociedad/
Allow /economia/
Allow /viajes/
Disallow /

*

Rule Path
Allow /ocio/cine/cartelera/
Disallow /ocio/cine/cartelera/alicante/$
Disallow /economia/declaracion-renta/
Disallow /hemeroteca/buscador/
Allow /ocio/hosteleria/*/valencia_m/*_s/
Allow /ocio/hosteleria/*/valencia_m/*_p/
Disallow /ocio/hosteleria/*/*_s/
Disallow /ocio/hosteleria/*/*_p/
Disallow /ocio/hosteleria/*/*_m/
Disallow /tour-francia/
Disallow /medio-ambiente/
Disallow /tiempo/
Disallow /*?p=
Disallow /deportes/futbol/
Disallow /deportes/baloncesto/
Disallow /clip/
Disallow /cds-statics/assets/fonts/

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.levante-emv.com/sitemap_google_news_d8592.xml
sitemap https://www.levante-emv.com/sitemap_index_d8592.xml

Comments

  • Bots Google
  • Bloqueo para bots GPT
  • Reglas generales para todos los bots
  • Bots Redes Sociales
  • Sitemaps