ilmessaggero.it
robots.txt

Robots Exclusion Standard data for ilmessaggero.it

Resource Scan

Scan Details

Site Domain ilmessaggero.it
Base Domain ilmessaggero.it
Scan Status Ok
Last Scan2024-12-23T01:16:34+00:00
Next Scan 2024-12-30T01:16:34+00:00

Last Scan

Scanned2024-12-23T01:16:34+00:00
URL https://ilmessaggero.it/robots.txt
Redirect https://www.ilmessaggero.it/robots.txt
Redirect Domain www.ilmessaggero.it
Redirect Base ilmessaggero.it
Domain IPs 34.149.236.87
Redirect IPs 34.149.236.87
Response IP 34.149.236.87
Found Yes
Hash cd475347dce263c84176a42d19413d9bd88d4961d113559fc7dda1aa50a10eb8
SimHash b08d49249f33

Groups

*

Rule Path
Disallow /SitoMessaggero/
Disallow /abbonamenti/
Disallow /includes/_stampa_articolo.php*
Disallow /cache/
Disallow /include/
Disallow /u/
Disallow /includes/
Disallow /query_cache/
Disallow /cgi-bin/
Disallow /hermes/
Disallow /?sez=JSON*
Disallow *sez%3DJSON*
Disallow *sez%3DAJAX*
Disallow /view.php*
Disallow /aprifoto.php*
Disallow /boxnewspl.php
Disallow /boxlocali_1news.php
Disallow /view
Disallow /articolo_app.php
Disallow /ELEZIONI2014/
Disallow /sfoglia_giornale.php
Disallow /articolo.php
Disallow /OMNIROMAviewnews2.php
Disallow /ANSAviewnews.php
Disallow /ANSAtitnews2.php
Disallow /tag.php*
Disallow /fotogallery.php
Disallow /video.php
Disallow /boxcasa_3news.php
Disallow /blogger.php
Disallow /pdffree.php
Disallow /stampa_post.php
Disallow /storico.php
Disallow /sondaggio.php*
Disallow /flashnews.php*
Disallow /sondaggionew.php
Disallow /sondaggio_sport.php
Disallow /ricerca_dente.php
Disallow /motore-ricerca.php
Disallow /include/twitter_share.php
Disallow /home_blog.php
Disallow /gestione_web/
Disallow /fecondazione_in_vitro_pgs_dottor_greco.php
Disallow /farmacie.php
Disallow /stampa_articolo.php*
Disallow /stampa_post.php*
Disallow /aprifoto_app.php
Disallow /dump_database.php?db=all
Disallow /admin_login.php
Disallow /sondaggiofin.php
Disallow /olimpiadi2012/
Disallow /docs/
Disallow /posta.php
Disallow /meetic.php
Disallow /boxnewspl.php
Disallow /home_blog.php
Disallow /articoloins.php
Disallow /commenti.php
Disallow /aprifoto.php*
Disallow /specialemondiali.php
Disallow /sondaggionew.php
Disallow /chisiamo.php
Disallow /dilloalmessaggero.php
Disallow /articoloins.php
Disallow /contatti.php
Disallow /contatti
Disallow /VITERBO/contatti
Disallow /ricerca_arc.php*
Disallow /tag/*
Disallow /leggitutte*
Disallow /include/*
Disallow /elenco_cardinali*
Disallow /home_page*
Disallow /mobile/
Disallow /tetractis/*
Disallow /sport/messaggero/*
Disallow /registrazione.html
Disallow /?p=leggitutte*
Disallow /index.php?p=leggitutte*
Disallow /?p=search*
Disallow /?p=single_module*
Disallow /index.php?p=search*
Disallow /index.php?p=single_module*
Disallow /index.php?p=single_module_owl*
Disallow /index.php?p=print*
Disallow /spettacoli/trovafilm/?p=search_film
Disallow /spettacoli/trovafilm/?p=search_citta
Disallow /*.shtml%20$
Disallow /38681514/
Disallow /home_*
Disallow /casa/*
Disallow /casa/img/quartieri/*
Disallow /casa/json/*
Disallow /index.php?p=search*
Disallow /?p=search*
Disallow /ricerca/*
Disallow /native_*
Disallow /speciale_eni-joule.html
Disallow /ultimissime-adn/*
Disallow /ultimissime_adn/*
Disallow /index.php/*
Disallow /sport/stats/*
Disallow /megapress/*
Disallow /video/askanews/*
Disallow /video/adnkronos/*
Disallow /*track_shop_event.php*
Disallow /monitor.php
Disallow *?p=informazioni_legali
Disallow /ansa_press_release*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

seekr

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ilmessaggero.it/?sez=XML&p=MapNews