ilpiccolo.gelocal.it
robots.txt

Robots Exclusion Standard data for ilpiccolo.gelocal.it

Resource Scan

Scan Details

Site Domain ilpiccolo.gelocal.it
Base Domain gelocal.it
Scan Status Ok
Last Scan2024-05-24T22:31:52+00:00
Next Scan 2024-05-31T22:31:52+00:00

Last Scan

Scanned2024-05-24T22:31:52+00:00
URL https://ilpiccolo.gelocal.it/robots.txt
Domain IPs 13.33.30.108, 13.33.30.124, 13.33.30.33, 13.33.30.96
Response IP 13.33.30.124
Found Yes
Hash 1b64e9f0f06fb3971993e87f35ca8d019b7dfd53117e9fd7ab9711cc57741871
SimHash 686043620423

Groups

*

Rule Path
Allow /
Disallow /trieste/cronaca/2023/09/03/news/pedinata_violentata_ascensore_trieste_arrestato-13024236/
Disallow /trieste/cronaca/2020/09/22/news/ubriaco-al-volante-aggredisce-i-carabinieri-e-viene-arrestato-1.39338366
Disallow /trieste/cronaca/2017/11/14/news/camion_fantasma_nella_centrale_a2a3883495/
Disallow /trieste/cronaca/2017/10/06/news/rifiuti_in_centrale_11_imputati_e_50_testi3874661/
Disallow /trieste/cronaca/2015/05/12/news/mazzette-in-provincia-per-le-assunzioni-1.11405951
Disallow /trieste/cronaca/2015/04/17/news/a2a_truffata_in_aula_a_ottobre-3669820/
Disallow /trieste/cronaca/2014/04/30/news/permessi-di-soggiorno-illegali-condannato-ex-commercialista-1.9133372
Disallow /trieste/cronaca/2013/10/19/news/beffatelecamere_a_cormons_nuovo_slittamento-3472937/amp/
Disallow /trieste/cronaca/2013/02/16/news/tenta-di-rubare-alla-caritas-arrestato-1.6547316
Disallow /trieste/cronaca/2012/12/04/news/cavana-prostitute-al-lavoro-in-un-residence-1.6142510
Disallow /trieste/cronaca/2012/11/24/news/immigrazione-clandestina-sgominata-una-banda-1.6085339
Disallow /trieste/cronaca/2012/06/20/news/altri_guai_per_samuele_grassi_il_truffatore_con_il_postepay-3372506/amp/
Disallow /trieste/cronaca/2012/06/20/news/altri_guai_per_samuele_grassi_il_truffatore_con_il_postepay-3372506/
Disallow /trieste/cronaca/2012/04/19/news/dopo_la_truffa_colpaccio_al_casino-3360225/amp/
Disallow /trieste/cronaca/2012/04/18/news/ex_goleador_truffa_edicolante_per_490_euro-3360035/
Disallow /trieste/cronaca/2012/02/24/news/gorizia_abusi_su_una_bimba_in_carcere_ex_assessore-3349548/
Disallow /trieste/cronaca/2011/11/30/news/scarcerati_i_due_analisti_di_san_dorligo-3334774/
Disallow /trieste/cronaca/2011/11/12/news/a2a_una_sola_mela_marcia_in_centrale-3331308/
Disallow /trieste/cronaca/2011/11/09/news/subito_sospeso_da_a2a_il_dipendente_infedele-3330688/
Disallow /trieste/cronaca/2011/11/09/news/maxitruffa_sui_rifiuti_8_arresti-3330675/
Disallow /trieste/cronaca/2011/09/27/news/rubavano-rame-in-cimitero-arrestati-1.839087
Disallow /trieste/cronaca/2010/10/21/news/circolo-miani-spese-personalicon-soldi-pubblici-fogar-nel-mirino-1.18289
Disallow /stampa-articolo/
Disallow /ricerca?query=
Disallow /italia/2024/04/16/news/vendeva_auto_non_sue_e_non_le_consegnava_arrestato-14228342/
Disallow /italia/2024/04/16/news/vendeva_auto_non_sue_e_non_le_consegnava_arrestato-14228342
Disallow /dettaglio/*?edizione=
Disallow /dettaglio-news/*?edizione=
Disallow /cultura-e-spettacoli/2024/05/13/news/elio_e_tenores_di_neoneli_sodalizio_trentennale_diventa_un_film-14299980/
Disallow /blaize/datalayer

googlebot-news

Rule Path
Disallow /tecnologia/
Disallow /salute/
Disallow /moda-e-beauty/
Disallow /la-zampa/
Disallow /il-gusto/
Disallow /green-and-blue/

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ilpiccolo.gelocal.it/sitemap-n.xml