corriereadriatico.it
robots.txt

Robots Exclusion Standard data for corriereadriatico.it

Resource Scan

Scan Details

Site Domain corriereadriatico.it
Base Domain corriereadriatico.it
Scan Status Ok
Last Scan2025-05-17T09:36:17+00:00
Next Scan 2025-05-24T09:36:17+00:00

Last Scan

Scanned2025-05-17T09:36:17+00:00
URL https://corriereadriatico.it/robots.txt
Redirect https://www.corriereadriatico.it/robots.txt
Redirect Domain www.corriereadriatico.it
Redirect Base corriereadriatico.it
Domain IPs 34.149.236.87
Redirect IPs 34.149.236.87
Response IP 34.149.236.87
Found Yes
Hash 818f028331d4d4f0f58007b00a8c8b5920883f71ce360ac8d38c8f8c35110d03
SimHash d3083b248119

Groups

*

Rule Path
Disallow /cache/
Disallow /includes/
Disallow /query_cache/
Disallow /cgi-bin/
Disallow *sez%3DJSON*
Disallow *sez%3DAJAX*
Disallow /view.php*
Disallow /view
Disallow /ELEZIONI2014
Disallow /ANSAviewnews2.php*
Disallow /ANSAviewnews.php*
Disallow /articolo.php*
Disallow /articoloins.php
Disallow /articolo_app.php*
Disallow /aprifoto.php*
Disallow /mobile/*
Disallow /sondaggio.php*
Disallow /tag.php*
Disallow /ricerca.php*
Disallow /fotogallery.php*
Disallow /video.php*
Disallow /sondaggio.php*
Disallow /*.aspx
Disallow *p%3Dflashnews*
Disallow /twitter_share.php
Disallow /box_tuttomercato/index_tm.php
Disallow /foto/*
Disallow /casa/*
Disallow /box_ajax*
Disallow /dump_database.php?db=all
Disallow /admin_login.php
Disallow /sicurezza_stradale*
Disallow /dettaglio.php*
Disallow /box_pl*
Disallow /diretta_europei.php*
Disallow /mobile/
Disallow /38681514/
Disallow /flashnews/
Disallow *p%3Dall_news*
Disallow *?p=search*
Disallow /native_*
Disallow /speciale_eni-joule.html
Disallow /ultimissime_adn/*
Disallow /u/*
Disallow /index.php/*
Disallow /sport/stats/*
Disallow /megapress/*
Disallow /?p=single_module*
Disallow /index.php?p=single_module*
Disallow /index.php?p=single_module_owl*
Disallow /*track_shop_event.php*
Disallow /track_shop_event.php*
Disallow /ricerca/*
Disallow /video/askanews/*
Disallow /video/adnkronos/*
Disallow *?p=informazioni_legali

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

seekr

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.corriereadriatico.it/?sez=XML&p=MapNews