quotidianodipuglia.it
robots.txt

Robots Exclusion Standard data for quotidianodipuglia.it

Resource Scan

Scan Details

Site Domain quotidianodipuglia.it
Base Domain quotidianodipuglia.it
Scan Status Ok
Last Scan2024-11-09T05:41:22+00:00
Next Scan 2024-11-16T05:41:22+00:00

Last Scan

Scanned2024-11-09T05:41:22+00:00
URL https://quotidianodipuglia.it/robots.txt
Redirect https://www.quotidianodipuglia.it/robots.txt
Redirect Domain www.quotidianodipuglia.it
Redirect Base quotidianodipuglia.it
Domain IPs 34.149.236.87
Redirect IPs 34.149.236.87
Response IP 34.149.236.87
Found Yes
Hash 936b173cb47f687535e4db5f013470832320718e7fa0aeafd3cfbf3f4d2fcf29
SimHash 030c786cb213

Groups

*

Rule Path
Disallow /cache/
Disallow /includes/
Disallow /query_cache/
Disallow /cgi-bin/
Disallow *sez%3DJSON*
Disallow *sez%3DAJAX*
Disallow /view.php
Disallow /view
Disallow /view.php
Disallow /tag.php
Disallow /ELEZIONI2014
Disallow /ANSAviewnews2.php
Disallow /articoloins.php
Disallow /articolo_app.php
Disallow /stampa_articolo.php
Disallow /tag.php
Disallow /fotogallery.php
Disallow /video.php
Disallow /dump_database.php?db=all
Disallow /admin_login.php
Disallow *?p=sondaggio*
Disallow *?p=sondaggio
Disallow *?p=print*
Disallow /casa/*
Disallow /CASA/*
Disallow /flashnews/*
Disallow /video.php*
Disallow /sondaggio.php*
Disallow /aprifoto.php*
Disallow /articoloins_app.php*
Disallow /stampa_articoloins.php*
Disallow /twitter_share.php*
Disallow /specialemondiali.php*
Disallow /box_ajax_pl.php*
Disallow /mobile/
Disallow /docs/
Disallow /posta.php
Disallow /meetic.php
Disallow /boxnewspl.php
Disallow /home_blog.php
Disallow /articoloins.php
Disallow /commenti.php
Disallow /aprifoto.php*
Disallow /specialemondiali.php
Disallow /sondaggionew.php
Disallow /chisiamo.php
Disallow /dilloalmessaggero.php
Disallow /articoloins.php
Disallow /contatti.php
Disallow /contatti
Disallow /ricerca_arc.php*
Disallow /tag/*
Disallow /leggitutte*
Disallow /include/*
Disallow /home_page*
Disallow /mobile/
Disallow /tetractis/*
Disallow /sport/messaggero/*
Disallow /registrazione.html
Disallow /?p=leggitutte*
Disallow /index.php?p=leggitutte*
Disallow /?p=single_module*
Disallow /index.php?p=search*
Disallow /index.php?p=single_module*
Disallow /index.php?p=single_module_owl*
Disallow /index.php?p=print*
Disallow /*.shtml%20$
Disallow /38681514/
Disallow /home_*
Disallow /index.php?p=search*
Disallow /?p=search*
Disallow /ricerca/*
Disallow /native_*
Disallow /speciale_eni-joule.html
Disallow /ultimissime_adn/*
Disallow /index.php/*
Disallow /t/*/0*
Disallow /t/*/1*
Disallow /t/*/2*
Disallow /t/*/3*
Disallow /t/*/4*
Disallow /t/*/5*
Disallow /t/*/6*
Disallow /t/*/7*
Disallow /t/*/8*
Disallow /t/*/9*
Disallow /t/*/a*
Disallow /t/*/b*
Disallow /t/*/c*
Disallow /t/*/d*
Disallow /t/*/e*
Disallow /t/*/f*
Disallow /t/*/g*
Disallow /t/*/h*
Disallow /t/*/i*
Disallow /t/*/j*
Disallow /t/*/k*
Disallow /t/*/l*
Disallow /t/*/m*
Disallow /t/*/n*
Disallow /t/*/o*
Disallow /t/*/p*
Disallow /t/*/q*
Disallow /t/*/r*
Disallow /t/*/s*
Disallow /t/*/t*
Disallow /t/*/u*
Disallow /t/*/v*
Disallow /t/*/w*
Disallow /t/*/x*
Disallow /t/*/y*
Disallow /t/*/z*
Disallow /t/*/A*
Disallow /t/*/B*
Disallow /t/*/C*
Disallow /t/*/D*
Disallow /t/*/E*
Disallow /t/*/F*
Disallow /t/*/G*
Disallow /t/*/H*
Disallow /t/*/I*
Disallow /t/*/J*
Disallow /t/*/K*
Disallow /t/*/L*
Disallow /t/*/M*
Disallow /t/*/N*
Disallow /t/*/O*
Disallow /t/*/P*
Disallow /t/*/Q*
Disallow /t/*/R*
Disallow /t/*/S*
Disallow /t/*/T*
Disallow /t/*/U*
Disallow /t/*/V*
Disallow /t/*/W*
Disallow /t/*/X*
Disallow /t/*/Y*
Disallow /t/*/Z*
Disallow /t/*/?*
Disallow /t/*/%*
Disallow /*track_shop_event.php*
Disallow /monitor.php
Disallow /video/askanews/*
Disallow /video/adnkronos/*
Disallow *?p=informazioni_legali
Disallow /ricerca/*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

seekr

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.quotidianodipuglia.it/?sez=XML&p=MapNews