linternaute.com
robots.txt

Robots Exclusion Standard data for linternaute.com

Resource Scan

Scan Details

Site Domain linternaute.com
Base Domain linternaute.com
Scan Status Ok
Last Scan2024-09-22T16:04:02+00:00
Next Scan 2024-09-29T16:04:02+00:00

Last Scan

Scanned2024-09-22T16:04:02+00:00
URL https://linternaute.com/robots.txt
Redirect http://www.linternaute.com/robots.txt
Redirect Domain www.linternaute.com
Redirect Base linternaute.com
Domain IPs 194.169.240.7, 195.248.251.103
Redirect IPs 23.50.81.198
Response IP 104.69.44.107
Found Yes
Hash 460839424f9e21950ed0dcd610055c0ec796e1ad0af39853c1992cd9ca4d972e
SimHash 6086c36879e1

Groups

mediapartners-google*

Rule Path
Disallow

*

Rule Path
Disallow /*f_impression
Disallow /*?print
Disallow /*zone%3Dprinter
Disallow /*f_imprimer
Disallow */imprimer/
Disallow /*f_zone%3Dimpression
Disallow */imprimer.php
Disallow /*xhr
Disallow /recherche/
Disallow /biographie/*/impression/
Disallow /voyage/*/magazine/
Disallow /musique/albums/
Disallow /cartes/*f_destinataire_id_personne
Disallow /cartes/abus/
Disallow /cartes/cartepostale/
Disallow /cartes/envoi_ok/
Disallow /cartes/cgi/carte_personnalisee/
Disallow /femmes/cartes/abus/
Disallow /femmes/cartes/cartepostale/
Disallow /femmes/cartes/envoi_ok/
Disallow /femmes/cartes/cgi/carte_personnalisee/
Disallow /histoire/cgi/
Disallow /voyage/moncompte/
Disallow /ville/recherche-bureau-vote/
Disallow /cinema/mes-films/
Disallow /cinema/cgi/avis/depose_avis.php
Disallow /histoire/cgi/evenement/
Disallow /histoire/cgi/mail/
Disallow /actualite/depeche/impression/
Disallow /404/
Disallow /aidememoire/
Disallow /alerte_mail/
Disallow /bin/
Disallow /boutique/
Disallow /cgi-bin/
Disallow /coldroite/
Disallow /communiq/
Disallow /communiquer/
Disallow /concours/
Disallow /dcforum/
Disallow /dev/
Disallow /ecouter_voir/
Disallow /ericson/
Disallow /etc/
Disallow /formations/
Disallow /forums/
Disallow /galerie/
Disallow /htdig/
Disallow /html/
Disallow /html_externe/
Disallow /htmlexterne/
Disallow /images/
Disallow /include/
Disallow /internaute/
Disallow /intuition/
Disallow /itineraires/
Disallow /kelkoo/
Disallow /ksearch/
Disallow /lib/
Disallow /mailling/
Disallow /monmobile/
Disallow /newsletter/
Disallow /partenariat/
Disallow /pollit_files/
Disallow /poisson/
Disallow /programme/
Disallow /pub/
Disallow /publiredac/
Disallow /question/
Disallow /sauvegarde/
Disallow /sponsor/
Disallow /studio/
Disallow /style/
Disallow /surfer/
Disallow /top/
Disallow /tvmag/
Disallow /webcam/
Disallow /webpassion/
Disallow /webutile/
Disallow /auto/accident/*%2C
Disallow /auto/accident/*/*/*-
Disallow /cinema/*/libelledistribution/
Disallow /restaurant/avis/
Disallow /restaurant/cgi/
Disallow /restaurant/avis_depose_par/
Disallow /restaurant/cgi/avis/avis_depose.php
Disallow /restaurant/liste/*/*-*/
Disallow /ville/avis/*/*/*/
Disallow /voyage2/
Disallow /voyage/*/hotel/*order%3D
Disallow /voyage/*/hotel/*date_check
Disallow /voyage/*/hotel/*id_fiche%3D
Disallow /voyage/climat/*%2C
Disallow /petites-annonces/
Disallow /wifi/fiche/*/ville/
Disallow /pratique/informatique/logiciels/windows-xp/1630/
Disallow /*/temoignage/reagir/
Disallow /expression/cgi/recherche/recherche.php*f_terme%3D
Disallow /actualite/depeches/
Allow /actualite/depeches/$

trendkite-akashic-crawler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

Comments

  • linternaute.com
  • Block https://opensiteexplorer.org/dotbot
  • Block http://ahrefs.com/robot/
  • Block https://dataforseo.com/dataforseo-bot
  • Block https://www.semrush.com/bot/