leparisien.fr
robots.txt

Robots Exclusion Standard data for leparisien.fr

Resource Scan

Scan Details

Site Domain leparisien.fr
Base Domain leparisien.fr
Scan Status Ok
Last Scan2024-11-15T09:53:20+00:00
Next Scan 2024-11-22T09:53:20+00:00

Last Scan

Scanned2024-11-15T09:53:20+00:00
URL https://leparisien.fr/robots.txt
Redirect https://www.leparisien.fr:443/robots.txt
Redirect Domain www.leparisien.fr
Redirect Base leparisien.fr
Domain IPs 35.71.153.23, 52.223.41.196
Redirect IPs 23.52.171.130, 23.52.171.144, 2600:1413:b000:13::b857:c188, 2600:1413:b000:13::b857:c189
Response IP 23.45.207.170
Found Yes
Hash 48949ed8f9d0b1b79b8a3cd47453ba32379403df7c5fcd37a596b2cfdb0a6d0c
SimHash 751a52a00d06

Groups

*
*

Rule Path
Disallow /pf/resources/dist/fonts/*
Disallow /pf/api/
Disallow /pf/resources/dist/images/*
Disallow /pf/resources/plugins/*
Disallow /codes-promo/redirect/*
Disallow /recherche/
Disallow /commentaire/
Disallow /annuaire-mairie-telephone.php
Disallow /*.swf$
Disallow /internals/
Disallow /blocs/pub/
Disallow /you/
Disallow /partager/
Disallow /diaporama-photos/
Disallow /leparisien-img/
Disallow /reactions/
Disallow /*/vote?
Disallow /article/comments/*
Disallow /espace-securise/shield/
Disallow */resultats/bac/diplome/*
Disallow */resultats/bts/diplome/*
Disallow */resultats/bac-pro/diplome/*
Disallow */resultats/bac-techno/diplome/*
Disallow */resultats/bp/diplome/*
Disallow */resultats/cap/diplome/*
Disallow */resultats/brevet/diplome/*
Disallow */resultats/mc-4/diplome/*
Disallow */resultats/mc-5/diplome/*
Disallow */resultats/dcs/diplome/*
Disallow /2018/01/0*/
Disallow /2018/02/0*/
Disallow /2018/03/0*/
Disallow /2018/04/0*/
Disallow /2018/05/0*/
Disallow /2018/06/0*/
Disallow /2018/07/0*/
Disallow /2018/08/0*/
Disallow /2018/09/0*/
Disallow /2018/10/0*/
Disallow /2018/11/0*/
Disallow /2018/12/0*/
Disallow /2019/01/0*/
Disallow /2019/02/0*/
Disallow /2019/03/0*/
Disallow /2019/04/0*/
Disallow /2019/05/0*/
Disallow /2019/06/0*/
Disallow /2019/07/0*/
Disallow /2019/08/0*/
Disallow /2019/09/0*/
Disallow /2019/10/0*/
Disallow /2019/11/0*/
Disallow /2019/12/0*/
Disallow /2020/01/0*/
Disallow /2020/02/0*/
Disallow /2020/03/0*/
Disallow /2020/04/0*/
Disallow /2020/05/0*/
Disallow /2020/06/0*/
Disallow /2020/07/0*/
Disallow /2020/08/0*/
Disallow /2020/09/0*/
Disallow /2020/10/0*/
Disallow /2020/11/0*/
Disallow /2020/12/0*/
Disallow /2018/01/1*/
Disallow /2018/02/1*/
Disallow /2018/03/1*/
Disallow /2018/04/1*/
Disallow /2018/05/1*/
Disallow /2018/06/1*/
Disallow /2018/07/1*/
Disallow /2018/08/1*/
Disallow /2018/09/1*/
Disallow /2018/10/1*/
Disallow /2018/11/1*/
Disallow /2018/12/1*/
Disallow /2019/01/1*/
Disallow /2019/02/1*/
Disallow /2019/03/1*/
Disallow /2019/04/1*/
Disallow /2019/05/1*/
Disallow /2019/06/1*/
Disallow /2019/07/1*/
Disallow /2019/08/1*/
Disallow /2019/09/1*/
Disallow /2019/10/1*/
Disallow /2019/11/1*/
Disallow /2019/12/1*/
Disallow /2020/01/1*/
Disallow /2020/02/1*/
Disallow /2020/03/1*/
Disallow /2020/04/1*/
Disallow /2020/05/1*/
Disallow /2020/06/1*/
Disallow /2020/07/1*/
Disallow /2020/08/1*/
Disallow /2020/09/1*/
Disallow /2020/10/1*/
Disallow /2020/11/1*/
Disallow /2020/12/1*/
Disallow /2018/01/2*/
Disallow /2018/02/2*/
Disallow /2018/03/2*/
Disallow /2018/04/2*/
Disallow /2018/05/2*/
Disallow /2018/06/2*/
Disallow /2018/07/2*/
Disallow /2018/08/2*/
Disallow /2018/09/2*/
Disallow /2018/10/2*/
Disallow /2018/11/2*/
Disallow /2018/12/2*/
Disallow /2019/01/2*/
Disallow /2019/02/2*/
Disallow /2019/03/2*/
Disallow /2019/04/2*/
Disallow /2019/05/2*/
Disallow /2019/06/2*/
Disallow /2019/07/2*/
Disallow /2019/08/2*/
Disallow /2019/09/2*/
Disallow /2019/10/2*/
Disallow /2019/11/2*/
Disallow /2019/12/2*/
Disallow /2020/01/2*/
Disallow /2020/02/2*/
Disallow /2020/03/2*/
Disallow /2020/04/2*/
Disallow /2020/05/2*/
Disallow /2020/06/2*/
Disallow /2020/07/2*/
Disallow /2020/08/2*/
Disallow /2020/09/2*/
Disallow /2020/10/2*/
Disallow /2020/11/2*/
Disallow /2020/12/2*/
Disallow /2018/01/3*/
Disallow /2018/02/3*/
Disallow /2018/03/3*/
Disallow /2018/04/3*/
Disallow /2018/05/3*/
Disallow /2018/06/3*/
Disallow /2018/07/3*/
Disallow /2018/08/3*/
Disallow /2018/09/3*/
Disallow /2018/10/3*/
Disallow /2018/11/3*/
Disallow /2018/12/3*/
Disallow /2019/01/3*/
Disallow /2019/02/3*/
Disallow /2019/03/3*/
Disallow /2019/04/3*/
Disallow /2019/05/3*/
Disallow /2019/06/3*/
Disallow /2019/07/3*/
Disallow /2019/08/3*/
Disallow /2019/09/3*/
Disallow /2019/10/3*/
Disallow /2019/11/3*/
Disallow /2019/12/3*/
Disallow /2020/01/3*/
Disallow /2020/02/3*/
Disallow /2020/03/3*/
Disallow /2020/04/3*/
Disallow /2020/05/3*/
Disallow /2020/06/3*/
Disallow /2020/07/3*/
Disallow /2020/08/3*/
Disallow /2020/09/3*/
Disallow /2020/10/3*/
Disallow /2020/11/3*/
Disallow /2020/12/3*/
Allow *.jpg$
Allow *.jpeg$
Disallow /widgets/*

googlebot-news
googlebot-image

Rule Path
Allow /images/

mediapartners-google

Rule Path
Allow /

flipboard

Rule Path
Allow /

flipboardproxy

Rule Path
Allow /

adequat

Rule Path
Disallow /

adequat-systems

Rule Path
Disallow /

alexibot

Rule Path
Disallow /

alvinetspider

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

amisoftware

Rule Path
Disallow /

antenne hatena

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

apocalxexplorerbot

Rule Path
Disallow /

argus

Rule Path
Disallow /

ask n read

Rule Path
Disallow /

asknread.com

Rule Path
Disallow /

asterias

Rule Path
Disallow /

augure

Rule Path
Disallow /

augure

Rule Path
Disallow /

auramundi

Rule Path
Disallow /

backdoorbot/1.0

Rule Path
Disallow /

bizinformation

Rule Path
Disallow /

black hole

Rule Path
Disallow /

bloodhound

Rule Path
Disallow /

bloomberg

Rule Path
Disallow /

blowfish/1.0

Rule Path
Disallow /

botalot

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

bullseye/1.0

Rule Path
Disallow /

bunnyslippers

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

cegbfeieh

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

cherrypickerelite/1.0

Rule Path
Disallow /

cherrypickerse/1.0

Rule Path
Disallow /

cision

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

coexel

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

converacrawler

Rule Path
Disallow /

copyrightcheck

Rule Path
Disallow /

corporama

Rule Path
Disallow /

cosmos

Rule Path
Disallow /

crescent

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

cydralspider

Rule Path
Disallow /

digimind

Rule Path
Disallow /

disco pump 3.1

Rule Path
Disallow /

dittospyder

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

downloadexpress

Rule Path
Disallow /

edd

Rule Path
Disallow /

ellisphere

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

erocrawler

Rule Path
Disallow /

europresse

Rule Path
Disallow /

explore

Rule Path
Disallow /

eureka

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

factiva

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

fetch

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

foobot

Rule Path
Disallow /

gammaspider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

harvest/1.5

Rule Path
Disallow /

hloader

Rule Path
Disallow /

httplib

Rule Path
Disallow /

httrack

Rule Path
Disallow /

httrack 3.0

Rule Path
Disallow /

humanlinks

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

igentia

Rule Path
Disallow /

indexer

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

infoseek

Rule Path
Disallow /

jennybot

Rule Path
Disallow /

jetbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

kantar

Rule Path
Disallow /

kbcrawl

Rule Path
Disallow /

kenjin spider

Rule Path
Disallow /

knowings

Rule Path
Disallow /

larbin

Rule Path
Disallow /

leadbox

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

libweb/clshttp

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linko

Rule Path
Disallow /

linkscan/8.1a unix

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

lwp-trivial/1.34

Rule Path
Disallow /

manageo

Rule Path
Disallow /

mata hari

Rule Path
Disallow /

mediacompil

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

mention

Rule Path
Disallow /

microsoft url control - 5.01.4511

Rule Path
Disallow /

microsoft url control - 6.00.8169

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

miixpc/4.2

Rule Path
Disallow /

mister pix

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

moget

Rule Path
Disallow /

moget/2.1

Rule Path
Disallow /

moreover

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

ms search 4.0 robot

Rule Path
Disallow /

ms search 5.0 robot

Rule Path
Disallow /

mytwip

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

netants

Rule Path
Disallow /

netattache

Rule Path
Disallow /

netmechanic

Rule Path
Disallow /

newscan-online

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

newzbin

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

npbot

Rule Path
Disallow /

objectssearch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

openfind

Rule Path
Disallow /

openindexspider

Rule Path
Disallow /

opinion-tracker

Rule Path
Disallow /

pimptrain

Rule Path
Disallow /

proxem

Rule Path
Disallow /

propowerbot/2.14

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

psbot

Rule Path
Disallow /

quepasacreep

Rule Path
Disallow /

queryn metasearch

Rule Path
Disallow /

qwam content intelligence

Rule Path
Disallow /

raven

Rule Path
Disallow /

readability.com

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

rma

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

score3

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

sightupbot

Rule Path
Disallow /

sindup

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

sitesucker

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spankbot

Rule Path
Disallow /

spanner

Rule Path
Disallow /

speedy

Rule Path
Disallow /

spotter

Rule Path
Disallow /

suggybot

Rule Path
Disallow /

superbot

Rule Path
Disallow /

superbot/2.6

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

talkwalker

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

the intraformant

Rule Path
Disallow /

thenomad

Rule Path
Disallow /

tighttwatbot

Rule Path
Disallow /

titan

Rule Path
Disallow /

tocrawl/urldispatcher

Rule Path
Disallow /

toscrawler

Rule Path
Disallow /

trendeo

Rule Path
Disallow /

trendybuzz

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

true_robot/1.0

Rule Path
Disallow /

tunitinbot

Rule Path
Disallow /

turingos

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

up2news

Rule Path
Disallow /

urlpouls

Rule Path
Disallow /

urly warning

Rule Path
Disallow /

vecteurplus

Rule Path
Disallow /

verif

Rule Path
Disallow /

verticalsearch

Rule Path
Disallow /

vci

Rule Path
Disallow /

vsw

Rule Path
Disallow /

wapspider

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

web image collector

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webbandit/3.50

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webcopy

Rule Path
Disallow /

webedia

Rule Path
Disallow /

webenhancer

Rule Path
Disallow /

webmasterworldforumbot

Rule Path
Disallow /

webmirror

Rule Path
Disallow /

websauger

Rule Path
Disallow /

website extractor

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

website quester

Rule Path
Disallow /

webster pro

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webstripper/2.02

Rule Path
Disallow /

webzinger

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

wikiofeedbot

Rule Path
Disallow /

winello

Rule Path
Disallow /

winhttrack

Rule Path
Disallow /

www-collector-e

Rule Path
Disallow /

xenu link sleuth/1.3.8

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yandex

Rule Path
Disallow /

youmag

Rule Path
Disallow /

yrspider

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zite

Rule Path
Disallow /

zookabot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.leparisien.fr/arc/outboundfeeds/news-sitemap/?from=0&outputType=xml&_website=leparisien
sitemap https://www.leparisien.fr/arc/outboundfeeds/news-sitemap-index/?from=0&outputType=xml&_website=leparisien
sitemap https://www.leparisien.fr/arc/outboundfeeds/news-sitemap-index/?from=0&outputType=xml&_website=leparisien

Comments

  • Robots exclus
  • @@@@@@@@@@@@
  • @ @@@
  • @@ @ @@ @@@@@@ @@@@@@@
  • @@ @@@ @ @@ @ @ @
  • @@@@@@ @@ @ @@ @@ @@ @ @ @
  • @ @ @@ @ @@ @@@@@@@@@ @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ @@@@@@@@@@@@@@@@@@@@@
  • @ @ @@@@@@@@ @@ @ @@ @ @@@ @ @ @@@ @ @@@ @@ @@
  • @ @ @@@ @@@ @@ @@@ @ @@ @ @ @@@ @@ @ @@
  • @ @ @@ @@ @@ @@@ @@@@@@ @@ @@@@ @ @@@@@@@ @ @@@ @ @@@@ @
  • @ @ @ @@ @ @@ @@@@ @@ @@ @@ @ @@@@@@ @ @@@ @ @ @ @
  • @ @ @ @@@ @ @@ @@@@@ @ @@ @ @@ @@ @@@@ @@ @@ @@ @ @ @
  • @ @ @ @ @@ @@ @@@@@ @@ @@ @@ @ @@ @@@@ @@ @@ @@@ @ @ @
  • @ @ @ @@@ @@ @ @ @@@ @@ @ @@ @ @@@ @@ @ @@@@ @ @ @ @
  • @ @@@@ @@@@ @@ @ @ @@@ @@ @ @@ @@@@@@ @ @ @@@ @ @ @ @
  • @ @@@@@ @@@ @@ @ @@ @@ @@ @ @@ @@@@@ @@ @@ @@@@ @ @ @ @
  • @ @ @ @ @ @@ @@ @ @@ @ @@ @@@ @ @ @ @ @
  • @@@@@@@@@@@@@@@ @@@@ @@@@@@@@@ @@@@ @@@@@@@@@@@@@ @@@@@ @@@@@ @@@@ @@@@

Warnings

  • 4 invalid lines.