amp.rfi.fr
robots.txt

Robots Exclusion Standard data for amp.rfi.fr

Resource Scan

Scan Details

Site Domain amp.rfi.fr
Base Domain rfi.fr
Scan Status Ok
Last Scan2024-11-12T12:26:04+00:00
Next Scan 2024-11-19T12:26:04+00:00

Last Scan

Scanned2024-11-12T12:26:04+00:00
URL https://amp.rfi.fr/robots.txt
Redirect https://www.rfi.fr/robots.txt
Redirect Domain www.rfi.fr
Redirect Base rfi.fr
Domain IPs 23.50.81.66, 2600:1413:b000:883::2bc9, 2600:1413:b000:8a0::2bc9
Redirect IPs 23.50.81.66, 2600:1413:b000:380::2bc9, 2600:1413:b000:389::2bc9
Response IP 173.223.89.66
Found Yes
Hash 90b757c60455b6597d68a8bbbfad928ebed9615f0d1531697662fee9435e9b8d
SimHash 721a52400403

Groups

*

Rule Path
Disallow

googlebot

Rule Path
Disallow */_ws/urgent

adequat

Rule Path
Disallow /

adequat-systems

Rule Path
Disallow /

admantx bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

alexibot

Rule Path
Disallow /

alvinetspider

Rule Path
Disallow /

amisoftware

Rule Path
Disallow /

antenne hatena

Rule Path
Disallow /

apocalxexplorerbot

Rule Path
Disallow /

argus

Rule Path
Disallow /

ask n read

Rule Path
Disallow /

asknread.com

Rule Path
Disallow /

asterias

Rule Path
Disallow /

augure

Rule Path
Disallow /

augure

Rule Path
Disallow /

auramundi

Rule Path
Disallow /

babya discoverer

Rule Path
Disallow /

backdoorbot/1.0

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

bitlybot

Rule Path
Disallow /

bizinformation

Rule Path
Disallow /

black hole

Rule Path
Disallow /

blekkobot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bloodhound

Rule Path
Disallow /

blowfish/1.0

Rule Path
Disallow /

botalot

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

bullseye/1.0

Rule Path
Disallow /

bunnyslippers

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

cegbfeieh

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

cherrypickerelite/1.0

Rule Path
Disallow /

cherrypickerse/1.0

Rule Path
Disallow /

cision

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

coexel

Rule Path
Disallow /

converacrawler

Rule Path
Disallow /

copyrightcheck

Rule Path
Disallow /

corporama

Rule Path
Disallow /

cosmos

Rule Path
Disallow /

crescent

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

cydralspider

Rule Path
Disallow /

digimind

Rule Path
Disallow /

disco pump 3.1

Rule Path
Disallow /

discobot

Rule Path
Disallow /

dittospyder

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotmic dotbot

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

downloadexpress

Rule Path
Disallow /

edd

Rule Path
Disallow /

ellisphere

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

erocrawler

Rule Path
Disallow /

etaospider

Rule Path
Disallow /

eureka

Rule Path
Disallow /

europresse

Rule Path
Disallow /

exabot

Rule Path
Disallow /

explore

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

ezooms robot

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

fetch

Rule Path
Disallow /

firstrain

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

foobot

Rule Path
Disallow /

gammaspider

Rule Path
Disallow /

gnowitnewsbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

harvest/1.5

Rule Path
Disallow /

hloader

Rule Path
Disallow /

houzzbot

Rule Path
Disallow /

httplib

Rule Path
Disallow /

httrack

Rule Path
Disallow /

httrack 3.0

Rule Path
Disallow /

humanlinks

Rule Path
Disallow /

igentia

Rule Path
Disallow /

indexer

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

infoseek

Rule Path
Disallow /

jennybot

Rule Path
Disallow /

jetbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

kbcrawl

Rule Path
Disallow /

kenjin spider

Rule Path
Disallow /

knowings

Rule Path
Disallow /

lamarkbot

Rule Path
Disallow /

larbin

Rule Path
Disallow /

leadbox

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

libweb/clshttp

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

linko

Rule Path
Disallow /

linkscan/8.1a unix

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

lwp-trivial/1.34

Rule Path
Disallow /

manageo

Rule Path
Disallow /

mata hari

Rule Path
Disallow /

maxpointcrawler

Rule Path
Disallow /

mediacompil

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

meltawer

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

mention

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

miixpc/4.2

Rule Path
Disallow /

mister pix

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

moget

Rule Path
Disallow /

moget/2.1

Rule Path
Disallow /

moreover

Rule Path
Disallow /

ms search 4.0 robot

Rule Path
Disallow /

ms search 5.0 robot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

mytwip

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

netants

Rule Path
Disallow /

netattache

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

netmechanic

Rule Path
Disallow /

newscan-online

Rule Path
Disallow /

newzbin

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

npbot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

objectssearch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

openfind

Rule Path
Disallow /

openindexspider

Rule Path
Disallow /

opinion-tracker

Rule Path
Disallow /

perl lwp

Rule Path
Disallow /

pimptrain

Rule Path
Disallow /

propowerbot/2.14

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

proxem

Rule Path
Disallow /

psbot

Rule Path
Disallow /

quepasacreep

Rule Path
Disallow /

queryn metasearch

Rule Path
Disallow /

qwam content intelligence

Rule Path
Disallow /

raven

Rule Path
Disallow /

readability.com

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

riddler

Rule Path
Disallow /

rma

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

score3

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sightupbot

Rule Path
Disallow /

sindup

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

sitesucker

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spankbot

Rule Path
Disallow /

spanner

Rule Path
Disallow /

speedy

Rule Path
Disallow /

spotter

Rule Path
Disallow /

sputnikbot

Rule Path
Disallow /

suggybot

Rule Path
Disallow /

superbot

Rule Path
Disallow /

superbot/2.6

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

talkwater

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

the intraformant

Rule Path
Disallow /

thenomad

Rule Path
Disallow /

tighttwatbot

Rule Path
Disallow /

titan

Rule Path
Disallow /

tocrawl/urldispatcher

Rule Path
Disallow /

toscrawler

Rule Path
Disallow /

trendeo

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

trendybuzz

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

true_robot/1.0

Rule Path
Disallow /

turingos

Rule Path
Disallow /

up2news

Rule Path
Disallow /

updownerbot

Rule Path
Disallow /

uptimerobot/2.0

Rule Path
Disallow /

urlpouls

Rule Path
Disallow /

urly warning

Rule Path
Disallow /

vci

Rule Path
Disallow /

vecteurplus

Rule Path
Disallow /

verif

Rule Path
Disallow /

verticalsearch

Rule Path
Disallow /

vsw

Rule Path
Disallow /

wapspider

Rule Path
Disallow /

web image collector

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webbandit/3.50

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webcopy

Rule Path
Disallow /

webenhancer

Rule Path
Disallow /

webmasterworldforumbot

Rule Path
Disallow /

webmirror

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

websauger

Rule Path
Disallow /

website extractor

Rule Path
Disallow /

website quester

Rule Path
Disallow /

webster pro

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webstripper/2.02

Rule Path
Disallow /

webzinger

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

wget

Rule Path
Disallow /

wikiofeedbot

Rule Path
Disallow /

winello

Rule Path
Disallow /

winhttrack

Rule Path
Disallow /

wiseguys robot

Rule Path
Disallow /

www-collector-e

Rule Path
Disallow /

xenu link sleuth/1.3.8

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

youmag

Rule Path
Disallow /

yrspider

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zite

Rule Path
Disallow /

zookabot

Rule Path
Disallow /

zyborg
criteobot/0.1

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

grapeshot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

ina dlweb

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

inkl

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

kantar

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

outbrain

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

proximic

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

weborama-fetcher

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

archive.org_bot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

baiduspider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

coccocbot-web

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

factiva

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

newsnow

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

petalbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

proximic

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

soso spider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

sogou spider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

turnitinbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

turnitin robot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

yandex

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

yandexbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

yeti

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.rfi.fr/sitemaps/fr/index.xml
sitemap https://www.rfi.fr/sitemaps/en/index.xml
sitemap https://www.rfi.fr/sitemaps/es/index.xml
sitemap https://www.rfi.fr/sitemaps/pt/index.xml
sitemap https://www.rfi.fr/sitemaps/br/index.xml
sitemap https://www.rfi.fr/sitemaps/cn/index.xml
sitemap https://www.rfi.fr/sitemaps/tw/index.xml
sitemap https://www.rfi.fr/sitemaps/km/index.xml
sitemap https://www.rfi.fr/sitemaps/ha/index.xml
sitemap https://www.rfi.fr/sitemaps/fa/index.xml
sitemap https://www.rfi.fr/sitemaps/sw/index.xml
sitemap https://www.rfi.fr/sitemaps/ma/index.xml
sitemap https://www.rfi.fr/sitemaps/ff/index.xml
sitemap https://www.rfi.fr/sitemaps/ro/index.xml
sitemap https://www.rfi.fr/sitemaps/ru/index.xml
sitemap https://www.rfi.fr/sitemaps/uk/index.xml
sitemap https://www.rfi.fr/sitemaps/vi/index.xml
sitemap https://www.rfi.fr/sitemaps/fr/news.xml
sitemap https://www.rfi.fr/sitemaps/en/news.xml
sitemap https://www.rfi.fr/sitemaps/es/news.xml
sitemap https://www.rfi.fr/sitemaps/pt/news.xml
sitemap https://www.rfi.fr/sitemaps/br/news.xml
sitemap https://www.rfi.fr/sitemaps/cn/news.xml
sitemap https://www.rfi.fr/sitemaps/tw/news.xml
sitemap https://www.rfi.fr/sitemaps/km/news.xml
sitemap https://www.rfi.fr/sitemaps/ha/news.xml
sitemap https://www.rfi.fr/sitemaps/fa/news.xml
sitemap https://www.rfi.fr/sitemaps/sw/news.xml
sitemap https://www.rfi.fr/sitemaps/ro/news.xml
sitemap https://www.rfi.fr/sitemaps/ru/news.xml
sitemap https://www.rfi.fr/sitemaps/uk/news.xml
sitemap https://www.rfi.fr/sitemaps/vi/news.xml

Comments

  • France Medias Monde [2024-03-22] - francemediasmonde.com
  • RFI - rfi.fr - HTTPS
  • Sitemaps
  • News Sitemaps
  • General rules
  • Bots blocking
  • Partners
  • Too fast bots

Warnings

  • 6 invalid lines.