rfimusic.com
robots.txt

Robots Exclusion Standard data for rfimusic.com

Resource Scan

Scan Details

Site Domain rfimusic.com
Base Domain rfimusic.com
Scan Status Ok
Last Scan2024-11-14T02:16:53+00:00
Next Scan 2024-12-14T02:16:53+00:00

Last Scan

Scanned2024-11-14T02:16:53+00:00
URL http://rfimusic.com/robots.txt
Redirect http://musique.rfi.fr/robots.txt
Redirect Domain musique.rfi.fr
Redirect Base rfi.fr
Domain IPs 217.70.184.38
Redirect IPs 23.50.81.66, 2600:1413:b000:380::2bc9, 2600:1413:b000:389::2bc9
Response IP 23.50.81.66
Found Yes
Hash 63feb85487ba69caeff8bb5acaf2a03a0f89a9d70e164c67bcd96f365c3d5f5d
SimHash 329e5240c673

Groups

*

Rule Path
Allow /misc/*.css$
Allow /misc/*.css?
Allow /misc/*.js$
Allow /misc/*.js?
Allow /misc/*.gif
Allow /misc/*.jpg
Allow /misc/*.jpeg
Allow /misc/*.png
Allow /modules/*.css$
Allow /modules/*.css?
Allow /modules/*.js$
Allow /modules/*.js?
Allow /modules/*.gif
Allow /modules/*.jpg
Allow /modules/*.jpeg
Allow /modules/*.png
Allow /profiles/*.css$
Allow /profiles/*.css?
Allow /profiles/*.js$
Allow /profiles/*.js?
Allow /profiles/*.gif
Allow /profiles/*.jpg
Allow /profiles/*.jpeg
Allow /profiles/*.png
Allow /themes/*.css$
Allow /themes/*.css?
Allow /themes/*.js$
Allow /themes/*.js?
Allow /themes/*.gif
Allow /themes/*.jpg
Allow /themes/*.jpeg
Allow /themes/*.png
Disallow /includes/
Disallow /misc/
Disallow /modules/
Disallow /profiles/
Disallow /scripts/
Disallow /themes/
Disallow /CHANGELOG.txt
Disallow /cron.php
Disallow /INSTALL.mysql.txt
Disallow /INSTALL.pgsql.txt
Disallow /INSTALL.sqlite.txt
Disallow /install.php
Disallow /INSTALL.txt
Disallow /LICENSE.txt
Disallow /MAINTAINERS.txt
Disallow /update.php
Disallow /UPGRADE.txt
Disallow /xmlrpc.php
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips/
Disallow /node/add/
Disallow /search/
Disallow /user/register/
Disallow /user/password/
Disallow /user/login/
Disallow /user/logout/
Disallow /?q=admin%2F
Disallow /?q=comment%2Freply%2F
Disallow /?q=filter%2Ftips%2F
Disallow /?q=node%2Fadd%2F
Disallow /?q=search%2F
Disallow /?q=user%2Fpassword%2F
Disallow /?q=user%2Fregister%2F
Disallow /?q=user%2Flogin%2F
Disallow /?q=user%2Flogout%2F

Other Records

Field Value
crawl-delay 10

adequat

Rule Path
Disallow /

adequat-systems

Rule Path
Disallow /

admantx bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

alexibot

Rule Path
Disallow /

alvinetspider

Rule Path
Disallow /

amisoftware

Rule Path
Disallow /

antenne hatena

Rule Path
Disallow /

apocalxexplorerbot

Rule Path
Disallow /

argus

Rule Path
Disallow /

ask n read

Rule Path
Disallow /

asknread.com

Rule Path
Disallow /

asterias

Rule Path
Disallow /

augure

Rule Path
Disallow /

augure

Rule Path
Disallow /

auramundi

Rule Path
Disallow /

babya discoverer

Rule Path
Disallow /

backdoorbot/1.0

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

bitlybot

Rule Path
Disallow /

bizinformation

Rule Path
Disallow /

black hole

Rule Path
Disallow /

blekkobot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bloodhound

Rule Path
Disallow /

blowfish/1.0

Rule Path
Disallow /

botalot

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

bullseye/1.0

Rule Path
Disallow /

bunnyslippers

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

cegbfeieh

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

cherrypickerelite/1.0

Rule Path
Disallow /

cherrypickerse/1.0

Rule Path
Disallow /

cision

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

coexel

Rule Path
Disallow /

converacrawler

Rule Path
Disallow /

copyrightcheck

Rule Path
Disallow /

corporama

Rule Path
Disallow /

cosmos

Rule Path
Disallow /

crescent

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

cydralspider

Rule Path
Disallow /

digimind

Rule Path
Disallow /

disco pump 3.1

Rule Path
Disallow /

discobot

Rule Path
Disallow /

dittospyder

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotmic dotbot

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

downloadexpress

Rule Path
Disallow /

edd

Rule Path
Disallow /

ellisphere

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

erocrawler

Rule Path
Disallow /

etaospider

Rule Path
Disallow /

eureka

Rule Path
Disallow /

europresse

Rule Path
Disallow /

exabot

Rule Path
Disallow /

explore

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

ezooms robot

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

fetch

Rule Path
Disallow /

firstrain

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

foobot

Rule Path
Disallow /

gammaspider

Rule Path
Disallow /

gnowitnewsbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

harvest/1.5

Rule Path
Disallow /

hloader

Rule Path
Disallow /

houzzbot

Rule Path
Disallow /

httplib

Rule Path
Disallow /

httrack

Rule Path
Disallow /

httrack 3.0

Rule Path
Disallow /

humanlinks

Rule Path
Disallow /

igentia

Rule Path
Disallow /

indexer

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

infoseek

Rule Path
Disallow /

jennybot

Rule Path
Disallow /

jetbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

kbcrawl

Rule Path
Disallow /

kenjin spider

Rule Path
Disallow /

knowings

Rule Path
Disallow /

lamarkbot

Rule Path
Disallow /

larbin

Rule Path
Disallow /

leadbox

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

libweb/clshttp

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

linko

Rule Path
Disallow /

linkscan/8.1a unix

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

lwp-trivial/1.34

Rule Path
Disallow /

manageo

Rule Path
Disallow /

mata hari

Rule Path
Disallow /

maxpointcrawler

Rule Path
Disallow /

mediacompil

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

meltawer

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

mention

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

miixpc/4.2

Rule Path
Disallow /

mister pix

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

moget

Rule Path
Disallow /

moget/2.1

Rule Path
Disallow /

moreover

Rule Path
Disallow /

ms search 4.0 robot

Rule Path
Disallow /

ms search 5.0 robot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

mytwip

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

netants

Rule Path
Disallow /

netattache

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

netmechanic

Rule Path
Disallow /

newscan-online

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

newzbin

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

npbot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

objectssearch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

openfind

Rule Path
Disallow /

openindexspider

Rule Path
Disallow /

opinion-tracker

Rule Path
Disallow /

perl lwp

Rule Path
Disallow /

pimptrain

Rule Path
Disallow /

propowerbot/2.14

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

proxem

Rule Path
Disallow /

psbot

Rule Path
Disallow /

quepasacreep

Rule Path
Disallow /

queryn metasearch

Rule Path
Disallow /

qwam content intelligence

Rule Path
Disallow /

raven

Rule Path
Disallow /

readability.com

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

riddler

Rule Path
Disallow /

rma

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

score3

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sightupbot

Rule Path
Disallow /

sindup

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

sitesucker

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spankbot

Rule Path
Disallow /

spanner

Rule Path
Disallow /

speedy

Rule Path
Disallow /

spotter

Rule Path
Disallow /

sputnikbot

Rule Path
Disallow /

suggybot

Rule Path
Disallow /

superbot

Rule Path
Disallow /

superbot/2.6

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

talkwater

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

the intraformant

Rule Path
Disallow /

thenomad

Rule Path
Disallow /

tighttwatbot

Rule Path
Disallow /

titan

Rule Path
Disallow /

tocrawl/urldispatcher

Rule Path
Disallow /

toscrawler

Rule Path
Disallow /

trendeo

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

trendybuzz

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

true_robot/1.0

Rule Path
Disallow /

turingos

Rule Path
Disallow /

up2news

Rule Path
Disallow /

updownerbot

Rule Path
Disallow /

uptimerobot/2.0

Rule Path
Disallow /

urlpouls

Rule Path
Disallow /

urly warning

Rule Path
Disallow /

vci

Rule Path
Disallow /

vecteurplus

Rule Path
Disallow /

verif

Rule Path
Disallow /

verticalsearch

Rule Path
Disallow /

vsw

Rule Path
Disallow /

wapspider

Rule Path
Disallow /

web image collector

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webbandit/3.50

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webcopy

Rule Path
Disallow /

webenhancer

Rule Path
Disallow /

webmasterworldforumbot

Rule Path
Disallow /

webmirror

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

websauger

Rule Path
Disallow /

website extractor

Rule Path
Disallow /

website quester

Rule Path
Disallow /

webster pro

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webstripper/2.02

Rule Path
Disallow /

webzinger

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

wget

Rule Path
Disallow /

wikiofeedbot

Rule Path
Disallow /

winello

Rule Path
Disallow /

winhttrack

Rule Path
Disallow /

wiseguys robot

Rule Path
Disallow /

www-collector-e

Rule Path
Disallow /

xenu link sleuth/1.3.8

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

youmag

Rule Path
Disallow /

yrspider

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zite

Rule Path
Disallow /

zookabot

Rule Path
Disallow /

zyborg
criteobot/0.1

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

grapeshot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

ina dlweb

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

inkl

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

kantar

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

outbrain

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

proximic

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

weborama-fetcher

Rule Path
Allow /

Other Records

Field Value
crawl-delay 5

archive.org_bot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

baiduspider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

coccocbot-web

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

factiva

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

petalbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

proximic

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

soso spider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

sogou spider

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

turnitinbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

turnitin robot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

yandex

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

yandexbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

yeti

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

Comments

  • France Medias Monde [2024-03-22] - francemediasmonde.com
  • RFIMusique - musique.rfi.fr - HTTPS
  • CSS, JS, Images
  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • Bots blocking
  • Partners
  • Too fast bots

Warnings

  • 6 invalid lines.