paristoversailles.com
robots.txt

Robots Exclusion Standard data for paristoversailles.com

Resource Scan

Scan Details

Site Domain paristoversailles.com
Base Domain paristoversailles.com
Scan Status Ok
Last Scan2024-11-08T16:09:13+00:00
Next Scan 2024-12-08T16:09:13+00:00

Last Scan

Scanned2024-11-08T16:09:13+00:00
URL https://paristoversailles.com/robots.txt
Domain IPs 2001:41d0:1:1b00:213:186:33:3, 46.105.204.15
Response IP 46.105.204.15
Found Yes
Hash 604e23314313644e0a608ce89f3690a45f1013f638dae0bbc215ac490e6c581e
SimHash 6372430c4e03

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-login.php
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /wp-content/backups
Disallow /category/*/*
Disallow */trackback
Disallow */feed
Disallow */comments
Disallow /*?

googlebot

Rule Path
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*.swf$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$
Allow /*.css$
Allow /*.js$

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

abonti
adbeat_bot
adequat
adequat-systems
ahrefsbot
ahrefsbot/5.0
aihitbot
alexa
alexibot
amisoftware
apocalxexplorerbot
archive.org_bot
asknread.com
asterias
augure
auramundi
backdoorbot/1.0
backlinkcrawler
baiduspider-favo
baiduspider-cpro
baiduspider-ads
baidu
baiduspider-news
baiduspider-video
baiduspider-image
baiduspider
baidu
bizinformation
black hole
blexbot
blowfish/1.0
botalot
builtbottough
bullseye/1.0
bunnyslippers
ccbot
cegbfeieh
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
cision
cliqzbot
coccoc/1.0
coexel
compspybot
converacrawler
copyrightcheck
corporama
cosmos
crescent
crescent internet toolpak http ole control v.1.0
curious george
digimind
disco pump 3.1
discobot
dittospyder
domaintuno
dotbot
dotbot/1.1
dubaiindex
ellisphere
ecairn-grabber/1.0
emailcollector
emailsiphon
emailwolf
erocrawler
exabot
exabot/3.0
extractorpro
ezooms
fetch
flamingo_searchengine
flipboard
flipboardproxy
foobot
genieo/1.0
gozaikbot
grapeshotcrawler
grouphigh/1.0
grup-client
harvest/1.5
hloader
httplib
httrack
httrack 3.0
hubspot links crawler 1.0
humanlinks
ia_archiver
ia_archiver-web.archive.org
icjobs
igentia
infohelfer
infonavirobot
infoseek
ips-agent
istellabot
it2media-domain-crawler
jamesbot
jetbot
jennybot
k2spider
kbcrawl
kenjin spider
kimengi/nineconnections.com
knowings
kraken/0.1
leadbox
lexibot
libweb/clshttp
libwww
linkdexbot
linkdexbot/2.0
linkextractorpro
linkfluence
linkscan/8.1a unix
linkscan
linkwalker
livelapbot
livelapbot/0.2
lssrocketcrawler
lwp-trivial
lwp-trivial/1.34
macinroy
mail.ru_bot
mata hari
meltawer
mention
metauri api/2.0
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
miixpc
miixpc/4.2
moreover
moreover/5.1
mj12bot
mj12bot/v1.4.5
mlbot
moget
moget/2.1
msiecrawler
ms search 4.0 robot
ms search 5.0 robot
mytwip
naverbot
ncbot
nerdybot
netants
netattache
netattache light 1.1
netestate
netlyzer fastprobe
netmechanic
netseer
newslebot
newslebot/1.0
newsnow
newsbin
nicerspro
nutch-1.4
obot
offline explorer
openfind
openfind data gathere
opinion-tracker
openhosebot/2.1
owlin bot
owlin bot v3
pagesinventory
paperlibot
paperlibot/2.1
petalbot
pixray*
propowerbot/2.14
prowebwalker
proximic
proxem
psbot
quepasacreep
queryn metasearch
queryseekerspider
qwam content intelligence
readability.com
rebelmouse/0.1
repomonkey
repomonkey bait & tackle/v1.01
rma
rogerbot
rogerbot/1.0
rssingbot
scoop.it
screaming frog seo spider
semvisubot
semvisubot 2.0
seznambot
shrook/2.93y
sightupbot
sindup
sistrix
sitebot
sitecheck.internetseer.com
siteexplorer
sitesnagger
spotter
socialshare
socialshare/1.0
sociallymap
sogou web spider
sogou web spider/4.0
sosospider
spankbot
spanner
spbot
speedy
spiderbot/nutch-1.7
spundge/0.1
spiderlytics
ssearch
suggybot
superbot
superbot/2.6
superfeedr bot/2.0
surveybot
suzuran
synthesio
szukacz
szukacz/1.4
talkwater
teleport
teleportpro
telesoft
the intraformant
thenomad
tighttwatbot
titan
tocrawl/urldispatcher
trendeo
trendybuzz
true_robot
true_robot/1.0
turingos
turnitinbot
u
unisterbot
urlpouls
urly warning
vecteurplus
verticalsearch
vci
vci webviewer vci webviewer win32
vsw
wasalive-bot
waybackarchive.org
wbsearchbot
web image collector
webauto
webbandit
webbandit/3.50
webcopier
webcopy
webenhancer
webmasterworldforumbot
webmirror
webreaper
websauger
website extractor
website quester
webster pro
webstripper
webstripper/2.02
webzip
webzip/4.0
wget
wget
wget/1.5.3
wget/1.6
wikiofeedbot
wikiwix-bot-3.0
winhttrack
winello
woriobot
worldwebheritage.org/1.0
wotbox
www-collector-e
xenu link sleuth
xenu link sleuth/1.3.8
xenu's
xenu's link sleuth 1.1c
yandexbot
yandeximages
yisouspider
youmag
yrspider
zaclysbot/1.2
zeus
zealbot
zite
admantx
ubermetrics
ubermetrics-technologies

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.paristoversailles.com/sitemap_index.xml

Comments

  • On empĂȘche l'indexation des dossiers sensibles
  • On empĂȘche l'indexation des fichiers sensibles
  • Autoriser Google Image
  • Autoriser Google AdSense
  • On indique au spider le lien vers notre sitemap
  • User-agent: SemrushBot
  • User-agent: SemrushBot-SA

Warnings

  • 5 invalid lines.