camping-duguesclin.com
robots.txt

Robots Exclusion Standard data for camping-duguesclin.com

Resource Scan

Scan Details

Site Domain camping-duguesclin.com
Base Domain camping-duguesclin.com
Scan Status Ok
Last Scan2024-09-28T01:40:13+00:00
Next Scan 2024-10-28T01:40:13+00:00

Last Scan

Scanned2024-09-28T01:40:13+00:00
URL https://camping-duguesclin.com/robots.txt
Redirect https://www.camping-duguesclin.com/robots.txt
Redirect Domain www.camping-duguesclin.com
Redirect Base camping-duguesclin.com
Domain IPs 54.36.220.60
Redirect IPs 54.36.220.60
Response IP 54.36.220.60
Found Yes
Hash 3757d242f9d74d2d6126e8082831bc62112269c2d71a0e39b3454abdf2d00bd8
SimHash 89cfcd3636d9

Groups

*

Rule Path
Disallow /*guestbook-add.html
Disallow /*account.html
Disallow /*clickAd-*
Disallow /*getpagecontent*
Disallow /*login*
Disallow /*logout*
Disallow /*newsletters.html
Disallow /*password.html
Disallow /*rechercher.html
Disallow /*subscribe.html
Disallow /*imagezoom*
Disallow /action-*
Disallow /content/*
Disallow /mon-compte/*
Disallow /newsletter/*
Disallow /page-introuvable/
Disallow /recherche/*
Disallow /*index.php*
Disallow /*.asp*
Disallow /htm/*
Disallow /htm_ang/*

ahrefsbot
baiduspider
barkrowler
blexbot
bubing
catchbot
cfnetwork
cliqzbot
coccocbot-web
digincore
domaincrawler
domaincrawler/3.0
dotbot
duppies
exalead
exaleadcloudview
exensa-dew
faviconizer
garlik
garlikcrawler
grapeshot
iccrawler - icjobs
linkdexbot
linkdexbot/2.0
linkdexbot/2.2
linkpadbot
mail.ru
megaindex
melodibot
mj12bot
obot
panscient.com
proximic
python-urllib
r6_feedfetcher
r6_commentreader
root/0.1
semrushbot
semrushbot-sa
seznambot
sistrix
smtbot
sogou spider
spbot
spinn3r
titan
turnitinbot
turnitinbot
twengabot-2.0
ucrawler/1.0
ucrawler
uptimebot
wotbox
yandex
yisouspider
zoombot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.camping-duguesclin/sitemap-1-google.xml

Comments

  • robots interdits