africavivre.com
robots.txt

Robots Exclusion Standard data for africavivre.com

Resource Scan

Scan Details

Site Domain africavivre.com
Base Domain africavivre.com
Scan Status Ok
Last Scan2026-03-23T03:19:44+00:00
Next Scan 2026-04-22T03:19:44+00:00

Last Scan

Scanned2026-03-23T03:19:44+00:00
URL https://africavivre.com/robots.txt
Domain IPs 176.31.75.116
Response IP 176.31.75.116
Found Yes
Hash 6267265ea447bcf0ba53009028310437792f5ec608e04b67f674fa3bd9a93f1a
SimHash 1378cb01d296

Groups

*

Rule Path
Disallow /*?tag=
Disallow /*?id_currency=
Disallow /*?back=
Disallow /*?order=
Disallow /*%26tag%3D
Disallow /*%26id_currency%3D
Disallow /*%26search_query%3D
Disallow /*%26back%3D
Disallow /*%26order%3D
Disallow /*controller%3Daddresses
Disallow /*controller%3Daddress
Disallow /*controller%3Dauthentication
Disallow /*controller%3Dcart
Disallow /*controller%3Ddiscount
Disallow /*controller%3Dfooter
Disallow /*controller%3Dget-file
Disallow /*controller%3Dheader
Disallow /*controller%3Dhistory
Disallow /*controller%3Didentity
Disallow /*controller%3Dimages.inc
Disallow /*controller%3Dinit
Disallow /*controller%3Dmy-account
Disallow /*controller%3Dorder
Disallow /*controller%3Dorder-opc
Disallow /*controller%3Dorder-slip
Disallow /*controller%3Dorder-detail
Disallow /*controller%3Dorder-follow
Disallow /*controller%3Dorder-return
Disallow /*controller%3Dorder-confirmation
Disallow /*controller%3Dpagination
Disallow /*controller%3Dpassword
Disallow /*controller%3Dpdf-invoice
Disallow /*controller%3Dpdf-order-return
Disallow /*controller%3Dpdf-order-slip
Disallow /*controller%3Dproduct-sort
Disallow /*controller%3Dsearch
Disallow /*controller%3Dstatistics
Disallow /*controller%3Dattachment
Disallow /*controller%3Dguest-tracking
Disallow /*/classes/
Disallow /*/config/
Disallow /*/download/
Disallow /*/mails/
Disallow /*/translations/
Disallow /*/tools/
Disallow /mot-de-passe-oublie
Disallow /adresse
Disallow /adresses
Disallow /authentification
Disallow /panier
Disallow /bons-de-reduction
Disallow /historique-des-commandes
Disallow /identite
Disallow /mon-compte
Disallow /details-de-la-commande
Disallow /avoirs
Disallow /commande
Disallow /commande-rapide
Disallow /suivi-commande-invite
Disallow /modules/gsnippetsreviews/
Disallow /themes/ba16/js/link.js
Allow */tools/*.js
Disallow /promotions
Disallow /index.php?controller=advancedsearch4%3F*
Disallow /*searchcron
Disallow /*cron

adsbot-google

Rule Path
Disallow /*searchcron
Disallow /*cron

facebookexternalhit

Rule Path
Allow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /

addsearchbot
ai2bot
ai2bot-dolma
aihitbot
amazonbot
andibot
anthropic-ai
applebot
applebot-extended
awario
bedrockbot
bigsur.ai
brightbot 1.0
bytespider
ccbot
claude-searchbot
claude-user
claude-web
claudebot
cloudvertexbot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawlspace
datenbank crawler
devin
diffbot
duckassistbot
echobot bot
echoboxbot
facebookbot
factset_spyderbot
firecrawlagent
friendlycrawler
gemini-deep-research
google-extended
googleother
googleother-image
googleother-video
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
linerbot
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
meta-webindexer
mistralai-user
mistralai-user/1.0
mycentralaiscraperbot
netestate imprint crawler
novaact
oai-searchbot
omgili
omgilibot
openai
operator
pangubot
panscient
panscient.com
perplexity-user
perplexitybot
petalbot
phindbot
poseidon research crawler
qualifiedbot
quillbot
quillbot.com
sbintuitionsbot
scrapy
semrushbot-ocob
semrushbot-swa
shapbot
sidetrade indexer bot
terracotta
thinkbot
tiktokspider
timpibot
velenpublicwebcrawler
wardbot
webzio-extended
wpbot
yak
yandexadditional
yandexadditionalbot
youbot

Rule Path
Disallow /

webzio-extended
wpbot
yak
yandexadditional
yandexadditionalbot
youbot

Rule Path
Disallow /

yandexadditional
yandexadditionalbot
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.laboutiqueafricavivre.com/1_index_sitemap.xml

Comments

  • Controllers
  • Directories
  • Files
  • Modules
  • Allow Bots
  • Disallow Bots
  • User-agent: GPTBot
  • Disallow: /
  • User-agent: ChatGPT-User
  • Disallow: /
  • User-agent: ChatGPT Agent
  • User-agent: ChatGPT-User
  • User-agent: facebookexternalhit
  • User-agent: Google-CloudVertexBot
  • User-agent: Google-Firebase
  • User-agent: GoogleAgent-Mariner
  • User-agent: GPTBot
  • This robots.txt has bin erased by DC-handler
  • This robots.txt has bin erased by DC-handler YaK
  • This robots.txt has bin erased by DC-handler

Warnings

  • `t` is not a known field.