meubliz.com
robots.txt

Robots Exclusion Standard data for meubliz.com

Resource Scan

Scan Details

Site Domain meubliz.com
Base Domain meubliz.com
Scan Status Ok
Last Scan2025-05-24T06:07:37+00:00
Next Scan 2025-05-31T06:07:37+00:00

Last Scan

Scanned2025-05-24T06:07:37+00:00
URL https://meubliz.com/robots.txt
Redirect https://www.meubliz.com/robots.txt
Redirect Domain www.meubliz.com
Redirect Base meubliz.com
Domain IPs 2001:41d0:1:1b00:213:186:33:17, 213.186.33.17
Redirect IPs 2001:41d0:1:1b00:213:186:33:17, 213.186.33.17
Response IP 213.186.33.17
Found Yes
Hash 73348330e6cbf3565f3347bafe3034410d5a01e623a700eca7fe8510e2cf8ad5
SimHash 54967145aefc

Groups

*

Rule Path
Disallow /adserver/

googlebot

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

storebot-google

Rule Path
Allow /

google-inspectiontool

Rule Path
Allow /

googleother

Rule Path
Allow /

googleother-image

Rule Path
Allow /

googleother-video

Rule Path
Allow /

bingbot

Rule Path
Allow /

adidxbot

Rule Path
Allow /

criteobot/0.1

Rule Path
Allow /

google-cloudvertexbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

bingpreview

Rule Path
Disallow /

adbeat_bot

Rule Path
Disallow /

admantx

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

claritybot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

linespider

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

pinterest

Rule Path
Disallow /

proximic

Rule Path
Disallow /

qwarrybot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

snapchatads/1.0

Rule Path
Disallow /

snap url preview service; bot; snapchat

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

startmebot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

ttd-content

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

uptimerobot/2.0

Rule Path
Disallow /

taboolabot/3.7

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

qwantify-prod/1.0

Rule Path
Disallow /

brightbot 1.0

Rule Path
Disallow /

pernod ricard - clicktobuy crawlerbot/1.0

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

yandexrenderresourcesbot/1.0

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bingsapphire

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatglm-spider

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

copilotsapphire

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

duckassistbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

kangaroo bot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

novaact

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

openai

Rule Path
Disallow /

operator

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

perplexity-user

Rule Path
Disallow /

poesearchbot

Rule Path
Disallow /

spawning-ai

Rule Path
Disallow /

summalybot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

webzio

Rule Path
Disallow /

youbot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

nicecrawler

Rule Path
Disallow /

ahrefssiteaudit

Rule Path
Disallow /

chrome-lighthouse

Rule Path
Disallow /

dark visitor server

Rule Path
Disallow /

deadlinkchecker

Rule Path
Disallow /

eyeotabot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

siteauditbot

Rule Path
Disallow /

t3versionsbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

w3c_css_validator

Rule Path
Disallow /

w3c_validator

Rule Path
Disallow /

wellknownbot

Rule Path
Disallow /

yakazbot

Rule Path
Disallow /

bazqux

Rule Path
Disallow /

bitlybot

Rule Path
Disallow /

bublupbot

Rule Path
Disallow /

discordbot

Rule Path
Disallow /

embedly

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

feedly

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

freshrss

Rule Path
Disallow /

friendica

Rule Path
Disallow /

google-read-aloud

Rule Path
Disallow /

hatena

Rule Path
Disallow /

iframely

Rule Path
Disallow /

inoreader

Rule Path
Disallow /

linkedinbot

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

mastodon

Rule Path
Disallow /

miniflux

Rule Path
Disallow /

newsblur

Rule Path
Disallow /

nextcloud

Rule Path
Disallow /

pinterestbot

Rule Path
Disallow /

pocketparser

Rule Path
Disallow /

redditbot

Rule Path
Disallow /

serendeputybot

Rule Path
Disallow /

simplepie

Rule Path
Disallow /

skypeuripreview

Rule Path
Disallow /

slackbot-linkexpanding

Rule Path
Disallow /

snap url preview service

Rule Path
Disallow /

snapchat

Rule Path
Disallow /

startmebot

Rule Path
Disallow /

superfeedr

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

synapse

Rule Path
Disallow /

telegrambot

Rule Path
Disallow /

twitterbot

Rule Path
Disallow /

viber

Rule Path
Disallow /

vkshare

Rule Path
Disallow /

whatsapp

Rule Path
Disallow /

yahoo link preview

Rule Path
Disallow /

tinytinyrss

Rule Path
Disallow /

*

Rule Path
Disallow /societe

Other Records

Field Value
crawl-delay 2

*

Rule Path
Disallow /forum_mobilier_et_arts_decoratifs/viewforum.php/memberlist.php
Disallow /forum_mobilier_et_arts_decoratifs/viewforum.php/viewtopic.php
Disallow /forum_mobilier_et_arts_decoratifs/viewtopic.php?p=*
Disallow /forum_mobilier_et_arts_decoratifs/memberlist.php
Disallow /forum_mobilier_et_arts_decoratifs/app.php/feed/*
Disallow /forum_mobilier_et_arts_decoratifs/feed/*
Disallow /forum_mobilier_et_arts_decoratifs/search.php?author_id=*

*

Rule Path
Disallow /cpad/

Comments

  • General
  • Modif:20/05/2025
  • no forum pic 01/04/2025
  • no forum feed 01/04/2025
  • nouveaux robots 20/05/2025
  • adserver
  • Robots autorisés
  • Adsense
  • https://www.bing.com/webmasters/help/which-crawlers-does-bing-use-8c184ec0
  • https://www.criteo.com/criteo-crawler/
  • Google et Bing Disallow
  • Robots bloqués
  • Nouveaux
  • IA Desactivation
  • Anthropic
  • Nouvel version de bot ChatGPT depuis été 2023
  • facebook Le robot indexation Meta-ExternalAgent pour entraînement de modèles d’IA ou’amélioration de produits en indexant directement le contenu.
  • Archivers - https://darkvisitors.com/agents
  • Developer Helpers - https://darkvisitors.com/agents
  • Fetchers - https://darkvisitors.com/agents
  • Tous les operations suivntes s'appliques tous les autres robots
  • Ne pas scanner ces repertoires
  • Ne pas scanner ces repertoires forum
  • priorité aux viewtopic.php?t= (topic plutôt que chaque post individuellement)
  • Disallow picture (pour l'instant) car non prises en compte de toute facon
  • Disallow: /forum_mobilier_et_arts_decoratifs/download/file.php?id=*
  • Conservation des ressources bots de referencement
  • User-agent: *
  • Temporaire ne pas référencer les pager de search.php (pour l'instant)
  • Disallow: /forum_mobilier_et_arts_decoratifs/search.php?*
  • Temporaire ne pas lire les requetes de query (pour l'instant)
  • Disallow: /*?*
  • Ne pas autoriser l'access aux chargement de publicité cpad

Warnings

  • 2 invalid lines.