guinee-manutention.com
robots.txt

Robots Exclusion Standard data for guinee-manutention.com

Resource Scan

Scan Details

Site Domain guinee-manutention.com
Base Domain guinee-manutention.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-03-17T15:22:40+00:00
Next Scan 2024-06-15T15:22:40+00:00

Last Successful Scan

Scanned2022-11-18T09:28:52+00:00
URL https://guinee-manutention.com/robots.txt
Redirect https://www.guinee-manutention.com/robots.txt
Redirect Domain www.guinee-manutention.com
Redirect Base guinee-manutention.com
Response IP 141.94.139.84
Found Yes
Hash 49d8771e02e1159551f8b152eedf178e8bed5f8e05c42e56d40591110f1f1cd4
SimHash 633352274917

Groups

ahrefsbot

Rule Path
Disallow /

semrushbot-sa
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-bm
semrushbot-seoab
semrushbot

Rule Path
Disallow /

baiduspider
baiduspider-image
baiduspider-video
baiduspider-news
baiduspider-favo
baiduspider-cpro
baiduspider-ads

Rule Path
Disallow /

ahrefsbot
alexibot
apocalxexplorerbot
asterias
backdoorbot/1.0
backlinkcrawler
bizinformation
black hole
blackbird
blowfish/1.0
botalot
builtbottough
bullseye/1.0
bunnyslippers
cegbfeieh
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
disco pump 3.1
dittospyder
emailcollector
emailsiphon
emailwolf
erocrawler
extractorpro
flamingo_searchengine
foobot
grapeshot
harvest/1.5
hloader
httplib
httrack
httrack 3.0
humanlinks
igentia
infonavirobot
jennybot
kenjin spider
lexibot
libweb/clshttp
linkextractorpro
linkscan/8.1a unix
linkwalker
lwp-trivial
lwp-trivial/1.34
mata hari
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
mj12bot
mlbot
moget
moget/2.1
naverbot
netants
netattache
netattache light 1.1
netmechanic
nicerspro
niki-bot
offline explorer
openfind
openfind data gathere
propowerbot/2.14
proximic
prowebwalker
psbot
quepasacreep
queryn metasearch
repomonkey
repomonkey bait & tackle/v1.01
rma
seekportbot
sightupbot
sitebot
sitesnagger
sogou web spider
sosospider
spankbot
spanner
speedy
suggybot
superbot
superbot/2.6
sputnikbot
suzuran
szukacz/1.4
teleport
telesoft
the intraformant
thenomad
tighttwatbot
titan
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
turnitinbot
urlpouls
urly warning
vci
vci webviewer vci webviewer win32
web image collector
webauto
webbandit
webbandit/3.50
webcopier
webcopy
webenhancer
webmasterworldforumbot
webmirror
webreaper
websauger
website extractor
website quester
webster pro
webstripper
webstripper/2.02
webzip
webzip/4.0
wget
wget/1.5.3
wget/1.6
wikiofeedbot
wikiwix-bot-3.0
winhttrack
www-collector-e
xenu's
xenu's link sleuth 1.1c
yrspider
zealbot
zeus

Rule Path
Disallow /

*

Rule Path
Disallow /buyers.js
Disallow /ad/overview/
Disallow /ad/compare
Disallow /sheet-stats/
Disallow /ad-book/
Disallow /bookmark/
Disallow /flags
Disallow /getfeaturedads/
Disallow /account
Disallow /account/
Disallow /criteo
Disallow /facebooklogin
Disallow /facebookUserData
Disallow /googleLogin
Disallow /linkedinLogin
Disallow /dbvi/picture-form
Disallow /dbvi/comment-form
Disallow /media-center/form-share-picture
Disallow /media-center/form-add-picture
Disallow /media-center/set-picture-rating
Disallow /contact/autocompleteContactForm
Disallow /*?*dir=
Disallow /translate
Disallow /translate/check
Disallow /contact/*/callme
Disallow /contact/*/ask-rdv
Disallow /contact/v2/send.html
Disallow /feed/backpage
Disallow /*?*q=
Disallow /search/similars
Disallow /redirect-to-auction-site-*
Disallow /btn-partner
Disallow /*/*-search-geo-brand
Disallow /*/*-search-geo-category
Disallow /bookmarksAds/*
Disallow /tuv/*
Disallow /*?*crn=0
Disallow /*?*hay=
Disallow /*?*ob=
Disallow /*?*dir=
Disallow /*?*fam=
Disallow /*?*cat=
Disallow /*?*ctr=
Disallow /*?*brd=
Disallow /*?*rgn=
Disallow /*?*var=
Disallow /*?*wrrt=
Disallow /*?*wd=
Disallow /*?*transmi=
Disallow /*?*pr=
Disallow /*?*ph=
Disallow /*?*vi=
Disallow /*?*pt=
Disallow /*?*rct=
Disallow /*?*brd=
Disallow /*?*no_ctr=
Disallow /*?*dpt=
Disallow /*?*tpe=
Disallow /*?*mt=
Disallow /*?*svar=
Disallow /*?*mlf=
Disallow /*?*mlt=
Disallow /*?*multi_brd=

linguee bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.guinee-manutention.com/sitemap.xml

Warnings

  • 2 invalid lines.