jeans-meile.de
robots.txt

Robots Exclusion Standard data for jeans-meile.de

Resource Scan

Scan Details

Site Domain jeans-meile.de
Base Domain jeans-meile.de
Scan Status Ok
Last Scan2024-06-08T21:01:59+00:00
Next Scan 2024-06-15T21:01:59+00:00

Last Scan

Scanned2024-06-08T21:01:59+00:00
URL https://jeans-meile.de/robots.txt
Redirect https://www.jeans-meile.de/robots.txt
Redirect Domain www.jeans-meile.de
Redirect Base jeans-meile.de
Domain IPs 138.201.120.221, 2a01:4f8:172:241c::2
Redirect IPs 138.201.120.221, 2a01:4f8:172:241c::2
Response IP 138.201.120.221
Found Yes
Hash fed9d6182c0a27ca08bff8b6c4283e8cab7795a083eb1a824d7cb811a5fb5317
SimHash eb8f7352cdf1

Groups

adsbot-google
adsbot-google-mobile

Rule Path
Allow /

asterias
backdoorbot/1.0
baiduspider
black hole
blexbot
blowfish/1.0
botalot
bspider
builtbottough
bullseye/1.0
bunnyslippers
ccbot
cegbfeieh
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
companybook crawler
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
dalvik/2.1.0
daum
dittospyder
domain re-animator bot
emailcollector
emailsiphon
emailwolf
erocrawler
extractorpro
findxbot
foobot
harvest/1.5
hloader
httplib
humanlinks
ia_archiver
infonavirobot
jennybot
jobboersebot
karriereatbot
kenjin spider
keyword density/0.9
lexibot
libweb/clshttp
linguee
linguee bot
linkextractorpro
linkscan/8.1a unix
linkwalker
lnspiderguy
ltx71
lwp-trivial
lwp-trivial/1.34
mata hari
megaindex.ru/2.0
megaindex.ru
megaindex.ru
megaindex.ru
megaindex.com
miixpc
miixpc/4.2
mister pix
moget
moget/2.1
mozilla/4
mozilla/4.0 (compatible; bullseye; windows 95)
mozilla/4.0 (compatible; msie 4.0; windows 95)
mozilla/4.0 (compatible; msie 4.0; windows 98)
mozilla/4.0 (compatible; msie 4.0; windows nt)
mozilla/4.0 (compatible; msie 4.0; windows xp)
mozilla/4.0 (compatible; msie 4.0; windows 2000)
mozilla/4.0 (compatible; msie 4.0; windows me)
netants
nicerspro
offline explorer
openfind
openfind data gathere
propowerbot/2.14
prowebwalker
queryn metasearch
quanta-probe
quanta-probe/2.0
repomonkey
repomonkey bait & tackle/v1.01
rma
scoutjet
sitesnagger
seoscanners.net
seznambot
sogou spider
spankbot
spanner
spbot
spbot/5.0.3
suzuran
szukacz/1.4
teeraidbot
teleport
teleportpro
telesoft
the intraformant
thenomad
tighttwatbot
titan
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
urly warning
uptimebot
uptimebot/1.0
uptimerobot/2.0
vagabondo
vci
vci webviewer vci webviewer win32
web image collector
webauto
webbandit
webbandit/3.50
webcopier
webenhancer
webmasterworldforumbot
websauger
website quester
webster pro
webstripper
webzip
webzip/4.0
wget
wget/1.5.3
wget/1.6
www-collector-e
xenu's
xenu's link sleuth 1.1c
yandex
yandeximages/3.0

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /pkginfo/
Disallow /index.php/
Disallow /control/
Disallow /customize/
Disallow /enable-cookies
Disallow /newsletter/
Disallow /poll/
Disallow /scripts/
Disallow /sendfriend/
Disallow /captcha/*
Disallow /shariff_backend
Disallow /shipping/tracking/*
Disallow /*.php$
Disallow /rss*
Disallow /api.php
Disallow /cron.php
Disallow /cron.sh
Disallow /shell/
Disallow /debit/
Disallow /error_log
Disallow /install.php
Disallow /get.php

Other Records

Field Value
sitemap https://www.jeans-meile.de/sitemap.xml
sitemap https://www.jeans-meile.de/media/feed/sitemap-products.xml
sitemap https://www.jeans-meile.de/media/feed/imagesitemap.xml

Comments

  • Allow Google Ads
  • Block Robots
  • Allow the rest of bots to crawl only Magento relevant information
  • Directories
  • Paths (clean URLs)
  • 2024-02-21-Disallow: */blog/search/
  • 2024-02-21-Disallow: /productalert/
  • 2024-02-21-Disallow: */shopby*
  • 2024-02-21-Disallow: */quickview*
  • Paths (no clean URLs)
  • alte Robots+Files