tischkicker.de
robots.txt

Robots Exclusion Standard data for tischkicker.de

Resource Scan

Scan Details

Site Domain tischkicker.de
Base Domain tischkicker.de
Scan Status Ok
Last Scan2024-09-17T09:22:55+00:00
Next Scan 2024-10-17T09:22:55+00:00

Last Scan

Scanned2024-09-17T09:22:55+00:00
URL https://tischkicker.de/robots.txt
Redirect https://www.tischkicker.de/robots.txt
Redirect Domain www.tischkicker.de
Redirect Base tischkicker.de
Domain IPs 178.16.59.77
Redirect IPs 178.16.59.77
Response IP 178.16.59.77
Found Yes
Hash 97dcfb0e6b883d81dae10b430629b5834b7b10c1b1b1a13a5f3af3a9806c8e63
SimHash 0b0908308791

Groups

*

Rule Path
Disallow /admin/
Disallow /app/
Disallow /doc/
Disallow /downloader/
Disallow /includes/
Disallow /lib/
Disallow /pkginfo/
Disallow /shell/
Disallow /var/
Disallow /errors/
Disallow /media/captcha/
Disallow /media/customer/
Disallow /media/dhl/
Disallow /media/downloadable/
Disallow /media/email/
Disallow /media/infortis/
Disallow /media/mgt_developertoolbar/
Disallow /media/import/
Disallow /media/sales/
Disallow /media/wysiwyg/infortis/ultimo/
Disallow /media/xmlconnect/
Disallow /index.php/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /catalog/product/gallery/
Disallow /catalogsearch/
Disallow /wishlist/
Disallow /productalert/
Disallow /customer/
Disallow /*?SID=
Disallow /*?*
Allow /*?p=2
Allow /*?p=3
Allow /*?p=4
Allow /*?p=5
Allow /*?p=6
Allow /*?p=7
Allow /*?p=8
Allow /*?-q3bqna
Disallow /index.php/
Disallow /api.php
Disallow /cron.php
Disallow /cron.sh
Disallow /callback_novalnet2magento.php
Disallow /index.php.sample
Disallow /install.php
Disallow /LICENSE.html
Disallow /LICENSE.txt
Disallow /LICENSE_AFL.txt
Disallow /SetPrimaryCategory.php
Disallow /novalnet_css_link.php
Disallow /get.php
Disallow /*.sql$
Disallow /*.tgz$

httrack
yandex
exabot
gigabot
baiduspider
nutch
cityreview
webreaper
webcopier
offline explorer
microsoft.url.control
emailcollector
penthesilea
backlinkcrawler
sistrix
xovi
seokicks
searchmetricsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.tischkicker.de/sitemap-https-v14.xml

Comments

  • SITEMAP
  • PAGES
  • MAGENTO DIRECTORIES & FILES
  • Directories
  • Disallow: /404/
  • Disallow: /cgi-bin/
  • Disallow: /magento/
  • Disallow: /report/
  • Disallow: /stats/
  • Disallow: /media/attributes/
  • Disallow: /media/tmp/
  • Paths (clean URLs)
  • Do not crawl links with session IDs
  • Paths (no clean URLs)
  • Disallow: /*.js$
  • Disallow: /*.css$
  • Disallow: /*.php$
  • Disallow: /rss*
  • QUERY STRING BLOCKER
  • Uncomment if your site is a brand new un-cached site.
  • Do not crawl sub category pages that are sorted or filtered.
  • Do not crawl 2-nd home page copy (example.com/index.php/). Uncomment it only if you activated Magento SEO URLs.
  • Files
  • Disallow: /error_log
  • Do not crawl development files and folders: CVS, svn directories and dump files
  • ===================================
  • Schliesse folgende Spider komplett aus:
  • ===================================
  • archive.org
  • User-agent: ia_archiver