cardmat.net
robots.txt

Robots Exclusion Standard data for cardmat.net

Resource Scan

Scan Details

Site Domain cardmat.net
Base Domain cardmat.net
Scan Status Ok
Last Scan2024-05-10T16:26:46+00:00
Next Scan 2024-06-09T16:26:46+00:00

Last Scan

Scanned2024-05-10T16:26:46+00:00
URL https://cardmat.net/robots.txt
Domain IPs 81.19.159.64
Response IP 81.19.159.64
Found Yes
Hash 495853540f07cdfc6abc0e2769727a3a4c287da4652c93881c85a957c32ce282
SimHash 730b93ab4787

Groups

*

Rule Path
Allow /
Disallow /network/ajax/view/
Disallow /network/search/
Disallow /network/search?
Disallow /network/_graphics

*

Rule Path
Disallow /analytics
Disallow /cgi-bin
Disallow /netdata
Disallow /error
Disallow /class
Disallow /stats
Disallow /jobber

afilias web mining tool

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

aqua_products

Rule Path
Disallow /

asterias

Rule Path
Disallow /

b2w/0.1

Rule Path
Disallow /

backdoorbot/1.0

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

blowfish/1.0

Rule Path
Disallow /

bookmark search tool

Rule Path
Disallow /

botalot

Rule Path
Disallow /

bpimagewalker

Rule Path
Disallow /

bpimagewalker*

Rule Path
Disallow /

bdbrandprotect

Rule Path
Disallow /

birubot

Rule Path
Disallow /

bixolabs

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

botonparade

Rule Path
Disallow /

bubing

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

bullseye/1.0

Rule Path
Disallow /

bullseye

Rule Path
Disallow /

bunnyslippers

Rule Path
Disallow /

catchbot

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

cherrypickerse/1.0

Rule Path
Disallow /

cherrypickerelite/1.0

Rule Path
Disallow /

comodo ssl checker

Rule Path
Disallow /

comodo-certificates-spider

Rule Path
Disallow /

content crawler

Rule Path
Disallow /

copyrightcheck

Rule Path
Disallow /

cosmos

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

crescent

Rule Path
Disallow /

dcpbot

Rule Path
Disallow /

discobot

Rule Path
Disallow /

dittospyder

Rule Path
Disallow /

ec2linkfinder

Rule Path
Disallow /

edisterbot

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

erocrawler

Rule Path
Disallow /

eurobot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

exdomain

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

fairad client

Rule Path
Disallow /

findfiles.net

Rule Path
Disallow /

findlinks

Rule Path
Disallow /

foobot

Rule Path
Disallow /

gaisbot

Rule Path
Disallow /

getright/4.2

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

gonzo

Rule Path
Disallow /

grub

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

harvest/1.5

Rule Path
Disallow /

hloader

Rule Path
Disallow /

htdig

Rule Path
Disallow /

httplib

Rule Path
Disallow /

huaweisymantecspider

Rule Path
Disallow /

humanlinks

Rule Path
Disallow /

ia_archiver/1.6

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

iccrawler - icjobs

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

icjobs

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

ips-agent

Rule Path
Disallow /

iron33/1.0.2

Rule Path
Disallow /

jennybot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

kaloogabot

Rule Path
Disallow /

kenjin spider

Rule Path
Disallow /

keyword density/0.9

Rule Path
Disallow /

larbin

Rule Path
Disallow /

lb-spider

Rule Path
Disallow /

lex

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

libweb/clshttp

Rule Path
Disallow /

linkdex.com

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linkscan/8.1a unix

Rule Path
Disallow /

linkscan

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

lnspiderguy

Rule Path
Disallow /

looksmart

Rule Path
Disallow /

lwp-trivial/1.34

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

mata hari

Rule Path
Disallow /

microsoft url control - 6.00.8169

Rule Path
Disallow /

microsoft url control - 5.01.4511

Rule Path
Disallow /

microsoft url control

Rule Path
Disallow /

miixpc/4.2

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

mister pix

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

moget/2.1

Rule Path
Disallow /

moget

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

nerdbynature.bot

Rule Path
Disallow /

netants

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

netmechanic

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

nutch

Rule Path
Disallow /

obot

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

oneriot

Rule Path
Disallow /

openbot

Rule Path
Disallow /

openfind data gathere

Rule Path
Disallow /

openfind

Rule Path
Disallow /

openindexspider

Rule Path
Disallow /

opidoobot

Rule Path
Disallow /

oracle ultra search

Rule Path
Disallow /

pagepeeker

Rule Path
Disallow /

perman

Rule Path
Disallow /

picmole

Rule Path
Disallow /

pixray-seeker

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

propowerbot/2.14

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

psbot

Rule Path
Disallow /

purebot

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

qualidator*

Rule Path
Disallow /

queryn metasearch

Rule Path
Disallow /

repomonkey bait & tackle/v1.01

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

reverseget

Rule Path
Disallow /

rma

Rule Path
Disallow /

schrein

Rule Path
Disallow /

scooter

Rule Path
Disallow /

scoutjet

Rule Path
Disallow /

search17

Rule Path
Disallow /

searchpreview

Rule Path
Disallow /

sitedomain-bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

slysearch

Rule Path
Disallow /

spankbot

Rule Path
Disallow /

spanner

Rule Path
Disallow /

spbot

Rule Path
Disallow /

speedy

Rule Path
Disallow /

spinn3r

Rule Path
Disallow /

suggybot

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

swebot

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

the intraformant

Rule Path
Disallow /

thenomad

Rule Path
Disallow /

tineye

Rule Path
Disallow /

true_robot/1.0

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

tocrawl/urldispatcher

Rule Path
Disallow /

turingos

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

unister*

Rule Path
Disallow /

unwindfetchor

Rule Path
Disallow /

updownerbot

Rule Path
Disallow /

url control

Rule Path
Disallow /

url_spider_pro

Rule Path
Disallow /

urly warning

Rule Path
Disallow /

vci webviewer vci webviewer win32

Rule Path
Disallow /

vci

Rule Path
Disallow /

voilabot

Rule Path
Disallow /

web image collector

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webenhancer

Rule Path
Disallow /

webinator

Rule Path
Disallow /

webmastercoffee

Rule Path
Disallow /

webmasterworldforumbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webripper

Rule Path
Disallow /

websauger

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

website quester

Rule Path
Disallow /

webster pro

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip/4.0

Rule Path
Disallow /

webzip

Rule Path
Disallow /

weneobot

Rule Path
Disallow /

wget/1.6

Rule Path
Disallow /

wget/1.5.3

Rule Path
Disallow /

wget

Rule Path
Disallow /

www-collector-e

Rule Path
Disallow /

xenu's link sleuth 1.1c

Rule Path
Disallow /

xenu's

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yeti-mobile

Rule Path
Disallow /

zeus 32297 webster pro v2.9 win32

Rule Path
Disallow /

zeus link scout

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zookabot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cardmat.net/sitemap.xml

Comments

  • info@cardmat.net
  • unerwünschte bots, die aber die robots.txt abfragen
  • Despictable and evil robots to keep out :)
  • User-agent: sistrix
  • Disallow: /
  • monitor:
  • "ssearch_bot (sSearch Crawler; http://www.semantissimo.de)"
  • "Mozilla/5.0 (compatible; Plukkie/1.4; http://www.botje.com/plukkie.htm)"
  • "Mozilla/5.0 (compatible; lemurwebcrawler admin@lemurproject.org; +http://boston.lti.cs.cmu.edu/crawler_12/)"
  • unerwünschte bots, die die robots.txt NICHT abfragen, gehören ggf. per Rewrite gesperrt:
  • "Mozilla/5.0+(compatible;+PiplBot;++http://www.pipl.com/bot/)" IGNORIERT ROBOTS.TXT
  • "Mozilla/5.0 (compatible; TweetmemeBot/2.11; +http://tweetmeme.com/)" IGNORIERT ROBOTS.TXT
  • in der Regel okay:
  • "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
  • "Googlebot-Image/1.0"
  • "Mozilla/5.0 (compatible; YandexBot/3.0; +http://yandex.com/bots)"
  • "Mozilla/5.0 (compatible; YandexImages/3.0; +http://yandex.com/bots)"
  • "Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)"
  • "Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)"
  • "Mozilla/5.0 (compatible; archive.org_bot +http://www.archive.org/details/archive.org_bot)"
  • "Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)"
  • "msnbot-media/1.1 (+http://search.msn.com/msnbot.htm)"
  • "Mozilla/5.0 (compatible; OpenindexDeepSpider/Nutch-1.5-dev; +http://www.openindex.io/en/webmasters/spider.html)"
  • "CloudACL/Nutch-1.4"
  • "webcrawler (compatible; heritrix/1.14.4 ++http://www.onb.ac.at/about/webarchivierung.htm)"
  • "Mail.RU/2.0" (russ. Suchmaschine)
  • "Sosospider+(+http://help.soso.com/webspider.htm)" (chin. Suchmaschine)
  • "ia_archiver (+http://www.alexa.com/site/help/webmasters; crawler@alexa.com)" (hängt auch mit archive.org zusammen)
  • "Mozilla/5.0 (compatible; archive.org_bot +http://www.archive.org/details/archive.org_bot)"
  • "Mozilla/5.0 (compatible; ScoutJet; +http://www.scoutjet.com/)"
  • "Eurobot/1.1 (http://eurobot.ayell.eu)"
  • "Mozilla/5.0 (compatible; MSIE or Firefox mutant; not on Windows server; +http://ws.daum.net/aboutWebSearch.html) Daumoa/2.0" (koreanische Suchmaschine)
  • "Acoon v4.10.3 (www.acoon.de)"
  • "DoCoMo/2.0 P900i(c100;TB;W24H11) (compatible; ichiro/mobile goo; +http://search.goo.ne.jp/option/use/sub4/sub4-1/)" (jap. Suchmaschine)
  • "ichiro/3.0 (http://help.goo.ne.jp/help/article/1142)"
  • "frogl-bot (Version: 1.06, powered by www.frogl.de +http://www.frogl.de/pfadzurbotseite/bot.html)"
  • "Mozilla/5.0 (compatible; NerdByNature.Bot; http://www.nerdbynature.net/bot)"
  • "Agent-SharewarePlazaBot/3.0+(+http://www.SharewarePlaza.com)" IGNORIERT ROBOTS.TXT
  • "Wotbox/2.0 (bot@wotbox.com; http://www.wotbox.com)" IGNORIERT ROBOTS.TXT
  • "www.freefileszone.com PadPollbot/1.1b (+http://www.freefileszone.com/)" IGNORIERT ROBOTS.TXT
  • "Mozilla/5.0 (compatible; Sitedomain-Bot 1.0; Headers only; +http://www.sitedomain.de/sitedomain-bot/)" IGNORIERT ROBOTS.TXT - checkt auf gelöschte Domains - ruft nur Hauptseite auf
  • "emefgebot/beta (+http://emefge.de/bot.html)" IGNORIERT ROBOTS.TXT

Warnings

  • 4 invalid lines.