leasingkostencheck.de
robots.txt

Robots Exclusion Standard data for leasingkostencheck.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	leasingkostencheck.de
Base Domain	leasingkostencheck.de
Scan Status	Ok
Last Scan	2024-11-11T00:14:12+00:00
Next Scan	2024-11-18T00:14:12+00:00

Last Scan

Scanned	2024-11-11T00:14:12+00:00
URL	https://leasingkostencheck.de/robots.txt
Redirect	https://www.leasingkostencheck.de/robots.txt
Redirect Domain	www.leasingkostencheck.de
Redirect Base	leasingkostencheck.de
Domain IPs	46.163.78.70
Redirect IPs	46.163.78.70
Response IP	46.163.78.70
Found	Yes
Hash	15ece85292d92afc19cf3170c6d9d3b0b7bafa041c6b6a0d20f9f6dfff850a0a
SimHash	e6107159eef7

Groups

*

Rule	Path
Disallow	/leasingangebot/
Disallow	/zum-anbieter/
Disallow	/incs/
Disallow	/open_mailclient.php

Rule

Path

Disallow

/leasingangebot/

Disallow

/zum-anbieter/

Disallow

/incs/

Disallow

/open_mailclient.php

mediapartners-google

Rule	Path
Disallow

Rule

Path

Disallow

sitecheck.internetseer.com

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

zealbot

Rule	Path
Disallow	/

Rule

Path

Disallow

msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

sitesnagger

Rule	Path
Disallow	/

Rule

Path

Disallow

webstripper

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

fetch

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer

Rule	Path
Disallow	/

Rule

Path

Disallow

teleport

Rule	Path
Disallow	/

Rule

Path

Disallow

teleportpro

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

linko

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

microsoft.url.control

Rule	Path
Disallow	/

Rule

Path

Disallow

xenu

Rule	Path
Disallow	/

Rule

Path

Disallow

larbin

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww

Rule	Path
Disallow	/

Rule

Path

Disallow

zyborg

Rule	Path
Disallow	/

Rule

Path

Disallow

download ninja

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

grub-client

Rule	Path
Disallow	/

Rule

Path

Disallow

k2spider

Rule	Path
Disallow	/

Rule

Path

Disallow

npbot

Rule	Path
Disallow	/

Rule

Path

Disallow

webreaper

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.leasingkostencheck.de/sitemap.xml

Field

Value

sitemap

https://www.leasingkostencheck.de/sitemap.xml

Comments

Some bots are known to be trouble, particularly those designed to copy
entire sites. Please obey robots.txt.
Sorry, wget in its recursive mode is a frequent problem.
Please read the man page and use it properly; there is a
--wait option you can use to set the delay between hits,
for instance.
The 'grub' distributed client has been *very* poorly behaved.
Doesn't follow robots.txt anyway, but...
Hits many times per second, not acceptable
http://www.nameprotect.com/botinfo.html
A capture bot, downloads gazillions of pages with no public benefit
http://www.webreaper.net/

leasingkostencheck.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

mediapartners-google

sitecheck.internetseer.com

petalbot

zealbot

msiecrawler

sitesnagger

webstripper

webcopier

fetch

offline explorer

teleport

teleportpro

webzip

linko

httrack

microsoft.url.control

xenu

larbin

libwww

zyborg

download ninja

wget

grub-client

k2spider

npbot

webreaper

Other Records

Comments

leasingkostencheck.de
robots.txt