main-angler.de
robots.txt

Robots Exclusion Standard data for main-angler.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	main-angler.de
Base Domain	main-angler.de
Scan Status	Ok
Last Scan	2024-11-05T08:45:14+00:00
Next Scan	2024-11-19T08:45:14+00:00

Last Scan

Scanned	2024-11-05T08:45:14+00:00
URL	https://main-angler.de/robots.txt
Redirect	https://www.main-angler.de/robots.txt
Redirect Domain	www.main-angler.de
Redirect Base	main-angler.de
Domain IPs	138.201.121.237
Redirect IPs	138.201.121.237
Response IP	138.201.121.237
Found	Yes
Hash	bd070e28d3125ecb0aaa79e71d5817e4ac6ce5f244c0d750ca284f791b241413
SimHash	aa3a15584eeb

Groups

*

Rule	Path
Disallow	/administrator/
Disallow	/cache/
Disallow	/cli/
Disallow	/components/
Disallow	/images/
Disallow	/includes/
Disallow	/installation/
Disallow	/language/
Disallow	/libraries/
Disallow	/logs/
Disallow	/media/
Disallow	/modules/
Disallow	/plugins/
Disallow	/templates/
Disallow	/tmp/
Disallow	/http-bind/

Rule

Path

Disallow

/administrator/

Disallow

/cache/

Disallow

/cli/

Disallow

/components/

Disallow

/images/

Disallow

/includes/

Disallow

/installation/

Disallow

/language/

Disallow

/libraries/

Disallow

/logs/

Disallow

/media/

Disallow

/modules/

Disallow

/plugins/

Disallow

/templates/

Disallow

/tmp/

Disallow

/http-bind/

seokicks

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks-robot

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

majesticseo

Rule	Path
Disallow	/

Rule

Path

Disallow

backlinkcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

xovi

Rule	Path
Disallow	/

Rule

Path

Disallow

xovibot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

spbot

Rule	Path
Disallow	/

Rule

Path

Disallow

searchmetricsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

search17

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ia_archiver

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

slysearch

Rule	Path
Disallow	/

Rule

Path

Disallow

findlinks

Rule	Path
Disallow	/

Rule

Path

Disallow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

pixray-seeker

Rule	Path
Disallow	/

Rule

Path

Disallow

ezooms

Rule	Path
Disallow	/

Rule

Path

Disallow

lb-spider

Rule	Path
Disallow	/

Rule

Path

Disallow

wbsearchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

psbot

Rule	Path
Disallow	/

Rule

Path

Disallow

huaweisymantecspider

Rule	Path
Disallow	/

Rule

Path

Disallow

ec2linkfinder

Rule	Path
Disallow	/

Rule

Path

Disallow

htdig

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

discobot

Rule	Path
Disallow	/

Rule

Path

Disallow

linkdex.com

Rule	Path
Disallow	/

Rule

Path

Disallow

seznambot

Rule	Path
Disallow	/

Rule

Path

Disallow

edisterbot

Rule	Path
Disallow	/

Rule

Path

Disallow

swebot

Rule	Path
Disallow	/

Rule

Path

Disallow

picmole

Rule	Path
Disallow	/

Rule

Path

Disallow

yeti

Rule	Path
Disallow	/

Rule

Path

Disallow

yeti-mobile

Rule	Path
Disallow	/

Rule

Path

Disallow

pagepeeker

Rule	Path
Disallow	/

Rule

Path

Disallow

catchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

yacybot

Rule	Path
Disallow	/

Rule

Path

Disallow

netestatenecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

surveybot

Rule	Path
Disallow	/

Rule

Path

Disallow

comodosslchecker

Rule	Path
Disallow	/

Rule

Path

Disallow

comodo-certificates-spider

Rule	Path
Disallow	/

Rule

Path

Disallow

gonzo

Rule	Path
Disallow	/

Rule

Path

Disallow

schrein

Rule	Path
Disallow	/

Rule

Path

Disallow

afiliaswebminingtool

Rule	Path
Disallow	/

Rule

Path

Disallow

suggybot

Rule	Path
Disallow	/

Rule

Path

Disallow

bdbrandprotect

Rule	Path
Disallow	/

Rule

Path

Disallow

bpimagewalker

Rule	Path
Disallow	/

Rule

Path

Disallow

updownerbot

Rule

Path

Disallow

lex

Rule

Path

Disallow

contentcrawler

Rule

Path

Disallow

dcpbot

Rule

Path

Disallow

kaloogabot

Rule

Path

Disallow

mlbot

Rule

Path

Disallow

icjobs

Rule

Path

Disallow

obot

Rule

Path

Disallow

webmastercoffee

Rule

Path

Disallow

qualidator

Rule

Path

Disallow

webinator

Rule

Path

Disallow

thunderstone

Rule

Path

Disallow

larbin

Rule

Path

Disallow

opidoobot

Rule

Path

Disallow

ips-agent

Rule

Path

Disallow

tineye

Rule

Path

Disallow

unisterbot

Rule

Path

Disallow

unister

Rule

Path

Disallow

reverseget

Rule

Path

Disallow

dotbot

Rule

Path

Disallow

Comments

If the Joomla site is installed within a folder such as at
e.g. www.example.com/joomla/ the robots.txt file MUST be
moved to the site root at e.g. www.example.com/robots.txt
AND the joomla folder name MUST be prefixed to the disallowed
path, e.g. the Disallow rule for the /administrator/ folder
MUST be changed to read Disallow: /joomla/administrator/
For more information about the robots.txt standard, see:
http://www.robotstxt.org/orig.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html

Warnings

2 invalid lines.

main-angler.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

seokicks

seokicks-robot

sistrix

majesticseo

backlinkcrawler

xovi

xovibot

mj12bot

spbot

searchmetricsbot

search17

ahrefsbot

ia_archiver

turnitinbot

slysearch

findlinks

magpie-crawler

pixray-seeker

ezooms

lb-spider

wbsearchbot

psbot

huaweisymantecspider

ec2linkfinder

htdig

semrushbot

discobot

linkdex.com

seznambot

edisterbot

swebot

picmole

yeti

yeti-mobile

pagepeeker

catchbot

yacybot

netestatenecrawler

surveybot

comodosslchecker

comodo-certificates-spider

gonzo

schrein

afiliaswebminingtool

suggybot

bdbrandprotect

bpimagewalker

updownerbot

lex

contentcrawler

dcpbot

kaloogabot

mlbot

icjobs

obot

webmastercoffee

qualidator

webinator

thunderstone

larbin

opidoobot

ips-agent

tineye

unisterbot

unister

reverseget

dotbot

Comments

Warnings

main-angler.de
robots.txt