fotografie.at
robots.txt

Robots Exclusion Standard data for fotografie.at

Archived Snapshots

Resource Scan

Scan Details

Site Domain	fotografie.at
Base Domain	fotografie.at
Scan Status	Ok
Last Scan	2024-09-30T17:30:29+00:00
Next Scan	2024-10-07T17:30:29+00:00

Last Scan

Scanned	2024-09-30T17:30:29+00:00
URL	https://fotografie.at/robots.txt
Domain IPs	185.51.10.86
Response IP	185.51.10.86
Found	Yes
Hash	6507424c7daeeaaa0039e6350c33b2a973edec2955156e4a71436209fd8b8d3b
SimHash	b2187149eff6

Groups

stress-agent

Rule	Path
Disallow	/

Rule

Path

Disallow

fast

Rule	Path
Disallow	/

Rule

Path

Disallow

scooter

Rule	Path
Disallow	/

Rule

Path

Disallow

sitecheck.internetseer.com

Rule	Path
Disallow	/

Rule

Path

Disallow

zealbot

Rule	Path
Disallow	/

Rule

Path

Disallow

msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

sitesnagger

Rule	Path
Disallow	/

Rule

Path

Disallow

webstripper

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

fetch

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer

Rule	Path
Disallow	/

Rule

Path

Disallow

teleport

Rule	Path
Disallow	/

Rule

Path

Disallow

teleportpro

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

linko

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

microsoft.url.control

Rule	Path
Disallow	/

Rule

Path

Disallow

xenu

Rule	Path
Disallow	/

Rule

Path

Disallow

larbin

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww

Rule	Path
Disallow	/

Rule

Path

Disallow

zyborg

Rule	Path
Disallow	/

Rule

Path

Disallow

download ninja

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

grub-client

Rule	Path
Disallow	/

Rule

Path

Disallow

k2spider

Rule	Path
Disallow	/

Rule

Path

Disallow

npbot

Rule	Path
Disallow	/

Rule

Path

Disallow

webreaper

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

robots.txt zu http://www.fotografie.at/
these robots have been bad once:
Some bots are known to be trouble, particularly those designed to copy
entire sites. Please obey robots.txt.
Sorry, wget in its recursive mode is a frequent problem.
Please read the man page and use it properly; there is a
--wait option you can use to set the delay between hits,
for instance.
The 'grub' distributed client has been *very* poorly behaved.
Doesn't follow robots.txt anyway, but...
Hits many times per second, not acceptable
http://www.nameprotect.com/botinfo.html
A capture bot, downloads gazillions of pages with no public benefit
http://www.webreaper.net/

fotografie.atrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

stress-agent

fast

scooter

sitecheck.internetseer.com

zealbot

msiecrawler

sitesnagger

webstripper

webcopier

fetch

offline explorer

teleport

teleportpro

webzip

linko

httrack

microsoft.url.control

xenu

larbin

libwww

zyborg

download ninja

semrushbot

semrushbot-sa

wget

grub-client

k2spider

npbot

webreaper

Comments

fotografie.at
robots.txt