jjj.de
robots.txt

Robots Exclusion Standard data for jjj.de

Resource Scan

Scan Details

Site Domain jjj.de
Base Domain jjj.de
Scan Status Ok
Last Scan2025-10-18T10:17:37+00:00
Next Scan 2025-11-17T10:17:37+00:00

Last Scan

Scanned2025-10-18T10:17:37+00:00
URL https://jjj.de/robots.txt
Domain IPs 2a01:4f8:121:1254::2, 78.46.105.101
Response IP 78.46.105.101
Found Yes
Hash 5f864464ecc196657bfa43adebd465b9114e2dec94df8db1b475144b10b9842e
SimHash 343af6e21f03

Groups

*

Rule Path
Disallow /stupid-bot/

myonid

Rule Path
Disallow /

pipl

Rule Path
Disallow /

yasni

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

iccrawler - icjobs

Rule Path
Disallow /

acoon-robot

Rule Path
Disallow /

acoon

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

aboutusbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

ec2linkfinder

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

careerbot

Rule Path
Disallow /

Comments

  • robots.txt for www.jjj.de:
  • ill-behaved bots show up in access-log
  • (the directory /stupid-bot/ does not exist):
  • identity gatherer:
  • identity gatherer:
  • identity gatherer:
  • identity gatherer:
  • brandwatch.com:
  • icjobs.de:
  • acoon.de:
  • warebay.com:
  • http://www.picsearch.com/bot.html:
  • ill-behaved:
  • sistrix.net sistrix.de:
  • aboutus.org mediaways.net:
  • seokicks.de:
  • amazonaws.com:
  • https://ahrefs.com/robot/index.php
  • http://www.career-x.de/bot.html
  • :
  • User-agent: Presto
  • Disallow: /
  • :
  • User-agent:
  • Disallow: /

Warnings

  • 2 invalid lines.