korrekturlesen.org
robots.txt

Robots Exclusion Standard data for korrekturlesen.org

Resource Scan

Scan Details

Site Domain korrekturlesen.org
Base Domain korrekturlesen.org
Scan Status Ok
Last Scan2025-11-30T21:52:34+00:00
Next Scan 2025-12-07T21:52:34+00:00

Last Scan

Scanned2025-11-30T21:52:34+00:00
URL http://korrekturlesen.org/robots.txt
Domain IPs 92.51.162.160
Response IP 92.51.162.160
Found Yes
Hash 77fbb0b76484b7ef9b162ee6d6476cde1eba4f0a9c777009450bc32ad084907c
SimHash 943de6685fbb

Groups

*

Rule Path
Disallow /images/
Disallow /plesk-stat/
Disallow /impressum.php
Disallow /widerruf.php
Disallow /hinweis-ie.php
Disallow /agb.php
Disallow /links.php

ia_archiver

Rule Path
Disallow /

wget
webzip
webmirror
webcopy

Rule Path
Disallow /

iccrawler - icjobs
turnitinbot
wotbox
acoon
grapeshot
sistrix
proximic
careerbot
baiduspider
baiduspider-image
mj12bot
wbsearchbot
dotbot
obot
ahrefsbot
blexbot
loadtimebot
archive-org.com

Rule Path
Disallow /

Comments

  • archive.org sperren
  • Downloader
  • weitere Bots

Warnings

  • 1 invalid line.