production-guide-saarland.de
robots.txt

Robots Exclusion Standard data for production-guide-saarland.de

Resource Scan

Scan Details

Site Domain production-guide-saarland.de
Base Domain production-guide-saarland.de
Scan Status Ok
Last Scan2024-10-02T15:01:07+00:00
Next Scan 2024-11-01T15:01:07+00:00

Last Scan

Scanned2024-10-02T15:01:07+00:00
URL https://production-guide-saarland.de/robots.txt
Redirect https://www.production-guide-saarland.de/robots.txt
Redirect Domain www.production-guide-saarland.de
Redirect Base production-guide-saarland.de
Domain IPs 89.238.70.138
Redirect IPs 89.238.70.138
Response IP 89.238.70.138
Found Yes
Hash 6e7b15f3d6cb7f8d8cf8b4a14e03e6ff0abab1ed3f61a4d0b5dc2f964f62cd05
SimHash ba30151a4560

Groups

seokicks

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

majesticseo

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

xovi

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

search17

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

slysearch

Rule Path
Disallow /

findlinks

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

pixray-seeker

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

lb-spider

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

huaweisymantecspider

Rule Path
Disallow /

ec2linkfinder

Rule Path
Disallow /

htdig

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

discobot

Rule Path
Disallow /

linkdex.com

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

edisterbot

Rule Path
Disallow /

swebot

Rule Path
Disallow /

picmole

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yeti-mobile

Rule Path
Disallow /

pagepeeker

Rule Path
Disallow /

catchbot

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

netestatenecrawler

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

comodosslchecker

Rule Path
Disallow /

comodo-certificates-spider

Rule Path
Disallow /

gonzo

Rule Path
Disallow /

schrein

Rule Path
Disallow /

afiliaswebminingtool

Rule Path
Disallow /

suggybot

Rule Path
Disallow /

bdbrandprotect

Rule Path
Disallow /

bpimagewalker

Rule Path
Disallow /

updownerbot

Rule Path
Disallow /

lex

Rule Path
Disallow /

contentcrawler

Rule Path
Disallow /

dcpbot

Rule Path
Disallow /

kaloogabot

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

icjobs

Rule Path
Disallow /

obot

Rule Path
Disallow /

webmastercoffee

Rule Path
Disallow /

qualidator

Rule Path
Disallow /

webinator

Rule Path
Disallow /

scooter

Rule Path
Disallow /

thunderstone

Rule Path
Disallow /

larbin

Rule Path
Disallow /

opidoobot

Rule Path
Disallow /

ips-agent

Rule Path
Disallow /

tineye

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

unister

Rule Path
Disallow /

reverseget

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Comments

  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html

Warnings

  • 2 invalid lines.