amazedmag.de
robots.txt

Robots Exclusion Standard data for amazedmag.de

Resource Scan

Scan Details

Site Domain amazedmag.de
Base Domain amazedmag.de
Scan Status Ok
Last Scan2024-09-29T16:46:44+00:00
Next Scan 2024-10-06T16:46:44+00:00

Last Scan

Scanned2024-09-29T16:46:44+00:00
URL https://amazedmag.de/robots.txt
Redirect https://www.amazedmag.de/robots.txt
Redirect Domain www.amazedmag.de
Redirect Base amazedmag.de
Domain IPs 85.13.154.104
Redirect IPs 85.13.154.104
Response IP 85.13.154.104
Found Yes
Hash 8f9ef9b96822d44ea349763540eacc91dcd0303e17b9bc8a780753bbb9d9e331
SimHash b227fd6a6160

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-includes/
Allow /wp-includes/js/
Allow /wp-includes/images/
Disallow /trackback/
Disallow /wp-login.php
Disallow /wp-register.php

Other Records

Field Value
crawl-delay 20

mj12bot

Rule Path
Disallow /

domainappender

Rule Path
Disallow /

mozilla/5.0 (compatible; domainappender /1.0; +http://www.profound.net/domainappender)

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

advbot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

publiclibraryarchive.org

Rule Path
Disallow /

memorybot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

abonti

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

mixbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

bubing

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

screenerbot

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

bpimagewalker/2.0

Rule Path
Disallow /

lipperhey

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

feedbooster

Rule Path
Disallow /

nutch

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

spbot

Rule Path
Disallow /

exb language crawler

Rule Path
Disallow /

Comments

  • This virtual robots.txt file was created by the Virtual Robots.txt WordPress plugin: https://www.wordpress.org/plugins/pc-robotstxt/
  • Block MJ12bot as it is just noise
  • 2015.09.21 DomainAppender (SB)
  • 2015.09.21 Baiduspider (SB)
  • 2015.06.27 crawler for SentiOne
  • 2015.04.06 SEO indexer
  • 2015.02.10 AdvBot "classify web content"
  • 2015.01.30 XoviBot SEO bot
  • 2015.02.19 ??? parked domain
  • 2014.12.26. Internet Memory Research
  • 2014.09.26. SimilarTech, Lead Generation, Competitive Intelligence based on Web Tech Analysis
  • 2014.09.26. XOVI Suite, SEO & Online Marketing Tool
  • 2014.09.18. WebSearch
  • 2014.09.11. The web search API
  • entries without date
  • SEO services
  • panscient.com
  • tiscali.it search bot
  • search engine
  • search engine
  • Mixdata : data for big business
  • chinese search engine
  • chinese search engine
  • scalable, fully distributed crawler
  • ??? search engine
  • search engine
  • the Internet Archive's open-source, extensible, scalable, archival-quality Web crawler
  • kostenlose Backlinkchecker von Torsten Rückert Internetdiestleistungen
  • part of Ware Bay Best Buys Search engine
  • Web crawler
  • analyses the structure of the WWW
  • search engine
  • seo
  • brand protection
  • seo
  • seo
  • search engine
  • seo
  • plagiarism check
  • search engine www.sengine.info
  • news
  • Apache Nutch based
  • news portal
  • seo moz
  • seo
  • language