theflyingdragon.nl
robots.txt

Robots Exclusion Standard data for theflyingdragon.nl

Resource Scan

Scan Details

Site Domain theflyingdragon.nl
Base Domain theflyingdragon.nl
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-19T10:02:48+00:00
Next Scan 2024-10-03T10:02:48+00:00

Last Successful Scan

Scanned2024-04-24T08:56:52+00:00
URL https://theflyingdragon.nl/robots.txt
Redirect https://www.theflyingdragon.nl/robots.txt
Redirect Domain www.theflyingdragon.nl
Redirect Base theflyingdragon.nl
Domain IPs 2a02:2968:1:0:1c00:d4ff:fe00:5bc, 62.84.246.226
Redirect IPs 2a02:2968:1:0:1c00:d4ff:fe00:5bc, 62.84.246.226
Response IP 62.84.246.226
Found Yes
Hash ed474844490db14a80ede49ed56dcc702e62fa67362df63a2387bc4bc5c3f465
SimHash 76e04dd1e0c8

Groups

*

Rule Path
Disallow /includes/
Disallow /misc/
Disallow /modules/
Disallow /profiles/
Disallow /themes/
Disallow /scripts/
Disallow /files/piwik/
Disallow /cron.php
Disallow /update.php
Disallow /install.php
Disallow /INSTALL.txt
Disallow /INSTALL.mysql.txt
Disallow /INSTALL.pgsql.txt
Disallow /CHANGELOG.txt
Disallow /MAINTAINERS.txt
Disallow /LICENSE.txt
Disallow /UPGRADE.txt
Disallow /xmlrpc.php
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips/
Disallow /logout
Disallow /contact/
Disallow /node/add/
Disallow /search/
Disallow /user/register
Disallow /user/password
Disallow /user/login
Disallow /user/logout
Disallow /sites/all/
Disallow /?q=admin%2F
Disallow /?q=comment%2Freply%2F
Disallow /?q=filter%2Ftips%2F
Disallow /?q=logout%2F
Disallow /?q=contact%2F
Disallow /?q=logout%2F
Disallow /?q=node%2Fadd%2F
Disallow /?q=search%2F
Disallow /?q=user%2Fpassword
Disallow /?q=user%2Fregister
Disallow /?q=user%2Flogin
Disallow /?q=user%2Flogout
Disallow /?q=sites%2Fall%2F
Disallow /files/
Disallow /images/
Disallow /imgs/
Disallow /js/
Disallow /email/
Disallow /taxonomy/
Disallow /tracker/
Disallow /feed/
Disallow /print/
Disallow /printmail/
Disallow /page%3D*
Disallow /from%3D*
Disallow /sort%3D*
Disallow /size%3D*
Disallow /warning.html
Disallow /now/bugoff
Disallow /ads.txt
Disallow /?q=files%2F
Disallow /?q=images%2F
Disallow /?q=imgs%2F
Disallow /?q=js%2F
Disallow /?q=email%2F
Disallow /?q=taxonomy%2F
Disallow /?q=tracker%2F
Disallow /?q=feed%2F
Disallow /?q=print
Disallow /?q=printmail
Disallow /?q=page=*
Disallow /?q=from=*
Disallow /?q=sort=*
Disallow /?q=size=*
Disallow /?q=warning.html
Disallow /?q=now%2Fbugoff

Other Records

Field Value
crawl-delay 10

bingbot
googlebot

Rule Path
Allow /*.js$
Allow /*.js?
Allow /*.css$
Allow /*.css?
Allow /*.png
Allow /*.jpg
Allow /*.jpeg
Allow /*.webp
Allow /*.svg

petalbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yandexcalendar

Rule Path
Disallow /

yandexmobilebot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

mr.4x3 powered

Rule Path
Disallow /

adscanner

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

aranhabot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image
baiduspider-news
baiduspider-favo
baiduspider-ads
baiduspider-cpro

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

barkrowler/0.9

Rule Path
Disallow /

bidswitchbot

Rule Path
Disallow /

blekkobot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

boardreader

Rule Path
Disallow /

bubing

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot
ccbot/2.0

Rule Path
Disallow /

cincraw/1.0

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

cowbot/1.0

Rule Path
Disallow /

crawlson/1.0

Rule Path
Disallow /

curious george

Rule Path
Disallow /

curious george - www.analyticsseo.com

Rule Path
Disallow /

curious george - www.analyticsseo.com/crawler

Rule Path
Disallow /

dataforseobot/1.0

Rule Path
Disallow /

datanyze

Rule Path
Disallow /

dataprovider

Rule Path
Disallow /

daum

Rule Path
Disallow /

deskyobot

Rule Path
Disallow /

deskyobot/1.0

Rule Path
Disallow /

df bot 1.0

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

feedly

Rule Path
Disallow /

feedburner

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

filibot/1.0

Rule Path
Disallow /

genieo

Rule Path
Disallow /

genieo/1.0

Rule Path
Disallow /

getintent crawler

Rule Path
Disallow /

gluten free crawler/1.0

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

grouphigh

Rule Path
Disallow /

grouphigh/1.0

Rule Path
Disallow /

hybridbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ias_crawler

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

ioncrawl

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

istellabot/1.01.18

Rule Path
Disallow /

istellabot/1.01.18 +http://www.tiscali.it/

Rule Path
Disallow /

istellabot/1.10.2 +http://www.tiscali.it/

Rule Path
Disallow /

mozilla/5.0 (compatible; istellabot/1.01.18 +http://www.tiscali.it/)

Rule Path
Disallow /

james bot

Rule Path
Disallow /

leikibot

Rule Path
Disallow /

libcurl

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

lipperhey

Rule Path
Disallow /

livelap

Rule Path
Disallow /

lssrocket

Rule Path
Disallow /

ltx71
ltx71+-+(http://ltx71.com/)

Rule Path
Disallow /

magpie

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

mauibot (crawler.feedback+wc@gmail.com)

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

moget

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

mr.4x3 powered

Rule Path
Disallow /

netseer

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

obot/2.3.1

Rule Path
Disallow /

owler

Rule Path
Disallow /

pandalytics/1.0

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

pi-monster

Rule Path
Disallow /

pinterestbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

safednsbot

Rule Path
Disallow /

scoutjet

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

semantic-visions.com

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

siteauditbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

splitsignalbot

Rule Path
Disallow /

semrushbot-coub

Rule Path
Disallow /

serpstatbot/1.0

Rule Path
Disallow /

seobility

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

http://site.ru

Rule Path
Disallow /

site.ru

Rule Path
Disallow /

sjuupbot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou web spider/4.0

Rule Path
Disallow /

sputnikbot

Rule Path
Disallow /

sputnik

Rule Path
Disallow /

sputnikbot/2.3

Rule Path
Disallow /

steeler

Rule Path
Disallow /

steeler/3.5

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

theoldreader.com

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

tracemyfile

Rule Path
Disallow /

tracemyfile/1.0

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

twingly recon-klondike/1.0

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

viglink

Rule Path
Disallow /

weborama-fetcher

Rule Path
Disallow /

woorankreview/2.0

Rule Path
Disallow /

xenu's link sleuth 1.1c

Rule Path
Disallow /

yak/1.0

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yeti-mobile

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

zgrab

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

Comments

  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • Mijn toevoegingen
  • Directories
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • https://gist.github.com/demyanovs/e5a1ac424e62ce01641bcc95afe0564c

Warnings

  • 2 invalid lines.