paardenkliniekdeveluwe.nl
robots.txt

Robots Exclusion Standard data for paardenkliniekdeveluwe.nl

Resource Scan

Scan Details

Site Domain paardenkliniekdeveluwe.nl
Base Domain paardenkliniekdeveluwe.nl
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-14T08:25:35+00:00
Next Scan 2024-09-28T08:25:35+00:00

Last Successful Scan

Scanned2024-04-19T07:40:28+00:00
URL https://paardenkliniekdeveluwe.nl/robots.txt
Redirect https://www.paardenkliniekdeveluwe.nl/robots.txt
Redirect Domain www.paardenkliniekdeveluwe.nl
Redirect Base paardenkliniekdeveluwe.nl
Domain IPs 2a02:2968:1:0:1c00:d4ff:fe00:5bc, 62.84.246.226
Redirect IPs 2a02:2968:1:0:1c00:d4ff:fe00:5bc, 62.84.246.226
Response IP 62.84.246.226
Found Yes
Hash 53a6d3c9aa128116de8e4585e8119e50ffba3568506b14e93de0c9762f1ffcda
SimHash 76f041d9e2f8

Groups

*

Rule Path
Disallow /includes/
Disallow /misc/
Disallow /modules/
Disallow /profiles/
Disallow /themes/
Disallow /scripts/
Disallow sites/default/files/matomo/
Disallow /cron.php
Disallow /update.php
Disallow /install.php
Disallow /INSTALL.txt
Disallow /INSTALL.mysql.txt
Disallow /INSTALL.pgsql.txt
Disallow /CHANGELOG.txt
Disallow /MAINTAINERS.txt
Disallow /LICENSE.txt
Disallow /UPGRADE.txt
Disallow /xmlrpc.php
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips/
Disallow /logout/
Disallow /contact/
Disallow /logout/
Disallow /node/add/
Disallow /search/
Disallow /user/register
Disallow /user/password
Disallow /user/login
Disallow /user/logout
Disallow /sites/all/libraries/
Disallow /?q=admin%2F
Disallow /?q=comment%2Freply%2F
Disallow /?q=filter%2Ftips%2F
Disallow /?q=logout%2F
Disallow /?q=contact%2F
Disallow /?q=logout%2F
Disallow /?q=node%2Fadd%2F
Disallow /?q=search%2F
Disallow /?q=user%2Fpassword
Disallow /?q=user%2Fregister
Disallow /?q=user%2Flogin
Disallow /?q=user%2Flogout
Disallow /?q=sites%2Fall%2Flibraries%2F
Disallow /files/
Disallow /images/
Disallow /imgs/
Disallow /js/
Disallow /email/
Disallow /taxonomy/
Disallow /tracker/
Disallow /feed/
Disallow /print/
Disallow /printmail/
Disallow /warning.html
Disallow /?q=files%2F
Disallow /?q=images%2F
Disallow /?q=imgs%2F
Disallow /?q=js%2F
Disallow /?q=email%2F
Disallow /?q=taxonomy%2F
Disallow /?q=tracker%2F
Disallow /?q=feed%2F
Disallow /?q=print%2F
Disallow /?q=printmail%2F
Disallow /?q=warning.html

Other Records

Field Value
crawl-delay 10

bingbot
googlebot

Rule Path
Allow /*.js*
Allow /*.css*
Allow /*.png*
Allow /*.jpg*
Allow /*.jpeg*
Allow /*.gif*
Allow /*.webp*

petalbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yandexcalendar

Rule Path
Disallow /

yandexmobilebot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

mr.4x3 powered

Rule Path
Disallow /

adscanner

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

aranhabot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image
baiduspider-news
baiduspider-favo
baiduspider-ads
baiduspider-cpro

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

barkrowler/0.9

Rule Path
Disallow /

bidswitchbot

Rule Path
Disallow /

blekkobot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

boardreader

Rule Path
Disallow /

bubing

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot
ccbot/2.0

Rule Path
Disallow /

cincraw/1.0

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

cowbot/1.0

Rule Path
Disallow /

crawlson/1.0

Rule Path
Disallow /

curious george

Rule Path
Disallow /

curious george - www.analyticsseo.com

Rule Path
Disallow /

curious george - www.analyticsseo.com/crawler

Rule Path
Disallow /

dataforseobot/1.0

Rule Path
Disallow /

datanyze

Rule Path
Disallow /

dataprovider

Rule Path
Disallow /

daum

Rule Path
Disallow /

deskyobot

Rule Path
Disallow /

deskyobot/1.0

Rule Path
Disallow /

df bot 1.0

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

feedly

Rule Path
Disallow /

feedburner

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

filibot/1.0

Rule Path
Disallow /

genieo

Rule Path
Disallow /

genieo/1.0

Rule Path
Disallow /

getintent crawler

Rule Path
Disallow /

gluten free crawler/1.0

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

grouphigh

Rule Path
Disallow /

grouphigh/1.0

Rule Path
Disallow /

hybridbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ias_crawler

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

ioncrawl

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

istellabot/1.01.18

Rule Path
Disallow /

istellabot/1.01.18 +http://www.tiscali.it/

Rule Path
Disallow /

istellabot/1.10.2 +http://www.tiscali.it/

Rule Path
Disallow /

mozilla/5.0 (compatible; istellabot/1.01.18 +http://www.tiscali.it/)

Rule Path
Disallow /

james bot

Rule Path
Disallow /

leikibot

Rule Path
Disallow /

libcurl

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

lipperhey

Rule Path
Disallow /

livelap

Rule Path
Disallow /

lssrocket

Rule Path
Disallow /

ltx71
ltx71+-+(http://ltx71.com/)

Rule Path
Disallow /

magpie

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

mauibot (crawler.feedback+wc@gmail.com)

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

moget

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

mr.4x3 powered

Rule Path
Disallow /

netseer

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

obot/2.3.1

Rule Path
Disallow /

owler

Rule Path
Disallow /

pandalytics/1.0

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

pi-monster

Rule Path
Disallow /

pinterestbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

safednsbot

Rule Path
Disallow /

scoutjet

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

semantic-visions.com

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

siteauditbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

splitsignalbot

Rule Path
Disallow /

semrushbot-coub

Rule Path
Disallow /

serpstatbot/1.0

Rule Path
Disallow /

seobility

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

http://site.ru

Rule Path
Disallow /

site.ru

Rule Path
Disallow /

sjuupbot

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou web spider/4.0

Rule Path
Disallow /

sputnikbot

Rule Path
Disallow /

sputnik

Rule Path
Disallow /

sputnikbot/2.3

Rule Path
Disallow /

steeler

Rule Path
Disallow /

steeler/3.5

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

theoldreader.com

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

tracemyfile

Rule Path
Disallow /

tracemyfile/1.0

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

twingly recon-klondike/1.0

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

viglink

Rule Path
Disallow /

weborama-fetcher

Rule Path
Disallow /

woorankreview/2.0

Rule Path
Disallow /

xenu's link sleuth 1.1c

Rule Path
Disallow /

yak/1.0

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yeti-mobile

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

zgrab

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

Comments

  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • Mijn toevoegingen
  • Directories
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • https://gist.github.com/demyanovs/e5a1ac424e62ce01641bcc95afe0564c

Warnings

  • 2 invalid lines.