pluimveeweb.nl
robots.txt

Robots Exclusion Standard data for pluimveeweb.nl

Resource Scan

Scan Details

Site Domain pluimveeweb.nl
Base Domain pluimveeweb.nl
Scan Status Ok
Last Scan2024-11-16T10:42:32+00:00
Next Scan 2024-11-23T10:42:32+00:00

Last Scan

Scanned2024-11-16T10:42:32+00:00
URL https://pluimveeweb.nl/robots.txt
Redirect https://www.pluimveeweb.nl/robots.txt
Redirect Domain www.pluimveeweb.nl
Redirect Base pluimveeweb.nl
Domain IPs 81.18.172.205
Redirect IPs 81.18.172.205
Response IP 81.18.172.205
Found Yes
Hash 692455d767a96e94ff9f5ba0e12acf58c005ea9c1245fb6f322ac7ecdcc45412
SimHash 42280a423db0

Groups

adidxbot
ahrefsbot
aihitbot
alphaseobot
alphaseobot-sa
baiduspider
bingpreview
blexbot
careerbot
cliqzbot
dotbot
grapeshot
ichiro
icjobs
linkdexbot
magpie-crawler
megaindex
mj12bot
moget
naverbot
owlin
owlin bot
owlin bot v. 3.0
proximic
queryseekerspider
scrapy
scrapybot
semrush
semrushbot
sentibot
seokicks-robot
sogou
sogou spider
tkbot
trendkite-akashic-crawler
vagabondo
wbsearchbot
yandex
yandexbot
yeti
youdaobot

Rule Path
Disallow /

bingbot
msnbot
msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /tiab/
Disallow /cancel-oidc/
Disallow /redirect/*
Disallow /zoek/*
Disallow /kennispartner/*/home/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.pluimveeweb.nl/sitemap/

Comments

  • Horrible bandwidth eating robots
  • Other robots
  • User-agent: *
  • Disallow: /