veld-post.nl
robots.txt

Robots Exclusion Standard data for veld-post.nl

Resource Scan

Scan Details

Site Domain veld-post.nl
Base Domain veld-post.nl
Scan Status Ok
Last Scan2024-09-24T12:33:04+00:00
Next Scan 2024-10-01T12:33:04+00:00

Last Scan

Scanned2024-09-24T12:33:04+00:00
URL https://www.veld-post.nl/robots.txt
Domain IPs 81.18.172.205
Response IP 81.18.172.205
Found Yes
Hash 5a2cf5450b90885d55f4adc79e9d8dd8c72dfc152ff9527672fb80955b6769ca
SimHash 42080a423db2

Groups

adidxbot
ahrefsbot
aihitbot
alphaseobot
alphaseobot-sa
baiduspider
bingpreview
blexbot
careerbot
cliqzbot
dotbot
grapeshot
ichiro
icjobs
linkdexbot
magpie-crawler
megaindex
mj12bot
moget
naverbot
owlin
owlin bot
owlin bot v. 3.0
proximic
queryseekerspider
scrapy
scrapybot
semrush
semrushbot
sentibot
seokicks-robot
sogou
sogou spider
tkbot
trendkite-akashic-crawler
vagabondo
wbsearchbot
yandex
yandexbot
yeti
youdaobot

Rule Path
Disallow /

bingbot
msnbot
msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /tiab/
Disallow /cancel-oidc/
Disallow /redirect/*
Disallow /zoek/*
Disallow /kennispartner/*/home/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.veld-post.nl/sitemap/

Comments

  • Horrible bandwidth eating robots
  • Other robots
  • User-agent: *
  • Disallow: /