trekkerweb.nl
robots.txt

Robots Exclusion Standard data for trekkerweb.nl

Resource Scan

Scan Details

Site Domain trekkerweb.nl
Base Domain trekkerweb.nl
Scan Status Ok
Last Scan2024-11-11T06:27:29+00:00
Next Scan 2024-11-18T06:27:29+00:00

Last Scan

Scanned2024-11-11T06:27:29+00:00
URL https://trekkerweb.nl/robots.txt
Redirect https://www.trekkerweb.nl/robots.txt
Redirect Domain www.trekkerweb.nl
Redirect Base trekkerweb.nl
Domain IPs 81.18.172.205
Redirect IPs 81.18.172.205
Response IP 81.18.172.205
Found Yes
Hash 0473245aae87ed324d3d0b5f5645c44d4dd60f99506c425a34bcebc6e3b085cb
SimHash 420803c23fb0

Groups

adidxbot
ahrefsbot
aihitbot
alphaseobot
alphaseobot-sa
amazonbot
baiduspider
bingpreview
blexbot
careerbot
cliqzbot
dotbot
grapeshot
ichiro
icjobs
linkdexbot
magpie-crawler
megaindex
mj12bot
moget
naverbot
owlin
owlin bot
owlin bot v. 3.0
proximic
queryseekerspider
scrapy
scrapybot
semrush
semrushbot
sentibot
seokicks-robot
sogou
sogou spider
tkbot
trendkite-akashic-crawler
vagabondo
wbsearchbot
yandex
yandexbot
yeti
youdaobot

Rule Path
Disallow /

bingbot
msnbot
msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /tiab/
Disallow /cancel-oidc/
Disallow /redirect/*
Disallow /zoek/*
Disallow /merkpartner/*/home/
Disallow /assets/components/*

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.trekkerweb.nl/sitemap/

Comments

  • Horrible bandwidth eating robots
  • Other robots
  • User-agent: *
  • Disallow: /