vork.org
robots.txt

Robots Exclusion Standard data for vork.org

Resource Scan

Scan Details

Site Domain vork.org
Base Domain vork.org
Scan Status Ok
Last Scan2024-09-19T22:48:19+00:00
Next Scan 2024-09-26T22:48:19+00:00

Last Scan

Scanned2024-09-19T22:48:19+00:00
URL https://www.vork.org/robots.txt
Domain IPs 81.18.172.205
Response IP 81.18.172.205
Found Yes
Hash dc5022a111107a8e367cba7c288d5754c678b549e6919205196c157bafd679fb
SimHash 42080ac235b0

Groups

adidxbot
ahrefsbot
aihitbot
alphaseobot
alphaseobot-sa
baiduspider
bingpreview
blexbot
careerbot
cliqzbot
dotbot
grapeshot
ichiro
icjobs
linkdexbot
magpie-crawler
megaindex
mj12bot
moget
naverbot
owlin
owlin bot
owlin bot v. 3.0
proximic
queryseekerspider
scrapy
scrapybot
semrush
semrushbot
sentibot
seokicks-robot
sogou
sogou spider
tkbot
trendkite-akashic-crawler
vagabondo
wbsearchbot
yandex
yandexbot
yeti
youdaobot

Rule Path
Disallow /

bingbot
msnbot
msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /tiab/
Disallow /cancel-oidc/
Disallow /redirect/*
Disallow /zoek/*
Disallow /kennispartner/*/home/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.vork.org/sitemap/

Comments

  • Horrible bandwidth eating robots
  • Other robots
  • User-agent: *
  • Disallow: /