pigbusiness.nl
robots.txt

Robots Exclusion Standard data for pigbusiness.nl

Resource Scan

Scan Details

Site Domain pigbusiness.nl
Base Domain pigbusiness.nl
Scan Status Ok
Last Scan2024-11-10T06:02:46+00:00
Next Scan 2024-11-17T06:02:46+00:00

Last Scan

Scanned2024-11-10T06:02:46+00:00
URL https://pigbusiness.nl/robots.txt
Redirect https://www.pigbusiness.nl/robots.txt
Redirect Domain www.pigbusiness.nl
Redirect Base pigbusiness.nl
Domain IPs 81.18.172.205
Redirect IPs 81.18.172.205
Response IP 81.18.172.205
Found Yes
Hash 9f6cbc166d3ea14b35ee7e769342eed7c752965bd9e1cfa04020f7766bc3e4fd
SimHash 42080a4235b0

Groups

adidxbot
ahrefsbot
aihitbot
alphaseobot
alphaseobot-sa
baiduspider
bingpreview
blexbot
careerbot
cliqzbot
dotbot
grapeshot
ichiro
icjobs
linkdexbot
magpie-crawler
megaindex
mj12bot
moget
naverbot
owlin
owlin bot
owlin bot v. 3.0
proximic
queryseekerspider
scrapy
scrapybot
semrush
semrushbot
sentibot
seokicks-robot
sogou
sogou spider
tkbot
trendkite-akashic-crawler
vagabondo
wbsearchbot
yandex
yandexbot
yeti
youdaobot

Rule Path
Disallow /

bingbot
msnbot
msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /tiab/
Disallow /cancel-oidc/
Disallow /redirect/*
Disallow /zoek/*
Disallow /kennispartner/*/home/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.pigbusiness.nl/sitemap/

Comments

  • Horrible bandwidth eating robots
  • Other robots
  • User-agent: *
  • Disallow: