gezondheidaanhuis.nl
robots.txt

Robots Exclusion Standard data for gezondheidaanhuis.nl

Resource Scan

Scan Details

Site Domain gezondheidaanhuis.nl
Base Domain gezondheidaanhuis.nl
Scan Status Ok
Last Scan2024-09-16T14:01:25+00:00
Next Scan 2024-09-30T14:01:25+00:00

Last Scan

Scanned2024-09-16T14:01:25+00:00
URL https://gezondheidaanhuis.nl/robots.txt
Domain IPs 185.173.21.126, 2a0b:3100:100:51::21:126
Response IP 185.173.21.126
Found Yes
Hash 56c2b6e28fc60939a12130fd731c831593d4ca8386525604e3167e18e38192e1
SimHash 032ded70a833

Groups

rogerbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

openlinkprofiler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

leipzig corpora collection

Rule Path
Disallow /

linguee

Rule Path
Disallow /

hubspot

Rule Path
Disallow /

quant

Rule Path
Disallow /

seoscanners

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

alphabot

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

magebee

Rule Path
Disallow /

domain reanimator

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

seoscanners.net/1

Rule Path
Disallow /

seoscanners.net

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.gezondheidaanhuis.nl/sitemap.xml