janwuts.nl
robots.txt

Robots Exclusion Standard data for janwuts.nl

Resource Scan

Scan Details

Site Domain janwuts.nl
Base Domain janwuts.nl
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-09-12T17:01:03+00:00
Next Scan 2025-12-11T17:01:03+00:00

Last Successful Scan

Scanned2023-08-18T16:22:22+00:00
URL https://www.janwuts.nl/robots.txt
Domain IPs 149.210.182.34
Response IP 149.210.182.34
Found Yes
Hash b40eb06f1a91694c05b4e34263a821be95c87c29fcd864a99f8e2a3931704a05
SimHash 037d17c2e623

Groups

*

Rule Path
Disallow /*suggestie%3D1

*

Rule Path
Disallow /help/
Disallow /cgi/comments.cgi
Disallow /cgi/ecard.cgi
Disallow /cgi/formmail
Disallow /cgi/login.cgi
Disallow /cgi/register.cgi
Disallow /cgi/search.cgi
Disallow */empty
Disallow *.cgi/filter/
Disallow *.cgi/option/
Disallow /*xml%3D1

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

googlebot-image

Rule Path
Disallow /*static-storage/

webcrawler

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

kingkevinbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap http://voorraad.autodatawheelerdelta.nl/voorraadlijst/sitemap.xml
sitemap http://voorraad.autodatawheelerdelta.nl/voorraadlijst/sitemap_occasions.xml

Comments

  • robots.txt for voorraadlijst (voorraad.autodatawheelerdelta.nl)
  • autodiscovery sitemaps
  • ongebruikelijke robots