bndestem.nl
robots.txt

Robots Exclusion Standard data for bndestem.nl

Resource Scan

Scan Details

Site Domain bndestem.nl
Base Domain bndestem.nl
Scan Status Ok
Last Scan2024-11-09T17:24:12+00:00
Next Scan 2024-11-16T17:24:12+00:00

Last Scan

Scanned2024-11-09T17:24:12+00:00
URL https://bndestem.nl/robots.txt
Redirect https://www.bndestem.nl/robots.txt
Redirect Domain www.bndestem.nl
Redirect Base bndestem.nl
Domain IPs 2600:1413:b000:6::17d5:2bcc, 2600:1413:b000:6::17d5:2bcd, 96.17.96.12, 96.17.96.28
Redirect IPs 2600:1413:b000:6::17d5:2bcc, 2600:1413:b000:6::17d5:2bcd, 96.17.96.12, 96.17.96.28
Response IP 23.209.46.154
Found Yes
Hash 728b69d01268e036293d4efc197b32f9537ad9dc2bb8577463a322270c7e453a
SimHash 69398b58ddf5

Groups

*

Rule Path
Disallow /*webview
Disallow /auth
Disallow /*widget*
Disallow /*?*otag=
Disallow /*?*abo_type=
Disallow /*?*utm_source=
Disallow /*?*currentArticleId=
Disallow /*?*articleUrl=
Disallow /zoeken?query=
Disallow /inloggen?*
Disallow /login?*
Disallow *~ab9e5892*
Disallow *~af7ac112*
Disallow *~a8eacee1*
Disallow *~aa82a19d*
Disallow *~a2575106*
Disallow *~ae3c4c34*
Disallow *?*redirect_url=*

twitterbot

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.bndestem.nl/sitemap.xml
sitemap https://www.bndestem.nl/sitemap-news.xml

Comments

  • Alle auteurs-, naburige en databankrechten die op de inhoud en opmaak van de DPG Media websites
  • en DPG Media apps rusten, worden door DPG Media BV uitdrukkelijk voorbehouden. De inhoud van de
  • DPG Media websites en apps is uitsluitend voor persoonlijk, niet-commercieel gebruik en het is
  • niet toegestaan om gegevens zoals tekst, afbeeldingen, audio, video of code van de websites of
  • de apps door middel van scraping (of een andere geautomatiseerde werkwijze) te vergaren.
  • Zie ook de Gebruikersvoorwaarden van DPG Media B.V. op www.dpgmedia.nl/gebruiksvoorwaarden
  • All copyrights, neighbouring rights and database rights in the content and layout of the
  • DPG Media websites and DPG Media apps are explicitly reserved by DPG Media BV. The content of
  • the DPG Media websites and DPG Media apps is for personal, non-commercial use only and it is not
  • allowed to collect data such as text, images, audio, video or code from the websites or from the
  • apps by means of scraping (or any other automated method).
  • See also the terms of use of DPG Media B.V. at www.dpgmedia.nl/gebruiksvoorwaarden
  • Tell robots which pages are not very interesting
  • Articles which should not be listed in google search index:
  • tu-e-zet-directeur-op-non-actief~ab9e5892/
  • Tell robots not to crawl redirect urls