gelderlander.nl
robots.txt

Robots Exclusion Standard data for gelderlander.nl

Resource Scan

Scan Details

Site Domain gelderlander.nl
Base Domain gelderlander.nl
Scan Status Ok
Last Scan2024-11-11T02:10:25+00:00
Next Scan 2024-11-18T02:10:25+00:00

Last Scan

Scanned2024-11-11T02:10:25+00:00
URL https://gelderlander.nl/robots.txt
Redirect https://www.gelderlander.nl/robots.txt
Redirect Domain www.gelderlander.nl
Redirect Base gelderlander.nl
Domain IPs 23.54.118.43, 23.54.118.52
Redirect IPs 2600:1413:b000:6::17d5:2bc5, 2600:1413:b000:6::17d5:2bd8, 96.17.96.18, 96.17.96.5
Response IP 23.44.4.138
Found Yes
Hash 8d44941e4ce3ba100708b326fe45368db09f3f67d27b99a380423c9c7fc53753
SimHash 61399b58dd75

Groups

*

Rule Path
Disallow /*webview
Disallow /auth
Disallow /*widget*
Disallow /*?*otag=
Disallow /*?*abo_type=
Disallow /*?*utm_source=
Disallow /*?*currentArticleId=
Disallow /*?*articleUrl=
Disallow /zoeken?query=
Disallow /inloggen?*
Disallow /login?*
Disallow *~ab9e5892*
Disallow *~af7ac112*
Disallow *~a2575106*
Disallow *?*redirect_url=*

twitterbot

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.gelderlander.nl/sitemap.xml
sitemap https://www.gelderlander.nl/sitemap-news.xml

Comments

  • Alle auteurs-, naburige en databankrechten die op de inhoud en opmaak van de DPG Media websites
  • en DPG Media apps rusten, worden door DPG Media BV uitdrukkelijk voorbehouden. De inhoud van de
  • DPG Media websites en apps is uitsluitend voor persoonlijk, niet-commercieel gebruik en het is
  • niet toegestaan om gegevens zoals tekst, afbeeldingen, audio, video of code van de websites of
  • de apps door middel van scraping (of een andere geautomatiseerde werkwijze) te vergaren.
  • Zie ook de Gebruikersvoorwaarden van DPG Media B.V. op www.dpgmedia.nl/gebruiksvoorwaarden
  • All copyrights, neighbouring rights and database rights in the content and layout of the
  • DPG Media websites and DPG Media apps are explicitly reserved by DPG Media BV. The content of
  • the DPG Media websites and DPG Media apps is for personal, non-commercial use only and it is not
  • allowed to collect data such as text, images, audio, video or code from the websites or from the
  • apps by means of scraping (or any other automated method).
  • See also the terms of use of DPG Media B.V. at www.dpgmedia.nl/gebruiksvoorwaarden
  • Tell robots which pages are not very interesting
  • Articles which should not be listed in google search index:
  • tu-e-zet-directeur-op-non-actief~ab9e5892/
  • Tell robots not to crawl redirect urls