fd.nl
robots.txt

Robots Exclusion Standard data for fd.nl

Resource Scan

Scan Details

Site Domain fd.nl
Base Domain fd.nl
Scan Status Ok
Last Scan2024-06-07T13:25:48+00:00
Next Scan 2024-06-14T13:25:48+00:00

Last Scan

Scanned2024-06-07T13:25:48+00:00
URL https://fd.nl/robots.txt
Domain IPs 52.212.209.27, 54.72.108.29, 79.125.61.59
Response IP 54.72.108.29
Found Yes
Hash d4eb4906f20da82f964c7dfd966180deba3dbaac445e4a822ac2b9f339481a9f
SimHash 7231497923b3

Groups

*

Rule Path
Disallow *?view=*
Disallow *?filter=*
Disallow *?order=*
Disallow *?sortBy=*
Disallow /search?*
Disallow /mijn-nieuws
Disallow /binaries/*
Disallow /*.pdf$
Disallow /*?*login=
Disallow

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://fd.nl/sitemap/sitemap_index.xml
sitemap https://fd.nl/sitemap/sitemap_google_news.xml
sitemap https://fd.nl/sitemap/sitemap-journalist.xml
sitemap https://fd.nl/sitemap/sitemap-tags.xml
sitemap https://fd.nl/sitemap/sitemap-channels.xml

Comments

  • all use of FD content is subject to the Terms & Conditions and Copyright Policy set out on FD.nl/copyright
  • scraping, harvesting or any (other) type of data- or text-mining of FD content is not allowed