nhnieuws.nl
robots.txt

Robots Exclusion Standard data for nhnieuws.nl

Resource Scan

Scan Details

Site Domain nhnieuws.nl
Base Domain nhnieuws.nl
Scan Status Ok
Last Scan2024-11-09T16:39:28+00:00
Next Scan 2024-11-16T16:39:28+00:00

Last Scan

Scanned2024-11-09T16:39:28+00:00
URL https://nhnieuws.nl/robots.txt
Redirect https://www.nhnieuws.nl/robots.txt
Redirect Domain www.nhnieuws.nl
Redirect Base nhnieuws.nl
Domain IPs 104.22.16.152, 104.22.17.152, 172.67.6.69, 2606:4700:10::6816:1098, 2606:4700:10::6816:1198, 2606:4700:10::ac43:645
Redirect IPs 104.22.16.152, 104.22.17.152, 172.67.6.69, 2606:4700:10::6816:1098, 2606:4700:10::6816:1198, 2606:4700:10::ac43:645
Response IP 104.22.17.152
Found Yes
Hash cd417ef9ec5f47aed51dbe18d172b683839d08892c2d97ea4448137e3db8b867
SimHash 501e41e0e003

Groups

*

Rule Path
Disallow /api/
Disallow /app/
Disallow /zoek/
Disallow /_next/
Disallow /nieuws/n*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.nhnieuws.nl/sitemap-news.xml