gld.nl
robots.txt

Robots Exclusion Standard data for gld.nl

Resource Scan

Scan Details

Site Domain gld.nl
Base Domain gld.nl
Scan Status Ok
Last Scan2024-05-18T01:57:27+00:00
Next Scan 2024-05-25T01:57:27+00:00

Last Scan

Scanned2024-05-18T01:57:27+00:00
URL https://gld.nl/robots.txt
Redirect https://www.gld.nl/robots.txt
Redirect Domain www.gld.nl
Redirect Base gld.nl
Domain IPs 212.114.113.68
Redirect IPs 13.226.2.115, 13.226.2.60, 13.226.2.61, 13.226.2.62, 2600:9000:215a:1000:b:106a:1d80:93a1, 2600:9000:215a:4e00:b:106a:1d80:93a1, 2600:9000:215a:5800:b:106a:1d80:93a1, 2600:9000:215a:6e00:b:106a:1d80:93a1, 2600:9000:215a:7200:b:106a:1d80:93a1, 2600:9000:215a:8e00:b:106a:1d80:93a1, 2600:9000:215a:c800:b:106a:1d80:93a1, 2600:9000:215a:d000:b:106a:1d80:93a1
Response IP 18.165.171.125
Found Yes
Hash 9c5c83978aa43589686a8ea162556cb3e7ad32ceb6d27047c7386909d7010bc8
SimHash 64182950a400

Groups

*

Rule Path
Allow /
Disallow /content/
Disallow /embedded/
Disallow /data/
Disallow /inc/
Disallow /*/-
Disallow /nos/
Disallow /zoeken?q=*

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.gld.nl/sitemap/sitemap.xml.gz