gld.nl
robots.txt

Robots Exclusion Standard data for gld.nl

Resource Scan

Scan Details

Site Domain gld.nl
Base Domain gld.nl
Scan Status Ok
Last Scan2024-11-16T17:44:22+00:00
Next Scan 2024-11-23T17:44:22+00:00

Last Scan

Scanned2024-11-16T17:44:22+00:00
URL https://gld.nl/robots.txt
Redirect https://www.gld.nl/robots.txt
Redirect Domain www.gld.nl
Redirect Base gld.nl
Domain IPs 85.10.128.132
Redirect IPs 216.137.52.106, 216.137.52.107, 216.137.52.29, 216.137.52.65, 2600:9000:2181:1800:b:106a:1d80:93a1, 2600:9000:2181:4a00:b:106a:1d80:93a1, 2600:9000:2181:5400:b:106a:1d80:93a1, 2600:9000:2181:7000:b:106a:1d80:93a1, 2600:9000:2181:9200:b:106a:1d80:93a1, 2600:9000:2181:a200:b:106a:1d80:93a1, 2600:9000:2181:e800:b:106a:1d80:93a1, 2600:9000:2181:ec00:b:106a:1d80:93a1
Response IP 18.165.122.18
Found Yes
Hash 064c399e2913e4a526ed71812f470476151d42b910c6ba821d8aa29dd8bd7817
SimHash 701309518402

Groups

*

Rule Path
Allow /
Disallow /content/
Disallow /embedded/
Disallow /data/
Disallow /inc/
Disallow /*/-
Disallow /nos/
Disallow /zoeken?q=*

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

duckassistbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

kangaroo bot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gelderland.undefined.egeniq.regiogroei.nl/sitemap/sitemap.xml.gz