l1.nl
robots.txt

Robots Exclusion Standard data for l1.nl

Resource Scan

Scan Details

Site Domain l1.nl
Base Domain l1.nl
Scan Status Ok
Last Scan2024-06-15T09:13:52+00:00
Next Scan 2024-06-22T09:13:52+00:00

Last Scan

Scanned2024-06-15T09:13:52+00:00
URL https://l1.nl/robots.txt
Redirect https://www.l1.nl/robots.txt
Redirect Domain www.l1.nl
Redirect Base l1.nl
Domain IPs 93.119.12.159
Redirect IPs 13.226.225.120, 13.226.225.19, 13.226.225.77, 13.226.225.97, 2600:9000:21f8:2400:1a:f2b5:cb00:93a1, 2600:9000:21f8:7000:1a:f2b5:cb00:93a1, 2600:9000:21f8:a00:1a:f2b5:cb00:93a1, 2600:9000:21f8:ba00:1a:f2b5:cb00:93a1, 2600:9000:21f8:d400:1a:f2b5:cb00:93a1, 2600:9000:21f8:f800:1a:f2b5:cb00:93a1, 2600:9000:21f8:fa00:1a:f2b5:cb00:93a1, 2600:9000:21f8:fc00:1a:f2b5:cb00:93a1
Response IP 3.160.246.50
Found Yes
Hash df869afb20f1e477783b68dd7b226ce2760131ee758f7ec34529c3c7bbc8c08a
SimHash 64108b58a400

Groups

*

Rule Path
Allow /
Disallow /content/
Disallow /embedded/
Disallow /data/
Disallow /inc/
Disallow /*/-
Disallow /nos/
Disallow /zoeken?q=*

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.l1.nl/sitemap/sitemap.xml.gz