l1nieuws.nl
robots.txt

Robots Exclusion Standard data for l1nieuws.nl

Resource Scan

Scan Details

Site Domain l1nieuws.nl
Base Domain l1nieuws.nl
Scan Status Ok
Last Scan2024-09-26T07:02:41+00:00
Next Scan 2024-10-03T07:02:41+00:00

Last Scan

Scanned2024-09-26T07:02:41+00:00
URL https://l1nieuws.nl/robots.txt
Redirect https://www.l1nieuws.nl/robots.txt
Redirect Domain www.l1nieuws.nl
Redirect Base l1nieuws.nl
Domain IPs 93.119.12.159
Redirect IPs 2600:9000:2022:3c00:1:87dc:ed00:93a1, 2600:9000:2022:5400:1:87dc:ed00:93a1, 2600:9000:2022:7400:1:87dc:ed00:93a1, 2600:9000:2022:8000:1:87dc:ed00:93a1, 2600:9000:2022:ca00:1:87dc:ed00:93a1, 2600:9000:2022:ce00:1:87dc:ed00:93a1, 2600:9000:2022:d800:1:87dc:ed00:93a1, 2600:9000:2022:de00:1:87dc:ed00:93a1, 54.230.112.111, 54.230.112.12, 54.230.112.63, 54.230.112.91
Response IP 52.85.49.126
Found Yes
Hash 004f81e929bcdd96ab8b6bd23b3cd67ca1b42d495daf3a963409023920352dba
SimHash 701351518503

Groups

*

Rule Path
Allow /
Disallow /content/
Disallow /embedded/
Disallow /data/
Disallow /inc/
Disallow /*/-
Disallow /nos/
Disallow /zoeken?q=*

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.l1nieuws.nl/sitemap/sitemap.xml.gz