ndla.no
robots.txt

Robots Exclusion Standard data for ndla.no

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ndla.no
Base Domain	ndla.no
Scan Status	Ok
Last Scan	2026-01-15T18:41:59+00:00
Next Scan	2026-01-29T18:41:59+00:00

Last Scan

Scanned	2026-01-15T18:41:59+00:00
URL	https://ndla.no/robots.txt
Domain IPs	108.157.188.25, 108.157.188.36, 108.157.188.58, 108.157.188.7
Response IP	13.226.2.84
Found	Yes
Hash	a7535697448912882d84f508848becf0e571f070a022032d990c3ea66efc40a7
SimHash	44098f40a888

Groups

*

Rule	Path
Disallow	/health/
Disallow	/oembed/
Disallow	/lti/
Disallow	/search
Disallow	//search
Disallow	/article-iframe/
Disallow	/embed-iframe/
Disallow	login
Disallow	logout
Disallow	*minndla/
Disallow	/history/

Rule

Path

Disallow

/health/

Disallow

/oembed/

Disallow

/lti/

Disallow

/search

Disallow

/*/search*

Disallow

*/article-iframe/*

Disallow

*/embed-iframe/*

Disallow

*login*

Disallow

*logout*

Disallow

*minndla/

Disallow

/history/

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

omigilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

diffbot

Rule	Path
Disallow	/

Rule

Path

Disallow

cohere-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

Updated ndla.no 03.04.2024 with additions
ndla-frontend paths
Min NDLA
status.ndla.no
AI/LLM CRAWLERS, fetched from snl.no
Common Crawl
OpenAI
Anthropic
Bytedance
Hive
Webz
Perplexity
Diffbot
Diffbot

ndla.norobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

ccbot

gptbot

anthropic-ai

claudebot

claude-web

bytespider

imagesiftbot

omigilibot

perplexitybot

diffbot

cohere-ai

Comments

ndla.no
robots.txt