en.duolingo.com
robots.txt

Robots Exclusion Standard data for en.duolingo.com

Resource Scan

Scan Details

Site Domain en.duolingo.com
Base Domain duolingo.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-06-29T20:16:38+00:00
Next Scan 2025-09-27T20:16:38+00:00

Last Successful Scan

Scanned2023-06-03T08:13:19+00:00
URL https://en.duolingo.com/robots.txt
Domain IPs 107.20.92.148, 18.235.93.35, 34.206.161.149, 44.207.115.234, 44.208.117.102, 52.45.97.77, 54.164.172.241, 54.210.212.163
Response IP 54.164.172.241
Found Yes
Hash 32c4463dc8b336ff356bf479de868ed0abcc0e6b16494d4c4db7d35b459bef75
SimHash 3c103d3bb1f5

Groups

*

Rule Path
Disallow /sessions/
Disallow /extend_session/
Disallow /session_element_solutions/
Disallow /friendships/
Disallow /vocabularies/
Disallow /diagnostics/
Disallow /skills/
Disallow /translations/
Disallow /translation/
Disallow /contribute/
Disallow /documents/
Disallow /matchmaker/
Disallow /logger/
Disallow /events/
Disallow /facebook/
Disallow /twitter/
Disallow /words/
Disallow /translate_jobs/
Disallow /oauth/
Disallow /data/
Disallow /unsubscribe
Disallow /deactivate
Disallow /pm/
Disallow /profile/
Disallow /register/

twitterbot

Rule Path
Allow /profile/

facebookexternalhit

Rule Path
Allow /profile/

Other Records

Field Value
sitemap https://en.duolingo.com/sitemap_index.xml

Comments

  • Certain social media sites are added to allow crawlers to access page markup when links to /profile are shared.