join.duolingo.com
robots.txt

Robots Exclusion Standard data for join.duolingo.com

Resource Scan

Scan Details

Site Domain join.duolingo.com
Base Domain duolingo.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-13T05:28:24+00:00
Next Scan 2024-12-12T05:28:24+00:00

Last Successful Scan

Scanned2024-05-17T03:35:58+00:00
URL https://join.duolingo.com/robots.txt
Domain IPs 18.204.191.147, 18.211.133.226, 3.209.163.89, 3.82.80.119, 34.205.204.152, 44.194.254.74, 54.80.87.54, 54.89.72.121
Response IP 35.174.127.75
Found Yes
Hash 4cfb1c35d2d35f849805fc852f6196b8f25246fc4696cd24f183b6dc77375784
SimHash 3c103d3ab1f5

Groups

*

Rule Path
Disallow /sessions/
Disallow /extend_session/
Disallow /session_element_solutions/
Disallow /friendships/
Disallow /vocabularies/
Disallow /diagnostics/
Disallow /skills/
Disallow /translations/
Disallow /translation/
Disallow /contribute/
Disallow /documents/
Disallow /matchmaker/
Disallow /logger/
Disallow /events/
Disallow /facebook/
Disallow /twitter/
Disallow /words/
Disallow /translate_jobs/
Disallow /oauth/
Disallow /data/
Disallow /unsubscribe
Disallow /deactivate
Disallow /pm/
Disallow /profile/
Disallow /register/

twitterbot

Rule Path
Allow /profile/

facebookexternalhit

Rule Path
Allow /profile/

Other Records

Field Value
sitemap https://join.duolingo.com/sitemap_index.xml

Comments

  • Certain social media sites are added to allow crawlers to access page markup when links to /profile are shared.