nl-nl.duolingo.com
robots.txt

Robots Exclusion Standard data for nl-nl.duolingo.com

Resource Scan

Scan Details

Site Domain nl-nl.duolingo.com
Base Domain duolingo.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-07-04T20:01:16+00:00
Next Scan 2024-10-02T20:01:16+00:00

Last Successful Scan

Scanned2023-06-03T07:21:48+00:00
URL https://nl-nl.duolingo.com/robots.txt
Domain IPs 34.199.76.114, 44.207.115.234, 52.200.242.98, 52.203.130.97, 52.45.97.77, 54.156.54.3, 54.164.172.241, 54.236.197.142
Response IP 44.208.117.102
Found Yes
Hash 0dde99ce5666ad2b42506c6a5ee780b24380db4c25db3935b911fdbac0fb676b
SimHash 3c103d38b1f1

Groups

*

Rule Path
Disallow /sessions/
Disallow /extend_session/
Disallow /session_element_solutions/
Disallow /friendships/
Disallow /vocabularies/
Disallow /diagnostics/
Disallow /skills/
Disallow /translations/
Disallow /translation/
Disallow /contribute/
Disallow /documents/
Disallow /matchmaker/
Disallow /logger/
Disallow /events/
Disallow /facebook/
Disallow /twitter/
Disallow /words/
Disallow /translate_jobs/
Disallow /oauth/
Disallow /data/
Disallow /unsubscribe
Disallow /deactivate
Disallow /pm/
Disallow /profile/
Disallow /register/

twitterbot

Rule Path
Allow /profile/

facebookexternalhit

Rule Path
Allow /profile/

Other Records

Field Value
sitemap https://nl-nl.duolingo.com/sitemap_index.xml

Comments

  • Certain social media sites are added to allow crawlers to access page markup when links to /profile are shared.