ar.duolingo.com
robots.txt

Robots Exclusion Standard data for ar.duolingo.com

Resource Scan

Scan Details

Site Domain ar.duolingo.com
Base Domain duolingo.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-04-06T09:24:58+00:00
Next Scan 2024-07-05T09:24:58+00:00

Last Successful Scan

Scanned2023-06-08T08:57:24+00:00
URL https://ar.duolingo.com/robots.txt
Domain IPs 174.129.17.119, 18.208.65.239, 3.232.217.122, 35.171.144.123, 50.19.196.20, 52.7.105.14, 54.236.197.142, 54.81.146.42
Response IP 52.200.242.98
Found Yes
Hash 1b07ba763d86a35556e41fb4019640df439689bbe2a85089fb4227ff53492117
SimHash 3c102d3bb1f9

Groups

*

Rule Path
Disallow /sessions/
Disallow /extend_session/
Disallow /session_element_solutions/
Disallow /friendships/
Disallow /vocabularies/
Disallow /diagnostics/
Disallow /skills/
Disallow /translations/
Disallow /translation/
Disallow /contribute/
Disallow /documents/
Disallow /matchmaker/
Disallow /logger/
Disallow /events/
Disallow /facebook/
Disallow /twitter/
Disallow /words/
Disallow /translate_jobs/
Disallow /oauth/
Disallow /data/
Disallow /unsubscribe
Disallow /deactivate
Disallow /pm/
Disallow /profile/
Disallow /register/

twitterbot

Rule Path
Allow /profile/

facebookexternalhit

Rule Path
Allow /profile/

Other Records

Field Value
sitemap https://ar.duolingo.com/sitemap_index.xml

Comments

  • Certain social media sites are added to allow crawlers to access page markup when links to /profile are shared.