hyperskill.org
robots.txt

Robots Exclusion Standard data for hyperskill.org

Resource Scan

Scan Details

Site Domain hyperskill.org
Base Domain hyperskill.org
Scan Status Ok
Last Scan2025-06-08T15:55:19+00:00
Next Scan 2025-06-15T15:55:19+00:00

Last Scan

Scanned2025-06-08T15:55:19+00:00
URL https://hyperskill.org/robots.txt
Domain IPs 104.26.2.247, 104.26.3.247, 172.67.70.250, 2606:4700:20::681a:2f7, 2606:4700:20::681a:3f7, 2606:4700:20::ac43:46fa
Response IP 104.26.3.247
Found Yes
Hash 3dd4d6823831ce0f6e69efb7440f6902ab6f964c66602f2f6a1139609b57c391
SimHash 19499c9387f7

Groups

*

Rule Path
Disallow /admin/
Disallow /apply-coupon/
Disallow /content/
Disallow /debug/
Disallow /delete-account-confirmation/
Disallow /delete-account/
Disallow /join/
Disallow /media/
Disallow /unsubscribe/
Disallow /ws/
Allow /media/sitemap-*.xml
Allow /media/sitemap-*.xml.gz
Allow /media/sitemaps/

twitterbot

Rule Path
Allow /join/

facebookexternalhit

Rule Path
Allow /join/

linkedinbot

Rule Path
Allow /join/

linkchecker

Rule Path
Allow /admin/

Other Records

Field Value
sitemap https://hyperskill.org/sitemap.xml

Comments

  • Allow major social network crawlers to access /join/ for preview purposes
  • Specific rule for LinkChecker