Skip to main content

Riiid Competitive Analysis

Company Overview

  • Founded: 2014
  • Headquarters: Seoul, South Korea
  • Founder/CEO: YJ Jang (Yang-Je Jang)
  • Funding: $274M+ total raised
  • Key investors: SoftBank Vision Fund 2 ($179M Series D, 2020), IMM Investment, DST Global
  • Valuation: ~$1B+ (unicorn, post-Series D)
  • Products: Santa (TOEIC/TOEFL prep app), Riiid TUTOR (B2B AI engine)
  • Users: 2.5M+ Santa app users (primarily South Korea and Japan)
  • Employees: 300+ (2023)

Mission: "Democratize education with AI" — making expert-level personalized tutoring universally accessible

Recognition:

  • World Economic Forum Technology Pioneer (2021)
  • CB Insights AI 100 (2021)
  • Published largest open education dataset (EdNet) for global AI research

Market Position

Position: AI-first adaptive test preparation platform (Korea/Japan) + B2B AI licensing globally

Primary market: English proficiency test preparation (TOEIC, TOEFL, TEPS) in East Asia

Target audience:

  • B2C: Korean and Japanese professionals/students taking TOEIC for employment/university admission
  • B2B: EdTech companies, publishers, and educational institutions seeking adaptive AI engine

Market context:

  • TOEIC is mandatory for job applications at most Korean conglomerates (Samsung, LG, Hyundai, etc.)
  • ~2M TOEIC tests taken annually in South Korea alone
  • Captive, highly motivated user base with clear outcome metric (score improvement)

Competitive position:

  • Dominant in Korean AI TOEIC prep (vs. YBM, Hackers, traditional institutes)
  • First to apply DKT at commercial scale for standardized test prep
  • Expanding: TOEFL (US/global), general English, corporate L&D

Business Model

Dual revenue model:

B2C — Santa App:

  • Freemium: basic adaptive practice free
  • Premium subscription: $15-25/month (full adaptive features, AI tutor, unlimited practice)
  • One-time course purchases: $50-200 for structured prep courses
  • Primary markets: South Korea, Japan; expanding to SE Asia, US

B2B — Riiid TUTOR API:

  • White-label AI engine for other EdTech platforms and publishers
  • SaaS licensing: annual contracts, pricing based on student volume
  • Target customers: textbook publishers, LMS vendors, corporate training platforms
  • Revenue undisclosed but reported as significant growth driver post-2021

Economics:

  • TOEIC prep market: highly recurring (professionals retake to improve scores for promotions)
  • B2B provides higher ACV (Annual Contract Value) with lower CAC than B2C

AI & Personalization Technology

Core Framework: Deep Knowledge Tracing (DKT)

Riiid built its platform on Deep Knowledge Tracing — using neural networks to model the temporal evolution of a student's knowledge state.

How DKT works at Riiid:

  1. Student interaction sequences (question ID, response, timestamp, response time) are fed into LSTM/Transformer neural network
  2. Model maintains a hidden state representing estimated knowledge across skill dimensions
  3. After each interaction, model updates probability estimates for all related skills
  4. Next question selected to maximize expected information gain given current knowledge state
  5. System predicts final TOEIC score based on practice performance trajectory

Technical evolution:

  • 2016-2018: LSTM-based DKT (standard architecture)
  • 2019-2020: Transformer-based attention mechanisms (better long-range dependency modeling)
  • 2021+: SAINT (Self-Attentive model for Knowledge Tracing) — Riiid's proprietary architecture, published at NeurIPS 2020

SAINT Architecture

Riiid's published SAINT model (Separated Self-AttentIve Neural Knowledge Tracing):

  • Separates exercise embedding and response embedding into two distinct encoder streams
  • Enables deeper attention layers than previous architectures
  • Achieved state-of-the-art on EdNet dataset benchmarks at time of publication
  • Key innovation: exercise content features (difficulty, concept tags) separated from response history → reduces interference between content modeling and performance modeling

Score Prediction Engine

Riiid's commercially differentiating feature:

  • Predicts TOEIC score from as few as 30-50 adaptive practice questions
  • Claimed accuracy: within ±50 points of actual TOEIC score (on a 990-point scale) with ~90% confidence
  • Updates prediction in real-time as student completes more practice
  • Score trajectory visualization: shows projected score over time with continued practice

Why this works for TOEIC:

  • TOEIC is a standardized exam with stable psychometric properties
  • Large historical dataset (EdNet) enables accurate IRT parameter estimation
  • Stable item pool allows calibration → score prediction becomes tractable

EdNet Dataset — Strategic Research Asset

What is EdNet?

EdNet is the largest publicly released dataset of student-AI interactions in education history:

  • Scale: 131M student-AI interaction logs
  • Students: 784,309 unique students
  • Timespan: 2 years of Santa app usage data
  • Content: 13,169 unique questions with metadata (difficulty, concept tags, response times)
  • Released: January 2020 (arXiv:1912.03072), available on GitHub

Why EdNet Matters

For AI research:

  • Enabled dozens of independent research groups to develop and benchmark knowledge tracing models
  • SAINT, DKT variants, BERT-based KT models all benchmarked on EdNet
  • Created a standard evaluation framework for the field

For Riiid:

  • Positioned Riiid as a scientific leader, not just a product company
  • Attracted top ML researchers and academic collaborators
  • Generated enormous brand credibility with AI/ML community
  • Enabled benchmarking of their own models against global competition

Strategic insight: By open-sourcing their dataset, Riiid converted proprietary data into a public research ecosystem that ultimately benefited their own model development and talent acquisition.

Santa TOEIC Product Deep-Dive

User Experience Flow

  1. Onboarding diagnostic: 30-question adaptive assessment → initial score prediction
  2. Personalized curriculum: AI generates study plan based on score gap and target date
  3. Daily adaptive practice: 20-30 questions, difficulty adjusts per session
  4. AI feedback: Wrong answer explanations, similar question recommendations
  5. Progress tracking: Score trajectory chart, weak area radar graph
  6. Listening practice: Audio questions with speech recognition for pronunciation
  7. AI Tutor (Santa): LLM-powered conversational tutor for grammar/vocabulary questions

Santa for TOEFL & Global Expansion

  • Launched Santa TOEFL (2022): applying same DKT approach to TOEFL iBT
  • English grammar and vocabulary products for general learners
  • Expanding Japanese market: TOEIC also high-stakes in Japan

Riiid TUTOR (Enterprise B2B API)

What it is: White-label adaptive AI engine that other EdTech companies embed into their platforms

Core capabilities:

  • Knowledge state modeling per learner
  • Adaptive question selection from client's item bank
  • Learning outcome prediction
  • Performance analytics and dashboards
  • REST API with LTI integration support

Target customers:

  • Large publishers (textbooks with digital components)
  • Corporate L&D platforms
  • National testing organizations
  • School district LMS vendors

Strategic rationale: SoftBank investment thesis was partly B2B API → Riiid becomes "AWS of adaptive learning"

Status (2024-2026): B2B growth reported but scale of adoption not publicly confirmed. SoftBank's Vision Fund 2 has faced broader portfolio pressure, which indirectly affects Riiid's expansion capital.

Research & Evidence

PublicationVenueKey Finding
EdNet dataset paper (arXiv:1912.03072)KDD 2020Largest education AI dataset; enables reproducible KT research
SAINT model (arXiv:2010.12042)NeurIPS 2020 WorkshopState-of-the-art on EdNet KT benchmarks at publication
SAINT+ (arXiv:2010.12042v2)EDM 2021Extended SAINT with response time features
Score prediction paperInternal blog/whitepaperClaims >90% accuracy within ±50 TOEIC points

Evidence quality: Academic publications (SAINT, EdNet) are peer-reviewed and independently reproducible — strong scientific credibility. Score prediction claims are self-reported without independent validation.

Weaknesses & Limitations

TOEIC concentration risk: ~80%+ of B2C revenue dependent on a single exam type in two countries (Korea, Japan). TOEIC score requirements could be relaxed by Korean conglomerates — already happening at some companies.

Score prediction novelty wearing off: As competitors adopt similar approaches, the core differentiator commoditizes.

Limited depth beyond test prep: DKT works best for subjects with structured, sequential skills and large item pools. Generalization to open-ended subjects (writing, critical thinking) is technically unproven.

B2B API monetization slower than expected: Building API business requires long enterprise sales cycles. SoftBank's portfolio pressures may limit runway for extended sales cycles.

Geography: Despite international ambitions, >90% of actual users remain in Korea/Japan. US market penetration (where TOEIC is irrelevant) requires entirely different product strategy.

LLM disruption threat: GPT-4 class models can now generate TOEIC-style questions and provide explanations at low cost. Riiid's value proposition narrows if LLMs commoditize adaptive practice.

Startup Implications

Lessons from Riiid's strategy:

Open data as moat: EdNet dataset release was counterintuitive (giving away your data?), yet it created scientific credibility, academic partnerships, and a talent magnet that no amount of marketing could replicate. Open-sourcing carefully chosen assets can build network effects.

Standardized exams as beachhead: TOEIC/TOEFL's structured nature (fixed difficulty parameters, stable item pools, clear outcome metric) makes score prediction tractable. This is a better initial target than open-ended learning. Beachhead in high-stakes, measurable tests → expand from proof point.

Score prediction as hook: "You will score X on the exam" is a uniquely compelling promise. Prediction creates accountability and urgency that generic "improve your skills" products lack. Any test-prep product should consider score prediction as primary UX feature.

B2B API ambition requires patience: The "AWS of adaptive learning" vision requires 5-7 year sales cycle investment. Under-capitalized startups should focus B2C first, use revenue to fund B2B development.

DKT vs. IRT trade-off: IRT is more interpretable and explainable to educators; DKT is more accurate but opaque. For K-12 school sales, explainability often matters more than marginal accuracy improvement. Choose based on customer.