Work/EdTech · 2M learners

EdTech · 2M learners · 2024-2025

Time-to-mastery,
cut by 62%.

A K-12 math platform was failing 40% of its learners with a one-size-fits-all curriculum. We built an adaptive AI tutor that calibrates difficulty per learner and explains stuck moments.

Client

EdTech · K-12 math

Engagement

Squad · 10 months

Team

2 PMs · 1 designer · 5 engineers

Status

2M learners

Stack

claude · pgvector · next.js · postgres

01 · The challenge

40% of learners failing the linear curriculum.

A K-12 math platform serving 2M learners had a one-size-fits-all curriculum. 40% of learners stalled at one of three predictable choke points (fractions, algebraic substitution, geometric reasoning) and churned within 30 days. The product team had been adding more practice problems for two years. Nothing moved the metric. The Head of Learning had a sharper question: what if difficulty calibrated to the learner instead of the grade?

02 · Our approach

Adaptive difficulty, real-time explanation.

We built an adaptive engine that tracks where each learner gets stuck, generates explanations targeted to the specific misconception (not the topic), and adjusts difficulty per-problem rather than per-grade. The model retrieves prior worked examples from the curriculum corpus, finds the closest analog to the learner's mistake, and walks them through it in their own grade-appropriate language. Two-thirds of churn came from three choke points. Targeting those alone delivered most of the gain.

03 · What we built

What we shipped.

Misconception classifier

Every wrong answer gets classified into one of 200+ known misconceptions, not just 'wrong.' The taxonomy was built with the learning team from a decade of student work.

Targeted explanation generator

Claude pulls a worked example from the curriculum corpus that matches the misconception and rewrites it in the learner's reading level. Same math, right level.

Difficulty controller

The next problem is selected to be solvable by the learner's current capability, not the grade-level average. Stuck learners get unstuck instead of buried.

Parent and teacher dashboards

Visible progress, where the learner is stuck this week, and what got unstuck. Parents see the work.

04 · How it shipped

Delivery cadence.

Month 1 to 2

Misconception taxonomy

Built with the learning team. Curriculum corpus indexed and tagged.

Month 3 to 5

Adaptive engine build

Limited rollout to one grade band. Eval pipeline weekly from day one.

Month 6 to 8

Multi-grade rollout

All K-8 math grades. Eval and content tuning.

Month 9 to 10

Parent and teacher tooling

Dashboards. Scaled to 2M learners. Retainer for ongoing eval.

05 · The results

Numbers their team verified.

62%

Faster time-to-mastery

Across all grade levels. Largest gains on previously stuck learners.

−28%

30-day churn

Stuck-then-churn cohort cut by more than half. $9M+ retained revenue.

+21pts

Parent NPS

Best predictor of long-term retention.

We've had AI features for two years that nobody used. This is the first thing the team built that learners and parents both notice.

Head of Learning

EdTech · 2M learners

06 · What we learned

Misconceptions, not topics.

Generic 'you got it wrong, try again' feedback did nothing. Misconception-specific explanations moved every metric.
Parent NPS turned out to be the long-term retention predictor, not learner NPS. Parents who could see 'what got unstuck this week' renewed.
Two-thirds of churn came from three choke points. Targeting those alone delivered most of the gain. The rest came from steady tuning.

More work

Other case studies.

Your turn

The next case study
could have your numbers.

Book a discovery call See more work

Loading…

40% of learners failing the linear curriculum.

Adaptive difficulty, real-time explanation.

Misconceptions, not topics.

Generic 'you got it wrong, try again' feedback did nothing. Misconception-specific explanations moved every metric.

Parent NPS turned out to be the long-term retention predictor, not learner NPS. Parents who could see 'what got unstuck this week' renewed.

Two-thirds of churn came from three choke points. Targeting those alone delivered most of the gain. The rest came from steady tuning.

Time-to-mastery,cut by 62%.

40% of learners failing the linear curriculum.

Adaptive difficulty, real-time explanation.

What we shipped.

Misconception classifier

Targeted explanation generator

Difficulty controller

Parent and teacher dashboards

Delivery cadence.

Misconception taxonomy

Adaptive engine build

Multi-grade rollout

Parent and teacher tooling

Numbers their team verified.

Misconceptions, not topics.

Other case studies.

Invoice processing, reborn

L1 IT support automated

Denials, recovered

The next case studycould have your numbers.

Time-to-mastery,cut by 62%.

40% of learners failing the linear curriculum.

Adaptive difficulty, real-time explanation.

What we shipped.

Misconception classifier

Targeted explanation generator

Difficulty controller

Parent and teacher dashboards

Delivery cadence.

Misconception taxonomy

Adaptive engine build

Multi-grade rollout

Parent and teacher tooling

Numbers their team verified.

Misconceptions, not topics.

Other case studies.

Invoice processing, reborn

L1 IT support automated

Denials, recovered

The next case studycould have your numbers.

Time-to-mastery,
cut by 62%.

The next case study
could have your numbers.

Time-to-mastery,
cut by 62%.

The next case study
could have your numbers.