KodiLingo: Strengthening the Concept — A Founder’s Action Plan

Short answer: yes. Every one of these weaknesses is fixable, and most are fixable on paper, this week, for the price of your own time. The concept note isn’t wrong — it’s unsequenced. You’ve written a capability inventory (“what could this platform contain”) when what you need is a market position plus a buildable plan (“what do I ship first, for whom, to prove what, by when, with what”). The fix is not more features. It is the opposite: pick the one defensible wedge, write four cheap paper artifacts that resolve scope/economics/compliance before any code, recruit one teacher, and build a thin offline loop end-to-end. The hardest discipline here is saying “not in v1” to 11 of your 13 pillars. That discipline is the differentiation.

One thing to internalize up front: your binding constraint is your own build-hours plus lesson-authoring effort, not code cleverness. Schedule around that and you win; ignore it and the 13-pillar wishlist calcifies into an unshippable mass maintained by one exhausted person.

Executive summary

#	Weakness	Strengthenable?	One-line fix
1	Wishlist, not a buildable plan (no MVP/phasing)	Yes	RICE-score ~50 features in R/CSV, draw an MVP line at ~12 effort-weeks, park 11 pillars in a “Won’t-yet” bin.
2	No competitor / differentiation analysis	Yes	Build a competitor×feature matrix; claim the one empty cell: a CBE coding lesson that is an English-literacy exercise.
3	No technical architecture	Yes	One-page ADR: offline-first PWA on a per-school box; nginx + FastAPI/Node + SQLite; cloud AI optional/async.
4	Offline-first vs cloud-AI tension	Yes	Split “AI” into 3 tiers (on-device / classroom-server / cloud); ship the typed-answer floor first.
5	Child-data privacy named, not designed	Partially	Pseudonymous-by-default + ODPC registration + Kenya-resident data copy; it’s a real workstream, not a policy page.
6	Cloud-AI unit economics / vendor lock-in	Yes	Provider-abstraction layer + caching/quotas; local default only where it works; pronunciation/tutor are cloud-only paid.
7	Heavy hardware/VR as v1 dependency	Yes	Virtual-first behind one device API; robotics/VR become externally-funded add-ons, never a gate.
8	No metrics / scale / team / budget / timeline	Yes	One living one-page operating plan: learning-outcome North Star, offline event layer, milestone-gated hiring.

“Partially” on privacy is deliberate: it’s fully addressable, but it forces a real product downgrade (offline child speech is weak) and recurring infra cost (Kenya-resident hosting), so it isn’t free paperwork.

1. From 13-pillar wishlist to a buildable plan

The problem. A solo founder has one scarce resource — build-hours — and a 13-pillar, ~50-feature wishlist silently assumes those hours are infinite. With no MVP and no phasing, every feature sits at equal priority (which is no priority), the offline-vs-cloud tradeoff gets made implicitly and badly, and you can’t pitch a school or grant officer because “we do everything” is unfundable. Worse, the wishlist hides whether the central bet — that fusing Code + Speak beats buying Scratch and Duolingo separately — is even true. Until you ship the thinnest fused slice into one real classroom, that bet is untested faith.

Can it be strengthened? Yes — and this is the highest-leverage move in the whole concept, because the act of prioritizing mechanically produces your MVP.

How. Use RICE as the primary scoring method (it plays to your R/data instincts and the Confidence term automatically punishes low-confidence cloud-AI-on-Kenyan-child-speech features), with two mandatory overlays: a MoSCoW “Won’t-yet” bin to formally park beloved ideas on the record, and a one-page dependency graph so you never schedule a feature before its prerequisites. Then commit to one MVP and resist adding to it.

The walking-skeleton MVP — the heart that proves the fused value:

A single guided lesson where a child (a) drags Blockly blocks to solve one coding micro-challenge, and (b) must answer what their code does — and the app checks it and only then marks the lesson complete and awards XP. Code and Speak are fused in one loop; you cannot finish without doing both.

Critical correction the original plan buried: the speak step is the heart of the bet and the part that breaks offline. The browser Web Speech API is not “free, zero infra” — in Chrome it streams audio to Google’s cloud and fails in the exact low-bandwidth classroom you target. So v1’s reliable default is a typed answer with a deterministic keyword check (does the text contain “forward”, “three”, “steps” — never an LLM grader). Speech capture is a best-effort enhancement layered on top, with a swappable engine interface. Pronunciation scoring is deferred entirely.

Treat the headline “10–14 weeks” as 4–6 calendar months part-time, because the slowest part is not code — it’s authoring 5–8 CBE-mapped lessons a 6–8-year-old can finish unassisted, which is curriculum design, outside your wheelhouse.

What you need to do.

Build the prioritization sheet (R/CSV): one row per feature. Columns: Pillar | Feature | Reach(0-5) | Impact(1/2/3) | Confidence(0.5/0.8/1.0) | Effort_weeks (your guess ×1.5) | RICE = R*I*C/E | MoSCoW | Depends_on | Phase. Pre-fill ~50 features. (Half a day, this week, step 1.)
Score honestly, sort by RICE, draw the MVP line at ~12 cumulative effort-weeks. Any feature whose core path needs cloud speech gets Confidence ≤0.8 and is flagged “offline alternative required.” Park VR, Unity/Godot, robotics, AI grading, parent/admin portals, Swahili UI, pronunciation scoring in “Won’t-yet.” (2–3 hours.)
Write a one-page Walking-Skeleton Spec and tape it above your desk: Blockly micro-challenge → mandatory typed answer + keyword check → XP → progress to SQLite → teacher list view. Pin it to ONE CBE lower-primary strand (sequencing/movement). (2–3 hours.)
Draw the MVP dependency graph; confirm nothing in the skeleton secretly depends on a Won’t-yet feature. Build end-to-end and thin before wide — one complete lesson beats ten half-lessons. (Half a day, then the multi-week build.)
Pre-declare a success metric AND a kill criterion before any classroom test: success = “≥70% of kids complete the fused lesson unassisted in <15 min”; kill = “if kids ignore or resent the speak step, or it adds no engagement over plain Blockly, the Code+Speak fusion has failed.” Treat one classroom as a signal, not proof.

2. Competitor and differentiation analysis

The problem. A 13-pillar note with no map of who already occupies each pillar is a plan to rediscover the market by building. Kenya has ~235 EdTech startups; incumbents are better-funded than you (Zeraki $1.8M, Kidato $1.4M, Eneza 6.5M+ users) and the coding tools you cite as inspiration (Scratch, Code.org) are free. Present KodiLingo as “coding + gamification + AI + language + robotics + dashboards” and every pillar maps to an incumbent who does that one thing better and cheaper. You cannot out-resource Scratch on coding, Duolingo on language, and Zeraki on admin simultaneously.

Can it be strengthened? Yes — and doing the analysis reveals KodiLingo’s actual reason to exist.

How. Build a one-page competitor×feature matrix and claim the empty cell. The genuinely unoccupied position is narrow: not “CBE + offline + coding” (Elimutab, MsingiPack, iLearn already partly occupy that — include them or you flatter yourself), but coding lessons where the same activity is also a structured English-literacy exercise. Positioning sentence:

“The only tool where a CBE coding lesson is an English-literacy exercise for the same Kenyan Grade 4–8 child.”

Offline-first is a differentiator against cloud-only global tools, not a moat against local offline incumbents — don’t oversell it. The Computer Science for All Bill (2025, still a proposal) is a tailwind, not a guarantee; treat it as informational conversations, not pilot inclusion you can assume.

What you need to do.

Trial the products, then build the matrix. Rows: Elimutab, MsingiPack, iLearn, eLimu, Kytabu, Eneza, Zeraki + Scratch, Code.org, Tynker (~$180/yr), Kodable, Duolingo ABC. Columns: CBE-aligned, Offline, Coding, English/literacy, Swahili, Teacher dashboard, Price, AI. Fill cells from real product behavior, not marketing pages. Circle the empty intersection. (1–2 days.)
Rewrite page 1 of the concept around the positioning sentence; move robotics, VR, labs, AI grading, parent portal to a labeled “Phase 2/3 Roadmap.” (1 day.)
Run 15–20 discovery conversations with a pre-committed kill rule (below) before writing product code. Raise the bar above polite agreement: proceed only if buyers name the coding+English gap unprompted AND either state a per-pupil price or commit to a paid pilot. (8–12 weeks if you lack a network; budget some travel.)
Set a hard stop: no buyer commits to a paid pilot within ~8 weeks → stop or pivot the wedge based on what they did say hurts.

3. Technical architecture

The problem. Without an architecture the 13 pillars are unschedulable and uncostable, and the two headline promises physically conflict: offline-first targets rural schools with intermittent power and no bandwidth; cloud-heavy AI assumes fast always-on internet per child. The note also implicitly scopes building a block engine, a speech engine, and an AI tutor from scratch — each a company-sized undertaking.

Can it be strengthened? Yes. The architecture is your single highest-leverage artifact — it converts an unfinishable wishlist into a buildable v1 and is what makes the concept fundable.

How. Adopt an offline-first PWA served from a per-school local box, with cloud AI as an optional, async, bandwidth-bound bonus.

Layer	Choice
Platform	PWA served from a per-school box over classroom WiFi; no app install; works with zero internet
Stack	nginx + FastAPI (or Node) + SQLite on the box; service-worker PWA; thin cloud tier (Postgres) only for sync + parent/admin portal
Data model (v1, exactly these)	`school, user, class, enrollment, lesson, content_pack, attempt, progress, assessment` — nothing more
Auth	4 roles, but not 4 equal logins: students use class-scoped PIN/avatar (no email, no child PII); teachers/admins email+password; parents read-only on cloud. Row-level by `school_id` + class membership
Sync	Content packs flow DOWN as signed/checksummed files; attempts flow UP as an append-only, idempotent, last-write-wins queue
Build-vs-reuse	Embed the block runtime — do not build a block engine. Decide Blockly-vs-MakeCode here (see §7) so the runtime is chosen once

The keystone property: content is data (signed packs), not code — you ship new lessons/subjects without redeploying, and conflicting AI features become optional sync-time enhancements rather than blockers.

Honest corrections to fold in: budget ~US$200–300 delivered per box in 2026 (the RAM shortage roughly doubled Pi 5 prices; price a refurb x86 mini-PC as a serious alternative); plan ~5–9 months to a real pilot for an R/data founder new to offline sync; the idempotent sync queue (the one truly hard piece) is 2–4 weeks; “embed Blockly” is multi-week lesson-runner/serialization/packaging work, not a script tag.

What you need to do.

Write the one-page architecture ADR stating platform, stack, the NOT-in-v1 list (VR, robotics, on-device speech, AI tutor, public gallery, marketplace, multi-language beyond English), and the Blockly-vs-MakeCode decision. (Half a day.)
Create the 9-table SQLite schema, migrations in one SQL file in git, seed one school/teacher/class/3 students. (1–2 days.)
Engineer for power loss from day one: boot from SSD not microSD; SQLite WAL mode; nightly local snapshot to SSD/USB (sync-up is not a backup). (Fold into the skeleton/sync steps.)
Define a concrete sync-size budget and a manual “teacher’s-phone, weekly” fallback; don’t assume a nightly hotspot.
Cap v1 at 1–3 pilot schools and flag fleet management (remote update, SD corruption, theft) as an unsolved scaling problem, not “your time.”

4. Offline-first vs cloud-AI tension

The problem. “Works offline in Kenyan classrooms” and “AI tutor / pronunciation coach / AI grading everywhere” are physically in tension on the exact device class the market owns (2–3GB-RAM Tecno/Infinix phones; 81% of African devices sold in 2025 were under $200). A useful fully-on-device LLM is not feasible on that hardware in 2026. A technically literate grant assessor will expose “offline-first + cloud-AI everywhere” in one demo.

Can it be strengthened? Yes — by splitting the single marketing word “AI” into three computationally distinct workloads tagged by where they can physically run.

How. A three-tier classification, wrapped in a graceful-degradation UX contract and a sync-when-online queue, with the infeasible pieces explicitly deferred.

Tier	Runs where	Features
0 — On-device (zero network)	The child’s browser/phone	Block coding, cached lessons, deterministic code checks, all gamification, deterministic hint tree, typed-answer keyword check
1 — Classroom server (LAN, no internet)	The per-school box	LanguageTool self-host, spaCy, class analytics, sync hub. (LLM/ASR here only after a benchmark — see below)
2 — Cloud (best-effort, queued offline)	Hosted APIs	LLM tutor, generative grading, role-play, AI-portfolio prose

Corrections the original over-promised: offline Swahili child ASR is the one genuinely hard unknown — Vosk has no Swahili model, the impressive Swahili WERs are fine-tuned results (and Whisper beats MMS once you have ~400h of data you don’t have), and Omnilingual’s “300M edge” figure is an A100 datacenter number with no published phone/Pi benchmark. So: ship the online path on a cloud ASR API and label offline-Swahili-ASR “R&D” until a spike with a hard pass/fail bar proves otherwise. Likewise, a 1–3B LLM serving 30 kids offline on a $50–150 Pi is unproven (Ollama gives ~24.7s p99 latency at 50 users) — that likely needs a $300–600 GPU mini-PC, which changes your monetization math.

What you need to do.

Write the one-page AI-Tiering Table into the note — every AI feature tagged Tier 0/1/2, with an “offline behavior” column AND a “what audio/text leaves the device” column (the latter doubles as your privacy artifact). (1–2 days.)
State plainly what is NOT feasible on target hardware (fully-on-device LLM tutor on 2–3GB phones). Naming the boundary is your strongest credibility signal.
Ship the genuine floor: Tier-0 typed-answer loop + deterministic hint tree; Tier-2 cloud LLM via a store-and-forward SQLite outbox. Never promise real-time offline AI.
Run two gated spikes (Swahili/accented-child ASR accuracy + concurrency/throughput) on the actual target phone and candidate box, each with a pre-defined pass/fail bar. Budget months and consented data collection, not 1–2 weeks.
Source a transcribed Swahili child-speech corpus up front (Common Voice / Kencorpus / African-NLP); you will likely need to fine-tune.

5. Child-data privacy — named, not designed

The problem. Your entire user base is minors in Kenya — the most legally protected data category. The Data Protection Act 2019 (s.33) prohibits processing a child’s data without verifiable parental consent; “provision of education” is a mandatory ODPC registration sector regardless of company size, so the small-business exemption does not save you; cloud AI pushes children’s voice to overseas processors (a cross-border transfer); and basic/early-childhood education is an express strategic-interest category, very likely requiring a Kenya-resident serving copy of children’s data. The ODPC is actively enforcing (KES 26M+ in fines by late 2024; the first penalty was KES 5M against Oppo for posting one person’s photo — a direct warning for a moderated gallery of children’s work). Schools’ procurement will demand your ODPC certificate and DPIA before signing, so this is a sales gate, not just a legal one.

Can it be strengthened? Partially — fully addressable, but it’s two intertwined problems (compliance and a product downgrade), a real parallel workstream that will slow feature delivery for months, not a privacy page.

How. Minimize first, then govern. Two zero-code deliverables this week: a controllership map (B2B: school = controller, KodiLingo = processor under a s.42 written DPA; B2C camps = KodiLingo controller) and a data inventory (one row per data element: field, who collects, lawful basis, purpose, retention, leaves-Kenya?). Then re-architect child identity to pseudonymous-by-default (teacher-provisioned opaque usernames + avatars; no child name/face/precise-DOB/email; gallery shows avatars and handles only — the exact pattern that drew the Oppo fine).

What you need to do.

Decide controllership and build the data inventory (do them together; the inventory’s lawful-basis column needs controllership fixed first). (1–2 days, this week.)
Re-architect to pseudonymous child accounts; push real identity + parental consent to the school. (Days to a couple of weeks.)
Register with the ODPC (~KES 4,000 initial, KES 2,000 renewal, valid 24 months) and stand up: retention schedule, child- and parent-facing privacy notice, breach plan (ODPC within 72h), a manual data-subject-rights intake/log, yourself as DPO.
Plan Kenya-resident hosting with at least one serving copy of children’s data in-country — a recurring infra cost; budget it.
Draft both consent paths (lawyer-reviewed school DPA; B2C DOB-gate → parent-email → disclosed-purpose consent-link → immutable log → withdrawal) and conduct a children’s-data DPIA using the inventory as input.
Verify each cloud vendor (OpenAI/Google/Azure) will actually sign a DPA for children’s data and a Kenya-suitable transfer basis; have a drop-or-de-identify fallback.
Budget honestly, as one line: ~KES 4,000 registration + ~US$1,500–4,000 of Kenyan data-protection counsel across DPA/notice/DPIA/transfer reviews (not “a few hundred dollars” five times) + recurring Kenya hosting.

6. Cloud-AI unit economics and vendor lock-in

The problem. “AI everywhere” is metered per-minute/per-token; your customers are schools whose per-child software budget is a few dollars a year. Pipe every kid’s speech to Azure and every tutor turn to OpenAI and your variable cost per active student can exceed the license price — you lose money on every user and get worse as you succeed. Hard-wiring vendor SDKs also creates lock-in in a region with forex risk and intermittent connectivity.

Can it be strengthened? Yes — this is the weakness most amenable to engineering discipline.

How. Three layers of one design: a provider-abstraction layer (built before you couple to any SDK), local/open-source as the default driver — but only where it works, and caching + batching + per-student quotas on top, exposed to customers as free/paid/premium tiers priced off a measured cost ceiling.

Reconcile the contradiction the portfolio flagged: “make local the default” is true only for LanguageTool/spaCy/narration/deterministic hints. Pronunciation grading and open-ended tutoring are explicitly cloud-only paid-tier — not “local default with cloud fallback” — because Vosk is likely unusable for graded Kenyan-child speech. The arithmetic: a capped, active paid student costs roughly $0.35–0.55/month (~$4–7/yr) of cloud COGS if you cache and quota; the cost bombs are pronunciation assessment (Azure bills it only at the ~$1.32/audio-hr real-time rate — never cheap-in-batch) and real-time voice (~$0.017/min → $12–25+/yr for a chatty kid). Caching the identical-lesson speech/TTS across a class cuts 80–95%; quotas bound the tail.

What you need to do.

Build a one-page COGS spreadsheet (R/data — your turf) computing cost/student/month from usage knobs × 2026 unit prices, with cache-hit and batch multipliers. Add two terms the original omitted: per-school box hardware + support cost, and a pupils-per-box concurrency ceiling. Output the break-even school count where local-first actually beats cached cloud. (Half a day.)
Write the provider-abstraction interface before any vendor call: Transcribe, AssessPronunciation, Tutor, Synthesize, each with a local and a cloud driver, selected by tier + connectivity. (2–4 days.)
Stand up the open-source stack (LanguageTool self-host, spaCy, narration) as the free-tier/offline default. (1–2 weeks.)
Implement caching (content-hash per locale), batching for non-real-time grading, and hard per-student quotas. (1–2 weeks.)
Tier as free/paid/premium, validate the price against the per-child software/ICT line item (not tuition) from 2–3 real low-fee schools, and decide box financing (sell/lease/grant) in the same conversation.
Ship per-student cost telemetry with the first cloud driver so margin erosion is an alert, not a year-end surprise.

7. Heavy hardware / VR as a v1 dependency

The problem. The note lists micro:bit/Arduino/Pi, a virtual simulator, virtual labs, AND VR at the same priority tier. They are not equal. Physical kits and VR are the biggest threat to adoption for reasons unrelated to software quality: per-unit cost in hard currency, import friction, breakage/theft, no mains power, no on-site technician, untrained teachers. If v1 adoption depends on a school owning kit, the product is dead on arrival in exactly the schools it serves. A simulator and virtual labs are pure software that ship over the same offline channel, cost nothing per student, and never break.

Can it be strengthened? Yes. The mistake isn’t wanting hardware — it’s letting hardware be a dependency instead of an optional layer.

How. Virtual-first behind a hardware-agnostic device API (motor.forward(), led.set(), sensor.distance()), with two backends: a software simulator (the v1 default) and a hardware adapter (deferred). Demote physical robotics and VR out of v1 explicitly.

Build-vs-embed correction: before building a bespoke p5.js/Phaser simulator, spend half a day deciding — MakeCode (already in your tech menu) ships a micro:bit simulator AND flashes real hardware from one block program, delivering ~70% of the “write once, sim-or-hardware” promise for free. Bespoke sim only for robot/lab worlds MakeCode can’t express. Make this decision here and in §3 so the coding runtime is chosen once. And don’t freeze the device abstraction with zero hardware in hand: spec it provisionally, build the sim, then buy exactly one micro:bit (~US$20) to validate against real serial/flash timing before locking it.

Don’t forget: “Tier-0” is not zero-hardware — every classroom still needs a delivery device. The note already names the answer (BRCK Kio Kit, a Kenyan-made ~US$5,000 rugged offline box already in schools and refugee camps; or a local Pi server serving BYOD phones). Validate that as your primary Tier-0 target — it’s also a ready distribution partner.

What you need to do.

Write a one-page hardware-tiering doc: Tier 0 (core, every school) = simulator + virtual labs; Tier 1 (optional, funded) = micro:bit shared cart/lease; Tier 2 (deferred) = VR. State plainly: NO Tier 1/2 item is a v1 dependency. (1–2 hours.)
Decide build-vs-embed (lean MakeCode) in a half-day spike before the runtime hardens.
Validate the Kio Kit / local-server delivery path and name the minimum device spec (RAM/browser/WASM, 7-inch touch, Android 9/Chrome 90).
Buy ONE micro:bit (~$20) to validate the abstraction against real hardware before freezing the API — not a class set.
Draft a hardware-pilot funding ask with real landed numbers (a 15-board cart + sensors + lockable case ≈ US$300–600 before duty; lease/insurance covers breakage), framed as “kits dropping into a platform already used by N kids.” Treat funding as a 6–18 month hypothesis the software roadmap never depends on.
Park VR; optionally prototype one WebXR/Three.js scene on existing phones for the immersive marketing hook (zero CapEx).

8. No metrics, scale, team, budget, or timeline

The problem. The note is 13 pillars with no way to tell whether any of them worked, how big the bet is, or when you’ll know. Without metrics you can’t prioritize; without a measured learning gain you can’t raise money (East-African funders fund outcomes, not feature lists); without a timeline/budget you can’t see that the full scope implies a 10–15-person team and a multi-year burn you can’t personally fund. This is the cheapest possible intervention — writing and a spreadsheet — and it de-risks the other seven weaknesses.

Can it be strengthened? Yes — it’s the most fixable weakness, and fixing it changes how every other weakness gets judged.

How. Maintain ONE living one-page operating plan: a learning-outcome North Star (“weekly active learners who completed a lesson AND showed a measured skill improvement” — this rejects vanity DAU), a metric tree (Acquisition/Activation/Engagement/Learning-outcome/Retention/Referral, each tagged leading vs lagging), an offline-queued event layer built into the first MVP, and a milestone-gated hiring ladder whose first non-founder spend is a CBE-pedagogy contractor, not another engineer.

Corrections: DeepSpeech is dead (Mozilla archived it June 2025) — strike it; use Vosk or whisper.cpp, and remember Vosk transcribes but does not score pronunciation. Kenya rebranded CBC → CBE (April 2024, 2-6-3-3 structure) — switch all school/funder-facing language or you read as out of date. Define your learning-outcome instrument up front (one narrow objectively-scorable Blockly skill, fixed pre/post tasks) and describe it to funders honestly as a signal, not causal proof (uncontrolled n=20–40). Don’t route child events to a US-hosted Google Sheet — that’s regulated data (§5).

What you need to do.

Write the North Star sentence and the metric tree (six buckets, one metric each, leading/lagging tags, numbers blank). (½ day.)
Spec the 10 day-one events (session_start, lesson_started, lesson_completed, quiz_scored, skill_node_mastered, hint_requested, streak_incremented, project_published, teacher_login, sync_completed) queued offline in SQLite — this schema IS your sync-queue spine. (½ day to spec, build in Phase 0.)
Fill the three-phase table (0–3 / 3–9 / 9–18 mo): scale target, budget band, team shape, one unlock milestone each (using bands below; get two real local quotes — a CBE curriculum day-rate and a teacher-PD day-rate — before trusting any number).
Write the milestone-gated hiring ladder; mark your FIRST investment as a part-time CBE lower-primary curriculum contractor (~US$300–800/mo part-time).
Add a Funder-Readiness checklist (offline MVP; signed pilot + 4 weeks data; measured learning gain; retention number; willingness-to-pay/CSR LOI); target small catalytic grants/competitions first, not big foundations.
Calendar a recurring 2-hour monthly metrics review. A plan nobody updates rots.

Cross-cutting issues you must own

These span multiple plans and are owned by none — call them out explicitly:

Content/lesson authoring is the real bottleneck, owned by no plan and your weakest skill. Authoring 5–8 CBE-mapped Blockly lessons (block challenges + narration + vocabulary + deterministic checks + strand mapping) is the critical path. Recruit one practicing CBE lower-primary teacher in week one as a co-author — a co-founder-grade dependency.
Curriculum currency: CBC → CBE everywhere school- and funder-facing.
Founder bandwidth / runway is a hard scalar. Summed, this portfolio is multiple full-time months for one part-time, unfunded, non-pedagogy person. Run it against ONE calendar (below) and an honest runway figure; assume part-time roughly doubles every timeline.
Go-to-market / buyer identity is unresolved (B2B school / B2G CS-for-All / CSR / B2C camps). Who pays determines the data model (controller vs processor), consent flow, price ceiling, and AI economics. Pin it via the discovery conversations before architecture and pricing harden.
Teacher onboarding / change management is undesigned. The pilot depends on a teacher running a fused lesson on flaky hardware with 30 kids — that needs a short PD asset and a “when tech fails” fallback script (also a monetization line and a consent prerequisite).
IP / licensing inventory (Blockly Apache-2.0, MakeCode, Scratch-blocks trademark, p5.js, LanguageTool LGPL/AGPL self-host vs cloud, Vosk Apache, Phaser, model weights) — a one-page license inventory before shipping or fundraising.
Content localization ≠ UI localization. Swahili content (narration, vocabulary, reading) is the pedagogically central, larger effort; decide which lessons are bilingual from day one. UI Swahili is the deferrable QA-surface item.
Device/delivery reality below the app layer: decide the single “what physical thing sits in the classroom, who owns/powers/repairs it” question, with power-loss, mic-in-noisy-room, and fleet-management failure modes.

Contradictions to resolve once, portfolio-wide

Contradiction	Resolution
Web Speech API as v1 default	Typed answer + deterministic keyword check is the reliable default; speech is best-effort only.
Pronunciation in Phase 1	Vocabulary/reading in Phase 1; pronunciation scoring deferred behind a passing ASR spike, never on the Phase-1 critical path.
“Local AI default” broadly	Local default ONLY for LanguageTool/spaCy/narration/hints; pronunciation grading + tutoring are cloud-only paid.
Bespoke simulator vs MakeCode	Decide build-vs-embed (lean MakeCode) before the Blockly runtime hardens.
Cloud vendors vs data residency	Confirm each vendor signs a DPA for children’s data + lawful Kenya transfer before pricing any cloud-AI tier; bake a drop-or-de-identify fallback into the COGS sheet.

Missing weaknesses the critic surfaced

Is the Code+Speak fusion pedagogically sound, or just novel? Half-day literature scan + ask 2–3 CBE teachers whether narrating code aids learning or adds friction; pre-register the kill criterion.
Reliability/data-loss in real classrooms is under-owned → SQLite WAL + nightly local snapshot + idempotent sync from the first build; test a hard power-pull mid-lesson before any pilot.
Child safeguarding beyond privacy (moderation, no public chat, mandatory-reporting posture) → one-page safeguarding policy: avatars/handles only, no public chat in v1, galleries deferred, named escalation contact.
Accessibility floor, not all-deferred → keyboard-operable Blockly + audio narration on every lesson + sufficient contrast (narration also de-risks low-literacy learners).
Learning-outcome instrument undefined → one narrow objectively-scorable Blockly skill, fixed pre/post, honestly framed as a signal.
Key-person / bus-factor risk → name the CBE teacher as a co-founder-grade dependency; content in git, schema documented.

Do this first

Four tracks against one calendar, not eight “do-this-first” lists.

Week 1 — paper, parallel (~3–4 focused days total). Cheap, high-leverage, unblocks everything:

The one-page wedge + positioning + NOT-in-v1 doc.
The RICE + COGS + data-inventory sheets in R/CSV (scope, unit economics, compliance resolved on paper before code).
The architecture ADR — with the Blockly-vs-MakeCode and controllership decisions made here.
The North Star + offline event schema.

Weeks 1–3 — start now, run for months in the background (do NOT write product code until discovery passes its gate):

Recruit the CBE teacher co-author (the true bottleneck).
Open ODPC registration + draft DPA/consent path (Day-0 legal blocker for any child data).
Run 15–20 discovery conversations with the pre-committed kill rule.
Outreach for ONE pilot classroom — school-term lead time is the long pole.

Weeks 2–12 — sequential build, gated on discovery + the teacher co-author:

Build the offline-first walking skeleton end-to-end (Blockly → typed-answer keyword check → XP → SQLite with WAL + snapshot → teacher list), runnable fully offline on a real low-end device. Prove the idempotent sync queue early — it’s the one genuinely hard piece and the spine of your evidence pipeline.
Author 5–8 CBE lessons WITH the teacher (the slow part).
Layer privacy controls into the skeleton (PIN login / no child PII / safeguarding policy), not after.
Run the pilot with pre-declared success metric AND kill criterion, written consent, and a device/mic check.

Strictly sequential: discovery gate → build · teacher co-author → lesson authoring · offline coding loop proven → any speech/AI → any cloud-AI pricing. Parallelizable: the four paper artifacts; teacher recruiting + compliance + discovery + pilot outreach. Defer wholesale past the pilot: pronunciation scoring, AI tutor/grading, robotics hardware, VR, classroom-server LLM box, parent/admin portals, Swahili UI, moderated gallery.

Suggested phased roadmap

	Phase 0 (0–3 mo) Build the wedge	Phase 1 (3–9 mo) Prove learning + retention	Phase 2 (9–18 mo) Prove it travels + costs work	Phase 3 (post-funding) Expand
Scope	Blockly + ~10 CBE lessons (1 strand) + 1-branch skill tree + points/streak + typed-answer fused step + event layer	Offline content packs + sync; teacher dashboard; pre/post quiz; ONE language feature (vocabulary/reading)	School admin/bulk upload; parent progress view; rubric assessment; FIRST cloud-AI feature (AI hint) with hard offline fallback	AI tutor, adaptive learning, robotics pilot, virtual labs, gallery, VR (premium)
Architecture	PWA + SQLite on a laptop/Pi; walking skeleton; ADR locked	Idempotent sync queue proven; signed content packs; per-school box	Thin cloud sync tier; Kenya-resident data copy	Classroom-server LLM box (only if benchmarked)
AI	None on critical path; deterministic checks/hints only	ASR spike (gated); offline vocabulary; pronunciation deferred	First cloud AI behind abstraction + caching + quotas; cloud-only paid tier	Tier-2 features at scale, vendor-abstracted
Privacy	Pseudonymous accounts; controllership map; data inventory	ODPC registered; DPIA; consent flow; safeguarding policy	DPAs signed; transfer basis verified; Kenya hosting live	Audited governance; renewals
Metrics	Event layer live; North Star instrumented	Measured learning gain + week-4 retention + “I’d keep using this”	Repeatable onboarding + credible cost-per-learner + WTP/CSR LOI	Cohort outcomes, renewal rates
Scale / team	1 school, ~20–40 learners; you + CBE teacher (PT)	2–4 schools, ~150–400; you + 2 PT contractors	8–15 schools, ~1,000–3,000; you + 1 FT + 1–2 PT	Funded team
Budget band	~US$0–300 + ~$20 box	~US$1k–6k	~US$15k–60k+	Grant/investment-funded
Unlock milestone	A stranger completes a lesson offline, visible in your dashboard	A measured pre/post gain + a retention number + a teacher who’d keep using it	Repeatable onboarding + credible cost-per-learner + an LOI — this is where funding conversations get real	—

Budget bands are order-of-magnitude planning placeholders for a solo R/data founder — replace with your own RICE Effort and two real local quotes.

Bottom line

Yes — all eight weaknesses are strengthenable, and the work is mostly paper and discipline before it is code. The single reframe that fixes the most: stop treating this as a 13-pillar platform and treat it as one narrow, defensible wedge — a CBE coding lesson that is an English-literacy exercise — shipped offline-first, typed-answer-first, into one real classroom with a pre-declared kill criterion. Write the four paper artifacts (wedge, RICE/COGS/inventory, architecture ADR, operating plan) this week; recruit one CBE teacher; start ODPC registration and discovery in parallel; then build a thin loop end-to-end while the lessons are co-authored. The privacy track is real legal-and-product work, not a policy page; the offline-vs-cloud tension is resolved by honesty (typed default, cloud AI as a bandwidth-bound bonus, pronunciation deferred behind a benchmark); and your binding constraint throughout is your own hours plus lesson-authoring, so schedule the teacher and the lessons as the critical path and let the architecture be boring on purpose. Do that, and within ~6 months you’ll know — from real Kenyan kids on real low-end devices — whether KodiLingo should exist. That answer, not another pillar, is what unlocks everything else.