6.6 KiB
Roadmap
This document summarizes the current prioritized improvement roadmap for Didactopus as a learner-facing system.
The ordering is intentional. The project should first strengthen the graph-grounded mentor loop that defines the real learner task, then use that stable backbone for local-model evaluation, accessibility work, and broader UX improvements.
Priorities
1. Graph-grounded conversational mentor loop
Status: in progress
Why first:
- It defines the actual learner-facing interaction Didactopus is trying to support.
- It makes later benchmarking and accessibility work target a real session model rather than an abstract idea.
- It uses the graph and source-corpus artifacts already present in the repository.
Near-term scope:
- continue strengthening the learner session backend
- make mentor, practice, and evaluator turns consistently source-grounded
- improve trust-preserving feedback behavior
- extend the session flow beyond one short interaction
Current code anchors:
didactopus.learner_sessiondidactopus.learner_session_demodidactopus.graph_retrievaldidactopus.ocw_rolemesh_transcript_demo
2. Local-model adequacy benchmark for constrained hardware
Status: planned
Why next:
- The learner loop should be benchmarked as soon as its task shape is stable.
- Adequate local models on low-cost hardware would materially improve access in underserved regions.
- Didactopus does not need a single perfect model; it needs role-adequate behavior.
Primary questions:
- Which models are adequate for
mentor,practice, andevaluatorroles? - What latency, memory, and throughput are acceptable on Raspberry Pi-class hardware?
- Which roles can degrade gracefully to smaller models?
Expected outputs:
- benchmark tasks grounded in the MIT OCW pack
- per-role adequacy scores
- recommended deployment profiles for low-end, laptop, and stronger local systems
3. Accessibility-first learner interaction
Status: planned
Why high priority:
- Didactopus has clear potential for learners who do not have access to enough teachers or tutors.
- Blind learners and other accessibility-focused use cases benefit directly from structured, guided interaction.
- Voice and text accessibility can build on the same learner-session backend.
Target features:
- screen-reader-friendly learner output
- accessible HTML alternatives to purely visual artifacts
- text-first navigation of concept neighborhoods and progress
- explicit structural cues in explanations and feedback
4. Voice interaction with local STT and TTS
Status: planned
Why after accessibility baseline:
- The project should first ensure that the session structure is accessible in text.
- Voice interaction is more useful once the mentor loop and pending-response behavior are stable.
Target features:
- speech-to-text input for learner answers
- text-to-speech output for mentor, practice, and evaluator turns
- spoken waiting notices during slow local-model responses
- repeat, interrupt, and slow-down controls
5. Learner workbench UI
Status: planned
Why important:
- The repository has review-focused interfaces and generated artifacts, but the learner path is still fragmented.
- A dedicated learner workbench would make Didactopus more usable as a personal mentor rather than only a pipeline/demo system.
Target features:
- current concept and why-it-matters view
- prerequisite chain and supporting lessons
- grounded source excerpts
- active practice task
- evaluator feedback
- recommended next step
6. Adaptive diagnostics and practice refinement
Status: planned
Why this matters:
- Learners need clearer answers to “what am I weak at?” and “what should I do next?”
- The repository already has evidence and evaluator machinery that can be surfaced in learner terms.
Target features:
- weak-dimension summaries by concept
- misconception tracking
- remedial branch suggestions
- hint ladders and difficulty control
- oral, short-answer, and compare-and-contrast practice modes
7. Source-grounded citation transparency
Status: planned
Why it matters:
- Trust depends on showing what is grounded in source material and what is model inference.
- This is especially important for learners using local models with variable quality.
Target features:
- lesson and source-fragment references in explanations
- explicit distinction between cited source support and model inference
- easier inspection of concept-to-source provenance
8. Pack quality, review, and concept-graph curation improvements
Status: planned
Why later:
- These are important, but they mainly improve the quality of the learning substrate rather than the immediate learner interaction.
- The graph-first path should first prove out the learner experience it supports.
Target features:
- concept merge and split workflows
- alias handling across packs
- impact analysis for concept edits
- stronger review support for noisy or broad concepts
- improved source coverage QA
9. Incremental re-ingestion and course updates
Status: planned
Why useful:
- External course repositories are now part of the intended workflow.
- Didactopus should avoid full rebuilds when only part of a source tree changes.
Target features:
- changed-file detection
- stable concept and fragment IDs where possible
- graph and pack diffs
- preservation of learner evidence across source updates
10. Richer multimodal and notation support
Status: longer-term
Why longer-term:
- This work is valuable but more specialized and technically demanding than the earlier roadmap items.
Examples:
- spoken math rendering improvements
- diagram descriptions
- accessible handling of image-heavy source materials
- EPUB and other learner-friendly export targets
Guiding Principles
- Use the graph and source corpus before relying on model prior knowledge.
- Optimize for guided learning, not answer offloading.
- Prefer role-adequate local models over chasing a single best model.
- Keep accessibility and low-cost deployment in scope from the start, not as cleanup work.
- Preserve provenance and license compliance as first-class constraints.
- Advance the current roadmap without assuming abundant compute, fluent English, expert supervision, or mature learners.
Suggested Implementation Sequence
- Strengthen
didactopus.learner_sessioninto the standard session backend. - Build a small model-benchmark harness around that backend.
- Add accessible learner HTML and text-first outputs.
- Add local TTS and STT support to the same session flow.
- Expand adaptive practice and diagnostics.
- Improve review, impact analysis, and incremental update support.