Didactopus/docs/roadmap.md

6.6 KiB

Roadmap

This document summarizes the current prioritized improvement roadmap for Didactopus as a learner-facing system.

The ordering is intentional. The project should first strengthen the graph-grounded mentor loop that defines the real learner task, then use that stable backbone for local-model evaluation, accessibility work, and broader UX improvements.

Priorities

1. Graph-grounded conversational mentor loop

Status: in progress

Why first:

  • It defines the actual learner-facing interaction Didactopus is trying to support.
  • It makes later benchmarking and accessibility work target a real session model rather than an abstract idea.
  • It uses the graph and source-corpus artifacts already present in the repository.

Near-term scope:

  • continue strengthening the learner session backend
  • make mentor, practice, and evaluator turns consistently source-grounded
  • improve trust-preserving feedback behavior
  • extend the session flow beyond one short interaction

Current code anchors:

  • didactopus.learner_session
  • didactopus.learner_session_demo
  • didactopus.graph_retrieval
  • didactopus.ocw_rolemesh_transcript_demo

2. Local-model adequacy benchmark for constrained hardware

Status: planned

Why next:

  • The learner loop should be benchmarked as soon as its task shape is stable.
  • Adequate local models on low-cost hardware would materially improve access in underserved regions.
  • Didactopus does not need a single perfect model; it needs role-adequate behavior.

Primary questions:

  • Which models are adequate for mentor, practice, and evaluator roles?
  • What latency, memory, and throughput are acceptable on Raspberry Pi-class hardware?
  • Which roles can degrade gracefully to smaller models?

Expected outputs:

  • benchmark tasks grounded in the MIT OCW pack
  • per-role adequacy scores
  • recommended deployment profiles for low-end, laptop, and stronger local systems

3. Accessibility-first learner interaction

Status: planned

Why high priority:

  • Didactopus has clear potential for learners who do not have access to enough teachers or tutors.
  • Blind learners and other accessibility-focused use cases benefit directly from structured, guided interaction.
  • Voice and text accessibility can build on the same learner-session backend.

Target features:

  • screen-reader-friendly learner output
  • accessible HTML alternatives to purely visual artifacts
  • text-first navigation of concept neighborhoods and progress
  • explicit structural cues in explanations and feedback

4. Voice interaction with local STT and TTS

Status: planned

Why after accessibility baseline:

  • The project should first ensure that the session structure is accessible in text.
  • Voice interaction is more useful once the mentor loop and pending-response behavior are stable.

Target features:

  • speech-to-text input for learner answers
  • text-to-speech output for mentor, practice, and evaluator turns
  • spoken waiting notices during slow local-model responses
  • repeat, interrupt, and slow-down controls

5. Learner workbench UI

Status: planned

Why important:

  • The repository has review-focused interfaces and generated artifacts, but the learner path is still fragmented.
  • A dedicated learner workbench would make Didactopus more usable as a personal mentor rather than only a pipeline/demo system.

Target features:

  • current concept and why-it-matters view
  • prerequisite chain and supporting lessons
  • grounded source excerpts
  • active practice task
  • evaluator feedback
  • recommended next step

6. Adaptive diagnostics and practice refinement

Status: planned

Why this matters:

  • Learners need clearer answers to “what am I weak at?” and “what should I do next?”
  • The repository already has evidence and evaluator machinery that can be surfaced in learner terms.

Target features:

  • weak-dimension summaries by concept
  • misconception tracking
  • remedial branch suggestions
  • hint ladders and difficulty control
  • oral, short-answer, and compare-and-contrast practice modes

7. Source-grounded citation transparency

Status: planned

Why it matters:

  • Trust depends on showing what is grounded in source material and what is model inference.
  • This is especially important for learners using local models with variable quality.

Target features:

  • lesson and source-fragment references in explanations
  • explicit distinction between cited source support and model inference
  • easier inspection of concept-to-source provenance

8. Pack quality, review, and concept-graph curation improvements

Status: planned

Why later:

  • These are important, but they mainly improve the quality of the learning substrate rather than the immediate learner interaction.
  • The graph-first path should first prove out the learner experience it supports.

Target features:

  • concept merge and split workflows
  • alias handling across packs
  • impact analysis for concept edits
  • stronger review support for noisy or broad concepts
  • improved source coverage QA

9. Incremental re-ingestion and course updates

Status: planned

Why useful:

  • External course repositories are now part of the intended workflow.
  • Didactopus should avoid full rebuilds when only part of a source tree changes.

Target features:

  • changed-file detection
  • stable concept and fragment IDs where possible
  • graph and pack diffs
  • preservation of learner evidence across source updates

10. Richer multimodal and notation support

Status: longer-term

Why longer-term:

  • This work is valuable but more specialized and technically demanding than the earlier roadmap items.

Examples:

  • spoken math rendering improvements
  • diagram descriptions
  • accessible handling of image-heavy source materials
  • EPUB and other learner-friendly export targets

Guiding Principles

  • Use the graph and source corpus before relying on model prior knowledge.
  • Optimize for guided learning, not answer offloading.
  • Prefer role-adequate local models over chasing a single best model.
  • Keep accessibility and low-cost deployment in scope from the start, not as cleanup work.
  • Preserve provenance and license compliance as first-class constraints.
  • Advance the current roadmap without assuming abundant compute, fluent English, expert supervision, or mature learners.

Suggested Implementation Sequence

  1. Strengthen didactopus.learner_session into the standard session backend.
  2. Build a small model-benchmark harness around that backend.
  3. Add accessible learner HTML and text-first outputs.
  4. Add local TTS and STT support to the same session flow.
  5. Expand adaptive practice and diagnostics.
  6. Improve review, impact analysis, and incremental update support.