6.6 KiB

Raw Permalink Blame History

Roadmap

This document summarizes the current prioritized improvement roadmap for Didactopus as a learner-facing system.

The ordering is intentional. The project should first strengthen the graph-grounded mentor loop that defines the real learner task, then use that stable backbone for local-model evaluation, accessibility work, and broader UX improvements.

Priorities

1. Graph-grounded conversational mentor loop

Status: in progress

Why first:

It defines the actual learner-facing interaction Didactopus is trying to support.
It makes later benchmarking and accessibility work target a real session model rather than an abstract idea.
It uses the graph and source-corpus artifacts already present in the repository.

Near-term scope:

continue strengthening the learner session backend
make mentor, practice, and evaluator turns consistently source-grounded
improve trust-preserving feedback behavior
extend the session flow beyond one short interaction

Current code anchors:

didactopus.learner_session
didactopus.learner_session_demo
didactopus.graph_retrieval
didactopus.ocw_rolemesh_transcript_demo

2. Local-model adequacy benchmark for constrained hardware

Status: planned

Why next:

The learner loop should be benchmarked as soon as its task shape is stable.
Adequate local models on low-cost hardware would materially improve access in underserved regions.
Didactopus does not need a single perfect model; it needs role-adequate behavior.

Primary questions:

Which models are adequate for mentor, practice, and evaluator roles?
What latency, memory, and throughput are acceptable on Raspberry Pi-class hardware?
Which roles can degrade gracefully to smaller models?

Expected outputs:

benchmark tasks grounded in the MIT OCW pack
per-role adequacy scores
recommended deployment profiles for low-end, laptop, and stronger local systems

3. Accessibility-first learner interaction

Status: planned

Why high priority:

Didactopus has clear potential for learners who do not have access to enough teachers or tutors.
Blind learners and other accessibility-focused use cases benefit directly from structured, guided interaction.
Voice and text accessibility can build on the same learner-session backend.

Target features:

screen-reader-friendly learner output
accessible HTML alternatives to purely visual artifacts
text-first navigation of concept neighborhoods and progress
explicit structural cues in explanations and feedback

4. Voice interaction with local STT and TTS

Status: planned

Why after accessibility baseline:

The project should first ensure that the session structure is accessible in text.
Voice interaction is more useful once the mentor loop and pending-response behavior are stable.

Target features:

speech-to-text input for learner answers
text-to-speech output for mentor, practice, and evaluator turns
spoken waiting notices during slow local-model responses
repeat, interrupt, and slow-down controls

5. Learner workbench UI

Status: planned

Why important:

The repository has review-focused interfaces and generated artifacts, but the learner path is still fragmented.
A dedicated learner workbench would make Didactopus more usable as a personal mentor rather than only a pipeline/demo system.

Target features:

current concept and why-it-matters view
prerequisite chain and supporting lessons
grounded source excerpts
active practice task
evaluator feedback
recommended next step

6. Adaptive diagnostics and practice refinement

Status: planned

Why this matters:

Learners need clearer answers to “what am I weak at?” and “what should I do next?”
The repository already has evidence and evaluator machinery that can be surfaced in learner terms.

Target features:

weak-dimension summaries by concept
misconception tracking
remedial branch suggestions
hint ladders and difficulty control
oral, short-answer, and compare-and-contrast practice modes

7. Source-grounded citation transparency

Status: planned

Why it matters:

Trust depends on showing what is grounded in source material and what is model inference.
This is especially important for learners using local models with variable quality.

Target features:

lesson and source-fragment references in explanations
explicit distinction between cited source support and model inference
easier inspection of concept-to-source provenance

8. Pack quality, review, and concept-graph curation improvements

Status: planned

Why later:

These are important, but they mainly improve the quality of the learning substrate rather than the immediate learner interaction.
The graph-first path should first prove out the learner experience it supports.

Target features:

concept merge and split workflows
alias handling across packs
impact analysis for concept edits
stronger review support for noisy or broad concepts
improved source coverage QA

9. Incremental re-ingestion and course updates

Status: planned

Why useful:

External course repositories are now part of the intended workflow.
Didactopus should avoid full rebuilds when only part of a source tree changes.

Target features:

changed-file detection
stable concept and fragment IDs where possible
graph and pack diffs
preservation of learner evidence across source updates

10. Richer multimodal and notation support

Status: longer-term

Why longer-term:

This work is valuable but more specialized and technically demanding than the earlier roadmap items.

Examples:

spoken math rendering improvements
diagram descriptions
accessible handling of image-heavy source materials
EPUB and other learner-friendly export targets

Guiding Principles

Use the graph and source corpus before relying on model prior knowledge.
Optimize for guided learning, not answer offloading.
Prefer role-adequate local models over chasing a single best model.
Keep accessibility and low-cost deployment in scope from the start, not as cleanup work.
Preserve provenance and license compliance as first-class constraints.
Advance the current roadmap without assuming abundant compute, fluent English, expert supervision, or mature learners.

Suggested Implementation Sequence

Strengthen didactopus.learner_session into the standard session backend.
Build a small model-benchmark harness around that backend.
Add accessible learner HTML and text-first outputs.
Add local TTS and STT support to the same session flow.
Expand adaptive practice and diagnostics.
Improve review, impact analysis, and incremental update support.

6.6 KiB Raw Permalink Blame History

Roadmap

Priorities

1. Graph-grounded conversational mentor loop

2. Local-model adequacy benchmark for constrained hardware

3. Accessibility-first learner interaction

4. Voice interaction with local STT and TTS

5. Learner workbench UI

6. Adaptive diagnostics and practice refinement

7. Source-grounded citation transparency

8. Pack quality, review, and concept-graph curation improvements

9. Incremental re-ingestion and course updates

10. Richer multimodal and notation support

Guiding Principles

Suggested Implementation Sequence

6.6 KiB

Raw Permalink Blame History