Add evidence trail learner workbench
This commit is contained in:
parent
59d45c2942
commit
074999fbe1
|
|
@ -13,3 +13,5 @@ build/
|
||||||
configs/config.yaml
|
configs/config.yaml
|
||||||
tmp-*
|
tmp-*
|
||||||
codex*
|
codex*
|
||||||
|
ops/
|
||||||
|
webui/node_modules/
|
||||||
|
|
|
||||||
17
README.md
17
README.md
|
|
@ -37,6 +37,15 @@ Didactopus is built around a few core ideas:
|
||||||
- explanations, critique, and next-step advice should preserve learner trust
|
- explanations, critique, and next-step advice should preserve learner trust
|
||||||
- local and low-cost deployment matter for access
|
- local and low-cost deployment matter for access
|
||||||
|
|
||||||
|
It should also operate under a scientific-virtues outlook. In practice that
|
||||||
|
means Didactopus should reinforce habits such as:
|
||||||
|
|
||||||
|
- curiosity about the question rather than premature closure
|
||||||
|
- honesty about what is observed versus what is inferred
|
||||||
|
- skepticism toward weakly supported claims, including model-generated claims
|
||||||
|
- attentiveness to source quality, caveats, and uncertainty
|
||||||
|
- willingness to revise when better evidence changes the picture
|
||||||
|
|
||||||
In practice, that means Didactopus tries to help with:
|
In practice, that means Didactopus tries to help with:
|
||||||
|
|
||||||
- topic structure
|
- topic structure
|
||||||
|
|
@ -198,6 +207,14 @@ It is also meant to support the pedagogy around learning:
|
||||||
- exporting evidence and capability artifacts
|
- exporting evidence and capability artifacts
|
||||||
- supporting multilingual and accessible outputs
|
- supporting multilingual and accessible outputs
|
||||||
|
|
||||||
|
Operationally, the scientific-virtues framing means Didactopus should:
|
||||||
|
|
||||||
|
- separate observation from interpretation in learner-facing flows
|
||||||
|
- reward justified revision rather than answer persistence
|
||||||
|
- surface uncertainty explicitly instead of smoothing it away
|
||||||
|
- push learners toward source comparison and evidence quality checks
|
||||||
|
- avoid presenting confident unsupported synthesis as settled knowledge
|
||||||
|
|
||||||
This is why the repository contains review workspaces, validation flows, knowledge graphs, and capability export machinery rather than only a chat interface.
|
This is why the repository contains review workspaces, validation flows, knowledge graphs, and capability export machinery rather than only a chat interface.
|
||||||
|
|
||||||
## Grounded AI Learner And Skill Production
|
## Grounded AI Learner And Skill Production
|
||||||
|
|
|
||||||
|
|
@ -22,6 +22,8 @@ Near-term scope:
|
||||||
- make mentor, practice, and evaluator turns consistently source-grounded
|
- make mentor, practice, and evaluator turns consistently source-grounded
|
||||||
- improve trust-preserving feedback behavior
|
- improve trust-preserving feedback behavior
|
||||||
- extend the session flow beyond one short interaction
|
- extend the session flow beyond one short interaction
|
||||||
|
- make scientific virtues operational in the session loop by separating observation from interpretation, preserving uncertainty, and rewarding justified revision
|
||||||
|
- replace stubbed provider output in learner-facing pilot flows with configured real model backends where available
|
||||||
|
|
||||||
Current code anchors:
|
Current code anchors:
|
||||||
|
|
||||||
|
|
@ -102,6 +104,25 @@ Target features:
|
||||||
- active practice task
|
- active practice task
|
||||||
- evaluator feedback
|
- evaluator feedback
|
||||||
- recommended next step
|
- recommended next step
|
||||||
|
- first external pilot should use the `evidence-trail` evo-edu pack as a learner-workbench test case
|
||||||
|
|
||||||
|
Current progress:
|
||||||
|
|
||||||
|
- the first external pilot pack now exists at `domain-packs/evidence-trail/`
|
||||||
|
- `pack_to_frontend` output is generated and copied into `webui/public/packs/evidence-trail-pack.json`
|
||||||
|
- the web UI now has a learner-workbench launcher and `Evidence Trail` pilot mode in addition to the review workbench
|
||||||
|
- the learner pilot exposes question, observation, interpretation, uncertainty, and revision-trigger fields directly in the UI
|
||||||
|
- scientific virtues are now reflected in the UI framing and in backend learner-session prompt construction
|
||||||
|
- the backend now exposes `POST /api/learner-workbench/session`
|
||||||
|
- end-to-end verification succeeded locally: the API starts, the endpoint returns structured concept/session output, and the frontend/backend contract is working
|
||||||
|
|
||||||
|
Immediate next steps:
|
||||||
|
|
||||||
|
- replace current stubbed mentor/practice/evaluator text with a configured real provider path
|
||||||
|
- enrich the `Evidence Trail` pack with grounded source fragments so returned guidance is based on more than pack metadata
|
||||||
|
- persist learner-session state instead of treating each call as a stateless step
|
||||||
|
- connect learner progress, evidence, and revision history to the standard backend session model
|
||||||
|
- define deployment notes for running the learner workbench against the local API outside development mode
|
||||||
|
|
||||||
Current pilot state:
|
Current pilot state:
|
||||||
|
|
||||||
|
|
@ -208,13 +229,18 @@ Examples:
|
||||||
- Keep accessibility and low-cost deployment in scope from the start, not as cleanup work.
|
- Keep accessibility and low-cost deployment in scope from the start, not as cleanup work.
|
||||||
- Preserve provenance and license compliance as first-class constraints.
|
- Preserve provenance and license compliance as first-class constraints.
|
||||||
- Advance the current roadmap without assuming abundant compute, fluent English, expert supervision, or mature learners.
|
- Advance the current roadmap without assuming abundant compute, fluent English, expert supervision, or mature learners.
|
||||||
|
- Treat scientific virtues as operational principles: encourage curiosity, honesty about evidence, skepticism toward weak claims, attentiveness to caveats, and revision when the evidence changes.
|
||||||
|
- Separate observation from interpretation in learner-facing guidance so the system does not blur grounded support with model inference.
|
||||||
|
- Frame revision as progress rather than as failure, especially in mentor and evaluator feedback.
|
||||||
|
|
||||||
## Suggested Implementation Sequence
|
## Suggested Implementation Sequence
|
||||||
|
|
||||||
1. Strengthen `didactopus.learner_session` into the standard session backend.
|
1. Strengthen `didactopus.learner_session` into the standard session backend.
|
||||||
2. Fold the learner-workbench pilot into that backend without losing its stronger study-state framing.
|
2. Fold the learner-workbench pilot into that backend without losing its stronger study-state framing.
|
||||||
3. Build a small model-benchmark harness around the unified learner backend.
|
3. Replace stubbed learner-workbench provider output with a configured real model backend.
|
||||||
4. Add accessible learner HTML and text-first outputs.
|
4. Ground the `evidence-trail` pilot in richer source fragments and persisted learner state.
|
||||||
5. Add local TTS and STT support to the same session flow.
|
5. Build a small model-benchmark harness around the unified learner backend.
|
||||||
6. Expand adaptive practice and diagnostics.
|
6. Add accessible learner HTML and text-first outputs.
|
||||||
7. Improve review, impact analysis, and incremental update support.
|
7. Add local TTS and STT support to the same session flow.
|
||||||
|
8. Expand adaptive practice and diagnostics.
|
||||||
|
9. Improve review, impact analysis, and incremental update support.
|
||||||
|
|
|
||||||
File diff suppressed because it is too large
Load Diff
Loading…
Reference in New Issue