Didactopus/examples/model-benchmark/model_benchmark.md

15 lines
658 B
Markdown

# Didactopus Local Model Benchmark
- Provider: `stub`
- Hardware profile: `pi-minimal`
- Primary concept: Independent Reasoning and Careful Comparison
- Secondary concept: Thermodynamics and Entropy
- Overall adequacy: borderline (0.667)
- Recommended use: Use with caution; responses should stay in review.
## Role Results
- `mentor` via `local-demo`: borderline (0.65), latency 0.027 ms
Notes: Did not ask a focused learner question.
- `practice` via `local-demo`: adequate (1.0), latency 0.004 ms
- `evaluator` via `local-demo`: inadequate (0.35), latency 0.003 ms
Notes: Did not acknowledge learner strengths.; Did not provide a concrete next step.