Didactopus/examples/model-benchmark/model_benchmark.md

658 B

Didactopus Local Model Benchmark

  • Provider: stub
  • Hardware profile: pi-minimal
  • Primary concept: Independent Reasoning and Careful Comparison
  • Secondary concept: Thermodynamics and Entropy
  • Overall adequacy: borderline (0.667)
  • Recommended use: Use with caution; responses should stay in review.

Role Results

  • mentor via local-demo: borderline (0.65), latency 0.027 ms Notes: Did not ask a focused learner question.
  • practice via local-demo: adequate (1.0), latency 0.004 ms
  • evaluator via local-demo: inadequate (0.35), latency 0.003 ms Notes: Did not acknowledge learner strengths.; Did not provide a concrete next step.