658 B
658 B
Didactopus Local Model Benchmark
- Provider:
stub - Hardware profile:
pi-minimal - Primary concept: Independent Reasoning and Careful Comparison
- Secondary concept: Thermodynamics and Entropy
- Overall adequacy: borderline (0.667)
- Recommended use: Use with caution; responses should stay in review.
Role Results
mentorvialocal-demo: borderline (0.65), latency 0.027 ms Notes: Did not ask a focused learner question.practicevialocal-demo: adequate (1.0), latency 0.004 msevaluatorvialocal-demo: inadequate (0.35), latency 0.003 ms Notes: Did not acknowledge learner strengths.; Did not provide a concrete next step.