GenieHive/tests
welberr b4e5a1af7d P0: remove dead default_strategy field; fix benchmark quality score
Remove RoutingConfig.default_strategy: the field was never read by
resolve_route() or any other code path, creating a false impression
that routing behaviour was configurable. Also removed from all three
example config files.

Fix _benchmark_quality_score: the previous implementation used max()
for correctness signals and then *added* speed bonuses on top, allowing
the score to accumulate past 1.0 before the final clamp. Speed bonuses
were therefore dead weight whenever pass_rate or quality_score was
already ≥ 0.65. Replace with an explicit weighted average: correctness
(pass_rate / quality_score) carries 0.65 and a normalised speed
component carries 0.35. When no correctness signal is available the
speed component carries full weight. Score is always in [0, 1] without
needing a clamp.

Add test_benchmark_quality_score_stays_bounded_and_weighted to lock in
the corrected behaviour: bounded at 1.0, correctness-dominant, speed-
only case non-zero, empty input zero, speed bonus never hurts.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-27 10:45:04 -04:00
..
conftest.py Initial commit 2026-04-07 13:17:28 -04:00
test_benchmark_runner.py Add benchmarked route matching and request shaping 2026-04-07 14:45:32 -04:00
test_benchmarks.py Add benchmarked route matching and request shaping 2026-04-07 14:45:32 -04:00
test_control_chat.py Add benchmarked route matching and request shaping 2026-04-07 14:45:32 -04:00
test_control_registry.py P0: remove dead default_strategy field; fix benchmark quality score 2026-04-27 10:45:04 -04:00
test_demo_flow.py Initial commit 2026-04-07 13:17:28 -04:00
test_node_inventory.py Initial commit 2026-04-07 13:17:28 -04:00
test_smoke.py Initial commit 2026-04-07 13:17:28 -04:00