GenieHive/scripts
welsberr e4f8b14437 Add smoke test, enable Ollama discovery in singlebox config, update demo doc
scripts/smoke_test.py: end-to-end validation script covering health, cluster
state, model catalog, route resolution, non-streaming chat (role + direct
asset), streaming chat (SSE validation + reasoning-strip check), embeddings,
and Ollama discovery metrics. Auto-detects targets from /v1/models; accepts
--chat-role, --chat-asset, --embed-asset overrides. Exit 0 if all pass/skip,
exit 1 on any failure.

configs/node.singlebox.ollama.example.yaml: add discover_protocol: "ollama"
to both services so the config works out of the box for Ollama discovery
testing without manual edits.

docs/llm_demo.md: update Current Readiness to reflect v1 complete feature set;
add Smoke Test section; add New Capabilities section covering streaming,
routing strategies, Ollama discovery, and role catalogs.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-27 15:13:31 -04:00
..
check_singlebox_health.sh Initial commit 2026-04-07 13:17:28 -04:00
demo_client_agent.py Initial commit 2026-04-07 13:17:28 -04:00
demo_inspect.sh Initial commit 2026-04-07 13:17:28 -04:00
ingest_benchmark_report.py Add benchmarked route matching and request shaping 2026-04-07 14:45:32 -04:00
launch_p40_triple.sh Initial commit 2026-04-07 13:17:28 -04:00
p40_triple_cpu.sh Initial commit 2026-04-07 13:17:28 -04:00
p40_triple_gpu0.sh Initial commit 2026-04-07 13:17:28 -04:00
p40_triple_gpu1.sh Initial commit 2026-04-07 13:17:28 -04:00
run_benchmark_workload.py Add benchmarked route matching and request shaping 2026-04-07 14:45:32 -04:00
run_control.sh Initial commit 2026-04-07 13:17:28 -04:00
run_control_p40_lan.sh Initial commit 2026-04-07 13:17:28 -04:00
run_control_p40_zerotier.sh Initial commit 2026-04-07 13:17:28 -04:00
run_control_singlebox.sh Initial commit 2026-04-07 13:17:28 -04:00
run_node.sh Initial commit 2026-04-07 13:17:28 -04:00
run_node_singlebox.sh Initial commit 2026-04-07 13:17:28 -04:00
smoke_test.py Add smoke test, enable Ollama discovery in singlebox config, update demo doc 2026-04-27 15:13:31 -04:00
start_p40_triple_llamacpp.sh Initial commit 2026-04-07 13:17:28 -04:00
test_qwen35_server_cuda_container.sh Initial commit 2026-04-07 13:17:28 -04:00
tmux_session_status.sh Initial commit 2026-04-07 13:17:28 -04:00