Mike Swanson
4aadf16a9f
feat: add qwen3:8b for DESKTOP-0O8A1RL, update Ollama routing
Benchmarked 2026-05-16 on DESKTOP-0O8A1RL (RTX 5070 Ti Laptop, 12 GB VRAM):
- qwen3:8b: 100% VRAM fit (10.9/10.9 GB) -> 74-86 tok/s
- qwen3:14b: 73% VRAM (11.3/15.6 GB split) -> 17-18 tok/s (4.8x slower)
- qwen3.6: 41% VRAM (11.3/27.5 GB split) -> 17-19 tok/s
qwen3:14b overflows 12 GB VRAM at runtime (9.3 GB GGUF = 15.6 GB loaded).
qwen3:8b fits entirely in VRAM and matches the reference machine speed.
Updated OLLAMA.md: added qwen3:8b to models table, per-machine routing
table, benchmark results. Updated CLAUDE.md model one-liner.
Routing: qwen3:8b for prose on DESKTOP-0O8A1RL, qwen3:14b everywhere else,
qwen3.6 for strict-format tasks on all machines.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-16 16:25:57 -07:00
..
2026-03-20 16:42:01 -07:00
2026-05-16 12:59:49 -07:00
2026-05-12 17:40:37 -07:00
2026-04-27 13:15:52 -07:00
2026-05-15 15:52:15 -07:00
2026-04-22 14:06:22 -07:00
2026-05-15 06:20:59 -07:00
2026-05-01 16:52:12 -07:00
2026-05-16 12:59:49 -07:00
2026-01-17 16:23:52 -07:00
2026-01-18 11:51:47 -07:00
2026-03-25 03:46:07 -07:00
2026-01-20 16:21:06 -07:00
2026-01-20 16:21:06 -07:00
2026-05-16 16:25:57 -07:00
2026-01-20 16:21:06 -07:00
2026-05-15 15:52:15 -07:00
2026-04-18 08:54:20 -07:00
2026-05-12 08:45:33 -07:00
2026-04-20 12:14:43 -07:00
2026-01-15 18:55:45 -07:00
2026-05-15 06:10:15 -07:00
2026-04-14 06:32:16 -07:00
2026-05-06 13:46:23 -07:00
2026-05-06 13:46:23 -07:00
2026-04-21 19:34:56 -07:00
2026-04-16 18:56:26 -07:00
2026-02-01 16:23:47 -07:00
2026-01-20 16:21:06 -07:00
2026-05-16 16:25:57 -07:00
2026-04-21 19:01:27 -07:00
2026-01-17 12:51:43 -07:00
2026-04-20 12:09:17 -07:00
2026-02-17 10:49:35 -07:00
2026-01-20 16:21:06 -07:00
2026-05-14 10:48:29 -07:00
2026-01-17 06:00:26 -07:00
2026-01-17 06:00:26 -07:00
2026-01-17 06:00:26 -07:00
2026-01-17 06:00:26 -07:00
2026-01-17 06:00:26 -07:00
2026-01-17 16:23:52 -07:00
2026-05-15 17:29:39 -07:00
2026-02-01 16:23:47 -07:00
2026-04-27 13:15:52 -07:00
2026-04-21 19:48:59 -07:00
2026-04-21 19:28:38 -07:00