Old Zhang's AI Learning
Jun 11, 2026 · Artificial Intelligence
Distilling Claude Opus: Qwen 9B Coding Model Runs on Consumer GPUs – Real‑World Benchmarks
The Qwopus3.5‑9B‑Coder model, fine‑tuned for agentic coding, tool calling and logical reasoning, offers three formats (Safetensors, GGUF, GGUF+MTP), runs on a 16 GB Mac mini via LM‑Studio, achieves up to 35% throughput gain with MTP, scores 85 on HermesAgent‑20, 100 on ToolCall‑15, and 53.89% on SWE‑bench, matching Claude Opus 4.6 in a 31‑tool adversarial test while highlighting its training tricks and current limitations.
Agentic CodingLLM BenchmarkQwen
0 likes · 11 min read
