Qwen3-Coder 30B-A3B
Alibaba's Qwen3-Coder 30B with MoE architecture. Top coding model in <30B class.
Task Fit
Tool use, repo work, terminal workflows, and coding benchmarks.
Code generation, debugging, refactoring, and benchmark signal.
Not marked for chat in the current library.
Not marked for rag in the current library.
Not marked for vision in the current library.
Not marked for image generation in the current library.
Not marked for video generation in the current library.
Not marked for voice in the current library.
Source Confidence
Variants and Quant Artifacts
Choose the artifact first; hardware fit follows from RAM, VRAM, format, and runtime.
| Quant | Format | Quality | Min RAM | Reco RAM | Runtime | Action |
|---|---|---|---|---|---|---|
| Q4_K_M | gguf | balanced | 24GB | 48GB | ollama, llama.cpp, lm-studio | Plan with this |
| Q5_K_M | gguf | balanced | 24GB | 48GB | ollama, llama.cpp, lm-studio | Plan with this |
| Q8_0 | gguf | high | 35GB | 48GB | ollama, llama.cpp, lm-studio | Plan with this |
| FP16 | gguf | high | 68GB | 48GB | ollama, llama.cpp, lm-studio | Plan with this |
Recommended Hardware
Lowest estimated 5-year cost that can run this model.
Plenty of fast-memory headroom for this model.
Highest local performance signal among compatible hardware.
Benchmarks
Source and Review
Similar Models
Alibaba's flagship Qwen3. Competitive with GPT-4 class models.
Alibaba's open-weight Qwen3.6 27B. Strong coding-agent, reasoning, long-context, and vision-language model with 262K native context.
Qwen3 30B with MoE. Same family as Coder, general-purpose.