DeepSeek-R1-Distill 70B
DeepSeek-R1 distilled to Llama 70B base. Strong general reasoning.
Task Fit
Not marked for code agent in the current library.
Code generation, debugging, refactoring, and benchmark signal.
General writing, Q&A, and assistant use.
Document QA benefits from long context and instruction following.
Not marked for vision in the current library.
Not marked for image generation in the current library.
Not marked for video generation in the current library.
Not marked for voice in the current library.
Source Confidence
Variants and Quant Artifacts
Choose the artifact first; hardware fit follows from RAM, VRAM, format, and runtime.
| Quant | Format | Quality | Min RAM | Reco RAM | Runtime | Action |
|---|---|---|---|---|---|---|
| Q4_K_M | gguf | balanced | 48GB | 80GB | ollama, llama.cpp, lm-studio | Plan with this |
| Q5_K_M | gguf | balanced | 50GB | 80GB | ollama, llama.cpp, lm-studio | Plan with this |
| Q8_0 | gguf | high | 78GB | 80GB | ollama, llama.cpp, lm-studio | Plan with this |
Recommended Hardware
Lowest estimated 5-year cost that can run this model.
Enough effective VRAM with a balanced 5-year cost.
Benchmarks
Source and Review
Similar Models
DeepSeek-R1 reasoning model distilled to 32B. Excellent at math & logic.
DeepSeek's efficient MoE coder. 16B total / 2.4B active.
Alibaba's flagship Qwen3. Competitive with GPT-4 class models.