Gemma 3n 4B
Google's Gemma 3n. Multimodal (text/vision/audio) at 4B size.
Task Fit
Not marked for code agent in the current library.
Not marked for code in the current library.
General writing, Q&A, and assistant use.
Not marked for rag in the current library.
Image or visual understanding, not necessarily image generation.
Not marked for image generation in the current library.
Not marked for video generation in the current library.
Speech recognition, TTS, or audio workflows.
Source Confidence
Variants and Quant Artifacts
Choose the artifact first; hardware fit follows from RAM, VRAM, format, and runtime.
| Quant | Format | Quality | Min RAM | Reco RAM | Runtime | Action |
|---|---|---|---|---|---|---|
| Q4_K_M | gguf | balanced | 5GB | 8GB | ollama, llama.cpp, lm-studio | Plan with this |
| Q8_0 | gguf | high | 7GB | 8GB | ollama, llama.cpp, lm-studio | Plan with this |
Recommended Hardware
Lowest estimated 5-year cost that can run this model.
Enough unified/system memory with a balanced 5-year cost.
Highest local performance signal among compatible hardware.
Benchmarks
No benchmark data is available for this model yet.