Model Detail Needs review

Gemma 4 E4B It QAT Q4 0 GGUF

Google's Gemma 4 E4B It QAT Q4 0 GGUF. Auto-imported from Hugging Face seed; review memory and benchmark details before using for final ranking.

GoogleGEMMA4apache-2.02026-06-06

Parameters

Dense model

Context window

131K

Long context

Architecture

decoder only transformer

dense

Quality score

Planner signal

Task Fit

Code AgentNot a fit

Not marked for code agent in the current library.

CodeNot a fit

Not marked for code in the current library.

ChatSupported

General writing, Q&A, and assistant use.

RAGSupported

Document QA benefits from long context and instruction following.

VisionSupported

Image or visual understanding, not necessarily image generation.

Image GenerationNot a fit

Not marked for image generation in the current library.

Video GenerationNot a fit

Not marked for video generation in the current library.

VoiceNot a fit

Not marked for voice in the current library.

Source Confidence

Overalllow · 40/100

ParametersAuto-estimated

Task fitAuto-inferred

MemoryEstimated from artifact

LicenseSource / seed

BenchmarksMissing

Hardware fitCalculated

Review flags

task capabilities auto-inferredmemory estimatedmissing benchmarks

Variants and Quant Artifacts

Choose the artifact first; hardware fit follows from RAM, VRAM, format, and runtime.

2 artifacts

Quant	Format	Quality	Min RAM	Reco RAM	Runtime	Action
Q4_0	gguf	balanced	8GB	12GB	llama.cpp, lm-studio	Plan with this
Q8_0	gguf	high	8GB	12GB	llama.cpp, lm-studio	Plan with this

Recommended Hardware

Cheapest That Works

Minisforum UM890 (Ryzen 9 8945HS)

Compatible

Lowest estimated 5-year cost that can run this model.

32GB RAM$597 / 5y

Best Value

Mac mini M4 Pro 32GB

Compatible

Plenty of fast-memory headroom for this model.

32GB RAM$1,499 / 5y

Best Performance

NVIDIA RTX 6000 Blackwell 96GB

Compatible

Highest local performance signal among compatible hardware.

96GB VRAM$12,376 / 5y

Benchmarks

No benchmark data is available for this model yet.

Source and Review

Hugging Facegoogle/gemma-4-E4B-it-qat-q4_0-gguf

OllamaNot mapped

VerificationNeeds review

Artifact sourceauto-estimated-from-hf-seed

Default variantGemma 4 E4B It QAT Q4 0 GGUF Coder

Tool callingNot marked

Similar Models

Gemma 4 31B It QAT W4a16 Ct

Google · 31B · 32GB RAM

Google's Gemma 4 31B It QAT W4a16 Ct. Auto-imported from Hugging Face seed; review memory and benchmark details before using for final ranking.

Gemma 4 12B It QAT W4a16 Ct

Google · 12B · 16GB RAM

Google's Gemma 4 12B It QAT W4a16 Ct. Auto-imported from Hugging Face seed; review memory and benchmark details before using for final ranking.

Gemma 4 E2B It QAT Mobile Transformers

Google · 2B · 8GB RAM

Google's Gemma 4 E2B It QAT Mobile Transformers. Auto-imported from Hugging Face seed; review memory and benchmark details before using for final ranking.