Model Detail human-review-recommended

Phi-4 14B

Microsoft's compact 14B. Punches way above its weight, especially on math.

MicrosoftPHI4mit2025-01

Parameters

14B

Dense model

Context window

16K

Standard context

Architecture

decoder only transformer

dense

Quality score

Planner signal

Task Fit

Code AgentNot a fit

Not marked for code agent in the current library.

CodeSupported

Code generation, debugging, refactoring, and benchmark signal.

ChatSupported

General writing, Q&A, and assistant use.

RAGSupported

Document QA benefits from long context and instruction following.

VisionNot a fit

Not marked for vision in the current library.

Image GenerationNot a fit

Not marked for image generation in the current library.

Video GenerationNot a fit

Not marked for video generation in the current library.

VoiceNot a fit

Not marked for voice in the current library.

Source Confidence

Overallhigh · 94/100

ParametersReviewed / seeded

Task fitReviewed / seeded

MemorySeeded artifact

LicenseSource / seed

BenchmarksAvailable

Hardware fitCalculated

Variants and Quant Artifacts

Choose the artifact first; hardware fit follows from RAM, VRAM, format, and runtime.

3 artifacts

Quant	Format	Quality	Min RAM	Reco RAM	Runtime	Action
Q4_K_M	gguf	balanced	12GB	24GB	ollama, llama.cpp, lm-studio	Plan with this
Q5_K_M	gguf	balanced	12GB	24GB	ollama, llama.cpp, lm-studio	Plan with this
Q8_0	gguf	high	18GB	24GB	ollama, llama.cpp, lm-studio	Plan with this

Recommended Hardware

Cheapest That Works

NVIDIA RTX 3060 12GB

Compatible

Lowest estimated 5-year cost that can run this model.

12GB VRAM$1,570 / 5y

Best Value

NVIDIA RTX 5090 32GB

Compatible

Plenty of fast-memory headroom for this model.

32GB VRAM$4,788 / 5y

Best Performance

NVIDIA RTX 6000 Blackwell 96GB

Compatible

Highest local performance signal among compatible hardware.

96GB VRAM$12,376 / 5y

Benchmarks

Gsm8k92%

MMLU84%

Humaneval82%

Source and Review

Hugging Facemicrosoft/phi-4

Ollamaphi4:14b

Verificationhuman-review-recommended

Artifact sourceseeded

Default variantPhi-4 14B Coder

Tool callingNot marked

Similar Models

DeepSeek-R1-Distill 70B

DeepSeek · 70B · 80GB RAM

DeepSeek-R1 distilled to Llama 70B base. Strong general reasoning.

DeepSeek-R1-Distill 32B

DeepSeek · 32B · 48GB RAM

DeepSeek-R1 reasoning model distilled to 32B. Excellent at math & logic.

Qwen3 235B-A22B

Alibaba · 235B · 192GB RAM

Alibaba's flagship Qwen3. Competitive with GPT-4 class models.