Model Detail Needs review

GLM 5.1 FP8

Z.ai's GLM 5.1 FP8. Auto-imported from Hugging Face seed; review memory and benchmark details before using for final ranking.

Z.aiGLMmit2026-04-16
Parameters
355B
32B active
Context window
131K
Long context
Architecture
mixture of experts transformer
moe
Quality score
94
Planner signal

Task Fit

Code AgentSupported

Tool use, repo work, terminal workflows, and coding benchmarks.

CodeNot a fit

Not marked for code in the current library.

ChatSupported

General writing, Q&A, and assistant use.

RAGSupported

Document QA benefits from long context and instruction following.

VisionNot a fit

Not marked for vision in the current library.

Image GenerationNot a fit

Not marked for image generation in the current library.

Video GenerationNot a fit

Not marked for video generation in the current library.

VoiceNot a fit

Not marked for voice in the current library.

Source Confidence

Overalllow · 40/100
ParametersAuto-estimated
Task fitAuto-inferred
MemoryEstimated from artifact
LicenseSource / seed
BenchmarksMissing
Hardware fitCalculated
Review flags
task capabilities auto-inferredmemory estimatedmissing benchmarks

Variants and Quant Artifacts

Choose the artifact first; hardware fit follows from RAM, VRAM, format, and runtime.

3 artifacts
QuantFormatQualityMin RAMReco RAMRuntimeAction
FP8safetensorsbalanced64GB96GBtransformers, vllm Plan with this
Q4_K_Mggufbalanced32GB48GBllama.cpp, lm-studio Plan with this
FP16safetensorshigh1TB2TBtransformers, vllm Plan with this

Recommended Hardware

Cheapest That Works
AMD Radeon Pro W7900 48GB
Compatible

Lowest estimated 5-year cost that can run this model.

48GB VRAM$5,640 / 5y
Best Value
NVIDIA H200 141GB
Compatible

Plenty of fast-memory headroom for this model.

141GB VRAM$37,162 / 5y

Benchmarks

No benchmark data is available for this model yet.

Source and Review

Hugging Facezai-org/GLM-5.1-FP8
OllamaNot mapped
VerificationNeeds review
Artifact sourceauto-estimated-from-hf-seed
Default variantGLM 5.1 FP8 Coder
Tool callingSupported

Similar Models