Model Detail Needs review

GLM 4.7 Flash

Z.ai's GLM 4.7 Flash. Auto-imported from Hugging Face seed; review memory and benchmark details before using for final ranking.

Z.aiGLMmit2026-01-29
Parameters
32B
Dense model
Context window
131K
Long context
Architecture
mixture of experts transformer
dense
Quality score
94
Planner signal

Task Fit

Code AgentSupported

Tool use, repo work, terminal workflows, and coding benchmarks.

CodeNot a fit

Not marked for code in the current library.

ChatSupported

General writing, Q&A, and assistant use.

RAGSupported

Document QA benefits from long context and instruction following.

VisionNot a fit

Not marked for vision in the current library.

Image GenerationNot a fit

Not marked for image generation in the current library.

Video GenerationNot a fit

Not marked for video generation in the current library.

VoiceNot a fit

Not marked for voice in the current library.

Source Confidence

Overalllow · 40/100
ParametersAuto-estimated
Task fitAuto-inferred
MemoryEstimated from artifact
LicenseSource / seed
BenchmarksMissing
Hardware fitCalculated
Review flags
task capabilities auto-inferredmemory estimatedmissing benchmarks

Variants and Quant Artifacts

Choose the artifact first; hardware fit follows from RAM, VRAM, format, and runtime.

4 artifacts
QuantFormatQualityMin RAMReco RAMRuntimeAction
Q4_K_Mggufbalanced24GB32GBllama.cpp, lm-studio Plan with this
Q5_K_Mggufbalanced24GB32GBllama.cpp, lm-studio Plan with this
Q8_0ggufhigh40GB64GBllama.cpp, lm-studio Plan with this
FP16safetensorshigh80GB128GBtransformers, vllm Plan with this

Recommended Hardware

Cheapest That Works
NVIDIA RTX 4060 Ti 16GB
Compatible

Lowest estimated 5-year cost that can run this model.

16GB VRAM$1,772 / 5y
Best Value
NVIDIA Jetson AGX Orin 64GB
Compatible

Enough unified/system memory with a balanced 5-year cost.

64GB RAM$2,553 / 5y
Best Performance
NVIDIA RTX 6000 Blackwell 96GB
Compatible

Highest local performance signal among compatible hardware.

96GB VRAM$12,376 / 5y

Benchmarks

No benchmark data is available for this model yet.

Source and Review

Hugging Facezai-org/GLM-4.7-Flash
OllamaNot mapped
VerificationNeeds review
Artifact sourceauto-estimated-from-hf-seed
Default variantGLM 4.7 Flash Coder
Tool callingSupported

Similar Models