Hardware Detail GPUPrevious gen

NVIDIA L40S 48GB

NVIDIA L40S 48GB profile for local AI model planning.

NVIDIANVIDIA GPU2023-0848GB VRAM
AI Memory
48GB VRAM
Runtime headroom is estimated.
LLM Score
78
server
Power
310W
Average AI load
Value Score
6.4
Planner signal

Task Fit

Vision97

Strong fit for vision workloads in the current planner.

Image generation97

Strong fit for image generation workloads in the current planner.

Chat96

Strong fit for chat workloads in the current planner.

Code94

Strong fit for code workloads in the current planner.

RAG94

Strong fit for rag workloads in the current planner.

Agent91

Strong fit for agent workloads in the current planner.

Video generation82

Usable for video generation, with tradeoffs around speed, memory, or runtime maturity.

Embedding75

Usable for embedding, with tradeoffs around speed, memory, or runtime maturity.

Voice71

Usable for voice, with tradeoffs around speed, memory, or runtime maturity.

Runtime Support

ollamaExcellent
llama.cppExcellent
LM StudioExcellent
vllmExcellent
sglanggood
ComfyUIExcellent
diffusersExcellent

Specs

CPU cores
GPU cores
System RAM
Unified memory
VRAM48GB
Usable model memory48GB
CUDASupported
MetalNo
ROCmNo

Cost and Ops

Hardware price$7,500
Build cost$1,800
UPS / accessories$200
5-year estimate$10,386
Expected lifespan6 years
Residual value18%
Build requiredYes

Models This Hardware Can Run

Estimated from default model artifacts, VRAM, unified memory, and recommended RAM.

62 compatible defaults

Best For

rack serversvLLMteam inference

NVIDIA L40S 48GB profile for local AI model planning.

Source Confidence

Specsreviewed
PriceEstimated
PowerEstimated
PerformanceEstimated
Updated2026-06-18

Similar Hardware