← All benchmarks

Apple Silicon RAM Calculator

Can your Mac run this model? Select a model and quantization to see which chips have enough RAM — and how fast they'll run it.

Check a model

Quick reference: RAM by model size

Approximate VRAM/unified memory needed at each quantization. Add ~2–4 GB for OS and runtime overhead.

Model sizeQ4_K_MQ5_K_MQ6_KQ8_0Minimum Mac
1B~0.8 GB~1.0 GB~1.2 GB~1.5 GB8 GB (any M-series)
3B~2.0 GB~2.5 GB~3.0 GB~3.5 GB8 GB (any M-series)
7B–8B~4.5 GB~5.5 GB~6.5 GB~8.5 GB16 GB (M-series base)
14B~9 GB~11 GB~13 GB~16 GB24 GB (M Pro+)
32B~20 GB~24 GB~29 GB~35 GB36–48 GB (M Max)
70B~43 GB~53 GB~63 GB~75 GB64 GB (M Max 64 GB+)
105B~65 GB~79 GB~94 GB~112 GB128 GB (M Max 128 GB)
235B (MoE)~130–140 GB~160 GB~190 GB~240 GB192 GB (M Ultra)
405B~245 GB~300 GB512 GB (M3 Ultra)

MoE (Mixture of Experts) models like Qwen 3 235B A22B use fewer active parameters during inference — they need less RAM than their total parameter count suggests. A 235B MoE model at Q4 needs ~130–140 GB, not ~145 GB.

Apple Silicon RAM tiers

ChipRAM optionsLargest model at Q4_K_MBest for
M4, M3, M2, M1 (base)8–32 GB8B (16 GB) · 14B (24 GB)7B–8B daily use
M4 Pro, M3 Pro, M2 Pro24–64 GB14B (24 GB) · 32B (48+ GB)14B daily, occasional 32B
M4 Max, M3 Max, M2 Max36–128 GB32B (48 GB) · 70B (128 GB)32B–70B inference
M2 Ultra, M3 Ultra64–512 GB235B (192+ GB) · 405B (512 GB)Maximum model size

About quantization and quality

QuantizationSize vs F32QualitySpeedRecommended use
Q2_K~25%Noticeably degradedFastestWhen RAM is severely limited
Q3_K_M~35%Somewhat degradedVery fastWhen RAM is tight
Q4_K_M~45%Good — minimal lossFastBest daily driver
Q5_K_M~55%Very goodModerately fastQuality-focused use
Q6_K~65%Excellent — near fullModerateHigh-quality tasks
Q8_0~83%Near-losslessSlowerBenchmarking, max quality

Related tools and guides

benchmarks.json — full dataset  ·  chips.json — chip summaries  ·  benchmarks.csv — CSV export