← All benchmarks

Qwen 3 32B — Apple Silicon Benchmarks

Measured inference speed for Qwen 3 32B across 3 Apple Silicon chips. Tokens per second at multiple quantization levels. Real runs, not estimates.

Quantizations measured: Q4_K_M, iQ2_K_S

3Benchmark rows
3Chip tiers covered
22.0Fastest avg tok/s (M4 Max (40-core GPU, 64 GB))
11 GBMinimum RAM observed

Benchmark results for Qwen 3 32B

Rows sorted by avg tok/s descending. Click source badge to see original measurement page.

ChipQuantRAM req.ContextAvg tok/sPrompt tok/sRuntimeSource
M4 Max (40-core GPU, 64 GB)Q4_K_M20.0 GB128k22.0 tok/sfactory harnessfactory lab
M4 Max (32-core GPU)iQ2_K_S11.0 GB4k13.2 tok/sref
M4 Max (128 GB)Q4_K_M10k11.7 tok/sLM Studioref

benchmarks.json — full dataset  ·  models.json — model summaries  ·  benchmarks.csv — CSV export

See all models →