← Canonical rankings

M3 Ultra — benchmark record

M3 Ultra local LLM benchmarks across 256 GB, 512 GB RAM tiers on Apple Silicon Mac. 26 published rows across 15 models with explicit evidence state and RAM-tier comparison. Peak published speed is 370.0 tok/s.

26Benchmark rows
15Models tested
2RAM configurations
370.0Fastest avg tok/s

Each configuration differs only in unified memory. More RAM = larger models fit. Throughput is similar across RAM tiers at the same model size.

All benchmark rows — M3 Ultra

Sorted by avg tok/s descending. Click source badge to see original measurement.

Chip (RAM)ModelQuantRAM req.Avg tok/sPrompt tok/sRuntimeSource
M3 Ultra (256 GB)Qwen 3 0.6B4bit370.0 tok/sMLXref
M3 Ultra (256 GB)Qwen3.5-9B4bit5.1 GB106.0 tok/sMLXref
M3 Ultra (256 GB)Qwen3.5-35B-A3B4bit19.6 GB95.0 tok/sMLXref
M3 Ultra (256 GB)Qwen3.5-35B-A3B8bit36.9 GB80.0 tok/sMLXref
M3 Ultra (256 GB)Qwen3-Coder-Next4bit44.9 GB74.0 tok/sMLXref
M3 Ultra (256 GB)Qwen3-Coder-Next6bit64.8 GB66.0 tok/sMLXref
M3 Ultra (256 GB)GLM-4.7-Flash8bit58.0 tok/sMLXref
M3 Ultra (256 GB)Qwen3.5-122B-A10BMXFP465.0 GB57.0 tok/sMLXref
M3 Ultra (256 GB)GLM-4.5-Air4bit60.3 GB54.0 tok/sMLXref
M3 Ultra (256 GB)Devstral Small 2 24B4bit13.4 GB47.0 tok/sMLXref
M3 Ultra (256 GB)Qwen3.5-122B-A10B8bit129.8 GB43.0 tok/sMLXref
M3 Ultra (512 GB)Devstral Small 1.14bit43.0 tok/sLM Studioref
M3 Ultra (512 GB)Qwen3.5-397B-A17Bq4.1bit40.2 tok/sMLXref
M3 Ultra (256 GB)Qwen3.5-27B4bit15.3 GB38.0 tok/sMLXref
M3 Ultra (512 GB)Qwen 3 235B-A22B4bit27.4 tok/sLM Studioref
M3 Ultra (256 GB)Qwen 3 235B-A22BQ427.0 tok/sMLXref
M3 Ultra (512 GB)GLM-54bit391.8 GB16.7 tok/s187.0 tok/sMLXref
M3 Ultra (512 GB)Llama 3.3 70B4bit15.5 tok/s150.0 tok/sLM Studioref
M3 Ultra (512 GB)GLM-54bit394.1 GB13.7 tok/s180.1 tok/sMLXref
M3 Ultra (512 GB)GLM-54bit396.7 GB13.2 tok/s154.1 tok/sMLXref
M3 Ultra (512 GB)GLM-54bit402.7 GB12.0 tok/s117.4 tok/sMLXref
M3 Ultra (512 GB)Gemma 3 27Bbf1652.6 GB11.2 tok/sLM Studioref
M3 Ultra (512 GB)GLM-54bit415.4 GB10.7 tok/s77.7 tok/sMLXref
M3 Ultra (512 GB)Llama 3.3 70B4bit9.6 tok/s103.0 tok/sLM Studioref
M3 Ultra (512 GB)Llama 3.3 70B8bit8.5 tok/s150.0 tok/sLM Studioref
M3 Ultra (512 GB)Llama 3.3 70B8bit6.5 tok/s101.0 tok/sLM Studioref

benchmarks.json — full dataset  ·  chips.json — chip summaries  ·  benchmarks.csv — CSV export

Data from in-house lab measurements plus community-published benchmarks. See all chip families →