← Canonical rankings

M5 Max — benchmark record

M5 Max local LLM benchmarks across 48 GB, 64 GB, 128 GB RAM tiers on Apple Silicon Mac. 60 published rows across 36 models with explicit evidence state and RAM-tier comparison. Peak published speed is 158.0 tok/s.

60Benchmark rows
36Models tested
3RAM configurations
158.0Fastest avg tok/s

Each configuration differs only in unified memory. More RAM = larger models fit. Throughput is similar across RAM tiers at the same model size.

All benchmark rows — M5 Max

Sorted by avg tok/s descending. Click source badge to see original measurement.

Chip (RAM)ModelQuantRAM req.Avg tok/sPrompt tok/sRuntimeSource
M5 Max (128 GB)Gemma 4 E2BQ4_K - Medium158.0 tok/sMLXref
M5 Max (64 GB)Qwen3.5-4BQ4_K - Medium148.0 tok/sMLXref
M5 Max (64 GB)Phi-4 Mini Instruct 3.8BQ4_K - Medium142.0 tok/sOllamaref
M5 Max (128 GB)Llama 3.1 8BQ4_K - Medium138.0 tok/sMLXref
M5 Max (64 GB)Qwen 3 4BQ4_K - Medium135.0 tok/sOllamaref
M5 Max (64 GB)Gemma 3 4BQ4_K - Medium132.0 tok/sOllamaref
M5 Max (48 GB)Qwen3.5-35B-A3B4bit128.0 tok/s3235.0 tok/sMLXref
M5 Max (128 GB)Gemma 4 E4BQ4_K - Medium128.0 tok/sMLXref
M5 Max (64 GB)Mistral 7B v0.3Q4_K - Medium122.0 tok/sOllamaref
M5 Max (64 GB)Phi-4 Mini Instruct 3.8BQ8_0112.0 tok/sMLXref
M5 Max (64 GB)Qwen3.5-9BQ4_K - Medium105.0 tok/sOllamaref
M5 Max (128 GB)Qwen 3 8BQ4_K - Medium98.0 tok/sOllamaref
M5 Max (64 GB)Ministral 3 8BQ4_K - Medium98.0 tok/sOllamaref
M5 Max (64 GB)DeepSeek R1 Distill Llama 8BQ4_K - Medium97.0 tok/sOllamaref
M5 Max (48 GB)Qwen3.5-35B-A3BQ4_K - Medium89.4 tok/s783.0 tok/sllama.cppref
M5 Max (64 GB)Mistral 7B v0.3Q8_088.0 tok/sMLXref
M5 Max (64 GB)Llama 3.1 8BQ8_082.0 tok/sOllamaref
M5 Max (128 GB)Qwen3-Coder-Next8bit87.1 GB79.3 tok/s754.9 tok/sMLXref
M5 Max (128 GB)Qwen3.5-9BQ8_078.0 tok/sMLXref
M5 Max (64 GB)DeepSeek R1 Distill Llama 8BQ8_075.0 tok/sMLXref
M5 Max (128 GB)Qwen3-Coder-Next8bit88.2 GB74.3 tok/s1802.1 tok/sMLXref
M5 Max (128 GB)Qwen3-Coder-Next8bit89.7 GB68.6 tok/s1887.2 tok/sMLXref
M5 Max (64 GB)Gemma 3 12BQ4_K - Medium68.0 tok/sOllamaref
M5 Max (128 GB)Qwen3.5-122B-A10B4bit71.9 GB65.9 tok/s881.5 tok/sMLXref
M5 Max (64 GB)Phi-4 14BQ4_K - Medium62.0 tok/sMLXref
M5 Max (64 GB)Qwen 3 30B-A3BQ4_K - Medium62.0 tok/sOllamaref
M5 Max (128 GB)Qwen3.5-122B-A10B4bit73.8 GB60.6 tok/s1239.7 tok/sMLXref
M5 Max (64 GB)Qwen 3 14BQ4_K - Medium58.0 tok/sOllamaref
M5 Max (64 GB)Ministral 3 14BQ4_K - Medium58.0 tok/sOllamaref
M5 Max (128 GB)Qwen3.6-35B-A3BQ4_K - Medium55.0 tok/sMLXref
M5 Max (128 GB)Qwen3.5-122B-A10B4bit76.4 GB54.9 tok/s1067.8 tok/sMLXref
M5 Max (64 GB)Qwen3.5-35B-A3BQ4_K - Medium52.0 tok/sMLXref
M5 Max (128 GB)Gemma 4 26B-A4BQ4_K - Medium50.0 tok/sMLXref
M5 Max (128 GB)Qwen3-Coder-Next8bit92.6 GB48.2 tok/s1432.7 tok/sMLXref
M5 Max (64 GB)Qwen3.6-35B-A3BQ4_K - Medium48.0 tok/sOllamaref
M5 Max (128 GB)Qwen3.5-35B-A3BQ4_K - Medium48.0 tok/sOllamaref
M5 Max (64 GB)Gemma 3 27BQ4_K - Medium42.0 tok/sOllamaref
M5 Max (128 GB)Mistral Small 4 119BQ4_K - Medium42.0 tok/sMLXref
M5 Max (128 GB)Qwen 3 14BQ8_042.0 tok/sMLXref
M5 Max (128 GB)Mistral Small 4 119BQ4_K - Medium38.0 tok/sOllamaref
M5 Max (64 GB)Nemotron Cascade 2 30B-A3BQ4_K - Medium35.0 tok/sOllamaref
M5 Max (128 GB)Qwen3.5-27B4bit31.6 tok/sMLXref
M5 Max (48 GB)Qwen3.5-27B4bit31.3 tok/s779.0 tok/sMLXref
M5 Max (128 GB)Qwen 3 32BQ4_K - Medium28.0 tok/sOllamaref
M5 Max (64 GB)DeepSeek R1 Distill Qwen 32BQ4_K - Medium27.0 tok/sOllamaref
M5 Max (128 GB)Gemma 4 31BQ4_K - Medium26.0 tok/sMLXref
M5 Max (128 GB)Llama 4 Scout 17B-16EQ4_K - Medium26.0 tok/sMLXref
M5 Max (48 GB)Qwen3.5-27BQ4_K - Medium23.7 tok/s171.0 tok/sllama.cppref
M5 Max (128 GB)Llama 4 Scout 17B-16EQ4_K - Medium22.0 tok/sOllamaref
M5 Max (64 GB)Gemma 4 31BQ4_K - Medium22.0 tok/sOllamaref
M5 Max (128 GB)Gemma 3 27BQ6_K20.0 tok/s391.0 tok/sllama.cppref
M5 Max (128 GB)Qwen 3 235B-A22BQ4_K - Medium18.0 tok/sMLXref
M5 Max (128 GB)Qwen3.5-27BQ6_K16.5 tok/sllama.cppref
M5 Max (128 GB)Llama 3.3 70BQ4_K - Medium15.0 tok/sMLXref
M5 Max (128 GB)Qwen 3 235B-A22BQ4_K - Medium15.0 tok/sOllamaref
M5 Max (128 GB)Qwen3.5-397B-A17B4bit13.0 tok/sflash-moeref
M5 Max (128 GB)Llama 3.3 70BQ4_K - Medium12.0 tok/sOllamaref
M5 Max (128 GB)DeepSeek R1 Distill Llama 70BQ4_K - Medium11.0 tok/sOllamaref
M5 Max (128 GB)Qwen 2.5 72BQ4_K - Medium10.0 tok/sOllamaref
M5 Max (128 GB)gpt-oss 120BQ4_K - Medium7.0 tok/sOllamaref

benchmarks.json — full dataset  ·  chips.json — chip summaries  ·  benchmarks.csv — CSV export

Data from in-house lab measurements plus community-published benchmarks. See all chip families →