Model coverage

Use this page when you're picking a model first and need to see which Macs run it well.

Frontier-priority models
18
Frontier families
10
Catalog models
60
Chip variants covered
103
Benchmark rows
432
Parameter range
0.6B – 744B

Catalog size isn't the point. What matters is whether a model family actually runs on Apple Silicon, the smallest Mac that fits it, and the fastest Mac that's been benchmarked.

qwen-3.6

2 models · 5 benchmark rows · 5 chips tested

55.0 tok/sfastest on M5 Max (128 GB)

gemma-4

4 models · 17 benchmark rows · 7 chips tested

158.0 tok/sfastest on M5 Max (128 GB)

minimax-m2

1 model · 0 benchmark rows · 0 chips tested

ModelCoverageChipsFastestSmallest fitStatus
MiniMax M2.7Detail
catalog onlyno Apple Silicon rows

mistral-small-4

1 model · 3 benchmark rows · 2 chips tested

45.0 tok/sfastest on M4 Ultra (192 GB)
ModelCoverageChipsFastestSmallest fitStatus
Mistral Small 4 119BDetail
3 rows245.0 tok/sbenchmark rows

ministral-3

2 models · 6 benchmark rows · 4 chips tested

98.0 tok/sfastest on M5 Max (64 GB)

devstral

2 models · 12 benchmark rows · 7 chips tested

47.0 tok/sfastest on M3 Ultra (256 GB)

glm-4.5

1 model · 5 benchmark rows · 2 chips tested

54.0 tok/sfastest on M3 Ultra (256 GB)
ModelCoverageChipsFastestSmallest fitStatus
GLM-4.5-AirDetail
5 rows254.0 tok/s60.3 GBbenchmark rows

magistral

1 model · 0 benchmark rows · 0 chips tested

ModelCoverageChipsFastestSmallest fitStatus
Magistral SmallDetail
catalog onlyno Apple Silicon rows

llama-4

1 model · 3 benchmark rows · 2 chips tested

30.0 tok/sfastest on M4 Ultra (192 GB)
ModelCoverageChipsFastestSmallest fitStatus
Llama 4 Scout 17B-16EDetail
3 rows230.0 tok/sbenchmark rows

glm-5.1

1 model · 0 benchmark rows · 0 chips tested

ModelCoverageChipsFastestSmallest fitStatus
GLM-5.1Detail
catalog onlyno Apple Silicon rows

nemotron-cascade-2

1 model · 3 benchmark rows · 3 chips tested

35.0 tok/sfastest on M5 Max (64 GB)

glm-5

1 model · 5 benchmark rows · 1 chip tested

16.7 tok/sfastest on M3 Ultra (512 GB)

qwen-3-coder

2 models · 7 benchmark rows · 3 chips tested

79.3 tok/sfastest on M5 Max (128 GB)

glm-4.7

1 model · 2 benchmark rows · 2 chips tested

58.0 tok/sfastest on M3 Ultra (256 GB)
ModelCoverageChipsFastestSmallest fitStatus
GLM-4.7-FlashDetail
2 rows258.0 tok/s17 GBbenchmark rows

nemotron-3

1 model · 1 benchmark rows · 1 chip tested

43.7 tok/sfastest on M1 Max (64 GB)

gpt-oss

2 models · 2 benchmark rows · 2 chips tested

10.0 tok/sfastest on M4 Ultra (192 GB)

gemma-3

3 models · 18 benchmark rows · 12 chips tested

132.0 tok/sfastest on M5 Max (64 GB)

mistral-small-3.1

1 model · 0 benchmark rows · 0 chips tested

phi-4

2 models · 10 benchmark rows · 8 chips tested

142.0 tok/sfastest on M5 Max (64 GB)

deepseek-r1

1 model · 0 benchmark rows · 0 chips tested

llama-3.3

1 model · 17 benchmark rows · 8 chips tested

18.0 tok/sfastest on M4 Ultra (192 GB)

llama-3.2

1 model · 0 benchmark rows · 0 chips tested

llama-3.1

1 model · 6 benchmark rows · 6 chips tested

138.0 tok/sfastest on M5 Max (128 GB)

mistral

1 model · 6 benchmark rows · 4 chips tested

122.0 tok/sfastest on M5 Max (64 GB)

llama-2

1 model · 5 benchmark rows · 5 chips tested

94.3 tok/sfastest on M2 Ultra (76-core GPU, 192 GB)

qwen-2-5-14b-instruct

1 model · 52 benchmark rows · 52 chips tested

36.7 tok/sfastest on M3 Ultra (80-core GPU, 256 GB)
ModelCoverageChipsFastestSmallest fitStatus
qwen-2-5-14b-instructDetail
52 rows5236.7 tok/sbenchmark rows

llama-3-1-8b-instruct

1 model · 52 benchmark rows · 52 chips tested

63.3 tok/sfastest on M3 Ultra (80-core GPU, 256 GB)
ModelCoverageChipsFastestSmallest fitStatus
llama-3-1-8b-instructDetail
52 rows5263.3 tok/sbenchmark rows

qwen-2-5-7b-instruct

1 model · 1 benchmark rows · 1 chip tested

49.7 tok/sfastest on M4 Max (128 GB)
ModelCoverageChipsFastestSmallest fitStatus
qwen-2-5-7b-instructDetail
1 rows149.7 tok/sbenchmark rows

llama-3-2-1b-instruct

1 model · 63 benchmark rows · 63 chips tested

229.0 tok/sfastest on M5 Max (32-core GPU, 36 GB)
ModelCoverageChipsFastestSmallest fitStatus
llama-3-2-1b-instructDetail
63 rows63229.0 tok/sbenchmark rows

benchmarks.json — full dataset  ·  models.json — model summaries  ·  benchmarks.csv — CSV export

Return to the canonical rankings →