Best models for this Mac

25 modelsCatalog current through February 27, 2026

MacBook Air M4 32GB 15-inch ranked for coding with a most capable bias, using the best available runtime evidence.

Use the strongest current runtime evidence for each row.M5 Max watch: 8B 61.6 tok/s · 14B 34.3 tok/s on the current 36GB public anchor.
RankModelScoreQuantTok/sRuntimeEvidenceHeadroomWhy it ranks here
1Qwen 3 32B2506bit22.0 tok/sOllamaEstimated6.6 GB6bit is the highest practical quality here. 22.0 tok/s estimated from nearby benchmark coverage, with Ollama wrapper on llama.cpp as the best runtime hint. 6.6 GB headroom leaves workable context margin.
2Devstral Small 1.12458bit43.0 tok/sLM StudioEstimated7.9 GB8bit is the highest practical quality here. 43.0 tok/s estimated from nearby benchmark coverage, with LM Studio wrapper on mixed as the best runtime hint. 7.9 GB headroom leaves workable context margin.
3Devstral Small 2 24B2458bit25.3 tok/sMLXEstimated7.9 GB8bit is the highest practical quality here. 25.3 tok/s estimated from nearby benchmark coverage, with MLX backend as the best runtime hint. 7.9 GB headroom leaves workable context margin.
4Qwen3.5-27B243Q6_K20.6 tok/sMLXEstimated8.9 GBQ6_K is the highest practical quality here. 20.6 tok/s estimated from nearby benchmark coverage, with MLX backend as the best runtime hint. 8.9 GB headroom leaves workable context margin.
5Gemma 3 27B236Q6_K14.5 tok/sLM StudioEstimated6.7 GBQ6_K is the highest practical quality here. 14.5 tok/s estimated from nearby benchmark coverage, with LM Studio wrapper on mixed as the best runtime hint. 6.7 GB headroom leaves workable context margin.
6DeepSeek R1 Distill Qwen 32B2286bitMeasure itBest availableFit-first6.6 GB6bit is the highest practical quality here. Speed still needs direct measurement. 6.6 GB headroom leaves workable context margin.
7Mistral Small 3.1 24B2238bitMeasure itBest availableFit-first7.9 GB8bit is the highest practical quality here. Speed still needs direct measurement. 7.9 GB headroom leaves workable context margin.
8Magistral Small2238bitMeasure itBest availableFit-first7.9 GB8bit is the highest practical quality here. Speed still needs direct measurement. 7.9 GB headroom leaves workable context margin.
9Nemotron-3-Nano-30B-A3B211Q6_K43.7 tok/sllama.cppEstimated8.2 GBQ6_K is the highest practical quality here. 43.7 tok/s estimated from nearby benchmark coverage, with llama.cpp backend as the best runtime hint. 8.2 GB headroom leaves workable context margin.
10Qwen3-Coder-30B-A3B210Q6_K58.5 tok/sllama.cppEstimated7.8 GBQ6_K is the highest practical quality here. 58.5 tok/s estimated from nearby benchmark coverage, with llama.cpp backend as the best runtime hint. 7.8 GB headroom leaves workable context margin.
11Qwen 3 8B2108bit63.1 tok/sLM StudioEstimated22.7 GB8bit is the highest practical quality here. 63.1 tok/s estimated from nearby benchmark coverage, with LM Studio wrapper on mixed as the best runtime hint. 22.7 GB headroom leaves workable context margin.
12Qwen 3 30B-A3B210Q6_K76.7 tok/sMLXEstimated7.4 GBQ6_K is the highest practical quality here. 76.7 tok/s estimated from nearby benchmark coverage, with MLX backend as the best runtime hint. 7.4 GB headroom leaves workable context margin.
13Qwen3.5-35B-A3B2096bit80.0 tok/sMLXEstimated6.4 GB6bit is the highest practical quality here. 80.0 tok/s estimated from nearby benchmark coverage, with MLX backend as the best runtime hint. 6.4 GB headroom leaves workable context margin.
14Llama 2 7B2078bit36.4 tok/sllama.cppEstimated21.2 GB8bit is the highest practical quality here. 36.4 tok/s estimated from nearby benchmark coverage, with llama.cpp backend as the best runtime hint. 21.2 GB headroom leaves workable context margin.
15Qwen3.5-9B2048bit15.0 tok/sllama.cppEstimated22.1 GB8bit is the highest practical quality here. 15.0 tok/s estimated from nearby benchmark coverage, with llama.cpp backend as the best runtime hint. 22.1 GB headroom leaves workable context margin.
16GLM-4.7-Flash203Q536.8 tok/sllama.cppEstimated6.7 GBQ5 is the highest practical quality here. 36.8 tok/s estimated from nearby benchmark coverage, with llama.cpp backend as the best runtime hint. 6.7 GB headroom leaves workable context margin.
17Qwen 2.5 14B1928bitMeasure itBest availableFit-first17.0 GB8bit is the highest practical quality here. Speed still needs direct measurement. 17.0 GB headroom leaves workable context margin.
18Phi-4 14B1918bitMeasure itBest availableFit-first17.5 GB8bit is the highest practical quality here. Speed still needs direct measurement. 17.5 GB headroom leaves workable context margin.
19Qwen 2.5 7B1898bitMeasure itBest availableFit-first24.0 GB8bit is the highest practical quality here. Speed still needs direct measurement. 24.0 GB headroom leaves workable context margin.
20Llama 3.1 8B1888bitMeasure itBest availableFit-first23.0 GB8bit is the highest practical quality here. Speed still needs direct measurement. 23.0 GB headroom leaves workable context margin.
21Mistral 7B v0.31888bitMeasure itBest availableFit-first23.7 GB8bit is the highest practical quality here. Speed still needs direct measurement. 23.7 GB headroom leaves workable context margin.
22Gemma 3 4B1508bit100.5 tok/sLM StudioEstimated26.4 GB8bit is the highest practical quality here. 100.5 tok/s estimated from nearby benchmark coverage, with LM Studio wrapper on mixed as the best runtime hint. 26.4 GB headroom leaves workable context margin.
23Qwen 3 4B1508bit143.2 tok/sMLXEstimated26.6 GB8bit is the highest practical quality here. 143.2 tok/s estimated from nearby benchmark coverage, with MLX backend as the best runtime hint. 26.6 GB headroom leaves workable context margin.
24Qwen 3 0.6B1458bit184.4 tok/sLM StudioEstimated30.8 GB8bit is the highest practical quality here. 184.4 tok/s estimated from nearby benchmark coverage, with LM Studio wrapper on mixed as the best runtime hint. 30.8 GB headroom leaves workable context margin.
25Llama 3.2 1B1248bitMeasure itBest availableFit-first30.1 GB8bit is the highest practical quality here. Speed still needs direct measurement. 30.1 GB headroom leaves workable context margin.
32GBUnified memory
$1,699MSRP
macbook_airForm factor
m4Chip