| 1 | Mac Studio M3 Ultra 256GB | 330 | 8bit | 10.2 tok/s Fastest evidence path: 8bit · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 223.0 GB | 131k | Estimated | $7,499 | 8bit is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 223.0 GB headroom remains at this quantization. |
| 2 | MacBook Pro M5 Max 128GB 16-inch | 273 | 8bit | 28.0 tok/s Fastest evidence path: 8bit · 28.0 tok/s · Ollama · Estimated | Ollama | Fits | 95.0 GB | 131k | Estimated | $5,399 | 8bit is the current best practical quantization. 28.0 tok/s is estimated from nearby benchmark coverage. 95.0 GB headroom remains at this quantization. |
| 3 | Mac Pro M2 Ultra 192GB | 266 | 8bit | 10.2 tok/s Fastest evidence path: 8bit · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 159.0 GB | 131k | Estimated | $6,999 | 8bit is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 159.0 GB headroom remains at this quantization. |
| 4 | Mac Studio M4 Max 128GB | 208 | 8bit | 11.7 tok/s Fastest evidence path: 8bit · 11.7 tok/s · LM Studio · Estimated | LM Studio | Fits | 95.0 GB | 131k | Estimated | $4,499 | 8bit is the current best practical quantization. 11.7 tok/s is estimated from nearby benchmark coverage. 95.0 GB headroom remains at this quantization. |
| 5 | MacBook Pro M4 Max 128GB 16-inch | 208 | 8bit | 11.7 tok/s Fastest evidence path: 8bit · 11.7 tok/s · LM Studio · Estimated | LM Studio | Fits | 95.0 GB | 131k | Estimated | $5,999 | 8bit is the current best practical quantization. 11.7 tok/s is estimated from nearby benchmark coverage. 95.0 GB headroom remains at this quantization. |
| 6 | Mac Studio M4 Max 64GB | 185 | 8bit | 22.0 tok/s Fastest evidence path: 8bit · 22.0 tok/s · Ollama · Estimated | Ollama | Fits | 31.0 GB | 96k | Estimated | $2,999 | 8bit is the current best practical quantization. 22.0 tok/s is estimated from nearby benchmark coverage. 31.0 GB headroom remains at this quantization. |
| 7 | MacBook Pro M4 Max 64GB 16-inch | 185 | 8bit | 22.0 tok/s Fastest evidence path: 8bit · 22.0 tok/s · Ollama · Estimated | Ollama | Fits | 31.0 GB | 96k | Estimated | $4,499 | 8bit is the current best practical quantization. 22.0 tok/s is estimated from nearby benchmark coverage. 31.0 GB headroom remains at this quantization. |
| 8 | Mac Studio M3 Ultra 96GB | 170 | 8bit | 10.2 tok/s Fastest evidence path: 8bit · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 63.0 GB | 131k | Estimated | $3,999 | 8bit is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 63.0 GB headroom remains at this quantization. |
| 9 | Mac Mini M4 32GB | 127 | 6bit | 15.0 tok/s Fastest evidence path: 6bit · 15.0 tok/s · Ollama · Estimated | Ollama | Fits | 6.6 GB | 16k | Estimated | $799 | 6bit is the current best practical quantization. 15.0 tok/s is estimated from nearby benchmark coverage. 6.6 GB headroom remains at this quantization. |
| 10 | MacBook Air M4 32GB 13-inch | 127 | 6bit | 15.0 tok/s Fastest evidence path: 6bit · 15.0 tok/s · Ollama · Estimated | Ollama | Fits | 6.6 GB | 16k | Estimated | $1,499 | 6bit is the current best practical quantization. 15.0 tok/s is estimated from nearby benchmark coverage. 6.6 GB headroom remains at this quantization. |
| 11 | MacBook Air M4 32GB 15-inch | 127 | 6bit | 15.0 tok/s Fastest evidence path: 6bit · 15.0 tok/s · Ollama · Estimated | Ollama | Fits | 6.6 GB | 16k | Estimated | $1,699 | 6bit is the current best practical quantization. 15.0 tok/s is estimated from nearby benchmark coverage. 6.6 GB headroom remains at this quantization. |
| 12 | Mac Mini M4 Pro 48GB | 122 | 8bit | 10.2 tok/s Fastest evidence path: 8bit · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 15.0 GB | 40k | Estimated | $1,599 | 8bit is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 15.0 GB headroom remains at this quantization. |
| 13 | MacBook Pro M4 Pro 48GB 14-inch | 122 | 8bit | 10.2 tok/s Fastest evidence path: 8bit · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 15.0 GB | 40k | Estimated | $2,499 | 8bit is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 15.0 GB headroom remains at this quantization. |
| 14 | Mac Studio M4 Max 48GB | 122 | 8bit | 10.2 tok/s Fastest evidence path: 8bit · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 15.0 GB | 40k | Estimated | $2,499 | 8bit is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 15.0 GB headroom remains at this quantization. |
| 15 | MacBook Pro M4 Pro 48GB 16-inch | 122 | 8bit | 10.2 tok/s Fastest evidence path: 8bit · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 15.0 GB | 40k | Estimated | $2,999 | 8bit is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 15.0 GB headroom remains at this quantization. |
| 16 | MacBook Pro M4 Max 48GB 14-inch | 122 | 8bit | 10.2 tok/s Fastest evidence path: 8bit · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 15.0 GB | 40k | Estimated | $3,499 | 8bit is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 15.0 GB headroom remains at this quantization. |
| 17 | MacBook Pro M4 Max 48GB 16-inch | 122 | 8bit | 10.2 tok/s Fastest evidence path: 8bit · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 15.0 GB | 40k | Estimated | $3,999 | 8bit is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 15.0 GB headroom remains at this quantization. |
| 18 | Mac Studio M4 Max 36GB | 109 | Q6_K | 10.2 tok/s Fastest evidence path: Q6_K · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 8.5 GB | 21k | Estimated | $1,999 | Q6_K is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 8.5 GB headroom remains at this quantization. |
| 19 | MacBook Pro M4 Max 36GB 14-inch | 109 | Q6_K | 10.2 tok/s Fastest evidence path: Q6_K · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 8.5 GB | 21k | Estimated | $2,999 | Q6_K is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 8.5 GB headroom remains at this quantization. |
| 20 | MacBook Pro M4 Max 36GB 16-inch | 109 | Q6_K | 10.2 tok/s Fastest evidence path: Q6_K · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 8.5 GB | 21k | Estimated | $3,499 | Q6_K is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 8.5 GB headroom remains at this quantization. |
| 21 | Mac Mini M4 24GB | 99 | Q4_K_M | 10.2 tok/s Fastest evidence path: Q4_K_M · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 4.0 GB | 10k | Estimated | $599 | Q4_K_M is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 4.0 GB headroom remains at this quantization. |
| 22 | MacBook Air M4 24GB 13-inch | 99 | Q4_K_M | 10.2 tok/s Fastest evidence path: Q4_K_M · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 4.0 GB | 10k | Estimated | $1,299 | Q4_K_M is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 4.0 GB headroom remains at this quantization. |
| 23 | Mac Mini M4 Pro 24GB | 99 | Q4_K_M | 10.2 tok/s Fastest evidence path: Q4_K_M · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 4.0 GB | 10k | Estimated | $1,399 | Q4_K_M is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 4.0 GB headroom remains at this quantization. |
| 24 | MacBook Air M4 24GB 15-inch | 99 | Q4_K_M | 10.2 tok/s Fastest evidence path: Q4_K_M · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 4.0 GB | 10k | Estimated | $1,499 | Q4_K_M is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 4.0 GB headroom remains at this quantization. |
| 25 | MacBook Pro M4 Pro 24GB 14-inch | 99 | Q4_K_M | 10.2 tok/s Fastest evidence path: Q4_K_M · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 4.0 GB | 10k | Estimated | $1,999 | Q4_K_M is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 4.0 GB headroom remains at this quantization. |
| 26 | MacBook Pro M4 Pro 24GB 16-inch | 99 | Q4_K_M | 10.2 tok/s Fastest evidence path: Q4_K_M · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 4.0 GB | 10k | Estimated | $2,499 | Q4_K_M is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 4.0 GB headroom remains at this quantization. |
| 27 | Mac Mini M4 16GB | 68 | mlx-dynamic-2.7bpw | 10.2 tok/s Fastest evidence path: mlx-dynamic-2.7bpw · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 3.2 GB | 11k | Estimated | $499 | mlx-dynamic-2.7bpw is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 3.2 GB headroom remains at this quantization. |
| 28 | MacBook Air M4 16GB 13-inch | 68 | mlx-dynamic-2.7bpw | 10.2 tok/s Fastest evidence path: mlx-dynamic-2.7bpw · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 3.2 GB | 11k | Estimated | $1,099 | mlx-dynamic-2.7bpw is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 3.2 GB headroom remains at this quantization. |
| 29 | MacBook Air M4 16GB 15-inch | 68 | mlx-dynamic-2.7bpw | 10.2 tok/s Fastest evidence path: mlx-dynamic-2.7bpw · 10.2 tok/s · Ollama · Estimated | Ollama | Fits | 3.2 GB | 11k | Estimated | $1,299 | mlx-dynamic-2.7bpw is the current best practical quantization. 10.2 tok/s is estimated from nearby benchmark coverage. 3.2 GB headroom remains at this quantization. |