| 1 | Mac Studio M3 Ultra 256GB | 540 | 8bit | 62.0 tok/s Fastest evidence path: 8bit · 62.0 tok/s · MLX · Estimated | MLX | Fits | 226.3 GB | 131k | Estimated | $7,499 | 8bit is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 226.3 GB headroom remains at this quantization. |
| 2 | Mac Pro M2 Ultra 192GB | 476 | 8bit | 62.0 tok/s Fastest evidence path: 8bit · 62.0 tok/s · MLX · Estimated | MLX | Fits | 162.3 GB | 131k | Estimated | $6,999 | 8bit is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 162.3 GB headroom remains at this quantization. |
| 3 | Mac Studio M4 Max 128GB | 445 | 8bit | 70.2 tok/s Fastest evidence path: 8bit · 70.2 tok/s · LM Studio · Estimated | LM Studio | Fits | 98.3 GB | 131k | Estimated | $4,499 | 8bit is the current best practical quantization. 70.2 tok/s is estimated from nearby benchmark coverage. 98.3 GB headroom remains at this quantization. |
| 4 | MacBook Pro M4 Max 128GB 16-inch | 445 | 8bit | 70.2 tok/s Fastest evidence path: 8bit · 70.2 tok/s · LM Studio · Estimated | LM Studio | Fits | 98.3 GB | 131k | Estimated | $5,999 | 8bit is the current best practical quantization. 70.2 tok/s is estimated from nearby benchmark coverage. 98.3 GB headroom remains at this quantization. |
| 5 | Mac Studio M4 Max 64GB | 440 | 8bit | 84.9 tok/s Fastest evidence path: Q4 · 92.1 tok/s · MLX · Trusted reference | MLX | Fits | 34.3 GB | 131k | Estimated | $2,999 | 8bit is the current best practical quantization. 84.9 tok/s is estimated from nearby benchmark coverage. 34.3 GB headroom remains at this quantization. |
| 6 | MacBook Pro M4 Max 64GB 16-inch | 440 | 8bit | 84.9 tok/s Fastest evidence path: Q4 · 92.1 tok/s · MLX · Trusted reference | MLX | Fits | 34.3 GB | 131k | Estimated | $4,499 | 8bit is the current best practical quantization. 84.9 tok/s is estimated from nearby benchmark coverage. 34.3 GB headroom remains at this quantization. |
| 7 | MacBook Pro M5 Max 128GB 16-inch | 412 | 8bit | 62.0 tok/s Fastest evidence path: 8bit · 62.0 tok/s · MLX · Estimated | MLX | Fits | 98.3 GB | 131k | Estimated | $5,399 | 8bit is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 98.3 GB headroom remains at this quantization. |
| 8 | Mac Studio M3 Ultra 96GB | 380 | 8bit | 62.0 tok/s Fastest evidence path: 8bit · 62.0 tok/s · MLX · Estimated | MLX | Fits | 66.3 GB | 131k | Estimated | $3,999 | 8bit is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 66.3 GB headroom remains at this quantization. |
| 9 | Mac Studio M4 Max 36GB | 320 | 8bit | 62.0 tok/s Fastest evidence path: 8bit · 62.0 tok/s · MLX · Estimated | MLX | Fits | 6.3 GB | 18k | Estimated | $1,999 | 8bit is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 6.3 GB headroom remains at this quantization. |
| 10 | MacBook Pro M4 Max 36GB 14-inch | 320 | 8bit | 62.0 tok/s Fastest evidence path: 8bit · 62.0 tok/s · MLX · Estimated | MLX | Fits | 6.3 GB | 18k | Estimated | $2,999 | 8bit is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 6.3 GB headroom remains at this quantization. |
| 11 | MacBook Pro M4 Max 36GB 16-inch | 320 | 8bit | 62.0 tok/s Fastest evidence path: 8bit · 62.0 tok/s · MLX · Estimated | MLX | Fits | 6.3 GB | 18k | Estimated | $3,499 | 8bit is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 6.3 GB headroom remains at this quantization. |
| 12 | Mac Mini M4 32GB | 315 | Q6_K | 62.0 tok/s Fastest evidence path: Q6_K · 62.0 tok/s · MLX · Estimated | MLX | Fits | 7.4 GB | 37k | Estimated | $799 | Q6_K is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 7.4 GB headroom remains at this quantization. |
| 13 | MacBook Air M4 32GB 13-inch | 315 | Q6_K | 62.0 tok/s Fastest evidence path: Q6_K · 62.0 tok/s · MLX · Estimated | MLX | Fits | 7.4 GB | 37k | Estimated | $1,499 | Q6_K is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 7.4 GB headroom remains at this quantization. |
| 14 | MacBook Air M4 32GB 15-inch | 315 | Q6_K | 62.0 tok/s Fastest evidence path: Q6_K · 62.0 tok/s · MLX · Estimated | MLX | Fits | 7.4 GB | 37k | Estimated | $1,699 | Q6_K is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 7.4 GB headroom remains at this quantization. |
| 15 | Mac Mini M4 Pro 48GB | 312 | 8bit | 55.0 tok/s Fastest evidence path: 8bit · 55.0 tok/s · MLX · Community row | MLX | Fits | 18.3 GB | 130k | Community row | $1,599 | 8bit is the current best practical quantization. 55.0 tok/s is backed by direct benchmark coverage. 18.3 GB headroom remains at this quantization. |
| 16 | MacBook Pro M4 Pro 48GB 14-inch | 312 | 8bit | 55.0 tok/s Fastest evidence path: 8bit · 55.0 tok/s · MLX · Community row | MLX | Fits | 18.3 GB | 130k | Community row | $2,499 | 8bit is the current best practical quantization. 55.0 tok/s is backed by direct benchmark coverage. 18.3 GB headroom remains at this quantization. |
| 17 | MacBook Pro M4 Pro 48GB 16-inch | 312 | 8bit | 55.0 tok/s Fastest evidence path: 8bit · 55.0 tok/s · MLX · Community row | MLX | Fits | 18.3 GB | 130k | Community row | $2,999 | 8bit is the current best practical quantization. 55.0 tok/s is backed by direct benchmark coverage. 18.3 GB headroom remains at this quantization. |
| 18 | Mac Mini M4 16GB | 276 | 3bit | 62.0 tok/s Fastest evidence path: 3bit · 62.0 tok/s · MLX · Estimated | MLX | Fits | 4.1 GB | 27k | Estimated | $499 | 3bit is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 4.1 GB headroom remains at this quantization. |
| 19 | MacBook Air M4 16GB 13-inch | 276 | 3bit | 62.0 tok/s Fastest evidence path: 3bit · 62.0 tok/s · MLX · Estimated | MLX | Fits | 4.1 GB | 27k | Estimated | $1,099 | 3bit is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 4.1 GB headroom remains at this quantization. |
| 20 | MacBook Air M4 16GB 15-inch | 276 | 3bit | 62.0 tok/s Fastest evidence path: 3bit · 62.0 tok/s · MLX · Estimated | MLX | Fits | 4.1 GB | 27k | Estimated | $1,299 | 3bit is the current best practical quantization. 62.0 tok/s is estimated from nearby benchmark coverage. 4.1 GB headroom remains at this quantization. |
| 21 | Mac Studio M4 Max 48GB | 252 | 8bit | 42.0 tok/s Fastest evidence path: 8bit · 42.0 tok/s · Ollama · Estimated | Ollama | Fits | 18.3 GB | 130k | Estimated | $2,499 | 8bit is the current best practical quantization. 42.0 tok/s is estimated from nearby benchmark coverage. 18.3 GB headroom remains at this quantization. |
| 22 | MacBook Pro M4 Max 48GB 14-inch | 252 | 8bit | 42.0 tok/s Fastest evidence path: 8bit · 42.0 tok/s · Ollama · Estimated | Ollama | Fits | 18.3 GB | 130k | Estimated | $3,499 | 8bit is the current best practical quantization. 42.0 tok/s is estimated from nearby benchmark coverage. 18.3 GB headroom remains at this quantization. |
| 23 | MacBook Pro M4 Max 48GB 16-inch | 252 | 8bit | 42.0 tok/s Fastest evidence path: 8bit · 42.0 tok/s · Ollama · Estimated | Ollama | Fits | 18.3 GB | 130k | Estimated | $3,999 | 8bit is the current best practical quantization. 42.0 tok/s is estimated from nearby benchmark coverage. 18.3 GB headroom remains at this quantization. |
| 24 | Mac Mini M4 24GB | 199 | 5bit | 35.0 tok/s Fastest evidence path: 5bit · 35.0 tok/s · MLX · Estimated | MLX | Fits | 5.0 GB | 23k | Estimated | $599 | 5bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 5.0 GB headroom remains at this quantization. |
| 25 | MacBook Air M4 24GB 13-inch | 199 | 5bit | 35.0 tok/s Fastest evidence path: 5bit · 35.0 tok/s · MLX · Estimated | MLX | Fits | 5.0 GB | 23k | Estimated | $1,299 | 5bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 5.0 GB headroom remains at this quantization. |
| 26 | Mac Mini M4 Pro 24GB | 199 | 5bit | 35.0 tok/s Fastest evidence path: 5bit · 35.0 tok/s · MLX · Estimated | MLX | Fits | 5.0 GB | 23k | Estimated | $1,399 | 5bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 5.0 GB headroom remains at this quantization. |
| 27 | MacBook Air M4 24GB 15-inch | 199 | 5bit | 35.0 tok/s Fastest evidence path: 5bit · 35.0 tok/s · MLX · Estimated | MLX | Fits | 5.0 GB | 23k | Estimated | $1,499 | 5bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 5.0 GB headroom remains at this quantization. |
| 28 | MacBook Pro M4 Pro 24GB 14-inch | 199 | 5bit | 35.0 tok/s Fastest evidence path: 5bit · 35.0 tok/s · MLX · Estimated | MLX | Fits | 5.0 GB | 23k | Estimated | $1,999 | 5bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 5.0 GB headroom remains at this quantization. |
| 29 | MacBook Pro M4 Pro 24GB 16-inch | 199 | 5bit | 35.0 tok/s Fastest evidence path: 5bit · 35.0 tok/s · MLX · Estimated | MLX | Fits | 5.0 GB | 23k | Estimated | $2,499 | 5bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 5.0 GB headroom remains at this quantization. |