| 1 | Mac Studio M3 Ultra 256GB | 387 | 8bit | 43.0 tok/s Fastest evidence path: 3bit · 57.0 tok/s · MLX · Estimated | MLX | Fits | 141.1 GB | 262k | Community row | $7,499 | 8bit is the current best practical quantization. 43.0 tok/s is backed by direct benchmark coverage. 141.1 GB headroom remains at this quantization. |
| 2 | Mac Pro M2 Ultra 192GB | 371 | 8bit | 57.0 tok/s Fastest evidence path: 8bit · 57.0 tok/s · MLX · Estimated | MLX | Fits | 77.1 GB | 262k | Estimated | $6,999 | 8bit is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 77.1 GB headroom remains at this quantization. |
| 3 | MacBook Pro M5 Max 128GB 16-inch | 336 | Q6_K | 60.6 tok/s Fastest evidence path: Q6_K · 60.6 tok/s · MLX · Estimated | MLX | Fits | 33.6 GB | 165k | Estimated | $5,399 | Q6_K is the current best practical quantization. 60.6 tok/s is estimated from nearby benchmark coverage. 33.6 GB headroom remains at this quantization. |
| 4 | Mac Studio M4 Max 128GB | 322 | Q6_K | 57.0 tok/s Fastest evidence path: Q6_K · 57.0 tok/s · MLX · Estimated | MLX | Fits | 33.6 GB | 165k | Estimated | $4,499 | Q6_K is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 33.6 GB headroom remains at this quantization. |
| 5 | MacBook Pro M4 Max 128GB 16-inch | 322 | Q6_K | 57.0 tok/s Fastest evidence path: Q6_K · 57.0 tok/s · MLX · Estimated | MLX | Fits | 33.6 GB | 165k | Estimated | $5,999 | Q6_K is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 33.6 GB headroom remains at this quantization. |
| 6 | Mac Studio M3 Ultra 96GB | 306 | 5bit | 57.0 tok/s Fastest evidence path: 5bit · 57.0 tok/s · MLX · Estimated | MLX | Fits | 23.7 GB | 110k | Estimated | $3,999 | 5bit is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 23.7 GB headroom remains at this quantization. |
| 7 | Mac Studio M4 Max 64GB | 263 | Q3_K_L | 57.0 tok/s Fastest evidence path: Q3_K_L · 57.0 tok/s · MLX · Estimated | MLX | Fits | 11.2 GB | 26k | Estimated | $2,999 | Q3_K_L is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 11.2 GB headroom remains at this quantization. |
| 8 | MacBook Pro M4 Max 64GB 16-inch | 263 | Q3_K_L | 57.0 tok/s Fastest evidence path: Q3_K_L · 57.0 tok/s · MLX · Estimated | MLX | Fits | 11.2 GB | 26k | Estimated | $4,499 | Q3_K_L is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 11.2 GB headroom remains at this quantization. |
| 9 | Mac Mini M4 Pro 48GB | 260 | mlx-dynamic-2.7bpw | 57.0 tok/s Fastest evidence path: mlx-dynamic-2.7bpw · 57.0 tok/s · MLX · Estimated | MLX | Fits | 8.4 GB | 21k | Estimated | $1,599 | mlx-dynamic-2.7bpw is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 8.4 GB headroom remains at this quantization. |
| 10 | MacBook Pro M4 Pro 48GB 14-inch | 260 | mlx-dynamic-2.7bpw | 57.0 tok/s Fastest evidence path: mlx-dynamic-2.7bpw · 57.0 tok/s · MLX · Estimated | MLX | Fits | 8.4 GB | 21k | Estimated | $2,499 | mlx-dynamic-2.7bpw is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 8.4 GB headroom remains at this quantization. |
| 11 | Mac Studio M4 Max 48GB | 260 | mlx-dynamic-2.7bpw | 57.0 tok/s Fastest evidence path: mlx-dynamic-2.7bpw · 57.0 tok/s · MLX · Estimated | MLX | Fits | 8.4 GB | 21k | Estimated | $2,499 | mlx-dynamic-2.7bpw is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 8.4 GB headroom remains at this quantization. |
| 12 | MacBook Pro M4 Pro 48GB 16-inch | 260 | mlx-dynamic-2.7bpw | 57.0 tok/s Fastest evidence path: mlx-dynamic-2.7bpw · 57.0 tok/s · MLX · Estimated | MLX | Fits | 8.4 GB | 21k | Estimated | $2,999 | mlx-dynamic-2.7bpw is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 8.4 GB headroom remains at this quantization. |
| 13 | MacBook Pro M4 Max 48GB 14-inch | 260 | mlx-dynamic-2.7bpw | 57.0 tok/s Fastest evidence path: mlx-dynamic-2.7bpw · 57.0 tok/s · MLX · Estimated | MLX | Fits | 8.4 GB | 21k | Estimated | $3,499 | mlx-dynamic-2.7bpw is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 8.4 GB headroom remains at this quantization. |
| 14 | MacBook Pro M4 Max 48GB 16-inch | 260 | mlx-dynamic-2.7bpw | 57.0 tok/s Fastest evidence path: mlx-dynamic-2.7bpw · 57.0 tok/s · MLX · Estimated | MLX | Fits | 8.4 GB | 21k | Estimated | $3,999 | mlx-dynamic-2.7bpw is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 8.4 GB headroom remains at this quantization. |
| 15 | Mac Studio M4 Max 36GB | 258 | IQ2_XS | 57.0 tok/s Fastest evidence path: IQ2_XS · 57.0 tok/s · MLX · Estimated | MLX | Fits | 6.3 GB | 19k | Estimated | $1,999 | IQ2_XS is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 6.3 GB headroom remains at this quantization. |
| 16 | MacBook Pro M4 Max 36GB 14-inch | 258 | IQ2_XS | 57.0 tok/s Fastest evidence path: IQ2_XS · 57.0 tok/s · MLX · Estimated | MLX | Fits | 6.3 GB | 19k | Estimated | $2,999 | IQ2_XS is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 6.3 GB headroom remains at this quantization. |
| 17 | MacBook Pro M4 Max 36GB 16-inch | 258 | IQ2_XS | 57.0 tok/s Fastest evidence path: IQ2_XS · 57.0 tok/s · MLX · Estimated | MLX | Fits | 6.3 GB | 19k | Estimated | $3,499 | IQ2_XS is the current best practical quantization. 57.0 tok/s is estimated from nearby benchmark coverage. 6.3 GB headroom remains at this quantization. |
| 18 | Mac Mini M4 16GB | 0 | F32 | — | MLX | No | -439.7 GB | — | Estimated | $499 | Qwen3.5-122B-A10B does not fit on Mac Mini M4 16GB at the current practical quantization. |
| 19 | Mac Mini M4 24GB | 0 | F32 | — | MLX | No | -431.7 GB | — | Estimated | $599 | Qwen3.5-122B-A10B does not fit on Mac Mini M4 24GB at the current practical quantization. |
| 20 | Mac Mini M4 32GB | 0 | F32 | — | MLX | No | -423.7 GB | — | Estimated | $799 | Qwen3.5-122B-A10B does not fit on Mac Mini M4 32GB at the current practical quantization. |
| 21 | MacBook Air M4 16GB 13-inch | 0 | F32 | — | MLX | No | -439.7 GB | — | Estimated | $1,099 | Qwen3.5-122B-A10B does not fit on MacBook Air M4 16GB 13-inch at the current practical quantization. |
| 22 | MacBook Air M4 24GB 13-inch | 0 | F32 | — | MLX | No | -431.7 GB | — | Estimated | $1,299 | Qwen3.5-122B-A10B does not fit on MacBook Air M4 24GB 13-inch at the current practical quantization. |
| 23 | MacBook Air M4 16GB 15-inch | 0 | F32 | — | MLX | No | -439.7 GB | — | Estimated | $1,299 | Qwen3.5-122B-A10B does not fit on MacBook Air M4 16GB 15-inch at the current practical quantization. |
| 24 | Mac Mini M4 Pro 24GB | 0 | F32 | — | MLX | No | -431.7 GB | — | Estimated | $1,399 | Qwen3.5-122B-A10B does not fit on Mac Mini M4 Pro 24GB at the current practical quantization. |
| 25 | MacBook Air M4 32GB 13-inch | 0 | F32 | — | MLX | No | -423.7 GB | — | Estimated | $1,499 | Qwen3.5-122B-A10B does not fit on MacBook Air M4 32GB 13-inch at the current practical quantization. |
| 26 | MacBook Air M4 24GB 15-inch | 0 | F32 | — | MLX | No | -431.7 GB | — | Estimated | $1,499 | Qwen3.5-122B-A10B does not fit on MacBook Air M4 24GB 15-inch at the current practical quantization. |
| 27 | MacBook Air M4 32GB 15-inch | 0 | F32 | — | MLX | No | -423.7 GB | — | Estimated | $1,699 | Qwen3.5-122B-A10B does not fit on MacBook Air M4 32GB 15-inch at the current practical quantization. |
| 28 | MacBook Pro M4 Pro 24GB 14-inch | 0 | F32 | — | MLX | No | -431.7 GB | — | Estimated | $1,999 | Qwen3.5-122B-A10B does not fit on MacBook Pro M4 Pro 24GB 14-inch at the current practical quantization. |
| 29 | MacBook Pro M4 Pro 24GB 16-inch | 0 | F32 | — | MLX | No | -431.7 GB | — | Estimated | $2,499 | Qwen3.5-122B-A10B does not fit on MacBook Pro M4 Pro 24GB 16-inch at the current practical quantization. |