| 1 | Mac Studio M3 Ultra 256GB | 616 | 8bit | 80.0 tok/s | MLX | Fits | Measured | $7,499 | 8bit is the current best practical quantization. 80.0 tok/s is directly measured here. 222.3 GB headroom remains at this quantization. |
| 2 | Mac Pro M2 Ultra 192GB | 544 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $6,999 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 158.3 GB headroom remains at this quantization. |
| 3 | Mac Studio M4 Max 128GB | 480 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $4,499 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 94.3 GB headroom remains at this quantization. |
| 4 | MacBook Pro M4 Max 128GB 16-inch | 480 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $5,999 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 94.3 GB headroom remains at this quantization. |
| 5 | Mac Studio M3 Ultra 96GB | 448 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $3,999 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 62.3 GB headroom remains at this quantization. |
| 6 | Mac Studio M4 Max 64GB | 416 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $2,999 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 30.3 GB headroom remains at this quantization. |
| 7 | MacBook Pro M4 Max 64GB 16-inch | 416 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $4,499 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 30.3 GB headroom remains at this quantization. |
| 8 | Mac Mini M4 Pro 48GB | 400 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $1,599 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 14.3 GB headroom remains at this quantization. |
| 9 | MacBook Pro M4 Pro 48GB 14-inch | 400 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $2,499 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 14.3 GB headroom remains at this quantization. |
| 10 | Mac Studio M4 Max 48GB | 400 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $2,499 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 14.3 GB headroom remains at this quantization. |
| 11 | MacBook Pro M4 Pro 48GB 16-inch | 400 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $2,999 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 14.3 GB headroom remains at this quantization. |
| 12 | MacBook Pro M4 Max 48GB 14-inch | 400 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $3,499 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 14.3 GB headroom remains at this quantization. |
| 13 | MacBook Pro M4 Max 48GB 16-inch | 400 | 8bit | 80.0 tok/s | MLX | Fits | Estimated | $3,999 | 8bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 14.3 GB headroom remains at this quantization. |
| 14 | Mac Studio M4 Max 36GB | 388 | Q6_K | 80.0 tok/s | MLX | Fits | Estimated | $1,999 | Q6_K is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 8.1 GB headroom remains at this quantization. |
| 15 | MacBook Pro M4 Max 36GB 14-inch | 388 | Q6_K | 80.0 tok/s | MLX | Fits | Estimated | $2,999 | Q6_K is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 8.1 GB headroom remains at this quantization. |
| 16 | MacBook Pro M4 Max 36GB 16-inch | 388 | Q6_K | 80.0 tok/s | MLX | Fits | Estimated | $3,499 | Q6_K is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 8.1 GB headroom remains at this quantization. |
| 17 | Mac Mini M4 32GB | 386 | 6bit | 80.0 tok/s | MLX | Fits | Estimated | $799 | 6bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 6.4 GB headroom remains at this quantization. |
| 18 | MacBook Air M4 32GB 13-inch | 386 | 6bit | 80.0 tok/s | MLX | Fits | Estimated | $1,499 | 6bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 6.4 GB headroom remains at this quantization. |
| 19 | MacBook Air M4 32GB 15-inch | 386 | 6bit | 80.0 tok/s | MLX | Fits | Estimated | $1,699 | 6bit is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 6.4 GB headroom remains at this quantization. |
| 20 | Mac Mini M4 24GB | 378 | Q4_K_M | 80.0 tok/s | MLX | Fits | Estimated | $599 | Q4_K_M is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 4.2 GB headroom remains at this quantization. |
| 21 | MacBook Air M4 24GB 13-inch | 378 | Q4_K_M | 80.0 tok/s | MLX | Fits | Estimated | $1,299 | Q4_K_M is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 4.2 GB headroom remains at this quantization. |
| 22 | Mac Mini M4 Pro 24GB | 378 | Q4_K_M | 80.0 tok/s | MLX | Fits | Estimated | $1,399 | Q4_K_M is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 4.2 GB headroom remains at this quantization. |
| 23 | MacBook Air M4 24GB 15-inch | 378 | Q4_K_M | 80.0 tok/s | MLX | Fits | Estimated | $1,499 | Q4_K_M is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 4.2 GB headroom remains at this quantization. |
| 24 | MacBook Pro M4 Pro 24GB 14-inch | 378 | Q4_K_M | 80.0 tok/s | MLX | Fits | Estimated | $1,999 | Q4_K_M is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 4.2 GB headroom remains at this quantization. |
| 25 | MacBook Pro M4 Pro 24GB 16-inch | 378 | Q4_K_M | 80.0 tok/s | MLX | Fits | Estimated | $2,499 | Q4_K_M is the current best practical quantization. 80.0 tok/s is estimated from nearby benchmark coverage. 4.2 GB headroom remains at this quantization. |
| 26 | Mac Mini M4 16GB | 33 | Q2_K | 1.3 tok/s | llama.cpp | Fits | Estimated | $499 | Q2_K is the current best practical quantization. 1.3 tok/s is estimated from nearby benchmark coverage. 4.2 GB headroom remains at this quantization. |
| 27 | MacBook Air M4 16GB 13-inch | 33 | Q2_K | 1.3 tok/s | llama.cpp | Fits | Estimated | $1,099 | Q2_K is the current best practical quantization. 1.3 tok/s is estimated from nearby benchmark coverage. 4.2 GB headroom remains at this quantization. |
| 28 | MacBook Air M4 16GB 15-inch | 33 | Q2_K | 1.3 tok/s | llama.cpp | Fits | Estimated | $1,299 | Q2_K is the current best practical quantization. 1.3 tok/s is estimated from nearby benchmark coverage. 4.2 GB headroom remains at this quantization. |