| 1 | Mac Studio M3 Ultra 256GB | 736 | 8bit | 106.0 tok/s Fastest evidence path: 8bit · 106.0 tok/s · MLX · Estimated | MLX | Fits | 246.1 GB | 262k | Estimated | $7,499 | 8bit is the current best practical quantization. 106.0 tok/s is estimated from nearby benchmark coverage. 246.1 GB headroom remains at this quantization. |
| 2 | MacBook Pro M5 Max 128GB 16-inch | 496 | 8bit | 78.0 tok/s Fastest evidence path: 8bit · 78.0 tok/s · MLX · Estimated | MLX | Fits | 118.1 GB | 262k | Estimated | $5,399 | 8bit is the current best practical quantization. 78.0 tok/s is estimated from nearby benchmark coverage. 118.1 GB headroom remains at this quantization. |
| 3 | Mac Mini M4 24GB | 448 | 8bit | 92.0 tok/s Fastest evidence path: 8bit · 92.0 tok/s · MLX · Estimated | MLX | Fits | 14.1 GB | 94k | Estimated | $599 | 8bit is the current best practical quantization. 92.0 tok/s is estimated from nearby benchmark coverage. 14.1 GB headroom remains at this quantization. |
| 4 | MacBook Air M4 24GB 13-inch | 448 | 8bit | 92.0 tok/s Fastest evidence path: 8bit · 92.0 tok/s · MLX · Estimated | MLX | Fits | 14.1 GB | 94k | Estimated | $1,299 | 8bit is the current best practical quantization. 92.0 tok/s is estimated from nearby benchmark coverage. 14.1 GB headroom remains at this quantization. |
| 5 | Mac Mini M4 Pro 24GB | 448 | 8bit | 92.0 tok/s Fastest evidence path: 8bit · 92.0 tok/s · MLX · Estimated | MLX | Fits | 14.1 GB | 94k | Estimated | $1,399 | 8bit is the current best practical quantization. 92.0 tok/s is estimated from nearby benchmark coverage. 14.1 GB headroom remains at this quantization. |
| 6 | MacBook Air M4 24GB 15-inch | 448 | 8bit | 92.0 tok/s Fastest evidence path: 8bit · 92.0 tok/s · MLX · Estimated | MLX | Fits | 14.1 GB | 94k | Estimated | $1,499 | 8bit is the current best practical quantization. 92.0 tok/s is estimated from nearby benchmark coverage. 14.1 GB headroom remains at this quantization. |
| 7 | MacBook Pro M4 Pro 24GB 14-inch | 448 | 8bit | 92.0 tok/s Fastest evidence path: 8bit · 92.0 tok/s · MLX · Estimated | MLX | Fits | 14.1 GB | 94k | Estimated | $1,999 | 8bit is the current best practical quantization. 92.0 tok/s is estimated from nearby benchmark coverage. 14.1 GB headroom remains at this quantization. |
| 8 | MacBook Pro M4 Pro 24GB 16-inch | 448 | 8bit | 92.0 tok/s Fastest evidence path: 8bit · 92.0 tok/s · MLX · Estimated | MLX | Fits | 14.1 GB | 94k | Estimated | $2,499 | 8bit is the current best practical quantization. 92.0 tok/s is estimated from nearby benchmark coverage. 14.1 GB headroom remains at this quantization. |
| 9 | Mac Pro M2 Ultra 192GB | 388 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 182.1 GB | 262k | Estimated | $6,999 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 182.1 GB headroom remains at this quantization. |
| 10 | Mac Studio M4 Max 128GB | 324 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 118.1 GB | 262k | Estimated | $4,499 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 118.1 GB headroom remains at this quantization. |
| 11 | MacBook Pro M4 Max 128GB 16-inch | 324 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 118.1 GB | 262k | Estimated | $5,999 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 118.1 GB headroom remains at this quantization. |
| 12 | Mac Studio M3 Ultra 96GB | 292 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 86.1 GB | 262k | Estimated | $3,999 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 86.1 GB headroom remains at this quantization. |
| 13 | Mac Studio M4 Max 64GB | 260 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 54.1 GB | 262k | Estimated | $2,999 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 54.1 GB headroom remains at this quantization. |
| 14 | MacBook Pro M4 Max 64GB 16-inch | 260 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 54.1 GB | 262k | Estimated | $4,499 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 54.1 GB headroom remains at this quantization. |
| 15 | Mac Mini M4 Pro 48GB | 244 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 38.1 GB | 261k | Estimated | $1,599 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 38.1 GB headroom remains at this quantization. |
| 16 | MacBook Pro M4 Pro 48GB 14-inch | 244 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 38.1 GB | 261k | Estimated | $2,499 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 38.1 GB headroom remains at this quantization. |
| 17 | Mac Studio M4 Max 48GB | 244 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 38.1 GB | 261k | Estimated | $2,499 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 38.1 GB headroom remains at this quantization. |
| 18 | MacBook Pro M4 Pro 48GB 16-inch | 244 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 38.1 GB | 261k | Estimated | $2,999 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 38.1 GB headroom remains at this quantization. |
| 19 | MacBook Pro M4 Max 48GB 14-inch | 244 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 38.1 GB | 261k | Estimated | $3,499 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 38.1 GB headroom remains at this quantization. |
| 20 | MacBook Pro M4 Max 48GB 16-inch | 244 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 38.1 GB | 261k | Estimated | $3,999 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 38.1 GB headroom remains at this quantization. |
| 21 | Mac Studio M4 Max 36GB | 232 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 26.1 GB | 178k | Estimated | $1,999 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 26.1 GB headroom remains at this quantization. |
| 22 | MacBook Pro M4 Max 36GB 14-inch | 232 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 26.1 GB | 178k | Estimated | $2,999 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 26.1 GB headroom remains at this quantization. |
| 23 | MacBook Pro M4 Max 36GB 16-inch | 232 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 26.1 GB | 178k | Estimated | $3,499 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 26.1 GB headroom remains at this quantization. |
| 24 | Mac Mini M4 32GB | 228 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 22.1 GB | 150k | Estimated | $799 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 22.1 GB headroom remains at this quantization. |
| 25 | MacBook Air M4 32GB 13-inch | 228 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 22.1 GB | 150k | Estimated | $1,499 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 22.1 GB headroom remains at this quantization. |
| 26 | MacBook Air M4 32GB 15-inch | 228 | 8bit | 35.0 tok/s Fastest evidence path: 8bit · 35.0 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 22.1 GB | 150k | Estimated | $1,699 | 8bit is the current best practical quantization. 35.0 tok/s is estimated from nearby benchmark coverage. 22.1 GB headroom remains at this quantization. |
| 27 | Mac Mini M4 16GB | 88 | 8bit | 4.1 tok/s Fastest evidence path: Q4_K_M · 72.0 tok/s · LM Studio · Trusted reference | llama.cpp | Fits | 6.1 GB | 39k | Estimated | $499 | 8bit is the current best practical quantization. 4.1 tok/s is estimated from nearby benchmark coverage. 6.1 GB headroom remains at this quantization. |
| 28 | MacBook Air M4 16GB 13-inch | 88 | 8bit | 4.1 tok/s Fastest evidence path: Q4_K_M · 72.0 tok/s · LM Studio · Trusted reference | llama.cpp | Fits | 6.1 GB | 39k | Estimated | $1,099 | 8bit is the current best practical quantization. 4.1 tok/s is estimated from nearby benchmark coverage. 6.1 GB headroom remains at this quantization. |
| 29 | MacBook Air M4 16GB 15-inch | 88 | 8bit | 4.1 tok/s Fastest evidence path: Q4_K_M · 72.0 tok/s · LM Studio · Trusted reference | llama.cpp | Fits | 6.1 GB | 39k | Estimated | $1,299 | 8bit is the current best practical quantization. 4.1 tok/s is estimated from nearby benchmark coverage. 6.1 GB headroom remains at this quantization. |