| 1 | Mac Studio M3 Ultra 256GB | 486 | 8bit | 47.0 tok/s Fastest evidence path: 8bit · 47.0 tok/s · MLX · Estimated | MLX | Fits | 231.9 GB | 262k | Estimated | $7,499 | 8bit is the current best practical quantization. 47.0 tok/s is estimated from nearby benchmark coverage. 231.9 GB headroom remains at this quantization. |
| 2 | Mac Pro M2 Ultra 192GB | 327 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 167.9 GB | 262k | Estimated | $6,999 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 167.9 GB headroom remains at this quantization. |
| 3 | Mac Studio M4 Max 128GB | 263 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 103.9 GB | 262k | Estimated | $4,499 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 103.9 GB headroom remains at this quantization. |
| 4 | MacBook Pro M5 Max 128GB 16-inch | 263 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 103.9 GB | 262k | Estimated | $5,399 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 103.9 GB headroom remains at this quantization. |
| 5 | MacBook Pro M4 Max 128GB 16-inch | 263 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 103.9 GB | 262k | Estimated | $5,999 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 103.9 GB headroom remains at this quantization. |
| 6 | Mac Studio M3 Ultra 96GB | 231 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 71.9 GB | 262k | Estimated | $3,999 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 71.9 GB headroom remains at this quantization. |
| 7 | Mac Studio M4 Max 64GB | 199 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 39.9 GB | 207k | Estimated | $2,999 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 39.9 GB headroom remains at this quantization. |
| 8 | MacBook Pro M4 Max 64GB 16-inch | 199 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 39.9 GB | 207k | Estimated | $4,499 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 39.9 GB headroom remains at this quantization. |
| 9 | Mac Mini M4 Pro 48GB | 183 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 23.9 GB | 118k | Estimated | $1,599 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 23.9 GB headroom remains at this quantization. |
| 10 | MacBook Pro M4 Pro 48GB 14-inch | 183 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 23.9 GB | 118k | Estimated | $2,499 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 23.9 GB headroom remains at this quantization. |
| 11 | Mac Studio M4 Max 48GB | 183 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 23.9 GB | 118k | Estimated | $2,499 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 23.9 GB headroom remains at this quantization. |
| 12 | MacBook Pro M4 Pro 48GB 16-inch | 183 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 23.9 GB | 118k | Estimated | $2,999 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 23.9 GB headroom remains at this quantization. |
| 13 | MacBook Pro M4 Max 48GB 14-inch | 183 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 23.9 GB | 118k | Estimated | $3,499 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 23.9 GB headroom remains at this quantization. |
| 14 | MacBook Pro M4 Max 48GB 16-inch | 183 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 23.9 GB | 118k | Estimated | $3,999 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 23.9 GB headroom remains at this quantization. |
| 15 | Mac Studio M4 Max 36GB | 171 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 11.9 GB | 51k | Estimated | $1,999 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 11.9 GB headroom remains at this quantization. |
| 16 | MacBook Pro M4 Max 36GB 14-inch | 171 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 11.9 GB | 51k | Estimated | $2,999 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 11.9 GB headroom remains at this quantization. |
| 17 | MacBook Pro M4 Max 36GB 16-inch | 171 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 11.9 GB | 51k | Estimated | $3,499 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 11.9 GB headroom remains at this quantization. |
| 18 | Mac Mini M4 32GB | 167 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 7.9 GB | 28k | Estimated | $799 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 7.9 GB headroom remains at this quantization. |
| 19 | MacBook Air M4 32GB 13-inch | 167 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 7.9 GB | 28k | Estimated | $1,499 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 7.9 GB headroom remains at this quantization. |
| 20 | MacBook Air M4 32GB 15-inch | 167 | 8bit | 23.4 tok/s Fastest evidence path: 8bit · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 7.9 GB | 28k | Estimated | $1,699 | 8bit is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 7.9 GB headroom remains at this quantization. |
| 21 | Mac Mini M4 24GB | 157 | Q6_K | 23.4 tok/s Fastest evidence path: Q6_K · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 3.9 GB | 10k | Estimated | $599 | Q6_K is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 3.9 GB headroom remains at this quantization. |
| 22 | MacBook Air M4 24GB 13-inch | 157 | Q6_K | 23.4 tok/s Fastest evidence path: Q6_K · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 3.9 GB | 10k | Estimated | $1,299 | Q6_K is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 3.9 GB headroom remains at this quantization. |
| 23 | Mac Mini M4 Pro 24GB | 157 | Q6_K | 23.4 tok/s Fastest evidence path: Q6_K · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 3.9 GB | 10k | Estimated | $1,399 | Q6_K is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 3.9 GB headroom remains at this quantization. |
| 24 | MacBook Air M4 24GB 15-inch | 157 | Q6_K | 23.4 tok/s Fastest evidence path: Q6_K · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 3.9 GB | 10k | Estimated | $1,499 | Q6_K is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 3.9 GB headroom remains at this quantization. |
| 25 | MacBook Pro M4 Pro 24GB 14-inch | 157 | Q6_K | 23.4 tok/s Fastest evidence path: Q6_K · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 3.9 GB | 10k | Estimated | $1,999 | Q6_K is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 3.9 GB headroom remains at this quantization. |
| 26 | MacBook Pro M4 Pro 24GB 16-inch | 157 | Q6_K | 23.4 tok/s Fastest evidence path: Q6_K · 23.4 tok/s · llama.cpp · Estimated | llama.cpp | Fits | 3.9 GB | 10k | Estimated | $2,499 | Q6_K is the current best practical quantization. 23.4 tok/s is estimated from nearby benchmark coverage. 3.9 GB headroom remains at this quantization. |
| 27 | Mac Mini M4 16GB | 57 | q4.1bit | 0.1 tok/s Fastest evidence path: Q4_0 · 3.4 tok/s · llama.cpp · Community row | llama.cpp | Fits | 2.8 GB | 11k | Estimated | $499 | q4.1bit is the current best practical quantization. 0.1 tok/s is estimated from nearby benchmark coverage. 2.8 GB headroom remains at this quantization. |
| 28 | MacBook Air M4 16GB 13-inch | 57 | q4.1bit | 0.1 tok/s Fastest evidence path: Q4_0 · 3.4 tok/s · llama.cpp · Community row | llama.cpp | Fits | 2.8 GB | 11k | Estimated | $1,099 | q4.1bit is the current best practical quantization. 0.1 tok/s is estimated from nearby benchmark coverage. 2.8 GB headroom remains at this quantization. |
| 29 | MacBook Air M4 16GB 15-inch | 57 | q4.1bit | 0.1 tok/s Fastest evidence path: Q4_0 · 3.4 tok/s · llama.cpp · Community row | llama.cpp | Fits | 2.8 GB | 11k | Estimated | $1,299 | q4.1bit is the current best practical quantization. 0.1 tok/s is estimated from nearby benchmark coverage. 2.8 GB headroom remains at this quantization. |