← All benchmarks

Qwen 2.5 14B — Apple Silicon Benchmarks

Measured inference speed for Qwen 2.5 14B across 1 Apple Silicon chip. Tokens per second at multiple quantization levels. Real runs, not estimates.

Quantizations measured: Q3_K_L

1Benchmark rows
1Chip tiers covered
18.6Fastest avg tok/s (M4 Pro (12-core GPU))
8 GBMinimum RAM observed

Benchmark results for Qwen 2.5 14B

Rows sorted by avg tok/s descending. Click source badge to see original measurement page.

ChipQuantRAM req.ContextAvg tok/sPrompt tok/sRuntimeSource
M4 Pro (12-core GPU)Q3_K_L8.0 GB4k18.6 tok/sref

benchmarks.json — full dataset  ·  models.json — model summaries  ·  benchmarks.csv — CSV export

See all models →