Model Quant Measured Estimated RAM used App Source Date
Qwen 2.5 0.5B Qwen 2.5 0.5B Q8 185 tok/s 100 tok/s +85% 0.9 GB Ollama editorial 2026-03-20
Phi-4 Mini Phi-4 Mini Q4 58 tok/s 15 tok/s +287% 2.4 GB LM Studio editorial 2026-03-20