Model Quant Measured Estimated RAM used App Source Date
Phi-4 Mini Phi-4 Mini Q5 68 tok/s 26 tok/s +162% 2.9 GB Ollama editorial 2026-03-20
Mistral 7B Mistral 7B Q4 48 tok/s 14 tok/s +243% 4.3 GB LM Studio editorial 2026-03-20