Model Quant Measured Estimated RAM used App Source Date
Mistral 7B Mistral 7B Q4 200 tok/s 69 tok/s +190% 4.3 GB LM Studio editorial 2026-03-20
Phi-4 Mini Phi-4 Mini Q8 200 tok/s 129 tok/s +55% 4.6 GB Ollama editorial 2026-03-20