Model Quant Measured Estimated RAM used App Source Date
Gemma 3 1B Gemma 3 1B Q8 130 tok/s 91 tok/s +43% 1.5 GB Ollama editorial 2026-03-15
Phi-4 Mini Phi-4 Mini Q4 82 tok/s 26 tok/s +215% 2.4 GB LM Studio editorial 2026-03-15