Model Quant Measured Estimated RAM used App Source Date
Llama 3.2 1B Llama 3.2 1B Q8 48 tok/s 25 tok/s +92% 1.8 GB Ollama editorial 2026-03-15
Phi-4 Mini Phi-4 Mini Q4 28 tok/s 9 tok/s +211% 2.4 GB LM Studio editorial 2026-03-15