Model Quant Measured Estimated RAM used App Source Date
Gemma 3 1B Gemma 3 1B Q8 62 tok/s 38 tok/s +63% 1.5 GB LM Studio editorial 2026-03-10
Phi-4 Mini Phi-4 Mini Q4 40 tok/s 19 tok/s +111% 2.4 GB LM Studio editorial 2026-03-10