Gaming PC (RTX 4070) Benchmarks

BENCHMARK RESULTS (75)

Model	Quant	Measured	Estimated	RAM used	App	Source	Date
🗣️ KittenTTS Mini	FP16	219 tok/s	200 tok/s +10%	0.2 GB	Xybrid CLI	community	2026-01-02
LFM2.5 1.2B	FP16	218 tok/s	200 tok/s +9%	2.6 GB	Xybrid CLI	editorial	2026-01-06
🗣️ OuteTTS 0.3 500M	FP16	218 tok/s	200 tok/s +9%	1.2 GB	Xybrid CLI	community	2026-04-21
Bonsai Image 4B	Q2	217 tok/s	200 tok/s +9%	2.2 GB	Xybrid CLI	community	2026-03-15
🗣️ KittenTTS Nano	FP16	215 tok/s	200 tok/s +8%	0.1 GB	Piper	editorial	2026-05-11
🗣️ Kokoro 82M	FP16	215 tok/s	200 tok/s +8%	0.4 GB	Piper	community	2026-04-08
Whisper Small	FP16	212 tok/s	200 tok/s +6%	0.6 GB	Whisper Transcription	community	2026-01-03
Whisper Medium	FP16	211 tok/s	200 tok/s +6%	1.7 GB	Xybrid CLI	editorial	2026-01-17
SmolLM2 135M	FP16	210 tok/s	200 tok/s +5%	0.3 GB	Jan	community	2026-03-16
🗣️ NeuTTS Air	FP16	210 tok/s	200 tok/s +5%	1.9 GB	Xybrid CLI	community	2026-05-24
Gemma 3 1B	FP16	207 tok/s	200 tok/s +4%	2.8 GB	Xybrid CLI	community	2026-05-25
Qwen 2.5 Coder 0.5B	FP16	207 tok/s	200 tok/s +4%	1.5 GB	Jan	community	2026-04-22
🔢 all-MiniLM-L6-v2	FP16	207 tok/s	200 tok/s +4%	0.1 GB	LM Studio	editorial	2026-01-15
👁️ SmolVLM 500M	FP16	206 tok/s	200 tok/s +3%	1.3 GB	Jan	community	2026-01-06
🔢 Nomic Embed Text	FP16	204 tok/s	200 tok/s +2%	0.3 GB	Jan	community	2026-05-23
Qwen 2.5 0.5B	FP16	199 tok/s	200 tok/s +0%	1.3 GB	Xybrid CLI	community	2026-04-02
TinyLlama 1.1B	FP16	199 tok/s	200 tok/s +0%	2.4 GB	LM Studio	community	2026-04-04
Ternary Bonsai 1.7B	Q2	197 tok/s	200 tok/s -1%	0.5 GB	Xybrid CLI	community	2026-02-23
Ternary Bonsai 8B	Q2	197 tok/s	200 tok/s -1%	1.9 GB	Xybrid CLI	community	2026-05-16
Whisper Tiny	FP16	197 tok/s	200 tok/s -1%	0.1 GB	Xybrid CLI	community	2026-01-03
🎙️ Distil-Whisper Large V3	FP16	193 tok/s	200 tok/s -4%	1.9 GB	Whisper Transcription	community	2026-04-04
🔢 BGE Small	FP16	191 tok/s	200 tok/s -4%	0.1 GB	LM Studio	editorial	2026-04-12
🔢 GTE Large	FP16	191 tok/s	200 tok/s -4%	0.8 GB	Ollama	community	2026-04-19
🎨 Stable Diffusion Turbo	FP16	191 tok/s	200 tok/s -4%	2.5 GB	Ollama	community	2026-03-16
Wav2Vec2 Base	FP16	191 tok/s	200 tok/s -4%	0.3 GB	Whisper Transcription	community	2026-04-03
SmolLM2 360M	FP16	188 tok/s	200 tok/s -6%	1.0 GB	LM Studio	community	2026-05-26
Bonsai 8B (1-bit)	Q1	185 tok/s	200 tok/s -7%	1.3 GB	Xybrid CLI	community	2026-01-18
Llama 3.2 1B	FP16	180 tok/s	187 tok/s -4%	3.5 GB	Xybrid CLI	community	2026-04-03
Qwen 3.5 0.8B	FP16	179 tok/s	200 tok/s -10%	2.1 GB	LM Studio	editorial	2026-03-15
Ternary Bonsai 4B	Q2	176 tok/s	200 tok/s -12%	1.1 GB	Xybrid CLI	editorial	2026-04-22
Whisper Large V3	FP16	161 tok/s	163 tok/s -1%	3.8 GB	Xybrid CLI	community	2026-04-23
StableLM 2 1.6B	FP16	156 tok/s	153 tok/s +2%	3.8 GB	LM Studio	editorial	2026-05-08
🧠 DeepSeek R1 Distill 1.5B	FP16	156 tok/s	153 tok/s +2%	3.6 GB	Jan	community	2026-02-07
🗣️ Dia 1.6B	FP16	154 tok/s	153 tok/s +1%	3.6 GB	Piper	community	2026-05-11
👁️ Moondream 2B	FP16	147 tok/s	136 tok/s +8%	4.3 GB	Xybrid CLI	community	2026-04-20
Qwen 2.5 Coder 1.5B	FP16	144 tok/s	158 tok/s -9%	3.6 GB	Jan	community	2026-05-14
SmolLM2 1.7B	FP16	134 tok/s	148 tok/s -9%	3.8 GB	Jan	community	2026-04-10
Mistral 7B	Q4	128 tok/s	61 tok/s +110%	4.3 GB	LM Studio	editorial	2026-02-10
Qwen 3.5 2B	FP16	128 tok/s	120 tok/s +7%	4.6 GB	Xybrid CLI	community	2026-02-07
Gemma 4 E2B	FP16	123 tok/s	112 tok/s +10%	5.1 GB	Ollama	community	2026-04-08
Qwen 2.5 7B	Q4	120 tok/s	62 tok/s +94%	4.6 GB	Ollama	community	2026-02-12
Phi-4 Mini	Q8	118 tok/s	65 tok/s +82%	4.6 GB	LM Studio	editorial	2026-02-10
🎨 Stable Diffusion 3.5 Medium	FP16	107 tok/s	110 tok/s -3%	5.5 GB	Xybrid CLI	community	2026-05-15
Qwen 2.5 3B	FP16	87 tok/s	79 tok/s +10%	7.2 GB	Xybrid CLI	editorial	2026-02-08
👨‍💻 StarCoder2 3B	FP16	80 tok/s	81 tok/s -1%	6.9 GB	Ollama	community	2026-01-01
👨‍💻 DeepSeek Coder 6.7B	Q8	78 tok/s	70 tok/s +11%	7.9 GB	Xybrid CLI	community	2026-01-03
🎨 SDXL Turbo	FP16	77 tok/s	78 tok/s -1%	7.3 GB	LM Studio	editorial	2026-05-10
Gemma 3 12B	Q4	74 tok/s	53 tok/s +40%	7.2 GB	LM Studio	editorial	2026-03-08
Qwen 2.5 Coder 3B	FP16	73 tok/s	79 tok/s -8%	8.3 GB	Jan	community	2026-03-27
Llama 3.2 3B	FP16	69 tok/s	76 tok/s -9%	7.5 GB	LM Studio	editorial	2026-03-16
Phi-4 Mini	FP16	69 tok/s	65 tok/s +6%	10.1 GB	Ollama	editorial	2026-04-06
Gemma 3n E4B	Q8	66 tok/s	69 tok/s -4%	9.5 GB	Xybrid CLI	community	2026-03-25
Mistral 7B	Q8	66 tok/s	61 tok/s +8%	8.7 GB	Xybrid CLI	community	2026-02-09
Qwen 3.5 4B	FP16	65 tok/s	61 tok/s +7%	8.8 GB	Xybrid CLI	community	2026-05-25
👁️ LLaVA 1.6 7B	Q8	65 tok/s	67 tok/s -3%	8.4 GB	LM Studio	editorial	2026-03-01
Gemma 3n E2B	FP16	64 tok/s	57 tok/s +12%	10.7 GB	Jan	community	2026-03-24
🧠 DeepSeek R1 Distill 7B	Q8	64 tok/s	62 tok/s +3%	8.6 GB	Xybrid CLI	community	2026-01-03
Qwen 2.5 Coder 7B	Q8	63 tok/s	62 tok/s +2%	10.3 GB	Jan	community	2026-04-19
Qwen 2.5 7B	Q8	61 tok/s	62 tok/s -2%	8.6 GB	Xybrid CLI	community	2026-03-28
🎨 FLUX.1 Schnell	Q6	58 tok/s	54 tok/s +7%	10.9 GB	Ollama	community	2026-05-12
Mistral Nemo 12B	Q6	57 tok/s	54 tok/s +6%	12.2 GB	Xybrid CLI	community	2026-01-03
Gemma 3 4B	FP16	56 tok/s	57 tok/s -2%	11.1 GB	Xybrid CLI	editorial	2026-01-19
🧠 Qwen3 8B	Q8	56 tok/s	58 tok/s -3%	10.8 GB	Xybrid CLI	community	2026-05-15
Qwen 2.5 VL 7B	Q8	56 tok/s	59 tok/s -5%	10.3 GB	Xybrid CLI	community	2026-01-20
Gemma 4 E4B	FP16	53 tok/s	56 tok/s -5%	10.6 GB	Xybrid CLI	community	2026-05-15
🧠 DeepSeek R1 Distill 8B	Q8	53 tok/s	59 tok/s -10%	9.9 GB	Jan	community	2026-05-13
Llama 3.1 8B	Q8	52 tok/s	59 tok/s -12%	9.1 GB	LM Studio	community	2026-03-14
Gemma 4 31B	Q2	52 tok/s	48 tok/s +8%	11.1 GB	Xybrid CLI	community	2026-01-05
Gemma 4 26B A4B	Q2	51 tok/s	50 tok/s +2%	11.1 GB	LM Studio	editorial	2026-04-05
LFM2.5 8B A1B	Q8	51 tok/s	57 tok/s -11%	9.9 GB	LM Studio	editorial	2026-01-06
Gemma 3 12B	Q6	49 tok/s	53 tok/s -8%	10.5 GB	Xybrid CLI	community	2026-02-27
Phi-4 Medium	Q6	49 tok/s	47 tok/s +4%	13.8 GB	Ollama	editorial	2026-03-04
Qwen 3.5 9B	Q8	47 tok/s	52 tok/s -10%	11.4 GB	Jan	community	2026-04-08
👨‍💻 Laguna XS.2	Q4	27 tok/s	27 tok/s +0%	21.5 GB	LM Studio	editorial	2026-05-11
Qwen 3.5 35B A3B	Q4	23 tok/s	25 tok/s -8%	25.6 GB	LM Studio	editorial	2026-05-23

← All benchmarks

Gaming PC (RTX 4070) 32GB