onmydevice
.ai
⌘
⌘
Devices
Models
Benchmarks
Apps
Replace
Blog
TinyLlama 1.1B
780 MB
Chat
2048 context
·
Requires Low+ tier device
Popular tiny model trained on 3T tokens
MIN TIER
Low
SMALLEST QUANT
Q2 · 0.4 GB
CONTEXT WINDOW
2048 tokens
PARAMETERS
1.1B
QUANTIZATION OPTIONS
Quant
File size
Quality
FP16
2.2 GB
100%
Best quality
Q8
1.2 GB
95%
Q6
0.9 GB
85%
Q5
0.8 GB
78%
Q4
0.7 GB
70%
Q3
0.5 GB
58%
Q2
0.4 GB
42%
DEVICE COMPATIBILITY
Runs on these devices (47)
💻
MacBook Air M4
16GB
macOS
FP16
·
~55 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M4 Pro
36GB
macOS
FP16
·
~124 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M1 Pro
16GB
macOS
FP16
·
~91 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M1 Pro
32GB
macOS
FP16
·
~91 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M1 Max
32GB
macOS
FP16
·
~182 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M1 Max
64GB
macOS
FP16
·
~182 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M2 Pro
16GB
macOS
FP16
·
~91 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M2 Pro
32GB
macOS
FP16
·
~91 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M2 Max
32GB
macOS
FP16
·
~182 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M2 Max
64GB
macOS
FP16
·
~182 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M3 Pro
18GB
macOS
FP16
·
~68 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M3 Pro
36GB
macOS
FP16
·
~68 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M3 Max
36GB
macOS
FP16
·
~182 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M3 Max
96GB
macOS
FP16
·
~182 tok/s
·
2.2 GB
Runs great
🖥️
Gaming PC (RTX 4070)
32GB
Windows
FP16
·
~200 tok/s
·
2.2 GB
Runs great
🖥️
Gaming PC (RTX 3060)
32GB
Windows
FP16
·
~164 tok/s
·
2.2 GB
Runs great
🖥️
Gaming PC (RTX 4080)
32GB
Windows
FP16
·
~200 tok/s
·
2.2 GB
Runs great
🖥️
Gaming PC (RTX 4090)
64GB
Windows
FP16
·
~200 tok/s
·
2.2 GB
Runs great
🤖
Atom 1
32GB
Linux
FP16
·
~93 tok/s
·
2.2 GB
Runs great
🤖
Atom 1
64GB
Linux
FP16
·
~124 tok/s
·
2.2 GB
Runs great
🤖
Atom 1
128GB
Linux
FP16
·
~124 tok/s
·
2.2 GB
Runs great
🖥️
Mac Mini M2 Pro
16GB
macOS
FP16
·
~91 tok/s
·
2.2 GB
Runs great
🖥️
Mac Mini M2 Pro
32GB
macOS
FP16
·
~91 tok/s
·
2.2 GB
Runs great
🖥️
Mac Mini M4
16GB
macOS
FP16
·
~55 tok/s
·
2.2 GB
Runs great
🖥️
Mac Mini M4
32GB
macOS
FP16
·
~55 tok/s
·
2.2 GB
Runs great
🖥️
Mac Mini M4 Pro
24GB
macOS
FP16
·
~124 tok/s
·
2.2 GB
Runs great
🖥️
Mac Mini M4 Pro
48GB
macOS
FP16
·
~124 tok/s
·
2.2 GB
Runs great
🖥️
Mac Studio M4 Max
64GB
macOS
FP16
·
~200 tok/s
·
2.2 GB
Runs great
🖥️
Mac Pro M2 Ultra
192GB
macOS
FP16
·
~200 tok/s
·
2.2 GB
Runs great
💻
Snapdragon X Elite Laptop
16GB
Windows
FP16
·
~62 tok/s
·
2.2 GB
Runs great
💻
MacBook Air M3
16GB
macOS
FP16
·
~45 tok/s
·
2.2 GB
Runs great
💻
MacBook Air M2
8GB
macOS
FP16
·
~45 tok/s
·
2.2 GB
Runs great
🖥️
Mac Mini M2
8GB
macOS
FP16
·
~45 tok/s
·
2.2 GB
Runs great
🎮
Steam Deck OLED
16GB
Linux
FP16
·
~40 tok/s
·
2.2 GB
Runs great
📱
iPad Pro M4
16GB
iOS
FP16
·
~38 tok/s
·
2.2 GB
Runs great
💻
MacBook Air M1
8GB
macOS
FP16
·
~31 tok/s
·
2.2 GB
Runs great
💻
MacBook Air M1
16GB
macOS
FP16
·
~31 tok/s
·
2.2 GB
Runs great
💻
MacBook Pro M1
16GB
macOS
FP16
·
~31 tok/s
·
2.2 GB
Runs great
🖥️
Mac Mini M1
8GB
macOS
FP16
·
~31 tok/s
·
2.2 GB
Runs great
🖥️
Mac Mini M1
16GB
macOS
FP16
·
~31 tok/s
·
2.2 GB
Runs great
📱
Galaxy S25 Ultra
12GB
Android
FP16
·
~24 tok/s
·
2.2 GB
Runs great
📱
OnePlus 13
16GB
Android
FP16
·
~24 tok/s
·
2.2 GB
Runs great
📱
Pixel 9 Pro
16GB
Android
FP16
·
~22 tok/s
·
2.2 GB
Runs great
🍓
Raspberry Pi 5
8GB
Linux
FP16
·
~15 tok/s
·
2.2 GB
Runs great
📱
iPhone 16 Pro
8GB
iOS
FP16
·
~22 tok/s
·
2.2 GB
Runs well
📱
Galaxy S24
8GB
Android
FP16
·
~19 tok/s
·
2.2 GB
Runs well
📱
iPhone 15
6GB
iOS
FP16
·
~13 tok/s
·
2.2 GB
Tight fit
RUN WITH THESE APPS
Beginner
Free
LM Studio
Beautiful desktop app for running local AI models
Tinkerer
Free
Ollama
Lightweight CLI tool for running models in the background
Beginner
Free
Jan
Open-source ChatGPT-like desktop app
Beginner
Free
LocallyAI
Run AI models privately on your iPhone and iPad
HOW TO RUN
Step-by-step run guides coming soon — check the apps above to get started today.