44 open-source models, on your device.
Filter by provider, use case, or size. Every model runs locally. No cloud, no upload, no compromise.
44 models
All open-source. All run entirely on your device.
Qwen
· 11Qwen 3 VL 2B
FeaturedAnalyzes images, reads text from screenshots, and answers questions about photos in 32 languages.
1.11 GBVisionThinkingQwen 3 4B
Handles complex questions, multi-step instructions, and conversations in 29+ languages.
2.5 GBThinkingMultilingualQwen 3 VL 4B
Advanced image analysis for charts, diagrams, documents, and complex visual reasoning tasks.
2.5 GBVisionThinkingQwen 2.5 Coder 3B
Writes, explains, and refactors code in 92+ programming languages.
2.1 GBCodingQwen 3 4B Thinking
Shows step-by-step reasoning for math, logic puzzles, and complex problems.
2.5 GBThinkingQwen 2.5 3B
Handles translation, creative writing, and longer documents in 29+ languages.
2.1 GBMultilingualQwen 3 1.7B
Balanced model for reasoning, logic tasks, and multilingual conversations.
1.83 GBThinkingMultilingualQwen 2.5 Coder 1.5B
Fast code explanations, bug fixes, and completions in 92+ programming languages.
1.12 GBCodingFastQwen 2.5 1.5B
Everyday conversations, writing help, and basic tasks in 29+ languages.
1.12 GBMultilingualQwen 3 0.6B
Handles quick questions, simple conversations, and basic tasks with thinking capability.
444 MBThinkingFastQwen 2.5 0.5B
Ultra-fast responses for very simple questions when speed matters most.
491 MBFast
Meta
· 3Llama 3.2 3B
FeaturedStrong reasoning and detailed explanations for complex conversations in 8 languages.
2.0 GBMultilingualLlama 3.2 3B Uncensored
Same as Llama 3.2 3B but answers all questions without refusals or content filters.
2.0 GBUncensoredMultilingualLlama 3.2 1B
Fast and lightweight for summarization, Q&A, and basic tasks in 8 languages.
0.8 GBMultilingualFast
Microsoft
· 2Phi-4 Mini
Excels at math, logic, and reasoning - competes with much larger models on benchmarks.
2.49 GBNewThinkingPhi-3 Mini
Solid math and logic performance trained on high-quality textbook data.
2.2 GB
Gemma 3 4B
FeaturedTop-tier reasoning and 140+ language support with efficient memory usage.
2.49 GBRecommendedMultilingualGemma 3 1B
Supports 140+ languages in under 1GB - great for less common languages.
806 MBMultilingualFastGemma 2 2B
Reliable general chat with precise instruction following.
1.71 GBCodeGemma 2B
Specialized for code completion and fill-in-the-middle tasks.
1.63 GBCodingGemma 3 270M
Extremely small (253MB) for very basic tasks when storage is critical.
253 MBFast
DeepSeek
· 2DeepSeek R1 1.5B
Shows detailed step-by-step reasoning before answering. Great for learning.
1.12 GBThinkingDeepSeek Coder 1.3B
Fast code help in 87 languages. Lightweight but has older training data.
870 MBCodingFast
Stability AI
· 3Stable Code 3B
Writes and explains code in 18 languages with good context for code review.
1.70 GBCodingStableLM Zephyr 3B
Reliable general chat with consistent, helpful responses.
1.71 GBStableLM 2 Zephyr 1.6B
Conversational model supporting 7 European languages under 1GB.
983 MBMultilingualFast
Mistral AI
· 1Ministral 3B
FeaturedDetailed answers and strong instruction following in multiple languages.
2.15 GBMultilingual
01.AI
· 1Yi Coder 1.5B
Fast code completions and debugging in 52 languages under 1GB.
960 MBCodingFast
NVIDIA
· 1Nemotron Mini 4B
Strong chat performance for longer conversations.
2.70 GB
IBM
· 3Granite 4.0 Micro
FeaturedHandles complex multi-step tasks with strong instruction following and low memory usage.
1.94 GBMultilingualGranite 4.0 1B
Reliable performance for business and professional tasks under 1GB.
901 MBMultilingualFastGranite 4.0 350M
Basic tasks with multilingual support at only 223MB.
223 MBFast
Hugging Face
· 4SmolLM3 3B
FeaturedPunches above its weight with strong reasoning and instruction following under 2GB.
1.92 GBNewSmolLM2 1.7B
Good conversations and instruction following at just over 1GB.
1.06 GBSmolLM2 360M
Quick responses and basic reasoning at only 390MB.
390 MBFastSmolLM2 135M
Tiny model for quick testing. Download a featured model for better responses.
112 MBFast
OpenAI
· 1GPT-2
Historic GPT-2 from 2019 - the model that started the modern AI wave. Text completion only.
113 MBFast
Liquid AI
· 1LFM2.5 1.2B
FeaturedFast and smart in a tiny package. Handles everyday questions and tasks with surprisingly good quality.
731 MBNewMultilingual
Community
· 6Hermes 3 3B
Roleplay and creative writing with consistent character voices.
2.02 GBRoleplayDolphin 2.6 3B
Uncensored model with no content filters for open conversations.
1.79 GBUncensoredRoleplayDanube 3 4B
Balanced 4B model with fast responses for general tasks.
2.39 GBOrca Mini 3B
Shows step-by-step reasoning - trained on GPT-4 thinking traces.
1.98 GBTinyLlama 1.1B
Basic chat under 700MB - runs fast on almost any device.
670 MBFastDanube 3 500M
Ultra-fast responses for basic tasks with multilingual support.
550 MBFast