KernelAIKernelAIDownload FreeDownload

44 AI Models.
All on your device.

From lightning-fast tiny models to powerful vision AI. Choose the perfect model for your needsโ€”all running privately on your iPhone.

44
Total Models
14
Providers
2
Vision Models
6
Coding Models

Filter by Provider

Filter by Use Case

Filter by Size

โญ Featured Models

FEATURED
Qwen
Qwen

Qwen 3 VL 2B

Analyzes images, reads text from screenshots, and answers questions about photos in 32 languages.

VisionThinking๐Ÿ‘๏ธ Vision
1.11 GB
FEATURED
Meta
Meta

Llama 3.2 3B

Strong reasoning and detailed explanations for complex conversations in 8 languages.

Multilingual
2.0 GB
FEATURED
Google
Google

Gemma 3 4B

Top-tier reasoning and 140+ language support with efficient memory usage.

RecommendedMultilingual
2.49 GB
FEATURED
Mistral AI
Mistral AI

Ministral 3B

Detailed answers and strong instruction following in multiple languages.

Multilingual
2.15 GB
FEATURED
IBM
IBM

Granite 4.0 Micro

Handles complex multi-step tasks with strong instruction following and low memory usage.

Multilingual
1.94 GB
FEATURED
Hugging Face
Hugging Face

SmolLM3 3B

Punches above its weight with strong reasoning and instruction following under 2GB.

New
1.92 GB
FEATURED
Liquid AI
Liquid AI

LFM2.5 1.2B

Fast and smart in a tiny package. Handles everyday questions and tasks with surprisingly good quality.

NewMultilingualFast
731 MB
Qwen

Qwen

11 models

Qwen
Qwen

Qwen 3 VL 2B

Analyzes images, reads text from screenshots, and answers questions about photos in 32 languages.

VisionThinking๐Ÿ‘๏ธ Vision
1.11 GB
Qwen
Qwen

Qwen 3 4B

Handles complex questions, multi-step instructions, and conversations in 29+ languages.

ThinkingMultilingual
2.5 GB
Qwen
Qwen

Qwen 3 VL 4B

Advanced image analysis for charts, diagrams, documents, and complex visual reasoning tasks.

VisionThinking๐Ÿ‘๏ธ Vision
2.5 GB
Qwen
Qwen

Qwen 2.5 Coder 3B

Writes, explains, and refactors code in 92+ programming languages.

Coding
2.1 GB
Qwen
Qwen

Qwen 3 4B Thinking

Shows step-by-step reasoning for math, logic puzzles, and complex problems.

Thinking
2.5 GB
Qwen
Qwen

Qwen 2.5 3B

Handles translation, creative writing, and longer documents in 29+ languages.

Multilingual
2.1 GB
Qwen
Qwen

Qwen 3 1.7B

Balanced model for reasoning, logic tasks, and multilingual conversations.

ThinkingMultilingual
1.83 GB
Qwen
Qwen

Qwen 2.5 Coder 1.5B

Fast code explanations, bug fixes, and completions in 92+ programming languages.

CodingFast
1.12 GB
Qwen
Qwen

Qwen 2.5 1.5B

Everyday conversations, writing help, and basic tasks in 29+ languages.

Multilingual
1.12 GB
Qwen
Qwen

Qwen 3 0.6B

Handles quick questions, simple conversations, and basic tasks with thinking capability.

ThinkingFast
444 MB
Qwen
Qwen

Qwen 2.5 0.5B

Ultra-fast responses for very simple questions when speed matters most.

Fast
491 MB
Meta

Meta

3 models

Meta
Meta

Llama 3.2 3B

Strong reasoning and detailed explanations for complex conversations in 8 languages.

Multilingual
2.0 GB
Meta
Meta

Llama 3.2 3B Uncensored

Same as Llama 3.2 3B but answers all questions without refusals or content filters.

UncensoredMultilingual
2.0 GB
Meta
Meta

Llama 3.2 1B

Fast and lightweight for summarization, Q&A, and basic tasks in 8 languages.

MultilingualFast
0.8 GB
Microsoft

Microsoft

2 models

Microsoft
Microsoft

Phi-4 Mini

Excels at math, logic, and reasoning - competes with much larger models on benchmarks.

NewThinking
2.49 GB
Microsoft
Microsoft

Phi-3 Mini

Solid math and logic performance trained on high-quality textbook data.

2.2 GB
Google

Google

5 models

Google
Google

Gemma 3 4B

Top-tier reasoning and 140+ language support with efficient memory usage.

RecommendedMultilingual
2.49 GB
Google
Google

Gemma 3 1B

Supports 140+ languages in under 1GB - great for less common languages.

MultilingualFast
806 MB
Google
Google

Gemma 2 2B

Reliable general chat with precise instruction following.

1.71 GB
Google
Google

CodeGemma 2B

Specialized for code completion and fill-in-the-middle tasks.

Coding
1.63 GB
Google
Google

Gemma 3 270M

Extremely small (253MB) for very basic tasks when storage is critical.

Fast
253 MB
DeepSeek

DeepSeek

2 models

DeepSeek
DeepSeek

DeepSeek R1 1.5B

Shows detailed step-by-step reasoning before answering. Great for learning.

Thinking
1.12 GB
DeepSeek
DeepSeek

DeepSeek Coder 1.3B

Fast code help in 87 languages. Lightweight but has older training data.

CodingFast
870 MB
Stability AI

Stability AI

3 models

Stability AI
Stability AI

Stable Code 3B

Writes and explains code in 18 languages with good context for code review.

Coding
1.70 GB
Stability AI
Stability AI

StableLM Zephyr 3B

Reliable general chat with consistent, helpful responses.

1.71 GB
Stability AI
Stability AI

StableLM 2 Zephyr 1.6B

Conversational model supporting 7 European languages under 1GB.

MultilingualFast
983 MB
Mistral AI

Mistral AI

1 model

Mistral AI
Mistral AI

Ministral 3B

Detailed answers and strong instruction following in multiple languages.

Multilingual
2.15 GB
01.AI

01.AI

1 model

01.AI
01.AI

Yi Coder 1.5B

Fast code completions and debugging in 52 languages under 1GB.

CodingFast
960 MB
NVIDIA

NVIDIA

1 model

NVIDIA
NVIDIA

Nemotron Mini 4B

Strong chat performance for longer conversations.

2.70 GB
IBM

IBM

3 models

IBM
IBM

Granite 4.0 Micro

Handles complex multi-step tasks with strong instruction following and low memory usage.

Multilingual
1.94 GB
IBM
IBM

Granite 4.0 1B

Reliable performance for business and professional tasks under 1GB.

MultilingualFast
901 MB
IBM
IBM

Granite 4.0 350M

Basic tasks with multilingual support at only 223MB.

Fast
223 MB
Hugging Face

Hugging Face

4 models

Hugging Face
Hugging Face

SmolLM3 3B

Punches above its weight with strong reasoning and instruction following under 2GB.

New
1.92 GB
Hugging Face
Hugging Face

SmolLM2 1.7B

Good conversations and instruction following at just over 1GB.

1.06 GB
Hugging Face
Hugging Face

SmolLM2 360M

Quick responses and basic reasoning at only 390MB.

Fast
390 MB
Hugging Face
Hugging Face

SmolLM2 135M

Tiny model for quick testing. Download a featured model for better responses.

Fast
112 MB
OpenAI

OpenAI

1 model

OpenAI
OpenAI

GPT-2

Historic GPT-2 from 2019 - the model that started the modern AI wave. Text completion only.

Fast
113 MB
Liquid AI

Liquid AI

1 model

Liquid AI
Liquid AI

LFM2.5 1.2B

Fast and smart in a tiny package. Handles everyday questions and tasks with surprisingly good quality.

NewMultilingualFast
731 MB

Community

6 models

Community

Hermes 3 3B

Roleplay and creative writing with consistent character voices.

Roleplay
2.02 GB
Community

Dolphin 2.6 3B

Uncensored model with no content filters for open conversations.

UncensoredRoleplay
1.79 GB
Community

Danube 3 4B

Balanced 4B model with fast responses for general tasks.

2.39 GB
Community

Orca Mini 3B

Shows step-by-step reasoning - trained on GPT-4 thinking traces.

1.98 GB
Community

TinyLlama 1.1B

Basic chat under 700MB - runs fast on almost any device.

Fast
670 MB
Community

Danube 3 500M

Ultra-fast responses for basic tasks with multilingual support.

Fast
550 MB

Ready to try these models?

Download KernelAI and start chatting with any of these 44 models privately on your device.

Download KernelAI