Ollama
Run Llama, Mistral, Gemma, and 100+ models locally with one CLI command.
Add Ollama to your hut →Ollama is the simplest way to run large language models locally — one command (`ollama run llama3`) downloads a model and starts a conversation. It supports Llama, Mistral, Gemma, Phi, Qwen, DeepSeek, and 100+ other models, choosing the best quantisation for your hardware automatically. It also runs a local server with an OpenAI-compatible API, so any tool built for the OpenAI API works with local models by changing just the base URL. No account, no API key, no data leaving your machine.
Completely free and open-source. Most often compared to LM Studio — Ollama's edge is the CLI-first workflow and the local API server that integrates with developer tools; LM Studio's is the beginner-friendly desktop GUI for users who aren't comfortable with the terminal.
| Made by | Ollama |
|---|---|
| Pricing | Free and open-source |
| Best for | Running LLMs locally, privacy-first AI, local OpenAI-compatible API, developer use |
Alternatives to Ollama
- LM Studio
Desktop GUI for downloading and running local LLMs — beginner-friendly, GPU-optimised.
- Jan
Offline-first desktop AI app — runs local models with OpenAI-compatible API on localhost.
- AnythingLLM
All-in-one private AI workspace — chat with documents, multi-user, local or cloud models.
- Civitai (local runner)
AUTOMATIC1111's Stable Diffusion web UI — the standard local interface for running open image models.
- GPT4All
Nomic's desktop app to run open LLMs privately on your machine.
- Open WebUI
Self-hosted, feature-rich web UI for Ollama and local models.