Local & on-device AI

Ollama

Name: Ollama
Author: Ollama

Run Llama, Mistral, Gemma, and 100+ models locally with one CLI command.

Ollama is the simplest way to run large language models locally — one command (`ollama run llama3`) downloads a model and starts a conversation. It supports Llama, Mistral, Gemma, Phi, Qwen, DeepSeek, and 100+ other models, choosing the best quantisation for your hardware automatically. It also runs a local server with an OpenAI-compatible API, so any tool built for the OpenAI API works with local models by changing just the base URL. No account, no API key, no data leaving your machine.

Completely free and open-source. Most often compared to LM Studio — Ollama's edge is the CLI-first workflow and the local API server that integrates with developer tools; LM Studio's is the beginner-friendly desktop GUI for users who aren't comfortable with the terminal.

Made by	Ollama
Pricing	Free and open-source
Best for	Running LLMs locally, privacy-first AI, local OpenAI-compatible API, developer use

local
cli
open-source
llm-runner
privacy

Alternatives to Ollama

LM Studio
Desktop GUI for downloading and running local LLMs — beginner-friendly, GPU-optimised.
Jan
Offline-first desktop AI app — runs local models with OpenAI-compatible API on localhost.
AnythingLLM
All-in-one private AI workspace — chat with documents, multi-user, local or cloud models.
Civitai (local runner)
AUTOMATIC1111's Stable Diffusion web UI — the standard local interface for running open image models.
GPT4All
Nomic's desktop app to run open LLMs privately on your machine.
Open WebUI
Self-hosted, feature-rich web UI for Ollama and local models.

See all 8 Ollama alternatives →