ollama/README.md

36 KiB

  ollama

Ollama

Get up and running with large language models.

macOS

Download

Windows

Download

Linux

curl -fsSL https://ollama.com/install.sh | sh

Manual install instructions

Docker

The official Ollama Docker image ollama/ollama is available on Docker Hub.

Libraries

Community

Quickstart

To run and chat with Gemma 3:

ollama run gemma3

Models

Model Parameters Size Download
Gemma 3 1B 815MB ollama run gemma3:1b
Gemma 3 4B 3.3GB ollama run gemma3
DeepSeek-R1 7B 4.7GB ollama run deepseek-r1
gpt-oss 20B 14GB ollama run gpt-oss

See a full list of models on ollama.com

CLI Reference

Download a model

ollama pull gemma3

Remove a model

ollama rm gemma3

Create a model

ollama create is used to create a model from a Modelfile.

ollama create mymodel -f ./Modelfile

Show model information

ollama show gemma3

Copy a model

ollama cp gemma3 my-model

List models on your computer

ollama list

Multiline input

For multiline input, you can wrap text with """:

>>> """Hello,
... world!
... """
I'm a basic program that prints the famous "Hello, world!" message to the console.

Multimodal models

ollama run gemma3 "What's in this image? /Users/jmorgan/Desktop/smile.png"

Pass the prompt as an argument

ollama run gemma3 "Summarize this file: $(cat README.md)"

List which models are currently loaded

ollama ps

Stop a model which is currently running

ollama stop gemma3

Start Ollama

To run Ollama's server, use:

ollama serve

Building

See the development guide

REST API

Ollama has a REST API for running and managing models.

Generate a response

curl http://localhost:11434/api/generate -d '{
  "model": "gemma3",
  "prompt":"Why is the sky blue?"
}'

See the API documentation for more information.

Community Integrations

Web & Desktop

  • Open WebUI
  • SwiftChat (macOS with ReactNative)
  • Enchanted (macOS native)
  • Hollama
  • Lollms-Webui
  • LibreChat
  • Bionic GPT
  • HTML UI
  • Saddle
  • TagSpaces (A platform for file-based apps, utilizing Ollama for the generation of tags and descriptions)
  • Chatbot UI
  • Chatbot UI v2
  • Typescript UI
  • Minimalistic React UI for Ollama Models
  • Ollamac
  • big-AGI
  • Cheshire Cat assistant framework
  • Amica
  • chatd
  • Ollama-SwiftUI
  • Dify.AI
  • MindMac
  • NextJS Web Interface for Ollama
  • Msty
  • Chatbox
  • WinForm Ollama Copilot
  • NextChat with Get Started Doc
  • Alpaca WebUI
  • OllamaGUI
  • OpenAOE
  • Odin Runes
  • LLM-X (Progressive Web App)
  • AnythingLLM (Docker + MacOs/Windows/Linux native app)
  • Ollama Basic Chat: Uses HyperDiv Reactive UI
  • Ollama-chats RPG
  • IntelliBar (AI-powered assistant for macOS)
  • Jirapt (Jira Integration to generate issues, tasks, epics)
  • ojira (Jira chrome plugin to easily generate descriptions for tasks)
  • QA-Pilot (Interactive chat tool that can leverage Ollama models for rapid understanding and navigation of GitHub code repositories)
  • ChatOllama (Open Source Chatbot based on Ollama with Knowledge Bases)
  • CRAG Ollama Chat (Simple Web Search with Corrective RAG)
  • RAGFlow (Open-source Retrieval-Augmented Generation engine based on deep document understanding)
  • StreamDeploy (LLM Application Scaffold)
  • chat (chat web app for teams)
  • Lobe Chat with Integrating Doc
  • Ollama RAG Chatbot (Local Chat with multiple PDFs using Ollama and RAG)
  • BrainSoup (Flexible native client with RAG & multi-agent automation)
  • macai (macOS client for Ollama, ChatGPT, and other compatible API back-ends)
  • RWKV-Runner (RWKV offline LLM deployment tool, also usable as a client for ChatGPT and Ollama)
  • Ollama Grid Search (app to evaluate and compare models)
  • Olpaka (User-friendly Flutter Web App for Ollama)
  • Casibase (An open source AI knowledge base and dialogue system combining the latest RAG, SSO, ollama support, and multiple large language models.)
  • OllamaSpring (Ollama Client for macOS)
  • LLocal.in (Easy to use Electron Desktop Client for Ollama)
  • Shinkai Desktop (Two click install Local AI using Ollama + Files + RAG)
  • AiLama (A Discord User App that allows you to interact with Ollama anywhere in Discord)
  • Ollama with Google Mesop (Mesop Chat Client implementation with Ollama)
  • R2R (Open-source RAG engine)
  • Ollama-Kis (A simple easy-to-use GUI with sample custom LLM for Drivers Education)
  • OpenGPA (Open-source offline-first Enterprise Agentic Application)
  • Painting Droid (Painting app with AI integrations)
  • Kerlig AI (AI writing assistant for macOS)
  • AI Studio
  • Sidellama (browser-based LLM client)
  • LLMStack (No-code multi-agent framework to build LLM agents and workflows)
  • BoltAI for Mac (AI Chat Client for Mac)
  • Harbor (Containerized LLM Toolkit with Ollama as default backend)
  • PyGPT (AI desktop assistant for Linux, Windows, and Mac)
  • Alpaca (An Ollama client application for Linux and macOS made with GTK4 and Adwaita)
  • AutoGPT (AutoGPT Ollama integration)
  • Go-CREW (Powerful Offline RAG in Golang)
  • PartCAD (CAD model generation with OpenSCAD and CadQuery)
  • Ollama4j Web UI - Java-based Web UI for Ollama built with Vaadin, Spring Boot, and Ollama4j
  • PyOllaMx - macOS application capable of chatting with both Ollama and Apple MLX models.
  • Cline - Formerly known as Claude Dev is a VSCode extension for multi-file/whole-repo coding
  • Cherry Studio (Desktop client with Ollama support)
  • ConfiChat (Lightweight, standalone, multi-platform, and privacy-focused LLM chat interface with optional encryption)
  • Archyve (RAG-enabling document library)
  • crewAI with Mesop (Mesop Web Interface to run crewAI with Ollama)
  • Tkinter-based client (Python tkinter-based Client for Ollama)
  • LLMChat (Privacy focused, 100% local, intuitive all-in-one chat interface)
  • Local Multimodal AI Chat (Ollama-based LLM Chat with support for multiple features, including PDF RAG, voice chat, image-based interactions, and integration with OpenAI.)
  • ARGO (Locally download and run Ollama and Huggingface models with RAG and deep research on Mac/Windows/Linux)
  • OrionChat - OrionChat is a web interface for chatting with different AI providers
  • G1 (Prototype of using prompting strategies to improve the LLM's reasoning through o1-like reasoning chains.)
  • Web management (Web management page)
  • Promptery (desktop client for Ollama.)
  • Ollama App (Modern and easy-to-use multi-platform client for Ollama)
  • chat-ollama (a React Native client for Ollama)
  • SpaceLlama (Firefox and Chrome extension to quickly summarize web pages with ollama in a sidebar)
  • YouLama (Webapp to quickly summarize any YouTube video, supporting Invidious as well)
  • DualMind (Experimental app allowing two models to talk to each other in the terminal or in a web interface)
  • ollamarama-matrix (Ollama chatbot for the Matrix chat protocol)
  • ollama-chat-app (Flutter-based chat app)
  • Perfect Memory AI (Productivity AI assists personalized by what you have seen on your screen, heard, and said in the meetings)
  • Hexabot (A conversational AI builder)
  • Reddit Rate (Search and Rate Reddit topics with a weighted summation)
  • OpenTalkGpt (Chrome Extension to manage open-source models supported by Ollama, create custom models, and chat with models from a user-friendly UI)
  • VT (A minimal multimodal AI chat app, with dynamic conversation routing. Supports local models via Ollama)
  • Nosia (Easy to install and use RAG platform based on Ollama)
  • Witsy (An AI Desktop application available for Mac/Windows/Linux)
  • Abbey (A configurable AI interface server with notebooks, document storage, and YouTube support)
  • Minima (RAG with on-premises or fully local workflow)
  • aidful-ollama-model-delete (User interface for simplified model cleanup)
  • Perplexica (An AI-powered search engine & an open-source alternative to Perplexity AI)
  • Ollama Chat WebUI for Docker (Support for local docker deployment, lightweight ollama webui)
  • AI Toolkit for Visual Studio Code (Microsoft-official VSCode extension to chat, test, evaluate models with Ollama support, and use them in your AI applications.)
  • MinimalNextOllamaChat (Minimal Web UI for Chat and Model Control)
  • Chipper AI interface for tinkerers (Ollama, Haystack RAG, Python)
  • ChibiChat (Kotlin-based Android app to chat with Ollama and Koboldcpp API endpoints)
  • LocalLLM (Minimal Web-App to run ollama models on it with a GUI)
  • Ollamazing (Web extension to run Ollama models)
  • OpenDeepResearcher-via-searxng (A Deep Research equivalent endpoint with Ollama support for running locally)
  • AntSK (Out-of-the-box & Adaptable RAG Chatbot)
  • MaxKB (Ready-to-use & flexible RAG Chatbot)
  • yla (Web interface to freely interact with your customized models)
  • LangBot (LLM-based instant messaging bots platform, with Agents, RAG features, supports multiple platforms)
  • 1Panel (Web-based Linux Server Management Tool)
  • AstrBot (User-friendly LLM-based multi-platform chatbot with a WebUI, supporting RAG, LLM agents, and plugins integration)
  • Reins (Easily tweak parameters, customize system prompts per chat, and enhance your AI experiments with reasoning model support.)
  • Flufy (A beautiful chat interface for interacting with Ollama's API. Built with React, TypeScript, and Material-UI.)
  • Ellama (Friendly native app to chat with an Ollama instance)
  • screenpipe Build agents powered by your screen history
  • Ollamb (Simple yet rich in features, cross-platform built with Flutter and designed for Ollama. Try the web demo.)
  • Writeopia (Text editor with integration with Ollama)
  • AppFlowy (AI collaborative workspace with Ollama, cross-platform and self-hostable)
  • Lumina (A lightweight, minimal React.js frontend for interacting with Ollama servers)
  • Tiny Notepad (A lightweight, notepad-like interface to chat with ollama available on PyPI)
  • macLlama (macOS native) (A native macOS GUI application for interacting with Ollama models, featuring a chat interface.)
  • GPTranslate (A fast and lightweight, AI powered desktop translation application written with Rust and Tauri. Features real-time translation with OpenAI/Azure/Ollama.)
  • ollama launcher (A launcher for Ollama, aiming to provide users with convenient functions such as ollama server launching, management, or configuration.)
  • ai-hub (AI Hub supports multiple models via API keys and Chat support via Ollama API.)
  • Mayan EDMS (Open source document management system to organize, tag, search, and automate your files with powerful Ollama driven workflows.)
  • Serene Pub (Beginner friendly, open source AI Roleplaying App for Windows, Mac OS and Linux. Search, download and use models with Ollama all inside the app.)
  • Andes (A Visual Studio Code extension that provides a local UI interface for Ollama models)
  • Clueless (Open Source & Local Cluely: A desktop application LLM assistant to help you talk to anything on your screen using locally served Ollama models. Also undetectable to screenshare)

Cloud

Terminal

Apple Vision Pro

  • SwiftChat (Cross-platform AI chat app supporting Apple Vision Pro via "Designed for iPad")
  • Enchanted

Database

  • pgai - PostgreSQL as a vector database (Create and search embeddings from Ollama models using pgvector)
  • MindsDB (Connects Ollama models with nearly 200 data platforms and apps)
  • chromem-go with example
  • Kangaroo (AI-powered SQL client and admin tool for popular databases)

Package managers

Libraries

Mobile

  • SwiftChat (Lightning-fast Cross-platform AI chat app with native UI for Android, iOS, and iPad)
  • Enchanted
  • Maid
  • Ollama App (Modern and easy-to-use multi-platform client for Ollama)
  • ConfiChat (Lightweight, standalone, multi-platform, and privacy-focused LLM chat interface with optional encryption)
  • Ollama Android Chat (No need for Termux, start the Ollama service with one click on an Android device)
  • Reins (Easily tweak parameters, customize system prompts per chat, and enhance your AI experiments with reasoning model support.)

Extensions & Plugins

Supported backends

  • llama.cpp project founded by Georgi Gerganov.

Observability

  • Opik is an open-source platform to debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards. Opik supports native intergration to Ollama.
  • Lunary is the leading open-source LLM observability platform. It provides a variety of enterprise-grade features such as real-time analytics, prompt templates management, PII masking, and comprehensive agent tracing.
  • OpenLIT is an OpenTelemetry-native tool for monitoring Ollama Applications & GPUs using traces and metrics.
  • HoneyHive is an AI observability and evaluation platform for AI agents. Use HoneyHive to evaluate agent performance, interrogate failures, and monitor quality in production.
  • Langfuse is an open source LLM observability platform that enables teams to collaboratively monitor, evaluate and debug AI applications.
  • MLflow Tracing is an open source LLM observability tool with a convenient API to log and visualize traces, making it easy to debug and evaluate GenAI applications.