ollama

Commit Graph

Author	SHA1	Message	Date
ParthSareen	c1a6aa8be5	docs: add JavaScript example for tool calling	2026-01-05 19:19:34 -08:00
ParthSareen	515c46c176	docs: add Claude Code integration guide	2026-01-05 19:19:34 -08:00
ParthSareen	ed1e17bb35	anthropic: fix error handling and update docs - Add proper error handling for JSON marshal in StreamConverter to prevent corrupted streams when tool arguments cannot be serialized - Add tests for unmarshalable arguments and mixed validity scenarios - Fix documentation typo and update recommended models to qwen3-coder	2026-01-05 19:19:34 -08:00
ParthSareen	6229df5b90	anthropic: add unit and integration tests - Unit tests for transformation functions (FromMessagesRequest, ToMessagesResponse) - Unit tests for error handling and edge cases - Middleware integration tests with httptest - Fix lint issues (gofmt) - Fix unused struct fields in StreamConverter - Add fallback for crypto/rand errors	2026-01-05 19:19:34 -08:00
ParthSareen	f760ae1fdd	api: add Anthropic Messages API compatibility layer Add middleware to support the Anthropic Messages API format at /v1/messages. This enables tools like Claude Code to work with Ollama models through the Anthropic API interface. Features: - Request/response transformation between Anthropic and internal formats - Streaming support with SSE events (message_start, content_block_delta, etc.) - Tool calling support (tool_use and tool_result content blocks) - Thinking/extended thinking block support - Image content block support (base64) - System prompt handling - Multi-turn conversation support - Proper stop_reason mapping (end_turn, max_tokens, tool_use) - Error responses in Anthropic format New files: - anthropic/anthropic.go: Types and transformation functions - middleware/anthropic.go: Request/response middleware	2026-01-05 19:19:34 -08:00
Harry V. Kiselev	d087e46bd1	docs/capabilities/vision: fix curl related code snippet (#13615 )	2026-01-03 17:27:46 -05:00
Nhan Nguyen	e1bdc23dd2	docs: fix tool name mismatch and trailing commas in api.md example (#13559 ) The tool calling example used "get_temperature" for tool_calls but defined the tool as "get_weather". Also removed trailing commas that made the JSON invalid. Fixes #13031	2026-01-03 02:14:53 -05:00
lif	f5f74e12c1	docs: add version note for /v1/responses API (#13596 ) Signed-off-by: majiayu000 <1835304752@qq.com>	2026-01-03 01:58:20 -05:00
Vallabh Mahajan	18fdcc94e5	docs: fix broken .md links and render issues (#13550 )	2025-12-23 12:44:55 -05:00
Jeffrey Morgan	8852220f59	add REQUIRES command to Modelfile (#13361 )	2025-12-18 13:21:29 -08:00
Devon Rifkin	9f7822851c	docs: add docs for v1/responses and rework openai compat section (#13416 ) * docs: add docs for v1/responses and rework openai compat section I reworked the examples to be separated by topic and to be fully runnable (i.e., they now log output instead of just suggesting how a call might be made). We now use `<CodeGroup>`s so that each example has a dropdown on the docs site for users to choose, which makes the examples a lot more digestible (since you only see approx 1/3 of the code you used to). I also added a new tool to extract code examples into files so that it's easier to actually run them and check that they work. ## Example ```shell go run docs/tools/extract-examples/main.go docs/api/openai-compatibility.mdx ``` Output: ``` Extracting code examples to: /var/folders/vq/wfm2g6k917d3ldzpjdxc8ph00000gn/T/mdx-examples-3271754368 - 01_basic.py - 01_basic.js - 01_basic.sh - 02_responses.py - 02_responses.js - 02_responses.sh - 03_vision.py - 03_vision.js - 03_vision.sh Extracted 9 file(s) to /var/folders/vq/wfm2g6k917d3ldzpjdxc8ph00000gn/T/mdx-examples-3271754368 To run examples: cd /var/folders/vq/wfm2g6k917d3ldzpjdxc8ph00000gn/T/mdx-examples-3271754368 npm install # for JS examples then run individual files with `node file.js`, `python file.py`, `bash file.sh` ``` In the future we should consider actually running the examples in CI and having some sort of acceptance test so we can automatically detect when our examples break. So this is just a start in that direction. * Update docs/api/openai-compatibility.mdx Co-authored-by: Parth Sareen <parth.sareen@ollama.com> * Update docs/api/openai-compatibility.mdx Co-authored-by: Parth Sareen <parth.sareen@ollama.com> --------- Co-authored-by: Parth Sareen <parth.sareen@ollama.com>	2025-12-11 17:39:40 -08:00
Alexander Gusak	93d45d7a04	docs: fix link to modelfile.mdx (#13220 )	2025-12-11 16:14:45 -08:00
Nathan Hook	cc9555aff0	Update user message format for temperature query (#13256 )	2025-12-02 15:08:39 -08:00
hello_world	20aee96706	Add Vulkan GPU support instructions in development.md (#13265 ) Added Vulkan SDK installation instructions and environment variable setup for building with Vulkan support.	2025-12-02 13:37:32 -08:00
Ondrej Kokes	0c2489605d	docs: fix output formatting in faq.mdx (#13231 ) There were a few Markdown typos in one FAQ answer. It now renders as a proper ascii table.	2025-11-28 19:19:21 -05:00
EntropyYue	8b1b89a984	docs: remove deprecated parameters (#13237 )	2025-11-26 11:03:09 +09:00
Lhiam Andrei Lingco	8ed1adf3db	docs: fix typo in vscode.mdx (#13116 )	2025-11-18 13:18:42 -08:00
Jeffrey Morgan	aa676b313f	docs: link to ollama.com instead of hardcoding list of cloud models (#13110 )	2025-11-16 20:56:09 -08:00
Parth Sareen	ce29f695b4	docs: add logprobs to openapi (#13090 )	2025-11-14 14:14:58 -08:00
nicole pardal	482bec824f	embeddings: added cli command to embedding docs (#12993 )	2025-11-13 13:24:13 -08:00
Kowyo	684a9a8c5a	docs: fix typo (VSCode -> VS Code) (#13072 )	2025-11-12 20:49:33 -08:00
Daniel Hiltgen	6286d9a3a5	Enable Vulkan with a temporary opt-in setting (#12931 ) * docs: vulkan information * Revert "CI: Set up temporary opt-out Vulkan support (#12614)" This reverts commit `8b6e5baee7`. * vulkan: temporary opt-in for Vulkan support Revert this once we're ready to enable by default. * win: add vulkan CI build	2025-11-12 08:40:38 -08:00
Jeffrey Morgan	cb1cb06478	docs: rename api-reference.md back to api.md since redirect stopped working (#13056 )	2025-11-11 15:53:06 -08:00
Jeffrey Morgan	2d5e066c8c	docs: fix openapi.yaml warnings, rename api.md to api-reference.md (#12904 )	2025-11-11 15:39:35 -08:00
Bruce MacDonald	15968714bd	docs/openapi: document that delete and copy responses are empty (#13055 ) Some route endpoints return an empty response with a 200 OK. These should be documented in the OpenAPI doc. Note that the previous deletion response was not correct.	2025-11-11 15:07:21 -08:00
Sheikh	6df4208836	docs: fix metal gpu section header (#13045 )	2025-11-10 21:51:22 -08:00
Parth Sareen	755ac3b069	docs: update n8n URL for Ollama (#12994 )	2025-11-07 20:07:26 -08:00
Daniel Hiltgen	60b8973559	doc: re-add login autostart faq and GPU updates (#12975 ) * doc: re-add login autostart faq This appears to have been accidentally dropped during the doc migration. * docs: GPU updates lost on the doc update * review comments: improve windows login disable instructions	2025-11-07 11:21:44 -08:00
Tomoya Fujita	d2ef679d42	docs: fix 404 link to modelfile documentation (#12996 )	2025-11-07 10:06:46 -08:00
nicole pardal	1ca608bcd1	embeddings: added embedding command for cl (#12795 ) Co-authored-by: A-Akhil <akhilrahul70@gmail.com> This PR introduces a new ollama embed command that allows users to generate embeddings directly from the command line. Added ollama embed MODEL [TEXT...] command for generating text embeddings Supports both direct text arguments and stdin piping for scripted workflows Outputs embeddings as JSON arrays (one per line)	2025-11-05 11:58:03 -08:00
Jeffrey Morgan	93e45f0f0d	docs: temporarily restore api.md and cleanup docs paths (#12818 )	2025-10-28 23:25:48 -07:00
Jeffrey Morgan	a342160803	docs: fix root api documentation page (#12813 )	2025-10-28 19:17:54 -07:00
Jeffrey Morgan	f6c29409dc	docs: add new cloud model + fix openai redirect (#12812 )	2025-10-28 19:09:07 -07:00
Parth Sareen	d828517e78	docs: update readme and links (#12809 )	2025-10-28 16:20:02 -07:00
Parth Sareen	3d99d9779a	docs: add docs for docs.ollama.com (#12805 )	2025-10-28 13:18:48 -07:00
Parth Sareen	6d02a43a75	docs: rename to mdx to setup docs site (#12804 )	2025-10-28 13:04:31 -07:00
Parth Sareen	5483497d7a	Revert "docs: add reference to docs.ollama.com (#12800 )" (#12803 ) This reverts commit `934dd9e196`.	2025-10-28 12:52:49 -07:00
Parth Sareen	934dd9e196	docs: add reference to docs.ollama.com (#12800 )	2025-10-28 12:44:02 -07:00
Daniel Hiltgen	270679932f	cuda: tidy up CC settings (#12668 ) 8.7 is Jetpack only, so no need on x86 builds 10.3 covers [G]B300	2025-10-16 16:39:30 -07:00
Daniel Hiltgen	70d9e363e1	doc: remove AMD EOL GPUs (#12567 )	2025-10-10 17:16:29 -07:00
Daniel Hiltgen	303be9304c	docs: improve accuracy of LLM library docs (#12530 )	2025-10-07 16:21:07 -07:00
Daniel Hiltgen	bd15eba4e4	Bring back escape valve for llm libraries and fix Jetpack6 crash (#12529 ) * Bring back escape valve for llm libraries If the new discovery logic picks the wrong library, this gives users the ability to force a specific one using the same pattern as before. This can also potentially speed up bootstrap discovery if one of the libraries takes a long time to load and ultimately bind to no devices. For example unsupported AMD iGPUS can sometimes take a while to discover and rule out. * Bypass extra discovery on jetpack systems On at least Jetpack6, cuda_v12 appears to expose the iGPU, but crashes later on in cublasInit so if we detect a Jetpack, short-circuit and use that variant.	2025-10-07 16:06:14 -07:00
Daniel Hiltgen	c68f367ef6	Update GGML to b6646 (#12245 ) Notable EOLs with this change: - MacOS v12 and v13 are no longer supported (v14+ required) - AMD gfx900 and gfx906 are no longer supported	2025-10-02 14:47:10 -07:00
Daniel Hiltgen	bc8909fb38	Use runners for GPU discovery (#12090 ) This revamps how we discover GPUs in the system by leveraging the Ollama runner. This should eliminate inconsistency between our GPU discovery and the runners capabilities at runtime, particularly for cases where we try to filter out unsupported GPUs. Now the runner does that implicitly based on the actual device list. In some cases free VRAM reporting can be unreliable which can leaad to scheduling mistakes, so this also includes a patch to leverage more reliable VRAM reporting libraries if available. Automatic workarounds have been removed as only one GPU leveraged this, which is now documented. This GPU will soon fall off the support matrix with the next ROCm bump. Additional cleanup of the scheduler and discovery packages can be done in the future once we have switched on the new memory management code, and removed support for the llama runner.	2025-10-01 15:12:32 -07:00
jmorganca	af060eb250	docs: update cloud.md for cloud models	2025-09-22 13:09:17 -03:00
jmorganca	ae5c33008e	docs: move turbo.md to cloud.md	2025-09-22 13:09:17 -03:00
Daniel Hiltgen	93c64ea1b1	doc: show how to clear the cgo cache (#12298 )	2025-09-15 15:45:35 -07:00
Michael Yang	feb18cd710	feat: add dimensions field to embed requests (#12242 ) * feat: add field to truncate embeddings * add openai embeddings for dimensions	2025-09-11 10:36:10 -07:00
Daniel Hiltgen	17a023f34b	Add v12 + v13 cuda support (#12000 ) * Add support for upcoming NVIDIA Jetsons The latest Jetsons with JetPack 7 are moving to an SBSA compatible model and will not require building a JetPack specific variant. * cuda: bring back dual versions This adds back dual CUDA versions for our releases, with v11 and v13 to cover a broad set of GPUs and driver versions. * win: break up native builds in build_windows.ps1 * v11 build working on windows and linux * switch to cuda v12.8 not JIT * Set CUDA compression to size * enhance manual install linux docs	2025-09-10 12:05:18 -07:00
Daniel Hiltgen	950d33aa30	docs: show how to debug nvidia init failures (#12216 ) This debug setting can help troubleshoot obscure initialization failures.	2025-09-08 11:39:00 -07:00

1 2 3 4 5 ...

519 Commits