ollama

History

Daniel Hiltgen 1c6669e64c Re-remove cuda v11 (#10694 ) * Re-remove cuda v11 Revert the revert - drop v11 support requiring drivers newer than Feb 23 This reverts commit `c6bcdc4223`. * Simplify layout With only one version of the GPU libraries, we can simplify things down somewhat. (Jetsons still require special handling) * distinct sbsa variant for linux arm64 This avoids accidentally trying to load the sbsa cuda libraries on a jetson system which results in crashes. * temporary prevent rocm+cuda mixed loading		2025-06-23 14:07:00 -07:00
..
0001-ggml-backend-malloc-and-free-using-the-same-compiler.patch	llama: update to commit de4c07f93 (#10655 )	2025-05-12 12:17:26 -07:00
0002-pretokenizer.patch	llama: update to commit de4c07f93 (#10655 )	2025-05-12 12:17:26 -07:00
0003-embeddings.patch	llama: update to commit de4c07f93 (#10655 )	2025-05-12 12:17:26 -07:00
0004-clip-unicode.patch	llama: update to commit de4c07f93 (#10655 )	2025-05-12 12:17:26 -07:00
0005-solar-pro.patch	chore: update mllama to use ollama engine (#10637 )	2025-05-13 17:36:02 -07:00
0006-fix-deepseek-deseret-regex.patch	chore: update mllama to use ollama engine (#10637 )	2025-05-13 17:36:02 -07:00
0007-maintain-ordering-for-rules-for-grammar.patch	chore: update mllama to use ollama engine (#10637 )	2025-05-13 17:36:02 -07:00
0008-ensure-KV-cache-is-fully-defragmented.patch	chore: update mllama to use ollama engine (#10637 )	2025-05-13 17:36:02 -07:00
0009-sort-devices-by-score.patch	chore: update mllama to use ollama engine (#10637 )	2025-05-13 17:36:02 -07:00
0010-add-phony-target-ggml-cpu-for-all-cpu-variants.patch	chore: update mllama to use ollama engine (#10637 )	2025-05-13 17:36:02 -07:00
0011-remove-amx.patch	chore: update mllama to use ollama engine (#10637 )	2025-05-13 17:36:02 -07:00
0012-fix-string-arr-kv-loading.patch	chore: update mllama to use ollama engine (#10637 )	2025-05-13 17:36:02 -07:00
0013-ollama-debug-tensor.patch	chore: update mllama to use ollama engine (#10637 )	2025-05-13 17:36:02 -07:00
0014-add-ollama-vocab-for-grammar-support.patch	chore: update mllama to use ollama engine (#10637 )	2025-05-13 17:36:02 -07:00
0015-add-argsort-and-cuda-copy-for-i32.patch	model: add Qwen2.5-VL support (#10385 )	2025-05-13 20:58:02 -07:00
0016-graph-memory-reporting-on-failure.patch	ggml: Report graph memory for failed allocations	2025-05-22 14:38:09 -07:00
0017-ggml-Export-GPU-UUIDs.patch	Revert "Revert "ggml: Export GPU UUIDs" (#11115 )" (#11117 )	2025-06-18 07:30:49 -07:00
0018-temporary-prevent-rocm-cuda-mixed-loading.patch	Re-remove cuda v11 (#10694 )	2025-06-23 14:07:00 -07:00