ollama/ml
Michael Yang d475d1f081 fix: qwen2.5vl metal argsort 2025-12-08 17:18:24 -08:00
..
backend fix: qwen2.5vl metal argsort 2025-12-08 17:18:24 -08:00
nn model: add rnj-1 inference support (#13354) 2025-12-08 16:49:17 -08:00
backend.go ggml: Enable flash attention for vision encoders 2025-12-04 15:19:06 -08:00
device.go CUDA: filter devices on secondary discovery (#13317) 2025-12-03 12:58:16 -08:00
path.go cpu: always ensure LibOllamaPath included (#12890) 2025-10-31 14:37:29 -07:00