ollama/ml/backend/ggml
Daniel Hiltgen c146a138e3
ggml: handle all streams (#13350)
Follow up from #12992

Free all streams, and keep the alloc logic aligned across streams.
2025-12-05 16:10:33 -08:00
..
ggml ggml: handle all streams (#13350) 2025-12-05 16:10:33 -08:00
ggml.go ggml: Enable flash attention for vision encoders 2025-12-04 15:19:06 -08:00
ggml_test.go ml: add slice operation (#12870) 2025-11-13 13:28:21 -08:00
quantization.go chore: fix some inconsistent function name in comment 2025-08-13 09:50:27 -07:00
threads.go ollama debug tensor 2025-03-11 14:49:19 -07:00
threads_debug.go ollama debug tensor 2025-03-11 14:49:19 -07:00