ollama

History

Jesse Gross d773b7d671 backend: API to support full precision matmul Most tensor backends try to optimize performance by using a lower precision for matmuls. However, some operations (such as kq) on some models are sensitive to this and require full precision.		2025-02-13 17:09:26 -08:00
..
ggml	fix: harden backend loading (#9024 )	2025-02-11 15:36:53 -08:00
ggml.go	backend: API to support full precision matmul	2025-02-13 17:09:26 -08:00