ollama/ml/backend/ggml
Jesse Gross d773b7d671 backend: API to support full precision matmul
Most tensor backends try to optimize performance by using a lower
precision for matmuls. However, some operations (such as kq) on
some models are sensitive to this and require full precision.
2025-02-13 17:09:26 -08:00
..
ggml fix: harden backend loading (#9024) 2025-02-11 15:36:53 -08:00
ggml.go backend: API to support full precision matmul 2025-02-13 17:09:26 -08:00