ollama/ml/backend/ggml
Jesse Gross acef9b4c1b ggml: Use assigned layers when reporting loading stats
Reporting params.NumGPULayers can be misleading because it is the
requested number of layers, not the actual number that is loaded.
While they are often the same, there are cases where they might mismatch,
such as if the GPU backend is missing.
2025-07-11 14:21:50 -07:00
..
ggml ggml: Report ordinal IDs for AMD GPUs on Windows 2025-07-09 10:35:31 -07:00
ggml.go ggml: Use assigned layers when reporting loading stats 2025-07-11 14:21:50 -07:00
quantization.go Move quantization to new backend (#10363) 2025-05-06 11:20:48 -07:00
threads.go ollama debug tensor 2025-03-11 14:49:19 -07:00
threads_debug.go ollama debug tensor 2025-03-11 14:49:19 -07:00