ollama/ml/backend/ggml
Jesse Gross 6e68feda00
ollamarunner: Memory usage reporting
This provides granular information about the backend memory allocations
required by the runner:
 - Per backend
 - Per layer
 - Weights, cache and graph
 - Allocation status

This can be used for debugging and validating memory estimates.
2025-12-29 06:38:06 -06:00
..
ggml ggml: Report graph memory for failed allocations 2025-12-29 06:38:06 -06:00
ggml.go ollamarunner: Memory usage reporting 2025-12-29 06:38:06 -06:00
quantization.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
threads.go ollama debug tensor 2025-03-11 14:49:19 -07:00
threads_debug.go ollama debug tensor 2025-03-11 14:49:19 -07:00