ollama

History

Jesse Gross 6e68feda00 ollamarunner: Memory usage reporting This provides granular information about the backend memory allocations required by the runner: - Per backend - Per layer - Weights, cache and graph - Allocation status This can be used for debugging and validating memory estimates.		2025-12-29 06:38:06 -06:00
..
ggml	ggml: Report graph memory for failed allocations	2025-12-29 06:38:06 -06:00
ggml.go	ollamarunner: Memory usage reporting	2025-12-29 06:38:06 -06:00
quantization.go	Move quantization to new backend (#10363 )	2025-12-29 06:37:52 -06:00
threads.go	ollama debug tensor	2025-03-11 14:49:19 -07:00
threads_debug.go	ollama debug tensor	2025-03-11 14:49:19 -07:00