ollama

History

Jesse Gross 73d6a82cce ollamarunner: Memory usage reporting This provides granular information about the backend memory allocations required by the runner: - Per backend - Per layer - Weights, cache and graph - Allocation status This can be used for debugging and validating memory estimates.		2025-05-22 14:38:09 -07:00
..
ggml	ggml: Report graph memory for failed allocations	2025-05-22 14:38:09 -07:00
ggml.go	ollamarunner: Memory usage reporting	2025-05-22 14:38:09 -07:00
quantization.go	Move quantization to new backend (#10363 )	2025-05-06 11:20:48 -07:00
threads.go	ollama debug tensor	2025-03-11 14:49:19 -07:00
threads_debug.go	ollama debug tensor	2025-03-11 14:49:19 -07:00