ollama

History

Jesse Gross 79f6376f5b ggml: No-alloc mode Callers can set a backend buffer type to be no-alloc, meaning that it does not allocate memory for tensors or operations. This can be used for calculating memory requirements. Tensors and graphs must be recreated with no-alloc set to false before loading data. Defaults to false for newly created backend buffer types.		2025-08-08 14:57:13 -07:00
..
ggml	ggml: No-alloc mode	2025-08-08 14:57:13 -07:00
ggml.go	ggml: Support closing backends	2025-08-08 14:57:13 -07:00
ggml_test.go	gpt-oss (#11672 )	2025-08-05 12:21:16 -07:00
mxfp4_test.go	gpt-oss (#11672 )	2025-08-05 12:21:16 -07:00
quantization.go	gpt-oss (#11672 )	2025-08-05 12:21:16 -07:00
threads.go	ollama debug tensor	2025-03-11 14:49:19 -07:00
threads_debug.go	ollama debug tensor	2025-03-11 14:49:19 -07:00