ollama/llm
Devon Rifkin 558c1920fa
ggml: fix crash for array head counts
If it's an array, it uses the max value in the array

If array values for head counts becomes more popular, we can consider a
more invasive change like #10225 to calculate more accurate estimates.

Fixes: #9984
2025-12-29 06:39:34 -06:00
..
llm_darwin.go Optimize container images for startup (#6547) 2024-09-12 12:10:30 -07:00
llm_linux.go Optimize container images for startup (#6547) 2024-09-12 12:10:30 -07:00
llm_windows.go win: lint fix (#10571) 2025-12-29 06:37:51 -06:00
memory.go ggml: fix crash for array head counts 2025-12-29 06:39:34 -06:00
memory_test.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
server.go avoid context overflow (#11175) 2025-12-29 06:38:18 -06:00
server_test.go lint: enable usetesting, disable tenv (#10594) 2025-12-29 06:37:55 -06:00
status.go Improve crash reporting (#7728) 2024-11-19 16:26:57 -08:00