ollama/llm
Jesse Gross eb10390de9 llm: Enable new memory estimates by default
New memory estimates (see #11090 for more information) are now
enabled automatically for all models running on the Ollama engine,
improving both stability and performance through more accurate sizing
and allocation. Models running on the llama engine will continue to
use the original style of memory estimation.
2025-09-11 11:21:53 -07:00
..
llm_darwin.go Optimize container images for startup (#6547) 2024-09-12 12:10:30 -07:00
llm_linux.go Optimize container images for startup (#6547) 2024-09-12 12:10:30 -07:00
llm_windows.go win: lint fix (#10571) 2025-05-05 11:08:12 -07:00
memory.go llm: Remove unneeded warning with flash attention enabled 2025-09-10 16:40:45 -07:00
memory_test.go llm: New memory management 2025-08-14 15:24:01 -07:00
server.go llm: Enable new memory estimates by default 2025-09-11 11:21:53 -07:00
server_test.go llm: New memory management 2025-08-14 15:24:01 -07:00
status.go Improve crash reporting (#7728) 2024-11-19 16:26:57 -08:00