ollama

History

Jesse Gross eb10390de9 llm: Enable new memory estimates by default New memory estimates (see #11090 for more information) are now enabled automatically for all models running on the Ollama engine, improving both stability and performance through more accurate sizing and allocation. Models running on the llama engine will continue to use the original style of memory estimation.		2025-09-11 11:21:53 -07:00
..
llm_darwin.go	Optimize container images for startup (#6547 )	2024-09-12 12:10:30 -07:00
llm_linux.go	Optimize container images for startup (#6547 )	2024-09-12 12:10:30 -07:00
llm_windows.go	win: lint fix (#10571 )	2025-05-05 11:08:12 -07:00
memory.go	llm: Remove unneeded warning with flash attention enabled	2025-09-10 16:40:45 -07:00
memory_test.go	llm: New memory management	2025-08-14 15:24:01 -07:00
server.go	llm: Enable new memory estimates by default	2025-09-11 11:21:53 -07:00
server_test.go	llm: New memory management	2025-08-14 15:24:01 -07:00
status.go	Improve crash reporting (#7728 )	2024-11-19 16:26:57 -08:00