ollama/ml/backend
Jesse Gross 015e39a8be
ggml: Disable unused pipeline parallelism
We're not currently using it, even in cases where we could. Disabling
it improves generation performance by 10-30% with multiple GPUs.
2025-12-29 06:39:42 -06:00
..
ggml ggml: Disable unused pipeline parallelism 2025-12-29 06:39:42 -06:00
backend.go next ollama runner (#7913) 2025-02-13 16:31:21 -08:00