ollama/ml/backend
Michael Yang 0796d79d19 cuda: skip large batches
cuda panics on batches larger than 1024 so skip those and fallback to
cpu
2025-11-18 16:11:37 -08:00
..
ggml cuda: skip large batches 2025-11-18 16:11:37 -08:00
backend.go next ollama runner (#7913) 2025-02-13 16:31:21 -08:00