ollama/ml/backend/ggml
Michael Yang 0796d79d19 cuda: skip large batches
cuda panics on batches larger than 1024 so skip those and fallback to
cpu
2025-11-18 16:11:37 -08:00
..
ggml cuda: skip large batches 2025-11-18 16:11:37 -08:00
ggml.go deepseekocr 2025-11-18 16:11:37 -08:00
ggml_test.go ml: add slice operation (#12870) 2025-11-13 13:28:21 -08:00
quantization.go chore: fix some inconsistent function name in comment 2025-08-13 09:50:27 -07:00
threads.go ollama debug tensor 2025-03-11 14:49:19 -07:00
threads_debug.go ollama debug tensor 2025-03-11 14:49:19 -07:00