ollama/ml/backend/ggml
Gabe Goodhart e6a22f20d1 Merge remote-tracking branch 'origin/main' into GraniteFour
* origin/main:
docs: update modelfile.md to reflect current default num_ctx (#11189)
ggml: Use assigned layers when reporting loading stats
ggml: Disable unused pipeline parallelism
Only load supported models on new engine (#11362)
2025-07-15 14:50:19 -06:00
..
ggml feat: Sync llama.cpp 2025-07-15 14:50:01 -06:00
ggml.go ggml: Use assigned layers when reporting loading stats 2025-07-11 14:21:50 -07:00
quantization.go Move quantization to new backend (#10363) 2025-05-06 11:20:48 -07:00
threads.go ollama debug tensor 2025-03-11 14:49:19 -07:00
threads_debug.go ollama debug tensor 2025-03-11 14:49:19 -07:00