* origin/main: docs: update modelfile.md to reflect current default num_ctx (#11189) ggml: Use assigned layers when reporting loading stats ggml: Disable unused pipeline parallelism Only load supported models on new engine (#11362) |
||
|---|---|---|
| .. | ||
| ggml | ||
| ggml.go | ||
| quantization.go | ||
| threads.go | ||
| threads_debug.go | ||