* origin/main: docs: update modelfile.md to reflect current default num_ctx (#11189) ggml: Use assigned layers when reporting loading stats ggml: Disable unused pipeline parallelism Only load supported models on new engine (#11362) |
||
|---|---|---|
| .. | ||
| ggml | ||
| backend.go | ||