With the new version of GGML in #12245, KV cache quantization no longer causes a fallback to CPU. |
||
|---|---|---|
| .. | ||
| ggml.go | ||
| ggml_test.go | ||
| gguf.go | ||
| gguf_test.go | ||
| type.go | ||
With the new version of GGML in #12245, KV cache quantization no longer causes a fallback to CPU. |
||
|---|---|---|
| .. | ||
| ggml.go | ||
| ggml_test.go | ||
| gguf.go | ||
| gguf_test.go | ||
| type.go | ||