This website requires JavaScript.
Explore
Help
Sign In
pali112
/
ollama
Watch
1
Star
0
Fork
0
You've already forked ollama
Code
Issues
Pull Requests
Packages
Projects
Releases
Wiki
Activity
Files
6a62b894c79d80667a3715fd04bed954a9458559
ollama
/
fs
/
ggml
History
Jesse Gross
19e6796eac
llm: Support KV cache quantization with gpt-oss
...
With the new version of GGML in
#12245
, KV cache quantization no longer causes a fallback to CPU.
2025-10-03 16:31:58 -07:00
..
ggml_test.go
ggml: fix crash for array head counts
2025-04-27 11:38:06 -07:00
ggml.go
llm: Support KV cache quantization with gpt-oss
2025-10-03 16:31:58 -07:00
gguf_test.go
convert: fix tensor sorting (
#12015
)
2025-08-26 13:57:46 -07:00
gguf.go
convert: fix tensor sorting (
#12015
)
2025-08-26 13:57:46 -07:00
type.go
convert(gptoss): mxfp4 to ggml layout to avoid jit conversion (
#12018
)
2025-08-26 16:41:02 -07:00