ollama/ml/nn
Grace 584e2d646f
Add deepseek v3.1 (#13063)
* Add mla for flash attention
* Revert to using chunks
2025-11-17 18:03:21 -08:00
..
fast ml: add more rope options (#10775) 2025-05-20 15:51:08 -07:00
pooling chore: update models to use slice/chunk/chunksections (#12934) 2025-11-13 15:20:12 -08:00
rope ggml update to b6840 (#12791) 2025-11-06 10:19:22 -08:00
attention.go Add deepseek v3.1 (#13063) 2025-11-17 18:03:21 -08:00
convolution.go fix: conv2d bias (#12834) 2025-10-29 11:03:43 -07:00
embedding.go next ollama runner (#7913) 2025-02-13 16:31:21 -08:00
linear.go update vendored llama.cpp and ggml (#11823) 2025-08-14 14:42:58 -07:00
normalization.go next ollama runner (#7913) 2025-02-13 16:31:21 -08:00