Logo
Explore Help
Sign In
pali112/ollama
1
0
Fork 0
You've already forked ollama
Code Issues Pull Requests Packages Projects Releases Wiki Activity
Files
12a7e5ec46c2ed20611fcac3ffb9c934107afcbb
ollama/model/models
History
Michael Yang 12a7e5ec46 gemma3: scale in attention
2025-08-19 13:43:47 -07:00
..
gemma2
gemma2: use fast attention
2025-08-19 13:33:12 -07:00
gemma3
gemma3: scale in attention
2025-08-19 13:43:47 -07:00
gemma3n
Increase performance for Gemma3n models on NVGPUs by enabling CUDA Graph execution (#11525)
2025-07-29 12:37:06 -07:00
gptoss
update vendored llama.cpp and ggml (#11823)
2025-08-14 14:42:58 -07:00
llama
Only load supported models on new engine (#11362)
2025-07-11 12:21:54 -07:00
llama4
use nn.Linear in place of ml.Tensor (#11049)
2025-06-11 12:10:15 -07:00
mistral3
ml: Panic rather than return error on tensor allocation failure
2025-05-22 14:38:09 -07:00
mllama
ml: Panic rather than return error on tensor allocation failure
2025-05-22 14:38:09 -07:00
qwen2
Only load supported models on new engine (#11362)
2025-07-11 12:21:54 -07:00
qwen3
use nn.Linear in place of ml.Tensor (#11049)
2025-06-11 12:10:15 -07:00
qwen25vl
ml: Panic rather than return error on tensor allocation failure
2025-05-22 14:38:09 -07:00
models.go
gpt-oss (#11672)
2025-08-05 12:21:16 -07:00
Powered by Gitea Version: 1.25.4 Page: 1068ms Template: 73ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API