Michael Yang
801564fa8b
add new gemma model ( #11204 )
...
* update patches
* cherry pick metal mean kernel
* cherry pick cuda mean kernel
* gemma3n
2025-12-29 06:39:38 -06:00
Michael Yang
dc8ee7636b
feat: port qwen2 model ( #10782 )
2025-12-29 06:38:04 -06:00
Michael Yang
9215b190fa
feat: qwen3 dense and sparse models ( #10708 )
...
* feat: qwen3 dense
* feat: qwen3moe
* fix llama4 moe
2025-12-29 06:38:04 -06:00
Bruce MacDonald
558b0f5fe9
model: add Qwen2.5-VL support ( #10385 )
2025-12-29 06:37:59 -06:00
Michael Yang
0f5c45e19d
llama4
2025-12-29 06:37:44 -06:00
Bruce MacDonald
6bd0a983cd
model: support for mistral-small in the ollama runner
...
Mistral is a popular research lab making open source models. This updates
the forward pass of llama architecture models to support both llama models
and mistral models by accounting for additional metadata present in mistral
models, and finding the correct dimensions for the output projection.
2025-04-03 16:57:36 -07:00
Patrick Devine
5f74d1fd47
gemma2 impl
2025-03-11 14:35:08 -07:00
Jesse Gross
6945617af5
models: Move model into their own directory
...
This allows there to be a file that is a list of models that is
not mixed into the runner code.
2025-02-13 17:09:26 -08:00