Commit Graph

3 Commits

Author SHA1 Message Date
Michael Yang 84a6567dee
fix token type 2025-12-29 06:37:43 -06:00
Parth Sareen 7cf4c146bc
llama: remove model loading for grammar (#10096) 2025-12-29 06:37:41 -06:00
Bruce MacDonald 6bd0a983cd model: support for mistral-small in the ollama runner
Mistral is a popular research lab making open source models. This updates
the forward pass of llama architecture models to support both llama models
and mistral models by accounting for additional metadata present in mistral
models, and finding the correct dimensions for the output projection.
2025-04-03 16:57:36 -07:00