ollama

History

Michael Yang 9213339549 fix: qwen25vl assign samebatch in multimodal input (#10789 ) setting samebatch on the vision start token is problematic because it will be shared with other inputs that also use images. this will cause the input to be cached and the runner will not see SameBatch. SameBatch will also be incorrect since it may be for a different image. assigning samebatch to the input tokens resolves this by ensure it's assigned correctly to inputs corresponding to the image. not setting same batch correctly may cause panics during inference since images are no longer guaranteed to be in the same batch.		2025-12-29 06:38:03 -06:00
..
gemma2	ml: add more rope options (#10775 )	2025-12-29 06:38:03 -06:00
gemma3	ml: add more rope options (#10775 )	2025-12-29 06:38:03 -06:00
llama	ml: add more rope options (#10775 )	2025-12-29 06:38:03 -06:00
llama4	ml: add more rope options (#10775 )	2025-12-29 06:38:03 -06:00
mistral3	ml: add more rope options (#10775 )	2025-12-29 06:38:03 -06:00
mllama	ml: add more rope options (#10775 )	2025-12-29 06:38:03 -06:00
qwen25vl	fix: qwen25vl assign samebatch in multimodal input (#10789 )	2025-12-29 06:38:03 -06:00
models.go	model: add Qwen2.5-VL support (#10385 )	2025-12-29 06:37:59 -06:00