ollama/model/models/qwen3
Michael Yang 6c833d5f8d fix(qwen3): deepseek distill
deepseek's qwen3 distill uses a different rope scheme so support both
2025-10-13 13:30:30 -07:00
..
embed.go multi-regexp pretokenizer (#12325) 2025-09-23 13:21:47 -07:00
model.go fix(qwen3): deepseek distill 2025-10-13 13:30:30 -07:00