ollama/convert
Michael Yang df4b146c49
fix: mllama quality (#10807)
* fix mllama convert

- transform attn_gate and ffn_gate
- swap attention heads for vision models

* fix mllama

the mlp gate which was applied in the wrong place
2025-12-29 06:38:05 -06:00
..
sentencepiece chore(all): replace instances of interface with any (#10067) 2025-04-02 09:44:27 -07:00
testdata convert: import support for command-r models from safetensors (#6063) 2025-01-15 16:31:22 -08:00
convert.go model: handle multiple eos tokens (#10577) 2025-12-29 06:38:01 -06:00
convert_bert.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
convert_commandr.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
convert_gemma.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
convert_gemma2.go next ollama runner (#7913) 2025-02-13 16:31:21 -08:00
convert_gemma2_adapter.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
convert_gemma3.go fix: change default context size for gemma3 (#9744) 2025-03-13 13:59:19 -07:00
convert_llama.go fix mllama conversion (#10716) 2025-12-29 06:38:00 -06:00
convert_llama4.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
convert_llama_adapter.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
convert_mistral.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
convert_mixtral.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
convert_mllama.go fix: mllama quality (#10807) 2025-12-29 06:38:05 -06:00
convert_phi3.go Move quantization to new backend (#10363) 2025-12-29 06:37:52 -06:00
convert_qwen2.go model: add Qwen2.5-VL support (#10385) 2025-12-29 06:37:59 -06:00
convert_qwen25vl.go model: add Qwen2.5-VL support (#10385) 2025-12-29 06:37:59 -06:00
convert_test.go ggml: Seperate tensor load from backend creation 2025-12-29 06:38:02 -06:00
reader.go chore: update mllama to use ollama engine (#10637) 2025-12-29 06:37:59 -06:00
reader_safetensors.go llama4 2025-12-29 06:37:44 -06:00
reader_torch.go llama4 2025-12-29 06:37:44 -06:00
sentencepiece_model.proto all: fix typos in documentation, code, and comments (#7021) 2024-12-10 12:58:06 -08:00
tensor.go model: add Qwen2.5-VL support (#10385) 2025-12-29 06:37:59 -06:00
tokenizer.go model: handle multiple eos tokens (#10577) 2025-12-29 06:38:01 -06:00
tokenizer_spm.go temporary work around for converting spm 2025-03-11 14:49:18 -07:00
tokenizer_test.go model: handle multiple eos tokens (#10577) 2025-12-29 06:38:01 -06:00