ollama/model/models
Bruce MacDonald 51ad65f831 ml: structured rope config to allow specifying context len
This commit refactors the Rotary Position Embedding (RoPE) implementation across the codebase to use a structured configuration approach instead of individual parameters.

Key changes:
- Add new RoPEConfig struct with fields for dimension, type, base frequency, and scaling
- Add RopeType enum to formalize different RoPE implementation variants
- Add YarnConfig struct and related configuration for YaRN (Yet Another RoPE extensioN) context extension
- Update RoPE method signature across all tensor interfaces and implementations
- Refactor all model implementations (llama, gemma2, gemma3, mllama) to use the new configuration structure

This change improves code organization, makes the RoPE configuration more explicit, and provides better support for different RoPE variants and context extension methods.
2025-05-12 13:49:41 -07:00
..
gemma2 ml: structured rope config to allow specifying context len 2025-05-12 13:49:41 -07:00
gemma3 ml: structured rope config to allow specifying context len 2025-05-12 13:49:41 -07:00
llama ml: structured rope config to allow specifying context len 2025-05-12 13:49:41 -07:00
llama4 model: fix build (#10416) 2025-04-25 19:24:48 -07:00
mistral3 fix token type 2025-04-25 16:59:01 -07:00
mllama ml: structured rope config to allow specifying context len 2025-05-12 13:49:41 -07:00
qwen2vl models: Move model into their own directory 2025-02-13 17:09:26 -08:00
models.go llama4 2025-04-25 16:59:20 -07:00