ollama/kvcache
Jesse Gross a8e83a7654 Disable causal attention based on batch index
Currently we are using positions, which are relative to a
sequence and may not be unique.
2025-03-11 14:49:20 -07:00
..
cache.go model: Update encoder cache to use multimodal input processing handler 2025-03-09 17:05:26 -07:00
causal.go Disable causal attention based on batch index 2025-03-11 14:49:20 -07:00
causal_test.go Fix tests and drift from main 2025-03-11 14:49:18 -07:00
encoder.go model: Update encoder cache to use multimodal input processing handler 2025-03-09 17:05:26 -07:00
wrapper.go model: Update encoder cache to use multimodal input processing handler 2025-03-09 17:05:26 -07:00