ollama/ml
Michael Yang bfce55db3d model: load non-repeated tensors into multiple backends
some tensors are expected to be used in repeating layers but are not
themselves repeated. this change copies these tensors into the same
backends as their repeating counterparts to minimize copying tensors
between backends
2025-03-07 14:08:21 -08:00
..
backend model: load non-repeated tensors into multiple backends 2025-03-07 14:08:21 -08:00
nn attention: Remove unnecessary contiguous operations 2025-03-01 20:53:23 -08:00
backend.go ml/backend/ggml: consolidate system info logging 2025-03-04 15:14:31 -08:00