Michael Yang
db3a312edf
feat: uneven splits ( #11048 )
...
The current splitDim function only operates on tensors that are split evenly which isn't always the case, e.g. a QKV tensor. This change allows the function to be used for arbitrary splits
2025-12-29 06:38:14 -06:00
..
2025-04-02 09:44:27 -07:00
2025-01-15 16:31:22 -08:00
2025-12-29 06:37:52 -06:00
2025-12-29 06:37:52 -06:00
2025-12-29 06:37:52 -06:00
2025-02-13 16:31:21 -08:00
2025-03-13 13:59:19 -07:00
2025-12-29 06:37:52 -06:00
2025-12-29 06:37:52 -06:00
2025-12-29 06:37:52 -06:00
2025-12-29 06:38:00 -06:00
2025-12-29 06:37:52 -06:00
2025-12-29 06:37:52 -06:00
2025-12-29 06:38:05 -06:00
2025-12-29 06:37:52 -06:00
2025-12-29 06:37:59 -06:00
2025-12-29 06:38:14 -06:00
2025-12-29 06:38:02 -06:00
2025-12-29 06:38:01 -06:00
2025-12-29 06:37:44 -06:00
2025-12-29 06:37:44 -06:00
2025-12-29 06:37:59 -06:00
2024-12-10 12:58:06 -08:00
2025-12-29 06:38:14 -06:00
2025-12-29 06:38:14 -06:00
2025-03-11 14:49:18 -07:00
2025-12-29 06:38:01 -06:00
2025-12-29 06:38:01 -06:00