Default Branch

5f179ff937 · Update README.md · Updated 2026-01-06 04:19:18 -08:00

Branches

f2a4d058f9 · gofmt · Updated 2025-06-16 16:34:46 -07:00    pali112

578
5

d5eae8248d · runner: enable returning more info from runner processing · Updated 2025-06-13 16:26:57 -07:00    pali112

581
1

f5d663e370 · update patches · Updated 2025-06-07 09:40:19 -07:00    pali112

597
2

46faf61a14 · ... · Updated 2025-06-06 16:34:53 -07:00    pali112

593
1

3cf62838ce · server: do partial gguf kv read for capability check · Updated 2025-06-06 16:14:19 -07:00    pali112

593
6

478824045d · temp · Updated 2025-05-29 16:17:11 -07:00    pali112

603
1

240921fd96 · fix: omit array parsing · Updated 2025-05-15 16:35:11 -07:00    pali112

644
1

2ab70e82d0 · remove rebase debug · Updated 2025-05-15 14:57:42 -07:00    pali112

652
13

5a2cd7b48a · runner: add test for unicode token processing · Updated 2025-05-14 11:29:11 -07:00    pali112

651
1

5c76074f66 · wip · Updated 2025-05-12 19:15:42 -07:00    pali112

662
57

715952705e · model: framework for testing forward pass · Updated 2025-05-08 09:25:12 -07:00    pali112

675
1

23e8ac9428 · wip? · Updated 2025-05-07 19:00:44 -07:00    pali112

722
2

a0a1fb463a · build: disable cuda compression · Updated 2025-05-05 11:20:57 -07:00    pali112

689
1

67335dede2 · lower default NUM_PARALLEL to 2 · Updated 2025-04-29 02:03:51 -07:00    pali112

718
1

d20cd8df80 · fix incorrect chat truncation · Updated 2025-04-28 16:11:36 -07:00    pali112

722
1

f4ab82f0b4 · llama: sync · Updated 2025-04-25 16:38:05 -07:00    pali112

739
1

b4cd1118ab · checkpoint for vscode · Updated 2025-04-24 18:23:23 -07:00    pali112

773
4

7c94471d38 · ggml: more accurate estimates for head count array case · Updated 2025-04-10 16:28:34 -07:00    pali112

773
2

04950140ec · server: do not attempt to parse offset file as gguf · Updated 2025-04-09 09:41:46 -07:00    pali112

776
1

3bc9d42e2e · rebase + fix tests · Updated 2025-04-03 17:31:21 -07:00    pali112

790
2