Default Branch

5f179ff937 · Update README.md · Updated 2026-01-06 04:19:18 -08:00

Branches

b662e4706e · Remove default auto from help message · Updated 2024-07-01 16:01:01 -07:00    pali112

1933
20

b7860f12ad · reference license, template, system as files · Updated 2024-07-01 09:23:08 -07:00    pali112

1917
3

7893ccb68c · introduce build.go for controlling distribution builds · Updated 2024-06-30 22:18:45 -07:00    pali112

1921
1

1963c00201 · Update README.md (#5214) · Updated 2024-06-30 19:00:57 -07:00    pali112

1921
0
Included

0ac5cbc00e · separate deprecation changes · Updated 2024-06-28 13:22:37 -07:00    pali112

2033
3

1071e17626 · lint · Updated 2024-06-28 13:14:49 -07:00    pali112

1926
3

d77a174eb4 · defaut timeout · Updated 2024-06-27 14:58:31 -07:00    pali112

1930
1

70d31c1e9a · use timestamp from challenge, fallback to local time · Updated 2024-06-25 10:20:25 -07:00    pali112

1933
1

acbffa59e9 · llm: suppress large allocations for GGUF arrays · Updated 2024-06-23 14:26:56 -07:00    pali112

1935
1

c494aea5c8 · Strip stop strings · Updated 2024-06-20 09:06:08 -07:00    pali112

1950
1

9b5b69c00f · llm: update llama.cpp submodule to `7c26775` · Updated 2024-06-17 10:46:02 -07:00    pali112

1981
1

9357570d59 · OpenAI Delete Endpoint · Updated 2024-06-14 16:28:22 -07:00    pali112

2017
1

d63e1f5b34 · Lint · Updated 2024-06-10 09:36:05 -07:00    pali112

2058
6

d8b3e09fb7 · llm: enable flash attention by default · Updated 2024-06-08 22:55:22 -07:00    pali112

2047
1

05f79602f0 · server: dont error on missing `tokenizer.chat_template` · Updated 2024-06-07 09:12:08 -07:00    pali112

2054
1

5dc5a295bf · added testcase · Updated 2024-06-03 17:28:05 -07:00    pali112

2083
3

5f0403d208 · Isolated Deletions · Updated 2024-05-31 17:40:11 -07:00    pali112

2117
7

95af97b9f3 · server: try github.com/minio/sha256-simd · Updated 2024-05-31 00:51:20 -07:00    pali112

2090
1

c79fd5c168 · Reincluding Numbers · Updated 2024-05-29 12:22:36 -07:00    pali112

2117
2

b73a512f24 · fix the cpu estimatedTotal memory + get the expiry time for loading models · Updated 2024-05-15 15:15:14 -07:00    pali112

2195
1