Josh Yan
d62a3a1e2b
lint
2024-07-22 15:51:52 -07:00
Josh Yan
de48cd681f
clean
2024-07-22 15:51:52 -07:00
Josh Yan
5d0e078057
removed cmt and prints
2024-07-22 15:51:52 -07:00
Josh Yan
8d5739b833
removed client isLocal()
2024-07-22 15:51:52 -07:00
Josh Yan
b5ff0ed4ff
lint
2024-07-22 15:51:52 -07:00
Josh Yan
857054f9fa
lint
2024-07-22 15:51:52 -07:00
Josh Yan
6dd9be55e2
lint
2024-07-22 15:51:52 -07:00
Josh Yan
d70707a668
syscopy windows
2024-07-22 15:51:52 -07:00
Josh Yan
c88774ffeb
os copy
2024-07-22 15:51:52 -07:00
Josh Yan
34d197000d
rmv prints
2024-07-22 15:51:52 -07:00
Josh Yan
6c0a8379f6
local copy
2024-07-22 15:51:52 -07:00
Josh Yan
163ee9a8b0
isLocal firstdraft
2024-07-22 15:51:52 -07:00
Josh Yan
de7b2f3948
clean
2024-07-22 15:51:52 -07:00
Josh Yan
f27c66fb0c
rm bench
2024-07-22 15:51:52 -07:00
Josh Yan
a238191798
rm config
2024-07-22 15:51:52 -07:00
Josh Yan
6436c7a375
rm config
2024-07-22 15:51:52 -07:00
Josh Yan
896a15874e
clean
2024-07-22 15:51:52 -07:00
Josh Yan
56008688a1
local path
2024-07-22 15:51:52 -07:00
Josh Yan
d14d38e940
still works
2024-07-22 15:51:52 -07:00
Josh Yan
03df02883d
rebase
2024-07-22 15:51:52 -07:00
Josh Yan
ae49abf80a
benchmark
2024-07-22 15:51:52 -07:00
Josh Yan
2c450502db
on disk copy
2024-07-22 15:51:52 -07:00
Josh Yan
46b76aeb46
start tests
2024-07-22 15:51:52 -07:00
Josh Yan
0e01da82d6
errorsis
2024-07-22 15:51:31 -07:00
Josh Yan
6b1b85ba3d
hide initialize keypair
2024-07-22 15:41:04 -07:00
Josh Yan
5603441538
test
2024-07-22 13:58:50 -07:00
Josh Yan
76b4dfcc9e
auth
2024-07-22 13:54:02 -07:00
Daniel Hiltgen
5784c05397
Merge pull request #5854 from dhiltgen/win_exit_status
...
Refine error reporting for subprocess crash
2024-07-22 10:40:22 -07:00
Daniel Hiltgen
f14aa5435d
Merge pull request #5855 from dhiltgen/remove_max_vram
...
Remove no longer supported max vram var
2024-07-22 10:35:29 -07:00
Jeffrey Morgan
f8fedbda20
Update llama.cpp submodule commit to d94c6e0c ( #5805 )
v0.2.8-rc2
2024-07-22 12:42:00 -04:00
Jeffrey Morgan
b3e5491e41
server: collect nested tool call objects when parsing ( #5824 )
2024-07-22 12:38:03 -04:00
Daniel Hiltgen
cc269ba094
Remove no longer supported max vram var
...
The OLLAMA_MAX_VRAM env var was a temporary workaround for OOM
scenarios. With Concurrency this was no longer wired up, and the simplistic
value doesn't map to multi-GPU setups. Users can still set `num_gpu`
to limit memory usage to avoid OOM if we get our predictions wrong.
2024-07-22 09:08:11 -07:00
Daniel Hiltgen
a3c20e3f18
Refine error reporting for subprocess crash
...
On windows, the exit status winds up being the search term many
users search for and end up piling in on issues that are unrelated.
This refines the reporting so that if we have a more detailed message
we'll suppress the exit status portion of the message.
2024-07-22 08:52:16 -07:00
Jeffrey Morgan
80ee9b5e47
Remove out of space test temporarily ( #5825 )
2024-07-21 00:22:11 -04:00
Jeffrey Morgan
5534f2cc6a
llm: consider head_dim in llama arch ( #5817 )
v0.2.8-rc1
2024-07-20 21:48:12 -04:00
Daniel Hiltgen
d321297d8a
Merge pull request #5815 from dhiltgen/win_rocm_gfx_features
...
Adjust windows ROCm discovery
2024-07-20 16:02:55 -07:00
Daniel Hiltgen
06e5d74e34
Merge pull request #5506 from dhiltgen/sched_tests
...
Refine scheduler unit tests for reliability
2024-07-20 15:48:39 -07:00
Daniel Hiltgen
5d707e6fd5
Merge pull request #5583 from dhiltgen/integration_improvements
...
Fix context exhaustion integration test for small gpus
2024-07-20 15:48:21 -07:00
Daniel Hiltgen
283948c83b
Adjust windows ROCm discovery
...
The v5 hip library returns unsupported GPUs which wont enumerate at
inference time in the runner so this makes sure we align discovery. The
gfx906 cards are no longer supported so we shouldn't compile with that
GPU type as it wont enumerate at runtime.
2024-07-20 15:17:50 -07:00
Jeffrey Morgan
1475eab95f
add patch for tekken ( #5807 )
2024-07-20 13:41:21 -04:00
Jeffrey Morgan
20090f3172
preserve last assistant message ( #5802 )
2024-07-19 20:19:26 -07:00
Jeffrey Morgan
69a2d4ccff
Fix generate test flakyness ( #5804 )
2024-07-19 19:11:25 -07:00
Josh
e8b954c646
server: validate template ( #5734 )
...
add template validation to modelfile
2024-07-19 15:24:29 -07:00
royjhan
c57317cbf0
OpenAI: Function Based Testing ( #5752 )
...
* distinguish error forwarding
* more coverage
* rm comment
2024-07-19 11:37:12 -07:00
royjhan
51b2fd299c
adjust openai chat msg processing ( #5729 )
2024-07-19 11:19:20 -07:00
Michael Yang
d0634b1596
Merge pull request #5780 from ollama/mxyng/tools
...
fix parsing tool calls: break on unexpected eofs
v0.2.7
2024-07-18 12:14:10 -07:00
Michael Yang
43606d6d6a
fix parsing tool calls
2024-07-18 12:08:11 -07:00
Jeffrey Morgan
70b1010fa5
server: check for empty tools array too ( #5779 )
2024-07-18 11:44:57 -07:00
Jeffrey Morgan
84e5721f3a
always provide content even if empty ( #5778 )
2024-07-18 11:28:19 -07:00
Jeffrey Morgan
319fb1ce03
server: only parse tool calls if tools are provided ( #5771 )
...
* server: only parse tool calls if tools are provided
* still set `resp.Message.Content`
v0.2.6
2024-07-18 08:50:23 -07:00