ollama/runner/ollamarunner
Jesse Gross b2a465296d runner: Release semaphore and improve error messages on failures
If we have an error after creating a new sequence but before
finding a slot for it, we return without releasing the semaphore.
This reduces our parallel sequences and eventually leads to deadlock.

In practice this should never happen because once we have acquired
the semaphore, we should always be able to find a slot. However, the
code is clearly not correct.
2025-03-30 19:21:54 -07:00
..
cache.go kvcache: Pass granular cache size into implementations 2025-03-21 11:20:19 -07:00
cache_test.go runner: remove cache prompt flag from ollama runner (#9826) 2025-03-17 15:11:15 -07:00
runner.go runner: Release semaphore and improve error messages on failures 2025-03-30 19:21:54 -07:00