Files
ollama/server
Bruce MacDonald 12a8b00b34 server: allow running embed models in parallel
The ability to run embedding models in parallel with other types of models
was removed due to limitations in server slot loading in a past version of
the server. This slot loading system is no longer used, and embedding models
can run in parallel with chat models.
2025-03-10 13:34:09 -07:00
..
2024-07-26 14:14:48 -07:00
2025-02-13 16:31:21 -08:00
2025-02-13 16:31:21 -08:00
2025-02-13 16:31:21 -08:00