Files

Daniel Hiltgen bd15eba4e4 Bring back escape valve for llm libraries and fix Jetpack6 crash (#12529 )

* Bring back escape valve for llm libraries

If the new discovery logic picks the wrong library, this gives users the
ability to force a specific one using the same pattern as before. This
can also potentially speed up bootstrap discovery if one of the libraries
takes a long time to load and ultimately bind to no devices.  For example
unsupported AMD iGPUS can sometimes take a while to discover and rule out.

* Bypass extra discovery on jetpack systems

On at least Jetpack6, cuda_v12 appears to expose the iGPU, but crashes later on in
cublasInit so if we detect a Jetpack, short-circuit and use that variant.

2025-10-07 16:06:14 -07:00

images

Fix import image width (#6528 )

2024-08-27 14:19:47 -07:00

api.md

feat: add dimensions field to embed requests (#12242 )

2025-09-11 10:36:10 -07:00

cloud.md

docs: update cloud.md for cloud models

2025-09-22 13:09:17 -03:00

development.md

doc: show how to clear the cgo cache (#12298 )