ollama

History

Blake Mizerany 7893ccb68c introduce build.go for controlling distribution builds This commit aims to provide the Ollama maintainers with maximum control of the distribution build process by creating a cross-platform shim. Currently, we have no flexibility, or control of the process (pre and post) or even the quality of the build. By introducing a shim, and propagating it out to Homebrew, et al., we can soon after ensure that the build process is consistent, and reliable. This also happens to remove the requirement for go generate and the build tag hacks, but it does still support go generate in the flow, at least until we can remove it after the major distribution use the new build process. About the script: Beyond giving the Ollama maintainers drastically more control over the build process, the script also provides a few other benefits: - It is cross-platform, and can be run on any platform that supports Go (a hard requirement for building Ollama anyway). - It can can check for correct versions of cmake, and other dependencies before starting the build process, and provide helpful error messages to the user if they are not met. - It can be used to build the distribution for any platform, architecture, or build type (debug, release, etc.) with a single command. Currently, it is two commands. - It can skip parts of the build process if they are already done, such as build the C dependencies. Of course there is a -f flag to force rebuild. - So much more!		2024-06-30 22:18:45 -07:00
..
ext_server	Do not shift context for sliding window models (#5368 )	2024-06-28 19:39:31 -07:00
generate	introduce build.go for controlling distribution builds	2024-06-30 22:18:45 -07:00
llama.cpp@7c26775adb	llm: update llama.cpp commit to `7c26775` (#4896 )	2024-06-17 15:56:16 -04:00
patches	llm: architecture patch (#5316 )	2024-06-26 21:38:12 -07:00
filetype.go	Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322 )	2024-05-23 13:21:49 -07:00
ggla.go	llm: speed up gguf decoding by a lot (#5246 )	2024-06-24 21:47:52 -07:00
ggml.go	gemma2 graph	2024-06-27 13:34:52 -07:00
ggml_test.go	llm: speed up gguf decoding by a lot (#5246 )	2024-06-24 21:47:52 -07:00
gguf.go	llm: speed up gguf decoding by a lot (#5246 )	2024-06-24 21:47:52 -07:00
llm.go	revert tokenize ffi (#4761 )	2024-05-31 18:54:21 -07:00
llm_darwin_amd64.go	Switch back to subprocessing for llama.cpp	2024-04-01 16:48:18 -07:00
llm_darwin_arm64.go	Switch back to subprocessing for llama.cpp	2024-04-01 16:48:18 -07:00
llm_linux.go	Switch back to subprocessing for llama.cpp	2024-04-01 16:48:18 -07:00
llm_windows.go	Move nested payloads to installer and zip file on windows	2024-04-23 16:14:47 -07:00
memory.go	handle asymmetric embedding KVs	2024-06-20 09:57:27 -07:00
memory_test.go	llm: speed up gguf decoding by a lot (#5246 )	2024-06-24 21:47:52 -07:00
payload.go	Move libraries out of users path	2024-06-17 13:12:18 -07:00
server.go	llm: speed up gguf decoding by a lot (#5246 )	2024-06-24 21:47:52 -07:00
status.go	Switch back to subprocessing for llama.cpp	2024-04-01 16:48:18 -07:00