Add cgo implementation for llama.cpp
Run the server.cpp directly inside the Go runtime via cgo while retaining the LLM Go abstractions.
This commit is contained in:
Reference in New Issue
Block a user
Run the server.cpp directly inside the Go runtime via cgo while retaining the LLM Go abstractions.