It was implemented upstream: https://github.com/ggml-org/llama.cpp/pull/14741 Branch: GraniteFour Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>