* Move quantization logic to GGML via new backend This moves the model aware logic to Go code and calls GGMLs quantization code for model creation. * Remove "add model quantizations" This is no longer needed now that quantization is implemented in Go+GGML code directly. |
||
|---|---|---|
| .. | ||
| common | ||
| examples/llava | ||
| include | ||
| src | ||
| .rsync-filter | ||
| LICENSE | ||