There are two cases where we may not have an output after computing: - Prompt processing where the length of the input exceeds the batch size - Internal memory management operations such as cache defrag and shift |
||
|---|---|---|
| .. | ||
| imageproc | ||
| llama | ||
| mllama | ||
| pixtral | ||
| qwen2vl | ||
| testdata | ||
| model.go | ||
| model_test.go | ||
| process_text.go | ||
| process_text_test.go | ||