We're not currently using it, even in cases where we could. Disabling it improves generation performance by 10-30% with multiple GPUs. |
||
|---|---|---|
| .. | ||
| backend | ||
| nn | ||
| backend.go | ||
We're not currently using it, even in cases where we could. Disabling it improves generation performance by 10-30% with multiple GPUs. |
||
|---|---|---|
| .. | ||
| backend | ||
| nn | ||
| backend.go | ||