Commit Graph

4316 Commits

Author SHA1 Message Date
Bruce MacDonald 18d52686de
Update model/models/qwen25vl/model_vision.go 2025-05-12 14:16:46 -07:00
Bruce MacDonald 2d2eb5903d use with pattern for rope 2025-05-12 14:14:03 -07:00
Bruce MacDonald 533f4c41bd add eot 2025-05-12 14:03:37 -07:00
Bruce MacDonald 31b2c06393 Update 0007-add-unpad-operator.patch 2025-05-12 13:51:46 -07:00
Bruce MacDonald 4ae23deb50 Revert "Update 0007-add-unpad-operator.patch"
This reverts commit 340359fd087dd93c99bed4b9c87ccea3e759e184.

Update 0007-add-unpad-operator.patch
2025-05-12 13:51:04 -07:00
Bruce MacDonald 5d3da85a16 remove out of date comments 2025-05-12 13:51:04 -07:00
Bruce MacDonald 8b64b456c1 Update 0007-add-unpad-operator.patch 2025-05-12 13:51:04 -07:00
Bruce MacDonald 684f0d9291 set default values for vision model in config 2025-05-12 13:51:04 -07:00
jmorganca 3308bff137 add i32 copy and argsort for cuda 2025-05-12 13:51:04 -07:00
Bruce MacDonald bf1929a3bc Delete 0017-add-ollama-vocab-for-grammar-support.patch 2025-05-12 13:49:44 -07:00
Bruce MacDonald 1a2c413225 move mask 2025-05-12 13:49:44 -07:00
Bruce MacDonald 57279f89a2 calculate block mask once, rather than in attention 2025-05-12 13:49:44 -07:00
Bruce MacDonald 9ceee25d8b chunk vision outputs 2025-05-12 13:49:44 -07:00
Bruce MacDonald 661bf04696 add picture prefix 2025-05-12 13:49:44 -07:00
Bruce MacDonald 2521a55ae6 fixes after rebase 2025-05-12 13:49:44 -07:00
Bruce MacDonald 32948ec952 increase rope base 2025-05-12 13:49:43 -07:00
Bruce MacDonald 9876c8453a update exported functions for tests 2025-05-12 13:49:43 -07:00
Bruce MacDonald 919b3d6e21 require new engine for qwen25vl arch 2025-05-12 13:49:43 -07:00
Bruce MacDonald 16b13e0cfc Revert "ropeTheta should be 1e5"
This reverts commit cc1638b26763eae7daddd44e3975a885671ef9d3.

This reverts commit
b32385591307e2d33a8f43ce1626b529d2dac83e.
2025-05-12 13:49:43 -07:00
Bruce MacDonald 75441c56f3 add comment explaining rope theta 2025-05-12 13:49:43 -07:00
Bruce MacDonald 45f96e898d ropeTheta should be 1e5 2025-05-12 13:49:43 -07:00
Bruce MacDonald 7c555d394c simplify patch creation 2025-05-12 13:49:43 -07:00
Bruce MacDonald 39ee6d2bd0 ranges for lint 2025-05-12 13:49:43 -07:00
Bruce MacDonald 47705b5168 simplify rope changes 2025-05-12 13:49:43 -07:00
Michael Yang 698a92aa4a reverse window 2025-05-12 13:49:43 -07:00
Michael Yang 150c499cae use silu 2025-05-12 13:49:43 -07:00
Michael Yang f1257a7de4 update vision rope theta default 2025-05-12 13:49:43 -07:00
Bruce MacDonald b68af0370f move sdpa to model forward pass 2025-05-12 13:49:43 -07:00
Bruce MacDonald ca981c8a49 full attn block indexes should be []int32 2025-05-12 13:49:43 -07:00
Bruce MacDonald b3da8a319e Update model_vision.go 2025-05-12 13:49:42 -07:00
Bruce MacDonald 359e1d5b19 full attention layers 2025-05-12 13:49:42 -07:00
Michael Yang bde6b46ce9 fix padding
padding was being added to offset but not to the running count
2025-05-12 13:49:42 -07:00
Bruce MacDonald ff1f74534b block attention 2025-05-12 13:49:42 -07:00
Bruce MacDonald 104f802df1 remove todos 2025-05-12 13:49:42 -07:00
Bruce MacDonald eed0ac2948 clean up vision model forward pass 2025-05-12 13:49:42 -07:00
Bruce MacDonald fcfad744ff fix patch merger 2025-05-12 13:49:42 -07:00
Michael Yang fb3c16f2a2 window index 2025-05-12 13:49:42 -07:00
Michael Yang ee869f35e4 fix image processing
python built-in `round()` rounds to the nearest even number if the value
is in the middle

https://docs.python.org/3/library/functions.html#round
2025-05-12 13:49:42 -07:00
Michael Yang ff5d1a3dc0 duplicate input embeddings 2025-05-12 13:49:42 -07:00
Michael Yang 88b231f903 use maxgridsize 2025-05-12 13:49:42 -07:00
Michael Yang 7e920c8d75 fix: patch merger and convert
convert:
- split patch embedding
- split qkv

remove duplicate PatchMerger
2025-05-12 13:49:42 -07:00
Bruce MacDonald dd8c619fba fixes after rebase 2025-05-12 13:49:42 -07:00
Bruce MacDonald 2af76d0e7a default to 32 for vision block count 2025-05-12 13:49:42 -07:00
Bruce MacDonald 8d901825f0 reshape cos and sin 2025-05-12 13:49:41 -07:00
Bruce MacDonald 04936b719f Update model_vision.go 2025-05-12 13:49:41 -07:00
Bruce MacDonald 0f0136d419 simplify by doing operations in Go rather than with tensors
Co-Authored-By: Michael Yang <2372640+mxyng@users.noreply.github.com>
2025-05-12 13:49:41 -07:00
Bruce MacDonald 80498f76de fix build 2025-05-12 13:49:41 -07:00
Bruce MacDonald f8b48aa784 Delete model_external_test.go 2025-05-12 13:49:41 -07:00
Bruce MacDonald 5ff0d538b0 wip: implementing rope 2025-05-12 13:49:41 -07:00
Bruce MacDonald eedc969c35 grid refactor 2025-05-12 13:49:41 -07:00