Skip to content

Releases: AD2605/llama.cpp

b5432

20 May 10:10
4245e62
Compare
Choose a tag to compare
sycl: disable reorder for sycl mulmat (#13536)

b5423

19 May 12:23
92ecdcc
Compare
Choose a tag to compare
mtmd : add vision support for llama 4 (#13282)

* wip llama 4 conversion

* rm redundant __init__

* fix conversion

* fix conversion

* test impl

* try this

* reshape patch_embeddings_0

* fix view

* rm ffn_post_norm

* cgraph ok

* f32 for pos embd

* add image marker tokens

* Llama4UnfoldConvolution

* correct pixel shuffle

* fix merge conflicts

* correct

* add debug_graph

* logits matched, but it still preceives the image incorrectly

* fix style

* add image_grid_pinpoints

* handle llama 4 preprocessing

* rm load_image_size

* rm unused line

* fix

* small fix 2

* add test & docs

* fix llava-1.6 test

* test: add notion of huge models

* add comment

* add warn about degraded quality

b5416

19 May 08:33
33d7aed
Compare
Choose a tag to compare
CANN: Support MOE Model MUL_MAT_ID (#13042)

Signed-off-by: noemotiovon <757486878@qq.com>

b5392

15 May 09:54
c753d7b
Compare
Choose a tag to compare
server : proper error handling for missing elements in messages array…

b5359

12 May 14:47
de4c07f
Compare
Choose a tag to compare
clip : cap max image size 1024 for qwen vl model (#13478)

b5329

09 May 14:38
611aa91
Compare
Choose a tag to compare
metal : optimize MoE for large batches (#13388)

ggml-ci

b5316

08 May 18:12
ee01d71
Compare
Choose a tag to compare
server : (webui) fix a very small misalignment (#13387)

* server : (webui) fix a very small misalignment

* restore font-bold

b5307

08 May 09:28
814f795
Compare
Choose a tag to compare
docker : disable arm64 and intel images (#13356)

b5303

07 May 12:16
bc4e112
Compare
Choose a tag to compare
llama : deci : support ffn-free with attention (#13296)

b5283

05 May 12:30
5215b91
Compare
Choose a tag to compare
clip :  fix confused naming ffn_up and ffn_down (#13290)

* clip :  fix confused naming ffn_up and ffn_down

* rm ffn_i/o/g naming

* rename n_embd, n_ff

* small fix

* no check n_ff