Releases · AD2605/llama.cpp

20 May 10:10

4245e62

b5432

sycl: disable reorder for sycl mulmat (#13536)

Assets 20

19 May 12:23

github-actions

b5423

92ecdcc

b5423

mtmd : add vision support for llama 4 (#13282)

* wip llama 4 conversion

* rm redundant __init__

* fix conversion

* fix conversion

* test impl

* try this

* reshape patch_embeddings_0

* fix view

* rm ffn_post_norm

* cgraph ok

* f32 for pos embd

* add image marker tokens

* Llama4UnfoldConvolution

* correct pixel shuffle

* fix merge conflicts

* correct

* add debug_graph

* logits matched, but it still preceives the image incorrectly

* fix style

* add image_grid_pinpoints

* handle llama 4 preprocessing

* rm load_image_size

* rm unused line

* fix

* small fix 2

* add test & docs

* fix llava-1.6 test

* test: add notion of huge models

* add comment

* add warn about degraded quality

Assets 20

19 May 08:33

github-actions

b5416

33d7aed

b5416

CANN: Support MOE Model MUL_MAT_ID (#13042)

Signed-off-by: noemotiovon <757486878@qq.com>

Assets 20

15 May 09:54

github-actions

b5392

c753d7b

b5392

server : proper error handling for missing elements in messages array…

Assets 20

12 May 14:47

github-actions

b5359

de4c07f

b5359

clip : cap max image size 1024 for qwen vl model (#13478)

Assets 20

09 May 14:38

github-actions

b5329

611aa91

b5329

metal : optimize MoE for large batches (#13388)

ggml-ci

Assets 20

08 May 18:12

github-actions

b5316

ee01d71

b5316

server : (webui) fix a very small misalignment (#13387)

* server : (webui) fix a very small misalignment

* restore font-bold

Assets 20

08 May 09:28

github-actions

b5307

814f795

b5307

docker : disable arm64 and intel images (#13356)

Assets 21

07 May 12:16

github-actions

b5303

bc4e112

b5303

llama : deci : support ffn-free with attention (#13296)

Assets 21

05 May 12:30

github-actions

b5283

5215b91

b5283

clip :  fix confused naming ffn_up and ffn_down (#13290)

* clip :  fix confused naming ffn_up and ffn_down

* rm ffn_i/o/g naming

* rename n_embd, n_ff

* small fix

* no check n_ff

Assets 21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: AD2605/llama.cpp

b5432

Uh oh!

b5423

Uh oh!

b5416

Uh oh!

b5392

Uh oh!

b5359

Uh oh!

b5329

Uh oh!

b5316

Uh oh!

b5307

Uh oh!

b5303

Uh oh!

b5283

Uh oh!