Skip to content

Releases: AD2605/llama.cpp

b5854

09 Jul 16:51
26a48ad
Compare
Choose a tag to compare
ggml : prevent integer overflow in gguf tensor size calculation (#14595)

b5795

01 Jul 10:21
343b6e9
Compare
Choose a tag to compare
CANN: update aclnnGroupedMatmulV2 to aclnnGroupedMatmulV3 (#14411)

* [CANN]update to aclnnGroupedMatmulV2

Signed-off-by: noemotiovon <757486878@qq.com>

* Support MUL_MAT_ID on 310p

Signed-off-by: noemotiovon <757486878@qq.com>

* fix editorconfig

Signed-off-by: noemotiovon <757486878@qq.com>

---------

Signed-off-by: noemotiovon <757486878@qq.com>

b5787

30 Jun 22:29
0a5a3b5
Compare
Choose a tag to compare
Add Conv2d for CPU (#14388)

* Conv2D: Add CPU version

* Half decent

* Tiled approach for F32

* remove file

* Fix tests

* Support F16 operations

* add assert about size

* Review: further formatting fixes, add assert and use CPU version of fp32->fp16

b5753

25 Jun 12:14
73e53dc
Compare
Choose a tag to compare
opencl: ref count `ggml_backend_opencl_context` and refactor profilin…

b5716

20 Jun 10:04
d27b3ca
Compare
Choose a tag to compare
ggml : fix repack work size for mul_mat_id (#14292)

ggml-ci

b5688

17 Jun 17:01
860a9e4
Compare
Choose a tag to compare
ggml-cpu : remove the weak alias trick (#14221)

b5611

09 Jun 12:37
dc0623f
Compare
Choose a tag to compare
webui: fix sidebar being covered by main content (#14082)

* webui: fix sidebar being covered by main content

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* webui: update index.html.gz

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

---------

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

b5518

28 May 09:40
26b79b6
Compare
Choose a tag to compare
convert : fix tensor naming conflict for llama 4 vision (#13836)

* convert : fix tensor naming conflict for llama 4 vision

* add comment

b5503

27 May 10:29
f9cd683
Compare
Choose a tag to compare
sampling : make sure samplers return at least 1 token (#13822)

* sampling : min-p should always return at least one token

ggml-ci

* sampling : same for typical sampling

* tests : sampling tests use min_keep == 0

ggml-ci

b5467

23 May 16:36
8a2afb7
Compare
Choose a tag to compare
llama : allow custom list of swa_layers (#13726)