Skip to content

Releases: tinglou/llama.cpp

b5827

04 Jul 10:11
67d1ef2
Compare
Choose a tag to compare
batch : add optional for sequential equal split (#14511)

ggml-ci

b5797

02 Jul 03:34
de56944
Compare
Choose a tag to compare
ci : disable fast-math for Metal GHA CI (#14478)

* ci : disable fast-math for Metal GHA CI

ggml-ci

* cont : remove -g flag

ggml-ci

b5711

20 Jun 01:16
8f71d0f
Compare
Choose a tag to compare
ggml-cpu : remove unnecesary arm feature detection (#14281)

Support for Arm runtime feature detection has now been added to GGML_CPU_ALL_VARIANTS. This removes the old and not very functional code.

b5174

24 Apr 01:34
5630406
Compare
Choose a tag to compare
llama-mtmd-cli: Sigint rework in mtmd vision example (#13080)

* Sigint rework in mtmd vision example

* Applied suggestions on mtmd-cli PR

* Forgot to invert one of the conditions

* Update examples/llava/mtmd-cli.cpp

* Removed redundant exit check

---------

Co-authored-by: pl752 <maximpl752@gmail.com>
Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com>

b4960

26 Mar 10:23
fd7855f
Compare
Choose a tag to compare
doc: [MUSA] minor changes (#12583)

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

b4875

12 Mar 09:53
7841fc7
Compare
Choose a tag to compare
llama : Add Gemma 3 support (+ experimental vision capability) (#12343)

* llama : Add Gemma 3 text-only support

* fix python coding style

* fix compile on ubuntu

* python: fix style

* fix ubuntu compile

* fix build on ubuntu (again)

* fix ubuntu build, finally

* clip : Experimental support for Gemma 3 vision (#12344)

* clip : Experimental support for Gemma 3 vision

* fix build

* PRId64

b4819

04 Mar 07:12
becade5
Compare
Choose a tag to compare
HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (#12032)

Adds GGML_HIP_ROCWMMA_FATTN and rocwmma header check
Adds rocWMMA support to fattn-wmma-f16

---

Signed-off-by: Carl Klemm <carl@uvos.xyz>
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
Co-authored-by: Ben Jackson <ben@ben.com>

b4784

28 Feb 02:25
b95c8af
Compare
Choose a tag to compare
cmake: Fix ggml backend dependencies and installation (#11818)

* Fix dependencies between ggml and backends

ggml backends link only to ggml-base and ggml links to all backends.

* Fix installation of ggml backends

Set up GNUInstallDirs before setting the installation directory of ggml backends

b4782

26 Feb 14:22
c3ba659
Compare
Choose a tag to compare
Apply suggestions from code review

b4781

26 Feb 10:45
Compare
Choose a tag to compare
add struct for FFI bindgen