Releases · tinglou/llama.cpp

04 Jul 10:11

67d1ef2

b5827 Latest

Latest

batch : add optional for sequential equal split (#14511)

ggml-ci

Assets 15

cudart-llama-bin-win-cuda-12.4-x64.zip

sha256:8c79a9b226de4b3cacfd1f83d24f962d0773be79f1e7b75c6af4ded7e32ae1d6

373 MB 2025-07-04T10:11:41Z
llama-b5827-bin-macos-arm64.zip

sha256:34c175c7817de95e57cb092447acaaf20d3f00ea0f9e3477a422db75168b2ea1

10.5 MB 2025-07-04T10:11:52Z
llama-b5827-bin-macos-x64.zip

sha256:7977a3d63deb3c2843a69121cf01e37b3351efaa28bb36e2a50b9ad2f74e7538

26.3 MB 2025-07-04T10:11:53Z
llama-b5827-bin-ubuntu-vulkan-x64.zip

sha256:e25607a8b99032ee368933b422950df770b5434c44c58fbb449772f5663fa1d9

20.1 MB 2025-07-04T10:11:54Z
llama-b5827-bin-ubuntu-x64.zip

sha256:d1fdb82bdafbcae0ad11daee943a8bb3b77a73787898080dad9c7883a09bd3aa

12.4 MB 2025-07-04T10:11:55Z
llama-b5827-bin-win-cpu-arm64.zip

sha256:f1811a619dcfc08168568240d8ff17fc621264c52a36957cc5685c5a695338dd

10.8 MB 2025-07-04T10:11:56Z
llama-b5827-bin-win-cpu-x64.zip

sha256:38d1a6e5bb191e775abbab4382eb4eba390447a27b6cf4d71b91a4e502a40f5e

13.6 MB 2025-07-04T10:11:57Z
llama-b5827-bin-win-cuda-12.4-x64.zip

sha256:dbe7cd7949cc716c1797ccf371ee3fbc3a292207095bfeb4ac8a5e409081de04

128 MB 2025-07-04T10:11:58Z
llama-b5827-bin-win-hip-radeon-x64.zip

sha256:fe48f8d4442cca0e4b40535ae5e2c8b1644147c28f3333e4b4922c4b70f61703

298 MB 2025-07-04T10:12:02Z
llama-b5827-bin-win-opencl-adreno-arm64.zip

sha256:38f117bf211ffde3e0de3d5ce3753e0514c7ce4998497247a6b6b5241a015554

11.1 MB 2025-07-04T10:12:12Z
Source code (zip)

2025-07-04T06:08:59Z
Source code (tar.gz)

2025-07-04T06:08:59Z

02 Jul 03:34

github-actions

b5797

de56944

b5797

ci : disable fast-math for Metal GHA CI (#14478)

* ci : disable fast-math for Metal GHA CI

ggml-ci

* cont : remove -g flag

ggml-ci

Assets 15

20 Jun 01:16

github-actions

b5711

8f71d0f

b5711

ggml-cpu : remove unnecesary arm feature detection (#14281)

Support for Arm runtime feature detection has now been added to GGML_CPU_ALL_VARIANTS. This removes the old and not very functional code.

Assets 15

24 Apr 01:34

github-actions

b5174

5630406

b5174

llama-mtmd-cli: Sigint rework in mtmd vision example (#13080)

* Sigint rework in mtmd vision example

* Applied suggestions on mtmd-cli PR

* Forgot to invert one of the conditions

* Update examples/llava/mtmd-cli.cpp

* Removed redundant exit check

---------

Co-authored-by: pl752 <maximpl752@gmail.com>
Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com>

Assets 26

26 Mar 10:23

github-actions

b4960

fd7855f

b4960

doc: [MUSA] minor changes (#12583)

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

Assets 26

12 Mar 09:53

github-actions

b4875

7841fc7

b4875

llama : Add Gemma 3 support (+ experimental vision capability) (#12343)

* llama : Add Gemma 3 text-only support

* fix python coding style

* fix compile on ubuntu

* python: fix style

* fix ubuntu compile

* fix build on ubuntu (again)

* fix ubuntu build, finally

* clip : Experimental support for Gemma 3 vision (#12344)

* clip : Experimental support for Gemma 3 vision

* fix build

* PRId64

Assets 26

04 Mar 07:12

github-actions

b4819

becade5

b4819

HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (#12032)

Adds GGML_HIP_ROCWMMA_FATTN and rocwmma header check
Adds rocWMMA support to fattn-wmma-f16

---

Signed-off-by: Carl Klemm <carl@uvos.xyz>
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
Co-authored-by: Ben Jackson <ben@ben.com>

Assets 25

28 Feb 02:25

github-actions

b4784

b95c8af

b4784

cmake: Fix ggml backend dependencies and installation (#11818)

* Fix dependencies between ggml and backends

ggml backends link only to ggml-base and ggml links to all backends.

* Fix installation of ggml backends

Set up GNUInstallDirs before setting the installation directory of ggml backends

Assets 25

26 Feb 14:22

github-actions

b4782

c3ba659

b4782

Apply suggestions from code review

Assets 25

26 Feb 10:45

github-actions

b4781

f583083

b4781

add struct for FFI bindgen

Assets 25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: tinglou/llama.cpp

b5827

Uh oh!

b5797

Uh oh!

b5711

Uh oh!

b5174

Uh oh!

b4960

Uh oh!

b4875

Uh oh!

b4819

Uh oh!

b4784

Uh oh!

b4782

Uh oh!

b4781

Uh oh!