Skip to content

Releases: furyhawk/llama.cpp

b5787

01 Jul 04:06
0a5a3b5
Compare
Choose a tag to compare
Add Conv2d for CPU (#14388)

* Conv2D: Add CPU version

* Half decent

* Tiled approach for F32

* remove file

* Fix tests

* Support F16 operations

* add assert about size

* Review: further formatting fixes, add assert and use CPU version of fp32->fp16

b5599

06 Jun 02:48
1caae7f
Compare
Choose a tag to compare
gguf-py : add add_classifier_output_labels method to writer (#14031)

* add add_classifier_output_labels

* use add_classifier_output_labels

b5572

02 Jun 00:58
7675c55
Compare
Choose a tag to compare
gguf: fix failure on version == 0 (#13956)

b5568

01 Jun 13:25
Compare
Choose a tag to compare
sync : ggml

ggml-ci

b2998

25 May 14:34
9588f19
Compare
Choose a tag to compare
train : change default FA argument (#7528)

b2961

22 May 06:07
201cc11
Compare
Choose a tag to compare
llama : add phi3 128K model support (#7225)

* add phi3 128k support in convert-hf-to-gguf

* add phi3 128k support in cuda

* address build warnings on llama.cpp

* adjust index value in cuda long rope freq factors

* add long rope support in ggml cpu backend

* make freq factors only depend on ctx size

* remove unused rope scaling type 'su' frin gguf converter

* fix flint warnings on convert-hf-to-gguf.py

* set to the short freq factor when context size is small than trained context size

* add one line of comments

* metal : support rope freq_factors

* ggml : update ggml_rope_ext API to support freq. factors

* backends : add dev messages to support rope freq. factors

* minor : style

* tests : update to use new rope API

* backends : fix pragma semicolons

* minor : cleanup

* llama : move rope factors from KV header to tensors

* llama : remove tmp assert

* cuda : fix compile warning

* convert : read/write n_head_kv

* llama : fix uninitialized tensors

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

b2918

18 May 05:26
0583484
Compare
Choose a tag to compare
ggml : fix quants nans when all the group weights are very close to z…

b2824

09 May 03:40
4426e29
Compare
Choose a tag to compare
cmake : fix typo (#7151)