You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+3-2Lines changed: 3 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -11,10 +11,11 @@
11
11
Inference of Meta's [LLaMA](https://arxiv.org/abs/2302.13971) model (and others) in pure C/C++
12
12
13
13
> [!IMPORTANT]
14
-
[2024 Jun 12] Binaries have been renamed w/ a `llama-` prefix. `main` is now `llama-cli`, `server` is `llama-server`, etc (https://github.com/ggerganov/llama.cpp/pull/7809)
14
+
[2024 Aug 31] Breaking changes to the C-style sampling API: https://github.com/ggerganov/llama.cpp/pull/8643
15
15
16
16
## Recent API changes
17
17
18
+
-[2024 Aug 31] Refactored `llama_sample` and `llama_grammar` APIs: https://github.com/ggerganov/llama.cpp/pull/8643
18
19
-[2024 Jun 26] The source code and CMake build scripts have been restructured https://github.com/ggerganov/llama.cpp/pull/8006
19
20
-[2024 Apr 21]`llama_token_to_piece` can now optionally render special tokens https://github.com/ggerganov/llama.cpp/pull/6807
20
21
-[2024 Apr 4] State and session file functions reorganized under `llama_state_*`https://github.com/ggerganov/llama.cpp/pull/6341
@@ -26,7 +27,7 @@ Inference of Meta's [LLaMA](https://arxiv.org/abs/2302.13971) model (and others)
26
27
27
28
## Hot topics
28
29
29
-
-**`convert.py` has been deprecated and moved to `examples/convert_legacy_llama.py`, please use `convert_hf_to_gguf.py`**https://github.com/ggerganov/llama.cpp/pull/7430
30
+
-`convert.py` has been deprecated and moved to `examples/convert_legacy_llama.py`, please use `convert_hf_to_gguf.py`https://github.com/ggerganov/llama.cpp/pull/7430
0 commit comments