Skip to content

Commit a08c1d2

Browse files
ddpasanamengxson
authored
docs : add Moondream2 pre-quantized link (#13745)
* Multimodal: Added Moondream2 model and fixed ggml.org link * Apply suggestions from code review --------- Co-authored-by: name <none@none.com> Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com>
1 parent d785f9c commit a08c1d2

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

docs/multimodal.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,7 @@ llama-server -hf ggml-org/gemma-3-4b-it-GGUF --no-mmproj-offload
3333

3434
## Pre-quantized models
3535

36-
These are ready-to-use models, most of them come with `Q4_K_M` quantization by default. They can be found at the Hugging Face page of the ggml-org: https://huggingface.co/ggml-org
36+
These are ready-to-use models, most of them come with `Q4_K_M` quantization by default. They can be found at the Hugging Face page of the ggml-org: https://huggingface.co/collections/ggml-org/multimodal-ggufs-68244e01ff1f39e5bebeeedc
3737

3838
Replaces the `(tool_name)` with the name of binary you want to use. For example, `llama-mtmd-cli` or `llama-server`
3939

@@ -81,6 +81,10 @@ NOTE: some models may require large context window, for example: `-c 8192`
8181

8282
# Llama 4 Scout
8383
(tool_name) -hf ggml-org/Llama-4-Scout-17B-16E-Instruct-GGUF
84+
85+
# Moondream2 20250414 version
86+
(tool_name) -hf ggml-org/moondream2-20250414-GGUF
87+
8488
```
8589

8690
**Audio models**:

0 commit comments

Comments
 (0)