Deepseek V3 support added #176

saood06 · 2025-01-23T15:56:22Z

Very direct port of ggml-org/llama.cpp#11049.

Tested working with IQ4_K_R4 and IQ4_K. No tests so far on any quant that is supported by llama.cpp so that performance can be compared.

Tested on dual socket Xeon E5-2690 v3
Prompt processing:11.5 t/s for IQ4_K, 9.8 t/s IQ4_K_R4
Token generation: 2.75 t/s for IQ4_K, 3.10 t/s for IQ4_K_R4

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

ikawrakow · 2025-01-23T17:00:50Z

@saood06

Quick question: current llama.cpp has this check for Deepseek-V3:

    } else if (tmpl_contains(LU8("<｜Assistant｜>")) && tmpl_contains(LU8("<｜User｜>")) && tmpl_contains(LU8("<｜end▁of▁sentence｜>"))) {
        return LLM_CHAT_TEMPLATE_DEEPSEEK_3;

while the check you added with this PR is

    else if (tmpl == "deepseek3" || tmpl_contains(LU8("'<｜Assistant｜>' + message['content'] + '<｜end▁of▁sentence｜>'"))) {

The check for tmpl == "deepseek3" is done before in llama.cpp, so this is not an issue, but the remainder is not the same. Is this a problem? Or would it be a problem if I just made it the same as llama.cpp ?

saood06 · 2025-01-23T18:00:03Z

The change you are referencing happened in ggml-org/llama.cpp@ec7f3ac I was not aware of that till now.

Is this a problem? Or would it be a problem if I just made it the same as llama.cpp ?

You can change it if you want but both work, based on the chat_templates for the models that have been released.

Deepseek V3 support added

00906e3

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

ikawrakow approved these changes Jan 23, 2025

View reviewed changes

ikawrakow merged commit 2195632 into ikawrakow:main Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Deepseek V3 support added #176

Deepseek V3 support added #176

saood06 commented Jan 23, 2025

Uh oh!

ikawrakow commented Jan 23, 2025

Uh oh!

saood06 commented Jan 23, 2025

Uh oh!

Uh oh!

Deepseek V3 support added #176

Deepseek V3 support added #176

Conversation

saood06 commented Jan 23, 2025

Uh oh!

ikawrakow commented Jan 23, 2025

Uh oh!

saood06 commented Jan 23, 2025

Uh oh!

Uh oh!