Any hope for DeepSeek v3 MTP support? #11455

Green0-0 · 2025-01-27T18:58:49Z

Green0-0
Jan 27, 2025

"Based on our evaluation, the acceptance rate of the second token prediction ranges between 85% and 90% across various generation topics, demonstrating consistent reliability. This high acceptance rate enables DeepSeek-V3 to achieve a significantly improved decoding speed, delivering 1.8 times TPS (Tokens Per Second)."
(The DeepSeek v3 report)

BarfingLemurs · 2025-01-28T20:14:40Z

BarfingLemurs
Jan 28, 2025

It may also be worth looking at DeepseekVL2 models which share the same vocabulary as DeepseekV3.

This one, maybe then it could be offloaded to the gpu?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Any hope for DeepSeek v3 MTP support? #11455

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Any hope for DeepSeek v3 MTP support? #11455

Uh oh!

Green0-0 Jan 27, 2025

Replies: 1 comment

Uh oh!

BarfingLemurs Jan 28, 2025

Green0-0
Jan 27, 2025

BarfingLemurs
Jan 28, 2025