dots.llm1 support and thanks #543

Iconology · 2025-06-20T03:07:05Z

Iconology
Jun 20, 2025

Hey, friend,

Out of curiosity, do you have any plans to add dots.llm1 support? The model seems interesting enough. I tried it out on mainline, but the speeds were atrocious for its size, making it unusable, at least for me. That’s why I jumped over to your fork (thanks to ubergarm) for both the insane MoE speedups and for being the godfather of, arguably, the absolute SOTA quants in my eyes.

Here's the pull request from mainline for dots:
ggml-org/llama.cpp@9ae4143

Regardless of whether it’s on your roadmap or not, I just wanted to say thank you, ikawrakow, for all that you have done and continue to do. You are one of a kind.

saood06 · 2025-06-20T03:21:14Z

saood06
Jun 20, 2025
Collaborator

The model seems interesting enough.

I agree, from a quick skim of the PR code, I don't see anything that would lead to a complicated port. I could do it if no one else gets to it first.

Especially due to this part in that PR:

The model architecture is a combination of Qwen and Deepseek parts, as
seen here:

https://github.com/huggingface/transformers/blob/ffe12627b4e84489d2ab91dd0ec00614855edc79/src/transformers/models/dots1/modular_dots1.py

2 replies

firecoperana Jul 2, 2025
Collaborator

@saood06 Are you working on it? If not, I can give a try.

saood06 Jul 3, 2025
Collaborator

#573 exists now. Testing is welcome.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

dots.llm1 support and thanks #543

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

dots.llm1 support and thanks #543

Uh oh!

Iconology Jun 20, 2025

Replies: 1 comment · 2 replies

Uh oh!

saood06 Jun 20, 2025 Collaborator

Uh oh!

firecoperana Jul 2, 2025 Collaborator

Uh oh!

saood06 Jul 3, 2025 Collaborator

Iconology
Jun 20, 2025

Replies: 1 comment 2 replies

saood06
Jun 20, 2025
Collaborator

firecoperana Jul 2, 2025
Collaborator

saood06 Jul 3, 2025
Collaborator