Running LLMs on Mini PCs #2892

Ar57m · 2023-08-30T03:10:30Z

Ar57m
Aug 30, 2023

I couldn't find anything about running LLMs on that cheap Mini PCs(mostly from China), what speeds could it achieve?(7B, 13B and etc).
Some have Intel processors(like n3350, j4125, n95, n100, n305, 11400h, etc) others AMD (like 4500u, 5500u, 5700u, 5600h, 5800h, etc).

If anyone has one and could benchmark it on CPU and share it, I would appreciate, thanks in advance.

verdooft · 2023-08-30T16:54:30Z

verdooft
Aug 30, 2023

I have a notebook with the AMD processor 5700u and it works. With 64 GB RAM i can run 70B models too, speed depends on the quantization method and the model. Sure i have to wait, but i watch movies or chat. Even Stable Diffusion with XL-model runs with my processor.

1 reply

Ar57m Aug 30, 2023
Author

@verdooft thanks for the comment, how much tokens per second can you get running your current 70B model quantization on cpu?

verdooft · 2023-08-30T17:51:52Z

verdooft
Aug 30, 2023

With airoboros-l2-70b-2.1 (gguf) and Q5_K quantization: 1260,18 ms per token, but i had other 70B models (ggml) with other quant.., with them i had under 500 ms/token sometimes. But i read about different methods and think, i don't want much accuracy lose. The different methods use different amount of RAM. I must say, i limited the power consumption to 25 watt.

At the moment i test: phind-codellama-34b-v2.Q5_K_M.gguf
686 runs ( 640,23 ms per token

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Running LLMs on Mini PCs #2892

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Running LLMs on Mini PCs #2892

Uh oh!

Ar57m Aug 30, 2023

Replies: 2 comments · 1 reply

Uh oh!

verdooft Aug 30, 2023

Uh oh!

Ar57m Aug 30, 2023 Author

Uh oh!

verdooft Aug 30, 2023

Ar57m
Aug 30, 2023

Replies: 2 comments 1 reply

verdooft
Aug 30, 2023

Ar57m Aug 30, 2023
Author

verdooft
Aug 30, 2023