BitNet 1.58b models reproduced successfully #6384

kalomaze · 2024-03-29T13:05:14Z

kalomaze
Mar 29, 2024

https://huggingface.co/1bitLLM/bitnet_b1_58-3B

Dampfinchen · 2024-03-29T19:51:16Z

Dampfinchen
Mar 29, 2024

great results. But why is it so big? 13 GB seems way too much for a 1.58 Bit 3B model.

3 replies

netrunnereve Mar 30, 2024
Collaborator

AFAIK the weights are stored as {-1, 0, 1} but in f32.

Dampfinchen Mar 31, 2024

I see. Would it be possible to store it in FP16 or even 4 Bit without a noticeable dip in quality? If not, it would take a lot of disk space for 7B models and up. But if that's the compromise of having 1.58 Bit models I think most people will gladly take it.

netrunnereve Mar 31, 2024
Collaborator

Yeah you could store it however you like, even in ternary if you pack it properly. They likely use f32 as all the ML libraries support it and they don't have to write special code for inference. Optimizing that into something that's faster and more compact is our job 😉

EriIaz · 2024-03-29T20:16:04Z

EriIaz
Mar 29, 2024

great results. But why is it so big? 13 GB seems way too much for a 1.58 Bit 3B model.

Isn't it FP32?

0 replies

hrstoyanov · 2024-03-30T18:19:56Z

hrstoyanov
Mar 30, 2024

Are there any plans for efficient non-gpu (AVX) implementation as part of llama.cpp?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BitNet 1.58b models reproduced successfully #6384

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

BitNet 1.58b models reproduced successfully #6384

Uh oh!

kalomaze Mar 29, 2024

Replies: 3 comments · 3 replies

Uh oh!

Dampfinchen Mar 29, 2024

Uh oh!

netrunnereve Mar 30, 2024 Collaborator

Uh oh!

Dampfinchen Mar 31, 2024

Uh oh!

netrunnereve Mar 31, 2024 Collaborator

Uh oh!

EriIaz Mar 29, 2024

Uh oh!

Uh oh!

hrstoyanov Mar 30, 2024

kalomaze
Mar 29, 2024

Replies: 3 comments 3 replies

Dampfinchen
Mar 29, 2024

netrunnereve Mar 30, 2024
Collaborator

netrunnereve Mar 31, 2024
Collaborator

EriIaz
Mar 29, 2024

hrstoyanov
Mar 30, 2024