Support BTLM models from Cerebras #2419

forrackun · 2023-07-27T08:44:46Z

forrackun
Jul 27, 2023

Looks like arch is bashed on Cerebras-GPT
so should work ?
anyway i donot see any info about it if anyone gave it a try.
i always used ggml file converted by other people so have not tried.
unlike previous models bashed on Cerebras-GPT this one looks like much more capable for it's class even challenging 7b models.

uses slimredpajama 600billion tokens cleanup from redpajama dataset

https://www.cerebras.net/blog/btlm-3b-8k-7b-performance-in-a-3-billion-parameter-model/

Previous Cerebras-GPT Disussion

forrackun · 2023-07-27T08:45:47Z

forrackun
Jul 27, 2023
Author

BTLM-3B-8K Highlights:

7B level model performance in a 3B model
State of the art 3B parameter model
Optimized for long sequence length inference 8K or more
First model trained on the SlimPajama, the largest fully deduplicated open dataset
Runs on devices with as little as 3GB of memory when quantized to 4-bit
Apache 2.0 license for commercial use

0 replies

forrackun · 2023-07-27T08:49:12Z

forrackun
Jul 27, 2023
Author

@TheBloke did you take a look 😄 ?

0 replies

TheBloke · 2023-07-27T09:58:11Z

TheBloke
Jul 27, 2023

It's a new model architecture so there would need to be a GGML implementation first, at https://github.com/ggerganov/ggml - you could raise it there.

Even once a GGML implementation is added, llama.cpp is unlikely to support it for now, as currently it only supports Llama models. Llama.cpp may add support for other model architectures in future, but not yet.

Adding a GGML implementation is not something I can do. If a GGML implementation is released for it, I am happy to release quantisations of it. But I can't do the implementation myself.

It will likely be easier to get GPTQ support, but again someone would have to add that, eg to AutoGPTQ.

0 replies

bornjre · 2023-08-05T23:30:30Z

bornjre
Aug 5, 2023

ggml-org/ggml#427

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support BTLM models from Cerebras #2419

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 4 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Support BTLM models from Cerebras #2419

Uh oh!

Uh oh!

forrackun Jul 27, 2023

Replies: 4 comments

Uh oh!

Uh oh!

forrackun Jul 27, 2023 Author

Uh oh!

forrackun Jul 27, 2023 Author

Uh oh!

Uh oh!

TheBloke Jul 27, 2023

Uh oh!

bornjre Aug 5, 2023

forrackun
Jul 27, 2023

forrackun
Jul 27, 2023
Author

forrackun
Jul 27, 2023
Author

TheBloke
Jul 27, 2023

bornjre
Aug 5, 2023