Where are the architectures implemented? #4328

noelfranthomas · 2023-12-04T21:33:31Z

noelfranthomas
Dec 4, 2023

Hello, I am very new to LLMs so please forgive my ignorance. I want to understand how these models are actually being implemented. So far, I understand that ggml helps with tensor ops, and the llama.cpp file has a lot of the workings of llama itself. Where are mistral and other llms implemented? Can we generate it from just the config file?

Any help would be greatly appreciated! My goal is to understand how llama.cpp works! Cheers!

Answered by ggerganov

Dec 5, 2023

See https://github.com/ggerganov/llama.cpp/blob/e4b76bbe316ee50fb17d9ac29e654c0edf830eba/llama.cpp#L5584-L5627

View full answer

cmp-nct · 2023-12-04T23:16:11Z

cmp-nct
Dec 4, 2023

it's all in llama.cpp, just a historic name

0 replies

ggerganov · 2023-12-05T07:58:28Z

ggerganov
Dec 5, 2023
Maintainer

See https://github.com/ggerganov/llama.cpp/blob/e4b76bbe316ee50fb17d9ac29e654c0edf830eba/llama.cpp#L5584-L5627

1 reply

noelfranthomas Dec 6, 2023
Author

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Where are the architectures implemented? #4328

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Where are the architectures implemented? #4328

Uh oh!

noelfranthomas Dec 4, 2023

Replies: 3 comments · 1 reply

Uh oh!

cmp-nct Dec 4, 2023

Uh oh!

ggerganov Dec 5, 2023 Maintainer

Uh oh!

noelfranthomas Dec 6, 2023 Author

noelfranthomas
Dec 4, 2023

Replies: 3 comments 1 reply

cmp-nct
Dec 4, 2023

ggerganov
Dec 5, 2023
Maintainer

noelfranthomas Dec 6, 2023
Author