GGML_LLAMAFILE vs GGML_BLAS #10345

bertsons · 2024-11-16T23:26:00Z

bertsons
Nov 16, 2024

I am a bit confused about the difference/relationship between GMLL_LLAMAFILE and GGML_BLAS.

The GGML_LLAMAFILE flag causes llamafile/sgemm.cpp to be compiled and linked, which appears to pertain to "TinyBLAS". I don't understand much about BLAS, but intuition tells me you can only use one type of BLAS, e.g., if you're using TinyBLAS, then you can't use Open BLAS. However, after reading the CMakeLists.txt files, this doesn't appear to be the case.

Can someone enlighten me on this?

Answered by slaren

Nov 17, 2024

GGML_BLAS can be used with any library that implements the CBLAS interface, such as OpenBLAS, BLIS, MKL, NVHPC, etc. Llamafile/TinyBLAS is a custom matrix multiplication library, but it does not conform to the standard BLAS interface. If you make a build with both enabled, the BLAS library will typically take precedence, although tinyBLAS may still be used with small batch sizes.

View full answer

slaren · 2024-11-17T00:31:50Z

slaren
Nov 17, 2024
Maintainer

GGML_BLAS can be used with any library that implements the CBLAS interface, such as OpenBLAS, BLIS, MKL, NVHPC, etc. Llamafile/TinyBLAS is a custom matrix multiplication library, but it does not conform to the standard BLAS interface. If you make a build with both enabled, the BLAS library will typically take precedence, although tinyBLAS may still be used with small batch sizes.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GGML_LLAMAFILE vs GGML_BLAS #10345

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

GGML_LLAMAFILE vs GGML_BLAS #10345

Uh oh!

bertsons Nov 16, 2024

Replies: 1 comment

Uh oh!

slaren Nov 17, 2024 Maintainer

bertsons
Nov 16, 2024

slaren
Nov 17, 2024
Maintainer