Llama 4 not working #1994

Kenshiro-28 · 2025-04-08T14:23:17Z

llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'llama4'
llama_model_load_from_file_impl: failed to load model

Please update to a newer version of llama.cpp:

https://github.com/ggml-org/llama.cpp/releases/tag/b5074

JamePeng · 2025-04-08T14:34:27Z

My fork project has added some updates of llama4: https://github.com/JamePeng/llama-cpp-python

kerlion · 2025-04-14T09:22:21Z

Same issue, how to run llama4?

AleefBilal · 2025-04-17T07:42:38Z

@kerlion
What version of llama-cpp-python are you using?
Can you also give me some inside about your platform (OS, etc).

kerlion · 2025-04-17T08:07:39Z

@kerlion What version of llama-cpp-python are you using? Can you also give me some inside about your platform (OS, etc).

image: nvidia/cuda:12.2.0-runtime-ubuntu22.04
llama_cpp_python 0.3.8

kerlion · 2025-04-17T08:11:26Z

I compiled it from the source code, passed this error. But I do not know which "chat_format" to use?
Llama-4-Scout-17B-16E-Instruct-UD-Q2_K_XL

AleefBilal · 2025-04-17T09:57:58Z

@kerlion
Great job on compiling it from source. Below is the command that might save you from the struggle of source compiling.
CMAKE_ARGS="-DGGML_CUDA=ON -DLLAMA_LLAVA=OFF" pip install llama-cpp-python --force-reinstall --upgrade --no-cache-dir
Furthermore, i wasn't able to quite understand your message about using which "chat_format", can you please elaborate.

h-haghpanah · 2025-04-20T07:29:36Z

same error with llama_cpp_python 0.3.8:

print_info: file format = GGUF V3 (latest)
print_info: file type = Q4_K - Medium
print_info: file size = 62.90 GiB (5.01 BPW)
llama_model_load: error loading model: error loading model architecture: unknown model architecture: 'llama4'
llama_model_load_from_file_impl: failed to load model

perronemirko · 2025-05-07T18:35:53Z

My fork project has added some updates of llama4: https://github.com/JamePeng/llama-cpp-python

Could you please provide your commit number ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama 4 not working #1994

Llama 4 not working #1994

Kenshiro-28 commented Apr 8, 2025

JamePeng commented Apr 8, 2025

kerlion commented Apr 14, 2025

AleefBilal commented Apr 17, 2025

kerlion commented Apr 17, 2025

kerlion commented Apr 17, 2025

AleefBilal commented Apr 17, 2025 •

edited

Loading

h-haghpanah commented Apr 20, 2025

perronemirko commented May 7, 2025

Llama 4 not working #1994

Llama 4 not working #1994

Comments

Kenshiro-28 commented Apr 8, 2025

JamePeng commented Apr 8, 2025

kerlion commented Apr 14, 2025

AleefBilal commented Apr 17, 2025

kerlion commented Apr 17, 2025

kerlion commented Apr 17, 2025

AleefBilal commented Apr 17, 2025 • edited Loading

h-haghpanah commented Apr 20, 2025

perronemirko commented May 7, 2025

AleefBilal commented Apr 17, 2025 •

edited

Loading