Linked CUDA but cannot use GPU #8046

TD-Sky · 2024-06-21T03:14:32Z

TD-Sky
Jun 21, 2024

Environment

OS: WSL2 Archlinux
Toolchains (running with nix-shell): gcc11, cuda11.8
GPU: NVIDIA GeForce RTX 4060 Ti, driver 552.44

Steps

Compile with CUDA stubs (I changed Makefile)
Prepare a chat LLM in models/llama-7b, rename to ggml-model-q4_0.gguf.
Run ./examples/chat.sh

Expected and actual

It should utilize GPU, but the CUDA statistics in tasks manager is 0%.

I also have ollama-cuda which linked CUDA statically. It will utilize GPU.

I have done the same thing in Linux, it doesn't work as well.

RhinoDevel · 2024-06-25T09:36:55Z

RhinoDevel
Jun 25, 2024

Do you have an output log from the beginning of llama-cli run?

3 replies

TD-Sky Jun 26, 2024
Author

I found ggml_cuda_init: failed to initialize CUDA: CUDA driver is a stub library. But compilation would fail when linking CUDA directly.

TD-Sky Jun 26, 2024
Author

Updated to the latest commit and compiled successfully. But there are still CUDA driver is a stub library errors.

RhinoDevel Jun 26, 2024

Seems as if the "real" CUDA library cannot be found. E.g. ChatGPT gives some detailed hints, if you just enter "CUDA driver is a stub library". :-)

dspasyuk · 2024-06-26T00:58:19Z

dspasyuk
Jun 26, 2024

You need to specify -n-gpu-layers 35 for example for a typical &b model. Something like this:

../llama.cpp/llama-cli --model models/Meta-Llama-3-8B-Instruct_Q5_K_S.gguf --n-gpu-layers 25 -cnv --interactive-first --simple-io -b 512 -n -1 --ctx_size 0 --temp 0.3 --top_k 10 --multiline-input --repeat_penalty 1.12 -t 6 -r "/n>" --log-disable

3 replies

TD-Sky Jun 27, 2024
Author

It utilized GPU when running on Linux physical machine but not on WSL. The latter can't even compile as both of them have same environment.

TD-Sky Jul 4, 2024
Author

Is your system running in WSL2?

TD-Sky Jul 4, 2024
Author

with import <nixpkgs> {};
(mkShell.override { stdenv = gcc11Stdenv; }) {
  buildInputs = [
    cudaPackages_11.cudatoolkit
    cudaPackages_11.cudnn
  ];
  shellHook = ''
    export CUDA_PATH=${cudaPackages_11.cudatoolkit}
    export EXTRA_LDFLAGS="-L/lib"
    export EXTRA_CCFLAGS="-I/usr/include"
  '';
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Linked CUDA but cannot use GPU #8046

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 9 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Linked CUDA but cannot use GPU #8046

Uh oh!

TD-Sky Jun 21, 2024

Environment

Steps

Expected and actual

Replies: 2 comments · 9 replies

Uh oh!

RhinoDevel Jun 25, 2024

Uh oh!

TD-Sky Jun 26, 2024 Author

Uh oh!

TD-Sky Jun 26, 2024 Author

Uh oh!

RhinoDevel Jun 26, 2024

Uh oh!

dspasyuk Jun 26, 2024

Uh oh!

TD-Sky Jun 27, 2024 Author

Uh oh!

TD-Sky Jul 4, 2024 Author

Uh oh!

Uh oh!

TD-Sky Jul 4, 2024 Author

TD-Sky
Jun 21, 2024

Replies: 2 comments 9 replies

RhinoDevel
Jun 25, 2024

TD-Sky Jun 26, 2024
Author

TD-Sky Jun 26, 2024
Author

dspasyuk
Jun 26, 2024

TD-Sky Jun 27, 2024
Author

TD-Sky Jul 4, 2024
Author

TD-Sky Jul 4, 2024
Author