how to run test-backend-ops in single threaded mode #9920
-
I am trying to debug llamafile_sgemm API and I am invoking just MUL_MAT operations on CPU backend. I would like to know if it is possible to run test-backend-ops in single threaded mode so that debugging becomes easier |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
You would have to modify the call to |
Beta Was this translation helpful? Give feedback.
-
You can set this to 1 and recompile as well |
Beta Was this translation helpful? Give feedback.
You can set this to 1 and recompile as well
https://github.com/ggerganov/llama.cpp/blob/afd9909a6481402844aecefa8a8908afdd7f52f1/ggml/include/ggml.h#L229