Skip to content

Significantly slower response following rename of main to llama-cli. #8148

Answered by brankoprica
brankoprica asked this question in Q&A
Discussion options

You must be logged in to vote

I have not figured out the reason with 100% certainty, but I think it must have something to do with my CMake settings. I seem to have managed to replicate similar performance now by rebuilding with w64devkit.

I tried rebuilding with CMake but it kept yielding the same speed, about 0.5 tokens/s.

New build with w64devkit yields 2.68 tokens/s.

Previously my main.exe file was in build/Release if that is helpful to anyone. I wonder whether the Release config build wasn't working in CMake for some reason. I'll leave it open in case anyone has anything else to add regarding insights, otherwise please feel free to close it.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by brankoprica
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant