Skip to content

How to Analyze Operator Latency Breakdown in llama.cpp? #10839

Closed Answered by max-krasnyansky
Zijie-Tian asked this question in Q&A
Discussion options

You must be logged in to vote

Take a look at #9659
Currently it supports only the CPU backend but will give you the info you're looking for.
I'm planning on updating it to the latest master and to add support for the OpenCL backend (and hopefully others one later).

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Zijie-Tian
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants