What is llama_batch,and why we need this in llama.android? #9024
Unanswered
FranzKafkaYu
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I am reading the code of llama.android,and I found that when this App load a model,it will call
new_batch
method,here is the code:I noticed that the implemention is copy of
llama_batch_init
,details:somebody can tell me what is the purpose of this API,does it affect inference performance?
Beta Was this translation helpful? Give feedback.
All reactions