Is there anyone run llamacpp on Jetson Orin SoC or other devices, how about the performance? #5059
Unanswered
adamydwang
asked this question in
Q&A
Replies: 1 comment 3 replies
-
Probably a bit late, but on Jetson Orin AGX 64GB I get approx 280 tks/s on llama2 7B:
|
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I want to know the performance of 7b or 13b models on device chips, especially the first token latency
Beta Was this translation helpful? Give feedback.
All reactions