What cause the runtime error nvmlDeviceGetNvLinkRemoteDeviceType_ #6533
HanweiLi-cqu
announced in
Q&A
Replies: 2 comments 1 reply
-
by the way, my environment as follows:
|
Beta Was this translation helpful? Give feedback.
0 replies
-
Do you solve this? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
my code:
when I set the
tensor_parallel_size=8
, get the runtime errors:I thought it might be a problem with memory usage, so I tried to use a different combination of parameters, and the result was as follows:
I'm curious as to why this is the case. I'm a beginner to VLLM, is there any optimization method of VLLM that causes this to happen?
Beta Was this translation helpful? Give feedback.
All reactions