Does vllm support deploy models on vuda? #9691

Gaohang0804 · 2024-10-25T08:37:07Z

Gaohang0804
Oct 25, 2024

I'm trying to deploy internVL2 model with vllm on k8s，I have succeed on 1 A800 80G GPU，but because the low memory required of my model，many gpu memory is wasted.
I just wondering could deploy my model on virtualized cuda，and required lower memory on one k8s pod.

        limits:
          cpu: '15'
          memory: 16Gi
          tencent.com/vcuda-core: '100'
          tencent.com/vcuda-memory: '320'
        requests:
          cpu: '15'
          memory: 16Gi
          tencent.com/vcuda-core: '100'
          tencent.com/vcuda-memory: '320'

Qhr77 · 2025-02-12T08:41:04Z

Qhr77
Feb 12, 2025

遇到同样问题了，在k8s上 tencent.com/vcuda-core 设成100 可以部署改成25 这样报错。请问你解决了吗

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Does vllm support deploy models on vuda? #9691

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Does vllm support deploy models on vuda? #9691

Uh oh!

Gaohang0804 Oct 25, 2024

Replies: 1 comment

Uh oh!

Qhr77 Feb 12, 2025

Gaohang0804
Oct 25, 2024

Qhr77
Feb 12, 2025