How to use two models in same inference code #2991

siddhantwaghjale · 2024-02-22T16:01:09Z

siddhantwaghjale
Feb 22, 2024

I'm trying with two model inference in the same code using vLLM. But while trying to load the 2nd model it fails with error
AssertionError: tensor model parallel group is already initialized.

Any help will be appreciated

thiner · 2024-02-24T08:45:34Z

thiner
Feb 24, 2024

I think it's not feasible with vllm currently(please correct me if I was wrong). But you can try to search "LLM gateway" in github.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How to use two models in same inference code #2991

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

How to use two models in same inference code #2991

Uh oh!

siddhantwaghjale Feb 22, 2024

Replies: 1 comment

Uh oh!

thiner Feb 24, 2024

siddhantwaghjale
Feb 22, 2024

thiner
Feb 24, 2024