Using vLLM or TGI #7688

prashil1996 · 2024-08-20T11:03:05Z

prashil1996
Aug 20, 2024

Hi, so I have a troubling question where I'm confused to use either vLLM or TGI (https://github.com/huggingface/text-generation-inference).
As of today vLLM only supports only Decoder Models as mentioned here (https://docs.vllm.ai/en/latest/models/supported_models.html) and my use case requires me to also support certain encoder-decoder models for which I want to use TGI.

Question:

How can I distinguish that a model would work with vLLM or TGI before I am about to bring up the model itself? Is there a way that would tell me if the model would come up successfully?

Can I make use of any parameters (which would be available for all models) present in config.json file for the models to say that yes, this model would work with vLLM and not TGI, and vice-versa.

FYI: The models I'm looking to host are if HuggingFace format itself.

@wooyeonlee0 @youkaichao @simon-mo @tmm1 @zhouyuan

simon-mo · 2024-08-20T16:40:44Z

simon-mo
Aug 20, 2024
Maintainer

The architecture field in config.json should be what you want to cross check: https://docs.vllm.ai/en/latest/models/supported_models.html

1 reply

prashil1996 Aug 21, 2024
Author

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Using vLLM or TGI #7688

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Using vLLM or TGI #7688

Uh oh!

Uh oh!

prashil1996 Aug 20, 2024

Question:

Replies: 1 comment · 1 reply

Uh oh!

simon-mo Aug 20, 2024 Maintainer

Uh oh!

prashil1996 Aug 21, 2024 Author

prashil1996
Aug 20, 2024

Replies: 1 comment 1 reply

simon-mo
Aug 20, 2024
Maintainer

prashil1996 Aug 21, 2024
Author