Have GuideLLM kick off a vLLM server automatically to avoid having the user install vLLM and assign the target themselves #95

rgreenberg1 · 2025-03-09T20:54:18Z

Description:
The proposal here is to change the architecture of how GuideLLM runs so that when a user runs GuideLLM it automatically kicks off a vLLM server and supplies GuideLLM with the necessary data to run the benchmark. This also covers adding any necessary pass-through parameters to vLLM to GuideLLM so that the user can just run GuideLLM to do a full end-to-end benchmark on a model. This is a UX enhancement.

Acceptance Criteria:

Enable GuideLLM to kick off a vLLM server when GuideLLM is run
Enable GuideLLM to accept the necessary pass-through arguments that need to go to vLLM:
-- model (required)
-- port (optional)
-- TBD

dagrayvid · 2025-03-10T19:06:07Z

I'm not sure that we should add orchestration of the vLLM server to the scope of GuideLLM, for a few reasons:

Want the tool to stay agnostic of the runtime engine, we may want to add different backends to support other runtime engines.
Don't want the tool to be opinionated about the platform that the runtime engine is deployed on. If we wanted to support model deployment we would likely want to support k8s, KServe, Podman, bare-metal, etc...
Want to avoid scope-creep and keep GuideLLM focused on it's current focus which is doing load tests against a pre-deployed model endpoint.

Alternatives:

A separate project focused on simplifying model deployment that supports all platforms we care about (k8s Deployment, kserve, Podman, local [vllm serve]). We could even try to keep a set of "known-good" model configurations in that repo
Refer users to existing model deployment guides / mechanisms.

rgreenberg1 added the enhancement New feature or request label Mar 9, 2025

rgreenberg1 assigned markurtz and rgreenberg1 Mar 9, 2025

rgreenberg1 added this to GuideLLM Kanban Board Mar 9, 2025

rgreenberg1 moved this to Backlog in GuideLLM Kanban Board May 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Have GuideLLM kick off a vLLM server automatically to avoid having the user install vLLM and assign the target themselves #95

Have GuideLLM kick off a vLLM server automatically to avoid having the user install vLLM and assign the target themselves #95

rgreenberg1 commented Mar 9, 2025

dagrayvid commented Mar 10, 2025 •

edited

Loading

Have GuideLLM kick off a vLLM server automatically to avoid having the user install vLLM and assign the target themselves #95

Have GuideLLM kick off a vLLM server automatically to avoid having the user install vLLM and assign the target themselves #95

Comments

rgreenberg1 commented Mar 9, 2025

dagrayvid commented Mar 10, 2025 • edited Loading

dagrayvid commented Mar 10, 2025 •

edited

Loading