Skip to content

Releases: vllm-project/production-stack

vllm-stack-0.1.6

22 Jul 04:55
3bb6b73
Compare
Choose a tag to compare

The stack deployment of vLLM

What's changed

vllm-stack-0.1.5

17 Jun 04:23
0b6a61c
Compare
Choose a tag to compare

The stack deployment of vLLM

vllm-stack-0.1.4

05 Jun 21:10
6e3c06f
Compare
Choose a tag to compare

The stack deployment of vLLM

What's changed

vllm-stack-0.1.3

30 May 06:39
ff7a6c1
Compare
Choose a tag to compare

The stack deployment of vLLM

Changes made

vllm-stack-0.1.2

29 Apr 19:56
2404918
Compare
Choose a tag to compare

The stack deployment of vLLM

What's Changed

vllm-stack-0.1.1

19 Mar 18:39
82b47eb
Compare
Choose a tag to compare

The stack deployment of vLLM

What's Changed

New Contributors

Full Changelog: vllm-stack-0.1.0...vllm-stack-0.1.1

vllm-stack-0.1.0

03 Mar 17:31
fecae77
Compare
Choose a tag to compare

The stack deployment of vLLM

What's Changed

  • [Feat] add imagePullSecrets option to helm chart #179 by @kalantar
  • [Benchmark] Adding multi-round QA benchmark script #180 @YuhanLiu11
  • [Feat]: add support for embeddings, rerank and score endpoints #181 @bufferoverflow
  • [CI/Build]: bump python to 3.12 to be inline with vllm #182 @bufferoverflow
  • Manually Enable LoRA Adapters using existing Router and vLLM deployment #206 @wangchen615
  • [Feat] dynamic configuration support for router #207 @ApostaC
  • [Feat] create kubernetes operator to manage dynamic config file #208 @rootfs
  • [Document, Feat] basic HPA support and tutorials #209 @ApostaC
  • [Feat] enable experimental semantic cache in router #210 @rootfs

New Contributors

vllm-stack-0.0.11

25 Feb 17:48
fb0cb90
Compare
Choose a tag to compare

The stack deployment of vLLM

What's Changed

New Contributors

Full Changelog: vllm-stack-0.0.10...vllm-stack-0.0.11

vllm-stack-0.0.9

19 Feb 18:07
4c3aeef
Compare
Choose a tag to compare

The stack deployment of vLLM

What's Changed

  • [Bugfix] Fix indentation issue in Helm Chart PVC by @BaeYeongbin in #148
  • [Tutorial] Deployment on Google GKE by @EaminC in #146
  • Feat: Router observability (Current QPS, router-side queueing delay, etc) Part 1 by @sitloboi2012 in #119
  • [release] Add github sha tag for router image by @gaocegege in #153
  • [Fix] Minor Fixs for Tutorial and Bumped version to 0.0.9 by @Hanchenli in #154

New Contributors

Full Changelog: vllm-stack-0.0.8...vllm-stack-0.0.9

vllm-stack-0.0.10

19 Feb 21:34
ecca068
Compare
Choose a tag to compare

The stack deployment of vLLM

What's Changed