Change the repository type filter
All
Repositories list
24 repositories
vllm
Public- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
vllm-gaudi
Publicvllm-spyre
Publicsemantic-router
PublicIntelligent Mixture-of-Models Router for Efficient LLM Inference- Community maintained hardware plugin for vLLM on Ascend
speculators
Publicci-infra
Publicvllm-project.github.io
Publicvllm-neuron
Publicvllm-openvino
Publicrfcs
Publicvllm-project.github.io-static
Public archivemedia-kit
Publicdashboard
Public