Change the repository type filter
All
Repositories list
24 repositories
llm-compressor
PublicTransformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM- Intelligent Mixture-of-Models Router for Efficient LLM Inference
- Community maintained hardware plugin for vLLM on Ascend
vllm-project.github.io
Publicvllm-openvino
Publicrfcs
Publicmedia-kit
Public