erictanjn

erictanjn

Pinned Loading

aibrix aibrix Public

Forked from vllm-project/aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go
vllm_batch vllm_batch Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
LMCache/LMCache LMCache/LMCache Public

Supercharge Your LLM with the Fastest KV Cache Layer

Python 4.9k 531
DeepLearning-for-NLP DeepLearning-for-NLP Public

Forked from mindsRiverPonder/DeepLearning-for-NLP

利用pytorch进行各种NLP任务

Jupyter Notebook
kubeflow kubeflow Public

Forked from kubeflow/kubeflow

Machine Learning Toolkit for Kubernetes

TypeScript
sglang_fused_expert sglang_fused_expert Public

Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python