bzhng-development

sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.

bzhng-development/sglang’s past year of commit activity

Python 0 Apache-2.0 2,494 0 0 Updated Jul 30, 2025
TensorRT-LLM Public Forked from NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.

bzhng-development/TensorRT-LLM’s past year of commit activity

C++ 0 Apache-2.0 1,652 0 0 Updated Jul 24, 2025
cccl Public Forked from NVIDIA/cccl
CUDA Core Compute Libraries

bzhng-development/cccl’s past year of commit activity

C++ 0 249 0 0 Updated Jul 24, 2025
TransformerEngine Public Forked from NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

bzhng-development/TransformerEngine’s past year of commit activity

Python 0 Apache-2.0 471 0 0 Updated Jul 24, 2025
vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs

bzhng-development/vllm’s past year of commit activity

Python 0 Apache-2.0 9,142 0 0 Updated Jul 24, 2025
NeMo-Skills Public Forked from NVIDIA/NeMo-Skills
A project to improve skills of large language models

bzhng-development/NeMo-Skills’s past year of commit activity

Python 0 Apache-2.0 89 0 0 Updated Jul 23, 2025
triton Public Forked from triton-lang/triton
Development repository for the Triton language and compiler

bzhng-development/triton’s past year of commit activity

MLIR 0 MIT 2,175 0 0 Updated Jul 23, 2025
recipes Public Forked from vllm-project/recipes
Common recipes to run vLLM

bzhng-development/recipes’s past year of commit activity

0 Apache-2.0 7 0 0 Updated Jul 23, 2025
LMCache Public Forked from LMCache/LMCache
Supercharge Your LLM with the Fastest KV Cache Layer

bzhng-development/LMCache’s past year of commit activity

Python 0 Apache-2.0 420 0 0 Updated Jul 23, 2025
SpecForge Public Forked from sgl-project/SpecForge
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

bzhng-development/SpecForge’s past year of commit activity

Python 0 MIT 32 0 0 Updated Jul 23, 2025

View all repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bzhng-development

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!