Skip to content
Change the repository type filter

All

    Repositories list

    • LightX2V

      Public
      Light Video Generation Inference Framework
      Python
      2237396Updated Aug 5, 2025Aug 5, 2025
    • Wan2.2-Lightning: Speed up wan2.2 model with distillation
      Python
      1162400Updated Aug 5, 2025Aug 5, 2025
    • LightLLM

      Public
      LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
      Python
      2713.4k7821Updated Aug 5, 2025Aug 5, 2025
    • ComfyUI custom node for lightx2v
      Python
      52900Updated Aug 2, 2025Aug 2, 2025
    • [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
      Python
      61528340Updated Aug 1, 2025Aug 1, 2025
    • Token healing implementation in Rust
      Rust
      0401Updated Aug 1, 2025Aug 1, 2025
    • A general suffix automaton implementation in Rust with Python bindings
      Rust
      0702Updated Jul 28, 2025Jul 28, 2025
    • Dockerfile
      2000Updated Jul 24, 2025Jul 24, 2025
    • Cuda
      0100Updated Jul 11, 2025Jul 11, 2025
    • HarmoniCa

      Public
      [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration".
      Python
      04110Updated Jul 10, 2025Jul 10, 2025
    • TFMQ-DM

      Public
      [CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".
      Jupyter Notebook
      410300Updated Jul 10, 2025Jul 10, 2025
    • SCSS
      0000Updated Jul 7, 2025Jul 7, 2025
    • LightTTS

      Public
      Light-tts is a lightweight TTS inference framework optimized for CosyVoice2, enabling fast and scalable speech synthesis in Python.
      Python
      0600Updated Jun 24, 2025Jun 24, 2025
    • Python bindings for general-sam and some utilities
      Python
      0401Updated Jun 17, 2025Jun 17, 2025
    • OmniBal

      Public
      [ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniverse Computation Balance".
      Python
      32330Updated Jun 16, 2025Jun 16, 2025
    • 0000Updated Apr 28, 2025Apr 28, 2025
    • MQBench

      Public
      Model Quantization Benchmark
      Python
      14382795Updated Apr 20, 2025Apr 20, 2025
    • Fast and memory-efficient exact attention
      Python
      1.9k000Updated Apr 17, 2025Apr 17, 2025
    • Greedily tokenize strings with the longest tokens iteratively.
      Python
      0001Updated Mar 24, 2025Mar 24, 2025
    • verl

      Public
      verl: Volcano Engine Reinforcement Learning for LLMs
      Python
      2k100Updated Mar 17, 2025Mar 17, 2025
    • LLM_QAT

      Public
      Python
      0000Updated Feb 19, 2025Feb 19, 2025
    • HTML
      0000Updated Jan 25, 2025Jan 25, 2025
    • Cuda
      21100Updated Jan 10, 2025Jan 10, 2025
    • EasyLLM

      Public
      Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing usability, it also ensures training efficiency.
      Python
      84800Updated Sep 18, 2024Sep 18, 2024
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      4.5k000Updated Sep 13, 2024Sep 13, 2024
    • OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
      Python
      637100Updated Sep 6, 2024Sep 6, 2024
    • xtuner

      Public
      An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
      Python
      351000Updated Aug 22, 2024Aug 22, 2024
    • InternVL

      Public
      [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
      Python
      674000Updated Aug 16, 2024Aug 16, 2024
    • Python
      01310Updated Jun 16, 2024Jun 16, 2024
    • msbench

      Public
      A tool for model sparse based on torch.fx
      Python
      21300Updated Jun 3, 2024Jun 3, 2024