Skip to content
Change the repository type filter

All

    Repositories list

    • TypeScript
      0001Updated Aug 8, 2025Aug 8, 2025
    • Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
      Go
      13k801Updated Aug 8, 2025Aug 8, 2025
    • LiteGS

      Public
      A refactored codebase for Gaussian Splatting. Fastest, Better, Modular, Pure Python or CUDA Extension
      Python
      915420Updated Aug 7, 2025Aug 7, 2025
    • Shell
      53150Updated Aug 6, 2025Aug 6, 2025
    • torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics cards.
      Python
      31431630Updated Jul 17, 2025Jul 17, 2025
    • kineto

      Public
      HTML
      3100Updated Jul 17, 2025Jul 17, 2025
    • GUI multimodal visual understanding model, GUI datasets DeskVision
      Python
      1200Updated Jul 11, 2025Jul 11, 2025
    • Jupyter Notebook
      0410Updated May 29, 2025May 29, 2025
    • muThrust

      Public
      The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
      C++
      766200Updated May 23, 2025May 23, 2025
    • StableGS

      Public
      0310Updated Apr 17, 2025Apr 17, 2025
    • TurboSplat-Viz is a 3D Gaussian Splatting (GS) renderer implemented using DirectX 12. Leveraging the exceptional performance of Mesh Shaders, DX12GSViewer achieves unparalleled speed improvements.
      C++
      0600Updated Apr 1, 2025Apr 1, 2025
    • Python
      1400Updated Mar 19, 2025Mar 19, 2025
    • A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
      Python
      474500Updated Mar 11, 2025Mar 11, 2025
    • Go
      41600Updated Mar 1, 2025Mar 1, 2025
    • 0800Updated Feb 28, 2025Feb 28, 2025
    • MT-DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      C++
      889600Updated Feb 27, 2025Feb 27, 2025
    • A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
      Python
      300100Updated Feb 27, 2025Feb 27, 2025
    • C++
      01500Updated Feb 26, 2025Feb 26, 2025
    • mutlass

      Public
      MUSA Templates for Linear Algebra Subroutines
      C++
      1.4k3010Updated Feb 26, 2025Feb 26, 2025
    • MooER

      Public
      MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.
      Python
      1521740Updated Jan 8, 2025Jan 8, 2025
    • TurboRAG

      Public
      Python
      117950Updated Nov 25, 2024Nov 25, 2024
    • vllm_musa

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      9.3k5550Updated Oct 28, 2024Oct 28, 2024
    • SimuMax

      Public
      a static analytical model for LLM distributed training
      Python
      11500Updated Oct 18, 2024Oct 18, 2024
    • RetinaGS

      Public
      Python
      72400Updated Oct 17, 2024Oct 17, 2024
    • Repository for OpenCV's extra modules
      C++
      5.8k200Updated Sep 25, 2024Sep 25, 2024
    • opencv

      Public
      Open Source Computer Vision Library
      C++
      56k1900Updated Sep 25, 2024Sep 25, 2024
    • muAlg

      Public
      Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
      Cuda
      459300Updated Sep 13, 2024Sep 13, 2024
    • dynolog

      Public
      Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.
      C++
      01300Updated Aug 7, 2024Aug 7, 2024
    • qtbase

      Public
      Qt Base (Core, Gui, Widgets, Network, ...)
      C++
      1.1k000Updated Jun 20, 2024Jun 20, 2024
    • C++
      123000Updated Jun 20, 2024Jun 20, 2024