Skip to content
Change the repository type filter

All

    Repositories list

    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      9798.7k13229Updated Nov 6, 2025Nov 6, 2025
    • DeepSeek-OCR

      Public
      Contexts Optical Compression
      Python
      1.5k20k19123Updated Oct 25, 2025Oct 25, 2025
    • 3FS

      Public
      A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
      C++
      9589.5k11424Updated Oct 24, 2025Oct 24, 2025
    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Cuda
      7395.9k395Updated Oct 15, 2025Oct 15, 2025
    • DeepSeek-V3.2-Exp

      Public
      Python
      69970125Updated Oct 2, 2025Oct 2, 2025
    • FlashMLA

      Public
      FlashMLA: Efficient Multi-head Latent Attention Kernels
      C++
      89712k505Updated Sep 30, 2025Sep 30, 2025
    • awesome-deepseek-integration

      Public
      Integrate the DeepSeek API into popular softwares
      3.8k34k9139Updated Sep 25, 2025Sep 25, 2025
    • DeepSeek-V3

      Public
      Python
      16k100k2742Updated Aug 28, 2025Aug 28, 2025
    • DeepSeek-Prover-V2

      Public
      901.2k102Updated Jul 18, 2025Jul 18, 2025
    • DeepSeek-R1

      Public
      12k91k1027Updated Jun 27, 2025Jun 27, 2025
    • ESFT

      Public
      Expert Specialized Fine-Tuning
      Python
      26070850Updated May 22, 2025May 22, 2025
    • open-infra-index

      Public
      Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
      2867.9k00Updated May 15, 2025May 15, 2025
    • DreamCraft3D

      Public
      [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
      Python
      3593k340Updated Apr 22, 2025Apr 22, 2025
    • EPLB

      Public
      Expert Parallelism Load Balancer
      Python
      1951.3k81Updated Mar 24, 2025Mar 24, 2025
    • profile-data

      Public
      Analyze computation-communication overlap in V3/R1.
      1431.1k110Updated Mar 21, 2025Mar 21, 2025
    • DualPipe

      Public
      A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
      Python
      3052.9k40Updated Mar 10, 2025Mar 10, 2025
    • smallpond

      Public
      A lightweight data processing framework built on DuckDB and 3FS.
      Python
      4304.8k226Updated Mar 5, 2025Mar 5, 2025
    • DeepSeek-VL2

      Public
      DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
      Python
      1.8k5.1k9815Updated Feb 26, 2025Feb 26, 2025
    • Janus

      Public
      Janus-Series: Unified Multimodal Understanding and Generation Models
      Python
      2.2k18k15721Updated Feb 1, 2025Feb 1, 2025
    • DeepSeek-V2

      Public
      DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
      5345k793Updated Sep 25, 2024Sep 25, 2024
    • DeepSeek-Coder-V2

      Public
      DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
      9926.2k605Updated Sep 24, 2024Sep 24, 2024
    • DeepSeek-Prover-V1.5

      Public
      Python
      23254080Updated Aug 16, 2024Aug 16, 2024
    • DeepSeek-Coder

      Public
      DeepSeek Coder: Let the Code Write Itself
      Python
      2.6k22k11923Updated May 21, 2024May 21, 2024
    • DeepSeek-VL

      Public
      DeepSeek-VL: Towards Real-World Vision-Language Understanding
      Python
      5804k422Updated Apr 24, 2024Apr 24, 2024
    • DeepSeek-Math

      Public
      DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
      Python
      5533k332Updated Apr 15, 2024Apr 15, 2024
    • awesome-deepseek-coder

      Public
      A curated list of open-source projects related to DeepSeek Coder
      20172300Updated Apr 3, 2024Apr 3, 2024
    • DeepSeek-LLM

      Public
      DeepSeek LLM: Let there be answers
      Makefile
      1k6.6k382Updated Feb 4, 2024Feb 4, 2024
    • DeepSeek-MoE

      Public
      DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
      Python
      2941.8k174Updated Jan 16, 2024Jan 16, 2024