Skip to content
Change the repository type filter

All

    Repositories list

    • Ongoing research project for code&math LLMs
      Python
      11800Updated Jul 4, 2025Jul 4, 2025
    • LAPS

      Public
      Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024
      Python
      11000Updated Jun 26, 2025Jun 26, 2025
    • hpsc-2025

      Public
      Shell
      24301Updated Jun 8, 2025Jun 8, 2025
    • nbd

      Public
      N-Body generator for Hatrix
      C++
      1000Updated Feb 25, 2025Feb 25, 2025
    • BigCodeBench: Benchmarking Code Generation Towards AGI
      Python
      50000Updated Feb 15, 2025Feb 15, 2025
    • cutlass

      Public
      CUDA Templates for Linear Algebra Subroutines
      C++
      1.4k100Updated Dec 4, 2024Dec 4, 2024
    • HTML
      159000Updated Oct 31, 2024Oct 31, 2024
    • Ongoing Research Project for Mixture of Expert models
      Python
      1200Updated Oct 2, 2024Oct 2, 2024
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      3k000Updated Sep 26, 2024Sep 26, 2024
    • nanoGPT

      Public
      The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      7.4k100Updated Sep 21, 2024Sep 21, 2024
    • Optimized primitives for collective multi-GPU communication
      C++
      990030Updated Aug 2, 2024Aug 2, 2024
    • Python
      49000Updated Jul 17, 2024Jul 17, 2024
    • Hatrix

      Public
      C++
      13121Updated Jul 5, 2024Jul 5, 2024
    • hpsc-2024

      Public
      Shell
      401500Updated Jun 12, 2024Jun 12, 2024
    • FRANK

      Public
      C++
      22110Updated May 9, 2024May 9, 2024
    • Python
      0100Updated Apr 30, 2024Apr 30, 2024
    • grok-1

      Public
      Grok open release
      Python
      8.4k000Updated Mar 17, 2024Mar 17, 2024
    • toast-gpt

      Public
      Python
      1000Updated Mar 8, 2024Mar 8, 2024
    • toast-vit

      Public
      Python
      0000Updated Feb 14, 2024Feb 14, 2024
    • Zero Bubble Pipeline Parallelism
      Python
      3k000Updated Feb 13, 2024Feb 13, 2024
    • main: microsoft/Meagtron-DeepSpeed, cpu: 富岳上で動かすstableブランチ
      Python
      1520Updated Feb 2, 2024Feb 2, 2024
    • 2023 ABCI Llama-2 継続学習プロジェクト
      Python
      31400Updated Jan 22, 2024Jan 22, 2024
    • Python
      209000Updated Dec 15, 2023Dec 15, 2023
    • An adaptable federated learning framework with a central server, supporting diverse datasets, models, and optimizers. Facilitates collaborative, yet private, data training with customizable aggregation algorithms.
      Python
      0000Updated Nov 16, 2023Nov 16, 2023
    • m2

      Public
      Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
      Assembly
      42000Updated Nov 2, 2023Nov 2, 2023
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
      Python
      1.1k000Updated Sep 25, 2023Sep 25, 2023
    • Best practice for training LLaMA models in Megatron-LM
      Python
      3k000Updated Sep 4, 2023Sep 4, 2023
    • Ongoing research training transformer language models at scale, including: BERT & GPT-2
      Python
      3k000Updated Aug 30, 2023Aug 30, 2023
    • elses

      Public
      Fortran
      0000Updated Aug 3, 2023Aug 3, 2023
    • A framework for few-shot evaluation of autoregressive language models.
      Python
      2.7k000Updated Jul 31, 2023Jul 31, 2023