Skip to content
Change the repository type filter

All

    Repositories list

    • SymbioticLab website
      TeX
      Apache License 2.0
      19570Updated Jun 9, 2025Jun 9, 2025
    • Venn

      Public
      [MLSys' 25] Resource Management Across Collaborative Learning Jobs
      Python
      MIT License
      0400Updated May 6, 2025May 6, 2025
    • .github

      Public
      1000Updated May 2, 2025May 2, 2025
    • Oobleck

      Public
      A resilient distributed training framework
      Python
      Apache License 2.0
      89530Updated Apr 11, 2024Apr 11, 2024
    • FedScale

      Public
      FedScale is a scalable and extensible open-source federated learning (FL) platform.
      Python
      Apache License 2.0
      121398412Updated Dec 18, 2023Dec 18, 2023
    • gaia-lib

      Public
      Java
      0001Updated Jul 5, 2023Jul 5, 2023
    • A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
      Python
      53500Updated Jan 9, 2023Jan 9, 2023
    • Aequitas

      Public
      Aequitas enables RPC-level QoS in datacenter networks.
      C++
      BSD 3-Clause "New" or "Revised" License
      21600Updated Jul 19, 2022Jul 19, 2022
    • Java
      Apache License 2.0
      0004Updated Jul 6, 2022Jul 6, 2022
    • Justitia

      Public
      Justitia provides RDMA isolation between applications with diverse requirements.
      C
      BSD 3-Clause "New" or "Revised" License
      74020Updated May 25, 2022May 25, 2022
    • Hydra

      Public
      Hydra adds resilience and high availability to remote memory solutions.
      C
      43100Updated Feb 22, 2022Feb 22, 2022
    • Fluid

      Public
      A Generic Resource-Aware Hyperparameter Tuning Execution Engine
      Python
      Apache License 2.0
      31510Updated Jan 8, 2022Jan 8, 2022
    • tensorflow fork with Salus integration
      C++
      Apache License 2.0
      51100Updated Jan 7, 2022Jan 7, 2022
    • Java
      0002Updated Dec 14, 2021Dec 14, 2021
    • Memtrade

      Public
      C
      2400Updated Oct 27, 2021Oct 27, 2021
    • Oort

      Public
      Oort: Efficient Federated Learning via Guided Participant Selection
      Python
      Apache License 2.0
      2612610Updated Oct 27, 2021Oct 27, 2021
    • Kayak

      Public
      Proactive-adaptive arbitration between shipping compute and shipping data
      Rust
      51810Updated Jul 8, 2021Jul 8, 2021
    • gaiasim

      Public
      Emulator/Simulator of Gaia/Terra
      Java
      0000Updated Jun 3, 2021Jun 3, 2021
    • Sol

      Public
      A Federated Execution Engine for Fast Distributed Computation Over Slow Networks
      Scala
      Apache License 2.0
      72600Updated Apr 26, 2021Apr 26, 2021
    • Infiniswap enables unmodified applications to efficiently use disaggregated memory.
      C
      50247152Updated Sep 26, 2020Sep 26, 2020
    • Leap

      Public
      Prefetching and efficient data path for memory disaggregation
      C
      Other
      236740Updated Jul 16, 2020Jul 16, 2020
    • Tiresias

      Public
      Tiresias is a GPU cluster manager for distributed deep learning training.
      Python
      Apache License 2.0
      5015440Updated May 7, 2020May 7, 2020
    • Salus

      Public
      Fine-grained GPU sharing primitives
      Jupyter Notebook
      Apache License 2.0
      1914120Updated Mar 13, 2020Mar 13, 2020
    • Network testing with boost Asio, DPDK and possibly other technologies
      C
      2100Updated Apr 14, 2019Apr 14, 2019
    • papercite

      Public
      Bibtex plugin for wordpress
      PHP
      GNU General Public License v2.0
      47000Updated Jan 2, 2019Jan 2, 2019
    • rdmaMQ

      Public
      C++
      1300Updated Dec 10, 2018Dec 10, 2018
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      C++
      Other
      24k000Updated Sep 19, 2018Sep 19, 2018
    • Testbench for experimenting with Apache Hive at any data scale.
      Python
      193000Updated Jun 30, 2018Jun 30, 2018
    • lipwig

      Public
      A slightly moist lipstick-on-pig clone for Apache Hive
      Python
      19000Updated Mar 9, 2018Mar 9, 2018
    • Big Bench Workload Development
      Shell
      Other
      71400Updated Feb 12, 2018Feb 12, 2018