Skip to content
Change the repository type filter

All

    Repositories list

    • Experiments to reproduce the results from Optimal Weight Formats
      Jupyter Notebook
      0000Updated Jul 29, 2025Jul 29, 2025
    • verl-fork

      Public
      verl: Volcano Engine Reinforcement Learning for LLMs
      Python
      2k000Updated Jul 29, 2025Jul 29, 2025
    • ao-fork

      Public
      PyTorch native quantization and sparsity for training and inference
      Python
      309001Updated Jul 17, 2025Jul 17, 2025
    • A library for unit scaling in PyTorch
      Jupyter Notebook
      11128110Updated Jul 11, 2025Jul 11, 2025
    • Jupyter Notebook
      0400Updated Jul 4, 2025Jul 4, 2025
    • A Python toolbox to compute topological metrics and statistics for Knowledge Graphs
      Jupyter Notebook
      0810Updated Jun 27, 2025Jun 27, 2025
    • A library to analyze PyTorch traces.
      Python
      65001Updated Jun 18, 2025Jun 18, 2025
    • Support materials for "On Stochastic Rounding with Few Random Bits", Fitzgibbon and Felix, ARITH 2025
      Python
      7.3k000Updated Jun 2, 2025Jun 2, 2025
    • Support materials for "On Stochastic Rounding with Few Random Bits", Fitzgibbon and Felix, ARITH 2025
      Python
      7.3k100Updated Jun 2, 2025Jun 2, 2025
    • minimol

      Public
      MiniMol is a 10M-parameters molecular fingerprinting model pre-trained on >3300 biological and quantum tasks
      Jupyter Notebook
      42030Updated May 29, 2025May 29, 2025
    • Reproducible, flexible LLM evaluations
      Python
      40000Updated May 21, 2025May 21, 2025
    • A PyTorch native library for large model training
      Python
      449002Updated Apr 8, 2025Apr 8, 2025
    • Packages simple-evals in an installable pip package
      Python
      0000Updated Mar 26, 2025Mar 26, 2025
    • Bucketed top-k for PyTorch using a priority queue
      Python
      0610Updated Mar 22, 2025Mar 22, 2025
    • gfloat

      Public
      Generic floating-point types in Python
      Python
      31300Updated Mar 21, 2025Mar 21, 2025
    • coconut

      Public
      Training Large Language Model to Reason in a Continuous Latent Space
      Jupyter Notebook
      113000Updated Feb 10, 2025Feb 10, 2025
    • open-r1

      Public
      Fully open reproduction of DeepSeek-R1
      Python
      2.3k000Updated Feb 6, 2025Feb 6, 2025
    • JAX Scalify: end-to-end scaled arithmetics
      Python
      01671Updated Oct 30, 2024Oct 30, 2024
    • mess

      Public archive
      MESS: Modern Electronic Structure Simulations
      Python
      22000Updated Sep 24, 2024Sep 24, 2024
    • Track & Visualisation tool for numerics debugging
      Python
      0620Updated Sep 20, 2024Sep 20, 2024
    • An experimentation platform for LLM inference optimisation
      Jupyter Notebook
      43200Updated Sep 19, 2024Sep 19, 2024
    • Fork of pytorch-labs/gpt-fast with SparQ attention and benchmarking
      Python
      2300Updated Sep 19, 2024Sep 19, 2024
    • LLM inference in C/C++
      C++
      13k200Updated Sep 16, 2024Sep 16, 2024
    • ml_dtypes

      Public
      A stand-alone implementation of several NumPy dtype extensions used in machine learning.
      C++
      43000Updated Sep 13, 2024Sep 13, 2024
    • Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
      Jupyter Notebook
      34600Updated Jul 17, 2024Jul 17, 2024
    • Graphium fork for Scaling Molecular GNNs project at Graphcore
      Python
      12003Updated Apr 8, 2024Apr 8, 2024
    • bess-kge

      Public
      A PyTorch library for Knowledge Graph Embedding on Graphcore IPUs implementing the distribution framework BESS
      Jupyter Notebook
      2300Updated Mar 21, 2024Mar 21, 2024
    • Path-tracer with Neural HDRI for Graphcore IPUs.
      C++
      2300Updated Mar 12, 2024Mar 12, 2024
    • TessellateIPU: low level Poplar tile programming from Python
      Python
      01341Updated Mar 12, 2024Mar 12, 2024
    • Poplar implementation of FlashAttention for IPU
      C++
      0300Updated Mar 12, 2024Mar 12, 2024