Skip to content
Change the repository type filter

All

    Repositories list

    • bergson

      Public
      Mapping out the "memory" of neural nets with data attribution
      Python
      52411Updated Aug 8, 2025Aug 8, 2025
    • Python
      0200Updated Aug 8, 2025Aug 8, 2025
    • A framework for few-shot evaluation of language models.
      Python
      2.6k9.8k458158Updated Aug 8, 2025Aug 8, 2025
    • djinn

      Public
      Provide a lightweight framework for authoring and validating exploitable verifiable coding problems
      Python
      0100Updated Aug 8, 2025Aug 8, 2025
    • Tools for understanding how transformer predictions are built layer-by-layer
      Python
      57200Updated Aug 7, 2025Aug 7, 2025
    • attribute

      Public
      Python
      5901Updated Aug 6, 2025Aug 6, 2025
    • delphi

      Public
      Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.
      Python
      3920272Updated Aug 5, 2025Aug 5, 2025
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      332091510Updated Aug 4, 2025Aug 4, 2025
    • sparsify

      Public
      Sparsify transformers with SAEs and transcoders
      Python
      8260153Updated Aug 4, 2025Aug 4, 2025
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      7513Updated Aug 3, 2025Aug 3, 2025
    • Sparsify transformers with cross-layer transcoders
      Python
      821002Updated Aug 2, 2025Aug 2, 2025
    • Linear probes with attention weighting
      Python
      1500Updated Aug 2, 2025Aug 2, 2025
    • verifiers

      Public
      Verifiers for LLM Reinforcement Learning
      Python
      234000Updated Jul 31, 2025Jul 31, 2025
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      4381181Updated Jul 29, 2025Jul 29, 2025
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      1.1k7.3k6124Updated Jul 23, 2025Jul 23, 2025
    • Python
      0000Updated Jul 22, 2025Jul 22, 2025
    • MIDI tokenizers and pre-processing utils.
      Python
      1130Updated Jul 21, 2025Jul 21, 2025
    • DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
      Python
      4.5k16801Updated Jul 21, 2025Jul 21, 2025
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      95200Updated Jul 9, 2025Jul 9, 2025
    • aria

      Public
      Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)
      Python
      126800Updated Jul 1, 2025Jul 1, 2025
    • The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      7.3k14910Updated Jun 27, 2025Jun 27, 2025
    • Problems generated by djinn (exploitably verifiable coding problems)
      0000Updated Jun 27, 2025Jun 27, 2025
    • Python
      58100Updated Jun 13, 2025Jun 13, 2025
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      1912.6k143Updated Jun 9, 2025Jun 9, 2025
    • Investigating goal instability in RL
      Python
      0100Updated Jun 2, 2025Jun 2, 2025
    • open-r1

      Public
      Fully open reproduction of DeepSeek-R1
      Python
      2.4k400Updated May 21, 2025May 21, 2025
    • POSER

      Public
      Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals
      Python
      4200Updated May 21, 2025May 21, 2025
    • tyche

      Public
      Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
      Jupyter Notebook
      0802Updated May 21, 2025May 21, 2025
    • rtopk

      Public
      Cuda
      0100Updated May 20, 2025May 20, 2025
    • wmdp

      Public
      WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
      Jupyter Notebook
      36000Updated May 15, 2025May 15, 2025