Skip to content
Change the repository type filter

All

    Repositories list

    • A framework for few-shot evaluation of language models.
      Python
      2.7k10k480159Updated Sep 16, 2025Sep 16, 2025
    • bergson

      Public
      Mapping out the "memory" of neural nets with data attribution
      Python
      52612Updated Sep 16, 2025Sep 16, 2025
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      332091510Updated Sep 15, 2025Sep 15, 2025
    • sparsify

      Public
      Sparsify transformers with SAEs and transcoders
      Python
      8362164Updated Sep 15, 2025Sep 15, 2025
    • delphi

      Public
      Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.
      Python
      4421165Updated Sep 15, 2025Sep 15, 2025
    • djinn

      Public
      Provide a lightweight framework for authoring and validating exploitable verifiable coding problems
      Python
      0300Updated Sep 15, 2025Sep 15, 2025
    • Problems generated by djinn (exploitably verifiable coding problems)
      0000Updated Sep 11, 2025Sep 11, 2025
    • Jupyter Notebook
      66100Updated Sep 8, 2025Sep 8, 2025
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      7512Updated Aug 18, 2025Aug 18, 2025
    • Python
      11000Updated Aug 12, 2025Aug 12, 2025
    • Sparsify transformers with cross-layer transcoders
      Python
      831502Updated Aug 12, 2025Aug 12, 2025
    • Tools for understanding how transformer predictions are built layer-by-layer
      Python
      59200Updated Aug 7, 2025Aug 7, 2025
    • attribute

      Public
      Python
      61001Updated Aug 6, 2025Aug 6, 2025
    • Linear probes with attention weighting
      Python
      1600Updated Aug 2, 2025Aug 2, 2025
    • verifiers

      Public
      Verifiers for LLM Reinforcement Learning
      Python
      335000Updated Jul 31, 2025Jul 31, 2025
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      4281381Updated Jul 29, 2025Jul 29, 2025
    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      1.1k7.3k6124Updated Jul 23, 2025Jul 23, 2025
    • Python
      0100Updated Jul 22, 2025Jul 22, 2025
    • MIDI tokenizers and pre-processing utils.
      Python
      3331Updated Jul 21, 2025Jul 21, 2025
    • DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
      Python
      4.6k16801Updated Jul 21, 2025Jul 21, 2025
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      95300Updated Jul 9, 2025Jul 9, 2025
    • aria

      Public
      Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)
      Python
      137000Updated Jul 1, 2025Jul 1, 2025
    • The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      7.5k16010Updated Jun 27, 2025Jun 27, 2025
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      1932.6k143Updated Jun 9, 2025Jun 9, 2025
    • Investigating goal instability in RL
      Python
      0100Updated Jun 2, 2025Jun 2, 2025
    • open-r1

      Public
      Fully open reproduction of DeepSeek-R1
      Python
      2.4k400Updated May 21, 2025May 21, 2025
    • POSER

      Public
      Poser: Unmasking Alignment Faking LLMs by Manipulating Their Internals
      Python
      4200Updated May 21, 2025May 21, 2025
    • tyche

      Public
      Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
      Jupyter Notebook
      0802Updated May 21, 2025May 21, 2025
    • rtopk

      Public
      Cuda
      0100Updated May 20, 2025May 20, 2025
    • wmdp

      Public
      WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
      Jupyter Notebook
      38000Updated May 15, 2025May 15, 2025