Skip to content
Change the repository type filter

All

    Repositories list

    • OLMost every training recipe you need to perform data interventions with the OLMo family of models.
      Python
      948130Updated Sep 19, 2025Sep 19, 2025
    • AllenAI's post-training codebase
      Python
      4403.2k1629Updated Sep 19, 2025Sep 19, 2025
    • dolma

      Public
      Data and tools for generating and inspecting OLMo pre-training data.
      Python
      1511.3k717Updated Sep 19, 2025Sep 19, 2025
    • OLMo-core

      Public
      PyTorch building blocks for the OLMo ecosystem
      Python
      55292134Updated Sep 19, 2025Sep 19, 2025
    • A simple evaluation of generative language models and safety classifiers.
      Python
      186402Updated Sep 18, 2025Sep 18, 2025
    • rslearn

      Public
      A tool for developing remote sensing datasets and models.
      Python
      641166Updated Sep 18, 2025Sep 18, 2025
    • olmocr

      Public
      Toolkit for linearizing PDFs for LLM datasets/training
      Python
      1k14k218Updated Sep 18, 2025Sep 18, 2025
    • Python
      2121614Updated Sep 18, 2025Sep 18, 2025
    • scispacy

      Public
      A full spaCy pipeline and models for scientific/biomedical documents.
      Python
      2451.9k354Updated Sep 17, 2025Sep 17, 2025
    • ai2thor

      Public
      An open-source platform for Visual AI.
      C#
      2571.5k2685Updated Sep 17, 2025Sep 17, 2025
    • FlexOlmo

      Public
      Code and training scripts for FlexOlmo
      Python
      1098310Updated Sep 17, 2025Sep 17, 2025
    • Code for in-loop evaluation tasks used by the OLMo training team
      Python
      4600Updated Sep 17, 2025Sep 17, 2025
    • nora_lib

      Public
      Python
      0100Updated Sep 16, 2025Sep 16, 2025
    • Python
      2704Updated Sep 16, 2025Sep 16, 2025
    • Fluid Language Model Benchmarking
      Python
      01010Updated Sep 16, 2025Sep 16, 2025
    • Data mapping framework for rust stuff
      Rust
      1502Updated Sep 15, 2025Sep 15, 2025
    • beaker-py

      Public
      A pure-Python Beaker client
      Python
      21715Updated Sep 15, 2025Sep 15, 2025
    • OLMoASR

      Public
      An open-source implementation of Whisper
      Python
      3642865Updated Sep 15, 2025Sep 15, 2025
    • Friends of OLMo and their links.
      2933111Updated Sep 15, 2025Sep 15, 2025
    • Repository for task scaling laws using model ladders
      Python
      0108Updated Sep 15, 2025Sep 15, 2025
    • regmixer

      Public
      Jupyter Notebook
      0602Updated Sep 14, 2025Sep 14, 2025
    • lfmc

      Public
      Live Fuel Moisture Content (LFMC) from satellite data
      Python
      11140Updated Sep 13, 2025Sep 13, 2025
    • Python
      824014Updated Sep 12, 2025Sep 12, 2025
    • molmoact

      Public
      Official Repository for MolmoAct
      Python
      1519110Updated Sep 12, 2025Sep 12, 2025
    • Set up your GitHub Actions workflow with the Beaker command-line client
      Python
      2608Updated Sep 12, 2025Sep 12, 2025
    • Gantry is a CLI that streamlines running experiments in Beaker
      Python
      72722Updated Sep 11, 2025Sep 11, 2025
    • S2AND

      Public
      Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite
      Python
      209540Updated Sep 10, 2025Sep 10, 2025
    • AskOlmo

      Public
      Python
      01200Updated Sep 10, 2025Sep 10, 2025
    • sinonym

      Public
      Format and normalize Chinese names into Western forms
      Python
      1300Updated Sep 10, 2025Sep 10, 2025
    • atlantes

      Public
      Efficient and low latency real-time global-scale GPS trajectory modeling
      Python
      125915Updated Sep 8, 2025Sep 8, 2025