Skip to content
Change the repository type filter

All

    Repositories list

    • Decouple Torch Network-Aware Training on Interlinked Online Nodes
      Python
      0401Updated Aug 13, 2025Aug 13, 2025
    • Python
      2100Updated Aug 6, 2025Aug 6, 2025
    • sensai

      Public
      Sensai serves teacher logits for knowledge distillation.
      Python
      0000Updated Jul 19, 2025Jul 19, 2025
    • organoids

      Public
      Automatic segmentation and analysis of organoids
      Python
      0000Updated Jul 13, 2025Jul 13, 2025
    • swiss army knife of scripts for transforming and processing datasets for machine learning
      Python
      1100Updated Jul 11, 2025Jul 11, 2025
    • mlgroom

      Public
      Grooming ML job queues
      Python
      0010Updated Jul 9, 2025Jul 9, 2025
    • regmix

      Public
      [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)
      Jupyter Notebook
      11000Updated May 28, 2025May 28, 2025
    • Danish Foundation Models landing page
      Python
      0000Updated May 20, 2025May 20, 2025
    • mltiming

      Public
      timing context manager for typical ML training
      Python
      0200Updated May 11, 2025May 11, 2025
    • trl

      Public
      Train transformer language models with reinforcement learning.
      Python
      2.1k000Updated May 2, 2025May 2, 2025
    • .home

      Public
      .home.sh script for persistent UCloud directories
      Shell
      0000Updated Apr 6, 2025Apr 6, 2025
    • A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
      Python
      5.8k000Updated Apr 3, 2025Apr 3, 2025
    • SISO

      Public
      Official implementation of "Single Image Iterative Subject-driven Generation and Editing".
      Python
      5000Updated Mar 31, 2025Mar 31, 2025
    • syntheval

      Public
      Software for evaluating the quality of synthetic data compared with real data.
      Python
      82810Updated Mar 24, 2025Mar 24, 2025
    • aimrun

      Public
      simple interface for integrating aim into MLOps frameworks
      Python
      1001Updated Mar 23, 2025Mar 23, 2025
    • A project for training foundational Danish language model
      Python
      6000Updated Mar 21, 2025Mar 21, 2025
    • OLMo

      Public
      Modeling, training, eval, and inference code for OLMo
      Python
      646100Updated Mar 19, 2025Mar 19, 2025
    • Python
      427000Updated Feb 5, 2025Feb 5, 2025
    • meta library for synthetic data generation
      Python
      0200Updated Jan 14, 2025Jan 14, 2025
    • NCCL Tests
      Cuda
      305000Updated Jan 8, 2025Jan 8, 2025
    • .tmux

      Public
      🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️
      Shell
      3.5k000Updated Dec 17, 2024Dec 17, 2024
    • bitlinear

      Public
      BitLinear implementation
      Python
      53311Updated Dec 10, 2024Dec 10, 2024
    • dolma

      Public
      Data and tools for generating and inspecting OLMo pre-training data.
      Python
      146000Updated Dec 10, 2024Dec 10, 2024
    • olmes

      Public
      Reproducible, flexible LLM evaluations
      Python
      42000Updated Dec 9, 2024Dec 9, 2024
    • nanoT5

      Public
      Fast & Simple repository for pre-training and fine-tuning T5-style models
      Python
      74102Updated Dec 4, 2024Dec 4, 2024
    • DeMo

      Public
      DeMo: Decoupled Momentum Optimization
      Python
      9000Updated Dec 2, 2024Dec 2, 2024
    • diffusers

      Public
      🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
      Python
      6.2k000Updated Oct 1, 2024Oct 1, 2024
    • Schedule-Free Optimization in PyTorch
      Python
      72000Updated Sep 24, 2024Sep 24, 2024
    • aim

      Public
      Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
      Python
      351000Updated Aug 3, 2024Aug 3, 2024
    • cramming

      Public
      Cramming the training of a (BERT-type) language model into limited compute.
      Python
      101000Updated Jul 4, 2024Jul 4, 2024