Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      2839902Updated Aug 18, 2025Aug 18, 2025
    • KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems
      Python
      59526810Updated Aug 7, 2025Aug 7, 2025
    • SCSS
      16200Updated Jul 29, 2025Jul 29, 2025
    • Samples of good AI generated CUDA kernels
      Python
      98810Updated May 30, 2025May 30, 2025
    • TPT

      Public
      Welcome to TPT, a framework for teaching large language models to solve math problems by learning from (and improving on) their own reasoning traces.
      Python
      4600Updated May 29, 2025May 29, 2025
    • caesar

      Public
      Throughput-oriented multi-turn inference engine for KernelBench [ICML '25]
      Python
      51300Updated May 27, 2025May 27, 2025
    • Archon

      Public
      Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
      Python
      2017730Updated Mar 7, 2025Mar 7, 2025
    • Python
      24120Updated Jan 28, 2025Jan 28, 2025
    • 0000Updated Dec 3, 2024Dec 3, 2024
    • CATS

      Public
      Python
      52820Updated Nov 11, 2024Nov 11, 2024
    • Python
      2510130Updated Sep 25, 2024Sep 25, 2024
    • Python
      1200Updated Jul 31, 2024Jul 31, 2024
    • Jupyter Notebook
      01300Updated Jul 31, 2024Jul 31, 2024
    • hydragen

      Public
      Hydragen: High-Throughput LLM Inference with Shared Prefixes
      Python
      34130Updated May 10, 2024May 10, 2024