Skip to content
Change the repository type filter

All

    Repositories list

    • truss

      Public
      The simplest way to serve AI/ML models in production
      Python
      881k6429Updated Aug 14, 2025Aug 14, 2025
    • Ready-to-use ML training recipes to help you build and deploy models on Baseten.
      0201Updated Aug 13, 2025Aug 13, 2025
    • Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.
      Python
      17000Updated Aug 12, 2025Aug 12, 2025
    • Examples of models deployable with Truss
      Python
      481951457Updated Aug 11, 2025Aug 11, 2025
    • Taming Stable Diffusion for Lip Sync!
      Python
      762002Updated Aug 6, 2025Aug 6, 2025
    • A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
      Python
      116103Updated Aug 6, 2025Aug 6, 2025
    • gorilla

      Public
      Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
      Python
      1.2k000Updated Aug 6, 2025Aug 6, 2025
    • harmony

      Public
      Renderer for the harmony response format to be used with gpt-oss
      Rust
      172000Updated Aug 5, 2025Aug 5, 2025
    • A GitHub action to create a pull request for changes to your repository in the actions workspace
      TypeScript
      490002Updated Jul 22, 2025Jul 22, 2025
    • Provides the function of slack notification to GitHub Actions.
      TypeScript
      142002Updated Jul 22, 2025Jul 22, 2025
    • lws

      Public
      LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
      Go
      95002Updated Jul 18, 2025Jul 18, 2025
    • llm-tools

      Public
      Python
      0001Updated Jul 15, 2025Jul 15, 2025
    • Reports junit test results as GitHub Pull Request Check
      TypeScript
      143003Updated Jul 12, 2025Jul 12, 2025
    • Go
      0001Updated Jul 7, 2025Jul 7, 2025
    • Front-End Take Home Challenge
      TypeScript
      12201Updated Jun 30, 2025Jun 30, 2025
    • Build agents powered by open models
      Jupyter Notebook
      1300Updated Jun 12, 2025Jun 12, 2025
    • Workshop materials for AI Engineer World's Fair
      Jupyter Notebook
      11800Updated Jun 3, 2025Jun 3, 2025
    • ✨ A Github Action which sets the base and head SHAs required for `nx affected` commands in CI
      TypeScript
      83001Updated May 15, 2025May 15, 2025
    • :octocat: Github action to retrieve all (added, copied, modified, deleted, renamed, type changed, unmerged, unknown) files and directories.
      TypeScript
      301001Updated May 15, 2025May 15, 2025
    • TypeScript
      0101Updated May 15, 2025May 15, 2025
    • 1100Updated Mar 24, 2025Mar 24, 2025
    • Add Honeycomb Markers to your GitHub Actions workflows.
      Dockerfile
      6000Updated Mar 17, 2025Mar 17, 2025
    • setup-mpi

      Public
      Set up your GitHub Actions workflow to use MPI
      Shell
      5000Updated Mar 17, 2025Mar 17, 2025
    • FlashInfer: Kernel Library for LLM Serving
      Cuda
      436000Updated Feb 6, 2025Feb 6, 2025
    • .github

      Public
      0000Updated Jan 13, 2025Jan 13, 2025
    • Autoscaling components for Kubernetes
      Go
      4.2k003Updated Dec 11, 2024Dec 11, 2024
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      1.1k002Updated Nov 7, 2024Nov 7, 2024
    • Jupyter Notebook
      1200Updated Sep 14, 2024Sep 14, 2024
    • Python
      121900Updated Jun 26, 2024Jun 26, 2024
    • NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
      Go
      371003Updated Apr 19, 2024Apr 19, 2024