Skip to content
Change the repository type filter

All

    Repositories list

    • vllm-fork

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      9.1k788101Updated Aug 2, 2025Aug 2, 2025
    • Python
      3613122Updated Aug 1, 2025Aug 1, 2025
    • Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
      Python
      265400Updated Aug 1, 2025Aug 1, 2025
    • HCL

      Public
      C++
      61000Updated Jul 31, 2025Jul 31, 2025
    • C++
      51710Updated Jul 30, 2025Jul 30, 2025
    • SGLang is a fast serving framework for large language models and vision language models.
      Python
      2.5k000Updated Jul 30, 2025Jul 30, 2025
    • Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/
      Jupyter Notebook
      506165Updated Jul 29, 2025Jul 29, 2025
    • Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      25k205Updated Jul 29, 2025Jul 29, 2025
    • Reference models for Intel(R) Gaudi(R) AI Accelerator
      Jupyter Notebook
      9016713Updated Jul 25, 2025Jul 25, 2025
    • gohlml

      Public
      HABANA Management Library bindings for Go
      Go
      4301Updated Jul 23, 2025Jul 23, 2025
    • hccl_demo

      Public
      C++
      202221Updated Jul 23, 2025Jul 23, 2025
    • slurm

      Public
      Slurm: A Highly Scalable Workload Manager
      C
      721201Updated Jul 4, 2025Jul 4, 2025
    • NIC drivers (Ethernet, IBverbs and common) for the NIC IP that is inside Intel's data-center GPU
      C
      2007Updated Jun 19, 2025Jun 19, 2025
    • Intel® Gaudi® Software is an implementation of the runtime and graph compiler for Gaudi3
      C++
      5911Updated Jun 17, 2025Jun 17, 2025
    • Apptainer: Application containers for Linux
      Go
      155000Updated Jun 13, 2025Jun 13, 2025
    • Setup and Installation Instructions for Habana binaries, docker image creation
      Python
      162565Updated May 19, 2025May 19, 2025
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      4.5k1303Updated May 19, 2025May 19, 2025
    • Ongoing research training transformer models at scale
      Python
      3k500Updated May 19, 2025May 19, 2025
    • Provides the examples to write and build Habana custom kernels using the HabanaTools
      C++
      252233Updated Apr 15, 2025Apr 15, 2025
    • The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
      Python
      3.6k1015Updated Apr 11, 2025Apr 11, 2025
    • 0000Updated Apr 7, 2025Apr 7, 2025
    • C
      0210Updated Apr 3, 2025Apr 3, 2025
    • SynapseAI_Core

      Public archive
      SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi
      C
      64220Updated Feb 3, 2025Feb 3, 2025
    • pyhlml

      Public archive
      Python
      0100Updated Feb 3, 2025Feb 3, 2025
    • DL1-Workshop

      Public archive
      Jupyter Notebook
      0200Updated Feb 3, 2025Feb 3, 2025
    • TOWL

      Public
      HTML
      3300Updated Jan 16, 2025Jan 16, 2025
    • Intel Gaudi's Megatron DeepSpeed Large Language Models for training
      Python
      3k1301Updated Dec 19, 2024Dec 19, 2024
    • C++
      1200Updated Dec 9, 2024Dec 9, 2024
    • Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases
      Jupyter Notebook
      61200Updated Dec 2, 2024Dec 2, 2024
    • AutoGPTQ

      Public
      An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
      Python
      523002Updated Nov 19, 2024Nov 19, 2024