Skip to content
Change the repository type filter

All

    Repositories list

    • BioNeMo Framework: For building and adapting AI models in drug discovery at scale
      Jupyter Notebook
      925555794Updated Oct 30, 2025Oct 30, 2025
    • The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
      Python
      4061.5k5122Updated Oct 30, 2025Oct 30, 2025
    • The CUDA target for Numba
      Python
      422079125Updated Oct 30, 2025Oct 30, 2025
    • NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      1361214Updated Oct 30, 2025Oct 30, 2025
    • A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
      Python
      5332.9k22289Updated Oct 30, 2025Oct 30, 2025
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      67359206187Updated Oct 30, 2025Oct 30, 2025
    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      2842k1.1k176Updated Oct 30, 2025Oct 30, 2025
    • TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      C++
      1.8k12k738424Updated Oct 30, 2025Oct 30, 2025
    • CUDA Python: Performance meets Productivity
      Python
      2173k17911Updated Oct 30, 2025Oct 30, 2025
    • Ongoing research training transformer models at scale
      Python
      3.2k14k315147Updated Oct 30, 2025Oct 30, 2025
    • cudaqx

      Public
      Accelerated libraries for quantum-classical computing built on CUDA-Q.
      C++
      34632311Updated Oct 30, 2025Oct 30, 2025
    • A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.
      Python
      1881.5k12141Updated Oct 30, 2025Oct 30, 2025
    • RAPIDS Accelerator JNI For Apache Spark
      Cuda
      7551787Updated Oct 30, 2025Oct 30, 2025
    • C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      29483241590Updated Oct 30, 2025Oct 30, 2025
    • Showcase JaxPP with MaxText
      Python
      418203Updated Oct 30, 2025Oct 30, 2025
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      885186917Updated Oct 30, 2025Oct 30, 2025
    • NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4062.4k39659Updated Oct 30, 2025Oct 30, 2025
    • stdexec

      Public
      `std::execution`, the proposed C++ framework for asynchronous and parallel programming.
      C++
      2082.1k11012Updated Oct 30, 2025Oct 30, 2025
    • Documentation repository for NVIDIA Cloud Native Technologies
      PowerShell
      302949Updated Oct 30, 2025Oct 30, 2025
    • nv-one-logger enables tracking of GPU application progress over time and can help to identify overhead from workload and cluster inefficiencies to provide efficiency metrics.
      Python
      32000Updated Oct 30, 2025Oct 30, 2025
    • Starting October 1, 2025, NVIDIA PSIRT will publish an initial set of security bulletins on GitHub in Markdown, CSAF, and CVE formats. Coverage will expand over time, while all bulletins remain available on the Product Security website.
      0500Updated Oct 30, 2025Oct 30, 2025
    • nv-ingest

      Public
      NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
      Python
      2722.8k9735Updated Oct 30, 2025Oct 30, 2025
    • Example dependency patterns for OSS build tools to support
      Starlark
      1100Updated Oct 30, 2025Oct 30, 2025
    • JAX-Toolbox
      Python
      663568038Updated Oct 30, 2025Oct 30, 2025
    • MatX

      Public
      An efficient C++17 GPU numerical computing library with Python-like syntax
      C++
      1081.4k315Updated Oct 30, 2025Oct 30, 2025
    • skyhook

      Public
      A Kubernetes Operator to manage Node OS customizations.
      Go
      32801Updated Oct 30, 2025Oct 30, 2025
    • The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.
      Go
      174131Updated Oct 30, 2025Oct 30, 2025
    • VisRTX

      Public
      NVIDIA OptiX based implementation of ANARI
      C++
      3526590Updated Oct 30, 2025Oct 30, 2025
    • Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods
      Python
      4692k4434Updated Oct 30, 2025Oct 30, 2025
    • aistore

      Public
      AIStore: scalable storage for AI applications
      Go
      2221.6k00Updated Oct 30, 2025Oct 30, 2025