Model Cache Utils (MCU)

The Model Cache Utils (MCU) (formerly Triton Kernel Development Kit (TKDK)) is a suite of tools designed to streamline and enhance the development workflow for Model Kernel developers. Whether you're optimizing cache usage, monitoring kernel performance, or distributing your builds securely, MCU has you covered. MCU supports Triton and vLLM.

Features

Model Cache Manager (MCM)

Organize, index, and monitor your Model kernel caches. This tool provides detailed reports on cache usage, offering data-driven insights into compilation performance and cache effectiveness. For more information please see the MCM readme.

Model Cache Vault (MCV)

Package Model/GPU kernel caches into OCI-compliant container images. Secure your caches with cryptographic signing, enabling safe and efficient cache distribution and reuse across environments and teams. For more information please see the MCV readme.

Triton Util

Write cleaner, more intuitive Triton code with high-level abstractions and utilities for loading, storing, and debugging GPU memory.

Triton-util was developed by Umer Adil and generously contributed to MCU.

For more information please see the Triton Util readme.

Getting Started

Clone this repository:

git clone https://github.com/redhat-et/MCU.git
cd MCU

Follow setup instructions for each tool in its respective directory.

Project Structure

MCU/
├── mcm/           # Model Cache Manager
├── mcv/           # OCI packaging and signing tool
├── triton_util/   # Triton Utilities
└── README.md      # You're here!

Security & Distribution

Model Cache Vault ensures that your cache packages are:

Packaged using OCI standards
Signed cryptographically for tamper-proof integrity
Easily distributable across environments and pipelines

Use Cases

Improve Triton/vLLM kernel cache management
Package and share caches across machines or Kubernetes environments.

Contributing

We welcome contributions! If you find bugs, have feature suggestions, or want to contribute code, please open an issue or submit a pull request.

License

Apache License Version 2.0. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 274 Commits
.github		.github
logo		logo
mcm		mcm
mcv		mcv
triton_util		triton_util
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
.pre-commit-config.yaml		.pre-commit-config.yaml
.yamllint.yaml		.yamllint.yaml
LICENSE		LICENSE
Makefile		Makefile
Readme.md		Readme.md
mcu-codespell.precommit-toml		mcu-codespell.precommit-toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Model Cache Utils (MCU)

Features

Model Cache Manager (MCM)

Model Cache Vault (MCV)

Triton Util

Getting Started

Project Structure

Security & Distribution

Use Cases

Contributing

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 7

Uh oh!

Languages

License

redhat-et/MCU

Folders and files

Latest commit

History

Repository files navigation

Model Cache Utils (MCU)

Features

Model Cache Manager (MCM)

Model Cache Vault (MCV)

Triton Util

Getting Started

Project Structure

Security & Distribution

Use Cases

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 7

Uh oh!

Languages

Packages