Toolings and library for Ahead of Time (AOT) Triton compilation and packaging for inference serving.
TBA
This codebase is Apache 2.0 licensed, as found in the LICENSE file.
The overall project is made possible thanks to the joint work from many technical contributors (listed in alphabetical order):
Sijia Chen, Huamin Li, Chloe Liu, Xing Liu, Linjian Ma, Bert Maher, Zhuoran Zhao