ternaryLLM Code for Fast Ternary Large Language Model Inference with Addition-Based Sparse GEMM on Edge Devices Initial contribution: CPU code: Mila and Shien GPU code: Guanshujie