EIC@GaTech

All

49 repositories

LaCache
Public
[ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models.
Python
•
BSD 3-Clause "New" or "Revised" License
•0•9•0•0•Updated Jul 22, 2025Jul 22, 2025
DiffCR
Public
Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers
Python
•
Apache License 2.0
•0•8•1•0•Updated May 19, 2025May 19, 2025
Early-Bird-Diffusion
Public
[CVPR 2025] "Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training" by Lexington Whalen, Zhenbang Du, Haoran You, Chaojian Li, Sixu Li, and Yingyan (Celine) Lin.
Python
•0•2•0•0•Updated May 5, 2025May 5, 2025
LongMamba
Public
A training-free method for extending the context length of SSMs (State Space Models) and hybrid architectures..
Python
•0•10•1•0•Updated Apr 26, 2025Apr 26, 2025
torchshiftadd
Public
An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.
Python
•
Apache License 2.0
•0•13•0•0•Updated Feb 3, 2025Feb 3, 2025
Omni-Recon
Public
[ECCV 2024 Oral] "Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields" by Yonggan Fu, Huaizhi Qu, Zhifan Ye, Chaojian Li, Kevin Zhao, and Yingyan (Celine) Lin.
surface-reconstruction nerf 3d-reconstruction real-time-rendering generalizable-nerf
Python
•
MIT License
•0•8•1•0•Updated Dec 14, 2024Dec 14, 2024
AmoebaLLM
Public
[NeurIPS 2024] "AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment" by Yonggan Fu, Zhongzhi Yu, Junwei Li, Jiayi Qian, Yongan Zhang, Xiangchi Yuan, Dachuan Shi, Roman Yakunin, and Yingyan (Celine) Lin.
language-model efficient-llm-inference
Python
•
MIT License
•3•14•0•0•Updated Dec 13, 2024Dec 13, 2024
ShiftAddLLM
Public
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
Python
•
Apache License 2.0
•16•110•5•0•Updated Oct 15, 2024Oct 15, 2024
mg-verilog
Public
Python
•
MIT License
•9•47•0•0•Updated Oct 8, 2024Oct 8, 2024
LLM4HWDesign_Starting_Toolkit
Public
LLM4HWDesign Starting Toolkit
Python
•4•17•1•0•Updated Oct 4, 2024Oct 4, 2024
ACT
Public
[ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration
Python
•1•41•2•0•Updated Jun 30, 2024Jun 30, 2024
Edge-LLM
Public
[DAC 2024] EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting
Python
•9•63•2•0•Updated Jun 30, 2024Jun 30, 2024
Linearized-LLM
Public
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
Python
•
Apache License 2.0
•3•33•1•0•Updated Jun 12, 2024Jun 12, 2024
Castling-ViT
Public
[CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference
Python
•
Apache License 2.0
•1•30•1•0•Updated Mar 14, 2024Mar 14, 2024
NeRFool
Public
[ICML 2023] "NeRFool: Uncovering the Vulnerability of Generalizable Neural Radiance Fields against Adversarial Perturbations" by Yonggan Fu, Ye Yuan, Souvik Kundu, Shang Wu, Shunyao Zhang, Yingyan (Celine) Lin
adversarial-robustness neural-radiance-fields
Python
•
MIT License
•1•18•0•0•Updated Mar 10, 2024Mar 10, 2024
CPT
Public
[ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vikas Chandra, and Yingyan (Celine) Lin.
pytorch quantization efficient-training low-precision-training
Python
•
MIT License
•6•31•2•1•Updated Mar 2, 2024Mar 2, 2024
ShiftAddViT
Public
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
Python
•
Apache License 2.0
•0•31•1•0•Updated Dec 6, 2023Dec 6, 2023
TinyML2023EIC-Gatech-Open
Public
C
•0•6•0•0•Updated Oct 19, 2023Oct 19, 2023
BNS-GCN
Public
[MLSys 2022] "BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node Sampling" by Cheng Wan, Youjie Li, Ang Li, Nam Sung Kim, Yingyan Lin
sampling graph-convolutional-networks distributed-training graph-neural-networks
Python
•
MIT License
•12•56•0•0•Updated Oct 6, 2023Oct 6, 2023
S3-Router
Public
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing" by Yonggan Fu, Yang Zhang, Kaizhi Qian, Zhifan Ye, Zhongzhi Yu, Cheng-I Lai, Yingyan Lin
self-supervised-learning automated-speech-recognition asr-pruning
Python
•
MIT License
•2•17•1•0•Updated Sep 19, 2023Sep 19, 2023
ViTCoD
Public
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
Python
•
Apache License 2.0
•12•115•2•0•Updated Jun 27, 2023Jun 27, 2023
Hint-Aug
Public
Python
•
MIT License
•0•5•0•0•Updated Jun 25, 2023Jun 25, 2023
HW-NAS-Bench
Public
[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark
Python
•
MIT License
•19•111•2•0•Updated Apr 18, 2023Apr 18, 2023
HALO
Public
The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"
Python
•
MIT License
•0•10•0•0•Updated Mar 22, 2023Mar 22, 2023
PipeGCN
Public
[ICLR 2022] "PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication" by Cheng Wan, Youjie Li, Cameron R. Wolfe, Anastasios Kyrillidis, Nam Sung Kim, Yingyan Lin
graph-convolutional-networks distributed-training graph-neural-networks
Python
•
MIT License
•7•33•0•0•Updated Mar 15, 2023Mar 15, 2023
ViTALiTy
Public
ViTALiTy (HPCA'23) Code Repository
Python
•
Apache License 2.0
•6•23•2•0•Updated Mar 13, 2023Mar 13, 2023
Spline-EB
Public
[TMLR] Max-Affine Spline Insights Into Deep Network Pruning
Python
•
MIT License
•0•1•0•0•Updated Nov 12, 2022Nov 12, 2022
TinyML-Contest-Solution
Public
7•11•0•0•Updated Oct 27, 2022Oct 27, 2022
DNN-Chip-Predictor
Public
[ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architectures
Python
•6•25•1•0•Updated Oct 1, 2022Oct 1, 2022
NASA
Public
[ICCAD 2022] NASA: Neural Architecture Search and Acceleration for Hardware Inspired Hybrid Networks
Python
•0•10•0•0•Updated Sep 22, 2022Sep 22, 2022