- 👋 Hi, I’m @DerrickYLJ
- 👀 I’m interested in ...
- 🌱 I’m currently learning ...
- 💞️ I’m looking to collaborate on ...
- 📫 How to reach me ...
Highlights
- Pro
Pinned Loading
-
TidalDecode
TidalDecode Public[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
-
flexflow/flexflow-train
flexflow/flexflow-train PublicAutomatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
-
Blocking_Waived_Estimation
Blocking_Waived_Estimation PublicLCN 2024, solving worst case delay of relatively complicated network architecture with [1] Trajectory Approach; [2] Network Calculus; [3] Compositional Performance Analysis (CPA); and [4] Flow Aggr…
Python 2
-
-
mit-han-lab/TinyChatEngine
mit-han-lab/TinyChatEngine PublicTinyChatEngine: On-Device LLM Inference Library
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.