Popular repositories Loading
-
LLMSpeculativeSampling
LLMSpeculativeSampling PublicForked from feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
Python
-
Consistency_LLM
Consistency_LLM PublicForked from hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
Python
-
LookaheadDecoding
LookaheadDecoding PublicForked from hao-ai-lab/LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Python
-
DuoDecoding
DuoDecoding PublicForked from KaiLv69/DuoDecoding
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting
Python
-
GraphSnapShot
GraphSnapShot PublicForked from NoakLiu/GraphSnapShot
GraphSnapShot: Caching Local Structure for Fast Graph Learning [Efficient ML System]
Python
If the problem persists, check the GitHub status page or contact support.