Skip to content

merge paged attention feature and moe feature into llama_fp8_12062024 #370

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 11 commits into
base: llama_fp8_12062024
Choose a base branch
from

Merge remote-tracking branch 'EmbeddedLLM/paged-attn-updated' into yu…

07552f2
Select commit
Loading
Failed to load commit list.
Draft

merge paged attention feature and moe feature into llama_fp8_12062024 #370

Merge remote-tracking branch 'EmbeddedLLM/paged-attn-updated' into yu…
07552f2
Select commit
Loading
Failed to load commit list.