Skip to content

perf : CUDA FA is slower with Gemma models. Is this expected? #10684

Unanswered
ggerganov asked this question in Q&A
Discussion options

You must be logged in to vote

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@ggerganov
Comment options

ggerganov Dec 6, 2024
Maintainer Author

@JohannesGaessler
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
question Further information is requested
2 participants