Skip to content

Help w/ understanding why an old (hacked together) build of koboldcpp has much faster Mixtral prompt processing than mainline? #5227

kalomaze started this conversation in General
Discussion options

You must be logged in to vote

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
2 replies
@JohannesGaessler
Comment options

@kalomaze
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants