Skip to content

Fix slow gguf tests #2846

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from
Open

Fix slow gguf tests #2846

wants to merge 3 commits into from

Conversation

For-rest2005
Copy link

In this pull request, we add a new API model, "builtin_gguf". It is derived from abetlen/llama-cpp-python#1983
But we can't get a correct results directly, due to some errors on the API of llama-cpp-python. I have post an issue on llama-cpp-python project. In this issue, I give my solution for the specific need in "builtin_gguf" and make some optimization for these APIs. You can modify the code in llama-cpp-python manually according to the issue. But my work can only meet our needs when running within "builtin_gguf". It is not thorough for llama-cpp-python. Thus I do not make a pull request. It is wait for others to solve.
By the way, this API-model is still not complete. Hope you guys can complement it.

@CLAassistant
Copy link

CLAassistant commented Mar 26, 2025

CLA assistant check
All committers have signed the CLA.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants