Skip to content

Branching out on model support #175

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
pcfreak30 opened this issue Feb 19, 2025 · 0 comments
Open

Branching out on model support #175

pcfreak30 opened this issue Feb 19, 2025 · 0 comments

Comments

@pcfreak30
Copy link

@b4rtaz I am opening this issue because I would like to get info/understand high level what is required to support other model families.

Two that come to mind are the actual deepseek R1 and the Qwen families. But my question is more abstract as well. Is every effort having to be a reverse engineering of the model to run case by case or can the process of a "distributed" model be abstracted and generalized? Im aware of you having a HF script, but not yet sure on the details of what the conversions are doing.

I just found this project after spending a decent amount of my time doing research/testing around llamas GRPC via localai.

Kudos.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant