Skip to content

Where to insert code to deploy to custom accelerator? #6070

Answered by slaren
wilderfield asked this question in Q&A
Discussion options

You must be logged in to vote

Look into the way the OpenCL backend is implemented. If you need to upload the weights to the accelerator, at this point, that would be the easiest way to do it. Ideally, you would create a full backend implementing the ggml-backend interface, but that's not really an option at the moment for backends that can only do matrix multiplication.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@wilderfield
Comment options

@wilderfield
Comment options

@slaren
Comment options

Answer selected by wilderfield
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants