Where to insert code to deploy to custom accelerator? #6070

wilderfield · 2024-03-15T00:40:48Z

wilderfield
Mar 15, 2024

I have a custom accelerator that can do matrix multiplications and an associated C++/C API.

It accepts bfloat16 inputs, and int4 weights.

I am curious if someone can help me figure out the best place to insert this API?

I am thinking ggml-quant.c

Additionally, the accelerator requires that weights be preloaded on off chip memory.

So it would be nice if there was some pass where I can find all the matrix multiplication ops, preload the weight tensors, and cache an association between the op instance, and the device buffer.

Any guidance or ideas on this would be greatly appreciated!

Answered by slaren

Mar 15, 2024

Look into the way the OpenCL backend is implemented. If you need to upload the weights to the accelerator, at this point, that would be the easiest way to do it. Ideally, you would create a full backend implementing the ggml-backend interface, but that's not really an option at the moment for backends that can only do matrix multiplication.

View full answer

ggerganov · 2024-03-15T08:56:30Z

ggerganov
Mar 15, 2024
Maintainer

Does it support only matrix multiplications or can it potentially do all other ops as well?

3 replies

wilderfield Mar 15, 2024
Author

It could potentially do more than matrix multiplications as the accelerator is programmable, but I think I need to approach it OP by OP for now. It won't be able to do everything.

wilderfield Mar 15, 2024
Author

I am thinking the insertion point would be somewhere under ggml_compute_forward_mul_mat()

slaren Mar 15, 2024
Maintainer

Look into the way the OpenCL backend is implemented. If you need to upload the weights to the accelerator, at this point, that would be the easiest way to do it. Ideally, you would create a full backend implementing the ggml-backend interface, but that's not really an option at the moment for backends that can only do matrix multiplication.

Answer selected by wilderfield

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Where to insert code to deploy to custom accelerator? #6070

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Where to insert code to deploy to custom accelerator? #6070

Uh oh!

wilderfield Mar 15, 2024

Replies: 1 comment · 3 replies

Uh oh!

ggerganov Mar 15, 2024 Maintainer

Uh oh!

wilderfield Mar 15, 2024 Author

Uh oh!

wilderfield Mar 15, 2024 Author

Uh oh!

slaren Mar 15, 2024 Maintainer

wilderfield
Mar 15, 2024

Replies: 1 comment 3 replies

ggerganov
Mar 15, 2024
Maintainer

wilderfield Mar 15, 2024
Author

wilderfield Mar 15, 2024
Author

slaren Mar 15, 2024
Maintainer