How to rewrite the compute graph in the backend #8534

marty1885 · 2024-07-17T08:53:39Z

marty1885
Jul 17, 2024

Hi,

I've asked this question in the GGML discussion board with no reply so decided to try the llama.cpp one

I'm working on a NPU backend and the NPU support 1D and 2D convolutions as a native functionality. AFAIK GGML implements convolution as a combination of im2col and matmul. I looked into the backend data structure and found graph_plan_create but it is documented to be not used right now.

Is there a place I can walk and rewrite the compute graph? So I can transform im2col + matmul into corresponding Conv1D and Conv2D operations.

Thanks

ggerganov · 2024-07-17T11:30:47Z

ggerganov
Jul 17, 2024
Maintainer

There is a discussion about how to add support for fusing operations: #5413

However, I don't think we have a good answer yet for the specific case of ggml_conv operators - see #5413 (comment). So at the moment, I can't recommend anything, but hopefully we will figure out how to do it

1 reply

marty1885 Jul 17, 2024
Author

Thanks. I'll read the discussion thread and see if I can contribute anything.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to rewrite the compute graph in the backend #8534

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to rewrite the compute graph in the backend #8534

Uh oh!

marty1885 Jul 17, 2024

Replies: 1 comment · 1 reply

Uh oh!

ggerganov Jul 17, 2024 Maintainer

Uh oh!

marty1885 Jul 17, 2024 Author

marty1885
Jul 17, 2024

Replies: 1 comment 1 reply

ggerganov
Jul 17, 2024
Maintainer

marty1885 Jul 17, 2024
Author