Quantified accelerated method to speed up inference? #105

pajuhaan · 2024-04-13T08:21:32Z

pajuhaan
Apr 13, 2024

Hi, nice to see something in the C like this. this type of code can definitely bring the execution of AI models closer to hardware like DSPs or microcontrollers.
Seeing this project made me want to ask you this question; is it possible to speed up the inference layer by accepting some errors in weights and calculations, using simple shift registers (with pre-calculated shifts) and adders instead of floating/fixed point multiplier units? This approach could be applied at both the code and hardware levels. Does this idea make sense?
It's a doc about it:
Quantified Accelerated Artificial Neural Network Neurons - PJ.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Quantified accelerated method to speed up inference? #105

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Quantified accelerated method to speed up inference? #105

Uh oh!

pajuhaan Apr 13, 2024

Replies: 0 comments

pajuhaan
Apr 13, 2024