OpenVino backend's speed is dissapointing. #1858

barolo · 2024-02-11T23:38:38Z

barolo
Feb 11, 2024

I've been testing whisper.cpp+OpenVino backend with GPU [Integrated Xe GPU]. And while it makes things faster [slightly lower CPU usage with maxed threads] it only uses GPU lightly [~20-40%] and in spaced out bursts.
Recently I've checked HF whisper with their optimum+openvino gpu backend, and it's the other way around. GPU is constantly maxed out,and CPU is used slightly, it's also blazing fast, orders of magnitude faster.

What's the reason behind such discrepancies in behaviour? Is it the chunking of HF?

pharmacologic · 2025-04-19T17:49:35Z

pharmacologic
Apr 19, 2025

i just got whisper.cpp+openvino working on my own iGPU (ended up needing 'legacy' packages for the intel compute runtime w/ a 6th gen i5), i noticed the same behavior: mostly CPU, only slightly less than when otherwise, with spaced out bursts on the GPU

so i'm curious about this too

1 reply

barolo Apr 20, 2025
Author

I think that they've abandoned OpenVino pretty much and they're going with ipex+SYCL instead. I've tested SYCL acceleration recently and it worked; was really annoying to setup though.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OpenVino backend's speed is dissapointing. #1858

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

OpenVino backend's speed is dissapointing. #1858

Uh oh!

Uh oh!

barolo Feb 11, 2024

Replies: 1 comment · 1 reply

Uh oh!

pharmacologic Apr 19, 2025

Uh oh!

barolo Apr 20, 2025 Author

barolo
Feb 11, 2024

Replies: 1 comment 1 reply

pharmacologic
Apr 19, 2025

barolo Apr 20, 2025
Author