Replies: 12 comments 32 replies
-
Hi @Brodvd, Super resolution has been on my mind for a while as something that I definitely want to add when I get some spare cycles to dedicate to it. It would be very helpful if you (and others) could give some feedback about which of the projects listed here work the best (from a quality perspective). I've looked at AudioSR before, but I'm not very familiar with the other projects. Anyway, this kind of research is always very time consuming for me.. as I don't want to spend the time porting some models into this framework if they don't appear to work well for general use. Thanks! |
Beta Was this translation helpful? Give feedback.
-
Ok, also I had problems with Audio SR. however if I find something I’ll tell you here. good job! |
Beta Was this translation helpful? Give feedback.
-
Just curious.... problems getting it to run? Or problems as in, you weren't impressed by the quality of the result? |
Beta Was this translation helpful? Give feedback.
-
however I agree on the quality of the result, the examples in the repository are not very good. |
Beta Was this translation helpful? Give feedback.
-
Well, even if the most "popular" (since has been adopted by @IAHispano too in their Audio Upscaler for example) is @haoheliu's AudioSR, the most interesting approach (even if has a slightly different scope) seems Audio Delossifier by @kroll-software since is trainable, IMHO. |
Beta Was this translation helpful? Give feedback.
-
Hey guys, I see in the AudioSR README (https://github.com/haoheliu/versatile_audio_super_resolution?tab=readme-ov-file#readme), there is a 'Demo & Cloud API' link. Seems like it's possible to use it via this Web UI front end... (although I didn't try it). Perhaps this is easier to try it out on your side? If you guys can help prove that this feature works well, it would be super helpful! As I said before, this part is super time consuming for me.. as I don't want to make the mistake of spending a lot of time porting some algorithm / model that only works well for a few specific samples (useless in the general case). Edit: Sorry, I missed the 'Add a payment method to run this model' link on that replicate page. Nevermind about trying to run it there. |
Beta Was this translation helpful? Give feedback.
-
Bump. Seems that @JacobLinCool setuped an AudioSR @ HF and @nateraw has another @ replicate too. Hope they can help. |
Beta Was this translation helpful? Give feedback.
-
@nateraw absolutly. @MarcoRavich I did a simple test of Audio SR at this link https://huggingface.co/spaces/JacobLinCool/audio-super-resolution , this is what it did: as described in the screen is a track with violins and in the second part the addition of percussion. For the violins Audio SR did a good job but didn’t understand the preponderance of percussion, this may be a borderline case but (for the knowledge I have in the field of super resolution) add a CNN that gets detailed track information and then passes the data to the latent spread of Audio SR could be a good idea? |
Beta Was this translation helpful? Give feedback.
-
Hey @MarcoRavich you know other GitHub Projects that use Deep learning(Wave-Unet, GAN...) for re-create the High-frequences of a audio track that aren't in your archive? |
Beta Was this translation helpful? Give feedback.
-
However @RyanMetcalfeInt8 if you want an advice if you need to add a super resolution audio plugin to the openVINO suite as soon as possible you can temporarily include Audio SR. |
Beta Was this translation helpful? Give feedback.
-
Hi all, just a heads up that we've integrated a super resolution feature into the main branch of this project. Next release (coming next few days) will include it. |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello @RyanMetcalfeInt8 , could you try adding a plugin for super resolution audio using openVINO? There are various models even just on GitHub or demo on Hugging Face, unfortunately with some bugs. @MarcoRavich’s store at the Upscalers section seems to be the best. Thank you! https://github.com/FORARTfe/HyMPS/blob/main/Audio/AI-Enhancing.md#---
Beta Was this translation helpful? Give feedback.
All reactions