[REQ] Delossifier function #356
Replies: 15 comments 41 replies
-
Thanks for the idea. I also see some MP3 restoration models listed here: https://github.com/ZFTurbo/Music-Source-Separation-Training/blob/main/docs/pretrained_models.md#single-stem-models As always, it would be very helpful to get some help from the community to evaluate & validate quality of the projects & available pre-trained models. |
Beta Was this translation helpful? Give feedback.
-
Thanks for the hint and happy new year, @RyanMetcalfeInt8 ! Just added Apollo by @JusperLee under AUDIO \ AI-based \ Restorers:
Anyway I honestly didn't understand exactly how it can work since it relies on music separation nnet, even if:
Hope that active nnet-based "audio restoration" projects' devs (such as @asjad895, @AakashRevankar, @matthewmcq, @shaws34, @bkraad47 and @kroll-software of course) will join this discussion too. If can help I'll try to engage Hydrogenaudio's experts again for qualitative evaluation of restorers' outputs. |
Beta Was this translation helpful? Give feedback.
-
If want to try fast: https://huggingface.co/spaces/patriotyk/Apollo |
Beta Was this translation helpful? Give feedback.
-
@yoongi43 - the Exploiting Time-Frequency Conformers for Music Audio Enhancement author - just noticed about this discussion. |
Beta Was this translation helpful? Give feedback.
-
Just discovered this interesting Audio Super Resolution implementation by @jakeoneijk: Abstract from the relative paper:
Hope that inspires ! |
Beta Was this translation helpful? Give feedback.
-
I think, you all doing a great job pushing this idea further. |
Beta Was this translation helpful? Give feedback.
-
Have someone objectively compared quality of these models (AudioSR, Apollo, FlashSR and maybe something else)? I like both Apollo and FlashSR in terms of quality) |
Beta Was this translation helpful? Give feedback.
-
Bump. Seems that @JusperLee's Apollo is gaining results (and needs optimization on Intel-*PUs)... |
Beta Was this translation helpful? Give feedback.
-
Hi all! I apologize for not contributing much to the thread until now, but I just wrapped up my thesis which I feel has some fairly profound implications for audio restoration. Unfortunately, I did not get the change to directly integrate any of these algorithms into a DL pipeline, but I feel y'all are the type of people who would know what to do with it better than me. TLDR is I developed an optimized algorithm to convert from a DFT spectrum into the set of constituent "true" continuous components with more or less arbitrarily good accuracy. In my thesis I used it to "clean" an STFT and for a pretty novel resampling approach. Code is available here. If you're at all curious to read the actual thesis to get a better idea of the theoretical foundations or to see the results visualized, the pdf is in the folder titled "thesis." Have a great weekend! |
Beta Was this translation helpful? Give feedback.
-
Bump. Just discovered @rajasekarnp1's / @31jay's Neural Audio Upscaler project: models aren't present (but the Project Summary claims "Voice, Music, Ambient and General") and I don't exactly understand inference platform they relies on, but the approach seems interesting. |
Beta Was this translation helpful? Give feedback.
-
Ok, with the help of GH-Copilot I've finally "ported" the latest @kroll-software's AudioDelossifier (and, of course, converted provided models) to run - locally - exploiting OpenVINO. https://github.com/MarcoRavich/OV-AudioDelossifier#readme It's still pretty raw of course, but it seems to infere without errors. I'm waiting for your feedbacks and suggestions to improve it further. |
Beta Was this translation helpful? Give feedback.
-
I really appreciate all your efforts and progress. Some ideas for improvements:
Detlef |
Beta Was this translation helpful? Give feedback.
-
Bump. Thanks to @jarredou's Apollo-Colab-Inference repo I've discovered the Universal model for any lossy files by Lew bringed by @deton24. I've failed to convert it to ONNX format, anyway I'm trying to inference with it as is. Any suggestion @RyanMetcalfeInt8 ? |
Beta Was this translation helpful? Give feedback.
-
Sorry for maybe stupid questions but is it possible to use this on a Nvidia gpu (on a cpu it's soo long) and is there any better delossifier in terms of quality ? |
Beta Was this translation helpful? Give feedback.
-
I'm sorry, but no. I'll write back if they come up.
pon., 7 lip 2025, 17:21 użytkownik Ryan Metcalfe ***@***.***>
napisał:
… Hi @deton24 <https://github.com/deton24> -- Any chance you heard back
from Lew? I've been occasionally checking your GDoc (which is amazing btw),
but didn't see any info there. I guess chances are, you didn't hear
anything -- but I figured I'd check :)
I just really want to release this model with our plugins, as it's pretty
amazing..
—
Reply to this email directly, view it on GitHub
<#356 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AIJ3EHGBPMKOMM5I23NNJXT3HKGBDAVCNFSM6AAAAABUPP3MJCVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTGNRYGU2TGOI>
.
You are receiving this because you were mentioned.Message ID:
<intel/openvino-plugins-ai-audacity/repo-discussions/356/comments/13685539
@github.com>
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi there,
after the recent cool super resolution feature add, it would be great to have a tool able to "delossify" - as far as possible - compressed audio.
@kroll-software has recently updated their Audio Delossifier and a brunch of pre-trained models (for mp3 delossification) are provided too.
Last but not least, I've just created a simple script to inference on Colab: kroll-software/AudioDelossifier#5 (comment)
Hope that inspires !
Beta Was this translation helpful? Give feedback.
All reactions