Add audio processing module #99

ladvoc · 2025-04-10T03:32:07Z

This PR adds support for the WebRTC audio processing module and enables AEC for microphone tracks.

Allocated audio frame data is not disposed of when allocated by Unity

holofermes · 2025-04-27T20:27:03Z

Hey all, I've been keeping an eye on this PR, and just give this a spin, and while this works on mac/windows, it fails on android arm64 ( I assume it would be the same for the other arm architectures ).

I think the latest android .so does is not up to date:

NullReferenceException: Object reference not set to an instance of an object.
 at LiveKit.AudioProcessingModule..ctor (System.Boolean echoCancellationEnabled, System.Boolean noiseSuppressionEnabled, System.Boolean highPassFilterEnabled, System.Boolean gainControllerEnabled) [0x00000] in <00000000000000000000000000000000>:0 
 at LiveKit.RtcAudioSource..ctor (System.Int32 channels, LiveKit.RtcAudioSourceType audioSourceType) [0x00000] in <00000000000000000000000000000000>:0

I noticed the install.py does not have android listed in the platforms, but I manually downloaded and replaced the ffi-android-arm64/liblivekit_ffi.so and ran a build with it but that doesn't work either as it fails to load the .so, which I suspect this is why the install.py does not grab the android builds.

ladvoc · 2025-04-28T07:59:08Z

Hi @holofermes, thank you for reporting this. Android should definitely be included as one of the platforms in install.py. I will look into this and make the necessary changes in a separate PR.

theomonnom · 2025-04-28T08:56:49Z

Runtime/Scripts/ApmReverseStream.cs

+        {
+            while (true)
+            {
+                Thread.Sleep(Constants.TASK_DELAY);


I think we're likely going to have a skew here. (this will impact the AEC a lot in long room duration)
Is there a way to directly process the frames as we receive them?

theomonnom · 2025-04-28T08:57:26Z

Runtime/Scripts/ApmReverseStream.cs

+
+        private void OnAudioRead(float[] data, int channels, int sampleRate)
+        {
+            _captureBuffer.Write(data, (uint)channels, (uint)sampleRate);


We could directly use ProcessReverseStream here?

theomonnom · 2025-04-28T09:01:12Z

Runtime/Scripts/RtcAudioSource.cs

@@ -101,78 +103,67 @@ private void Update()
            while (true)
            {
                Thread.Sleep(Constants.TASK_DELAY);


We will also get a skew here, so as soon as we're a bit late, we're going to hear bad quality input (jittery audio).

It's OK to push faster than realtime, the Rust-SDKs will handle it in a high precision queue

theomonnom · 2025-04-28T09:02:44Z

I see the TASK_DELAY is 5ms.
We're sending 10ms, this work because Rust will buffer, so maybe it's fine?

ladvoc · 2025-04-30T04:17:37Z

Hi @theomonnom, thank you for your feedback. Yes, there does appear to be a skew for longer room durations. I've moved the calls to the APM methods directly into the audio filter callbacks, however, this seems to introduce some audio artifacts that I haven't been able to explain yet. I think the issue is related to the forward stream being processed before the reverse stream, however, I need to do some more investigation to see if this is the case.

theomonnom · 2025-05-02T13:15:37Z

Hi @theomonnom, thank you for your feedback. Yes, there does appear to be a skew for longer room durations. I've moved the calls to the APM methods directly into the audio filter callbacks, however, this seems to introduce some audio artifacts that I haven't been able to explain yet. I think the issue is related to the forward stream being processed before the reverse stream, however, I need to do some more investigation to see if this is the case.

I think it's most likely because this function is too slow:

client-sdk-unity/Runtime/Scripts/RtcAudioSource.cs

Line 96 in 89437aa

private void OnAudioRead(float[] data, int channels, int sampleRate)

You could also try to increase the default DSP buffer of Unity

theomonnom · 2025-05-02T13:21:56Z

Runtime/Scripts/ApmReverseStream.cs

+        private void OnAudioRead(float[] data, int channels, int sampleRate)
+        {
+            _captureBuffer.Write(data, (uint)channels, (uint)sampleRate);
+            while (true)
+            {
+                using var frame = _captureBuffer.ReadDuration(AudioProcessingModule.FRAME_DURATION_MS);
+                if (frame == null) break;
+
+                _apm.ProcessReverseStream(frame);
+            }
+        }


Maybe this one too?

ladvoc added 12 commits April 10, 2025 13:27

Implement apm

32acf70

Expose internals for testing

cb23c17

Test APM

aad73bc

Add set stream delay method

0892526

Fix memory leak

7c6f099

Allocated audio frame data is not disposed of when allocated by Unity

Create audio buffer

6294861

Merge branch 'ladvoc/fix-audio-leak' into ladvoc/add-apm

624935f

Integrate audio buffer

5569acc

Integrate apm

51c596b

Call superclass stop method

fc0e83f

Merge branch 'ladvoc/audio-source-stop' into ladvoc/add-apm

e09c6cc

Set stream delay

bbad26a

ladvoc marked this pull request as ready for review April 25, 2025 23:30

ladvoc requested a review from theomonnom April 25, 2025 23:30

theomonnom reviewed Apr 28, 2025

View reviewed changes

ladvoc added 4 commits April 29, 2025 13:12

Merge remote-tracking branch 'origin/main' into ladvoc/add-apm

87f723a

Process & capture on audio thread

5f71c75

Optimize clear

af3d8ec

Remove locking from AudioBuffer

89437aa

theomonnom reviewed May 2, 2025

View reviewed changes

ladvoc mentioned this pull request May 14, 2025

Remote AudioSource has an echo #37

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add audio processing module #99

Add audio processing module #99

Uh oh!

ladvoc commented Apr 10, 2025 •

edited

Loading

Uh oh!

holofermes commented Apr 27, 2025 •

edited

Loading

Uh oh!

ladvoc commented Apr 28, 2025

Uh oh!

theomonnom Apr 28, 2025 •

edited

Loading

Uh oh!

theomonnom Apr 28, 2025

Uh oh!

theomonnom Apr 28, 2025

Uh oh!

theomonnom commented Apr 28, 2025

Uh oh!

ladvoc commented Apr 30, 2025

Uh oh!

theomonnom commented May 2, 2025

Uh oh!

theomonnom May 2, 2025

Uh oh!

Uh oh!

Add audio processing module #99

Are you sure you want to change the base?

Add audio processing module #99

Uh oh!

Conversation

ladvoc commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

holofermes commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ladvoc commented Apr 28, 2025

Uh oh!

theomonnom Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

theomonnom Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

theomonnom Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

theomonnom commented Apr 28, 2025

Uh oh!

ladvoc commented Apr 30, 2025

Uh oh!

theomonnom commented May 2, 2025

Uh oh!

theomonnom May 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ladvoc commented Apr 10, 2025 •

edited

Loading

holofermes commented Apr 27, 2025 •

edited

Loading

theomonnom Apr 28, 2025 •

edited

Loading