Help with visualising FFT - sine wave is visualised as noise #527

aSemy · 2022-12-15T21:25:36Z

aSemy
Dec 15, 2022

Hi 👋

I would really appreciate some help trying to visualise the result of an FFT. I've been trying for a few days now, and I just can't crack the code. All I have ever seen is just noise, like there's no difference between any of the frequencies.

The magnitudes for all frequencies bounce around the same values.

The same output occurs whether I use a microphone, or generated sine wave.

A quick summary

Audio is generated from SineWaveGenerator

SineWaveGenerator<int16_t> sineWave(32000);
GeneratedSoundStream<int16_t> generatedSource(sineWave);

void setup() {
  sineWave.begin(AUDIO_CHANNELS, AUDIO_SAMPLE_RATE, N_FS6);
}

the ESP32 hosts a website with a chart, using code adapted from https://github.com/squix78/esp32-mic-fft to plot the FFT using Chart.js
FFT results are sent to the site using websockets (this isn't very performant, it keeps crashing, but the performance of the site isn't important to me right now.

I've tried connecting a real mic. I've verified that the mic does work using a server that streams the audio via a WAVEncoder.

Complete code

Note: add WiFiCredentials.h with

#define WIFI_SSID "my-wifi-ssid"
#define WIFI_PASS "password123"

I'm using PlatformIO:

[env]
platform = espressif32
board = esp32dev
framework = arduino
monitor_port = /dev/cu.SLAB_USBtoUART
monitor_speed = 115200
upload_port = /dev/cu.SLAB_USBtoUART
build_type = debug
monitor_filters = esp32_exception_decoder
lib_deps =
    links2004/WebSockets@^2.3.7
    Wire
    me-no-dev/AsyncTCP @ ^1.1.1
    https://github.com/me-no-dev/ESPAsyncWebServer.git
    https://github.com/pschatzmann/arduino-audio-tools#main

build_flags =
    -DCONFIG_COMPILER_CXX_EXCEPTIONS
    -DPIO_FRAMEWORK_ESP_IDF_ENABLE_EXCEPTIONS
    -fexceptions
    -Wno-unused-variable
    -Wno-unused-but-set-variable
    -Wno-unused-function
    -Wno-format-extra-args

build_unflags = -fno-exceptions -fcolor-diagnostics

[env:esp32dev]
monitor_filters = esp32_exception_decoder

#include <ESPAsyncWebServer.h>
#include <sstream>
#include "WiFiCredentials.h"
#include "AudioTools.h"
#include "AudioLibs/AudioRealFFT.h"
#include "AudioLibs/MemoryManager.h"
#include <ctime>

#include <cstdint>

static const long DEVICE_SERIAL_BAUD = 115200;
static const int LED_PIN = 2;

static const int AUDIO_CHANNELS = 1;
static const int AUDIO_SAMPLE_RATE = 44100;
static const int AUDIO_BITS_PER_SAMPLE = 16;
static const int AUDIO_FFT_LENGTH = 64;// *must* be power of 2
static const int AUDIO_FFT_BINS = AUDIO_FFT_LENGTH / 2;

static const char webpage[] PROGMEM = /* language=HTML */ R"=====(
<html lang="en">
<!-- Adding a data chart using Chart.js -->
<!--https://github.com/squix78/esp32-mic-fft-->
<head>
  <title>ESP32 Audio Visualiser</title>
  <script src='https://cdnjs.cloudflare.com/ajax/libs/Chart.js/2.5.0/Chart.min.js'></script>
</head>
<body onload="javascript:init()">
<div>
  <canvas id="chart" width="600" height="400"></canvas>
</div>
<!-- Adding a websocket to the client (webpage) -->
<script>
  let webSocket, dataPlot;
  // const maxDataPoints = 200;
  const maxValue = 100000;
  const maxLow = maxValue * 0.5;
  const maxMedium = maxValue * 0.2;
  const maxHigh = maxValue * 0.3;

  function init() {
    webSocket = new WebSocket('ws://' + window.location.hostname + ':80/ws');
    dataPlot = new Chart(document.getElementById("chart"), {
      type: 'bar',
      data: {
        labels: [],
        datasets: [{
          data: [],
          label: "Low",
          backgroundColor: "#D6E9C6"
        },
        {
          data: [],
          label: "Moderate",
          backgroundColor: "#FAEBCC"
        },
        {
          data: [],
          label: "High",
          backgroundColor: "#EBCCD1"
        },
        ]
      },
      options: {
          responsive: false,
          animation: false,
          scales: {
              xAxes: [{ stacked: true }],
              yAxes: [{
                  display: true,
                  stacked: true,
                  ticks: {
                    beginAtZero: true,
                    steps: 100,
                    stepValue: 50,
                    max: maxValue
                  }
              }]
           }
       }
    });
    webSocket.onmessage = function(event) {
      const data = JSON.parse(event.data);
      dataPlot.data.labels = [];
      dataPlot.data.datasets[0].data = [];
      dataPlot.data.datasets[1].data = [];
      dataPlot.data.datasets[2].data = [];

      data.forEach(function(element) {
        dataPlot.data.labels.push(element.bin);
        const lowValue = Math.min(maxLow, element.value);
        dataPlot.data.datasets[0].data.push(lowValue);

        const mediumValue = Math.min(Math.max(0, element.value - lowValue), maxMedium);
        dataPlot.data.datasets[1].data.push(mediumValue);

        const highValue = Math.max(0, element.value - lowValue - mediumValue);
        dataPlot.data.datasets[2].data.push(highValue);
      });
      dataPlot.update();
    }
  }

</script>
</body>
</html>
)=====";


/// Toggle the LED from off->on, or on->off
static void toggleLed() {
  digitalWrite(LED_PIN, digitalRead(LED_PIN) ^ HIGH);
}

SineWaveGenerator<int16_t> sineWave(32000);
GeneratedSoundStream<int16_t> generatedSource(sineWave);

AnalogAudioStream audioSource;
AudioRealFFT audioFFT;

StreamCopy fftCopier(audioFFT, generatedSource);

AsyncWebServer httpServer(80);
AsyncWebSocket wsServer("/ws");


void initWiFi() {
  Serial.print("\nConnecting to WiFi...");
  WiFiClass::mode(WIFI_STA);
  WiFi.begin(WIFI_SSID, WIFI_PASS);
  uint8_t connectAttempts = 0;
  while (WiFiClass::status() != WL_CONNECTED) {
    Serial.print(".");
    delay(500);
    if (connectAttempts++ % 10 == 0) {
      Serial.print(" retrying...");
      WiFi.reconnect();
    }
  }
  Serial.println(" connected!");

  WiFi.setSleep(WIFI_PS_NONE);
}


void initWebServer() {
  Serial.print("Configuring HTTP server...");

  httpServer.on("/", HTTP_GET, [](AsyncWebServerRequest *request) {
    request->send_P(
        200,
        "text/html",
        webpage
    );
  });

  httpServer.addHandler(&wsServer);

  Serial.print(" starting...");
  httpServer.begin();
  Serial.println(" started!");
}


bool wsEnabled = false;

float fftBinFrequencies[AUDIO_FFT_BINS] = {0};

void handleFftResult(AudioFFTBase &fft) {
  if (wsEnabled) {
    String json = "[";
    // skip the first bin, it's the fft total
    for (int bin = 1; bin < fft.size(); bin++) {
      // scale, because the magnitudes are huge
      float mag = fft.magnitude(bin) / 1000000.0f;
      float freq = fft.frequency(bin);

      // send the bin label, and stored magnitude
      if (bin > 1) { json += ", "; }
      json += "{\"bin\":";
      json += "\"" + String((int) freq) + "\"";
      json += ",\"value\":";
      json += String((long) mag);
      json += "}";
    }
    json += "]";
    wsServer.textAll(json.c_str());
    toggleLed();
  }
  yield();
}


[[noreturn]]
void updateWebSocketServerTask(void *) {

  TickType_t xLastWakeTime = xTaskGetTickCount();
  const TickType_t xFrequency = pdMS_TO_TICKS(250);

  while (true) {
    wsServer.cleanupClients();
    wsEnabled = wsServer.count() > 0 && wsServer.availableForWriteAll();
    xTaskDelayUntil(&xLastWakeTime, xFrequency);
    yield();
  }
}


void setup() {
  pinMode(LED_PIN, OUTPUT);

  Serial.begin(DEVICE_SERIAL_BAUD);

  initWiFi();

  configTime(0, 0, "pool.ntp.org");

  initWebServer();

  TaskHandle_t updateWebSocketServerTaskHandle;
  xTaskCreatePinnedToCore(
      updateWebSocketServerTask,
      "websocket task",
      2048,
      nullptr,
      5,
      &updateWebSocketServerTaskHandle,
      tskNO_AFFINITY
  );

//  AudioLogger::instance().begin(Serial, AudioLogger::Info);

  AudioBaseInfo baseAudioConf = AudioBaseInfo();
  baseAudioConf.bits_per_sample = AUDIO_BITS_PER_SAMPLE;
  baseAudioConf.channels = AUDIO_CHANNELS;
  baseAudioConf.sample_rate = AUDIO_SAMPLE_RATE;

  Serial.print("Starting input stream...");
  auto audioSourceConf = audioSource.defaultConfig(RX_MODE);
  audioSourceConf.copyFrom(baseAudioConf);
  audioSourceConf.mode_internal = I2S_MODE_MASTER | I2S_MODE_RX | I2S_MODE_ADC_BUILT_IN;
  audioSourceConf.port_no = I2S_NUM_0;
//  audioSourceConf.channel_format = I2S_CHANNEL_FMT_ONLY_LEFT;
  audioSourceConf.use_apll = false;
  audioSourceConf.buffer_count = 2;
  audioSourceConf.buffer_size = 1024;
  audioSourceConf.setInputPin1(GPIO_NUM_35);
  audioSource.begin(audioSourceConf);
  Serial.println(" Input stream started");

  Serial.print("Starting FFT...");
  auto fftConf = audioFFT.defaultConfig();
  fftConf.copyFrom(baseAudioConf);
  fftConf.length = AUDIO_FFT_LENGTH;
  fftConf.callback = &handleFftResult;
//  fftConf.channel_used = 1;
  fftConf.window_function = new BufferedWindow(new Hamming());
  audioFFT.begin(fftConf);
  Serial.println(" FFT started");

  // compute the upper frequency for each bin
  for (int bin = 1; bin <= AUDIO_FFT_BINS; bin++) {
    // formula from AudioFFT.h
    auto freq = static_cast<float>(bin) * static_cast<float>(fftConf.sample_rate) / static_cast<float>(fftConf.length);
    fftBinFrequencies[bin - 1] = freq;
  }

  sineWave.begin(AUDIO_CHANNELS, AUDIO_SAMPLE_RATE, N_FS6);
}


void loop() {
  if (wsEnabled) {
    fftCopier.copy();
  }

  yield();
}

Answered by pschatzmann

Dec 17, 2022

Hmm, It seems there might be some bug in the window function: If you take it out you will get the expected result. Without it I am getting

4694 6371113 => D8 diff: -5 - time ms 1
4694 4546548 => D8 diff: -5 - time ms 0
4694 13356174 => D8 diff: -5 - time ms 0
4694 15973200 => D8 diff: -5 - time ms 0
4694 11184241 => D8 diff: -5 - time ms 0
4694 1209734 => D8 diff: -5 - time ms 0
4694 9325750 => D8 diff: -5 - time ms 0
4694 15537279 => D8 diff: -5 - time ms 1
4694 14544922 => D8 diff: -5 - time ms 0
4694 6808822 => D8 diff: -5 - time ms 0
4694 4084242 => D8 diff: -5 - time ms 0
4694 13083604 => D8 diff: -5 - time ms 0
4694 16016752 => D8 diff: -5 - time ms 0
4694 11523721 => D8 diff: -5 - …

View full answer

pschatzmann · 2022-12-15T22:04:44Z

pschatzmann
Dec 15, 2022
Maintainer

I can only recommend to solve the problem step by step.

send output to CSV stream to check that the input is working properly
add fft output to display the frequency
only if 2 is working start to work on the server code to display the full FFT

ps. how come that your chart has so many bars, but in your code it is supposed to have 32 only?

1 reply

aSemy Dec 16, 2022
Author

I'm sure that the input is working correctly because I've tested it using the WAV server. The sine wave comes through clearly. There's a little noise from the microphone, but that's expected. (I'm not able to use the CSV stream because the serial plotter isn't able to access the serial port, or if it can, then I'm not able to push new versions.)

I'm quite sure that the FFT isn't producing correct data. I've looked at the WebSocket messages and they match what's displayed on the graph - no consistent frequency is determined.

I've been experimenting with different FFT lengths, the screenshot is from when I tried with a larger length.

My guess is that the FFT isn't receiving enough samples to make a good determination. But if I try and increase the samples, it runs out of memory. I'll try experimenting more tomorrow....

pschatzmann · 2022-12-16T02:49:44Z

pschatzmann
Dec 16, 2022
Maintainer

I guess you're right: Most likely the issue lies in the output handling and you need to decouple the fft output from the rendering of the output:

I would write the output into a QueueFreeRTOS< T > of size 2 or into any synchronized data structure. If the queue is full I would consume the oldest record to make sure the queue contains only 2 records with actual data (to cover the speed difference)
The rendering (including wsServer.textAll) would need to be done in the separate task picking up the data from the queue.

ps.
Measure the fft output speed w/o any output and then add an assert to guarantee that the added output logic has no impact.

0 replies

aSemy · 2022-12-16T22:41:19Z

aSemy
Dec 16, 2022
Author

Here's another simpler experiment. A generated sign wave (frequency set to N_D8) is piped into the FFT. The result of the FFT is printed to serial.

#include <sstream>
#include "WiFiCredentials.h"
#include "AudioTools.h"
#include "AudioLibs/AudioRealFFT.h"
#include "AudioLibs/MemoryManager.h"

static const long DEVICE_SERIAL_BAUD = 115200;
static const int LED_PIN = 2;

static const int AUDIO_CHANNELS = 1;
static const int AUDIO_SAMPLE_RATE = 44100;
static const int AUDIO_BITS_PER_SAMPLE = 16;
static const int AUDIO_FFT_LENGTH = 1024; // *must* be power of 2
static const int AUDIO_FFT_BINS = AUDIO_FFT_LENGTH / 2;


SineWaveGenerator<int16_t> sineWave(32000);
GeneratedSoundStream<int16_t> generatedAudioSource(sineWave);
AudioRealFFT audioFFT;

StreamCopy fftCopier(audioFFT, generatedAudioSource);


// display fft result
void handleFftResult(AudioFFTBase &fft) {
  int diff;
  auto result = fft.result();
  if (result.magnitude > 100) {
    Serial.print(result.frequency);
    Serial.print(" ");
    Serial.print(result.magnitude);
    Serial.print(" => ");
    Serial.print(result.frequencyAsNote(diff));
    Serial.print(" diff: ");
    Serial.print(diff);
    Serial.print(" - time ms ");
    Serial.print(fft.resultTime() - fft.resultTimeBegin());
    Serial.println();
  }
}


void setup() {
  pinMode(LED_PIN, OUTPUT);

  Serial.begin(DEVICE_SERIAL_BAUD);

  sineWave.setFrequency(N_D8);

  AudioBaseInfo baseAudioConf = AudioBaseInfo();
  baseAudioConf.bits_per_sample = AUDIO_BITS_PER_SAMPLE;
  baseAudioConf.channels = AUDIO_CHANNELS;
  baseAudioConf.sample_rate = AUDIO_SAMPLE_RATE;

  auto generatedSourceConf = generatedAudioSource.defaultConfig();
  generatedSourceConf.copyFrom(baseAudioConf);
  generatedAudioSource.begin(generatedSourceConf);

  Serial.print("Starting FFT...");
  auto fftConf = audioFFT.defaultConfig();
  fftConf.copyFrom(baseAudioConf);
  fftConf.length = AUDIO_FFT_LENGTH;
  fftConf.callback = &handleFftResult;
//  fftConf.channel_used = 1;
  fftConf.window_function = new BufferedWindow(new Hamming());
  audioFFT.begin(fftConf);
  Serial.println(" FFT started");
}


void loop() {
  fftCopier.copy();
}

The text output shows B8, but I would expect to see D8

I see ovf, which I think means the magnitudes are larger than the number and are overflowing? That matches what I've seen previously, the magnitudes generated by the FFT are enormous.

There's also occasional F1 notes in there.

9345.41 ovf => B8 diff: 1443 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
43.07 ovf => F1 diff: -1 - time ms 0
9388.48 ovf => B8 diff: 1486 - time ms 0
9388.48 ovf => B8 diff: 1486 - time ms 0
9388.48 ovf => B8 diff: 1486 - time ms 0
9388.48 ovf => B8 diff: 1486 - time ms 0
9388.48 ovf => B8 diff: 1486 - time ms 0
9345.41 ovf => B8 diff: 1443 - time ms 0
21963.87 ovf => B8 diff: 14062 - time ms 0
9388.48 ovf => B8 diff: 1486 - time ms 0
9345.41 ovf => B8 diff: 1443 - time ms 0
9388.48 ovf => B8 diff: 1486 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
43.07 ovf => F1 diff: -1 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
21963.87 ovf => B8 diff: 14062 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9345.41 ovf => B8 diff: 1443 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 1
9388.48 ovf => B8 diff: 1486 - time ms 0

If I change the note

sineWave.setFrequency(N_A0);

then the output changes, but it doesn't match the generated tone.

43.07 ovf => F1 diff: -1 - time ms 1
129.20 ovf => C3 diff: -2 - time ms 1
43.07 ovf => F1 diff: -1 - time ms 1
43.07 ovf => F1 diff: -1 - time ms 1
43.07 ovf => F1 diff: -1 - time ms 1
43.07 ovf => F1 diff: -1 - time ms 1
129.20 ovf => C3 diff: -2 - time ms 0
129.20 ovf => C3 diff: -2 - time ms 0
43.07 ovf => F1 diff: -1 - time ms 0
43.07 ovf => F1 diff: -1 - time ms 0
43.07 ovf => F1 diff: -1 - time ms 0
43.07 ovf => F1 diff: -1 - time ms 1
129.20 ovf => C3 diff: -2 - time ms 0
43.07 ovf => F1 diff: -1 - time ms 0
43.07 ovf => F1 diff: -1 - time ms 1

0 replies

pschatzmann · 2022-12-17T10:53:12Z

pschatzmann
Dec 17, 2022
Maintainer

Hmm, It seems there might be some bug in the window function: If you take it out you will get the expected result. Without it I am getting

4694 6371113 => D8 diff: -5 - time ms 1
4694 4546548 => D8 diff: -5 - time ms 0
4694 13356174 => D8 diff: -5 - time ms 0
4694 15973200 => D8 diff: -5 - time ms 0
4694 11184241 => D8 diff: -5 - time ms 0
4694 1209734 => D8 diff: -5 - time ms 0
4694 9325750 => D8 diff: -5 - time ms 0
4694 15537279 => D8 diff: -5 - time ms 1
4694 14544922 => D8 diff: -5 - time ms 0
4694 6808822 => D8 diff: -5 - time ms 0
4694 4084242 => D8 diff: -5 - time ms 0
4694 13083604 => D8 diff: -5 - time ms 0
4694 16016752 => D8 diff: -5 - time ms 0
4694 11523721 => D8 diff: -5 - time ms 0
4694 1687725 => D8 diff: -5 - time ms 0

0 replies

aSemy · 2022-12-17T11:28:36Z

aSemy
Dec 17, 2022
Author

Yes!!! Removing the windowing produces something a lot more reasonable. Here's a screenshot, but viewing it live shows a sine wave!

I'm not sure why the lower frequencies are maxing out...

Here's an updated code sample.

Changes

used EventSourcing instead of websockets
added delay(100); in the FFT handler to slow it down

#include <ESPAsyncWebServer.h>
#include <sstream>
#include "WiFiCredentials.h"
#include "AudioTools.h"
#include "AudioLibs/AudioRealFFT.h"
#include "AudioLibs/MemoryManager.h"
#include <ctime>

#include <cstdint>

static const long DEVICE_SERIAL_BAUD = 115200;
static const int LED_PIN = 2;

static const int AUDIO_CHANNELS = 1;
static const int AUDIO_SAMPLE_RATE = 44100;
static const int AUDIO_BITS_PER_SAMPLE = 16;
static const int AUDIO_FFT_LENGTH = 64;// *must* be power of 2
static const int AUDIO_FFT_BINS = AUDIO_FFT_LENGTH / 2;

static const char webpage[] PROGMEM = /* language=HTML */ R"=====(<html lang="en">
<!-- Adding a data chart using Chart.js -->
<!--https://github.com/squix78/esp32-mic-fft-->
<head>
  <title>ESP32 Audio Visualiser</title>
  <script src='https://cdnjs.cloudflare.com/ajax/libs/Chart.js/2.5.0/Chart.min.js'></script>
</head>
<body onload="javascript:init()">
<div>
  <canvas id="chart" width="600" height="400"></canvas>
</div>
<!-- Adding a websocket to the client (webpage) -->
<script>
  let dataPlot;
  const maxValue = 100000;
  const maxLow = maxValue * 0.5;
  const maxMedium = maxValue * 0.2;
  const maxHigh = maxValue * 0.3;

  function init() {
    dataPlot = new Chart(document.getElementById("chart"), {
      type: 'bar',
      data: {
        labels: [],
        datasets: [{
          data: [],
          label: "Low",
          backgroundColor: "#D6E9C6"
        },
          {
            data: [],
            label: "Moderate",
            backgroundColor: "#FAEBCC"
          },
          {
            data: [],
            label: "High",
            backgroundColor: "#EBCCD1"
          },
        ]
      },
      options: {
        responsive: false,
        animation: false,
        scales: {
          xAxes: [{stacked: true}],
          yAxes: [{
            display: true,
            stacked: true,
            ticks: {
              beginAtZero: true,
              steps: 100,
              stepValue: 50,
              max: maxValue
            }
          }]
        }
      }
    });
    if (!!window.EventSource) {
      const source = new EventSource('/events');

      source.addEventListener('open', function (e) {
        console.log("Events Connected");
      }, false);

      source.addEventListener('error', function (e) {
        if (e.target.readyState !== EventSource.OPEN) {
          console.log("Events Disconnected");
        }
      }, false);

      source.addEventListener('fft', function (e) {
        const data = JSON.parse(e.data);
        dataPlot.data.labels = [];
        dataPlot.data.datasets[0].data = [];
        dataPlot.data.datasets[1].data = [];
        dataPlot.data.datasets[2].data = [];

        data.forEach(function (element) {
          dataPlot.data.labels.push(element.bin);
          const lowValue = Math.min(maxLow, element.value);
          dataPlot.data.datasets[0].data.push(lowValue);

          const mediumValue = Math.min(Math.max(0, element.value - lowValue), maxMedium);
          dataPlot.data.datasets[1].data.push(mediumValue);

          const highValue = Math.max(0, element.value - lowValue - mediumValue);
          dataPlot.data.datasets[2].data.push(highValue);
        });
        dataPlot.update();
      }, false);
    }
  }

</script>
</body>
</html>
)=====";


/// Toggle the LED from off->on, or on->off
static void toggleLed() {
  digitalWrite(LED_PIN, digitalRead(LED_PIN) ^ HIGH);
}

SineWaveGenerator<int16_t> sineWave(32000);
GeneratedSoundStream<int16_t> generatedSource(sineWave);

AudioRealFFT audioFFT;

StreamCopy fftCopier(audioFFT, generatedSource);

AsyncWebServer httpServer(80);
AsyncEventSource events("/events");


void initWiFi() {
  Serial.print("\nConnecting to WiFi...");
  WiFiClass::mode(WIFI_STA);
  WiFi.begin(WIFI_SSID, WIFI_PASS);
  uint8_t connectAttempts = 1;
  while (WiFiClass::status() != WL_CONNECTED) {
    Serial.print(".");
    delay(500);
    if (connectAttempts++ % 10 == 0) {
      Serial.print(" retrying...");
      WiFi.reconnect();
    }
  }
  Serial.println(" connected!");

  WiFi.setSleep(WIFI_PS_NONE);
}


void initWebServer() {
  Serial.print("Configuring HTTP server...");

  httpServer.on("/", HTTP_GET, [](AsyncWebServerRequest *request) {
    request->send_P(
        200,
        "text/html",
        webpage
    );
  });

  events.onConnect([](AsyncEventSourceClient *client) {
    if (client->lastId()) {
      Serial.printf("Client reconnected! Last message ID: %u\n", client->lastId());
    }
    //send event with message "hello!", id current millis, reconnect delay of 1 second
    client->send("hello!", NULL, millis(), 1000);
  });
  httpServer.addHandler(&events);

  Serial.print(" starting...");
  httpServer.begin();
  Serial.println(" started!");
}

void handleFftResult(AudioFFTBase &fft) {
  String json = "[";
  // skip the first bin, it's the fft total
  for (int bin = 1; bin < fft.size(); bin++) {
    auto mag = (long) fft.magnitude(bin);
    auto freq = (int) fft.frequency(bin);

    // send the bin label, and stored magnitude
    if (bin > 1) { json += ", "; }
    json += "{\"bin\":";
    json += "\"" + String(freq) + "\"";
    json += ",\"value\":";
    json += String(mag);
    json += "}";
  }
  json += "]";
  events.send(json.c_str(), "fft", millis());
  toggleLed();
  delay(100);
  yield();
}

void setup() {
  pinMode(LED_PIN, OUTPUT);

  Serial.begin(DEVICE_SERIAL_BAUD);

  initWiFi();

  configTime(0, 0, "pool.ntp.org");

  initWebServer();

//  AudioLogger::instance().begin(Serial, AudioLogger::Info);

  AudioBaseInfo baseAudioConf = AudioBaseInfo();
  baseAudioConf.bits_per_sample = AUDIO_BITS_PER_SAMPLE;
  baseAudioConf.channels = AUDIO_CHANNELS;
  baseAudioConf.sample_rate = AUDIO_SAMPLE_RATE;

  Serial.print("Starting FFT...");
  auto fftConf = audioFFT.defaultConfig();
  fftConf.copyFrom(baseAudioConf);
  fftConf.length = AUDIO_FFT_LENGTH;
  fftConf.callback = &handleFftResult;
//  fftConf.channel_used = 1;
//  fftConf.window_function = new BufferedWindow(new Hamming());
  audioFFT.begin(fftConf);
  Serial.println(" FFT started");

  sineWave.begin(AUDIO_CHANNELS, AUDIO_SAMPLE_RATE, N_FS6);
}

void loop() {
  fftCopier.copy();

  yield();
}

0 replies

pschatzmann · 2022-12-17T12:31:19Z

pschatzmann
Dec 17, 2022
Maintainer

Not sure what this is the result of the fft gives

0: 60179.2
86.1328: 58527.6
172.266: 57364
258.398: 57039.8
344.531: 57766.3
430.664: 59617.3
516.797: 62507.8
602.93: 66251
689.062: 70675
775.195: 75682.7
861.328: 81438.7
947.461: 88424.5
1033.59: 97716.9
1119.73: 111538
1205.86: 134850
1291.99: 181182
1378.12: 308994
1464.26: 1.84957e+06
1550.39: 381521
1636.52: 157404
1722.66: 92937
1808.79: 63527

0 replies

aSemy · 2022-12-17T12:45:33Z

aSemy
Dec 17, 2022
Author

Here's another example with my MAX9814 kind of working...

I ran the below code and played a 1900Hz tone through my speakers, and held the esp32's mic up against it

While I could see a clear peak at ~1.9KHz, there's also another peak ~20KHz, mirrored on the central frequency. The mirrored peak is more apparent when I scrub through higher frequencies. I'm not sure why this is, maybe because there's no windowing?

Changes:

event messages use a JSON array rather than a JSON object (faster encode/decode)
I manually edited the source code to allow for customisation of audioSourceConf.channel_format = I2S_CHANNEL_FMT_ONLY_LEFT; (see Allow configuration of AnalogDriverESP32 channel_format #526). I'm not sure if it makes a difference.
Because all the magnitudes seem to start at 3000 (shouldn't they start at 0?) I scaled all magnitudes by subtracting the minimum magnitude per FFT result .

#include <ESPAsyncWebServer.h>
#include <sstream>
#include "WiFiCredentials.h"
#include "AudioTools.h"
#include "AudioLibs/AudioRealFFT.h"
#include "AudioLibs/MemoryManager.h"
#include <ctime>

#include <cstdint>

static const long DEVICE_SERIAL_BAUD = 115200;
static const int LED_PIN = 2;

static const int AUDIO_CHANNELS = 1;
static const int AUDIO_SAMPLE_RATE = 44100;
static const int AUDIO_BITS_PER_SAMPLE = 16;
static const int AUDIO_FFT_LENGTH = 256;// *must* be power of 2
static const int AUDIO_FFT_BINS = AUDIO_FFT_LENGTH / 2;

static const char webpage[] PROGMEM = /* language=HTML */ R"=====(<html lang="en">
<!-- Adding a data chart using Chart.js -->
<!--https://github.com/squix78/esp32-mic-fft-->
<head>
  <title>ESP32 Audio Visualiser</title>
  <script src='https://cdnjs.cloudflare.com/ajax/libs/Chart.js/2.5.0/Chart.min.js'></script>
</head>
<body onload="javascript:init()">
<div>
  <canvas id="chart" width="600" height="400"></canvas>
</div>
<!-- Adding a websocket to the client (webpage) -->
<script>
  let dataPlot;
  const maxValue = 100000;
  const maxLow = maxValue * 0.5;
  const maxMedium = maxValue * 0.2;
  const maxHigh = maxValue * 0.3;

  function init() {
    dataPlot = new Chart(document.getElementById("chart"), {
      type: 'bar',
      data: {
        labels: [],
        datasets: [{
          data: [],
          label: "Low",
          backgroundColor: "#D6E9C6"
        },
          {
            data: [],
            label: "Moderate",
            backgroundColor: "#FAEBCC"
          },
          {
            data: [],
            label: "High",
            backgroundColor: "#EBCCD1"
          },
        ]
      },
      options: {
        responsive: false,
        animation: false,
        scales: {
          xAxes: [{stacked: true}],
          yAxes: [{
            display: true,
            stacked: true,
            ticks: {
              beginAtZero: true,
              steps: 100,
              stepValue: 50,
              max: maxValue
            }
          }]
        }
      }
    });
    if (!!window.EventSource) {
      const source = new EventSource('/events');

      source.addEventListener('open', function (e) {
        console.log("Events Connected");
      }, false);

      source.addEventListener('error', function (e) {
        if (e.target.readyState !== EventSource.OPEN) {
          console.log("Events Disconnected");
        }
      }, false);

      source.addEventListener('fft', function (e) {
        dataPlot.data.labels = [];
        dataPlot.data.datasets[0].data = [];
        dataPlot.data.datasets[1].data = [];
        dataPlot.data.datasets[2].data = [];

        const data = JSON.parse(e.data);
        data.forEach(function (element) {
          const frequency = element[0];
          const magnitude = element[1];

          dataPlot.data.labels.push(frequency);

          const lowValue = Math.min(maxLow, magnitude);
          dataPlot.data.datasets[0].data.push(lowValue);

          const mediumValue = Math.min(Math.max(0, magnitude - lowValue), maxMedium);
          dataPlot.data.datasets[1].data.push(mediumValue);

          const highValue = Math.max(0, magnitude - lowValue - mediumValue);
          dataPlot.data.datasets[2].data.push(highValue);
        });
        dataPlot.update();
      }, false);
    }
  }

</script>
</body>
</html>
)=====";

/// Toggle the LED from off->on, or on->off
static void toggleLed() {
  digitalWrite(LED_PIN, digitalRead(LED_PIN) ^ HIGH);
}

AnalogAudioStream audioSource;
AudioRealFFT audioFFT;

StreamCopy fftCopier(audioFFT, audioSource);

AsyncWebServer httpServer(80);
AsyncEventSource events("/events");


void initWiFi() {
  Serial.print("\nConnecting to WiFi...");
  WiFiClass::mode(WIFI_STA);
  WiFi.begin(WIFI_SSID, WIFI_PASS);
  uint8_t connectAttempts = 1;
  while (WiFiClass::status() != WL_CONNECTED) {
    Serial.print(".");
    delay(500);
    if (connectAttempts++ % 10 == 0) {
      Serial.print(" retrying...");
      WiFi.reconnect();
    }
  }
  Serial.println(" connected!");

  WiFi.setSleep(WIFI_PS_NONE);
}


void initWebServer() {
  Serial.print("Configuring HTTP server...");

  httpServer.on("/", HTTP_GET, [](AsyncWebServerRequest *request) {
    request->send_P(
        200,
        "text/html",
        webpage
    );
  });

  events.onConnect([](AsyncEventSourceClient *client) {
    if (client->lastId()) {
      Serial.printf("Client reconnected! Last message ID: %u\n", client->lastId());
    }
    // send event with message "hello!", id current millis, reconnect delay of 1 second
    client->send("hello!", NULL, millis(), 1000);
  });
  httpServer.addHandler(&events);

  Serial.print(" starting...");
  httpServer.begin();
  Serial.println(" started!");
}

void handleFftResult(AudioFFTBase &fft) {
  String json = "[";

  // skip bin 0, it's sum of all fft bins
  const int FIRST_BIN = 1;

  auto minMagnitude = 0;
  auto maxMagnitude = 0;
  for (int bin = FIRST_BIN; bin < fft.size(); bin++) {
    auto mag = (long) fft.magnitude(bin);

    if (minMagnitude == 0 || mag < minMagnitude) {
      minMagnitude = mag;
    } else if (mag > maxMagnitude) {
      maxMagnitude = mag;
    }
  }

  for (int bin = FIRST_BIN; bin < fft.size(); bin++) {
    auto freq = (int) fft.frequency(bin);
    auto mag = ((long) fft.magnitude(bin)) - minMagnitude;

    // send the frequency and magnitude
    if (bin > FIRST_BIN) { json += ", "; }
    json += "[" + String(freq) + "," + String(mag) + "]";
  }

  json += "]";
  events.send(json.c_str(), "fft", millis());
  toggleLed();
  delay(100);
  yield();
}

void setup() {
  pinMode(LED_PIN, OUTPUT);

  Serial.begin(DEVICE_SERIAL_BAUD);

  initWiFi();

  configTime(0, 0, "pool.ntp.org");

  initWebServer();

//  AudioLogger::instance().begin(Serial, AudioLogger::Info);

  AudioBaseInfo baseAudioConf = AudioBaseInfo();
  baseAudioConf.bits_per_sample = AUDIO_BITS_PER_SAMPLE;
  baseAudioConf.channels = AUDIO_CHANNELS;
  baseAudioConf.sample_rate = AUDIO_SAMPLE_RATE;

  Serial.print("Starting input stream...");
  auto audioSourceConf = audioSource.defaultConfig(RX_MODE);
  audioSourceConf.copyFrom(baseAudioConf);
  audioSourceConf.mode_internal = I2S_MODE_MASTER | I2S_MODE_RX | I2S_MODE_ADC_BUILT_IN;
  audioSourceConf.port_no = I2S_NUM_0;
  audioSourceConf.channel_format = I2S_CHANNEL_FMT_ONLY_LEFT;
  audioSourceConf.use_apll = false;
  audioSourceConf.buffer_count = 2;
  audioSourceConf.buffer_size = 1024;
  audioSourceConf.setInputPin1(GPIO_NUM_35);
  audioSource.begin(audioSourceConf);
  Serial.println(" Input stream started");

  Serial.print("Starting FFT...");
  auto fftConf = audioFFT.defaultConfig();
  fftConf.copyFrom(baseAudioConf);
  fftConf.length = AUDIO_FFT_LENGTH;
  fftConf.callback = &handleFftResult;
//  fftConf.channel_used = 1;
//  fftConf.window_function = new BufferedWindow(new Hamming());
  audioFFT.begin(fftConf);
  Serial.println(" FFT started");
}


void loop() {
  fftCopier.copy();
}

0 replies

pschatzmann · 2022-12-17T17:25:57Z

pschatzmann
Dec 17, 2022
Maintainer

I would would have expected to see some increased amounts on any harmonics of the signal.
By the way I corrected the broken window functions and stride with my last commit

0 replies

aSemy · 2023-08-02T11:14:09Z

aSemy
Aug 2, 2023
Author

Thanks @pschatzmann! I've picked my project back up again with the newer version and it's working much better. Thanks for the fixes!

However, I think I've found a minor issue with the FFT frequency computation. The AudioFFTBase.frequency(int bin) function seems to return a frequency that is half of the expected value.

arduino-audio-tools/src/AudioLibs/AudioFFT.h

Lines 195 to 202 in 6303c74

    
           /// Determines the frequency of the indicated bin 
        
           float frequency(int bin){ 
        
               if (bin>=bins){ 
        
                   LOGE("Invalid bin %d", bin); 
        
                   return 0; 
        
               } 
        
               return static_cast<float>(bin) * cfg.sample_rate / cfg.length; 
        
           }

I believe this is because it is using the FFT length. However, there are only length/2 bins.

I can manually compute the frequency as follows:

static const int AUDIO_SAMPLE_RATE = 44100;
static const int AUDIO_FFT_LENGTH =  4096; // ~10Hz per bin resolution

void handleFftResult(AudioFFTBase &fft) {
  // skip bin 0, it's sum of all fft bins
  const int FIRST_BIN = 1;

  auto minMagnitude = fft.magnitude(FIRST_BIN);
  auto maxMagnitude = minMagnitude;
  int binWithMaxMagnitude = 0;

  for (int bin = FIRST_BIN; bin < AUDIO_FFT_BINS; bin++) {
    auto mag = fft.magnitude(bin);
    if (mag > maxMagnitude) {
      maxMagnitude = mag;
      binWithMaxMagnitude = bin;
    }
  }

  //float frequencyWithMaxMagnitude = fft.frequency(binWithMaxMagnitude);
  float frequencyWithMaxMagnitude = (2.f * static_cast<float>(binWithMaxMagnitude) * static_cast<float>(AUDIO_SAMPLE_RATE)) / static_cast<float>(AUDIO_FFT_LENGTH);

  Serial.print(" binWithMaxMagnitude:");
  Serial.print(binWithMaxMagnitude);
  Serial.print(", frequencyWithMaxMagnitude:");
  Serial.println(frequencyWithMaxMagnitude);
}

(I'm not sure if the static_cast<>s are needed - I'm still hacking around...)

This seems to work as expected.

I could be completely wrong though and AudioFFTBase.frequency(int bin) is returning the correct value, and I've got some other bug in my code!

2 replies

pschatzmann Aug 2, 2023
Maintainer

Since you did not provide your full code, it's difficult to say what you are doing wrong. The following example was working perfectly for me and I never had any problem in getting the wrong frequency: https://www.pschatzmann.ch/home/2022/11/06/freertos-addons-a-c-api-for-freertos/..

I suggest that you have a look at the implementation of the result() method.

aSemy Aug 13, 2023
Author

Thanks for the pointers! I must have made a mistake, or possibly I needed to clean my project to make sure I had the right dependencies. Either way, I can now visualise the FFT result!

I've got 3 ESP32 devices that analyse the audio, and I plot the results in Grafana.

Thanks so much for your help and library @pschatzmann!

Uh oh!

Help with visualising FFT - sine wave is visualised as noise #527

Uh oh!

aSemy Dec 15, 2022

Complete code

Replies: 9 comments · 3 replies

Uh oh!

Uh oh!

pschatzmann Dec 15, 2022 Maintainer

Uh oh!

aSemy Dec 16, 2022 Author

Uh oh!

Uh oh!

pschatzmann Dec 16, 2022 Maintainer

Uh oh!

aSemy Dec 16, 2022 Author

Uh oh!

Uh oh!

pschatzmann Dec 17, 2022 Maintainer

Uh oh!

Uh oh!

aSemy Dec 17, 2022 Author

Uh oh!

pschatzmann Dec 17, 2022 Maintainer

Uh oh!

aSemy Dec 17, 2022 Author

Uh oh!

pschatzmann Dec 17, 2022 Maintainer

Uh oh!

aSemy Aug 2, 2023 Author

Uh oh!

Uh oh!

pschatzmann Aug 2, 2023 Maintainer

Uh oh!

aSemy Aug 13, 2023 Author

aSemy
Dec 15, 2022

Replies: 9 comments 3 replies

pschatzmann
Dec 15, 2022
Maintainer

aSemy Dec 16, 2022
Author

pschatzmann
Dec 16, 2022
Maintainer

aSemy
Dec 16, 2022
Author

pschatzmann
Dec 17, 2022
Maintainer

aSemy
Dec 17, 2022
Author

pschatzmann
Dec 17, 2022
Maintainer

aSemy
Dec 17, 2022
Author

pschatzmann
Dec 17, 2022
Maintainer

aSemy
Aug 2, 2023
Author

pschatzmann Aug 2, 2023
Maintainer

aSemy Aug 13, 2023
Author