Convert from FFT back to audio?

BennyHacker · April 17, 2021, 10:48pm

Hello forum. I have used the processing sound library to analyze an audio file and get the sound spectrum in the form of an array of floats.

I was wondering if there was a well understood way of converting back to audio from this array?

Edit: I should now clarify that I have an array of FFT values, and I am trying to turn this array back into an audio sample

micuat · April 18, 2021, 9:01am

hi! after all inverse FFT is almost same as FFT, and usually you can use the same algorithm with a bit of tweaking to recover the data:
https://www.dsprelated.com/showarticle/800.php

However, there are 2 problems. First, I’m assuming that your array is the spectrum from the FFT object. This already lost information of the phase - in the article it has real and imaginary numbers, which are the “raw” output of FFT, but Processing only exposes the magnitude, which is, abs(real + i * imag) or mag(real, imag) if you think of a vector.

Another problem is that usually FFT is a “snapshot” in the time domain. If you compute FFT of a whole song of a few minutes, for example, you get a huge array (that is double the size of the samples of the song or the length of the waveform). But this is not convenient because it doesn’t represent the time (for example, if you visualize this, you will have stationary spectrum bars for the whole song ). What people usually do is to calculate spectrogram which is FFT of a small chunk of the song, so it represents the frequency domain of that given duration, and that’s what Processing does (so it can animate). Unless you have all the snapshots of the FFT arrays (thus it becomes an array of arrays) you cannot recover the original song.

But I’m curious if you can simply generate a waveform from the spectrum. The array you have probably contains the amplitude of each frequency. For example, if the data is [0.9, 0.5, 0.1, 0.2] and if you already know that they correspond to 0, 4, 8, 16 Hz, then the original waveform (without phase) is

  sin(t *  0 * TWO_PI) * 0.9
+ sin(t *  4 * TWO_PI) * 0.5
+ sin(t *  8 * TWO_PI) * 0.1
+ sin(t * 16 * TWO_PI) * 0.2

and it would be interesting to generate sound this way and hear how it sounds like!

BennyHacker · April 18, 2021, 3:30pm

Thank you for the detailed response! I am still just learning this so I will probably try exploring these other methods of using audio data before I go ahead and use the FFT array.

And yes, I am using an array of amplitudes, so perhaps I will use that equation and see what results I can get!

Topic		Replies	Views
Drawing a spectogram 2D (or 3D) from a recorded audio Libraries	25	2729	May 11, 2021
Using the FFT library, is it possible to get the beats per minute? Libraries	5	557	August 2, 2020
Is there a way to play a SoundFile muted and keep its amplitude data? Libraries	9	886	April 4, 2019
FFT of 1D array of 1024 measurements Electronics (Arduino, etc.)	19	2484	May 31, 2021
FFT in p5.js p5.sound Libraries	4	2336	June 3, 2019

Convert from FFT back to audio?

Related topics