The present invention relates to limiting the peaks in signals and in particular to limiting the magnitude of FFT bins to avoid harmonic distortion.
Known limiting utilizes filter banks and hard clipping to suppress problematic frequency bands created when increasing the playback level. The known approach has very limited resolution, and due to hard limiting of the signal, significant noise is often introduced.
The present invention addresses the above and other needs by providing a volume extension method including limiting the magnitude of Fast Fourier Transform (FFT) frequency bins which allows increases to the perceived level of audio content without causing distortion. Soft limit and smoothing is applied to each FFT bin is to prevent or reduce distortion while allowing maximizing output volume. Frequency resolution is significantly improved compared to volume extension methods utilizing filterbanks and hard limiting, and distortion is reduced because no hard limiting occurs.
In accordance with one aspect of the invention, there is provided a method for signal processing audio signals. The method includes: receiving a time domain input signal; window the input signal; performing an FFT on the windowed signal to obtain FFT bin data; limiting the FFT bin data; performing an inverse FFT on the limited FFT bin data to obtain limited time domain data; assembling a limited signal from the limited time domain data; and driving an acoustic transducer with the limited signal to generate sound waves.
In accordance with a further aspect of the invention, there is provided a method for signal processing audio signals including applying a Hanning window to the input signal before performing an FFT.
In accordance with a further aspect of the invention, there is provided a method for signal processing audio signals including soft limiting FFT bin magnitudes. The magnitudes are mapped into limited magnitude values using an equation to reduce distortion.
In accordance with a further aspect of the invention, there is provided a method for signal processing audio signals including smoothing limited FFT bin magnitudes. After limiting individual FFT bin magnitudes, the frame to frame change in magnitude is further limited to reduce distortion. For example, a one pole filter may be applied after limiting.
The above and other aspects, features and advantages of the present invention will be more apparent from the following more particular description thereof, presented in conjunction with the following drawings wherein:
Corresponding reference characters indicate corresponding components throughout the several views of the drawings.
The following description is of the best mode presently contemplated for carrying out the invention. This description is not to be taken in a limiting sense, but is made merely for the purpose of describing one or more preferred embodiments of the invention. The scope of the invention should be determined with reference to the claims.
A functional block diagram of an audio signal processing system 10 including frequency domain limiting according to the present invention is shown in
A method for intelligent Fast Fourier Transform (FFT) frequency bin limiting according to the present invention is shown in
The limiting of step 106 may be a hard limiting which sets all values above the limit to the limit value, or a soft limiting as shown in
The inverse FFT data is preferably assembled by multiplying each frame (of size N) of inverse FFT data by the Hanning window, and adding the results to an accumulator (of size N). The first N/4 values in the accumulator are outputted as the first N/4 samples of the limited digital signal. After the first N/4 values are outputted, the data in the accumulator is shifted to the left by N/4 cells and the N/4 cells at the right end of the accumulator are set to zero. Then, the next frame of inverse FFT data is multiplied by the Hanning window and added to the existing N values in the accumulator, and the first N/4 values in the accumulator are again outputted as the second N/4 samples of the limited digital signal. This process is repeated as each frame of inverse FFT data is generated, windowed, and added to the accumulator to obtain a weighted average. The result is, weighted, and shifted by 0, N/4, 2N/4, and 3N/4, portions (of length N/4) of each FFT output frames, are summed together and outputted to obtain N/4 samples of the limited digital signal.
The FFT bin based signal limiting according to the present invention has application to music, television, movies, cell phones, and generally to any audio system where a desire exists to provide a high audio level without losing sound quality. Any audio system including FFT bin based signal limiting is intended to come within the scope of the present invention.
The digital signal sample rate is preferably 48K. The window length is generally matched to the FFT length N and a longer length FFT increases the frequency resolution of the FFT. Preferably, N is a power of two, and at least 256, and may be as high as 2048 in software based systems, but is constrained by the processing environment. Implementations in DSPs are limited by existing processors to about 512.
The FFT output is conjugate symmetric, meaning that the first half of the output is equal to the reversed order conjugate of the second half. Thus, if the FFT produces N complex values, N/2 of the values describe the spectrum of the signal.
While the invention herein disclosed has been described by means of specific embodiments and applications thereof, numerous modifications and variations could be made thereto by those skilled in the art without departing from the scope of the invention set forth in the claims.
Number | Name | Date | Kind |
---|---|---|---|
6604071 | Cox et al. | Aug 2003 | B1 |
7013011 | Weeks | Mar 2006 | B1 |
20030179860 | Sun et al. | Sep 2003 | A1 |
20050149339 | Tanaka et al. | Jul 2005 | A1 |
20070083365 | Shmunk | Apr 2007 | A1 |
20090299742 | Toman | Dec 2009 | A1 |
20120179456 | Ryu et al. | Jul 2012 | A1 |