1. Technical Field
A system and methods for extending the frequency bandwidth of harmonic signals are provided.
2. Prior Art
All communication systems, especially wireless communication systems, suffer bandwidth limitations. The quality and intelligibility of speech signals transmitted in such systems must be balanced against the limited bandwidth available to the system. In wireless telephone networks, for example, the bandwidth is typically set according to the minimum bandwidth necessary for successful communication. The lowest frequency important to understanding a vowel is about 200 Hz and the highest frequency vowel formant is about 3,000 Hz. Most consonants however are broadband, usually having energies in frequencies below about 3,400 Hz. Accordingly, most wireless speech communication systems are optimized to pass between 300 and 3,400 Hz.
A typical passband 10 for a speech communication system is shown in
In many cases, the quality of band-limited signals can be improved by reintroducing the harmonic components of signals that have been lost because they lie outside of the system's passband. In some systems, such as that disclosed in a co-pending U.S. patent application Ser. No. 11/110,556, entitled “System for Improving Speech Quality and Intelligibility,” the entire disclosure of which is incorporated herein by reference, higher frequency components of speech signals are transposed or compressed into lower frequency ranges that are within the system's passband. In this case the compressed speech signals retain much of the information from the higher frequency ranges that are outside the passband and which would otherwise be lost if the signal were not compressed. This step alone significantly improves the quality and intelligibility of band-limited speech signals. Nonetheless, such frequency compressed signals experience further significant quality and intelligibility improvements if they are re-expanded after they have been transmitted over the narrowband communication channel and harmonics have been reintroduced at higher frequencies.
Presently, several techniques exist for extending the frequency range of harmonic signals for both speech and music. In many cases extending the harmonic signal content may be described as “excitation signal generation.” These techniques can be broadly grouped into two categories: frequency shifting methods; and nonlinear distortion methods.
Frequency shifting methods involve some form of spectral copying, transposition, or folding, in order to introduce a replica of lower frequency harmonics at higher frequencies. Many of these methods use a fixed copying scheme, which can result in the improper placement of the high-frequency harmonics. In many cases, the re-introduced high frequency harmonics will not be placed accurately at each multiple of the fundamental pitch frequency. Some spectral copying methods use a pitch estimate to insure the proper placement of transposed harmonics. However, performance of these methods can become severely degraded if the pitch estimate is inaccurate. This is often the case with signals having a low SNR.
The second category of harmonic extension methods involves creating harmonic distortion so that harmonics are introduced across the full frequency spectrum. These methods employ a time domain non-linear transformation such as a squared function x2(n), cubic function x3(n), or full-wave rectification |x(n)|, to introduce harmonic distortion. These methods are usually followed by spectral envelope estimation techniques, such as linear prediction, which are used to ensure that the final wideband excitation signal is spectrally flat.
The main advantage of non-linear transformation methods over spectral copying or folding methods is that harmonics are generated at multiples of the fundamental frequency without requiring the use of a pitch estimation algorithm. However, the main disadvantage of these techniques is that the new harmonics can contain aliasing artifacts in the higher frequencies. Also, because it is a time domain approach, it is difficult to control the bandwidth of the generated harmonics. New harmonics are generated across all frequencies instead of only the frequency range of interest.
A system and methods are provided for extending the harmonics of band limited harmonic signals. Harmonic content of a band-limited harmonic signal is extended to frequencies outside the signal's passband by performing a non-linear transformation on the complex spectrum of the band limited signal in the frequency domain. This non-linear transformation may be accomplished by a linear convolution of the complex spectrum with itself. A system for extending the frequency bandwidth of a harmonic signal includes a signal processor with a forward transform module for transforming a time domain signal into the frequency domain, a non-linear transform module for performing the non-linear transformation on the complex spectrum of the harmonic signal, and a reverse transform module for transforming the extended spectrum of the harmonic signal back into the time domain. In many applications, it may be desirable to combine the original band-limited signal with all or some spectral portion of the spectrally-extended harmonic signal (e.g. to obtain a final speech or music signal with improved quality or intelligibility). This can be accomplished using a variety of techniques as described in the co-pending U.S. patent application Ser. No. 11/110,556, entitled “System for Improving Speech Quality and Intelligibility”.
According to an embodiment of the invention a method of extending the harmonics of a band-limited harmonic signal is provided. The method calls for transforming the band-limited harmonic signal from the time domain into the frequency domain. The transformation produces a complex spectrum of the band-limited harmonic signal. Once the complex spectrum has been obtained, a non-linear transformation is performed on the complex spectrum. The non-linear transformation may include a linear convolution of the complex spectrum with itself. The non-linear transformation extends the harmonic content of the complex spectrum to frequencies outside the limited frequency band of the original band-limited harmonic signal. Finally, an inverse transform is performed on the extended complex spectrum, transforming the complex spectrum back into the time domain.
According to another embodiment, a harmonic extension method is provided. This method includes receiving a band-limited harmonic signal. By definition, the band-limited harmonic signal includes significant signal energies at regular frequency intervals within the limited frequency band of the band-limited signal. The signal's pass band is defined by a passband lower frequency limit and a passband upper frequency limit. The band-limited harmonic signal is transformed from the time domain into the frequency domain. The time domain to frequency domain transform produces a complex spectrum representing the frequency content of the received signal. In order to add harmonic content to frequencies outside the narrow frequency band of the original signal, a non-linear transformation is performed on the complex spectrum of the received band-limited harmonic signal. The harmonically extended spectrum is then transformed back into the time domain.
A system for extending the harmonics of a band limited harmonic signal is also provided. The system includes a device for receiving a band-limited harmonic signal, such as microphone, a wireless telephone hand set, an audio system, or any other device or system capable of receiving an harmonic signal. The system further includes a signal processor for processing a signal received by the receiving device. The signal processor includes a forward transform module for transforming the received band-limited harmonic signal from the time domain into the frequency domain. The forward transform module generates a complex spectrum representing the frequency content of the band-limited signal. A non-linear transformation module is provided by the signal processor for performing a non-linear transformation of the complex spectrum of the band-limited signal in the frequency domain. The non-linear transformation creates an extended spectrum that includes harmonics at frequencies outside the original frequency band of the received signal. Finally, the signal processor includes a reverse transform module for transforming the harmonically extended spectrum of the band-limited harmonic signal back into the time domain.
Other systems, methods, features and advantages of the invention will be, or will become, apparent to one with skill in the art upon examination of the following figures and detailed description. It is intended that all such additional systems, methods, features and advantages be included within this description, be within the scope of the invention, and be protected by the following claims.
The present invention relates to a system and methods for extending the frequency bandwidth of harmonic signals. The system and methods may be employed to reintroduce both high and low frequency harmonics of band-limited signals, as well as restore mid-band harmonics that may have been masked by background noise. The system and methods for extending the frequency bandwidth of harmonic signals may be advantageously employed to enhance the quality or intelligibility of harmonic signals such as speech or music signals. Further, the system and methods may provide an optimal mechanism for extending the bandwidth of compressed signals according to the co-pending patent application entitled “System For Improving Speech Quality and Intelligibility,” as mentioned in the Background section, the teaching of which is incorporated into the present disclosure.
The present system and methods for extending the bandwidth of harmonic signals operate in a manner similar to the method described in the Background section of introducing harmonics by creating harmonic distortion in the time domain. However, whereas past efforts of introducing or reintroducing harmonic content through harmonic distortion rely on a non-linear transformation of the band-limited signal in the time domain, the system and methods disclosed herein rely on a transformation of the band-limited signal in the frequency domain instead. As noted in the background of the invention, a non-linear transformation in the time domain may be accomplished by squaring the original time domain signal x(n) as represented by the equation.
y(n)=x2(n) (1)
where n denotes the time index and y is the transformed output signal which includes the harmonic distortion that produces harmonics at a wider bandwidth. Squaring the time sampled signal in the time domain is equivalent to performing circular or cyclic convolution of the signal's complex spectrum with itself in the frequency domain. However, circular convolution in the frequency domain suffers from the same defects as squaring the time domain signal, namely aliasing artifacts as shown in
Y(k)=X(k)*X(k) k=O . . . N/2 (2)
where * denotes a linear convolution operation, k is the frequency index, and N is the length of the FFT employed to transform the time domain signal into the frequency domain. Note that the Figures showing frequency spectra in this document were generated using a digital signal sampled at 11 kHz transformed using a 256-pt FFT on hanning-windowed time segments, with 50% overlap. Other sampling rates, windowing functions or FFT sizes may also be used for this invention.
It is also clear that the aliasing artifacts 26 which are prevalent in the higher frequencies of the spectrum 16 when a non-linear transform is performed in the time domain are not present in the spectrum 28 resulting from a linear convolution in the frequency domain. An additional advantage of performing linear convolution of the complex spectrum of the harmonic signal with itself in the frequency domain is that it is easier to control the bandwidth of the generated harmonics. For example, in
In the preceding example, the harmonic range of the spectrum 10 of the original band-limited signal was extended from approximately 3500 Hz to 5,500 Hz. Performing a non-linear transformation in the frequency domain may also be used to extend harmonics from higher frequency ranges to lower frequency ranges, although a slightly different approach must be employed. In order to introduce harmonics in frequencies below the frequencies of the harmonics of the original band-limited signal, the mirrored complex conjugate of the original complex spectrum of the band-limited signal is used. The original complex spectrum of the band-limited signal is convolved with a mirrored complex conjugate version of itself. This operation may be expressed mathematically as
Y′(k)=X(k)*conj(X(N/2−k)) k=0 . . . N/2, (3)
where conj represents the complex conjugate of the complex spectrum. The final output Y(k), the spectrum including lower frequency harmonics, is obtained by again mirroring and taking the complex conjugate of the result of the linear convolution Y′(k). This may be expressed as
Y(k)=conj(Y′(N/2−k)) k=0 . . . N/2. (4)
As was the case when extending harmonics to higher frequencies, the filtered output Y(k) need only be calculated for points k where harmonic extension of the signal are desired.
Using the system and methods of extending harmonic information in band-limited signals discussed herein, it may be desirable to linearly convolve only portions of the original complex spectrum with itself. For example, for phone-band speech signals, the most significant harmonic energies are usually contained within the frequency range between 0-2.5 KHz. Therefore, in order to reduce the computational load on the system, it may be desirable to use only the portion of the original complex spectrum for the convolution where most of the harmonic energy resides. The linear convolution equation (2) described above may be altered as
Y(k)=X(k1)*X(k2) k1=0 . . . M1 (5)
k2=0. . . M2
where
Limiting the portions of the complex spectrum which are used in performing the linear convolution may also be effective when a portion of the original harmonic signal has been corrupted by noise. Generation of additional harmonics may be more effective when portions of the spectrum having the low signal-to-noise ratio (SNR) are excluded from the linear convolution.
As mentioned above, excluding a portion of the original complex spectrum from the linear convolution may be beneficial where portions of the original signal are corrupted by noise. Another alternative for excluding low SNR portions of the spectrum is to use a weighted convolution approach. In some cases it may be advantageous to exclude or suppress portions of a spectrum prior to performing the linear convolution. This may be accomplished by multiplying the spectrum by one or more weighting factors before performing the convolution. In this case, Equation (4) may be re-written as
Y(k)=[G1(k1)X(k1)]*[G2 (k2)X(k2)] k1=0 . . . M1 (6
k2=0. . . M2
where M1, M2<N/2 and G1 and G2 are weighting factor vectors. Appropriate values for G1 and G2 may be for example between 0 and 1. In a particular implementation for suppressing background noise and generating extended harmonics in a speech signal corrupted by background noise, G1 and G2 may correspond to Weiner filter coefficients estimated from SNR characteristics of the original spectrum of the input speech signal.
The weighted convolution procedure embodied in Equation (6) may also be employed to recover or enhance in-band harmonics that have been completely or partially masked by noise. For example
Based on the above discussion, a flowchart describing a method for extending the bandwidth of a band-limited harmonic signal is shown in
The harmonic extender 206 further includes a non-linear transform module 210. The harmonic generation module 210 performs a non-linear transformation on the complex spectrum of the received audio signal which is output from forward transform module 208. The non-linear transformation may include linear convolution of the complex spectrum of the transformed signal with itself; linear convolution of the complex spectrum or a portion of the complex spectrum with a portion of itself; convolution of the complex spectrum with the mirrored complex conjugate of itself; convolution of a first weighted version of the complex spectrum with a second weighted version of the complex spectrum; or some other non-linear frequency domain transformation that will generate the desired harmonics.
Once the additional harmonics have been generated, an inverse, or reverse transform module 212 transforms the harmonically extended spectrum back into the time domain. The reverse transform module 212 may employ an inverse FFT algorithm, an Inverse Discrete Fourier Transform (IDFT); an Inverse Digital Cosine Transform (IDCT); a digital filter bank; or a wavelet transform or some other mechanism for transforming the complex spectrum of the harmonically extended signal back into the time domain. Preferably, the reverse transform module 212 will employ the inverse of the transform employed by the forward transform module 208. The reverse transform module 212 outputs a time domain signal 214 which includes harmonics in frequencies outside the limited frequency band of the original signal.
While various embodiments of the invention have been described, it will be apparent to those of ordinary skill in the art that many more embodiments and implementations are possible within the scope of the invention. Accordingly, the invention is not to be restricted except in light of the attached claims and their equivalents.