The present application is related to and claims the benefit under 35 U.S.C. §119(a) of a Korean patent application filed on May 29, 2012 in the Korean Intellectual Property Office and assigned Serial No. 10-2012-0056871, the entire disclosure of which is hereby incorporated by reference.
The present disclosure relates to a method and an apparatus for processing an audio signal, and more particularly, to a method of processing an audio signal which adjusts a gain by separating the audio signal into a harmonic signal and a percussion signal, and an apparatus thereof.
A dynamic range controller (DRC) dynamically adjusts magnitude of an output according to magnitude of an input, and refers to an audio gain controller for increasing an output of a speaker.
In general, audio gain control according to the related art is achieved using one DRC.
Meanwhile, recent pop music generally includes a percussion signal output from a percussion instrument. The percussion signal has predetermined energy in a time domain and is intermittently generated as compared with a continuously input harmonic signal. The percussion signal has a spike format in a time spectrum. That is, since the percussion signal is interposed between continuous harmonic signals, the percussion signal can be bounced in the middle of a time spectrum. Such a characteristic of the percussion signal causes a problem when an output level is maximized. In order to prevent the audio signal from being distorted, a maximum value of the audio signal can be amplified to a predetermined level. An amplifiable level is limited due to a percussion signal having the spike format. That is, although an average of the output level is extremely less than a maximum value, it is impossible to increase a gain longer to maintain a format of the percussion signal.
To address the above-discussed deficiencies, a method and an apparatus for processing an audio signal improve controlling a gain and adopt a reproducing volume level to a current environment by separating the audio signal into a harmonic signal and a percussion signal to adjust a gain.
The present disclosure further provides an apparatus and a method for processing an audio signal having a high output level without non-linear distortion by separately processing the audio signal into a harmonic signal and a percussion signal.
Objects of the embodiments may not be limited to the above. Other objects that are not described may be clearly comprehended to those of skill in the art to which the embodiment pertains through the following description. It should be understood that objects and advantages of the present disclosure may be realized by means described in claims and a combination thereof.
In accordance with embodiments of the present disclosure, a method of processing an audio signal includes: separating the audio signal into a first harmonic signal and a first percussion signal; adjusting gains of the first harmonic signal and the first percussion signal to generate a second harmonic signal and a second percussion signal; and adding the second harmonic signal and the second percussion signal.
In accordance with embodiments of the present disclosure, an apparatus for processing an audio signal includes: a harmonic/percussion separator separating the audio signal into a first harmonic signal and a first percussion signal; a first DRC adjusting a gain of the harmonic signal to generate a second harmonic signal; a second DRC adjusting a gain of the percussion signal to generate a second percussion signal; and an adder adding the first harmonic signal and the second percussion signal.
Before undertaking the DETAILED DESCRIPTION below, it may be advantageous to set forth definitions of certain words and phrases used throughout this patent document: the terms “include” and “comprise,” as well as derivatives thereof, mean inclusion without limitation; the term “or,” is inclusive, meaning and/or; the phrases “associated with” and “associated therewith,” as well as derivatives thereof, may mean to include, be included within, interconnect with, contain, be contained within, connect to or with, couple to or with, be communicable with, cooperate with, interleave, juxtapose, be proximate to, be bound to or with, have, have a property of, or the like; and the term “controller” means any device, system or part thereof that controls at least one operation, such a device may be implemented in hardware, firmware or software, or some combination of at least two of the same. It should be noted that the functionality associated with any particular controller may be centralized or distributed, whether locally or remotely. Definitions for certain words and phrases are provided throughout this patent document, those of ordinary skill in the art should understand that in many, if not most instances, such definitions apply to prior, as well as future uses of such defined words and phrases.
For a more complete understanding of the present disclosure and its advantages, reference is now made to the following description taken in conjunction with the accompanying drawings, in which like reference numerals represent like parts:
a through 10, discussed below, and the various embodiments used to describe the principles of the present disclosure in this patent document are by way of illustration only and should not be construed in any way to limit the scope of the disclosure. Those skilled in the art will understand that the principles of the present disclosure may be implemented in any suitably arranged audio signal processing device. Exemplary embodiments of the present disclosure are described with reference to the accompanying drawings in detail. The same reference numbers are used throughout the drawings to refer to the same or like parts. Detailed descriptions of well-known functions and structures incorporated herein may be omitted to avoid confusing the subject matter of the present disclosure.
In the present disclosure, separated harmonic signal and percussion signal are defined as follows. First, the percussion signal refers to an audio signal by an instrument called percussion or a percussion instrument outputting a sound by beating. Since the percussion signal is generally used to keep time with music, the percussion signal has energy distribution in which a spike format is intermittently generated in a time spectrum. The harmonic signal refers to a periodic component composed of at least two sounds. Energy of the harmonic signal has a predetermined value according to a time as compared with the percussion signal and represents an energy distribution of a spike format in a frequency spectrum.
Hereinafter, the apparatus for processing an audio signal according to embodiments of the present disclosure will be described with reference to
Referring to
The harmonic/percussion separator 210 separates an audio signal into a harmonic signal and a first percussion signal. The harmonic/percussion separator 210 will be described in detail with reference to
The spectrogram separator 215 creates the harmonic spectrogram and the spectrogram using the spectrogram of the audio signal from the spectrogram generator 213. In detail, the harmonic spectrogram generator 216 generates the harmonic spectrogram using the spectrogram, and the percussion spectrogram generator 217 generates the percussion spectrogram using the spectrogram.
The harmonic spectrogram generator 216 slices the spectrogram received from the spectrogram generator 213 for each preset frequency to generate at least one frequency frame, and performs median filtering with respect to the respective frequency frames. The median filter has a characteristic which substitutes a median value in a window centering around one point for a value of the one point. If the harmonic spectrogram generator 216 performs median filtering with respect to a frequency frame being an audio signal of a specific frequency band according to a time, a percussion component spiked in a time axis spectrum is suppressed. The harmonic spectrogram generator 216 combines at least one frequency frame in which a percussion component is suppressed by the filtering with each other to generate the harmonic spectrogram.
The percussion spectrogram generator 217 slices the spectrogram received from the spectrogram generator 213 for each preset time to generate at least one time frame, and performs median filtering with respect to the respective time frames. If the percussion spectrogram generator 217 performs median filtering with respect to a time frame being a frequency spectrum at a specific time, a harmonic component spiked in the frequency spectrum is suppressed. The percussion spectrogram generator 217 combines at least one time frame in which a harmonic component is suppressed by the filtering with each other to generate the percussion spectrogram.
The converter 219 converts the harmonic spectrogram and the percussion spectrogram output from the spectrogram separator 215 into a first harmonic signal and a first percussion signal which are signals in a time domain, respectively.
The first harmonic signal and the first percussion signal separated by the harmonic/percussion separator 210 are input to the first DRC 230 and the second DRC 250, respectively. The first DRC 230 adjusts a gain of the first harmonic signal to generate a second harmonic signal. The first DRC 230 adjusts the gain of the first harmonic signal so that the second harmonic signal does not exceed a first threshold. The second DRC 250 adjusts the gain of the first percussion signal to generate a second percussion signal. The second DRC 250 adjusts the gain of the first percussion signal so that the second percussion signal does not exceed a second threshold.
The first threshold and the second threshold can be 0 dB for maximizing an output level, and can have different values. The first DRC 230 and the second DRC 250 have a DRC structure, the same structure, or different structures. That is, the first DRC 230 and the second DRC 250 are independent of each other. When the first DRC 230 and the second DRC 250 have the same structure, the first DRC 230 and the second DRC 250 can have different parameters (time constants, gains) as necessary.
In certain embodiments, the apparatus 200 for processing an audio signal according to the present disclosure further includes the gain controller 290 as a separate module. The gain controller 290 simultaneously controls gains adjusted by respective DRCS 230, 250 which are independently operated. That is, the gain controller 290 controls both of the gains of the first harmonic signal and the first percussion signal so that a difference between the gains of the first harmonic signal and the first percussion signal is less than a preset value. That is, the gain controller 290 simultaneously controls the gains of the first harmonic signal and the first percussion signal in order to prevent mismatch of an extent capable of being recognized between signals as a gain difference of both channels is increased.
That is, a gain control module is included inside the first DRC 230 and the second DRC 250 and independently adjust gains of the DRCs. The gain control module adjusts gains of the second harmonic signal and the second percussion signal so that the second harmonic signal matches with the second percussion signal in consideration of the gain of each DRC at an outside of the DRC in the same scheme as that of the gain controller 290 of
The first DRC 230 and the second DRC 250 adjust gains or time constants of the first harmonic signal and the percussion signal, the adder 270 adds the second harmonic signal and the second percussion signal having the adjusted gains to each other, and outputs the addition result. Using the foregoing scheme, the harmonic signal and the percussion signal are separated from the audio signal, gains of the harmonic signal and the percussion signal are individually adjusted, and the adjusted gains of the harmonic signal and the percussion signal are added to each other, and the addition result is output, so that a total volume of an output signal can be increased without non-linear distortion of some signals.
The following is a description of outputs from respective modules in the apparatus for processing an audio signal according to the present disclosure with reference to
Hereinafter, a method of acquiring a harmonic spectrogram and a percussion spectrogram from a spectrogram of an audio signal will be described with reference to
Hereinafter, the method of processing an audio signal according to the embodiment of the present disclosure will be described with reference to
Referring to
Referring to
In order to generate the spectrogram of the audio signal, the apparatus 200 for processing an audio signal performs a DFT or an FFT on an input audio signal according to a time to acquire a frequency spectrum of the audio signal according to the time. In certain embodiments, the apparatus 200 for processing an audio signal uses a filter bank to acquire a frequency component. The filter bank is a filter group that divides a frequency band of the audio signal by a plurality of band pass filters. The apparatus 200 for processing an audio signal extracts frequency components of respective bands by passing the audio signal through the filter bank and adds the extracted frequency components to generate a frequency spectrum according to a time as a spectrogram.
In block 730, the apparatus 200 for processing an audio signal acquires a harmonic spectrogram and a percussion spectrogram using the spectrogram generated in block 710. More particularly, the apparatus 200 for processing an audio signal generates the harmonic spectrogram and the percussion spectrogram using the spectrogram, respectively.
Referring to
Referring to
In block 750, after acquiring the harmonic spectrogram and the percussion spectrogram from the spectrogram of the audio signal, the apparatus 200 for processing an audio signal converts the harmonic spectrogram and the percussion spectrogram into a first harmonic signal and a first percussion signal, which are signals in a time domain, respectively. The apparatus 200 for processing an audio signal separates the audio signal into the harmonic signal and the percussion signal in steps 710 to 750.
Referring back to
The apparatus 200 for processing an audio signal controls both of the gains of the first harmonic signal and the first percussion signal so that a difference between the gains of the first harmonic signal and the first percussion signal is less than a preset value. That is, the apparatus 200 for processing an audio signal simultaneously controls the gains of the first harmonic signal and the first percussion signal in order to prevent mismatch of an extent capable of being recognized between signals as a gain difference of both channels is increased. That is, the apparatus 200 for processing an audio signal inputs the first harmonic signal and the first percussion signal to different channels so that respective channels can independently adjust the gains, and control gains of both channels using a common control module. Upon use of the common control module, the apparatus 200 for processing an audio signal adjusts the gains so that the second harmonic signal matches with the second percussion signal.
After separating the harmonic signal and the percussion signal, the apparatus 200 for processing an audio signal of the present disclosure individually adjusts parameters such as a time constant as well as the gains of respective signals. Upon adjusting the time constant, formats of respective signals can be changed. For example, when increasing the time constant, the signal becomes gentle. When reducing the time constant, the signal changes to a sharp format.
As described above, if the gains or time constants of the first harmonic signal and the first percussion signal are adjusted, the apparatus 200 for processing an audio signal adds the second harmonic signal and the second percussion signal to output an addition result (block 900). Using the foregoing scheme, the harmonic signal and the percussion signal are separated from the audio signal, gains of the harmonic signal and the percussion signal are individually adjusted, and the adjusted gains of the harmonic signal and the percussion signal are added to each other, and the addition result is output, so that a total volume of an output signal can be increased without non-linear distortion of some signals.
As described above, the present disclosure can generally improve apparatuses for controlling a gain and adopt a reproducing volume level to a current environment by separating the audio signal into a harmonic signal and a percussion signal to adjust a gain.
The present disclosure further provides an audio signal having a high output level without non-linear distortion and improve excitation performance of an embedded speaker.
Although the present disclosure has been described with an exemplary embodiment, various changes and modifications may be suggested to one skilled in the art. It is intended that the present disclosure encompass such changes and modifications as fall within the scope of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2012-0056871 | May 2012 | KR | national |