This application claims the benefit of Korean Patent Application No. 10-2005-0064507, filed on Jul. 15, 2005, in the Korean Intellectual Property Office, the disclosure of which is incorporated herein in its entirety by reference.
1. Field of the Invention
The present general inventive concept relates to an audio signal coding and/or decoding system, and more particularly, to a method and apparatus to extract an important spectral component of an audio signal and a method and apparatus to code and decode a low bit-rate audio signal using the same.
2. Description of the Related Art
“MPEG (Moving Picture Experts Group) audio” is an ISO/IEC standard for high-quality high-performance stereo coding. The MPEG audio is standardized together with moving picture coding in accordance with ISO/IEC SC29/WG11 of MPEG. For the MPEG audio, sub-band coding (band division coding) based on 32 bands and modified discrete cosine transform (MDCT) are used for compression, and in particularly, a high performance compression is performed by using psychopathic characteristics. The MPEG audio can implement a high quality of sound compared to a conventional compression coding scheme.
In order to compress audio signals with a high performance, the MPEG audio utilizes a “perceptual coding” compression scheme in which detailed low sensitive information is eliminated by using sensitive characteristics of human beings sensing audible signals, to reduce a code amount of the audio signals.
In addition, in the MPEG audio, a minimum audible limit and a masking property of a silent period are mainly used for the perceptual coding using an auditory psychopathic characteristic. The minimum audible limit of a silent period is a minimum level of sound which can be perceived by auditory sense. The minimum audible limit is related to a limit of noise which can be perceived by the auditory sense in the silent period. The minimum audible limit varies according to frequencies of sound. At some frequencies, sound higher than the minimum audible limit may be audible, but at other frequencies, sound lower than the minimum audible limit may not be audible. In addition, a sensing limit of a specific sound may varies greatly according to other sounds which are heard together with the specific sound. This is called “masking effect.” A width of a frequency at which the masking effect occurs is called a critical band. In order to efficiently use the auditory psychopathic characteristics such as the critical band, it is important to decompose the sound signal into spectral components. For the reason, the band is divided into 32 sub-bands, and then, the sub-band coding is performed. In addition, in the MPEG audio, filter banks are used to eliminate aliasing noises of the 32 sub-bands.
The MPEG audio includes bit allocation and quantization using filter banks and a psychoacoustic model. Coefficients generated from the MDCT are allocated with optimal quantization bits and compressed by using a psychoacoustic model 2. The psychoacoustic model 2 for allocating the optimal bits evaluates the masking effect based on FFT by using spreading functions. Therefore, a relatively large amount of complexity is required.
In general, for the compression of the audio signals with a low bit-rate (32 kbps or less), the number of bits which can be allocated to the signals is insufficient for quantization of all spectral components of the audio signal and lossless coding thereof. Therefore, there is a need for extraction of perceptively importance spectral components (ISCs) and quantization and lossless coding thereof.
The present general inventive concept provides a method and apparatus to extract an important spectral component from an audio signal to compress the audio signal with a low bit-rate.
The present general inventive concept also provides a low bit-rate audio signal coding method and apparatus using a method and apparatus to extract an important spectral component from an audio signal.
The present general inventive concept also provides a low bit-rate audio signal decoding method and apparatus to decode a low bit-rate audio signal coded by the low bit-rate audio signal coding method and apparatus
Additional aspects and advantages of the present general inventive concept will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the general inventive concept.
The foregoing and/or other aspects and advantages of the present general inventive concept may be achieved by providing a method of extracting important spectral components (ISCs) of audio signals, the method comprising calculating perceptual importance including a signal-to-mask ratio (SMR) value of transformed spectral audio signals by using a psychoacoustic model, selecting the spectral audio signals having a masking threshold value smaller than that of the spectral audio signals using the SMR value as first ISCs, and extracting a spectral peak from the spectral audio signals selected as the first ISCs according to a predetermined weighting factor to select second ISCs. The weighting factor may be obtained by using a predetermined number of spectrum values near a frequency of a current signal of which weighting factor is to be obtained.
The method may further include obtaining SNRs (signal-to-noise ratios) for frequency bands and selecting spectral components of which peak values are larger than a predetermined value among the frequency bands having a low SNR as the ISCs.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing a method of extracting ISCs (important spectral components) of audio signals, the method comprising calculating perceptual importance including an SMR (signal-to-mask ratio) value of transformed spectral audio signals by using a psychoacoustic model, selecting the spectral audio signals having a masking threshold value smaller than that of the spectral audio signals using the SMR as first ISCs, and obtaining SNRs for frequency bands among the spectral audio signals selected as the first ISCs to select the spectral audio signals having spectral components of which peak values are larger than a predetermined value among the frequency bands having a low SNR using the SNRs as another ISCs.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing a low bit-rate audio signal coding method comprising calculating perceptual importance including an SMR (signal-to-mask ratio) value of spectral audio signals by using a psychoacoustic model, selecting the spectral audio signals having a masking threshold value smaller than that of the spectral audio signals using the SMR value as first ISCs, extracting a spectral peak from the audio signals selected as the first ISCs according to a predetermined weighting factor, and selecting the spectral audio signals having a frequency of the spectral peak as a second ISC, and performing quantization and lossless coding on the spectral audio signals having the second ISC. The extracting of the spectral peak may comprise obtaining SNRs (signal-to-noise ratios) for frequency bands and selecting spectral components of which peak values are larger than a predetermined value among the frequency bands having a low SNR using the SNRs as third ISCs. The low bit-rate audio signal coding method may further comprise transforming a temporal audio signal into the spectral audio signal by using MDCT (modified discrete cosine transform) and MDST (modified discrete sine transform) to generate the spectral audio signal. The performing of quantization of the ISC audio signal may comprise performing grouping the audio signals into a plurality of groups so as to minimize additional information according to a used bit amount and a quantization error, determining a quantization step size according to an SMR (signal-to-mask ratio) and data distribution of a dynamic range of the groups, and quantizing the audio signal by using one or more predetermined quantizers for the groups. The quantizers may be determined by using values normalized with a maximum value of the group and the quantization step size. The quantization may be a Max-Lloyd quantization.
The performing of the lossless coding of the quantized signal may comprise performing context arithmetic coding. The performing of the context arithmetic coding may comprise representing the spectral components constituting frames with spectral indexes indicating the presence of the ISCs, and selecting a stochastic model according to a correlation to a previous frame and distribution of neighboring ISCs to perform the lossless coding on quantization values of the audio signal, and additional information including the quantizer information, the quantization step, the grouping information, and the spectral index value.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing a low bit-rate audio signal coding method comprising calculating perceptual importance including an SMR (signal-to-mask ratio) value of spectral audio signals by using a psychoacoustic model, selecting the spectral audio signals having a making threshold value smaller than that of the spectral audio signals using the SMR value as first ISCs, obtaining SNRs for frequency bands among the spectral audio signals selected as the first ISCs and selecting spectral components of which peak values are larger than a predetermined value among the frequency bands having a low SNR using the SNRs as another ISCs, and performing quantization and lossless coding on the spectral audio signals having the another ISCs.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing an apparatus to extract an audio signal ISC (important spectral component), the apparatus comprising a psychoacoustic modeling unit which calculates perceptual importance including an SMR (signal-to-mask ratio) value of transformed spectral audio signals by using a psychoacoustic model, a first ISC selection unit which selects the spectral audio signals having a masking threshold value smaller than that of the spectral audio signals using the SMR as first ISCs, and a second ISC selection unit which extracts a spectral peak from the spectral audio signals selected as the first ISCs according to a predetermined weighting factor and selecting second ISCs. The weighting factor in the second ISC selection unit may be obtained by using a predetermined number of spectrum values near a frequency of a current signal of which weighting factor is to be obtained. The apparatus may further comprise a third ISC selection unit which obtains SNRs (signal-to-noise ratios) for frequency bands and selects spectral components of which peak values are larger than a predetermined value among the frequency bands having a low SNR using the SNRs as third ISCs.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing an apparatus to extract an important spectral component (ISC) from an audio signal, the apparatus comprising a psychoacoustic modeling unit which calculates perceptual importance including an SMR (signal-to-mask ratio) value of transformed spectral audio signals by using a psychoacoustic model, a first ISC selection unit which selects the spectral audio signals having a masking threshold value smaller than that of the spectral audio signals using the SMR as first ISCs, and another ISC selection unit which obtains SNRs for frequency bands among the audio signals selected as the first ISCs and selects spectral components of which peak values are larger than a predetermined value among the frequency bands having a low SNR using the SNRs as another ISCs.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing a low bit-rate audio signal coding extracting apparatus comprising a psychoacoustic modeling unit which calculates perceptual importance including an SMR (signal-to-mask ratio) value of transformed spectral audio signals by using a psychoacoustic model, a first ISC (important spectral component) selection unit which selects the spectral audio signals having a masking threshold value smaller than that of the spectral audio signals using the SMR as first ISCs, a second ISC selection unit which extracts a spectral peak from the spectral audio signals selected as the first ISCs according to a predetermined weighting factor and selecting second ISCs, a quantizer which quantizes the spectral audio signal having the second ISCs, and a lossless coder which performs lossless coding on the quantized signal.
The low bit-rate audio signal coding apparatus may further comprise a third ISC selection unit which obtains SNRs (signal-to-noise ratios) for frequency bands and selects spectral components of which peak values are larger than a predetermined value among the frequency bands having a low SNR using the SNRs as third ISCs.
The low bit-rate audio signal coding apparatus may further comprise a T/F transformation unit which transforms a temporal audio signal into the spectral audio signal by using MDCT (modified discrete cosine transform) and MDST (modified discrete sine transform).
The quantizer may comprise a grouping unit which performs grouping the spectral audio signals into a plurality of groups so as to minimize additional information according to a used bit amount and a quantization error, a quantization step size determination unit which determines a quantization step size according to an SMR (signal-to-mask ratio) and data distribution (dynamic range) of groups, and a group quantizer which quantizes the audio signal by using predetermined quantizers for the groups. The quantization of the group quantizer may be a Max-Lloyd quantization, and the lossless coding of the lossless coder may be context arithmetic coding.
The lossless coder may comprise an indexing unit which represents the spectral components constituting frames with spectral indexes indicating the presence of the ISCs, and a stochastic model lossless coder which selects a stochastic model according to a correlation to a previous frame and distribution of neighboring ISCs and performs the lossless coding on quantization values of the audio signal, and additional information including the quantizer information, the quantization step size, the grouping information, and the spectral index value.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing a low bit-rate audio signal coding apparatus comprising a psychoacoustic modeling unit which calculates perceptual importance including an SMR (signal-to-mask ratio) value of transformed spectral audio signals by using a psychoacoustic model, a first ISC (important spectral component) selection unit which selects the spectral audio signals having a masking threshold value smaller than that of the spectral audio signals using the perceptual importance as first ISCs, another selection unit which obtains SNRs for frequency bands among the audio signals selected as the ISCs and selects spectral components of which peak values are larger than a predetermined value among the frequency bands having a low SNR using the SNRs as another ISCs, a quantizer which quantizes the spectral audio signal having the another ISCs, and a lossless coder which performs lossless coding on the quantized signal.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing a low bit-rate audio signal decoding method comprising restoring index information indicating the presence of ISCs (importance spectral components), quantizer information, a quantization step size, ISC grouping information, and audio signal quantization values, performing inverse quantization with reference to the restored quantizer information, quantization step size, and grouping information, and transforming the inversely-quantized values to temporal signals.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing a low bit-rate audio signal decoding apparatus comprising a lossless decoder which extracts stochastic model information for frames and restores index information indicating the presence of ISCs (importance spectral components), quantizer information, a quantization step size, ISC grouping information, and audio signal quantization values by using the stochastic model information, an inverse quantizer which performs inverse quantization with reference to the restored quantizer information, quantization step size, and grouping information, and an F/T transformation unit which transforms the inversely-quantized values to temporal signals.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing a computer-readable medium having embodied thereon a computer program to perform a method comprising calculating perceptual importance including an SMR (signal-to-mask ratio) value of transformed spectral audio signals according to a psychoacoustic model, selecting spectral signals having a masking threshold value smaller than that of the spectral audio signals using the perceptual importance as one or more first important spectral components (ISCs), and extracting a spectral peak from the audio signals selected as the one or more first ISCs according to a predetermined weighting factor to select one or more second ISCs to be used to code the spectral audio signal.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing a computer-readable medium having embodied thereon a computer program to perform a method comprising restoring index information indicating the presence of importance spectral components (ISCs), quantizer information, a quantization step size, ISC grouping information, and audio signal quantization values with respect to an audio signal, performing inverse quantization on the audio signal according to the restored quantizer information, quantization step size, and grouping information, and transforming the inversely-quantized signals to temporal signals.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing audio signal coding and/or decoding system, comprising a coder to select spectral audio signals having one or more important spectral components (ISCs) according to a signal-to-mask ratio (SMR) value and one of a weighing factor and a signal-to-noise ratio (SNR) of a frequency band, and to code the spectral audio signals according to information on the selected ISCs, and a decoder to decode the coded spectral audio signals according to the information.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing an audio signal coding and/or decoding system, comprising a coder to select spectral audio signals having one or more important spectral components (ISCs) according to a signal-to-mask ratio (SMR) value and one of a weighing factor and a signal-to-noise ratio (SNR) of a frequency band, and to code the spectral audio signals according to information on the selected ISCs.
The foregoing and/or other aspects and advantages of the present general inventive concept may also be achieved by providing an audio signal coding and/or decoding system comprising a decoder to decode the coded spectral audio signals according to information on ISCs. The ISC may be obtained according to a signal-to-mask ratio (SMR) value and one of a weighing factor and signal-to-noise ratios (SNRs) of frequency bands of spectral audio signals.
These and/or other aspects and advantages of the present general inventive concept will become apparent and more readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
Reference will now be made in detail to the embodiments of the present general inventive concept, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. The embodiments are described below in order to explain the present general inventive concept by referring to the figures.
The psychoacoustic modeling unit 100 calculates a signal-to-mask ratio (SMR) value for a transformed spectral audio signal transformed according to psychoacoustic characteristics. The spectral audio signal input to the psychoacoustic modeling unit 100 is generated by using a modified discrete cosine transform (MDCT) and a modified discrete sine transform (MDST) instead of a discrete Fourier transform (DFT). Since the MDCT and the MDST represent real and imaginary parts of the audio signal, respectively, phase information of the audio signal can be represented. Therefore, a problem of mis-match between the DFT and the MDCT can be solved. The problem of the mis-match occurs when coefficients of the MDCT is quantized by using a temporal audio signal which is subject to the DFT.
The ISC selection unit 150 selects the ISC from the audio signal by using the SMR value. The ISC selection unit 150 includes first, second, and third ISC selectors 152, 154, and 156 to select one or more first, second, and third ISCs, respectively. The one or more first, second, and/or third ISCs can be referred to as the ISCs.
The first ISC selector 152 selects the one or more spectral signals having a masking threshold value smaller than that of the spectral audio signal as one or more first important spectral components (ISCs) by using the SMR value calculated by the psychoacoustic modeling unit 100.
The second ISC selector 154 selects the one or more second ISCs by extracting a spectral peak from the audio signals selected as the one or more first ISCs in the first ISC selector 152 according to a predetermined weighting factor.
The spectral peak is searched among the one or more first ISCs. The spectral peak is determined based on a size of a signal. The size of the signal is defined by the root of the square of a real part plus the square of an imaginary part of a signal subjected to transformation of the MDCT and MDST. The weighting factor of the signal is obtained by using a spectrum value near the signal. The weight factor in the second ISC selector 154 is obtained by using a predetermined number of spectrum values near a frequency of a current signal of which weighting factor is to be obtained. The weighting factor may be obtained by using Equation 1.
Here, |SCk| denotes a size of the current signal of which weighting factor is to be obtained, and |SCi| and |SCj| denotes sizes of signals near the current signal. In addition, Ien denotes the number of signals near the current signal.
The second ISCs are selected based on the peak value and the weighting factor of the signal. For example, a product of the peak value and the weighting factor is compared to a predetermined threshold value to select only values larger than the threshold value as the second ISCs.
The third ISC selector 156 performs signal to noise ratio (SNR) equalization on the audio signal. That is, spectral components of the audio signal are divided into frequency bands, and SNRs for frequency bands are obtained, and spectral components of which peak values are larger than a predetermined value among the frequency bands having a low SNR are selected as the one or more third ISCs. Such an operation is performed in order to prevent the ISCs from concentrating on a specific frequency band. In other words, dominant peaks are selected among the frequency bands having a low SNR, so that the SNRs of the frequency bands are approximately equalized over the entire frequency bands. As a result, the SNR values of the frequency bands having the low SNR increase, so that the SNR values of the entire frequency bands are approximately equalized.
The first, second, and third ISC selectors 152, 154, and 156 constituting the ISC selection unit 150 may selectively used to extract the audio signal having the perceptively important spectral components (ISCs). For example, only the first and second ISC selector 152 and 154 may be used. However, only the first and third ISC selectors 152 and 156 may be used. Otherwise, all the first to third selectors 152, 154, and 156 may be used. Accordingly, the first, second, and/or third ISCs can be extracted from the audio signal to be used as the ISCs so that the audio signal is compressed using the extracted ISCs in quantization of all spectral components of the audio signal and/or lossless coding thereof.
Spectral peaks are extracted from the audio signals selected as the first ISCs according to a predetermined weighting factor and selected as the second ISCs (operation 240). The weighting factor can be obtained by using spectrum values of predetermined frequencies near a frequency of a current signal of which weighting factor is to be obtained. Operation 240 may be the same as the operation of the aforementioned second ISC selector 154 of
The third ISCs for frequencies (or frequency bands) are selected by performing SNR equalization (operation 260). That is, the spectral components of the audio signal are divided into frequency bands, SNRs for frequency bands are obtained, and the spectral components of which peak values are larger than a predetermined value among the frequency bands having a low SNR are selected as the third ISCs. The first, second, and/or third ISCs may be collectively referred to as the ISCs. As described above, such an operation is performed in order to prevent the ISCs from concentrating on a specific frequency band. In other words, dominant peaks are selected among the frequency bands having the low SNR, so that the SNRs of the frequency bands are approximately equalized over the entire bands. As a result, the SNR values of the frequency bands having the low SNR increase, so that the SNR values of the entire bands are approximately equalized.
On the other hand, the ISC extraction in operations 220 to 260 may be selectively used. For example, only the operations 200 and 200 may be used to extract the ISCs. However, only the operations 200 and 260 may be used to extract the ISCs. Otherwise, all the operations 200, 240, and 260 may be used to extract the ISCs.
Referring to
The ISC extractor 420 extracts the audio signal having the ISC from the spectral audio signal. The ISC extractor 420 may be the same as the audio signal ISC extraction apparatus of
The quantizer 440 quantizes the audio signal of the ISC. As shown in
The grouping unit 442 performs grouping so as to minimize additional information according to a used bit amount and a quantization error. The quantization for the selected ISCs is performed as follows. Firstly, the grouping is performed on the selected ISCs so as to minimize the additional information according to a rate-distortion. The Rate-Distortion represents a relation between the used bit amount and the quantization error. The used bit amount and the quantization error can be traded off. That is, if the used bit amount increases, the quantization error decreases.
On the contrary, if the used bit amount decreases, the quantization error increases. The selected ISCs are grouped, and costs of the groups are calculated. The grouping is performed so as to lower the costs.
The groups may be formed to be uniform, and may be merged so as to reduce the costs of the frequency bands. In addition, the cost is obtained by adding bit numbers required for the groups and additional information on the bit numbers as shown in Equation 2.
cost=qbit+additional information [bit number] Equation 2
Here, qbit denotes the bit number required for each group, and the additional information includes a scale factor, quantization information, and the like.
When the grouping is completed, the quantization step size determining unit 444 determines a quantization step size according to the SMRs and data distributions (dynamic ranges) of the groups. In addition, the ISCs constituting the group are normalized with a maximum value of the ISCs.
The quantizer 446 quantizes the audio signals of the groups. The quantizer 446 is determined by using values normalized with the maximum value of the ISCs of the group and the quantization step size.
It is possible that the quantization may be Max-Lloyd quantization.
The lossless coder 460 performs the lossless coding on the quantized signal. As illustrated in
The indexing unit 462 generates one or more spectral indexes to represent the spectral components constituting each frame. The spectral indexes indicate the presence of the ISCs. The spectral information of the ISCs is coded by using the context arithmetic coding. More specifically, the spectral components constituting each frame are set by the spectral index representing the selection of the ISCs. The spectral index may be a signal having 0 or 1 to represent the presence or absence of the ISCs.
The stochastic model lossless coder 464 selects a stochastic model according to a correlation to a previous frame and distribution of neighboring ISCs and performs the lossless coding on the quantization values of the audio signal and additional information including the quantizer information, the quantization step size, and the grouping information and the spectral index value. Next, bit packing is performed on the coded value.
Referring to
After the ISCs are extracted, the ISC quantization is performed (operation 760). Detailed operations of the ISC quantization are illustrated in
After the grouping, a quantization step size is determined according to the SMRs and data distributions (dynamic ranges) of the groups (operation 764). In addition, the ISCs constituting the group are normalized with a maximum value of the ISCs.
Next, the quantizer is determined by using the values normalized with the maximum value of the group and the quantization step size.
It is possible that the quantization is Max-Lloyd quantization.
Referring back to
The lossless decoder 900 extracts stochastic model information of the groups and restores index information indicating the presence of the ISCs, quantizer information, a quantization step size, ISC grouping information, and audio signal quantization values for the groups by using the stochastic model information.
The inverse quantizer 920 performs inverse quantization with reference to the restored quantizer information, quantization step size, and grouping information.
The F/T transformation unit 940 transforms the inversely-quantized values to temporal signals.
Firstly, stochastic model information for frames is extracted by the lossless decoder 900 (operation 1000). Next, index information indicating the presence of the ISCs, quantizer information, a quantization step size, ISC grouping information, and audio signal quantization values are restored by using the stochastic model information (operation 1020). Next, the quantization values are inversely-quantized according to the restored quantizer information, quantization step size, and grouping information by the inverse quantizer 920 (operation 1040). After the inverse quantization, the inversely-quantized values are transformed to temporal signals by the F/T transformation unit 940 (operation 1060).
According to an method and apparatus to extract an audio signal having an ISC and a low bit-rate audio signal coding/decoding method and apparatus using the same, it is possible to efficiently code perceptual important spectral components so as to obtain high sound quality at a low bit-rate. In addition, it is possible to extract the perceptual important component by using a psychoacoustic model, to perform coding without phase information, and to efficiently represent a spectral signal at a low bit-rate. In addition, the present embodiment can be employed in all the applications requiring a low bit-rate audio coding scheme and in a next generation audio scheme.
The present general inventive concept can also be embodied as computer readable codes on a computer readable recording medium. The computer readable recording medium is any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, optical data storage devices, and carrier waves (such as data transmission through the Internet). The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion. Also, functional programs, codes, and code segments for accomplishing the present invention can be easily construed by programmers skilled in the art to which the present invention pertains.
Although a few embodiments of the present general inventive concept have been shown and described, it will be appreciated by those skilled in the art that changes may be made in these embodiments without departing from the principles and spirit of the general inventive concept, the scope of which is defined in the appended claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
10-2005-0064507 | Jul 2005 | KR | national |
Number | Name | Date | Kind |
---|---|---|---|
5285498 | Johnston | Feb 1994 | A |
5537510 | Kim | Jul 1996 | A |
5625743 | Fiocca | Apr 1997 | A |
5649053 | Kim | Jul 1997 | A |
5684922 | Miyakawa et al. | Nov 1997 | A |
5706009 | Kondo et al. | Jan 1998 | A |
5706392 | Goldberg et al. | Jan 1998 | A |
5721806 | Lee | Feb 1998 | A |
5790759 | Chen | Aug 1998 | A |
5886276 | Levine et al. | Mar 1999 | A |
5950156 | Ueno et al. | Sep 1999 | A |
5956674 | Smyth et al. | Sep 1999 | A |
5974380 | Smyth et al. | Oct 1999 | A |
5987407 | Wu et al. | Nov 1999 | A |
6023674 | Mekuria | Feb 2000 | A |
6092041 | Pan et al. | Jul 2000 | A |
6266644 | Levine | Jul 2001 | B1 |
6298322 | Lindemann | Oct 2001 | B1 |
6308150 | Neo et al. | Oct 2001 | B1 |
6330531 | Su | Dec 2001 | B1 |
6351730 | Chen | Feb 2002 | B2 |
6771777 | Gbur et al. | Aug 2004 | B1 |
7398204 | Najaf-Zadeh et al. | Jul 2008 | B2 |
7640157 | Prakash et al. | Dec 2009 | B2 |
20020116179 | Watanabe | Aug 2002 | A1 |
20020176353 | Atlas et al. | Nov 2002 | A1 |
20030233234 | Truman et al. | Dec 2003 | A1 |
20040088160 | Manu | May 2004 | A1 |
20050015249 | Mehrotra et al. | Jan 2005 | A1 |
20060069555 | Konda et al. | Mar 2006 | A1 |
Number | Date | Country |
---|---|---|
07-183818 | Jul 1995 | JP |
08-256064 | Oct 1996 | JP |
09-214346 | Aug 1997 | JP |
10-301594 | Nov 1998 | JP |
2000505266 | Apr 2000 | JP |
2001-282290 | Oct 2001 | JP |
2003-177797 | Jun 2003 | JP |
1994-1115 | Feb 1994 | KR |
2002-77959 | Oct 2002 | KR |
0039790 | Jul 2000 | WO |
03107328 | Dec 2003 | WO |
Entry |
---|
PCT Search Report dated Oct. 12, 2006 issued in PCT/KR2006/002775. |
Korean Office Action dated Aug. 20, 2007 issued in KR 2005-64507. |
Purnhagen H. et al., “Sinusoidal Coding Using Loudness-Based Component Selection”, In: Acoustics Speech, and Signal Processing, 2002, Proceedings. (ICASSP '02), IEEE International Conference on May 13, 2002, pp. 1817-1820. |
Korean Office Action dated Sep. 25, 2006 issued in KR 2005-64507. |
“Exploiting Time and Frequency Masking in Consistent Sinusoidal Analysis-Synthesis,” Renat et al., IEEE, 2000 (pp. 901-904). |
“Spectrum Estimation and Harmonic Analysis,” David J. Thomson, Proceedings of IEEE, 1982.9 (pp. 1055-1096). |
Extended European Search Report issued Jan. 26, 2010 in EP Application No. 06823588.6. |
Van Schijndel N H et al: “Towards a better balance in sinusoidal plus stochastic representation” Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on. New Paltz, NY USA Oct. 19-22, 2003, Piscataway, NJ, USA, IEEE, Oct. 19, 2003, pp. 197-200, XP010697936 ISBN: 978-0-7803-7850-6. |
Mark Fek et al: “Joint Speech and Audio Coding Combining Sinusoidal Modeling and Wavelet Packets” Eurospeech 2001, vol. 4, Sep. 3, 2001, Sep. 7, 2001 pp. 2311-2314, XP007004852 * section 3: “Sinusoidal Modeling” *. |
Purnhagen H et al: “Sinusoidal coding using loudness-based component selection” 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. (ICASSP). Orlando, FL, May 13-17, 2002; [IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)], New York, NY : IEEE, US, vol. 2, May 13, 2002, pp. II-1817, XP010804249. |
Najafzadch H et al: “Perceptual bit allocation for low rate coding of narrowband audio” Acoustics, Speech, and Signal Processing, 2000. ICSASSP ' 00. Proceedings. 2000 IEEE International Conference on Jun. 5-9, 2000, Piscataway, NJ, USA, IEEE, vol. 2, Jun. 5, 2000, pp. 893-896, XP010504867 ISBN: 978-0-7803-6293-2. |
Chinese Office Action issued Aug. 11, 2010 in CN Application No. 200680025920.2. |
Yanfei Ma, Study of MPEG Audio Layer III Coding Algorithm and Its Hardware Implementation, Xidian University, Master's Degree Thesis, Thesis Submitted Jan. 15, 2005. |
Japanese Office Action 2008-521328 issued Jan. 11, 2011. |
CN Office Action issued Aug. 9, 2011 in CN Patent Application No. 200680025920.2. |
“Exploiting Time and Frequency Masking in Consistent Sinusoidal Analysis-Synthesis”, Renat Vafin et al. IEEE international COnference on Acoustics, Speech, and Signal Processing, 2000, vol. 2, Jun. 9, 2000. |
Chinese Office Action Issued on Apr. 13, 2012 in CN Patent Application No. 200680025920.2. |
European Office Action Issued on May 10, 2012 in EP Patent Application No. 06823588.6. |
“Speech Enhancement Using a Constrained Iterative Sinusoidal Model”, Jesper Jensen et al., IEEE Transactions on Speech and Audio Processing, vol. 9, No. 7; Oct. 2001, pp. 731-740. |
Extended European Search Report dated Nov. 22, 2012 issued in EP Application No. 12003918.5. |
Venna T S et al: “A 6KBPS to 85KBPS scalable audio coder” Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceeding S. 2000 IEEE International Conference on Jun. 5-9, 2000, Piscataway, NJ, USA, IEEE, vol. 2, Jun. 5, 2000, pp. 877-880, XP010504863, ISBN: 978-0-7803-6293-2 *p. 879, left-hand column* *figures 2,3* *Sec. 3: “Decoder Overview”*. |
Japanese Office Action dated Jun. 25, 2013 issued in JP Application No. 2012-118574. |
Number | Date | Country | |
---|---|---|---|
20070016404 A1 | Jan 2007 | US |