The invention relates to a method and a unit for substracting quantization noise from a PCM signal.
Pulse code modulation (PCM) is a widely used technique to represent e.g. audio signals in the digital domain. Linear PCM uses evenly distributed quantization levels whereas the logarithmic PCM uses a distribution where small-signal amplitudes are quantized using more closely spaced levels than large-signal amplitudes. An example to linear PCM is the CD-audio, which uses 16 bits to represent uniformly distributed signal levels.
where |s| is the magnitude of the input, |y| is the magnitude of the output and μ is a parameter that is selected to give the desired compression characteristic.
In
wherein the meaning of y and s is as mentioned above and A is constant. The prior art μ-law and A-law coders are standardized by the Comité Consultatif International de Téléfonique et Télégraphique CCITT as the G.711 64 kbit/s PCM. The quantizer characteristics of the G.711 are disclosed in: N. S. Jayant and P. Noll, “Digital Coding of Waveforms”, Englewood Cliffs N.J.: Prentice-Hall, 1984.
The quantized sample values of a signal may be modeled mathematically as
s*[n]=s[n]+q[n], (3)
wherein S*[n] represents the quantized value of an original unquantized sample value s[n] and q[n] represents the quantization error resulting from said quantization. The quantization error can be interpreted as undesired additive noise decreasing the quality of the original sample values s[n].
Thus, it is the object of the invention to provide a method and a unit for substracting said quantization noise from a PCM signal.
This object is solved by a method comprising the steps of:
wherein
It is the essential idea of the invention to calculate said quantization noise levels Bq for each frame according to the claimed formula and to use any suitable (background) noise substracting method known in the art—perhaps after carrying out slight modifications—to substract the quantization noise from the PCM signal.
Throughout the whole description * indicates the content of quantization noise n the co-ordinated quantity or signal.
The window-function may be chosen arbitrarily
The claimed formula requires the minimum quantization level S*min[n] as well as on the maximum quantization level S*max[n] of the sample values s*[n] of the PCM signal.
According to a first advantageous embodiment of the invention the minimum quantization level S*min[n] as well as the maximum quantization level S*max[n] are known from the quantization method applied for generating the PCM signal.
According to a second embodiment said quantization levels S*min[n] and S*max[n] are not known and must therefore be predicted according to the formulas claimed in claim 3.
Advantageously neither in the first nor in the second embodiment any additional information must transmitted together with the PCM sample values s*[n] to enable the conduction of the claimed method. This is because in the first embodiment the quantization levels and in the second embodiment reference levels i can be assumed to be known (see the last two paragraphs above) when the method for substracting the quantization noise according to the invention is carried out. Because of that advantageously bandwidth is saved.
Preferably, the noise substraction is based on a background noise substracting method and system as described in the non-prepublished European Patent application having the application number EP 01201304.1 of the same applicant. The application of said system is described later as preferred embodiment. The whole content of said patent application shall be regarded as part of the disclosure of the present application.
Finally, the above-identified object is solved by a noise substracting unit according to the subject matter of claim 9. The advantages of said unit correspond to the advantages mentioned above referring to the claimed method.
Preferably, the claimed unit and method are applied at the decoder side, i.e. just before the PCM signal is decoded again. However, nevertheless it might also be applied at the encoder side.
The description is accompanied by five figures, wherein
In the following a preferred embodiment of the invention will be described by referring to
First of all said unit comprises a quantization noise level calculating unit 100 for calculating a quantization noise level Bq for each frame of said PCM signal under consideration of both a minimum and a maximum quantization noise level S*min[n], S*max[n]. More specifically, the quantization noise level Bq is constant for each frame of said PCM signal and calculated individually for a specific frame according to the following equation:
wherein
represents the mean square value of the quantization error, i.e. the energy of the quantization noise in the current frame.
The frame size is typically 2 ms.
According to a first embodiment said quantization noise levels S*min[n], S*max[n] are assumed to be known.
However, according to a second embodiment of the invention these quantization levels S*min[n], S*max[n] are not assumed to be known, e.g. because the original signals might have been quantized with different quantizers and there may have been extra scalings applied on the samples before or after coding. In these cases the quantization levels have to be predicted.
To predict the quantization levels, the entire original signal, hereinafter also referred to as waveform, is scanned to receive the sample values s*[n] of the quantized waveform. Subsequently, a check is made, if all possible representation levels i predefined due the quantization method applied to the waveform are represented (used) by at least one said sample values s*[n]. For a 16-bit linear PCM quantization method there are 216=65536 possible representation levels i and the result of said check is described in a so called usage table u[i] with 65536 values. The values of said usage table u[i] are either 1 (if there is at least one sample value of the whole waveform corresponding to the current representation level i) or 0 (if there is no sample value of the whole waveform corresponding to the current representation level i). Expressed mathematically
In formula (5) L is the waveform length, i.e. the number of samples of the whole waveform, and N is the number of bits used by the linear PCM quantization; in the above example N=16. This operation can be simplified by initializing all values of u[i] with 0 and using formula (6):
u[s*[n]]=1, 0≦n<L (6)
Now equations (5) or (6) are used to find special representation levels imin[n] and imax[n] required for predicting the desired quantization levels. More specifically, imin[n] is defined as that representation level which is—startet from i[n]—the next smaller non-zero representation level for which u[n]=1 and imax[n] is defined as that representation level which is—started from i[n]—the next bigger non-zero representation level for which u[n]=1.
With the help of said co-ordinated representation levels the desired quantization levels are predicted or calculated according to the following equations:
s*min=i[n]−(i[n]−imin[n])/2 (7)
S*max=i[n]+(imax[n]−i[n])/2 (8)
wherein
In
u[−3]=u[−1]=u[1]=u[2]=u[3]=1; and
u[−4]=u[−2]=u[0]=0
Now, from said usage array the reference levels imin[n] and imax[n] are calculated e.g. for n=2 with s*[2]=i[2]=1—according to the definition given above—to:
imax[2]=2 and imin[2]=−1
and the desired quantization levels result—according to equations (7) and (8) to:
s*min=1−(1−(−1))/2=0
S*max=1+(2−1)/2=1,5
Special care has to be taken when i[n] co-ordinated to s*[n] is the first or last non-zero value in the usage-array u. One option is to allocate two extra array values u[−2N−1−] and u[2N−1], set them both to 1 and always continue the search until a non-zero value is found.
Another question is what to do when some valid quantization levels aren't used in the waveform. Especially waveforms shorter than 1 second will result in a usage-array with many unused (but valid) quantization levels. Since this usually will give a wrong result for the quantization step-size at boundaries (when the waveform is at its maximum or minimum value), the effect on the measured level of quantization noise will be small. Although an algorithm could be used to predict the unused valid quantization levels, the added value of the extra complexity is not worth the performance gain. In practice, it seems to be enough to assume that the used signal levels in the waveform are the only valid levels.
After the quantization noise level Bq has been calculated either with the known or with the predicted quantization levels s*min and s*max it is input into a background noise substracting unit BNS 200 which is also part of said quantization noise substracting unit.
SNR[k]=|S*[k]|/Bq, (9)
wherein Bq is noise level representing the level of quantization noise in said PCM signal.
Said signal-to-noise ratio SNR[k] is input into a filter update unit 230 for calculating a filter magnitude F[k] according to a predefined filter algorithm based on at least one filter update parameter. Said filter magnitude F[k] is input into a calculation unit 240 for calculating an output spectrum Sb[k] by multiplying both the real part R{S*[k]} and the imaginary part I{S*[k]} of the spectrum S*[k] with said filter magnitude F{k}. The filter update parameter and thus also the filter magnitude F[k] are preferably adjusted such that output spectrum Sb[k] is at least substantially free of quantization noise.
Because said background noise substracting unit 200 is embodied to carry out the substraction of the quantization noise in the frequency domain, the quantization noise substracting unit further comprises several units for computing the spectrum S*[k] of the PCM signal to be input into said background noise substracting unit 200 and for transforming the spectrum Sb[k] output by said unit 200 back into a time domain signal. These units are described in the following paragraphs.
Referring back to
The spectrum Sb[k] output by said background substracting unit 200 is input into a re-transforming unit 500 for being transformed back into a signal sb[n] in the time domain. In the case that the spectrum S*[k] has been calculated by using FFT the re-transformation is done by using the inverse FFT.
The quantization noise substracting unit may further comprise a second window block 600 for generating a weighted output signal sbw[n] by weighting the signal sb[n] received after the transformation with a second window W2[n] and an overlap and add unit 700 for individually calculating a final output signal Ŝwb[n] for each frame of the PCM-signal from said weighted output signal sbw[n] such that the transition between two successive output frames is smoothed. The first and the second window W1 and W2 are preferably identical. They are independent of the window-function w[n] used for calculating the quantization noise level Bq. However, it is preferred to use the same function for w[n] as for W1.
The provision of the overlap and add unit 700 only improves the quality of the output signal but it is not absolutely necessary for carrying out the quantization noise substraction; consequently, it might be left out. The same applies to the first and the second window block 300 and 600 which are both only provided to improve the operation said overlap and add unit 700.
More specifically,
Number | Date | Country | Kind |
---|---|---|---|
02075312 | Jan 2002 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB02/05711 | 12/23/2002 | WO | 00 | 7/22/2004 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO03/063160 | 7/31/2003 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5636295 | Kim | Jun 1997 | A |
5774842 | Nishio et al. | Jun 1998 | A |
Number | Date | Country | |
---|---|---|---|
20050117656 A1 | Jun 2005 | US |