This application is the United States National Phase under 35 U.S.C. §371 of PCT International Patent Application No. PCT/EP2009/008853, filed on Dec. 10, 2009, and claiming priority to Austrian application no. A1982/2008, filed on Dec. 19, 2008.
Embodiments of the invention relate to a method and means for the scalable improvement of the quality of a signal encoding method.
To reduce the data rates necessary in digital communications systems, the audio signals being transmitted are compressed by means of encoding methods and then decompressed after the transmission.
An encoding method of this kind, which is used for the transmission of a voice signal in a frequency range from 300 to 3400 Hz at a data rate of 8 kbit/s, is known, for example, from ITU-T-Recommendation G.729.
For higher quality transmission, an expanded frequency range from 50 Hz up to 7000 Hz is known. For example, ITU-T-Recommendation G.722.EV describes a broadband method known as the Voice-Codec for this purpose.
This method uses Subband-Adaptive Differential Pulse Code Modulation (SB-ADPCM) for encoding audio signals.
To further increase the quality of the transmitted audio signal, a scalable encoding method is needed.
On the one hand, this scalability will give the receiver downstream compatibility with conventional decoding methods, and on the other hand, it offers the possibility, in the event of limited data transmission capacities in the transmission channel, of easily adapting the data rate and the size of transmitted data frames on both the sending and receiving sides.
Embodiments presented herein provide methods for scalable improvement of the quality of an encoding method according to the Subband-Adaptive Differential Pulse Code principle.
Embodiments may further provide a method for scalable improvement of the quality of an encoding method according to IT-U-Recommendation G.722 with the following method steps: a digital error signal, derived from an input signal to be encoded and a prognosis signal, is compared in sections to a number of M*LN different reference signals in an iterative process having a number of repeated steps depending on the scope of the expansion, and the reference signal having a minimum error signal with respect to a prescribed error criterion is derived there from the reference signals c(n) are each made up of equidistant Dirac impulses δ(n) according to
wherein off=[0 . . . M−1] indicates the distance of the first pulse from the beginning of the comparison segment, αpε{α0, α1, . . . , αL-1} indicates the amplitude value, M the distance between two individual pulses, N the number of pulses, and L the number of different levels {acute over (α)}.
The information about the reference signal with the minimum error signal is transmitted.
Here it is preferable for an expanded error signal eH1(n) to be determined as the error criterion according to eH1(n)=eH−c(n) and for an error value to be determined over the time period of the comparison segment as per
and then be used to determine the minimum error signal.
It is also preferable to have an arrangement for implementing the method according to the invention, in which—in addition to a conventional encoder (ADPCM) operating according to the Subband Adaptive Differential Pulse Code principle according to IT-U Recommendation G.722—means are provided for the creation of reference signals which have, for each step of the expansion, a signal generator EHDS1, . . . EHDSS to generate the reference signals c(n) and a control unit CB 1, . . . CB S.
The figures show:
Embodiments will now be discussed with reference to the figures.
The reference signal according to
The mathematical definition of a reference signal is as follows:
By varying the parameters of the amplitude value α with L different values and with the offset off=[0 . . . M−1], a group with the quantity M·LN of different reference signals is produced.
The comparison of reference signals c(n) obtained in this manner according to the invention is explained in greater detail based on
According to the invention, the reference signals c(n) are compared, over a preset time segment known as a frame, to a digital error signal eH which was determined in a conventional encoding process according to IT-U Recommendation G.722 from an input signal for encoding and a prognosis signal.
Thus, according to
eH1(n)=eH−c(n), an expanded error signal eH1(n) is obtained for which an error value is determined over the time period of the comparison segment according to
By means of control unit CB 1, . . . CB S, the reference signal c(n) with the smallest error value En is now determined, and the information about this signal is transmitted as supplemental information IH1min, . . . IHSmin and is used in the receiver to decode the payload signal.
In practice, the following parameters have proven valuable for generating the reference signal c(n).
The starting point is a sampling rate of 8 KHz and thus a sampling interval duration of 125 μsec. The duration of one comparison segment amounts to 5 msec, and the possible quantity of amplitude values L for the Dirac pulses amounts to 2. The number of Dirac pulses in one comparison segment amounts to N=5. The interval between every 2 Dirac pulses amounts to M=8 sampling intervals.
The process described above for comparing the reference signals c(n) with the digital error signal eH is now repeated iteratively as a function of the selected scaling, which is illustrated in
For the first repetition step this means that the reference signals c(n) are compared with the expanded first error signal eH1(n), and from this an expanded second error signal EH2(n) is produced. This process is typically repeated four times.
An important advantage herein is that not all information contained in the received signal actually also has to be evaluated. For example, it is possible that a receiver with only one conventional Core Decoder will receive a signal which also contains the supplemental information IH1min, . . . IHSmin, but does not use it to obtain the audio signal.
This possibility is called downstream compatibility.
However, in the case of a receiver which contains the invented expansion stages EDS1, EDS2, . . . EDSS for decoding the supplemental information IH1min, . . . IHSmin, the full quality of the signal is decoded, provided no limitation is imposed for other reasons.
Number | Date | Country | Kind |
---|---|---|---|
A 1982/2008 | Dec 2008 | AT | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2009/008853 | 12/10/2009 | WO | 00 | 10/5/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2010/069513 | 6/24/2010 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20040054529 | Sung et al. | Mar 2004 | A1 |
Number | Date | Country |
---|---|---|
3115859 | Mar 1982 | DE |
69124034 | Jul 1997 | DE |
Entry |
---|
Written Opinion of the International Searching Authority for PCT/EP2009/008853 dated Mar. 26, 2010 (Form PCT/ISA/237) (German Translation). |
Written Opinion of the International Searching Authority for PCT/EP2009/008853 dated Mar. 26, 2010 (Form PCT/ISA/237) (English Translation). |
International Preliminary Report on Patentability for PCT/EP2009/008853 dated Jun. 21, 2011 (Form PCT/IB/373, PCT/ISA/237) (German Translation). |
International Preliminary Report on Patentability for PCT/EP2009/008853 dated Jun. 21, 2011 (Form PCT/IB/373, PCT/ISA/237) (English Translation). |
International Search Report of PCT/EP2009/008853 dated Mar. 26, 2010 (English). |
International Search Report of PCT/EP2009/008853 dated Mar. 26, 2010 (German). |
“7 kHz Audio-Coding within 64 kbits/s; G.722 (11/88)” ITU—Standard in Force (I), International Telecommunication Union, Geneva, CH, No. G.722 (Nov. 25, 1988). |
“Reduced Rate Ultra Low Delay Audio Coder Using Multistage Vector Quatization” T. V. Sreenivas, et al., Signals, System and Computers 2007, 2007 Association, Conference on IEEE, Piscataway, NJ, US. |
Number | Date | Country | |
---|---|---|---|
20120014474 A1 | Jan 2012 | US |