1. Field of the Invention
The present invention relates to performing a fixed codebook search of an enhanced variable-rate Codec (EVRC).
2. Background of the Related Art
The IS-127 EVRC was adopted as an 8 kbps voice encoder standard of TIA/EIA in 1996 and is being considered for use as a standard encoder in CDMA 2000. The IS-127 EVRC, which has been used in CDMA digital cellular systems, is a high performance voice encoder which provides toll quality second to 13 kbps Qualcomm code excited linear prediction (QCELP) used in PCS communications.
The EVRC has three data rates, namely a maximum data rate (Rate1, 8 kbps), an intermediate data rate (Rate1/2, 4 kbps), and a minimum data rate (Rate1/8, 1 kbps). It employs an encoding process which includes performing adaptive and fixed codebook searches for linear prediction and excited signal quantization. At this time, the fixed codebook search requires the highest computational complexity and occupies at least 40% of the whole encoding process.
More specifically, when voice information is inputted, an analyzer extracts a linear predictive coefficient (LPC), a pitch element (adaptive codebook search) and an energy, namely residual element (fixed codebook search). The fixed codebook search of the EVRC is based on an algebraic code-excited linear prediction (ACELP). The maximum data rate (Rate1) generates the highest computational complexity during the fixed codebook search.
One sub frame is randomly divided into five tracks T0, T1, T2, T3 and T4 each having eleven pulse positions. The eleven pulses (0, 5, 10, . . . , 50), (1, 6, 11, . . . , 51), (2, 7, 12, . . . , 52), (3, 8, 13, . . . , 53) and (4, 9, 14, . . . 54) of the five tracks are randomly set up and searched, and thus tracks including two pulses and tracks including one pulse exist in the five tracks. That is, the five tracks T0, T1, T2, T3 and T4 are combined to generate double-pulse per track including two pulses and single-pulse per track including one pulse.
More specifically, when the track configuration codeword is ‘00’, a double-pulse per track order is T0-T1-T2 and a single-pulse per track order is T3-T4 in the five tracks. When the track configuration codeword is ‘01’, the double-pulse per track order is T1-T2-T3 and the single-pulse per track order is T4-T0. When the track configuration codeword is ‘10’, the double-pulse per track order is T2-T3-T4 and the single-pulse per track order is T0-T1. And, when the track configuration codeword is ‘11’, the double-pulse per track order is T3-T4-T0 and the single-pulse per track order is T1-T2.
In the single-pulse track, one of T3-T4, T4-T0, T0-T1 and T1-T2 is selected, encoded using a 2-bit (P6, P7) codeword, and transmitted to a receiving end. In the double-pulse track, two pulse positions and codes are encoded each using an 8-bit codeword (P0, P1), (P2, P3) and (P4, P5). Accordingly, a total of 35-bits {=2+(7+2)+(8×3)} are necessary for the encoding process of the algebraic codebook.
The EVRC fixed codebook is an algebraic codebook which has advantages in storage performance and computational complexity. The structure of the EVRC fixed codebook is based on an interleaved single-pulse permutation (ISPP) design. The codebook search is a process for searching a codebook factor and a codebook gain which minimizes a weighted mean square error between an original signal and a combined signal, and is performed in sub frame units.
In an initial step of the method, a vector dot product (d)[N×1] and an autocortelation function (φ)[N×N] are calculated using the fixed codebook target signal and the impulse response matrix (S301). That is, the vector d is calculated by multiplying the impulse response matrix H by the fixed codebook object signal xw, and the autocorrelation function φ is calculated by mutually multiplying the impulse response matrix H.
Next, a pulse sign (±1) is determined in pulse positions existing in each track (S302). The pulse sign is previously determined according to code information of a reference signal which is a weighted sum of the object signal x(n) of a residual domain and the vector dot product d.
Finally, after the pulse code is determined, an optimal pulse position is searched from the vector dot product d which is a signal backward-filtered from each codeword and the autocorrelation function φ (S303). This procedure is repeated to search the pulse positions. That is, the optimal pulse for each codeword 00, 01, 10 and 11 is searched by using the calculated vector dot product, autocorrelation function and pulse code determined in every pulse position.
The codebook search is identical to the process for searching a code vector Ck maximizing a search standard Tk as represented by Formula 1:
Here, the vector dot product (d=Htxw) is a backward filtered signal obtained by passing the given object signal (xw)[N×1] through the weighted combined filter H[N×N], the autocorrelation function (φ=HtH) is an impulse response correlation matrix of the weighted combined filter, and k is a number of cases.
The vector dot product (d)[N×1] and the autocorrelation function (φ)[N×N] are previously calculated before the codebook search, and computational complexity thereof is in proportion to a square of a length of the sub frame.
In the EVRC, the pulse sign (±1) is predetermined in each position of the tracks to simplify the codebook search for determining the optimal codebook vector. The optimal pulse position is then obtained based on Formula 1.
In the second step, the backward filtered target vector dot product d and the autocorrelation function φ are calculated using the fixed codebook object signal xw and the impulse response matrix H of the first step as represented by Formula 2 (S402):
d=Htxw
φ=HtH (2)
In the third step, the pulse sign (±1) is determined by using the vector dot product d of the second step (S403).
In the four given track configuration codewords (jth=0, 1, 2, 3) of
After the pulse searches are done in each codeword order, when the search codeword Jth exceeds 3(11), the codeword order jth having the greatest codebook gain, namely the codeword Ck maximizing the search standard Tk in Formula 1, is selected in the fourth step (S408). When the codeword is selected, the pulse position, pulse code and codebook gain of the corresponding track configuration codeword are determined as the optimal fixed codebook parameters (S409). That is, in the fourth step, the pulse position, pulse sign (±1) and codebook gain (scale) of the track configuration codeword c calculated in the third step are determined as the optimal fixed codebook parameters.
The process for obtaining the fixed codebook object signal xw and the impulse response matrix H through LPC analysis and residual signal correction and adaptive codebook search processes has been generally performed and therefore a detailed explanation is omitted. Also generally performed is the process for selecting the track configuration codeword that maximizes the search standard Tk in Formula 1 by doing pulse searches on the pulse positions of the tracks T0, T1, T2, T3 and T4 of
In the conventional fixed codebook search performed at the maximum data rate, the track configuration codeword searches of
An object of the invention is to solve at least the above problems and/or disadvantages and to provide at least the advantages described hereinafter.
Accordingly, one object of the present invention is to solve the foregoing problems by providing a method for searching a codebook which can reduce computational complexity of residual signal correction and fixed codebook search by, firstly, searching a track configuration codeword and, then, searching a pulse position of the searched codeword.
Another object of the present invention is to provide a method for searching a codebook which obtains each track energy and determines a value minimizing a sum of the two track energies as a track configuration codeword.
The foregoing and other objects and advantages are realized by providing a method for searching a codebook which calculates each track energy by using an energy formula including a vector dot product, arranges/selects codewords in a small track energy order, and searches/selects an optimal pulse for single/double-pulse tracks of the selected codeword.
According to the present invention, the method for searching the codeword calculates each track energy in the fixed codebook search and previously determines a value minimizing a sum of the two track energies as a track configuration codeword to individually perform the track configuration codeword search and the pulse position search, thereby simplifying the fixed codebook search process and reducing computational complexity without deteriorating combined voice.
Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention. The objects and advantages of the invention may be realized and attained as particularly pointed out in the appended claims.
The invention will be described in detail with reference to the following drawings in which like reference numerals refer to like elements wherein:
The following detailed description is directed to a method for searching a codebook according to a preferred embodiment of the invention with reference to the accompanying drawings.
Referring to
A pulse sign si is determined by the vector dot product and the fixed codebook target signal (S502). Each track energy is calculated using the vector dot product d, and a track configuration codeword q included in a track pair having a minimum energy for a single-pulse track pair among the calculated energies is selected (S503). The track configuration codeword determination is individually performed from the pulse position search.
In accordance with the present invention, the pulse implies a signal element and a size of the track energy is dependent upon the number of pulses. That is to say, the track configuration codewords of
Accordingly, in order to determine the track configuration codeword, the energies E(i) distributed in each track i are calculated using the previously-determined vector dot product before the codebook search is performed. This is represented by Formula 3:
In the above formula, i represents a track and n is pulse position 0 to 10. The track distribution energies determine the track configuration codewords (q=00, 01, 10, 11).
An optimal pulse is searched by searching the pulse positions of
The fixed codebook target signal Xw and the impulse response matrix H are obtained through the LPC analysis, residual signal correction and adaptive codebook search processes, and the vector dot product (d=Htxw) and the autocorrelation function (φ=HtH) are respectively calculated using the fixed codebook target signal Xw and the impulse response matrix H (S601).
The pulse code s1 is determined according to the vector dot product and the fixed codebook target signal (S602 and S603).
The pulse code (±1) is determined in the pulse positions of each track (S603). Such a pulse code is previously determined according to code information of a reference signal which is a weighted sum of the target signal x(n) of a residual domain and the vector dot product d. That is, the pulse sign s1 is determined according to the vector dot product d and the fixed codebook target signal (S603), each track energy is calculated using the vector dot product d, and the track configuration codeword q included in the track pair having the minimum energy for the single-pulse track pair among the calculated energies is selected. The track configuration codeword determination is individually performed from the pulse position search. That is, the track configuration codewords of
Accordingly, in order to determine the track configuration codeword, the energies E(i) distributed in each track may be calculated using the previously-determined vector dot product before the codebook search (S604).
The energies E(i) distributed in each track are preferably calculated using Formula 3. The track distribution energies E(i) may be obtained by multiplying energies of all pulse positions existing in each track T0, T1, T2, T3 and T4 by a squared value of the vector dot product d, and then adding the whole pulse energy to the resultant value.
In applying Formula 3, E(0) is the track distribution energy which is a sum of the energies of the whole positions existing in the first track T0, E(1) is the track distribution energy which is a sum of the energies of the whole positions existing in the second track T1, E(2) is the track distribution energy which is a sum of the energies of the whole positions existing in the third track T2, E(3) is the track distribution energy which is a sum of the energies of the whole positions existing in the fourth track T3, and E(4) is the track distribution energy which is a sum of the energies of the whole positions existing in the fifth track T4.
The track configuration codewords {E(3),E(4)},{E(4),E(0)},{E(0),E(1)} and {E(1),E(2)} are determined using the respective track distribution energies. For this, energies ε(j) for the single-pulse track pairs of each track configuration codeword are calculated rather than energies for the double-pulse track pairs having a high value. The energy for the single-pulse track pair is obtained by adding the two track distribution energies (S605). The energies ε(j) for the single-pulse track pairs are mutually compared, and the energy for the single-pulse track pair having a minimum value is selected as the track configuration codeword jth (S606). In addition, the pulse positions of the single-pulse tracks and the double-pulse tracks are searched merely on the selected track configuration codeword jth (S607).
Here, selection of the minimum energy value implies selection of few pulses. More specifically, the respective track distribution energies are calculated, the energies {E(3)+E(4)},{E(4)+E(0)},{E(0)+E(1)} and {E(1)+E(2)} for the single-pulse track pairs are formed by using the track distribution energies, and the minimum value of the energies for the single-pulse track pairs is searched to select the track distribution codeword.
The energies ε(j) for the single-pulse track pairs are preferably calculated using the track distribution energies E(i) represented by Formula 4:
ε(j)=E(j+3)%5)+E((j+4)%5), 0≦j≦3 (4)
Here, % represents a modulo operation.
When 0 to 3 are introduced to j of Formula 4, the sum of the energies for the single-pulse track pairs is obtained.
The minimum value of the sum of the energies ε(j) for each single-pulse track pair is searched among the four energies ε(0), ε(1), ε(2) and ε(3) for the single-pulse track pairs, and its track configuration codeword order jth is obtained.
When the minimum value of the sum of the energies ε(j) for each single-pulse track pair is {E(3)+E(4)}, the track configuration codeword jth is determined as q=0(“00”), when it is {E(4)+E(0)}, the track configuration codeword jth is determined as q=1(“01”), when it is {E(0)+E(1)}, the track configuration codeword jth is determined as q=2(“10”), and when it is {E(1)+E(2)}, the track configuration codeword jth is determined as q=3(“11”).
The single-pulse track and the double-pulse track as shown in
The energies of each track of
The minimum value of the calculated energies has few pulses (signal elements), and thus the minimum energy is selected and arranged as the single-pulse track pair (S704).
The track configuration codeword order jth is obtained by comparing the minimum values of the sums of the energies ε(j) of each single-pulse track pair.
The pulse searches are done on the single/double-pulse tracks of the codeword of the selected track, thereby searching/selecting the optimal pulse position.
The foregoing embodiments and advantages are merely exemplary and are not to be construed as limiting the present invention. The present teaching can be readily applied to other types of apparatuses. The description of the present invention is intended to be illustrative, and not to limit the scope of the claims. Many alternatives, modifications, and variations will be apparent to those skilled in the art. In the claims, means-plus-function clauses are intended to cover the structures described herein as performing the recited function and not only structural equivalents but also equivalent structures.
Number | Date | Country | Kind |
---|---|---|---|
2001-65278 | Oct 2001 | KR | national |
Number | Name | Date | Kind |
---|---|---|---|
6236960 | Peng et al. | May 2001 | B1 |
6813602 | Thyssen | Nov 2004 | B1 |
6847929 | Bernard | Jan 2005 | B1 |
Number | Date | Country | |
---|---|---|---|
20030078771 A1 | Apr 2003 | US |