The present invention relates to a vector quantization apparatus, vector dequantization apparatus, and vector quantization and dequantization methods for vector-quantization of LSP (Line Spectral Pairs) parameters. In particular, the present invention relates to a vector quantization apparatus, vector dequantization apparatus, and vector quantization and dequantization method for vector-quantization of LSP parameters that are used in a speech encoding and decoding apparatus that transmits speech signals in fields of, particularly, a packet communication system represented by Internet communication and a mobile communication system.
In the field of digital wireless communication, packet communication represented by Internet communication and speech storage, speech signal encoding and decoding techniques are essential for effective use of the channel capacity of radio waves and storage media. In particular, a CELP speech encoding and decoding technique is a mainstream technique (for example, see non-patent document 1).
A CELP speech encoding apparatus encodes input speech based on pre-stored speech models. To be more specific, the CELP (Code Excited Linear Prediction) speech encoding apparatus separates a digital speech signal into frames of regular time intervals, for example, frames of approximately 10 to 20 ms, performs a linear prediction analysis of a speech signal on a per frame basis, finds the linear prediction coefficients (“LPC's”) and linear prediction residual vector, and encodes the linear prediction coefficients and linear prediction residual vector individually. As a method of encoding linear prediction coefficients, a CELP speech encoding apparatus generally adopts a method of converting the linear prediction coefficients into LSP parameters and encoding these LSP parameters. As a method of encoding LSP parameters, a CELP speech encoding apparatus oftentimes performs vector quantization of LSP parameters. As a method of vector quantization, to reduce the amount of calculations in vector quantization, multistage vector quantization is used oftentimes (e.g., see Non-Patent Document 2). Multistage vector quantization is a method of quantizing a vector to be quantized, over a plurality of stages. For example, by further quantizing early-stage quantization error in the later stages, it is possible to improve the accuracy of vector quantization.
However, although the above-noted multistage vector quantization is an efficient method to enable more LSP vectors and code vectors to be matched with a smaller amount of calculations, matching in each stage is performed with respect to one candidate, and, consequently, the performance is not sufficient. To improve performance, although it is possible to adopt a method of leaving a plurality of candidates in the first stage and performing a search using these candidates in the second stage, there is a problem that the amount of calculations increases.
It is therefore an object of the present invention to provide a vector quantization apparatus, vector dequantization apparatus, and vector quantization and dequantization methods that can perform vector quantization in the later stages based on the vector quantization result in the early stage in multistage vector quantization and improve quantization accuracy with a smaller amount of calculations and lower bit rate.
The vector quantization apparatus of the present invention employs a configuration having: a first quantization section that comprises a first codebook, quantizes an input vector to generate a first code and first quantization vector; a quantization residual generating section that generates a residual between the vector and the first quantization vector, as a quantization residual vector; an additive factor selecting section that comprises an additive factor codebook and selects an additive factor vector associated with the first code from the additive factor codebook; an additive residual generating section that generates a residual between the first quantization vector and the additive factor vector, as an additive residual vector; and a second quantization section that comprises a second codebook, quantizes the additive residual vector to generate a second code.
The vector dequantization apparatus of the present invention employs a configuration having: a first dequantization section that comprises a first codebook, dequantizes a first code acquired from a received quantized vector code to generate a first quantization vector; a second dequantization section that comprises a second codebook, dequantizes a second code acquired from the quantized vector code to generate a quantization additive residual vector; an additive factor selecting section that comprises an additive factor codebook and selects an additive factor vector associated with the first code from the additive factor codebook; a quantization residual generating section that generates a quantization residual vector by adding the quantization additive residual vector and the additive factor vector; and a quantization vector generating section that generates a quantized vector by adding the first quantization vector and the quantization residual vector.
According to the present invention, a vector is quantized in a plurality of stages, and the vector space of code vectors to be used for quantization in the later stages is adjusted adaptively based on a quantization result in the early stage, so that it is possible to improve quantization accuracy with a smaller amount of calculations and lower bit rate.
Embodiments of the present invention will be explained below in detail with reference to the accompanying drawings. Further, an example will be explained where an LSP vector quantization apparatus, LSP vector dequantization apparatus, and quantization method and dequantization methods are used as the vector quantization apparatus, vector dequantization apparatus, and quantization and dequantization methods according to the present invention.
Further, a vector of LSP (Line Spectral Pairs) parameters is abbreviated to “LSP vector.” Further, in the present invention, a factor (vector) to move the centroid that is the center of a code vector space by addition or subtraction with respect to all code vectors forming a codebook, is referred to as “additive factor.” Further, actually, it is more likely to subtract an additive factor vector from the vector of the quantization target for use, rather than add an additive factor vector to a code vector for use. Further, this residual (vector) that is a result of subtracting an additive factor vector from the vector of the quantization target is referred to as “additive residual.”
LSP vector quantization apparatus 100 is provided with first quantization section 101, quantization residual generating section 102, additive factor selecting section 103, additive residual generating section 104, second quantization section 105 and multiplexing section 106.
First quantization section 101 incorporates a first codebook comprised of a plurality of first code vectors, and quantizes an inputted LSP vector using the built-in first codebook, calculates the first quantization vector and first code, outputs the first code to additive factor selecting section 103 and multiplexing section 106, and outputs the first quantization vector to quantization residual generating section 102.
Quantization residual generating section 102 calculates the residual between the input LSP vector and the first quantization vector received as input from first quantization section 101, and outputs the calculated residual to additive residual generating section 104 as a quantization residual vector.
Additive factor selecting section 103 incorporates an additive factor codebook comprised of a plurality of additive factor code vectors, and, based on the first code received as input from first quantization section 101, selects one additive factor code vector from the additive factor codebook. Additive factor selecting section 103 outputs the selected additive factor code vector to additive residual generating section 104 as an additive factor vector.
Additive residual generating section 104 calculates the residual between the quantization residual vector received as input from quantization residual generating section 102 and the additive factor vector received as input from additive factor selection 103, and outputs the calculated residual to second quantization section 105 as an additive residual vector.
Second quantization section 105 incorporates a second codebook comprised of a plurality of second code vectors, and quantizes the additive residual vector received as input from additive residual generating section 104 using the built-in second codebook and outputs the resulting second code to multiplexing section 106.
Multiplexing section 106 multiplexes the first code received as input from first quantization section 101 and the second code received as input from second quantization section 105, and outputs the resulting code as a quantized vector code.
The operations of LSP vector quantization apparatus 100 will be explained below with an example case where the order of the LSP vector of the quantization target is R. Further, an LSP vector is expressed as LSP(i) (i=0, 1, . . . , R-1).
First quantization section 101 calculates the square error between an inputted LSP vector LSP(i) (i=0, 1, . . . , R-1) and each first code vector CODE_P(m)(i) (m=0, 1, . . . , M-1, i=0, 1, . . . , R-1) forming the built-in first codebook, according to following equation 1.
Here, m is the index of each first code vector forming the first codebook, and M is the total number of the first code vectors forming the first codebook. First quantization section 101 outputs the value m_min of m where the square error Err_P(m) is minimum amongst the resulting square errors Err_P(m) (m=0, 1, . . . , M-1) associated with the M first code vectors, to additive factor selecting section 103 and multiplexing section 106 as a first code. Further, the first code vector CODE_P(m
Quantization residual generating section 102 calculates the residual ERR(i) (i=0, 1, . . . , R-1) between the input LSP vector LSP(i) (i=0, 1, . . . , R-1) and the first quantization vector CODE_P(m
ERR(i)=LSP(i)−CODE—P(m
Quantization residual generating section 102 outputs the calculated ERR(i) (i=0, 1, . . . , R-1) to additive residual generating section 104 as a quantization residual vector.
Additive factor selecting section 103 selects the additive factor code vector ADD_F(m
Additive residual generating section 104 calculates the residual A_ERR(i) (i=0, 1, . . . , R-1) between the quantization residual vector ERR(i) (i=0, 1, . . . , R-1) received as input from quantization residual generating section 102 and the additive factor vector ADD_F(m
A_ERR(i)=ERR(i)−ADD—F(m
Additive residual generating section 104 outputs the calculated A_ERR(i) (i=0, 1, . . . , R-1) to second quantization section 105 as an additive residual vector.
Second quantization section 105 calculates the square error Err_F(n) (n=0, 1, . . . , N−1) between the additive residual vector A_ERR(i) (i=0, 1, . . . , R-1) received as input from additive residual generating section 104 and each second code vector CODE_F(n)(i) (i=0, 1, . . . , R-1, n=0, 1, . . . , N-1) forming the built-in second codebook, according to following equation 4.
Here, n is the index of each second code vector forming the second codebook, and N is the total number of second code vectors forming the second codebook. Second quantization section 105 outputs the value n_min of n where the square error Err_F(n) is minimum amongst N square errors Err_F(n) (n=0, 1, . . . , N-1) calculated, to multiplexing section 106 as a second code. That is, second quantization section 105 selects the second code vector from the second codebook, where the similarity is maximized between the second code vector and the additive residual vector.
Multiplexing section 106 multiplexes the first code m_min received as input from first quantization section 101 and the second code n_min received as input from second quantization section 105, and transmits the resulting quantized vector code to LSP vector dequantization apparatus 150.
The first codebook, additive factor codebook and second codebook used in LSP vector quantization apparatus 100 are prepared and created in advance by learning, and the method of learning these codebooks will be explained.
To prepare the first codebook included in first quantization section 101 by learning, first, a large number of, for example, V LSP vectors are prepared from a large amount of learning speech data. Next, M first code vectors CODE_P(m)(i) (m=0, 1, . . . , M-1, i=0, 1, . . . , R-1) are calculated using these V LSP vectors according to learning algorithms such as the LBG (Linde Buzo Gray) algorithm, to generate the first codebook.
To prepare the additive factor codebook included in additive factor selecting section 103 by learning, first, a large number of, for example, V′ LSP vectors are prepared from a large amount of learning speech data. Next, for one arbitrary LSP vector amongst prepared V′ LSP vectors, for example, for LSP(V′S)(i) (where v′s is an integer of 0≦v′s≦V′−1), the index ms of the first code vector CODE_P(ms)(i) (where ms is an integer of 0≦ms≦M-1) minimizing the square error between LSP(v′s)(i) and the first code vector CODE_P(ms)(i), is selected as the first code m_min from the first codebook, according to above equation 1. By repeating the same processing, the first codes m_min associated with all LSP vectors LSP(v′)(i) (where v′s is an integer of 0≦v′s≦V′−1) are calculated and stored. Next, more than one LSP vector LSP(v′s)(i) are extracted where the index ms of one arbitrary first code vector of first code vectors in the first codebook, for example, CODE_P(ms)(i) (where ms is an integer of 0≦ms≦M-1), is used as the first code. Next, in each of more than one extracted LSP vector LSP(v′s)(i), the residual vector ERR(i) (i=0, 1, . . . , R-1) is calculated which is the residual between the LSP vector LSP(v′s)(i) and the first code vector CODE_P(ms)(i), according to above equation 2. Next, the center (centroid) vector of more than one calculated residual vector ERR(i) (1=0, 1, . . . , R-1) is found and used as the additive factor code vector ADD_F(ms)(i) (i=0, 1, . . . , R-1) associated with the index ms. By repeating the same processing, additive factor code vectors ADD_F(m)(i) (m=0, 1, . . . , M-1, i=0, 1, . . . , R-1) associated with the indexes m of all first code vectors CODE_P(m)(i) are calculated to generate the additive factor codebook.
In other words, more than one LSP vector are extracted in association with the same first code acquired by performing first vector quantization using the first codebook comprised of M first code vectors and V′ LSP vectors. Next, a plurality of residual vectors are calculated by subtracting the first code vector associated with the first code from each of the extracted LSP vectors, the center (centroid) of the plurality of calculated residual vectors are found, and this centroid vector is used as an additive factor code vector. Thus, all additive factor code vectors associated with the index m (m=0, 1, . . . , M-1) of each first code vector in the first codebook are found to generate the additive factor codebook.
Thus, by calculating the first codebook and the additive factor codebook, the second codebook used in second quantization section 105 can be calculated by learning, using the calculated first codebook and additive factor codebook. To be more specific, first, the first codebook and the additive factor codebook are created as described above, and a large number of, for example, V LSP vectors are calculated from a large amount of learning speech data. Next, first vector quantization is performed for V calculated LSP vectors. For example, for the vs-th (0≦vs≦V-1) LSP vector LSP(vs)(i) (i=0, 1, . . . , R-1), the first code m_min is calculated according to equation 1. Next, according to equation 2, the residual vector ERR(i) (i=0, 1, . . . , R-1) is acquired which is the residual between the LSP vector LSP(vs)(i) (i=0, 1, . . . , R-1) and the first code vector CODE_P(m
LSP vector dequantization apparatus 150 is provided with code demultiplexing section 151, second dequantization section 152, additive factor selecting section 153, quantization residual generating section 154, first dequantization section 155 and quantized LSP vector generating section 156. Further, second dequantization section 152 has the same second codebook as in second quantization section 105. Further, additive factor selecting section 153 has the same additive factor codebook as in additive factor selecting section 103. Further, first dequantization section 155 has the same first codebook as in first quantization section 101.
Code demultiplexing section 151 performs demultiplexing processing for the quantization vector code transmitted from LSP vector quantization apparatus 100 into the first code and the second code. Code demultiplexing section 151 outputs the first code to additive factor selecting section 153 and first dequantization section 155, outputs the second code to second dequantization section 152.
Second dequantization section 152 dequantizes the second code received as input from code demultiplexing section 151 using the built-in second codebook, and outputs the resulting second code vector to quantization residual generating section 154 as a quantization additive residual vector.
Additive factor selecting section 153 selects one additive factor code vector from the built-in additive factor codebook based on the first code received as input from code demultiplexing section 151, and outputs the selected additive factor code vector to quantization residual generating section 154 as an additive factor vector.
Quantization residual generating section 154 outputs to quantized LSP vector generating section 156 a quantization residual vector acquired by adding the quantization additive residual vector received as input from second dequantization section 152 and the additive factor vector received as input from additive factor selecting section 153.
First dequantization section 155 dequantizes the first code received as input from code demultiplexing section 151 using the built-in first codebook and outputs the resulting first quantization vector to quantized LSP vector generating section 156.
Quantized LSP vector generating section 156 outputs a quantized LSP vector acquired by adding the first quantization vector received as input from first dequantization section 155 and the quantization residual vector received as input from quantization residual generating section 154.
The operations of LSP vector dequantization apparatus 150 will be explained below.
Code demultiplexing section 151 performs demultiplexing processing for the inputted quantized vector code into the first code m_min and the second code n_min, and outputs the first code m_min to additive factor selecting section 153 and first dequantization section 155, and outputs the second code n_min to second dequantization section 152.
Second dequantization section 152 selects the second code vector CODE_F(n
Q
—
A_ERR(i)−CODE—F(n
Additive factor selecting section 153 selects the additive factor code vector ADD_F(m
Quantization residual generating section 154 adds the quantization additive residual vector Q_A_ERR(i) (i=0, 1, . . . , R-1) received as input from second dequantization section 152 and the additive factor vector ADD_F(m
Q_ERR(i)=Q—A_ERR(i)+ADD—F(m
First dequantization section 155 selects the first code vector CODE_P(m
Quantized LSP vector generating section 156 adds the quantization residual vector Q_ERR(i) (i=0, 1, . . . , R-1) received as input from quantization residual generating section 154 and the first quantization vector CODE_P(m
Q_LSP(i)=Q_ERR(i)+CODE—P(m
As described above, according to the present embodiment, an LSP vector quantization apparatus that quantizes first quantization and second quantization in two stages adjusts the vector space of second code vectors for the second quantization adaptively, using an additive factor associated with the quantization result of the first quantization, so that it is possible to improve the accuracy of LSP vector quantization with a smaller amount of calculations and lower bit rate.
Further, although an example case has been explained with the present embodiment where the first code vectors forming a first codebook and the additive factor code vectors forming an additive factor codebook are associated in one-to-one association, the present invention is not limited to this, and the first code vectors in the first codebook and the additive factor code vectors in the additive factor codebook can be associated in N-to-one association (N is an integer equal to or greater than 2).
Further, although an example case has been explained with the present embodiment where the first code vectors forming a first codebook and the additive factor code vectors forming an additive factor codebook are associated in one-to-one association, the present invention is not limited to this, and the first code vectors forming the first codebook and the additive factor code vectors forming the additive factor codebook can be associated in one-to-N association (N is an integer equal to or greater than 2). In this case, in more than two additive factor code vectors associated with the first code, it may be preferable to select the additive factor code vector minimizing the square error Err_F(n) (n=0, 1, . . . , N-1) calculated by equation 4. In this case, the LSP vector quantization apparatus needs to report information as to which additive factor vector is selected, to the LSP vector dequantization apparatus. For example, when the number of additive factor code vectors associated with the first code is 2x, it may be preferable to report information as to which additive factor code vector is selected from 2x additive factor code vectors, to the LSP dequantization apparatus by sending X-bits information.
Further, although an example case has been explained with the present embodiment where vector quantization is performed for an LSP vector in two stages, the present invention is not limited to this, and it is equally possible to perform vector quantization in three stages or more.
Further, although an example case has been explained with the present embodiment where vector quantization is performed for an LSP vector in two stages, the present invention is not limited to this, and it is equally possible to perform vector quantization while using split vector quantization. For example, when vector quantization is performed for an additive residual vector in the second stage, it is possible to split the additive residual vector into a plurality of vectors and perform vector quantization for the plurality of split vectors. In this case, it may be preferable to prepare different codebooks according to the order of the split vectors.
Further, although an example case has been explained with the present embodiment where an LSP vector is used as the quantization target, the quantization target is not limited to this, and vectors other than an LSP vector are equally possible.
Further, although a case has been described with the present embodiment where LSP vector dequantization apparatus 150 decodes a quantized vector code transmitted from LSP vector quantization apparatus 100, the present invention is not limited to this, and it is needless to say that, when encoded data in a form that can be decoded by LSP vector dequantization apparatus 150 is transmitted as a quantized vector code, vector dequantization apparatus 150 can receive and decode encoded data even if the encoded data is not transmitted from LSP vector quantization apparatus 100.
Scaling factor selecting section 201 incorporates a scaling factor table comprised of a plurality of scaling factors and selects one scaling factor associated with a first code received as input from first quantization section 101, from the built-in scaling factor table. Scaling factor selecting section 201 outputs the selected scaling factor to second quantization section 205.
Second quantization section 205 multiplies second code vectors by the scaling factor received as input from scaling factor selecting section 201, quantizes an additive residual vector received as input from additive residual generating section 104 using the second codebook multiplied by the scaling factor, and outputs the resulting second code to multiplexing section 106.
To be more specific, scaling factor selecting section 201 and second quantization section 205 having the above-noted configuration will perform the following operations.
Scaling factor selecting section 201 selects the scaling factor AMP(m
Second quantization section 205 calculates the square error Err_F(n) (n=0, 1, . . . , N-1) between vectors acquired by multiplying second code vectors CODE_F(n)(i) (i=0, 1, . . . , R-1, n=0, 1, . . . , N-1) forming the built-in second codebook by the scaling factor AMP(m
Here, n is the index of each second code vector forming the second codebook, and N is the total number of the second code vectors forming the second codebook. In N square errors Err_F(n) (n=0, 1, . . . , N-1) calculated, second quantization section 205 outputs the value n_min of n where the square error Err_F(n) is minimum, as the second code, to multiplexing section 106.
The scaling factor table used in scaling factor selecting section 201 is prepared and created in advance by learning, and the method of learning the scaling factor table will be explained.
To prepare the scaling factor table included in scaling factor selecting section 201 by learning, as shown in Embodiment 1, after the first codebook, additive factor codebook and second codebook are prepared by learning, a large number of, for example, V̂ LSP vectors acquired from a large amount of speech data for learning are prepared. Next, the first codes associated with the prepared V̂ LSP vectors are prepared. For example, for LSP(v̂s)(i) (where v̂s is an integer of 0≦v̂s≦v̂s−1), the index ms of the first code vector CODE_P(ms)(i) (where ms is an integer of 0≦ms≦M-1) minimizing the square error between LSP(v̂s)(i) and the first code vector is selected as the first code m_min from the first codebook, according to above equation 1. By repeating the same processing, the first codes m_min associated with all LSP vectors LSP(v̂s)(i) (where v̂s is an integer of 0≦v̂s≦v̂s−1) are calculated and stored. Next, more than one LSP vector LSP(v̂s)(i) are extracted where the index ms of an arbitrary first code vector forming the first codebook, for example, CODE_P(ms)(i) (where ms is an integer of 0≦ms≦M-1), is used as the first code m_min. Next, in each of more than one extracted LSP vector LSP(v̂s)(i), the residual vector ERR(i) (i=0, 1, . . . , R-1) which is the residual between the LSP vector LSP(v̂s)(i) and the first code vector CODE_P(ms)(i), is calculated according to above equation 2. Next, the additive residual vector A_ERR(i) (i=0, 1, . . . , R-1) which is the residual between the residual vector ERR(i) (i=0, 1, . . . , R-1) and the additive factor ADD_F(m
By repeating the same processing for more than one LSP vector LSP(v̂s)(i) where the index ms is used as the first code m_min, the second code n_min associated with each of LSP vectors LSP(v̂s)(i) are calculated and stored. Next, AMP(m
Here, W(ms) is the total number of LSP vectors LSP(v̂s)(i) where the index ms is used as the first code m_min. A_ERR(w)(i) (i=0, 1, . . . , R-1, w=0, 1, . . . , W(ms)−1) is the additive residual vector acquired from the LSP vector LSP(v̂s)(i) where the index ms is used as the code m_min. CODE_F(n
Thus, the scaling factor AMP(m
Scaling factor selecting section 251 incorporates the same scaling factor table as in scaling factor selecting section 201 of LSP vector quantization 200, and selects a scaling factor associated with the first code received as input from code demultiplexing section 151, from the built-in scaling factor table, and outputs the selected scaling factor to second dequantization section 252.
Second dequantization section 252 dequantizes the second code received as input from code demultiplexing section 151 using the built-in second codebook, multiplies the resulting second code vector by the scaling factor received as input from scaling factor selecting section 251 and outputs the second code vector multiplied by the scaling factor to quantization residual generating section 154.
To be more specific, scaling factor selecting section 251 and second dequantization section 252 having the above-noted configuration will perform the following operations.
Scaling factor selecting section 251 selects the scaling factor AMP(m
Second dequantization section 252 selects the second code vector CODE_F(n
Q
—
A_ERR(i)=AMP(m
As described above, according to the present embodiment, the LSP vector quantization apparatus that performs first quantization and second quantization adjusts the vector space of second code vectors for second quantization further adaptively, using a scaling factor associated with the quantization result in the first quantization, so that it is possible to further improve the accuracy of LSP vector quantization with a lower amount of calculations and lower bit rate.
With the present embodiment, multistage vector quantization in two stages is performed for an LSP vector, and, furthermore, in the vector quantization in the second stage, the LSP vector is split into two and quantized using the vector quantization result in the first stage.
In
Vector split section 301 splits a quantization residual vector received as input from quantization residual generating section 102 into two to generate two split vectors. Vector split section 301 outputs a split vector with a lower order associated with a lower frequency domain, as a first split vector, to first additive residual generating section 303, and outputs a split vector with a higher order associated with a higher frequency domain, as a second split vector, to second additive residual generating section 307.
First additive factor selecting section 302 incorporates a first additive factor codebook comprised of a plurality of first additive factor code vectors, and selects one additive factor code vector from the first additive factor codebook based on the first code received as input from first quantization section 101. First additive factor selecting section 302 outputs the selected first additive factor code vector to first additive residual generating section 303 as a first additive factor vector.
First additive residual generating section 303 calculates the residual between the first split vector received as input from vector split section 301 and the first additive factor vector received as input from first additive factor selecting section 302, and outputs the calculated residual to second quantization section 305 as a first additive residual vector.
Scaling factor selecting section 304 incorporates a scaling factor table comprised of a plurality of scaling factors, and selects one scaling factor from the scaling factor table based on the first code received as input from first quantization section 101. Scaling factor selecting section 304 outputs the selected scaling factor to second quantization section 305 and third quantization section 308.
Second quantization section 305 incorporates a first split codebook comprised of a plurality of first split code vectors, and multiplies each first code vector by the scaling factor received as input from scaling factor selecting section 304. Further, second quantization section 305 quantizes the first additive residual vector received as input from first additive residual generating section 303, using the first split codebook multiplied by the scaling factor, and outputs the resulting second code to second additive factor selecting section 306 and multiplexing section 309.
Second additive factor selecting section 306 incorporates a second additive factor codebook comprised of a plurality of second additive factor code vectors, and selects one second additive factor code vector from the second additive factor codebook based on the second code received as input from second quantization section 305. Second additive factor selecting section 306 outputs the selected second additive factor code vector to second additive residual generating section 307 as a second additive factor vector.
Second additive residual generating section 307 calculates the residual between the second split vector received as input from vector split section 301 and the second additive factor vector received as input from second additive factor selecting section 306, and outputs the residual to third quantization section 308 as a second additive residual vector.
Third quantization section 308 incorporates a second split codebook comprised of a plurality of second split code vectors, and multiplies each second split code vector by the scaling factor received as input from scaling factor selecting section 304. Further, third quantization section 308 quantizes the second additive residual vector received as input from second additive residual generating section 307, using the second split codebook multiplied by the scaling factor, and outputs the resulting third code to multiplexing section 309.
Multiplexing section 309 multiplexes the first code received as input from first quantization section 101, the second code received as input from second quantization section 305 and the third code received as input from third quantization section 308, and outputs the resulting code as a quantized vector code.
To be more specific, vector split section 301, first additive factor selecting section 302, first additive residual generating section 303, scaling factor selecting section 304, second quantization section 305, second additive factor selecting section 306, second additive residual generating section 307, third quantization section 308 and multiplexing section 309 having the above-noted configuration performs the following operations.
Vector split section 301 splits the quantization residual vector ERR(i) (i=0, 1, . . . , R-1) received as input from quantization residual generating section 102 into the first split vector of order R_P and the second split vector of order R_F, according to following equation 11.
ERR—P(i)=ERR(i) (i=0, . . . , R—P-1)
ERR—F(i)=ERR(i+R—P) (i=0, . . . , R—F-1) (Equation 11)
Here, the total of R_P and R_F is R, and therefore the relationship R_P+R_F=R holds. Vector split section 301 outputs ERR_P(i) (i=0, 1, . . . , R_P-1) to first additive residual generating section 303 as a first split vector and outputs ERR_F(i) (i=0, 1, . . . , R_F-1) to second additive residual generating section 307 as a second split vector.
First additive factor selecting section 302 selects the first additive factor code vector ADD_F_P(m
First additive residual generating section 303 calculates the residual A_ERR_P(i) (i=0, 1, . . . , R_P-1) between the first split vector ERR_P(i) (i=0, 1, . . . , R_P-1) received as input from vector split section 301 and the first additive factor vector ADD_F_P(n
A_ERR—P(i)=ERR—P(i)−ADD—F—P(m
First additive residual generating section 303 outputs the calculated A_ERR_P(i) (i=0, 1, . . . , R_P-1) to second quantization section 305 as a first additive residual vector.
Scaling factor selecting section 304 selects the scaling factor AMP(m
Second quantization section 305 multiplies first split code vectors CODE_F_P(n)(i) (i=0, 1, . . . , R_P-1, n=0, 1, . . . , N-1) forming the built-in first split codebook by the scaling factor AMP(m
Here, n is the index of each first split code vector forming the first split codebook, and N is the total number of first split code vectors forming the first split codebook. Second quantization section 305 outputs the value n min of n where the square error Err_F_P(n) is minimum amongst the N square errors Err_F_P(n) (n=0, 1, . . . , N-1) calculated, to second additive factor selecting section 306 and multiplexing section 309 as a second code.
Second additive factor selecting section 306 selects the second additive factor code vector ADD_F_F(n
Second additive residual generating section 307 calculates the residual A_ERR_F(i) (i=0, 1, . . . , R_F-1) between the second split vector ERR_F(i) (i=0, 1, . . . , R_F-1) received as input from vector split section 301 and the second additive factor vector ADD_F_F(n
A_ERR—F(i)=ERR—F(i)−ADD—F—F(n
Second additive residual generating section 307 outputs the calculated A_ERR_F(i) (i=0, 1, . . . , R_F-1) to third quantization section 308 as a second additive residual vector.
Third quantization section 308 multiplies second split code vectors CODE_F_F(o)(i) (i=0, 1, . . . , R_F-1, o=0, 1, . . . , O-1) forming the built-in second split codebook by the scaling factor AMP(m
Here, o is the index of each second split cod e vector forming the second split codebook, and O is the total number of the second split code vectors forming the second split codebook. In O square errors Err_F_F(o) (o=0, 1, . . . , O-1) calculated, third quantization section 308 outputs the value o_min of o where the square error Err_F_F(o) is minimum, to multiplexing section 309 as a second code.
Multiplexing section 309 multiplexes the first code m_min received as input from first quantization section 101, the second code n_min received as input from second quantization section 305 and the third code o_min received as input from third quantization section 308, and transmits the resulting quantized vector code to LSP vector dequantization apparatus 350.
LSP vector dequantization apparatus 350 is provided with first dequantization section 155, quantized LSP vector generating section 156, code demultiplexing section 351, scaling factor selecting section 352, second dequantization section 353, third dequantization section 354, first additive factor selecting section 355, first quantization split vector generating section 356, second additive factor selecting section 357, second quantization split vector generating section 358 and vector combining section 359. Here, first dequantization section 155 and quantized LSP vector generating section 156 are the same as first dequantization section 155 and quantized LSP vector generating section 156 in Embodiment 2, and therefore explanation will be omitted. Further, scaling factor selecting section 352 has the same scaling factor table as in scaling factor selecting section 304 of LSP vector quantization apparatus 300. Further, second dequantization section 353 has the same first split codebook as in second quantization section 305 of LSP vector quantization section 300. Further, third dequantization 354 has the same second split codebook as in third quantization section 308 of LSP vector quantization section 300. Further, first additive factor selecting section 355 has the same first additive factor codebook as in first additive factor selecting section 302 of LSP vector quantization apparatus 300. Further, second additive factor selecting section 357 has the same second additive factor codebook as in second additive factor selecting section 306 of LSP vector quantization apparatus 300.
Code demultiplexing section 351 performs dequantization processing for the quantized vector code transmitted from LSP vector quantization apparatus 300 into the first code, second code and third code. Code demultiplexing section 351 outputs the first code to scaling factor selecting section 352, first additive factor selecting section 355 and first dequantization section 155, outputs the second code to second dequantization section 353 and second additive factor selecting section 357, and outputs the third code to third dequantization section 354.
Scaling factor selecting section 352 selects one scaling factor from the built-in scaling factor table based on the first code received as input from code demultiplexing section 351, and outputs the selected scaling factor to second dequantization section 353 and third dequantization section 354.
Second dequantization section 353 dequantizes the second code received as input from code demultiplexing section 351 using the built-in first split codebook to acquire a first split code vector. Second dequantization section 353 multiplies the acquired first split code vector by the scaling factor received as input from scaling factor selecting section 352 and outputs the first split code vector multiplied by the scaling factor to first quantization split vector generating section 356 as a first quantization additive residual vector.
Third dequantization section 354 dequantizes the third code received as input from code demultiplexing section 351 using the built-in second split codebook to acquire second split code vector. Third dequantization section 354 multiplies the acquired second split code vector by the scaling factor received as input from scaling factor selecting section 352 and outputs the second split code vector multiplied the scaling factor to second quantization split vector generating section 358 as a second quantization additive residual vector.
First additive factor selecting section 355 selects one first additive factor code vector from the built-in first additive factor codebook based on the first code received as input from code demultiplexing section 351, and outputs the selected first additive factor code vector to first quantization split vector generating section 356 as a first additive factor vector.
First quantization split vector generating section 356 outputs to vector combining section 359 the first quantization split vector acquired by adding the first quantization additive residual vector received as input from second dequantization section 353 and the first additive factor vector received as input from first additive factor selecting section 355.
Second additive factor selecting section 357 selects one second additive factor code vector from the built-in second additive factor codebook based on the second code received as input from code demultiplexing section 351, and outputs the selected second additive factor code vector to second quantization split vector generating section 358 as a second additive factor vector.
Second quantization split vector generating section 358 outputs to vector combining section 359 the second quantization split vector acquired by adding the second quantization additive residual vector received as input from third dequantization section 354 and the second additive factor vector received as input from second additive factor selecting section 357.
Vector combining section 359 combines the first quantization split vector received as input from first quantization split vector generating section 356 and the second quantization split vector received as input from second quantization split vector generating section 358, and outputs the resulting quantization residual vector to quantized LSP vector generating section 156.
To be more specific, code demultiplexing section 351, scaling factor selecting section 352, second dequantization section 353, third dequantization section 354, first additive factor selecting section 355, first quantization split vector generating section 356, second additive factor selecting section 357, second quantization split vector generating section 358 and vector combining section 359 employing the above-noted configurations will perform the following operations.
Code demultiplexing section 351 performs demultiplexing processing for the quantized vector code transmitted from LSP vector quantization apparatus 300 into the first code m_min, second code n min and third code o_min, outputs the first code m_min to scaling factor selecting section 352, first additive factor selecting section 355 and first dequantization section 155, outputs the second code n_min to second dequantization section 353 and second additive factor selecting section 357, and outputs the third code o_min to third dequantization section 354.
Scaling factor selecting section 352 selects the scaling factor AMP(m
Second dequantization section 353 selects the first split code vector CODE_F_P(n
Q
—
A_ERR—P(i)=AMP(m
Third dequantization section 354 selects the second split code vector CODE_F_F(o
Q
—
A_ERR—F(i)=AMP(m
First additive factor selecting section 355 selects the first additive factor code vector ADD_F_P(m
First quantization split vector generating section 356 adds the first quantization additive residual vector Q_A_ERR_P(i) (i=0, 1, . . . , R_P-1) received as input from second dequantization section 353 and the first additive factor vector ADD_F_P(m
Q_ERR—P(i)=Q—A_ERR—P(i)+ADD—F—P(m
Second additive factor selecting section 357 selects the second additive factor code vector ADD_F_F(n
Second quantization split vector generating section 358 adds the second quantization additive residual vector Q_A_ERR_F(i) (i=0, 1, . . . , R_F-1) received as input from third dequantization section 354 and the second additive factor vector ADD_F_F(n
Q_ERR—F(i)=Q—A_ERR—F(i)+ADD—F—F(n
Vector combining section 359 combines the first quantization split vector Q_ERR_P(i) (i=0, 1, . . . , R_P-1) received as input from first quantization split vector generating section 356 and the second quantization split vector Q_ERR_F(i) (i=0, 1, . . . , R_F-1) received as input from second quantization split vector generating section 358, according to following equation 20, and outputs the resulting quantization residual vector Q_ERR(i) (i=0, 1, . . . , R-1) to quantized LSP vector generating section 156.
Q_ERR(i)=Q_ERR—P(i) (i=0, . . . , R—P-1)
Q_ERR(i+R—P)=Q_ERR—F(i) (i=0, . . . , R—F-1) (Equation 20)
As described above, according to the present embodiment, the LSP vector quantization apparatus, which performs quantization in two stages of the first quantization and second quantization, splits into two and quantizes a vector in the second quantization, and, in this split vector quantization, adaptively adjusts the vector space of code vectors for quantization of one split vector based on the quantization result of the other split vector. Therefore, it is possible to further improve the accuracy of LSP vector quantization with a smaller amount of calculations and lower bit rate.
Further, although a case has been described with the present embodiment where LSP vector quantization apparatus splits into two and quantizes a vector in the second quantization, the present invention is not limited to this, and it is equally possible to split into three or more and quantize a vector in the second quantization. In this case, the quantization accuracy increases when correlation is higher between split vectors acquired by splitting the quantization target.
CELP encoding apparatus 400 is provided with preprocessing section 401, LSP analysis section 402, LSP vector quantization section 403, synthesis filter 404, adder 405, adaptive excitation codebook 406, quantization gain generating section 407, fixed excitation codebook 408, multipliers 409 and 410, adder 411, perceptual weighting section 412, parameter determination section 413 and multiplexing section 414, where LSP vector quantization section 403 is formed with LSP vector quantization apparatus 100 according to Embodiment 1, LSP vector quantization apparatus 200 according to Embodiment 2 or LSP vector quantization apparatus 300 according to Embodiment 3. Further, CELP encoding apparatus 400 divides input speech or audio signal in units of a plurality of samples, and performs encoding every frame comprised of the plurality of samples.
Preprocessing section 401 processes an input speech or audio signal by performing highpass filtering processing to remove the DC components, waveform shaping processing or preemphasis processing for improved performance in subsequent encoding processing, and outputs the signal Xin resulted from these processing, to LSP analysis section 402 and adder 405.
LSP analysis section 402 performs a linear predictive analysis using Xin received as input from preprocessing section 401, converts the resulting LPC into an LSP vector and outputs the LSP vector to LSP vector quantization section 403.
LSP vector quantization section 403 quantizes the LSP vector received as input from LSP analysis section 402. Further, LSP vector quantization section 403 outputs the resulting quantized LSP vector to synthesis filter 404 and outputs the quantized LSP code (L) to multiplexing section 414.
Synthesis filter 404 outputs to adder 405 a synthesis signal generated by performing synthesis processing with respect to excitation received as input from adder 411, which will be described later, using filter coefficients based on the quantized LSP vector received as input from LSP vector quantization section 403.
Adder 405 calculates an error signal by inverting the polarity of the synthesis signal received as input from synthesis filter 404 and adding the result to the signal Xin received as input from preprocessing section 401, and outputs the error signal to perceptual weighting section 412.
Adaptive excitation codebook 406 that stores excitations received as input from adder 411 in the past in a buffer, and cuts out one frame of samples in the buffer from the cut position specified by the adaptive excitation lag code (A) received as input from parameter determination section 413, and outputs the one frame of samples to multiplier 409 as an adaptive excitation vector. Here, adaptive excitation codebook 406 updates content of the buffer every time an excitation is received as input from adder 411.
Quantization gain generating section 407 determines a quantized adaptive excitation gain and quantized fixed excitation gain by the quantized excitation gain code (G) received as input from parameter determination section 413, and outputs the quantized adaptive excitation gain and quantized fixed excitation gain to multiplier 409 and multiplier 410, respectively.
Fixed excitation codebook 408 outputs a vector having the form specified by the fixed excitation vector code (F) received as input from parameter determination section 413, to multiplier 410 as a fixed excitation vector.
Multiplier 409 multiplies the adaptive excitation vector received as input from adaptive excitation codebook 406 by the quantized adaptive excitation gain received as input from quantization gain generating section 407, and outputs the result to adder 411.
Multiplier 410 multiplies the fixed excitation vector received as input from fixed excitation codebook 408 by the quantized fixed excitation gain received as input from quantization gain generating section 407, and outputs the result to adder 411.
Adder 411 adds the adaptive excitation vector after gain multiplication received as input from multiplier 409 and the fixed excitation vector after gain multiplication received as input from multiplier 410, and outputs the addition result to synthesis filter 404 and adaptive excitation codebook 406 as an excitation. Here, the excitation inputted in adaptive excitation codebook 406 is stored in the buffer of adaptive excitation codebook 406.
Perceptual weighting section 412 performs perceptual weighting processing for the error signal received as input from adder 405, and outputs the result to parameter determination section 413 as coding distortion.
Parameter determination section 413 selects the adaptive excitation lag that minimizes the coding distortion received as input from perceptual weighting section 412, from adaptive excitation codebook 406, and outputs the adaptive excitation lag code (A) indicating the selection result to adaptive excitation codebook 406 and multiplexing section 414. Here, an adaptive excitation lag is the parameter for indicating the position in which an adaptive excitation vector is cut out. Further, parameter determination section 413 selects the fixed excitation vector that minimizes the coding distortion received as input from perceptual weighting section 412, from fixed excitation codebook 408, and outputs the fixed excitation vector code (F) indicating the selection result to fixed excitation codebook 408 and multiplexing section 414. Further, parameter determination section 413 selects the quantized adaptive excitation gain and quantized fixed excitation gain that minimize the coding distortion received as input from perceptual weighting section 412, from quantized adaptive gain generating section 407, and outputs the quantized adaptive excitation gain code (G) indicating the selection result to quantized gain generating section 407 and multiplexing section 414.
Multiplexing section 414 outputs encoding information acquired by multiplexing the quantized LSP code (L) received as input from LSP vector quantization section 403, and the adaptive excitation lag code (A), fixed excitation vector code (F) and quantization excitation gain code (G) received as input from parameter determination section 413.
CELP decoding apparatus 450 is provided with demultiplexing section 451, LSP vector dequantization section 452, adaptive excitation codebook 453, quantization gain generating section 454, fixed excitation codebook 455, multipliers 456 and 457, adder 458, synthesis filter 459 and post-processing section 460. Here, LSP vector dequantization section 452 is formed with LSP vector dequantization apparatus 150 according to Embodiment 1, LSP vector dequantization apparatus 250 according to Embodiment 2 or LSP vector dequantization apparatus 350 according to Embodiment 3.
Demultiplexing section 451 performs demultiplexing processing for the encoding information transmitted from CELP encoding apparatus 400 and acquires quantized LSP code (L), adaptive excitation lag code (A), quantized excitation gain code (G) and fixed excitation vector code (F). Demultiplexing section 451 outputs the quantized LSP code (L) to LSP vector dequantization section 452, the adaptive excitation lag code (A) to adaptive excitation codebook 453, the quantized excitation gain code (G) to quantization gain generating section 454, and the fixed excitation vector code (F) to fixed excitation codebook 455.
LSP vector dequantization section 452 decodes a quantized LSP vector from the quantized LSP code (L) received as input from demultiplexing section 451, and outputs the quantized LSP vector to synthesis filter 459.
Adaptive excitation codebook 453 cuts out one frame of samples in the buffer from the cut position specified by the adaptive excitation lag code (A) received as input from demultiplexing section 451, and outputs the cut vector to multiplier 456 as an adaptive excitation vector. Here, adaptive excitation codebook 453 updates the buffer every time an excitation is received as input from adder 458.
Quantization gain generating section 454 decodes the quantized adaptive excitation gain and quantized fixed excitation gain specified by the quantized excitation gain code (G) received as input from demultiplexing section 451, and outputs the quantized adaptive excitation gain to multiplier 456 and the quantized fixed excitation gain to multiplier 457.
Fixed excitation codebook 455 generates the fixed excitation vector specified by the fixed excitation vector code (F) received as input from demultiplexing section 451, and outputs the fixed excitation vector to multiplier 457.
Multiplier 456 multiplies the adaptive excitation vector received as input from adaptive excitation codebook 453 by the quantized adaptive excitation gain received as input from quantization gain generating section 454, and outputs the result to adder 458.
Multiplier 457 multiplies the fixed excitation vector received as input from fixed excitation codebook 455 by the quantized fixed excitation gain received as input from quantization gain generating section 454, and outputs the result to adder 458.
Adder 458 generates an excitation by adding the adaptive excitation vector after gain multiplication received as input from multiplier 456 and the fixed excitation vector after gain multiplication received as input from multiplier 457, and outputs this generated excitation to synthesis filter 459 and adaptive excitation codebook 453. Here, the excitation inputted in adaptive excitation codebook 453 is stored in the buffer of adaptive excitation codebook 453.
Synthesis filter 459 performs synthesis processing using the excitation received as input from adder 458 and the filter coefficients decoded in LSP vector dequantization section 452, and outputs the generated synthesis signal to post-processing section 460.
Post-processing section 460 processes the synthesis signal received as input from synthesis filter 459 by performing processing to improve the subjective quality of speech, such as format enhancement and pitch enhancement, and processing to improve the subjective quality of stationary noise, and outputs the resulting speech signal.
As described above, according to the present invention, by adaptively adjusting the vector space of second code vectors for second quantization using an additive factor and scaling factor associated with the quantization result in first quantization, and by applying an LSP vector quantization apparatus that performs multi-stage quantization processing to an CELP encoding apparatus, it is possible to improve the accuracy of speech encoding with a smaller amount of calculations and lower bit rate.
As described above, embodiments of the present invention have been explained above.
Further, LSP can be referred to as LSF (Line Spectral Frequency), and it is possible to read LSP as LSF. Further, when ISP (Immittance. Spectrum Pairs) is quantized as a spectrum parameter instead of LSP, it is possible to read LSP as ISP and utilize an ISP quantization/dequantization apparatus in the present embodiments.
The vector quantization apparatus, vector dequantization apparatus and vector quantization and dequantization methods of the present invention are not limited to the embodiments described above, and can be implemented with various changes.
For example, although a case has been described with the above-described embodiments where a speech signal is targeted in the vector quantization apparatus, vector dequantization apparatus and vector quantization and dequantization methods, it is equally possible to apply the present invention to an audio signal and such.
The vector quantization apparatus and vector dequantization apparatus according to the present invention can be mounted on a communication terminal apparatus in a mobile communication system that transmits speech, audio and such, so that it is possible to provide a communication terminal apparatus having the same operational effect as above.
Although a case has been described with the above embodiments as an example where the present invention is implemented with hardware, the present invention can be implemented with software. For example, by describing the vector quantization method and vector dequantization method according, to the present invention in a programming language, storing this program in a memory and making the information processing section execute this program, it is possible to implement the same function as the vector quantization apparatus and vector dequantization apparatus according to the present invention.
Furthermore, each function block employed in the description of each of the aforementioned embodiments may typically be implemented as an LSI constituted by an integrated circuit. These may be individual chips or partially or totally contained on a single chip.
“LSI” is adopted here but this may also be referred to as “IC,” “system LSI,” “super LSI,” or “ultra LSI” depending on differing extents of integration.
Further, the method of circuit integration is not limited to LSI's, and implementation using dedicated circuitry or general purpose processors is also possible. After LSI manufacture, utilization of an FPGA (Field Programmable Gate Array), or a reconfigurable processor where connections and settings of circuit cells in an LSI can be reconfigured is also possible.
Further, if integrated circuit technology comes out to replace LSI's as a result of the advancement of semiconductor technology or a derivative other technology, it is naturally also possible to carry out function block integration using this technology. Application of biotechnology is also possible.
The disclosure of Japanese Patent Application No. 2006-283097, filed on Oct. 17, 2006, including the specification, drawings and abstract, is included herein by reference in its entirety.
The vector quantization apparatus, vector dequantization apparatus and vector quantization and dequantization methods according to the present invention are applicable for, for example, speech coding and speech decoding.
Number | Date | Country | Kind |
---|---|---|---|
2006-283097 | Oct 2006 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2007/070178 | 10/16/2007 | WO | 00 | 4/16/2009 |