The present disclosure relates to an encoding apparatus, a decoding apparatus, an encoding method, and a decoding method.
Multi-rate lattice vector quantization is one of quantization methods in encoding audio or voice (for example, encoding an excitation signal) (see, for example, NPL 1). Multi-rate lattice vector quantization may be applied to, for example, split vector quantization (which is called, for example, split multi-rate lattice vector quantization or split multi-rate lattice vector quantization). Also, split multi-rate lattice vector quantization may be applied to, for example, algebraic vector quantization (AVQ).
PTL 1
NPL 1
There is room for study in a method of reducing the number of bits for encoding in multi-rate lattice vector quantization.
Non-limiting embodiments of the present disclosure facilitate providing an encoding apparatus, a decoding apparatus, an encoding method, and a decoding method that reduce the number of bits for encoding in vector quantization.
An encoding apparatus according to one embodiment of the present disclosure includes: quantization circuitry, which, in operation, generates quantization parameters including information on a codebook for vector quantization; and control circuitry, which, in operation, determines which one of first encoding of the information for a targeted sub-vector and second encoding of (based on) a difference between the number of bits allocated to the vector quantization and the number of bits of the quantization parameters is to be to executed, in accordance with the number of bits available for encoding a sub-vector in the vector quantization.
It should be noted that general or specific embodiments may be implemented as a system, an apparatus, a method, an integrated circuit, a computer program, a storage medium, or any selective combination thereof.
According to an embodiment of the present disclosure, it is possible to reduce the number of bits for encoding in multi-rate lattice vector quantization.
Additional benefits and advantages of the disclosed embodiments will become apparent from the specification and drawings. The benefits and/or advantages may be individually obtained by the various embodiments and features of the specification and drawings, which need not all be provided in order to obtain one or more of such benefits and/or advantages.
Hereinafter, embodiments of the present disclosure will be described in detail with reference to the accompanying drawings.
For example, in split multi-rate lattice vector quantization, a signal in a time domain or a frequency domain (or a spectral domain) may be split into a plurality of sub-vectors (SVs; also referred to as sub-bands or sub-blocks), and each of the plurality of split sub-vectors may be subjected to multi-rate lattice vector quantization.
For example, as shown in
For example, in each of the codebooks Q0, Q2, Q3, Q4, Q5, . . . , Qn shown in
The codebook Q0 may include one vector (for example, a zero vector or a null vector). A zero vector, for example, means that a quantization value of a vector is zero. Thus, in the codebook Q0, a codevector index does not need to be defined, and the number of bits used for a codevector index may be zero. In the codebook Q0, for example, one bit may be used for a codebook indicator.
For example, an encoding apparatus (encoder) may collectively encode a plurality of sub-vectors (for example, eight SVs in NPL 1) by using the codebooks shown in
For example, PTL 1 suggests a method of reducing bits in split multi-rate lattice vector quantization for eight SVs in an example. For example, of eight SVs, in accordance with the number of bits used for seven SVs, a codebook indicator (codebook index) used for the remaining one SV is estimated in accordance with the following expression (1) (see, for example, PTL 1).
In the expression (1), ch′fix denotes an estimated value of the number of bits used by a codebook indicator for one SV (for example, sub-vector number i=Pfix), Bitsavailable denotes the total number of bits available for encoding eight SVs, and ΣBitscbvi denotes the sum of the numbers of bits used in encoding for the other seven sub-vectors vi (i≠Pfix) different from the sub-vector number i=Pfix (for example, the total number of bits used in
In PTL 1, the encoding apparatus, for example, quantizes (or encodes) a difference between the estimated value cb′fix of the number of bits used by the codebook indicator shown in the expression (1) and the number of bits of an actual codebook indicator for one SV (for example, i=Pfix), and transfers the differential information to the decoding apparatus. For example, as the codebook number n used for one SV increases, information content (for example, the number of bits) of the above-described differential information reduces with respect to the codebook indicator, and the number of bits for encoding is reduced.
In PTL 1, for example, there is a case where differential information (in other words, encoding target) is a negative number (for example, −1), and a quantization level or code for a negative number is used, so encoding (or quantization) can be more complicated.
Also, when one identified SV is encoded by using the codebook Q0 (for example, codebook indicator “0”) or the codebook Q2 (for example, codebook indicator “1”) under special conditions, there is a possibility that it is not possible to reduce the number of bits for encoding.
Here, a special case may be, for example, a case where, of the total number of bits available for encoding, there is no bit not used for encoding and all the bits are used for encoding. In this case, for example, in
When, for example, reduction of bits of an SV with a larger number of bits used for encoding of a plurality of SVs is focused, there is a possibility that it is not possible to reduce the number of bits for encoding when there occurs an SV of which the number of bits used for encoding is zero (for example, an SV that is not encoded due to an insufficient number of bits available). An SV of which the number of bits used for encoding is zero tends to be, for example, a high-range SV of the plurality of SVs (for example, the sixth, seventh, or eighth SV of the eight SVs).
In an embodiment of the present disclosure, a method of reducing the number of bits for encoding, used to encode (in other words, variable-length code) a codebook indicator in multi-rate lattice vector quantization (lattice VQ (LVQ)) applied to split vector quantization (for example, split VQ (SVQ)) will be described.
Hereinafter, in an example, enhanced voice services (EVS) codec in which multi-rate lattice vector quantization is used as AVQ will be described. Here, an example in which AVQ is used for vector quantization of discrete cosine transform (DCT) coefficients will be described. The configuration is not limited to quantization and encoding of DCT coefficients (in other words, frequency domain). For example, AVQ (or multi-rate lattice vector quantization) is also applicable to vector quantization in a time domain.
Hereinafter, in an example, a case where the number of sub-vectors split in AVQ is set to eight (for example, SV1 to SV8) will be described. The number of sub-vectors split is not limited to eight and may be another number.
Multiplier 101 may, for example, multiply an adaptive codebook vector v(n) input from an adaptive codebook, by an adaptive codebook gain (or pitch gain) gp, and output the multiplied result to subtracter 102.
Subtracter 102 may, for example, subtract an adaptive codebook vector multiplied by the adaptive codebook gain and input from multiplier 101, from a linear prediction residual signal r(n) that is an encoding target in ACELP encoding, and determine (for example, calculate) an excitation residual signal qin(n). The excitation residual signal qin(n) may be, for example, calculated in accordance with the following expression (2). Subtracter 102 may output the excitation residual signal qin(n) to de-emphasizer 103.
q
in(n)=r(n)−gp·v(n) (2)
De-emphasizer 103 is, for example, a de-emphasis filter Fp(z), and may execute de-emphasis processing on the excitation residual signal qin(n) input from subtracter 102. De-emphasizer 103 may output an excitation residual signal qin,d(n) subjected to de-emphasis processing, to DCT section 104.
DCT section 104 may, for example, convert the excitation residual signal qin,d(n) input from de-emphasizer 103, to DCT coefficients, and output the DCT coefficients to AVQ encoder 105. A method of converting a signal in a time domain to a signal in a frequency domain is not limited to DCT processing and may be another method, such as discrete cosine transform (DFT) and modified discrete cosine transform (MDCT).
AVQ encoder 105 may perform split lattice vector quantization (or AVQ encoding) on the DCT coefficients of the excitation residual signal qin,d(n) input from DCT section 104.
For example, AVQ encoder 105 may split the DCT coefficients into a plurality of sub-vectors (SVs) and quantize the plurality of sub-vectors respectively to generate quantization parameters including codebook numbers (codebook indicators or codebook indices) and codevector indices that indicate any one of a plurality of codevectors included in the codebooks.
AVQ encoder 105 may, for example, in each sub-frame of encoding, determine the number of bits or bit budget allocated to AVQ (AVQ bit budget) in accordance with the sum of the number of bits fixed (or a predetermined number of bits or a fixed bit-budget) and the number of floating bits input from floating bits manager 106 (for example, the number of bits available additionally) (floating bit-budget), and output the number of bits or bit budget allocated to AVQ to code converter 109. AVQ encoder 105 may, for example, output information on the number of floating bits (floating bit-budget) updated in accordance with an excess number of bits after AVQ to floating bits manager 106.
AVQ encoder 105 may, for example, output a global gain code of the quantization parameters obtained through quantization to multiplexer 110. AVQ encoder 105 may, for example, output codebook indicators (codebook numbers) of the sub-vectors, codevector indices of the sub-vectors, and the number of bits allocated to AVQ (AVQ bit budget) to code converter 109. AVQ encoder 105 may, for example, output DCT coefficients of the quantized excitation residual signal qin,d(n) to inverse DCT section 107.
Floating bits manager 106 may hold (or manage) information on the number of bits available in an encoding frame in accordance with information on the number of floating bits input from AVQ encoder 105. For example, floating bits manager 106 may output the number of bits to be held as the number of floating bits to AVQ encoder 105 in AVQ encoding in a subsequent sub-frame.
Inverse DCT section 107 may output a quantized excitation residual signal qd(n) by performing inverse DCT conversion of the DCT coefficients of qin,d(n) input from AVQ encoder 105.
Sub-vector identifier 108 may, for example, identify a dominant sub-vector from among a plurality of sub-vectors in accordance with an input adaptive codebook vector v(n). Sub-vector identifier 108 may output information on the position of the dominant sub-vector (for example, dominant sub-vector information) to code converter 109. For example, here, since a target for quantization or encoding in AVQ encoder 105 is DCT coefficients, sub-vector identifier 108 may convert the adaptive codebook vector v(n) to DCT coefficients and identify the position (for frequency) of a sub-vector having the highest energy in a DCT coefficient domain (or a frequency domain) of the adaptive codebook vector v(n). When a target for quantization or encoding in AVQ encoder 105 is a signal in a time domain, sub-vector identifier 108 does not need to convert the adaptive codebook vector v(n) to DCT coefficients.
Sub-vector identifier 108 may be, for example, a memory that outputs information on the position of a predetermined specific sub-vector to code converter 109 regardless of the adaptive codebook vector v(n). In this case, since the position of the specific sub-vector is fixed, the position of the specific sub-vector may be written in a program when, for example, an embodiment of the present disclosure is implemented by the program of software. For example, sub-vector identifier 108 may set the third sub-vector or the last sub-vector of the plurality of (for example, eight) sub-vectors for the specific sub-vector. The specific sub-vector is not limited to the third sub-vector or the last sub-vector and may be a sub-vector in another position in order. For example, the position of the specific sub-vector may be set in a position in which there is a higher probability (frequency) that the codebook number further increases (for example, the highest position) and that is investigated experimentally or statistically.
Code converter 109 (codebook indications conversion) may, for example, convert encoding information of the codebook number of a specific sub-vector (for example, code conversion-targeted sub-vector) in accordance with codebook numbers and codevector indices of the plurality of sub-vectors, input from AVQ encoder 105, the number of bits allocated to AVQ of one sub-frame (AVQ bit-budget), and the dominant sub-vector information input from sub-vector identifier 108.
When, for example, the number of sub-vectors is eight, code converter 109 may output encoding information including codebook indicators (codebook indices) and codevector indices of the eight sub-vectors or output encoding information including codebook indicators of the seven sub-vectors, an indicator on the number of unused bits (referred to as, for example, an unused-bit indicator), and codevector indices of the eighth sub-vectors to multiplexer 110.
Multiplexer 110 may multiplex the global gain input from AVQ encoder 105 and the encoding information input from code converter 109 and output multiplexed bit stream information (for example, AVQ code).
Next, an example of the operations of encoding apparatus 100 will be described.
Code converter 109 may, for example, select a code conversion-targeted sub-vector (referred to as, for example, a targeted sub-vector) in accordance with the dominant sub-vector information (for example, information indicating the sub-vector identified as a dominant sub-vector) input from sub-vector identifier 108 and the number of bits allocated to AVQ in one sub-frame (the number of bits allocated for vector quantization) input from AVQ encoder 105.
In
Code converter 109, for example, determines whether the AVQ bit-budget exceeds a threshold Threshold (S102). For example, 85 [bit/subframe] may be set for the threshold or another value may be set for the threshold. The threshold may be, for example, set experimentally or statistically.
When the AVQ bit-budget exceeds the threshold (Yes in S102), code converter 109 may select a sub-vector identified by the dominant sub-vector information as a code conversion-targeted sub-vector from among the plurality of sub-vectors (S103).
On the other hand, when the AVQ bit-budget is less than or equal to the threshold (No in S102), code converter 109 may, for example, set the last sub-vector (for example, the eighth sub-vector SV8) for the code conversion-targeted sub-vector from among the plurality of sub-vectors (S104).
Code converter 109 may, for example, apply code conversion (described later) to the selected code conversion-targeted sub-vector.
An example of selection of a code conversion-targeted sub-vector has been described above.
Selection of a code conversion-targeted sub-vector may be performed not by code converter 109 but by sub-vector identifier 108. In this case, information on the AVQ bit-budget may be input to sub-vector identifier 108. For example, sub-vector identifier 108 may output dominant sub-vector information on the sub-vector selected for the code conversion-targeted sub-vector to code converter 109.
Next, an example of code conversion by code converter 109 will be described.
For example, code converter 109 may perform the processes of the following step 1 to step 3 in accordance with codebook indicators of the plurality of sub-vectors, input from AVQ encoder 105, and the selected code conversion-targeted sub-vector.
Code converter 109, for example, sets the codebook indicators of the other sub-vectors at positions (for example, (N−1) sub-vectors) different from the code conversion-targeted sub-vector, of the plurality of (for example, N) codebook indicators for codes (or encoding codes). Code converter 109 may, for example, calculate the total sum of the number of bits used by the codebook indicators and the number of bits used by the codevector indices in the (N−1) sub-vectors.
Code converter 109 may, for example, calculate the number of bits available for the codebook indicator of the code conversion-targeted sub-vector. For example, code converter 109 may calculate the number of bits available for encoding the codebook indicator of the code conversion-targeted sub-vector by subtracting the total sum of the number of bits used for encoding the (N−1) sub-vectors, calculated in (Step 1), from the total number of bits available for AVQ encoding (AVQ bit-budget).
Code converter 109 may, for example, encode the number of bits not used for encoding (referred to as, for example, the number of unused bits) by calculating the number of unused bits, of the number of bits available for encoding the code conversion-targeted sub-vector, calculated in (Step 2). For example, code converter 109 may calculate the number of unused bits by subtracting the sum of the number of bits used by the codebook indicator and the number of bits used by the codevector index of the code conversion-targeted sub-vector from the number of bits available, calculated in (Step 2).
Code converter 109 may, for example, output the codebook indicator (encoding code) obtained through (Step 1) to (Step 3) and the information obtained by encoding the number of unused bits (for example, referred to as an unused-bit indicator or an unused-bit encoding code) to multiplexer 110.
Next, an example of the operations of code converter 109 will be described.
In
When, for example, the code conversion-targeted sub-vector SVd=SV3, Group1 may include SV1, SV2, SV4, and SV5, and Group2 may include SV6 to SV8, and SV3. When, for example, the code conversion-targeted sub-vector SVd=SV8, Group1 may include SV1 to SV5, and Group2 may include SV6 to SV83. SVd is not limited to SV3 or SV8.
Code converter 109 may, for example, encode sequentially the codebook indicators (codebook indices) and codevector indices of the sub-vectors classified as Group1 (S202). Code converter 109 may, for example, output the encoding information (the codebook indicators and the codevector indices) of the sub-vectors included in Group1 to multiplexer 110. Code converter 109 may, for example, calculate the number of bits used for encoding Group1 (for example, indicated by BITSgroup1) (S202). Code converter 109 may, for example, determine the number of bits available for encoding the sub-vectors classified as Group2 (for example, BITSgroup2) in accordance with the following expression (3) (S202).
(BITSgroup2)=(AVQbit-budget)−(BITSgroup1) (3)
Code converter 109 may, for example, determine whether BITSgroup2 exceeds a threshold Threshold1 (S203).
When BITSgroup2 exceeds the threshold Threshold1 (Yes in S203), code converter 109 may proceed to the process shown in
On the other hand, when BITSgroup2 is less than or equal to the threshold Threshold1 (No in S203), code converter 109 may determine the order in which the sub-vectors in Group2 are encoded as follows, and encode the sub-vectors in the determined order (S204).
Code converter 109 outputs encoding information including the codebook indicators and the codevector indices of the sub-vectors of Group2 to multiplexer 110 and ends the code conversion.
In
When SVd is SV6, each of Group2 and BITSgroup2 does not need to be changed (or updated).
Subsequently, code converter 109 may, for example, determine whether BITSgroup2 exceeds a threshold Threshold2 (S206).
When BITSgroup2 exceeds the threshold Threshold2 (Yes in S206), code converter 109 may proceed to the process shown in
On the other hand, when BITSgroup2 is less than or equal to the threshold Threshold2 (No in S206), code converter 109 may determine the order in which the sub-vectors in Group2 are encoded as follows, and encode the sub-vectors in the determined order (S207).
Code converter 109 outputs encoding information including the codebook indicators and the codevector indices of the sub-vectors of Group2 to multiplexer 110 and ends the code conversion.
In
When SVd is SV7, each of Group2 and BITSgroup2 does not need to be changed (or updated).
Subsequently, code converter 109 may, for example, determine whether BITSgroup2 exceeds a threshold Threshold3 (S209).
When BITSgroup2 is less than or equal to the threshold Threshold3 (No in S209), code converter 109 may determine the order in which the sub-vectors in Group2 are encoded as follows, and encode the sub-vectors in the determined order (S210).
Code converter 109 outputs encoding information including the codebook indicators and the codevector indices of the sub-vectors of Group2 to multiplexer 110 and ends the code conversion.
On the other hand, when BITSgroup2 is less than or equal to the threshold Threshold3 (Yes in S209), code converter 109 may determine the order in which the sub-vectors in Group2 are encoded as follows, and encode the sub-vectors in the determined order (S211).
Code converter 109 outputs encoding information including the codebook indicator and the codevector index of SV8 and the indicator of the number of unused bits to multiplexer 110 and ends the code conversion. In other words, code converter 109 may output encoding information of the number of unused bits to multiplexer 110 instead of encoding information of the codebook indicator of SVd.
When SVd is SV8, code converter 109 may, for example, output any one of the codebook indicator and the indicator of the number of unused bits and the codevector index, of SV8 to multiplexer 110. Any one of the codebook indicator and the number of unused bits may be determined in advance.
In this way, code converter 109 may determine which one of encoding of the codebook number of SVd and encoding of the number of unused bits is to be executed, in accordance with the number of bits available for encoding Group2. For example, code converter 109 may determine to encode the codebook number (in other words, output the codebook indicator) when the number of bits available for encoding Group2 is less than or equal to the threshold (for example, Threshold1, Threshold2, or Threshold3) and determine to encode the number of unused bits (in other words, output the unused-bit indicator) when the number of bits available for encoding Group2 exceeds the threshold.
Here, in
For example, an average number of bits allocated per sub-vector in accordance with the number of bits fixedly allocated to AVQ is indicated by BITSsv.
In the above-described example, for example, Threshold 1 may be set to 4×BITSsv when SVd is any one of SV1 to SV5 and may be set to 3×BITSsv when SVd is any one of SV6 to SV8.
For example, Threshold2 may be set to 3×BITSsv when SVd is any one of SV1 to SV6 and may be set to 2×BITSsv when SVd is any one of SV7 to SV8.
For example, Threshold3 may be set to 2×BITSsv when SVd is any one of SV1 to SV7 and may be set to BITSsv when SVd is SV8.
In this way, the number of bits obtained by multiplying the number of SVs classified as Group2, by BITSsv may be set for a threshold.
Next, an example of encoding of the number of unused bits by code converter 109 (for example the process of S211 in
In
When SV8 is not zero (No in S301), code converter 109 may set the number of bits obtained by subtracting the number of bits used to encode the codebook number of SVd from the number of bits available for encoding the codebook number of SVd for the number of unused bits (S302).
Even when SV8 is not zero, there is a case where the last digit (for example, stop bit 0) of the codebook indicator of the encoded result of SV8 in AVQ encoding is abandoned. In this case, the number of bits used to encode SVd reduces by one bit. Insufficiency of one bit may be covered by abandoning stop bit 0 in encoding SVd or insufficiency of one bit may be covered by the number of bits saved by applying an embodiment of the present disclosure.
When, for example, the encoded result of SV8 is zero and stop bit 0 is abandoned, there is a possibility that SV7 or SV6 is also abandoned (for example, SV7 or SV6 may be encoded by zero bits). In this way, when two or more SVs are abandoned, one bit (for example, codebook indicator 0) is allocated to each of the two or more SVs, so the number of bits available for encoding SVd can reduce by two or more bits. Such a case can be, for example, detected in accordance with whether the number of bits available for encoding SVd is an odd number of bits as the number of bits used to encode SV (for example, whether the number of bits available is different from multiples of five).
As described above, in encoding the number of unused bits, the order of encoding SVd is set (in other words, replaced) to the last position in the order of sub-vectors included in Group2. Here, in
When SV8 (or an SV before SV8) is encoded with zero bits, all the bits allocated to AVQ encoding are used, so the number of unused bits is zero.
For this reason, when there is a difference between the number of bits available for encoding SVd (for example, a remaining number of bits) and the number of bits used to encode SVd, this difference can correspond to the number of SVs encoded with zero bits. For example, as shown in
For example, code converter 109 may update the number of bits available for encoding SVd to a multiple of five by adding a waste number of bits to the number of bits available for encoding SVd (for example, a remaining number of bits) (S303). Code converter 109 may, for example, determine that the number of unused bits is zero.
In this way, code converter 109 may, for example, calculate the number of unused bits in accordance with the number of bits available for encoding an original SVd (in other words, when the order of encoding SVd is not changed) when there occur wasted bits by calculating the number of bits of the wasted bits and adding the wasted bits to the number of bits available for encoding SVd. In other words, code converter 109 may update the number of bits available for encoding SVd in accordance with the number of consecutive sub-vectors of which the quantization parameters are a null vector, of the sub-vectors different from SVd in Group2.
Subsequently, code converter 109 may encode the number of unused bits (for example, convert the number of unused bits to an indicator) obtained through the process of S302 or the process of S303 (S304).
In an example, when the number of bits available is 13 bits and the code (indicator) of unused bits is “10”, the number of unused bits can be identified as three bits in order for the number of bits except unused bits to be a multiple of five if there is no wasted bit.
It is possible to determine whether there is a wasted bit by using, for example, whether SV8 to be decoded is a zero vector, a remainder obtained by dividing the number of bits available by five (for example, a remainder divided by five or modulo five) (hereinafter, also referred to as, for example, a remaining number of bits) and the number of consecutive decoded SVs that become zero vectors, including SV8. For example, when SV8 is not a zero vector, no wasted bit occurs. For example, a case where there occur wasted bits is a case where the number of unused bits is zero. The number of wasted bits can be identified by, for example, the number of consecutive decoded SVs that become zero vectors, including SV8, and a remainder of the number of bits available, divided by five (for example, a remaining number of bits).
The number of unused bits in a case where there occur wasted bits may be, for example, determined as follows.
In an example, when the number of bits available for encoding SVd is 12 bits and decoded values of three consecutive SVs, including SV8, are zero vectors (null vectors), three bits (for example, 5−(12 modulo 5)=3) used to encode three zero vectors (for example, SV6 to SV8) at the time of changing the order of encoding SVd are wasted bits. Thus, 15 bits obtained by adding three bits to 12 bits are the number of bits available for encoding SVd. Thus, the number of bits available for encoding SVd is equal to the number of bits used to encode SVd (for example, a multiple of five), and the number of unused bits is zero.
In another example, a case where the number of bits available for encoding SVd is 13 bits, decoded values of three consecutive SVs (for example, SV6 to SV8), including SV8, are zero vectors (null vectors), SV6 is encoded with one bit, and SV7 and SV8 are encoded with zero bits (for example, unencoded zero vectors) will be described. In this case, of three bits used to encode three zero vectors (encode SV6 to SV8) at the time of changing the order of encoding SVd, at least two bits (for example, 5−(13 modulo 5)=2) are wasted bits. Thus, 15 bits obtained by adding two bits to 13 bits are the number of bits available for encoding SVd. Thus, the number of bits available for encoding SVd is equal to the number of bits used to encode SVd (for example, a multiple of five), and the number of unused bits is zero.
When, for example, the number of unused bits is zero, the number of bits used for encoding is one as shown in
In this way, even when there occur wasted bits, encoding apparatus 100 can encode the number of unused bits by suppressing an increase in the number of bits used for encoding.
Since the number of unused bits in a case where it is determined that there occur wasted bits is zero, code converter 109 may, for example, change (or update) the number of bits available for encoding SVd and finally determine the number of unused bits to zero without calculating a difference from the number of bits used to encode SVd. When, for example, the codebook number of SVd is decoded by dividing the number of bits available for encoding SVd by five, the number of bits available for encoding SVd may be changed as described above. Alternatively, the decoding procedure in which the codebook number is decoded by adding one to the codebook number obtained by dividing the number of bits available for encoding SVd, by five may be changed.
An example in which the code of the number of unused bits shown in
In decoding apparatus 200, a bit stream sent from encoding apparatus 100 is input to separator 201.
Separator 201 may, for example, separate a global gain code, codevector indices, codebook indicators (codebook indices), and an unused-bit indicator (an unused-bit code or an unused-bit index) from an AVQ code included in the input bit stream. Separator 201 may, for example, output the global gain code to AVQ decoder 204, and output the codebook indicators, the codevector indices, and the unused-bit indicator to code converter 203.
Sub-vector identifier 202 may, for example, identify a dominant sub-vector from among a plurality of sub-vectors in accordance with an input adaptive codebook vector v(n). Sub-vector identifier 202 may, for example, output information on the position of the dominant sub-vector (for example, dominant sub-vector information) to code converter 203. For example, here, since a target for quantization or encoding in encoding apparatus 100 (for example, AVQ encoder 105) are DCT coefficients, sub-vector identifier 202 may convert the adaptive codebook vector v(n) to DCT coefficients and identify the position (for frequency) of a sub-vector having the highest energy in a DCT coefficient domain of the adaptive codebook vector v(n). When a target for quantization or encoding in encoding apparatus 100 is a signal in a time domain, sub-vector identifier 202 does not need to convert the adaptive codebook vector v(n) to DCT coefficients.
Code converter 203 may, for example, calculate the codebook indicator of the sub-vector at a specific position (for example, code conversion-targeted sub-vector) in accordance with information input from separator 201 and information input from sub-vector identifier 202. For example, code converter 203 may calculate the codebook indicator of the code conversion-targeted sub-vector in accordance with the codebook indicators, the codevector indices, and the unused-bit indicator, input from separator 201, and the number of bits available for AVQ (for example, AVQ bit-budget) input from AVQ decoder 204. Code converter 203 may output the codebook indicator and the codevector index of each sub-vector to AVQ decoder 204.
For example, code converter 203 may execute the processes of the following step 4 to step 7.
Code converter 203, for example, decodes the codebook indicators of the other sub-vectors different from the code conversion-targeted sub-vector in accordance with the codebook indicators. Code converter 203 may, for example, calculate the number of bits used to encode sub-vectors different from the code conversion-targeted sub-vector (for example, the sum of the number of bits used by the codebook indicators and the number of bits used by the codevectors) in accordance with the decoded codebook indicators.
Code converter 203 may, for example, decode the number of unused bits in accordance with the unused-bit indicator.
Code converter 203 may, for example, calculate the number of bits for encoding the code conversion-targeted sub-vector in accordance with the number of bits for encoding the sub-vectors, calculated in (Step 4), and the number of unused bits, decoded in (Step 5).
Code converter 203 may, for example, calculate (or decode) the codebook indicator of the code conversion-targeted sub-vector in accordance with the number of bits for encoding the sub-vector calculated in (Step 6).
AVQ decoder 204 may, for example, decode (or inversely quantize) quantized DCT coefficients in accordance with the global gain code input from the separator, the codebook indicators and the codevector indices of the SVs, input from code converter 203, and the number of floating bits, input from floating bits manager 205, and output the decoded (or inversely quantized) quantized DCT coefficients to inverse DCT section 206. AVQ decoder 204 may, for example, determine (for example, calculate) the number of bits allocated to AVQ in accordance with the sum of the fixed number of bits (or the predetermined number of bits) and the number of floating bits, input from floating bits manager 205, and output the determined (or calculated) number of bits to code converter 203. AVQ decoder 204 may, for example, output information on the number of floating bits updated in accordance with an excess number of bits after AVQ decoding to floating bits manager 205.
Floating bits manager 205 may hold (or manage) information on the number of bits available in a decoding frame in accordance with information on the number of floating bits input from AVQ decoder 204. For example, floating bits manager 205 may output the number of bits to be held as the number of floating bits to AVQ decoder 204 in AVQ decoding in a subsequent sub-frame.
Inverse DCT section 206 may, for example, convert decoded quantized DCT coefficients input from AVQ decoder 204 to a signal in a time domain by means of inverse DCT transform, and output the signal in a time domain as a decoding excitation residual signal qd(n).
As described above, in the present embodiment, encoding apparatus 100 determines which one of encoding of the codebook number of the code conversion-targeted sub-vector and encoding which is based on a difference between the number of bits allocated to vector quantization and the number of bits of quantization parameters (for example, encoding of the number of unused bits) is to be executed, in accordance with the number of bits available for encoding sub-vectors in vector quantization.
In this way, by switching between encoding of the codebook number and encoding of the number of unused bits in accordance with the number of bits available for encoding, it is possible to perform encoding according to the number of bits available for encoding in multi-rate lattice vector quantization, so it is possible to reduce the number of bits for encoding. Thus, according to the present embodiment, it is possible to reduce the number of bits for encoding in multi-rate lattice vector quantization.
According to the present embodiment, even when the code conversion-targeted sub-vector for an unused-bit indicator is a sub-vector different from the last sub-vector (for example, SV8) (in other words, when the order of encoding is changed), encoding apparatus 100 can accurately determine the number of unused bits according to the number of wasted bits that can occur as a result of a change of the order of encoding. When, for example, there occur wasted bits, encoding apparatus 100 can accurately estimate the number of unused bits according to the number of wasted bits. Therefore, it is possible to suppress an error due to an insufficient number of bits for encoding or reduction in codebook number (for example, reduction in the accuracy of codebook) and to suppress a decrease in encoding performance.
For example, even in a case of bit allocation with which no wasted bit occurs even when the order of encoding of SVs is changed (for example, in a case of bit allocation with which there is no SV that is encoded with zero bits), but when wasted bits can occur (for example, when an SV that is encoded with zero bits can be present), encoding apparatus 100 may rearrange the allocation of encoding bits such that wasted bits occur. In an example, a case where, in Group2 including SVd, SV6, SV7, and SV8, SV6, SV7, and SV8 are encoded with zero (one bit), SVd is encoded with 10 bits (codebook number 2), and the number of bits available in encoding SVd is 13 bits will be described. In this case, since SV6 to SV8 are zero vectors, encoding apparatus 100 may set bit allocation for SV6 to SV8 to zero bits and allocate three bits allocated to SV6 to SV8 may be allocated (in other words, may be transferred) to bits for encoding SVd. When there occur wasted bits (for example, allocation of zero bits is possible), the number of unused bits is zero, and the total of the remaining number of bits and the number of consecutive zero vectors is limited to a multiple of five, so sub-vectors to which allocation of zero bits is allowed are SV7 and SV8. Therefore, the number of bits that can be transferred to bits for encoding SVd is two. In this case, encoding apparatus 100, for example, sets the number of bits available for SVd to 15 obtained by adding two bits to 13 bits and encodes SVd with codebook number 3 (for example, 15 bits). In this case, the number of unused bits is zero. In this way, by rearranging bit allocation, it is possible to reliably determine the presence or absence of wasted bits at a decoding side and to improve the accuracy of encoding SVd.
According to the present embodiment, decoding apparatus 200, for example, can identify encoding information on the code conversion-targeted sub-vector in accordance with parameters such as the number of bits used to encode and decoded information (for example, codebook indicators of sub-vectors other than the code conversion-targeted sub-vector). Thus, for example, a signal for switching between encoding of codebook indicators and encoding of the number of unused bits (for example, a flag or switching-specific control information) does not need to be provided from encoding apparatus 100 to decoding apparatus 200.
In
Sub-vector identifier 301 does not need to execute any signal processing or may, for example, does not explicitly include any constituent element for identifying (designating) a sub-vector at a predetermined specific position. In
In encoding apparatus 300 shown in
Next, an example of the operations of encoding apparatus 300, different from Embodiment 1, will be described.
A method of selecting a code conversion-targeted sub-vector may be similar to the method shown in
In
Subsequently, encoding apparatus 300, for example, encodes quantization parameters of the sub-vectors included in Group1 (for example, SV1 and SV2) and outputs encoding information (for example, the codebook indicators and the codevector indices) (S402). Encoding apparatus 300 may, for example, determine the number of bits used (or consumed) to encode Group1 and determine the number of bits available for encoding the sub-vectors (for example, SV3 to SV8).
Subsequently, encoding apparatus 300, for example, determines whether the number of bits available for encoding the sub-vectors of Group2 is greater than or equal to the threshold Threhold1 (S403). For example, encoding apparatus 300 proceeds to a process shown in
Here, when the sub-vector selected as a code conversion target is the third sub-vector (for example, SV3), Threshold1 may be set to 30 bits. When, for example, of the sub-vectors SV1 to SV8, SV3 is set to the sub-vector to be encoded at the end (in other words, when the order of encoding is changed), one bit can be used to encode each of the sub-vectors SV4 to SV8 behind SV3 (for example, up to five bits for the five sub-vectors) can be used to encode each sub-vector when a decoded result of each of the sub-vectors SV4 to SV8 is zero (for example, a null vector). On the other hand, in a case of AVQ encoding (in other words, when the order of encoding is not changed), there is a possibility that SV4 to SV8 behind SV3 each are encoded with zero bits. If SV4 to SV8 are encoded with zero bits through AVQ encoding, five bits in total are used to encode SV4 to SV8 in an encoding method that performs code conversion according to the present embodiment, so the effect of reducing bits greater than or equal to five bits is expected in encoding SV3. For example, referring to
When the position of the specific sub-vector is different from that of SV3, the sub-bands included in Group2 and the threshold Threshold1 may be set according to the specific sub-vector. When, for example, SV4 is set for the specific sub-vector, SV4 to SV8 may be included in Group2, and 25 bits may be set for Threshold1.
In
In this way, when, for example, the number of bits available for encoding Group2 is less than Threshold1 (No in S403), encoding apparatus 300 does not need to perform code conversion (in other words, change the order of encoding) of the sub-vector SV3 in encoding Group2. In other words, when the number of bits available for encoding Group2 is less than Threshold1, encoding apparatus 300 may set (or change or update) the code conversion-targeted sub-vector from SV3 to the last sub-vector SV8 in Group2. Through this process, for example, it is possible to suppress occurrence of an insufficient number of bits in encoding the code conversion-targeted sub-vector.
Subsequently, encoding apparatus 300 may, for example, determine whether the number of bits available for encoding SV8 is less than the threshold Threshold2 or whether the number of bits available for encoding SV8 exceeds the threshold Threshold3 (S405).
When the number of bits available for encoding SV8 is less than Threshold2 or exceeds Threshold3 (Yes in S405), encoding apparatus 300 may, for example, encode SV8 by using an encoding method for AVQ (a method of encoding a codebook number), output encoding information (for example, the codebook indicator and the codevector index), and end the code conversion (S406). In this way, when the number of bits available for encoding SV8 is less than Threshold2 or exceeds Threshold3, the number of bits for encoding resulting from encoding the number of unused bits is not reduced, so encoding apparatus 300 may encode the codebook number without encoding the number of unused bits.
On the other hand, when the number of bits available for encoding SV8 is greater than or equal to Threshold2 and does not exceed Threshold3 (No in S405), encoding apparatus 300 may encode the number of unused bits instead of encoding the codebook number, output encoding information (for example, the unused-bit indicator and the codevector index), and end the code conversion (S407).
Here, for example, Threshold2 may be set to nine bits, and Threshold3 may be set to 80 bits. The reason why Threshold2 is set to nine bits is because, when, for example, the number of bits available for encoding SV8 is less than 10 bits, the number of bits used to encode the codebook number is also one bit in AVQ encoding, and the effect of reducing the number of bits resulting from code conversion is not obtained.
For example, Threshold3 may be set experimentally or empirically. For example, as the number of bits available for encoding SV8 increases, the number of unused bits is easier to increase, so Threshold3 may be set in order to avoid an increase in the number of unused bits. A case where the number of unused bits is large can be, for example, a case where information content is small as in the case where, for example, an encoding target is silence. Therefore, even when the effect of reducing bits is not obtained, there is no problem in terms of the quality of encoding. For this reason, for example, the number of bits that is large to a certain degree may be empirically set for Threshold3.
In
Subsequently, encoding apparatus 300 may, for example, encode the sub-vectors one by one in order of SV4, SV5, SV6, SV7, and SV8 and output encoding information (for example, the codebook indicators and the codevector indices) (S409). Encoding apparatus 300 may, for example, determine the number of bits used to encode the sub-vectors and determine (in other words, update) the number of bits available for encoding the remaining sub-vectors in Group2 (S409).
Subsequently, encoding apparatus 300, for example, determines whether the number of bits available for encoding the remaining sub-vectors in Group2 is greater than or equal to Threshold1 (S410).
When the number of bits available for encoding Group2 is less than Threshold1 (No in S410), encoding apparatus 300 may, for example, proceed to the process of S404 in
On the other hand, when the number of bits available for encoding Group2 is greater than or equal to Threshold1 (Yes in S410), encoding apparatus 300, for example, determines whether the sub-vector to be encoded next is SV3 (S411). When the sub-vector to be encoded next is not SV3 (code conversion-targeted sub-vector) (No in S411), encoding apparatus 300 may, for example, proceed to the process of S409 and encode the next sub-vector. Encoding apparatus 300 may, for example, repeat the processes of S409 to S411 and sequentially encode SV4, SV5, SV6, SV7, and SV8.
When the sub-vector to be encoded next is SV3 (Yes in S411), encoding apparatus 300, for example, proceeds to the process of S412 shown in
In
When the number of bits available for encoding SV3 exceeds Threshold3 (Yes in S412), encoding apparatus 300 may, for example, encode SV3 in accordance with AVQ encoding without code conversion, output encoding information (for example, the codebook indicator and the codevector index), and end the code conversion (S413). In this way, when the number of bits available for encoding SV3 exceeds Threshold3, the number of unused bits increases, and the number of bits of the unused-bit indicator is easier to increase, so encoding apparatus 300 may encode the codebook number.
On the other hand, when the number of bits available for encoding SV3 is less than or equal to Threshold3 (No in S412), encoding apparatus 300 may, for example, encode the number of unused bits instead of encoding the codebook number of SV3, output encoding information (for example, the codebook indicator and the codevector index), and end the code conversion (S414).
Next, another example of the operations of encoding apparatus 300 will be described.
The example of operations shown in
In
Subsequently, encoding apparatus 300 may, for example, determine whether the number of bits available for encoding SV8 is less than the threshold Threshold2 or whether the number of bits available for encoding SV8 exceeds the threshold Threshold3 (S502).
When the number of bits available for encoding SV8 is less than Threshold2 or exceeds Threshold3 (Yes in S502), encoding apparatus 300 may, for example, encode SV8 by using an encoding method for AVQ (for encoding a codebook number), output encoding information (for example, the codebook indicator and the codevector index), and end the code conversion (S503). In this way, when the number of bits available for encoding SV8 is less than Threshold2 or exceeds Threshold3, the number of bits for encoding resulting from encoding the number of unused bits is not reduced, so encoding apparatus 300 may encode the codebook number without encoding the number of unused bits.
On the other hand, when the number of bits available for encoding SV8 is greater than or equal to Threshold2 and does not exceed Threshold3 (No in S502), encoding apparatus 300 may encode the number of unused bits instead of encoding the codebook number, output encoding information (for example, the unused-bit indicator and the codevector index), and end the code conversion (S504).
In
Next, an example of encoding of the number of unused bits in the process of S407 in
In
On the other hand, when the code conversion-targeted sub-vector is not SV8 (for example, in a case of SV3) (No in S601), the order of encoding the sub-vectors is changed, so there is a possibility that the number of bits used to encode the sub-vectors after the order of encoding is changed is different from the number of bits in a case where encoding (for example, AVQ encoding) is performed without changing the order of encoding. In other words, there is a possibility that bits to be used wastefully (for example, wasted bits) occur as a result of changing the order of encoding. When, for example, the number of bits available for encoding becomes zero before encoding of the sub-vectors halfway and, therefore, the subsequent sub-vectors (for example, the consecutive sub-vectors including SV8) forcibly become zero vectors without being encoded (in other words, encoded with zero bits), wasted bits can occur.
In the present embodiment, when the code conversion-targeted sub-vector is not SV8 (No in S601), encoding apparatus 300, for example, counts the number of consecutive sub-vectors (hereinafter, referred to as NCNV) of which the quantization parameter is a null vector (referred to as an all-zero vector or a zero vector) in sub-vectors subjected to AVQ encoding, checks whether SV8 (for example, the last sub-vector) is included in the sub-vectors that are null vectors (S602). In other words, NCNV may indicate the number of consecutive sub-vectors that are null vectors including SV8.
Encoding apparatus 300 may, for example, calculate the remaining number of bits (hereinafter, referred to as RB) (S602). The remaining number of bits RB may be, for example, calculated by (Number of bits available for encoding the code conversion-targeted sub-vector)%5. Here, “%” denotes modulo operation.
Subsequently, encoding apparatus 300 may, for example, determine whether there is a possibility that wasted bits occur as a result of changing the order of encoding the sub-vectors (S603). Encoding apparatus 300 may, for example, determine whether there is a possibility that wasted bits occur in accordance with NCNV and RB (an example of determination will be described later). When there is no possibility that wasted bits occur (No in S603), encoding apparatus 300, for example, proceeds to the process of S605.
On the other hand, when there is a possibility that wasted bits occur (Yes in S603), encoding apparatus 300 may, for example, update the number of bits available for encoding the code conversion-targeted sub-vector (for example, SVd) (S604). For example, encoding apparatus 300 may add (5-RB) bits to the number of bits available for encoding the code conversion-targeted sub-vector. In other words, encoding apparatus 300 may, for example, increase the number of bits available for encoding the code conversion-targeted sub-vector by the number of wasted bits (for example, the number of bits that is possibly used wastefully). Encoding apparatus 300 may, for example, update the remaining number of bits RB to zero (S604).
A value “5” used to calculate the number of bits to be added (for example, 5-RB) and to calculate a remaining number of bits RB is an example. For example, the value may be defined in accordance with the proportion (for example, ⅕) of the number of bits allocated to a codebook with respect to the entire number of bits used to encode the plurality of sub-vectors or a value that is a multiple of the number of bits used to encode sub-vectors.
Subsequently, encoding apparatus 300, for example, determines whether the remaining number of bits RB is four (S605). When RB is not four (No in S605), encoding apparatus 300 may, for example, proceed to the process of S607, and may perform determination of the number of unused bits.
When RB is four (Yes in S605), encoding apparatus 300 may, for example, increase by one bit the number of bits available for encoding the code conversion-targeted sub-vector (S606).
Subsequently, encoding apparatus 300 may, for example, determine the number of unused bits (S607). For example, encoding apparatus 300 may calculate a difference between the number of bits available for the code conversion-targeted sub-vector and the number of bits used to encode the code conversion-targeted sub-vector (for example, the number of bits unused) as the number of unused bits.
Encoding apparatus 300 may, for example, encode the calculated number of unused bits (S608).
Next, an example of a method of determining whether there is a possibility that wasted bits occur as a result of changing the order of encoding sub-vectors in the process of S603 in
For example, encoding apparatus 300 may determine that there is a possibility that wasted bits occur when at least the following Condition 1 and Condition 2 are satisfied.
For example, for Condition 1, when SV8 is not a null vector, all the plurality of sub-vectors (for example, SV1 to SV8) is encoded in AVQ encoding, so the number of bits used for encoding remains unchanged even when the order of encoding of the sub-vectors is changed or not changed. Thus, when SV8 is not a null vector, no wasted bit can occur. In other words, when SV8 is a null vector, there is a possibility that wasted bits occur.
For example, for Condition 2, wasted bits occur only when, for example, the number of unused bits (for example, the number of bits to be unused in AVQ encoding) is zero.
Here, for example, the remaining number of bits RB corresponds to an excess number of bits due to a reduction (or insufficiency) of the number of bits available for encoding SVd as a result of a wasteful use of bits caused by changing the order of encoding. For example, the number of bits NCNV corresponds to the number of bits (wasted bits) wastefully used as a result of changing the order of encoding.
Thus, RB+NCNV can be a value greater than or equal to five. While, for example, the number of bits used to encode the sub-vectors is a multiple of five (for example, 5n), the last one bit (stop bit) of the codebook indicator can be, for example, omitted, so RB+NCNV can be a value greater than or equal to four. In this way, in Condition 2, when the number of unused bits is zero, RB+NCNV can be greater than or equal to four. In other words, when RB+NCNV is greater than or equal to four, unused bits are zero, and there is a possibility that wasted bits occur; whereas, when RB+NCNV is less than four, unused bits are present, so no wasted bit can occur.
Condition 2 may be set as follows.
Here, it is also applicable that ECBI=(INT)((Number of bits available for SVd)/5). The function (INT)(X) may be a function that returns a value obtained by discarding all the digits to the right of the decimal point of X.
Next, an example of operations of an AVQ decoder according to the present embodiment will be described.
In
Sub-vector identifier 401 does not need to execute any signal processing or, for example, does not need to explicitly include any constituent element for identifying (designating) a sub-vector at a predetermined specific position. In
In decoding apparatus 400 shown in
Next, an example of the operations of decoding apparatus 400, different from Embodiment 2, will be described.
A method of selecting a code conversion-targeted sub-vector may be similar to the method shown in
In the description of
In
Subsequently, decoding apparatus 400, for example, determines whether the number of bits remaining as a bit stream for the sub-vectors in Group2 is greater than or equal to the threshold Threshold1 (S702). For example, decoding apparatus 400 proceeds to a process shown in
In
In this way, when, for example, the number of bits remaining as a bit stream for the sub-vectors in Group2 is less than Threshold1 (No in S702), decoding apparatus 400 may determine that code conversion (in other words, changing the order of encoding) of the code conversion-targeted sub-vector SV3 is not performed in encoding apparatus 100.
Subsequently, decoding apparatus 400 may, for example, determine whether the number of bits remaining as a bit stream for SV8 is less than the threshold Threshold2 or exceeds the threshold Threshold3 (S704).
When the number of bits remaining as a bit stream for SV8 is less than Threshold2 or exceeds Threshold3 (Yes in S704), decoding apparatus 400 may, for example, determine that SV8 is encoded by the AVQ encoding method (the method of encoding the codebook number), decode SV8, output decoded information (for example, the codebook number and the codevector index), and end the decoding (S705).
On the other hand, when the number of bits remaining as a bit stream for SV8 is greater than or equal to Threshold2 and does not exceed Threshold3 (No in S704), decoding apparatus 400, for example, determines that the number of unused bits is encoded instead of the codebook number of SV8, and decodes the number of unused bits and the codevector index (S706). Decoding apparatus 400 may, for example, determine the codebook number of SV8 in accordance with the number of bits remaining as a bit stream for SV8 and the number of unused bits decoded (S706). An example of a method of determining a codebook number will be described later. Decoding apparatus 400 may output the determined decoded information (for example, the codebook number and the codevector index) of SV8, and end the decoding.
In
Subsequently, decoding apparatus 400 may, for example, decode the sub-vectors one by one in order of SV4, SV5, SV6, SV7, and SV8, and output decoded information (for example, the codebook numbers and the codevector indices) (S708). Decoding apparatus 400 may, for example, determine the number of bits of a bit stream used to decode the sub-vectors, and determine the number of bits of a bit stream for the remaining sub-vectors in Group2 (S708).
Subsequently, decoding apparatus 400, for example, determines whether the number of bits of a bit stream for the remaining sub-vectors in Group2 is greater than or equal to Threshold1 (S709).
When the number of bits of a bit stream for the remaining sub-vectors in Group2 is less than Threshold1 (No in S709), decoding apparatus 400 may, for example, proceed to the process of S703 in
On the other hand, when the number of bits of a bit stream for the remaining sub-vectors in Group2 is greater than or equal to Threshold1 (Yes in S709), decoding apparatus 400, for example, determines whether the sub-vector to be decoded next is SV3 (S710). When the sub-vector to be decoded next is not SV3 (code conversion-targeted sub-vector) (No in S710), decoding apparatus 400 may, for example, proceed to the process of S708, and perform decoding of the next sub-vector. Decoding apparatus 400 may, for example, repeat the processes of S708 to S710, and sequentially decode SV4, SV5, SV6, SV7, and SV8.
When the sub-vector to be decoded next is SV3 (Yes in S710), decoding apparatus 400, for example, proceeds to the process of S711 shown in
In
When the number of bits remaining as a bit stream for SV3 exceeds Threshold3 (Yes in S711), decoding apparatus 400 may, for example, decode SV3 in accordance with the AVQ encoding method without code conversion, output decoded information (for example, the codebook number and the codevector index), and end the decoding (S712).
On the other hand, when the number of bits remaining as a bit stream for SV3 is less than or equal to Threshold3 (No in S711), decoding apparatus 400 may, for example, decode an unused-bit indicator instead of the codebook number of SV3, and decode the codevector index (S713). Decoding apparatus 400 may, for example, determine the codebook number of SV3 in accordance with the number of bits remaining as a bit stream for SV3 and decoded unused bits information (S713). Decoding apparatus 400 may, for example, output the codebook number and the codevector index of SV3, and end the decoding. An example of a method of determining a codebook number will be described later.
Next, another example of the operations of decoding apparatus 400 will be described.
For example, the process of
In
Subsequently, decoding apparatus 400 may, for example, determine whether the number of bits remaining as a bit stream for SV8 is less than the threshold Threshold2 or exceeds the threshold Threshold3 (S802).
When the number of bits remaining as a bit stream for SV8 is less than Threshold2 or exceeds Threshold3 (Yes in S802), decoding apparatus 400 may, for example, determine that SV8 is encoded by the AVQ encoding method (the method of encoding the codebook number), decode SV8, output decoded information (for example, the codebook number and the codevector index), and end the decoding (S803).
On the other hand, when the number of bits remaining as a bit stream for SV8 is greater than or equal to Threshold2 and does not exceed Threshold3 (No in S802), decoding apparatus 400, for example, determines that the number of unused bits is encoded instead of the codebook number of SV8, and decodes the number of unused bits and the codevector index (S804). Decoding apparatus 400 may, for example, determine the codebook number of SV8 in accordance with the number of bits remaining as a bit stream for SV8 and the number of unused bits decoded (S804). An example of a method of determining a codebook number will be described later. Decoding apparatus 400 may output the determined decoded information (for example, the codebook number and the codevector index) of SV8, and end the decoding.
Next, an example of decoding of SVd (for example, SV3 or SV8) in the process of S706 in
In
On the other hand, when the code conversion-targeted sub-vector is not SV8 (here, the code conversion-targeted sub-vector is, for example, SV3) (No in S901), decoding apparatus 400 may, for example, count the number (for example, NCNV) of consecutive sub-vectors (including SV8) of which the quantization parameter is a null vector (zero vector) among the decoded sub-vectors (S902).
Decoding apparatus 400 may, for example, calculate the remaining number of bits (for example, RB) (S902). The remaining number of bits RB may be, for example, calculated by (Number of bits remaining as a code bit stream for the code conversion-targeted sub-vector (for example, SV3))%5. Here, “%” denotes modulo operation.
Subsequently, decoding apparatus 400 may, for example, determine whether to update the number of bits remaining as a bit stream for SVd (for example, SV3) in accordance with NCNV and RB (S903). In other words, decoding apparatus 400 may, for example, determine whether there is a possibility that wasted bits have occurred as a result of changing the order of encoding the sub-vectors. A determining method in S903 may be similar to the determining method in encoding apparatus 300.
When there is no possibility that wasted bits have occurred (No in S903), decoding apparatus 400, for example, proceeds to the process of S905.
On the other hand, when there is a possibility that wasted bits have occurred (Yes in S903), decoding apparatus 400 may, for example, update the number of bits remaining as a bit stream for the code conversion-targeted sub-vector (for example, SVd) (S904). For example, decoding apparatus 400 may add (5-RB) bits to the number of bits remaining as a bit stream for the code conversion-targeted sub-vector. In other words, decoding apparatus 400 may, for example, increase the number of bits remaining as a bit stream for SVd by the number of wasted bits (for example, the number of bits that are possibly used wastefully). Decoding apparatus 400 may, for example, update the remaining number of bits RB to zero (S904).
Subsequently, decoding apparatus 400, for example, determines whether the remaining number of bits RB is four (S905). When RB is not four (No in S905), decoding apparatus 400 may, for example, proceed to the process of S907, and determine the code length of SVd obtained by AVQ encoding in accordance with the number of unused bits (an example will be described later).
When RB is four (Yes in S905), decoding apparatus 400 may, for example, increase by one bit the number of bits remaining as a bit stream for the code conversion-targeted sub-vector (S906).
Subsequently, decoding apparatus 400 may, for example, determine the code length of SVd obtained by AVQ encoding in accordance with unused bits information (S907). For example, decoding apparatus 400 may subtract the number of unused bits to be decoded, from the number of bits remaining as a bit stream for the code conversion-targeted sub-vector and calculate the code length of SVd (for example, the number of bits of a code (bit stream) obtained through AVQ encoding).
In an example, as shown in
(Code length of SV8)=(INT((Number of bits remaining as encoded bit stream for SV8)−((Number of bits in FIGS.10)−1)×5)/5)+1)×5
When, for example, (Number of bits remaining as encoded bit stream for SV8)=13 bits and (Code of the number of unused bits)=10, the code length of SV8 is (INT((13−5)/5)+1)×5=10 bits. When (Code of the number of unused bits)=10, the number of unused bits to be decoded is any one of one to five in accordance with
Subsequently, decoding apparatus 400 may, for example, decode the codebook number and the codevector index of SVd in accordance with the code length of SVd (S908). In an example, when the code length of SV8 is 10 bits, decoding apparatus 400 may decode the codebook number=2 in accordance with
As described above, in the present embodiment, encoding apparatus 300 determines which one of encoding of the 130 of the code conversion-targeted sub-vector and encoding which is based on a difference between the number of bits allocated to vector quantization and the number of bits of quantization parameters (for example, encoding of the number of unused bits) is to be executed, in accordance with the number of bits available for encoding sub-vectors in vector quantization.
In this way, by switching between encoding of the codebook number and encoding of the number of unused bits in accordance with the number of bits available for encoding, it is possible to perform encoding according to the number of bits available for encoding in multi-rate lattice vector quantization, so it is possible to reduce the number of bits for encoding. Thus, according to the present embodiment, it is possible to reduce the number of bits for encoding in multi-rate lattice vector quantization.
According to the present embodiment, even when the code conversion-targeted sub-vector for an unused-bit indicator is a sub-vector different from the last sub-vector (for example, SV8) (in other words, when the order of encoding is changed), encoding apparatus 300 can accurately determine the number of unused bits according to the number of wasted bits that can occur as a result of a change of the order of encoding.
According to the present embodiment, decoding apparatus 400, for example, can identify encoding information on the code conversion-targeted sub-vector in accordance with parameters such as the number of bits used to encode and decoded information (for example, codebook indicators of sub-vectors different from the code conversion-targeted sub-vector). Thus, for example, a signal for switching between encoding of codebook indicators and encoding of the number of unused bits (for example, a flag or switching-specific control information) does not need to be provided from encoding apparatus 300 to decoding apparatus 400.
The embodiments of the present disclosure have been described above.
In an embodiment of the present disclosure, a codebook list is not limited to an example shown in
For example, in
In the above-described embodiments, a case where the number of sub-vectors into which an input signal S(f) is split is eight has been described. The number of sub-vectors into which an input signal S(f) is split is not limited to eight.
In the above-described embodiments, in an example, a case where an input signal is split into a plurality of sub-vectors in a frequency domain has been described; however, the configuration is not limited thereto. An input signal may be split into a plurality of sub-vectors in a time domain. In a case of a time domain, for example, a specific sub-vector (in an example, the third sub-vector or the last sub-vector from the first) of a plurality of sub-vectors arranged in the time domain may be set for the above-described code conversion-targeted sub-vector. In this way, in an embodiment of the present disclosure, when an input signal is split into sub-vectors with a constant length in any one of a frequency domain and a time domain, a sub-vector at any position (for example, a sub-vector at a specific position (for example, a third position) or the last sub-vector) of sub-vectors arranged may be set for the code conversion-targeted sub-vector SVd.
In the above-described embodiments, vector quantization is not limited to AVQ and may be another method.
The present disclosure can be realized by software, hardware, or software in cooperation with hardware. Each functional block used in the description of each embodiment described above can be partly or entirely realized by an LSI such as an integrated circuit, and each process described in the each embodiment may be controlled partly or entirely by the same LSI or a combination of LSIs. The LSI may be individually formed as chips, or one chip may be formed so as to include a part or all of the functional blocks. The LSI may include a data input and output coupled thereto. The LSI here may be referred to as an IC, a system LSI, a super LSI, or an ultra LSI depending on a difference in the degree of integration. However, the technique of implementing an integrated circuit is not limited to the LSI and may be realized by using a dedicated circuit, a general-purpose processor, or a special-purpose processor. In addition, a FPGA (Field Programmable Gate Array) that can be programmed after the manufacture of the LSI or a reconfigurable processor in which the connections and the settings of circuit cells disposed inside the LSI can be reconfigured may be used. The present disclosure can be realized as digital processing or analogue processing. If future integrated circuit technology replaces LSIs as a result of the advancement of semiconductor technology or other derivative technology, the functional blocks could be integrated using the future integrated circuit technology. Biotechnology can also be applied.
The present disclosure can be realized by any kind of apparatus, device or system having a function of communication, which is referred to as a communication apparatus. The communication apparatus may comprise a transceiver and processing/control circuitry. The transceiver may comprise and/or function as a receiver and a transmitter. The transceiver, as the transmitter and receiver, may include an RF (radio frequency) module including amplifiers, RF modulators/demodulators and the like, and one or more antennas. Some non-limiting examples of such a communication apparatus include a phone (e.g, cellular (cell) phone, smart phone), a tablet, a personal computer (PC) (e.g, laptop, desktop, netbook), a camera (e.g, digital still/video camera), a digital player (digital audio/video player), a wearable device (e.g, wearable camera, smart watch, tracking device), a game console, a digital book reader, a telehealth/telemedicine (remote health and medicine) device, and a vehicle providing communication functionality (e.g., automotive, airplane, ship), and various combinations thereof.
The communication apparatus is not limited to be portable or movable, and may also include any kind of apparatus, device or system being non-portable or stationary, such as a smart home device (e.g, an appliance, lighting, smart meter, control panel), a vending machine, and any other “things” in a network of an “Internet of Things (IOT)”.
In addition to data communication via cellular systems, wireless LAN systems, communication satellite systems and/or the like, communication includes data communication via a combination of these systems.
Further, the communication apparatuses include devices, such as controllers and sensors, to be connected to or linked to a communication device that executes a communication function described in the present disclosure. Controllers or sensors are included, for example, each of which is configured to generate a control signal and/or a data signal used by the communication device that executes the communication functions of the communication apparatuses.
Further, the communication apparatuses include infrastructure equipment which performs communication with the above-mentioned non-limiting apparatuses of various kinds or which controls these non-limiting apparatuses of various kinds, such as base stations, access points, apparatuses of any other kinds, devices, and systems.
An encoding apparatus according to one embodiment of the present disclosure includes: quantization circuitry, which, in operation, generates quantization parameters including information on a codebook for vector quantization; and control circuitry, which, in operation, determines which one of first encoding of the information for a targeted sub-vector and second encoding of a difference between the number of bits allocated to the vector quantization and the number of bits of the quantization parameters is to be executed, in accordance with the number of bits available for encoding a sub-vector in the vector quantization.
In the encoding apparatus according to one embodiment of the present disclosure, the control circuitry classifies a plurality of the sub-vectors into a first group not including the targeted sub-vector and a second group including the targeted sub-vector, encodes the quantization parameter of each sub-vector included in the first group, and determines which one of the first encoding and the second encoding is to be executed, in accordance with the number of bits available in the second group.
In the encoding apparatus according to one embodiment of the present disclosure, the control circuitry sets the targeted sub-vector to a sub-vector having a highest frequency or a last sub-vector in a time domain in the second group when the number of bits available in the second group is less than a first threshold.
In the encoding apparatus according to one embodiment of the present disclosure, the control circuitry determines to execute the first encoding when the number of bits available for the targeted sub-vector is less than a second threshold or exceeds a third threshold, and determines to execute the second encoding when the number of bits available for the targeted sub-vector is greater than or equal to the second threshold and less than or equal to the third threshold.
In the encoding apparatus according to one embodiment of the present disclosure, the control circuitry determines to execute the first encoding when the number of bits available for the targeted sub-vector exceeds a threshold and determines to execute the second encoding when the number of bits available for the targeted sub-vector is less than the threshold.
In the encoding apparatus according to one embodiment of the present disclosure, the control circuitry sets a position of the targeted sub-vector in encoding order to a last one of the sub-vectors included in the second group in the second encoding.
In the encoding apparatus according to one embodiment of the present disclosure, the control circuitry updates the number of bits available in accordance with the number of consecutive sub-vectors of which the quantization parameter indicates a null vector, of the sub-vectors different from the targeted sub-vector in the second group.
In the encoding apparatus according to one embodiment of the present disclosure, the control circuitry adds a value obtained by subtracting a remainder of the number of bits available, divided by five, from five, to the number of bits available.
In the encoding apparatus according to one embodiment of the present disclosure, the targeted sub-vector is a third sub-vector in ascending order in a frequency domain or a third sub-vector in order starting with an earliest in a time domain, among eight sub-vectors.
In the encoding apparatus according to one embodiment of the present disclosure, the targeted sub-vector is a sub-vector having a highest frequency or a last sub-vector in a time domain, among a plurality of the sub-vectors.
In the encoding apparatus according to one embodiment of the present disclosure, the targeted sub-vector is a sub-vector of which energy of an adaptive codebook vector is a highest, among a plurality of the sub-vectors.
In the encoding apparatus according to one embodiment of the present disclosure, in information resulting from encoding candidates for the second number of bits, the number of bits for encoding information for at least one of the candidates having a higher probability of occurrence is smaller.
A decoding apparatus according to one embodiment of the present disclosure includes: control circuitry, which, in operation, determines which one of first decoding of first information on a codebook for vector quantization for a targeted sub-vector and second decoding of a difference between the number of bits allocated to the vector quantization and the number of bits of quantization parameters including the first information is to be executed, in accordance with the number of bits available for encoding a sub-vector in the vector quantization: and inverse quantization circuitry, which, in operation, performs inverse vector quantization in accordance with a result of any one of the first decoding and the second decoding.
An encoding method according to one embodiment of the present disclosure includes: by an encoding apparatus, generating quantization parameters including information on a codebook for vector quantization: and determining which one of first encoding of the information for a targeted sub-vector and second encoding of a difference between the number of bits allocated to the vector quantization and the number of bits of the quantization parameters is to be executed, in accordance with the number of bits available for encoding a sub-vector in the vector quantization.
A decoding method according to one embodiment of the present disclosure includes: by a decoding apparatus, determining which one of first decoding of first information on a codebook for vector quantization for a targeted sub-vector and second decoding of a difference between the number of bits allocated to the vector quantization and the number of bits of quantization parameters including the first information is to be executed, in accordance with the number of bits available for encoding a sub-vector in the vector quantization; and performing inverse vector quantization in accordance with a result of any one of the first decoding and the second decoding.
The disclosure of U.S. Provisional Patent Application No. 63/164,942, filed Mar. 23, 2021 and Japanese Patent Application No. 2021-118130, filed Jul. 16, 2021, each including the specification, drawings, and abstract, is incorporated herein by reference in its entirety.
An embodiment of the present disclosure is useful in encoding systems and the like.
Number | Date | Country | Kind |
---|---|---|---|
2021-118130 | Jul 2021 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2021/042063 | 11/16/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63164942 | Mar 2021 | US |