The present invention relates to a coding method, a decoding method, a coder, and a decoder.
In the vector coding technology, residual signals subsequent adaptive filtering generally undergo quantization coding by using algebraic codebooks. After the information about the position and the sign of the optimum algebraic codebook pulse on the track is searched out, the corresponding index value is calculated out through coding so that the decoder can reconstruct a pulse order according to the index value. One of the main objectives of researching and developing the algebraic codebook pulse coding method is to minimize the bits required by the coding index value on the precondition of ensuring lossless reconstruction.
The Extended Adaptive Multi-Rate Wideband (AMR_WB+) coding method is an algebraic codebook pulse coding method in the conventional art. Depending on the coding rate, one to N pulses may be encoded on each track. With the increase of coding pulses, the bits required for encoding such an amount of pulses also increase. For example, for a track with M=2m positions, encoding one pulse on the track requires m+1 bits, and encoding six pulses on the track requires 6 m−2 bits. In the process of developing the present invention, the inventor finds that in the algebraic pulse coding in the conventional art, a recursion-like coding method is applied to break down a coding pulse with many pulses into several coding pulses with fewer pulses, thus making the coding process rather complex. Meanwhile, with the increase of coding pulses on the track, the redundancy of the coding index accrues, thus tending to cause waste of coding bits.
A coding method, a decoding method, a coder, and a decoder capable of saving coding bits effectively are disclosed in an embodiment of the present invention.
A coding method is disclosed according to an embodiment of the present invention. The coding method includes: (1) obtaining a pulse distribution, on a track, of pulses to be encoded on the track; (2) determining a distribution identifier for identifying the pulse distribution according to the pulse distribution; and (3) generating a coding index including the distribution identifier.
A decoding method is disclosed according to an embodiment of the present invention. The decoding method includes: (1) receiving a coding index; (2) obtaining a distribution identifier from the coding index, where the distribution identifier is configured to identify a pulse distribution, on a track, of pulses encoded on the track; (3) determining the pulse distribution, on the track, of all the pulses encoded on the track, according to the distribution identifier; and (4) reconstructing a pulse order on the track according to the pulse distribution.
A coder is disclosed according to an embodiment of the present invention. The coder includes: (1) a pulse distribution obtaining unit, adapted to obtain a pulse distribution, on a track, of pulses to be encoded on the track; (2) a distribution identifier determining unit, adapted to determine a distribution identifier for identifying the pulse distribution according to the pulse distribution obtained by the pulse distribution obtaining unit; and (3) a coding index generating unit, adapted to generate a coding index including the distribution identifier determined by the distribution identifier determining unit.
A decoder is disclosed according to an embodiment of the present invention. The decoder includes: (1) a coding index receiving unit, adapted to receive a coding index; (2) a distribution identifier extracting unit, adapted to obtain a distribution identifier from the coding index received by the coding index receiving unit, where the distribution identifier is configured to identify a pulse distribution, on a track, of pulses encoded on the track; (3) a pulse distribution determining unit, adapted to determine the pulse distribution on the track, of all the pulses encoded on the track, according to the distribution identifier obtained by the distribution identifier extracting unit; and (4) a pulse order reconstructing unit, adapted to reconstruct a pulse order on the track, according to the pulse distribution determined by the pulse distribution determining unit.
In the embodiments of the present invention, the coding index may carry a distribution identifier for identifying the pulse distribution, and break down a coding pulse with many pulses into several coding pulses with fewer pulses. In this way, a coding index includes less information, and therefore, the coding index requires fewer bits, thus simplifying the coding process, reducing coding redundancy, and saving coding bits.
The methods and the apparatuses under the present invention are detailed below.
A coding method is disclosed in the first embodiment of the present invention. As shown in
A1: Statistics about the positions of the pulses to be encoded on a track are collected to obtain the distribution of positions of pulses on the track.
The total quantity of pulses to be encoded on the same track generally depends on the code rate. In this embodiment, pulse_num represents the total quantity of pulses to be encoded on the same track, and it is assumed that pulse_num= and a pulse distribution vector Q() indicates how each position of the pulse is distributed on the track, and Q()={q(0), q(1), . . . , q(−1)}, where q(h) is a serial number of the position for the (h+1)th pulse on the track, hε[0, −1], q(h)ε[0, M−1], and M represents the total quantity of positions on the track, for example, M=8, M=16, and so on.
Besides, a pulse to be encoded may carry a sign, namely, a positive sign or a negative sign. In this case, the pulse sign information of each pulse needs to be obtained at the time of collecting statistics about the pulses to be encoded on the track. In this embodiment, the pulse sign information of each pulse is represented by a pulse sign vector, namely, SS()={ss(0), ss(1), . . . , ss(−1)}, where ss(h) represents the pulse sign for the (h+1)th pulse, and is known as a sign index of the q(h) pulse. The pulse sign represented by ss(h) may be a positive value or a negative value. A simple coding mode is generally applied, namely, ss(h)=0 represents a positive pulse and ss(h)=1 represents a negative pulse. Nevertheless, for the pulses to be encoded, pulse signs are not a mandatory feature. As specifically required, a pulse may have only the position feature and the quantity feature. In this case, it is not necessary to collect statistics about the pulse sign information.
Evidently, a one-to-one corresponding relation may exist between Q() and SS().
After the parameters such as Q() and SS() of the pulses to be encoded are obtained through statistics, the parameters may be encoded into indices, and a corresponding relation is established between the parameter and the index so that the decoder can recover a parameter according to the corresponding index. In the present invention, a corresponding relation may be expressed in two modes. One is a calculation relation denoted by an algebraic mode, where the coder performs forward calculation for the parameter to obtain the index, and the decoder performs reverse calculation for the index to obtain the parameter; and the other is a query relation denoted by a mapping mode, where a mapping table that correlates the parameter with the index needs to be stored in both the coder and the decoder. A corresponding relation may be selected among the foregoing two corresponding relations according to the characteristics of the parameter. Generally, when the data quantity is large, the corresponding relation denoted by a calculation relation is preferred because it saves the storage space of the coder and the decoder.
A2: The distribution index (also referred to as distribution identifier) I4 is determined. The I4 may be calculated in this way: All possible distributions of the positions of all the pulses on the track are permuted in a set order, supposing that the current quantity of pulses is , and the permuting number in the permutation serves as a distribution index I4 indicative of the distribution.
The “set order” may be understood as an order of all possible Q() values determined by the coder and the decoder according to the same sequencing calculation rule.
The total quantity of possible values of the pulse distribution vector Q() is WQ()=CPPTN, where PPT=M+−1, and C refers to calculating the combination function. Each I4 corresponds to a pulse distribution in the WQ().
Generally, the WQ() is a large data quantity. Therefore, a calculation relation is preferred as a corresponding relation with the distribution index I4. Nevertheless, it is also practicable to express the corresponding relation through a query relation. Evidently, WQ() is the total quantity of all possible values of I4. If the value of I4 starts from 0, I4ε[0, WQ()−1].
A3: A coding index, namely, Index(), is generated. The Index() includes information about the distribution index I4.
The I4 may be placed into the coding index in any mode identifiable to the decoder, for example, by placing the I4 into the positions that start from a set position of the coding index, which is the simplest mode.
Nevertheless, in the case that the pulse being encoded includes a sign, the Index() also needs to carry information about the sign index, namely, ss(h), of each pulse. The pulse sign vector SS() may be simply placed as a field with a length of into a fixed position of the coding index, for example, at the end of the coding index.
To sum up, a mode of constructing the Index() may be:
Index()=I4×+ss(0)×+ss(1)×+ . . . +ss(N−1).
It is easily understandable that the mode of constructing a coding index described above is only an example of this embodiment. In practice, it is easy to derive other modes of constructing a coding index structure from the basic information about the coding index structure, for example, by swapping or recombining the index positions. The mode of constructing a coding index does not constitute any limitation to the embodiments of the present invention.
Examples are given below in order to further facilitate the understanding of the mode of constructing a coding index in the first embodiment of the present invention, supposing that the total quantity of positions on the track is M=16.
=5 pulses with signs are encoded.
The coding index, namely, Index(5), occupies 19 bits in total. That is, Index(5)ε[0, 219−1]. The coding value range of the Index(5) in
Five sign indices, namely, ss(0)˜ss(4), occupy five bits at the end.
In
=4 pulses with signs are encoded. The structure of the coding index is as follows:
The coding index, Index(4), occupies 16 bits in total. That is, Index(4)ε[0, 216−1].
Four sign indices, namely, ss(0)˜ss(3), occupy four bits at the end.
A space of 12 bits is available to the I4. Therefore, the coding space length available to the I4 is 212=4096, which is enough because WQ(4)=C16+4−14=3876.
=3 pulses with signs are encoded. The structure of the coding index is as follows:
The coding index, Index(3), occupies 13 bits in total. That is, Index(3)ε[0, 213−1].
Three sign indices, namely, ss(0)˜ss(2), occupy three bits at the end.
A space of 10 bits is available to the I4. Therefore, the coding space length available to the I4 is 210=1024, which is enough because WQ(3)=C16+3−13=816.
A coding method is provided in the second embodiment. A method for calculating a distribution index I4 is provided in this embodiment, thus making it easy to determine the corresponding relation between I4 and the distribution of pulses on the track through algebraic calculation, where the distribution is Q()={q(0), q(1), . . . , q(−1)}.
The following Q() sequencing calculation rule is provided in this embodiment.
The Q() varies with the value combination included in it. Therefore, serial numbers of the positions included in Q() may be permuted, supposing:
q(0)≦q(1)≦ . . . ≦q(−1), or q(0)≧q(1)≧ . . . ≧q(−1),
If they are ordered from a smaller value to a greater value and the ordered Q() are numbered, with the starting serial number being 0, then:
The foregoing formula may be interpreted as follows:
It should be noted that the foregoing formula is only an exemplary calculation relation between I4 and Q(). Depending on the same sequencing rule, the calculation relation may also be expressed in other algebraic modes equivalently. If a different sequencing rule is applied, similar calculation relations may also be designed. The mode of expressing the calculation relation does not constitute any limitation to the embodiments of the present invention.
To make the foregoing I4 calculation method clearer, a relative position vector of pulses is assumed: XX()={xx(1), xx(2), . . . , xx()}. The following one-to-one corresponding relation exists between XX() and Q():
xx(1)=q(0); and
xx(i)=q(i−1)−q(i−2).
Given below is an example. Supposing M=16 and =3 (M is the total quantity of positions on the track), the tree structure is shown in
I4(3)=C183−C18-q(0)3+C17-q(0)2−C17-q(1)2C16-q(1)1−C16-q(2)1.
If the value of is different, the corresponding tree structure is similar, and the formula for calculating the I4 can be deduced and is not repeated here any further.
A method for obtaining a distribution index I4 through a calculation relation is disclosed in this embodiment. Because the data quantity occupied by the I4 in the coding index is large, the calculation method in this embodiment minimizes the storage load of the coder and the decoder. The I4 is encoded continuously in a strict one-to-one relation with Q(), thus making the best of the coding bits and avoiding waste.
A coding method is disclosed in the third embodiment. The third embodiment differs from the first embodiment in that: The third embodiment regards the coding process in the first embodiment as a first coding mode, a coding mode is selected among options of the first coding mode first, and then pulses are encoded in the selected coding mode. As shown in
B1: The total quantity () of pulses to be encoded on the same track is determined.
The value of generally depends on the coding rate.
B2: A coding mode is selected according to the value of . Coding modes include a first coding mode. Depending on the selection result, the process proceeds to step B3 or step B4.
The coding mode described in the first embodiment is called a first coding mode in this embodiment. Optional coding modes include not only the first coding mode, but also other coding modes such as AMR_WB+ in the conventional art. A second coding mode, which is optional, is disclosed in this embodiment.
The coding mode may depend on the determined value. For example, for some values, the first coding mode is applied; and for other values, the second coding mode is applied. Researches reveal that the first coding mode is preferred when the value of is 3, 4, or 5.
B3: The result of selecting the coding mode is judged. If it is determined that the first coding mode is selected, the pulses are encoded in the first coding mode.
The specific coding process is similar to the description in the first embodiment, namely, steps A1, A2, and A3 in the first embodiment.
B4. The result of selecting the coding mode is judged. If it is determined that the second coding mode is selected, the pulses are encoded in the second coding mode. The second coding mode may include the following steps.
B41: Statistics about the positions of the pulses to be encoded on a track are collected to obtain the quantity of positions with a pulse, pulse distribution of the positions with a pulse on the track, and the quantity of pulses in each position with a pulse.
Similar to step A1 in the first embodiment, a pulse position vector, namely, P(N)={p(0), p(1), . . . , p(N−1)}, represents the distribution of the positions with a pulse on the track; a position sign vector, namely, S(N)={s(0), s(1), . . . , s(N−1)}, represents the pulse sign information of each position with a pulse; and the quantity of the positions with a pulse is obtained. In this embodiment, a pulse quantity vector, namely, SU(N)={su(0), su(1), . . . , su(N−1)}, represents the quantity of pulses in each position with a pulse, where su(n) represents the quantity of pulses in the p(n) position. Evidently, su(0)+su(1)+ . . . +su(N−1)=
Evidently, in this embodiment, a one-to-one corresponding relation exists between P(N), SU(N), and S(N).
After the parameters such as N, P(N), SU(N), and S(N) of the pulses to be encoded are obtained through statistics, the parameters need to be encoded into indices, and a corresponding relation is established between the parameter and the index so that the decoder can recover a parameter according to the corresponding index.
B42: The first index I1 is determined according to the quantity (namely, pos_num=N) of positions with a pulse. The first index I1 corresponds to all possible distributions of the positions with a pulse on the track when the pos_num is the same.
The pos_num value (N) fluctuates mildly. Therefore, the corresponding relation with the first index I1 may be expressed by either a calculation relation or a query relation. At the time of establishing a corresponding relation between pos_num and I1, this corresponding relation may be assumed as a one-to-one corresponding relation. Nevertheless, when the pos_num has other values, the index of other parameters requires fewer bits. Such pos_num values may use one I1 jointly, and are distinguished through an extra flag bit.
The pos_num value (N) decides the total quantity of all possible P(N) values, and the total quantity is W(N)=CMN, where C refers to calculating the combination function. Therefore, one I1 corresponds to W(N) possible P(N), where W(N) is a natural number.
B43: The second index I2 is determined according to the distribution of the positions with a pulse, where the distribution is expressed by P(N). The second index I2 indicates the instance of distribution corresponding to the distribution of the current positions with a pulse among all possible distributions corresponding to the first index I1.
The total quantity of all possible P(N) values is W(N)=CMN. The W(N) is a large data quantity. Therefore, a calculation relation is preferred as a corresponding relation with the second index I2. Nevertheless, it is also practicable to express the corresponding relation through a query relation. Evidently, W(N) is the total quantity of all possible values of I2. If the value of I2 starts from 0, 12ε[0, W(N)−1].
B44: The third index I3 is determined according to SU(N) which represents the quantity of pulses in each position with a pulse.
The SU(N) is a vector whose dimension is the same as the dimension of P(N), but is limited to su(0)+su(1)+ . . . +su(N−1)=, where the value of generally ranges from 1 to 6. Therefore, the corresponding relation with the third index I3 may be expressed by either a calculation relation or a query relation. Moreover, in view of the vector form, the query relation is preferred in the case of high dimensions, and the calculation relation is preferred in the case of low dimensions because it makes the design easier. It should be noted that in some extreme circumstances, for example, if N=1 or N=, the SU(N) has only one possible value, which does not need to be indicated by a specific I3, and the I3 may be regarded as any value that does not affect the final coding index.
B45: A coding index, namely, Index(), is generated. The Index() includes information about the first index I1, the second index I2, and the third index I3.
The I1, I2, and I3 may be placed into the coding index in any mode identifiable to the decoder, for example, by placing them into a fixed field separately, which is the simplest mode. When the total quantity (pulse_num) of pulses to be encoded on the same track is constant, the pos_num value (N) indicated by I1 decides the range of I2 and I3, namely, decides the quantity of coding bits required by I2 and I3. Therefore, the coding index is constructed in the following mode:
(1) The first index I1 is used as a start value, and the information about other indices is overlaid. A value of I1 corresponds to an independent value range of the coding index. In this way, the decoder can determine the pos_num value (N) directly according to the value range of the coding index.
(2) Further, in the value range of the I1 (generally corresponding to a certain field length), the I2 and the I3 may be placed in any mode identifiable to the decoder, for example, by placing them separately, which is the simplest mode. Generally, neither I2 nor I3 can be expressed as 2n (n is an integer number). Therefore, in order to save coding bits, I2 and I3 may be combined in the following way and placed into the specified value range of I1:
I23=I3×W(N)+I2=I3×CMN+I2.
where the coding of both I2 and I3 starts from 0, I2ε[0, CMN−1], and I3ε[0, Class(N)−1], where Class(N) is the total quantity of possible values of SU(N); evidently, such a mode is equivalent to dividing the value range of I1 into Class(N) portions, where the length of each portion is W(N), and each portion corresponds to a distribution, namely, a SU(N) value.
(3) Nevertheless, in the case that the pulse being encoded includes a sign, the Index() needs also to carry information about the sign index, namely, s(n), of each pulse. The position sign vector S(N) may be simply placed as a field with a length of N into a fixed position of the coding index, for example, at the end of the coding index.
To sum up, a mode of constructing the Index() may be:
Index()=I1+I23×+s(0)×+s(1)×+ . . . +s(−1).
It is easily understandable that the mode of constructing a coding index described above is only an example of this embodiment. In practice, it is easy to derive other modes of constructing a coding index structure from the basic information about the coding index structure, for example, by swapping or recombining the index positions. The mode of constructing a coding index does not constitute any limitation to the embodiments of the present invention.
For any quantity of pulses to be encoded, the coding logics provided in the second coding mode may be applied uniformly, thus avoiding increase of the coding index redundancy of the recursive mode applied in AMR_WB+, and ensuring a high utilization ratio of the coding bits. Meanwhile, it is not necessary to encode multiple pulses in the same position separately. Instead, the positions of pulses are merged before coding, thus saving coding bits. With the increase of the pulses to be encoded on the track, the probability of overlaying pulse positions also increases, and the merits of the embodiments of the prevent invention are more noticeable.
Examples are given below in order to further facilitate the understanding of the mode of constructing a coding index in the second coding mode. Supposing that the total quantity of positions on the track is M=16 and the quantity of positions with a pulse is pos_num, and the pos_num is in a one-to-one corresponding relation with the first index I1:
=6 Pulses with Signs are Encoded.
The coding index, namely, Index(6), occupies 21 bits in total. That is, Index(6)ε[0, 221−1].
(1) When six pulses are in one position, N=1, W(1)=16, I2(1)ε[0, 15], SU(1)={6}, Class(1)=1, and I3(1)=0,
(2) When six pulses are in two positions, N=2, W(2)=120, I2(2)ε[0, 119], SU(2)={5, 1}, {4, 2}, {3, 3}, {2, 4}, {1, 5}; Class(2)=5, and I3(2)ε[0, 4],
(3) When six pulses are in three positions, N=3W(3)=560, I2(3)ε[0, 559], SU(3)={4, 1, 1}, {1, 4, 1}, {1, 1, 4}, {3, 2, 1}, {3, 1, 2}, {2, 3, 1}, {2, 1, 3}, {1, 3, 2}, {1, 2, 3}, {2, 2, 2}; Class(3)=10, and I3(3)ε[0, 9],
(4) When six pulses are in four positions, N=4, W(4)=1820, I2(4)ε[0, 1819], SU(4)={3, 1, 1, 1}, {1, 3, 1, 1}, {1, 1, 3, 1}, {1, 1, 1, 3}, {2, 2, 1, 1}, {2, 1, 2, 1}, {2, 1, 1, 2}, {1, 2, 2, 1}, {1, 2, 1, 2}, {1, 1, 2, 2}; Class(4)=10, and I3(4)ε[0, 9],
(5) When six pulses are in five positions, N=5, W(5)=4368, I2(5)ε[0, 4367], SU(5)={2, 1, 1, 1, 1}, {1, 2, 1, 1, 1}, {1, 1, 2, 1, 1}, {1, 1, 1, 2, 1}, {1, 1, 1, 1, 2}; Class(5)=5, and I3(5)ε[0, 4],
(6) When six pulses are in six positions, N=6, W(6)=8008, I2(6)ε[0, 8007], SU(6)={1, 1, 1, 1, 1, 1}, Class(6)=1, and I3(6)=0,
=5 pulses with signs are encoded.
The coding index, Index(5), occupies 19 bits in total. That is, Index(5)ε[0, 219−1].
The detailed analysis on
=4 pulses with signs are encoded.
The coding index, Index(4), occupies 16 bits in total. That is, Index(4)ε[0, 216−1]. The figure shows the quantity of bits occupied by different portions of Index(4) when the pos_num value varies. The I1(N) is determined in a mapping mode:
=3 pulses with signs are encoded.
The coding index, Index(3), occupies 13 bits in total. That is, Index(3)ε[0, 213−1].
The figure shows the quantity of bits occupied by different portions of Index(3) when the pos_num value varies. The I1(N) is determined in a mapping mode:
=2 pulses with signs are encoded.
The coding index, Index(2), occupies 9 bits in total. That is, Index(2)ε[0, 29−1]. The figure shows the quantity of bits occupied by different portions of Index(2) when the pos_num value varies. The I1(N) is determined in a mapping mode:
=1 pulse with a sign is encoded.
The coding index, Index(1), occupies 5 bits in total. That is, Index(1)ε[0, 25−1]. Considering N≡1, the Index(1) includes only index I23(1)=I2(1) and s(0) which is a sign index of p(0).
A coding method is disclosed in the fourth embodiment. More specifically, a method for calculating the second index I2 in the second coding mode is provided in this embodiment, thus making it easy to determine the corresponding relation between I2 and the distribution of the positions with a pulse on a track through algebraic calculation, where the distribution is P(N)={p(0), p(1), . . . , p(N−1)}.
In this embodiment, the method of calculating I2 is: All possible P(N) values are permuted in a set order, where N is the quantity of the positions with a pulse corresponding to the first index I1; the permuting number in the permutation serves as a second index I2 indicative of the distribution.
The “set order” may be understood as an order of all possible P(N) values determined by the coder and the decoder according to the same sequencing calculation rule. The following sequencing calculation rule is provided in this embodiment:
The P(N) varies with the value combination included in it. Therefore, serial numbers of the positions included in P(N) may be permuted, supposing:
Supposing p(0)<p(1)< . . . <p(N−1), p(0)ε[0, M−N], p(n)ε[p(n−1)+1, M−N+n], where M is the total quantity of positions on the track. All possible values of P(N) are ordered from a smaller value to a greater value or from a greater value to a smaller value after the values in each dimensions of the P(N) are compared.
If they are ordered from a smaller value to a greater value and the ordered P(N) values are numbered, with the starting serial number being 0, then:
The foregoing formula may be interpreted as follows:
It should be noted that the foregoing formula is only an exemplary calculation relating 14 to Q(N). Depending on the same sequencing rule, the calculation relation may also be expressed in other algebraic modes equivalently. If a different sequencing rule is applied, similar calculation relations may also be designed. The mode of expressing the calculation relation does not constitute any limitation to the embodiments of the present invention.
To make the foregoing I2 calculation method clearer, a relative position vector of pulses is assumed: X(N)={x(1), x(2), . . . , x(N)}. The following one-to-one corresponding relation exists between X(N) and P(N):
x(1)=p(0); and
x(i)=p(i−1)−p(i−2).
where x(i) represents a relative position relation between the ith position with a pulse and the (i−1)th position with a pulse, iε[1, N]. The X(N) can construct an N-layer tree that includes all possible values of P(N). The depth of the tree is N+1, and the sub-node on the ith layer represents the relative position value x(i) of ith position with pulse. The values of x(i) are arranged from left to right and from a smaller value to a greater value. The end nodes are encoded from left to right at the bottom (namely, end nodes) of the tree. Each path from an end node to a root node corresponds to a value of X(N). Therefore, the code of each end node is the second index I2 indicative of the corresponding P(N) value.
In the examples given below, it is assumed that the total quantity of positions on the track is M=16.
The quantity of the positions with a pulse, namely, pos_num, is n=2, and
The quantity of the positions with a pulse, namely, pos_num, is n=3, and
When the value of N is 4, 5, or 6, the corresponding tree structure is similar, and the formula for calculating the I2 can be deduced and is not repeated here any further.
A method for obtaining a second index I2 through a calculation relation is disclosed in this embodiment. Because the data quantity occupied by the I2 in the coding index is large, the calculation method in this embodiment minimizes the storage load of the coder and the decoder. The I2 is encoded continuously in a strict one-to-one relation with P(N), thus making the best of the coding bits and avoiding waste.
The merits of the coding index construction mode in the first coding mode and the second coding mode are given below. In theory, on the precondition that the total quantity (pulse_num) of the pulses to be encoded on the same track is constant, the quantity of all possible permutations of all pulses on the track is the minimum value range of the coding index, and the corresponding quantity of coding bits is a theoretic lower limit. When the quantity of permutations is 2n (n is an integer), the theoretic lower limit of the quantity of coding bits is an integer; when the quantity of permutations is not 2n (n is an integer), the theoretic lower limit of the quantity of coding bits is a decimal fraction. In this case, certain coding redundancy exists. When the total quantity of positions on the track is M=16, with different values of pulse_num, a comparison is made between the theoretic lower limit of the quantity of coding bits, and the quantity of coding bits required in the AMR_WB+ coding mode, and the quantity of bits required by the coding index construction mode in the first coding mode and the second coding mode, as shown in Table 1:
Table 1 reveals that: The coding index construction mode of the second coding mode reaches the theoretic lower limit when the theoretic lower limit is an integer, and reaches 1 plus the integer part of the theoretic lower limit when the theoretic lower limit is a decimal fraction. When is 3, 4, or 5, the first coding mode has a coding bit length equal to that of the second coding mode. In the case of high code rates, both of such coding modes provide a coding efficiency higher than that of the AMR_WB+, namely, can save more bits.
With respect to calculation complexity, by using all the test orders in the reference codes of the AVS-M mobile audio standard as test objects, a comparison of operation time is made between the AMR_WB+, the first coding mode, and the second coding mode (all sample spaces are traversed, including the coding process and the decoding process, the first coding mode is the calculation mode provided in the second embodiment, the second coding mode is the calculation mode provided in the fourth embodiment, and the decoding mode is the corresponding mode provided in the subsequent embodiments), as shown in Table 2:
Table 2 reveals that: The first coding mode involves lower operation complexity in most circumstances, and the operation complexity of the second coding mode is equivalent to that of the AMR_WB+. Table 1 and Table 2 reveal that: By using the first coding mode and the second coding mode, the low calculation complexity of the first coding mode is exerted when is 3, 4, or 5, and the low coding bit length of the second coding mode is exerted when is another value.
A coding method is disclosed in the fifth embodiment of the present invention. As shown in
C1: Statistics about the pulses to be encoded on a track are collected according to positions, to obtain the quantity of positions with a pulse, pulse distribution of the positions with a pulse is distributed on the track, and the quantity of pulses in each position with a pulse.
The description about step C1 is similar to the description about step B41 in the third embodiment, and is not repeated here any further.
C2: The first index I1 is determined according to the quantity (namely, pos_num=N) of the positions with a pulse. The first index I1 corresponds to all possible distributions of the positions with a pulse on the track when the pos_num is the same.
The description about step C2 is similar to the description about step B42 in the third embodiment, and is not repeated here any further.
C3: The second index I2 is determined according to the distribution of the pulse positions on the track, where the distribution is expressed by P(N). The second index I2 indicates the instance of distribution corresponding to the distribution of the current position with a pulse among all possible distributions corresponding to the first index I1.
The description about step C3 is similar to the description about step B43 in the third embodiment, and is not repeated here any further.
C4: The third index I3 is determined according to SU(N) which represents the quantity of pulses in each position with a pulse.
The description about step C4 is similar to the description about step B44 in the third embodiment, and is not repeated here any further.
C5: A coding index, namely, Index(), is generated. The Index() includes information about the first index I1, the second index I2, and the third index I3.
The description about step C5 is similar to the description about step B45 in the third embodiment, and is not repeated here any further.
The relevant description about the fifth embodiment is similar to the description about the third embodiment (including the examples), and is not repeated here any further.
A coding method is disclosed in the sixth embodiment. In this embodiment, the coding logics identical to those of the fifth embodiment are applied. Specifically, a method for calculating the second index I2 is provided in this embodiment, thus making it easy to determine the corresponding relation between I2 and the distribution of the positions with a pulse on a track through algebraic calculation, where the distribution is P(N)={p(0), p(1), . . . , p(N−1)}. The detailed description is similar to that of the fourth embodiment, and is not repeated here any further.
The decoding method disclosed herein is detailed below.
A decoding method is provided in the seventh embodiment. The decoding method provided in this embodiment decodes the coding index obtained according to the coding method in the first embodiment. The decoding process is the inverse of the coding process. As shown in
D1: A coding index Index() is received.
D2: The distribution index I4 is extracted from the Index().
The process of extracting the distribution index I4 from the Index() may be the inverse of the process of placing the I4 into the Index() at the time of coding. For example, if the I4 is placed into a fixed field, the I4 may be extracted from the field directly.
If the coded pulse is a pulse with a sign, the sign index ss(h) corresponding to each pulse needs to be extracted from the Index(). The total quantity of bits varies with the code rate. Therefore, the decoder may determine the total quantity of pulses encoded on the same track, namely, pulse_num=, directly according to the length (quantity of bits) of the coding index, and then extract the corresponding quantity of sign indices ss(h) from the Index() according to . According to the structure of the Index() provided in the first embodiment, the sign indices are located at the end of the Index(), and therefore, each ss(h) may be extracted from the Index() directly.
D3: The distribution of each position of the pulses on the track, which is expressed as Q(), is determined according to the distribution index I4.
The decoding of the I4 is the inverse of encoding the I4. If the I4 is obtained through a calculation relation in the coding process, the same calculation relation may be applied in the decoding process to perform an inverse operation; if the I4 is obtained through a query relation in the coding process, the same corresponding relation may be queried in the decoding process.
D4: The pulse order on the track is reconstructed according to the Q(), which represents the distribution of each position of the pulses on the track.
If the pulse includes a sign, at the time of reconstructing the pulse order on the track, the positive or negative feature of the pulse sign of each pulse needs to be recovered according to the pulse sign information carried in each sign index ss(h).
A decoding method is disclosed in the eighth embodiment. The decoding logics applied in this embodiment are the same as those applied in the seventh embodiment. The eighth embodiment discloses a calculation method for decoding the distribution index I4 obtained through the coding method in the second embodiment. This calculation method at the decoder is the inverse of the method for calculating the I4 in the second embodiment.
If the I4 is obtained through
in the coding process, the following calculation process is applied at the decoder:
A decoding method is provided in the ninth embodiment. The decoding method provided in this embodiment decodes the coding index obtained according to the coding method in the third embodiment. The decoding process is the inverse of the coding process. As shown in
E1: The total quantity () of pulses encoded on the same track by the received coding index Index() is determined.
The decoder may determine the total quantity of pulses encoded on the same track, namely, pulse_num=, directly according to the length (quantity of bits) of the coding index. Nevertheless, the decoder may also obtain the value corresponding to the coding index in a mode agreed with the encoder (for example, by exchanging information mutually before receiving the coding index). This embodiment does not specify the mode of obtaining the .
E2: A decoding mode is selected according to the value of . Decoding modes include a first decoding mode. Depending on the selection result, the process proceeds to step E3 or step E4.
The decoding mode described in the seventh embodiment is called a first decoding mode in this embodiment. Optional decoding modes include not only the first decoding mode, but also other decoding modes. Each optional decoding mode needs to correspond to the coding mode provided at the encoder. This embodiment provides a second decoding mode corresponding to the second coding mode described above.
In order to ensure consistency between the coding mode and the decoding mode, the decoder needs to select a decoding mode by using the corresponding rule applied at the coder.
E3: The result of selecting the decoding mode is analyzed. If it is determined that the first decoding mode is selected, the pulses are decoded in the first coding mode. The step of extracting the distribution index from the coding index is performed.
For the specific decoding process, the description in the seventh embodiment serves as a reference.
E4: The result of selecting the decoding mode is analyzed. If it is determined that the second decoding mode is selected, the pulses are decoded in the second decoding mode. The second decoding mode may include the following steps:
E41: The first index I1 is extracted from the Index(). The quantity of pulse positions, namely, pos_num, is determined according to the I1.
The total quantity of bits varies with the code rate. Therefore, the decoder may determine the total quantity of pulses encoded on the same track, namely, pulse_num=, directly according to the length (quantity of bits) of the coding index.
The process of extracting the information about each index from the Index() may be the inverse of the process of combining the indices into an Index() at the coder. For example, if each index is placed into a fixed field separately, the index may be extracted from the field directly.
If the Index() is a structure that uses the I1 as a starting value and overlays other indices as described in the third embodiment, it is appropriate to extract the I1 first, and then determine the positions of other indices in the Index() according to the pos_num value () corresponding to the I1. In this case, considering that one I1 corresponds to an independent value range of the Index(), the decoder may judge the value range of the Index() among several set independent value ranges, and determine the first index I1 according to the starting value of such a value range.
E42: The second index I2 and the third index I3 are extracted from the Index().
Like I1, the I2 and the I3 are also extracted in a process contrary to the process of combining the I2 and the I3 into the Index(), and can be extracted directly if they are placed independently at the coder. If the I2 and the I3 are combined and overlaid in the coding process as described in the third embodiment, they can be separated in the following steps:
(1) The combination value of I2 and I3, namely, I23, is extracted from the Index().
The position of I23 in the Index() may be indicated by the N value determined by the I1.
(2) The I2 and the I3 are separated in this way: I2=I23% W(N) and I3=Int[I23/W(N)]. W(N) is the total quantity of all possible P(N) in the case of pos_num=N, and W(N)=CMN, where M is the total quantity of positions on the track, % refers to taking the remainder and Int refers to taking the integer.
E43: If the coded pulse is a pulse with a sign, the sign index s(n) corresponding to each position with a pulse needs to be extracted from the Index().
According to the structure of the Index() provided in the third embodiment, the N sign indices are located at the end of the Index(). Therefore, each s(n) may be separated from the Index() directly after the N value indicated by the I1 is obtained.
E44: The distribution of each position with a pulse on the track in the case of pos_num=N is determined according to the second index I2, where the distribution is expressed as P(N).
The decoding of the I2 is the inverse of encoding the I2. If the I2 is obtained through a calculation relation in the coding process, the same calculation relation may be applied in the decoding process to perform an inverse operation. If the I2 is obtained through a query relation in the coding process, the same corresponding relation may be queried in the decoding process.
E45: The SU(N), which represents the quantity of pulses in each position with a pulse, is determined according to the third index I3. The rule of decoding the I3 is similar to the rule of decoding the I2.
E46: The pulse order on the track is reconstructed according to the P(N) and the SU(N), where: P(N) represents distribution the positions with a pulse on the track, and SU(N) represents the quantity of pulses in each position with a pulse.
If the pulse includes a sign, at the time of reconstructing the pulse order on the track, the positive or negative feature of the pulse sign of each position with a pulse needs to be recovered according to the pulse sign information carried in each sign index s(n).
A decoding method is disclosed in the tenth embodiment. The decoding logics applied in this embodiment are the same as those applied in the ninth embodiment. The tenth embodiment discloses a calculation method in the second decoding mode for decoding the second index I2 obtained through the coding method in the fourth embodiment. This calculation method at the decoder is the inverse of the method for calculating the I2 in the fourth embodiment.
If the I2 is obtained through
in the coding process, the following calculation process is applied at the decoder:\
(1) CM-1N-1, . . . , and CM-y0N-1 are subtracted from I2 one by one.
R(y0)=I2−CM-1N-1− . . . −CM-y0N-1,
until the I2 remainder R(y0) changes from a positive number to a negative number, where M is the total quantity of positions on the track, N is the quantity of the positions with a pulse, y0ε[1, M−N+1], and C refers to calculating the combination function. The p(0), namely, the serial number of the first position with a pulse on the track, is recorded, where p(0)=y0−1.
(2) If N>1, CM-p(0)-1N-2, . . . , and CM-p(0)-y1N-2 are further subtracted from R[p(0)] one by one until the R[p(0)] remainder R1(x1) changes from a positive number to a negative number. The p(1), namely, the serial number of the second position with a pulse on the track, is recorded, where p(1)=y1−1.
(3) By analogy, CM-p(0)- . . . -p(n-1)-1N-n-2, . . . , and CM-p(0)- . . . -p(n-1)-ynN-n-2 are further subtracted from R(n−1)[p(n−1)] one by one until the R(n−1)[p(n−1)] remainder Rn(yn) changes from a positive number to a negative number, where n≦N−1. The p(n), namely, the serial number of the n+1 pulse position on the track, is recorded, where p(n)=yn−1.
(4) The decoding of the I2 is completed, and P(N)={p(0), p(1), . . . , p(N−1)} is obtained.
A decoding method is provided in the eleventh embodiment. The decoding method provided in this embodiment decodes the coding index obtained according to the coding method in the fifth embodiment. The decoding process is the inverse of the coding process. As shown in
F1: The coding index Index() is received, and the first index I1 is extracted from the Index(). The quantity of positions with a pulse, namely pos_num, is determined according to the I1.
The description about step F1 is similar to that of step E41 in the ninth embodiment.
F2: The second index I2 and the third index I3 are extracted from the Index().
The description about step F2 is similar to that of step E42 in the ninth embodiment.
F3: If the coded pulse is a pulse with a sign, the sign index s(n) corresponding to each position with a pulse needs to be extracted from the Index().
The description about step F3 is similar to that of step E43 in the ninth embodiment.
F4: The distribution of each position with a pulse on the track in the case of pos_num=N is determined according to the second index I2, where the distribution is expressed as P(N).
The description about step F4 is similar to that of step E44 in the ninth embodiment.
F5: The SU(N), which represents the quantity of pulses in each position with a pulse, is determined according to the third index I3. The rule of decoding the I3 is similar to the rule of decoding the I2. The description about step F5 is similar to that of step E45 in the ninth embodiment.
F6: The pulse order on the track is reconstructed according to the P(N) and the SU(N), where P(N) represents the distribution of each position with a pulse on the track, and SU(N) represents the quantity of pulses in each position with a pulse.
The description about step F6 is similar to that of step E46 in the ninth embodiment.
A decoding method is disclosed in the twelfth embodiment. The decoding logics applied in this embodiment are the same as those applied in the eleventh embodiment. The eighth embodiment discloses a calculation method for decoding the second index I2 obtained through the coding method in the sixth embodiment. This calculation method at the decoder is the inverse of the method for calculating the I2 in the sixth embodiment. The detailed description is similar to that of the fourth embodiment, and is not repeated here any further.
It is understandable to those skilled in the art that all or part of the steps of the foregoing embodiments may be implemented through software, hardware, or both thereof.
The embodiments of the present invention may further include a computer-readable storage medium for bearing or storing instructions readable or executable by a computer, or for storing data instructions. The program may be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, and compact disk. When being executed, the program generated out of the instructions stored in the storage medium may cover part or all of the steps in any embodiment of the present invention.
The coder and the decoder under the present invention are detailed below.
A coder is disclosed according to an embodiment of the present invention. The coder may include: (1) a pulse distribution obtaining unit, adapted to obtain the pulse distribution, on a track, of all the pulses to be encoded on the track; (2) a distribution identifier determining unit, adapted to determine a distribution identifier for identifying the pulse distribution, according to the pulse distribution obtained by the pulse distribution obtaining unit; and (3) a coding index generating unit, adapted to generate a coding index that includes the distribution identifier determined by the distribution identifier determining unit.
The pulse distribution obtained by the pulse distribution obtaining unit may include the information about the distribution of positions of pulses on the track.
The distribution identifier determining unit may include: (1) a comparing unit, adapted to compare the pulse distribution with all possible distributions of the pulse positions on the track; and (2) an obtaining unit, adapted to obtain a distribution identifier corresponding to the pulse distribution compared by the comparing unit, wherein each possible distribution of the pulse positions corresponds to a distribution identifier.
The pulse distribution may include: quantity of positions with a pulse, distribution of the positions with a pulse on the track, and quantity of pulses in each position with a pulse.
The distribution identifier may carry information about the first index, the second index, and the third index, where: (1) the first index is adapted to identify the information about all possible distributions of the positions with a pulse on the track when the quantity of the positions with a pulse is the same; (2) the second index is adapted to identify the instance of distribution corresponding to the current position with a pulse among all possible distributions corresponding to the first index; and (3) the third index is adapted to identify the information about the quantity of pulses in each position with a pulse.
The distribution identifier determining unit may include: (1) a first determining unit, adapted to determine the first index according to the quantity of positions with a pulse; (2) a second determining unit, adapted to determine the second index according to the distribution of the positions with a pulse on the track; and (3) a third determining unit, adapted to determine the third index according to the quantity of pulses in each position with a pulse.
The coder may further include: a permuting unit, adapted to: permute all the possible distributions of the positions of the pulses on the track in a set order with respect to the current quantity of pulses before the comparing unit compares the pulse distribution with the information about all possible distributions of the positions with a pulse on the track, or before the second determining unit determines the second index according to the distribution of the positions of pulses on the track, where the permuting number in the permutation serves as a distribution index indicative of the distribution.
The pulse distribution obtaining unit may also obtain the pulse sign information indicative of the positive and negative features of the pulse when obtaining the pulse distribution about how all the pulses to be encoded on the track are distributed on the track. The distribution identifier determining unit may also determine the pulse sign identifier corresponding to the pulse sign information when determining the distribution identifier. The coding index generated by the coding index generating unit may also include the pulse sign identifier corresponding to each pulse.
A coder is disclosed according to an embodiment of the present invention. The coder may include: (1) a pulse sum determining unit, adapted to determine the total quantity of pulses to be encoded on a track; (2) a coding mode selecting unit, adapted to select a coding mode according to the total quantity of pulses determined by the pulse sum determining unit; and (3) a coding unit, adapted to perform coding in the coding mode selected by the coding mode selecting unit.
The coding unit may include: (1) a pulse distribution obtaining unit, adapted to obtain pulse distribution about how all the pulses to be encoded on a track are distributed on the track; (2) a distribution identifier determining unit, adapted to determine a distribution identifier for identifying the pulse distribution according to the pulse distribution obtained by the pulse distribution obtaining unit; and (3) a coding index generating unit, adapted to generate a coding index that includes the distribution identifier determined by the distribution identifier determining unit.
The pulse distribution may include the information about the distribution of the positions of pulses on the track.
The distribution identifier determining unit may include: (1) a comparing unit, adapted to compare the pulse distribution with the information about all possible distributions of the positions of the pulses on the track; and (2) an obtaining unit, adapted to obtain a distribution identifier corresponding to the pulse distribution compared by the comparing unit, where the information about each possible distribution corresponds to a distribution identifier.
The coder may further include a permuting unit, adapted to: permute all possible distributions of the positions of the pulses on the track in a set order with respect to the current quantity of pulses before the comparing unit compares the pulse distribution with the information about all possible distributions of the positions of the pulses on the track, where the permuting number in the permutation serves as a distribution index indicative of the distribution.
The pulse distribution may include: quantity of positions with a pulse, distribution of the positions with a pulse on the track, and quantity of pulses on each position with a pulse.
The distribution identifier may carry information about the first index, the second index, and the third index, where: (1) the first index is adapted to identify the information about all possible distributions of the positions with a pulse on the track when the quantity of the positions with a pulse is the same; (2) the second index is adapted to identify the instance of distribution corresponding to the current position with a pulse among all possible distributions corresponding to the first index; and (3) the third index is adapted to identify the information about the quantity of pulses in each position with a pulse.
The distribution identifier determining unit may include: (1) a first determining unit, adapted to determine the first index according to the quantity of the positions with a pulse; (2) a second determining unit, adapted to determine the second index according to the distribution of the positions with a pulse on the track; and (3) a third determining unit, adapted to determine the third index according to the quantity of pulses in each position with a pulse.
The coder may further include a permuting unit, adapted to: permute all possible distributions of the positions with a pulse on the track in a set order with respect to the current quantity of pulses before the second determining unit determines the second index according to the distribution of positions of the pulses on the track, where the permuting number in the permutation serves as a distribution index indicative of the distribution.
The pulse distribution obtaining unit may also obtain the pulse sign information indicative of the positive and negative features of the pulse when obtaining the pulse distribution about how all the pulses to be encoded on the track are distributed on the track. The distribution identifier determining unit may also determine the pulse sign identifier corresponding to the pulse sign information when determining the distribution identifier. The coding index generated by the coding index generating unit may also include the pulse sign identifier corresponding to each pulse.
A decoder is disclosed according to an embodiment of the present invention. The decoder may include: (1) a coding index receiving unit, adapted to receive a coding index; (2) a distribution identifier extracting unit, adapted to obtain a distribution identifier from the coding index received by the coding index receiving unit, wherein the distribution identifier is configured to identify the pulse distribution, on a track, of all the pulses to be encoded on the track; (3) a pulse distribution determining unit, adapted to determine the pulse distribution, on a track, of all the pulses to be encoded on the track, according to the distribution identifier obtained by the distribution identifier extracting unit; and (4) a pulse order reconstructing unit, adapted to reconstruct the pulse order on the track according to the pulse distribution determined by the pulse distribution determining unit.
The pulse distribution may include the information about the distribution of positions of pulses on the track.
The pulse distribution determining unit may include:
The distribution identifier may carry information about the first index, the second index, and the third index; where
The pulse distribution may include: quantity of positions with a pulse, distribution of positions with a pulse on the track, and quantity of pulses on each position with a pulse.
The distribution identifier extracting unit may include:
The pulse distribution determining unit includes:
The distribution identifier extracting unit may also extract the pulse sign identifier indicative of the positive and negative features of the pulse from the coding index when extracting the distribution identifier from the coding index. The pulse distribution determining unit may also determine the corresponding pulse sign information according to the pulse sign identifier when determining the pulse distribution according to the distribution identifier. The pulse order reconstructing unit may recover the positive or negative feature of the pulse according to the pulse sign information when reconstructing the pulse order on the track.
A decoder is disclosed according to an embodiment of the present invention. The decoder may include: (1) a coding index receiving unit, adapted to receive a coding index; (2) a pulse sum determining unit, adapted to determine the total quantity of pulses encoded on the track with respect to the coding index received by the coding index receiving unit; (3) a decoding mode selecting unit, adapted to select a decoding mode according to the total quantity of pulses determined by the pulse sum determining unit; and (4) a decoding unit, adapted to perform decoding in the decoding mode selected by the decoding mode selecting unit.
The decoding unit may include: (1) a distribution identifier extracting unit, adapted to extract the distribution identifier from the coding index received by the coding index receiving unit, where the distribution identifier identifies the pulse distribution about how all the pulses to be encoded on a track are distributed on the track; (2) a pulse distribution determining unit, adapted to determine the pulse distribution about how all the pulses to be encoded on a track are distributed on the track according to the distribution identifier extracted by the distribution identifier extracting unit; and (3) a pulse order reconstructing unit, adapted to reconstruct the pulse order on the track according to the pulse distribution determined by the pulse distribution determining unit.
The pulse distribution may include the information about the distribution of the positions of pulses on the track.
The pulse distribution determining unit may include: (1) a comparing unit, adapted to compare the distribution identifier with the distribution identifier corresponding to all possible distributions of the positions of the pulses on the track; and (2) an obtaining unit, adapted to obtain pulse distribution corresponding to the distribution identifier compared by the comparing unit, where each distribution identifier corresponds to the information about a possible instance of distribution.
The distribution identifier may carry information about the first index, the second index, and the third index, where: (1) the first index is adapted to identify the information about all possible distributions of the positions with a pulse on the track when the quantity of the positions with a pulse is the same; (2) the second index is adapted to identify the instance of distribution corresponding to the current position with a pulse among all possible distributions corresponding to the first index; and (3) the third index is adapted to identify the information about the quantity of pulses in each position with a pulse.
The pulse distribution may include: quantity of positions with a pulse, distribution of positions with a pulse on the track, and quantity of pulses on each position with a pulse.
The distribution identifier extracting unit may include: (1) a first extracting unit, adapted to extract the first index from the coding index; and (2) a second extracting unit, adapted to extract the second index and the third index from the coding index.
The pulse distribution determining unit may include: (1) a first determining unit, adapted to determine the quantity of positions with a pulse according to the first index; (2) a second determining unit, adapted to determine the distribution of positions with a pulse on the track according to the second index with respect to the quantity of positions with a pulse corresponding to the first index; and (3) a third determining unit, adapted to determine the quantity of pulses in each position with a pulse according to the third index.
The distribution identifier extracting unit may also extract the pulse sign identifier indicative of the positive and negative features of the pulse from the coding index when extracting the distribution identifier from the coding index. The pulse distribution determining unit may also determine the corresponding pulse sign information according to the pulse sign identifier when determining the pulse distribution according to the distribution identifier. The pulse order reconstructing unit may recover the positive or negative feature of the pulse according to the pulse sign information when reconstructing the pulse order on the track.
The coder and the decoder under the present invention are detailed below by reference to accompanying drawings.
A coder 10 is disclosed in the thirteenth embodiment of the present invention. As shown in
The coding apparatus disclosed in this embodiment is applicable to the coding methods disclosed in the first embodiment and the second embodiment.
The fourteenth embodiment provides a coder 20. As shown in
The coding selecting unit 23 is adapted to: determine the total quantity () of pulses to be encoded on the same track, and select a coding mode according to , the total quantity. In this embodiment, optional coding modes include a first coding mode and a second coding mode. Depending on the result of selecting the coding mode, the first coding module 21 is triggered to perform coding if the first coding mode is selected; or the second coding module 22 is triggered to perform coding if the second coding mode is selected.
The first coding module 21 includes a first statistic unit 211, a distribution index unit 212, and an index generating unit 213. The logical structure of such units is the same as that of the counterpart units in the 13th embodiment.
The second coding module 22 includes a second statistic unit 221, an index calculating unit 222, and an index combining unit 223.
The second statistic unit 221 is adapted to: collect the statistics of the pulses to be encoded on a track according to positions; and output the quantity (N) of positions with a pulse, the P(N) and the SU(N), where P(N) represents the distribution of each position with a pulse on the track, and SU(N) represents the quantity of pulses in each position with a pulse. When collecting the statistics of the pulse with a sign, the second statistic unit 221 also outputs the corresponding pulse sign information S(N) according to the positive or negative feature of the pulse sign of each position with a pulse.
The index calculating unit 222 includes: a first index unit 2221, a second index unit 2222, a third index unit 2223, and an index combining unit 223.
The first index unit 2221 is adapted to output the first index I1 according to the quantity (N) of the positions with a pulse. The first index I1 corresponds to all possible distributions of the positions with a pulse on the track when N is the same.
The second index unit 2222 is adapted to output the second index I2 according to distribution of the positions with a pulse on the track, where the distribution is expressed by P(N). The second index I2 indicates the instance of distribution corresponding to the distribution of the current position with a pulse among all possible distributions corresponding to the first index.
The third index unit 2223 is adapted to output the third index according to the quantity of pulses in each position with a pulse, namely, according to SU(N).
The index combining unit 223 is adapted to combine the information about the first index, the second index, and the third index to generate a coding index. If the pulse to be encoded includes a sign, the index combining unit 223 further combines the sign index information S(N) corresponding to each position with a pulse into the coding index, where the sign index indicates the pulse sign information of the position with a pulse corresponding to the sign index.
If the coding index structure is provided in the second coding mode in the third embodiment, the index combining unit 223 for coding may include: (1) a first combining unit 2231, adapted to output the second index and the third index combined into I23, namely, I23=I3×W(N)+I2, where W(N) represents the total quantity of all possible distributions of the positions with a pulse on the track, and N represents the quantity of positions with a pulse corresponding to the first index; and (2) a second combining unit 2232, adapted to: overlay the output of the first combining unit 2231 with information about other indices, and output the coding index Index().
The coding apparatus disclosed in this embodiment is applicable to the coding methods disclosed in the third embodiment and the fourth embodiment.
A coder 30 is disclosed in the fifteenth embodiment. As shown in
When the pulses are encoded according to the coding index structure provided in the fifth embodiment, the index combining unit 33 may include: (1) a first combining unit 331, adapted to output the second index and the third index combined into I23, namely, I23=I3×W(N)+I2, where W(N) represents the total quantity of all possible distributions of the positions with a pulse on the track, and N represents the quantity of positions with a pulse corresponding to the first index; and (2) a second combining unit 332, adapted to: overlay the output of the first combining unit 331 with information about other indices, and output the coding index Index().
The coding apparatus disclosed in this embodiment is applicable to the coding methods disclosed in the fifth embodiment and the sixth embodiment.
A decoder 40 is disclosed in the sixteenth embodiment. As shown in
If the pulse to be decoded includes a sign, the decoder needs to further include a sign extracting unit 45, adapted to extract the sign index SS() corresponding to each pulse from the Index() received by the receiving unit 31 according to the total quantity () of pulses to be encoded on the same track, where the sign index indicates the pulse sign information of the pulse corresponding to the sign index.
In this case, the distribution reconstructing unit 44 further recovers the positive or negative feature of the pulse sign of each pulse according to the pulse sign information indicated by the SS() extracted by the sign extracting unit 45.
The decoding apparatus disclosed in this embodiment is applicable to the decoding methods disclosed in the seventh embodiment and the eighth embodiment.
The seventeenth embodiment provides a decoder 50. As shown in
The decoding selecting unit 53 is adapted to: determine the total quantity () of pulses encoded on the same track by the received coding index Index(), and select a decoding mode according to N the total quantity. Optional decoding modes in this embodiment include a first decoding mode and a second decoding mode. Depending on the result of selecting the decoding mode, the first decoding module 51 is triggered to perform decoding if the first decoding mode is selected; or the second decoding module 52 is triggered to perform decoding if the second decoding mode is selected.
The first decoding module 51 includes a distribution extracting unit 512, a distribution decoding unit 513, a distribution reconstructing unit 514, and a sign extracting unit 515. The logical structure of such units is the same as that of the counterpart units in the 16th embodiment.
The second decoding module 52 includes: (1) a first extracting unit 521, adapted to: receive the coding index Index(), extract the first index I1 from the Index (), and determine the quantity () of positions with a pulse according to the I1; and (2) a second extracting unit 522, adapted to extract the second index I2 and the third index I3 from the coding index Index().
If the coding index structure is provided in the second coding mode in the third embodiment, the second extracting unit 522 for decoding may include: (1) a separating subunit 5221, adapted to extract the combination value I23 of the second index and the third index from the coding index; (2) a resolving subunit 5222, adapted to separate and output the second index I2 and the third index I3 in the following way:
I2=I23% W(N),I3=Int[I23/W(N)],
where W(N) represents the total quantity of all possible distributions of the positions with a pulse on the track, N represents the quantity of the positions with a pulse corresponding to the first index, % refers to taking the remainder, and Int refers to taking the integer; (a) a first decoding unit 523, adapted to determine the P(N) according to the second index I2 with respect to the quantity of the positions with a pulse corresponding to the I1, where P(N) represents the distribution of the positions with a pulse on the track; (b) a second decoding unit 524, adapted to determine the SU(N) according to the third index I3, where SU(N) represents the quantity of pulses in each position with a pulse; and (3) a pulse reconstructing unit 525, adapted to reconstruct the pulse order on the track according to the P(N) and the SU(N), where: P(N) represents distribution of the positions with a pulse on the track, and SU(N) represents the quantity of pulses in each position with a pulse.
If the pulse to be decoded includes a sign, the decoder needs to further include a third extracting unit 526, adapted to extract the sign index s(n) corresponding to each position with a pulse from the Index() according to the quantity (N) of the positions with a pulse, where the sign index indicates the pulse sign information of the position with a pulse corresponding to the sign index.
In this case, the pulse reconstructing unit 525 may include: (1) a first reconstructing unit 5251, adapted to recover the positive or negative feature of the pulse sign of each position with a pulse according to the P(N) and the s(n), where P(N) represents distribution of the positions with a pulse on the track, and s(n) represents the sign index corresponding to each position with a pulse; and (2) a second reconstructing unit 5252, adapted to reconstruct the pulse order on the track according to the distribution of the positions with a pulse and signs output by the first reconstructing unit 5251, and according to the SU(N) which represents the quantity of pulses in each position with a pulse.
The decoding apparatus disclosed in this embodiment is applicable to the decoding methods disclosed in the ninth embodiment and the 10th embodiment.
A decoder 60 is disclosed in the eighteenth embodiment. As shown in
In the case of decoding the coding index structure provided in the fifth embodiment, the second extracting unit 62 may include: (1) a separating subunit 621, adapted to extract the combination value I23 of the second index and the third index from the coding index; (2) a resolving subunit 622, adapted to separate and output the second index I2 and the third index I3 in the following way:
I2=I23% W(N),I3=Int[I23/W(N)],
where W(N) represents the total quantity of all possible distributions of the positions with a pulse on the track, N represents the quantity of positions with a pulse corresponding to the first index, % refers to taking the remainder, and Int refers to taking the integer; (a) a first decoding unit 63, adapted to determine the P(N) according to the second index I2 with respect to the quantity of the positions with a pulse corresponding to the I1, where P(N) represents the distribution of the positions with a pulse on the track; (b) a second decoding unit 64, adapted to determine the SU(N) according to the third index I3, where SU(N) represents the quantity of pulses in each position with a pulse; and (3) a pulse reconstructing unit 65, adapted to reconstruct the pulse order on the track according to the P(N) and the SU(N), where: P(N) represents distribution of the positions with a pulse on the track, and SU(N) represents the quantity of pulses in each position with a pulse.
If the pulse to be decoded includes a sign, the decoder needs to further include a third extracting unit 66, adapted to extract the sign index s(n) corresponding to each position with a pulse from the Index() according to the quantity (N) of the positions with a pulse, where the sign index indicates the pulse sign information of the position with a pulse corresponding to the sign index.
In this case, the pulse reconstructing unit 65 may include: (1) a first reconstructing unit 651, adapted to recover the positive or negative feature of the pulse sign of each position with a pulse according to the P(N) and the s(n), where: P(N) represents distribution of the positions with a pulse on the track, and s(n) represents the sign index corresponding to each position with a pulse; and (2) a second reconstructing unit 652, adapted to reconstruct the pulse order on the track according to the distribution of the positions with a pulse and signs output by the first reconstructing unit 651, and according to the SU(N) which represents the quantity of pulses in each position with a pulse.
The decoding apparatus disclosed in this embodiment is applicable to the decoding methods disclosed in the eleventh embodiment and the twelfth embodiment.
In order to further clarify the foregoing embodiments, coding and decoding examples are given below, where the coding is based on the coding method in the third embodiment (the first coding mode is based on the calculation method in the second embodiment, and the second coding mode is based on the calculation method in the fourth embodiment), and the decoding is based on the decoding method in the ninth embodiment (the first decoding mode is based on the calculation method in the eighth embodiment, and the second decoding mode is based on the calculation method in the 10th embodiment), supposing that the selection condition of the first coding/decoding mode is: =3, 4, 5; and the total quantity of positions on the track is M=16.
Coding and decoding for pulse search results.
A. Coding
B. Decoding
Coding and decoding for pulse search results.
A. Coding
B. Decoding
The foregoing embodiments reveal that: The pulses to be encoded are ordered according to the distribution of the positions of the pulses on the track before coding, thus simplifying the calculation; because the coding is performed according to the order, all pulse distributions correspond to continuous coding, thus minimizing the coding redundancy and saving the coding bits. Further, the first coding/decoding mode is integrated with the second coding/decoding mode under the present invention. Therefore, the merits of the two coding modes with different N values complement each other, and the merits are more noticeable.
More coding and decoding examples are given below, where the coding is based on the coding method in the second embodiment and the decoding is based on the decoding method in the fourth embodiment, supposing that the total quantity of positions on the track is M=16.
Coding and decoding for pulse search results.
A. Coding
B. Decoding
Coding and decoding for pulse search results.
A. Coding
B. Decoding
According to P(2)={2, 4}, SU(2)={2, 4}, s(0)=0, and s(1)=0, it is determined that 2 positive pulses exist in position 2; and 4 positive pulses exist in position 4. The decoding process is completed.
The foregoing embodiments reveal that: The pulses to be encoded are combined according to positions, and the quantity of positions with a pulse, distribution of the positions with a pulse on the track, and the quantity of pulses in each position with a pulse are encoded. To any quantity of pulses to be encoded, the coding method under the present invention is uniformly applicable, thus avoiding increase of the coding index redundancy caused in the recursive mode, and ensuring a high utilization ratio of the coding bits. Meanwhile, it is not necessary to encode multiple pulses in the same position separately. Instead, the positions of pulses are merged before coding, thus saving coding bits. With the increase of the pulses to be encoded on the track, the probability of overlaying pulse positions also increases, and the merits of the embodiments of the prevent invention are more noticeable.
Detailed above are a coding method, a decoding, a coder, and a decoder under the present invention. Although the invention is described through some exemplary embodiments, the invention is not limited to such embodiments. It is apparent that those skilled in the art can make various modifications and variations to the invention without departing from the spirit and scope of the invention. The invention is intended to cover the modifications and variations provided that they fall in the scope of protection defined by the following claims or their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
2007 1 0103023 | Apr 2007 | CN | national |
2007 1 0153952 | Sep 2007 | CN | national |
This application is a continuation of U.S. patent application Ser. No. 14/617,585, filed on Feb. 9, 2015, which is a continuation of U.S. patent application Ser. No. 13/622,207, filed on Sep. 18, 2012, now U.S. Pat. No. 8,988,256. The U.S. Pat. No. 8,988,256 is a continuation of U.S. patent application Ser. No. 12/607,723, filed on Oct. 28, 2009, now U.S. Pat. No. 8,294,602, which is a continuation of International Patent Application No. PCT/CN2008/070841, filed on Apr. 29, 2008. The International Patent Application No. PCT/CN2008/070841 claims priority to Chinese Patent Application No. 200710103023.5, filed on Apr. 29, 2007, and Chinese Patent Application No. 200710153952.7, filed on Sep. 15, 2007. The aforementioned patent applications are hereby incorporated by reference in their entireties.
Number | Name | Date | Kind |
---|---|---|---|
6236960 | Peng | May 2001 | B1 |
6847929 | Bernard | Jan 2005 | B2 |
8294602 | Ma | Oct 2012 | B2 |
8988256 | Ma | Mar 2015 | B2 |
9225354 | Ma | Dec 2015 | B2 |
20020111799 | Bernard | Aug 2002 | A1 |
20050065785 | Bessette | Mar 2005 | A1 |
20060116872 | Byun | Jun 2006 | A1 |
20070124138 | Lamblin et al. | May 2007 | A1 |
20070124381 | Zurko | May 2007 | A1 |
20090248406 | Zhang | Oct 2009 | A1 |
20100049511 | Ma | Feb 2010 | A1 |
20130021177 | Ma | Jan 2013 | A1 |
Number | Date | Country |
---|---|---|
1395724 | Feb 2003 | CN |
1811917 | Aug 2006 | CN |
1890713 | Jan 2007 | CN |
101295506 | Oct 2008 | CN |
H11296195 | Oct 1999 | JP |
2003506764 | Feb 2003 | JP |
2004120623 | Apr 2004 | JP |
2005062453 | Mar 2005 | JP |
2007515676 | Jun 2007 | JP |
2008533522 | Aug 2008 | JP |
5221642 | Jun 2013 | JP |
2005066936 | Jul 2005 | WO |
2006096099 | Sep 2006 | WO |
Entry |
---|
Andy C. Hung et al: “Error resilient pyramid vector quantization for image compression”, Standford University, XP010145906, 1994, total 6 pages. |
Udar Mittal et al: “Coding unconstrained FCB excitation using combinatorial and huffman codes”, IEEE workshop proceedings, XP010647236, Oct. 2002, total 3 pages. |
“Digital cellular telecommunications system (Phase 2+); Universal Mobile Telecommunications System (UMTS); Mandatory Speech Codec speech processing functions AMR Wideband speech codec; Transcoding functions (3GPP TS 26.190 version 6.0.0 Release 6)”, ETSI TS 126 190 V6.0.0, 3rd Generation Partnership Project, XP014027744, Dec. 2004, total 56 pages. |
“3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Speech codec speech processing functions; Adaptive Multi-Rate—Wideband (AMR-WB) speech codec;Transcoding functions (Release 6)”, 3GPP TS 26.190 V6.0.0, 3rd Generation Partnership Project, Dec. 2004, total 54 pages. |
D. Guerchi et al :“Multi-Track Codebook in Low-Rate Celp Coding”, Industrial Electronics, IEEE International Symposium on Industrial Electronics, Jul. 2006, total 6 pages. |
“3rd Generation Partnership Project; Technical Specification Group Service and System Aspects; Audio codec processing functions; Extended Adaptive Multi-Rate—Wideband (AMR-WB+) codec; Transcoding functions (Release 7)”, 3GPP TS 26.290 V7.0.0, 3rd Generation Partnership Project, Mar. 2007, total 85 pages. |
Number | Date | Country | |
---|---|---|---|
20160105198 A1 | Apr 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 14617585 | Feb 2015 | US |
Child | 14974171 | US | |
Parent | 13622207 | Sep 2012 | US |
Child | 14617585 | US | |
Parent | 12607723 | Oct 2009 | US |
Child | 13622207 | US | |
Parent | PCT/CN2008/070841 | Apr 2008 | US |
Child | 12607723 | US |