The present invention relates to the encoding/decoding of digital signals such as audio, video, and more generally multimedia signals, for their storage or transmission. It proposes in particulars solution to the problems raised by the encoding and decoding of permutation codes.
Typically, the present invention also applies to the counterpart of the source encoding: the channel encoding or “modulation”.
Compression encoding/decoding of digital signals in the sense of the invention can be very useful for the quantization of the transform coefficients of speech and/or audio signal frequency encoders.
Vector Quantization
One very widely-used digital signal compression solution is vector quantization. Vector quantization represents an input vector by a vector of the same dimension selected from a finite set. A quantizer with M levels (or “code vectors”) is a non-bijective application of the set of input vectors, generally the n-dimensional real euclidian space n or a subset of n in a finite subset Y of n with M separate elements: Y={y0, y1, . . . yM−1}.
Y is called reproduction alphabet or “dictionary” or even “directory”, and its code vector elements are called “code words” (or “output points” or even “representatives”).
The bit rate for each dimension r of the quantizer (or its “resolution”) is defined by the relation:
In vector quantization, a block of n samples is treated as an n-dimensional vector. According to the theory of source encoding when the dimension becomes very large, the performance of the vector quantization approaches the source distortion bit rate limit. The vector quantizer dictionaries can be designed using statistical methods such as the Generalized Lloyd Algorithm (GLA), based on the necessary conditions of optimality of a vector quantizer. The duly obtained statistical vector quantizers have no structure, which makes their exploration costly in terms of calculation and storage resources, because the complexity of both encoding and storage is proportional to n2nr.
Referring to
In modulation, the three operations OP1, OP2 and OP3 of
The exponential increase in the complexity, according to the dimension, the vectors and the bit rate, limits the use of non-structural vector quantizers with small dimensions and/or low bit rates to be able to use them in real time. For a non-structured vector quantizer, the search for the closest neighbor (operation OP1) requires an exhaustive search among all the elements of the dictionary to select the dictionary element that minimizes a distance measurement between itself and the input vector. The latter two operations (indexing OP2 and reverse operation OP3) are generally performed by simple readings of tables, that are nevertheless costly in memory space. To overcome the size and dimension constraints, several variants of the basic vector quantization have been studied. They attempt to remedy the absence of dictionary structure and so reduce the complexity, but at the cost of quality. However, the performance/complexity trade-off is enhanced, which makes it possible to increase the range of the resolutions and/or dimensions to which the vector quantization can be applied cost-effectively. Many structured vector quantizer schemes have been proposed, and, in particular, the vector quantizer that implements a “permutation code” described below.
Permutation Codes
In the “permutation code” vector quantizer, the code vectors are obtained by permutations of the components of the first code vector (in lexicographic order) called “leader” (or even “leader vector”). These components take their values from an alphabet A={a0, a1, . . . , aq−1} of size q (q-ary alphabet A such that ai≠aj for I≠j). The components ai are real (or whole) numbers. The weight wi (where i is an index ranging from 0 to q−1) is the number of repetitions of the letter ai of the alphabet. The weights are positive integer numbers such that
By convention, the values of the alphabet satisfy a0>a1> . . . >aq−1. The n components of the leader go in descending order from the position 0 to the position (n−1). The leader vector y0 is therefore a vector of dimension n of the form:
It will be understood that a different order of the components could be chosen, for example a0<a1< . . . <aq−1.
The leader vector is called “signed leader” and the permutation code is said to be “of type I”. The other code vectors are obtained by permutations of the components of y0. The total number M of permutations is:
There is another type of permutation code (type II). The leader vector again has the same form as previously, but its components must be positive (a0>a1> . . . >aq−1≧0). The other code vectors are also obtained by to permutations of the components of y0 by assigning them all the possible sign combinations. The total number M of permutations is:
with h=n if aq−1>0 and h=n−wq−1 otherwise.
In this case, the leader vector is also called absolute leader.
The “permutation code” vector quantizer has been extended to the compounding (or union) of permutation codes and recently, this permutation code union structure has been generalized to variable dimension and resolution vector quantization (in document WO-04/00219 in the name of the Applicant). The permutation codes are not only used in statistical vector quantization. They are also found in algebraic vector quantization, which uses highly structured dictionaries, deriving from regular arrays of dots or error correcting codes. The permutation codes are also employed in modulation.
The use of the permutation code structure allows for the development of optimal and rapid nearest neighbor, search algorithms (operation OP1 in
There are several ways of enumerating the permutations. The Schalkwijk algorithm is one of these ways:
“An algorithm for source coding”, by Schalkwijk J. P. M, in IEEE Trans. on Information Theory, vol. IT-18, No 3, pp 395-399, May 1972.
Using combinatorial analysis, these techniques make it possible to index a code vector of a permutation code (operation OP2) and also perform the reverse index decoding operation (operation OP3). Among the permutation indexing algorithms, the commonly-used Schalkwijk algorithm is reviewed below, for example in the standards:
This involves ordering and indexing all the possible permutations of the components of the vector y=(y0, y1 . . . , yn−1). The order is lexicographic and the index is here called “rank”. Calculating the rank of the vector y involves calculating the rank of the vector D=(d0, d1 . . . , dn−1) associated with y, and such that dk has an index value d if, and only if, yk=ad.
For example, a vector y, of dimension n=8, has the following components:
The alphabet with q=3 letters (components of different values) is given by A={4, 2, 0} with a0=4, a1=2 and a2=0.
The vector y then has associated with it the vector D=(0, 1, 0, 2, 2, 0, 0, 1), the components of which are simply the indices of the q letters of the alphabet A.
The ranks of y and D are the same, but the definition of the vector D reduces the operation to the calculation of the rank of a sequence D having its values in a set {0, 1, . . . , q−1} (comprising the same number of elements as the alphabet {a0, a1, . . . , aq−1}).
The weights of the vectors y and D are the same since the occurrences of their respective components are the same. The intermediate weight (wk0, wk1, . . . , wkq−1) is also defined as the weight of the vector of components (yk, yk+1 . . . , yn−1) and which therefore corresponds to vector y truncated to leave the positions k to n−1. Thus:
where δ(x, y) is the Kronecker operator (δ(x, y)=1 if x=y and 0 otherwise).
The following applies: (w00, w01, . . . , w0q−1)=(w0, w1, . . . , wq−1).
The rank t of the vector y can be calculated, by combinatorial analysis, by the formula:
This formula can be simplified as follows:
It is this latter formulation that is often used and that will therefore be adopted here. Hereinafter, Ikd
Decoding the Rank (Operation OP3): Determining the Permutation from its Index
Decoding the rank t entails retrieving the vector D=(d0, d1, . . . , dn−1) associated with y. One method of sequentially searching for the dk is described below. The component d0 is determined first, followed by the component d1, . . . to the component dn−1.
Determination of d0:
d0 is found by using the formula comprising the inequalities:
The terms (n−1)!, t,
and the values w0d for d=0, . . . , q−1 are known. To find the value of d0, d0=0 is used as the starting point, and d0 is successively incremented by 1 until the formula (8) is satisfied. By writing the formula (8) with the partial ranks, the value d0 is such that:
Determination of d1:
To find d1, the following relation is used:
The values of w1d for d=0, . . . , q−1 are deduced from those of w0d as follows:
We come back to the same problem as the determination of the component d0.
To find the value of d1, d1=0 is used as the starting point and d1 is successively incremented by 1 until the inequality (9) is satisfied (I1d
Determination of the Other dk:
The calculation of the subsequent dk is deduced from the cases described above. To find the value of dk, dk=0 is used as the starting point and dk is successively incremented by 1 until the inequality
is satisfied.
Once the vector D=(d0, . . . , dn−1) is decoded, the vector y is deduced from it by simple alphabet transposition.
The indexing of a permutation code and the reverse operation are complex operations. The algorithms used apply combinatorial analysis. The indexing of the combinations and the reverse operation require divisions of factorial products.
Complexity of Division
Despite the advances made in integrated circuitry and signal processing processors, division remains a complex operation. Typically, the division of an integer number represented on 16 bits by an integer number represented on 16 bits costs 18 times more than their multiplication. The weight of the division of a 32-bit integer number by a 16-bit integer number is 32 whereas the weight of their multiplication is 5.
Framing of the Variables
The cost of division is not the only problem. The framing of the variables is another, as the table 1 below illustrates.
Only the factorials of the integer numbers less than or equal to 8 can be represented on whole words of 16 bits. For numbers greater than 12, the representation of the factorials on whole words of 32 bits is no longer possible.
Furthermore, the complexity of the operations also increases with the number of bits used to represent the variables. Thus, the division by a 16-bit integer of a 32-bit integer is almost twice as complex (weight 32) as the division of a 16-bit integer (weight 18).
Solutions for reducing the complexity of the operation OP1 have been proposed. The problem of the complexity of the operations OP2 and OP3 is not very well covered. However, notably in the Applicant's TDAC coder/decoder, or in the 3GPP-AMR-WB+ encoder, simplifications have been made to the encoding and decoding algorithms based on the Schalkwijk formula.
Simplifications of the Enumeration of the Permutations and of the Reverse Operation (Operations OP2 and OP3)
The calculation of the rank t of a permutation and the reverse operation have been accelerated by virtue of simplifications described below made to the calculation of the n terms Ikd
The first two are used to reduce the complexity in both encoding and decoding. The third is used in the encoding and the last two in the decoding.
The encoding technique is illustrated by
Referring to
The subsequent general preliminary step EP-3 is a calculation of the vector D=(d0, . . . , dn−1) as follows:
Referring to
The step CA-1 is an initialization of the rank t at 0, of the variable P at 1 (denominator used in the calculation of the rank in the step CA-13) and of the q weights w0, . . . , wq−1 at 0.
Here, the value of k is decremented from n−1 to 0 (step CA-2 for initialization of k at n−1, end test CA-14 and decrementation CA-15 otherwise). The benefit of such a selection is described later. The components dk of the vector D obtained in the preliminary step EP-3 are then used, and, for a current k, d=dk is set (step CA-3). The associated weight wd (wd=wd+1 in the step CA-4) is updated to estimate the term P (P=P×wd in the step CA-5). The sum S used on the numerator in the calculation of the rank corresponding to the step CA-13 is initialized at 0 (step CA-6) and a loop is then run on the index i of the weights wi (end test CA-9 and incrementation CA-10 otherwise, to d−1) to update the sum S (S=S+wi in the step CA-8). Before calculating the rank t in the step CA-13, a check is made to ensure that the sum S is not zero (test CA-11). The benefit of this implementation is described later.
The calculation of the rank t (step CA-13) involves the factorial term (n−k−1)! as follows:
t=t+(S/P)(n−k−1)!
Instead of calculating the term (n−k−1)! on each update of the rank t, preference is given to pre-recording these values in memory and using a simple memory access (step CA-12) to obtain the current value of (n−k−1)!
Thus, some of the advantages of the processing illustrated in
Storage of the Factorials
To avoid calculating the terms (n−1−k)! and wki! in real time, the values of the n+1 factorials (0!, l!, . . . , n!) are pre-calculated and stored. If the dimension n varies in a limited way (n≦nmax), the values 0!, l!, . . . , nmax! are precalculated and stored.
Test on the Cumulation of the Intermediate Weight to Avoid Division
There is no point in calculating
if the term
is zero. Now, this term is often zero, particularly for the last positions (k close to n−1). By adding a test as to the zero nature of this term (test CA-11 of
Reversal of the Loop on the Positions in Encoding
The weights wkd (with d=0, 1, . . . , q−1) are deduced from the weights wk+1d by incrementing wk+1d
with only an incrementation and a multiplication for each iteration, or:
It is also possible to process the last two positions separately (k=n−1 and k=n−2). In effect,
therefore In−1d
and (n−1−k)!=1!, therefore
therefore In−2d
Other advantageous implementation details, described below, can also be provided for.
Elimination of a Division in Decoding
In order to avoid divisions in decoding when searching for d0, the inequality (8) can be reformulated in the form:
Similarly, the divisions are eliminated from the search for d1 by reformulating the inequality (9) in the form:
It should be noted that, while it is thus possible to eliminate the divisions on searching for the dk (0≦k≦n−1), it is still necessary to perform (n−1) divisions to calculate the Ikd
Test on the Weights on Decoding
In the last positions, for certain values of d, wkd=0 (for the w0d components of value d occupying positions preceding the position k). It is therefore pointless calculating the terms of the inequalities (8) and (9) for these values of d.
The problem of the framing of the variables has been tackled in the Applicant's TDAC encoder.
A first solution was to distinguish the processing operations on the dimensions greater than 12 from those on the smaller dimensions. For the small dimensions (n<12), the calculations are performed on unsigned integers on 32 bits. For the greater dimensions, double-precision floating variables are used at the cost of an increase in the calculation complexity (floating double precision operations are more expensive than their integer precision equivalents) and in the required memory capacity.
Furthermore, if the maximum precision is limited to unsigned 32-bit integers (implementation by a fixed-point processor), the integer factorials greater than 12 cannot be prestored directly and the vectors of dimension greater than 12 must be coded separately. To resolve this problem, a more sophisticated solution uses a pseudo-floating-point representation by mantissa and exponent of the factorials n! in the form 2j×r. This decomposition is detailed in table 2 below. The storage of n! (for n less than or equal to 16) is reduced to storing r with a precision of 30 bits maximum as well as the exponent j which corresponds to a simple bit offset.
Thus, the techniques of the prior art, for the most part, do not solve the problem of the framing of the limited precision, in particular fixed point, variables. The implementation in the TDAC encoder, although it settles the framing problem, does not avoid the costly divisions of two integers. Furthermore, for high dimensions, the intermediate calculations (for example, the numerator and the denominator of the partial ranks Ikd
The enumeration techniques other than the Schalkwijk technique suffer from the same problems. Given that they also use combinatorial analysis, they entail the calculation of products of factorials and their divisions.
The present invention aims to improve the situation.
To this end, it firstly proposes a method of encoding/decoding digital signals, using permutation codes involving a calculation of combinatorial expressions, in which these combinatorial expressions are represented by prime factor power decompositions, and determined by a memory reading of pre-recorded representations of decompositions of selected integer numbers.
The present invention then provides an effective solution to the problems associated with both the indexing of a permutation code and the reverse operation. It at the same time resolves the two problems that are the framing of the variables and the divisions.
In effect, in an advantageous implementation, the pre-recorded representations comprise values representative of exponents, respectively stored correlated with values representative of successive prime numbers, for each of said selected integers.
The problem associated with the framing of the variables in the prior art is thus already resolved.
This problem of framing of the variables is all the more acute when it comes to manipulating factorial terms.
In an advantageous implementation, to manipulate the combinatorial expressions when they comprise integer factorial values, the pre-recorded representations comprise at least representations of decompositions of factorial values.
This implementation then makes it possible to lift the constraint of framing of the variables and, from that, push back the limits usually set as to the dimension n of the permutation codes concerned.
According to another advantageous characteristic, at least one of said combinatorial expressions comprises a quotient of an integer number numerator by an integer denominator, and this quotient is represented by a prime factor power decomposition, of which each power is a difference of exponents respectively associated with the numerator and the denominator and assigned to one and the same prime number.
The problem associated with the calculation of divisions of the prior art is thus resolved, by replacing this calculation with a simple subtraction calculation.
In a first embodiment, provision is made for a memory addressing to retrieve a pre-recorded decomposition of one of the abovementioned selected integers. To this end, the pre-recorded representation of a selected integer number is stored in an addressable memory, an addressing of said memory giving a succession of exponents to be assigned to respective prime numbers to recompose the selected integer number.
Preferably, the pre-recorded representation of a selected integer number will be stored in the form of a succession of addresses, each giving, for a prime number, an exponent to be assigned to this prime number to recompose the selected integer number.
This implementation according to the first embodiment will be designated hereinafter “exploded representation of the decompositions”.
As a variant, in a second embodiment, the pre-recorded representations are stored in the form of words comprising a succession of groups of bits, each group having:
Preferably, the prime factor powers are then determined by successively applying at least one partial mask to the word of bits, with successive offsets according to the weights of the bits and readings of the remaining bits.
This implementation according to the second embodiment will hereinafter be designated “compact representation of the decompositions”.
The same method process for the calculation of a combinatorial expression, can generally be conducted according to the following steps:
Regarding the calculation of a product to be performed recurrently and involving a new term on each recurrence, it may be advantageous to temporarily store the decomposition of a product calculation performed for a prior recurrence. Thus, if the method comprises a recurrent step for calculation of a product involving on each recurrence a term being multiplied with a product determined on a prior recurrence:
Similarly, if the method comprises a recurrent step for calculation of a division involving on each recurrence a term dividing a quotient determined on a prior recurrence:
This temporary storage of the intermediate decompositions of the recurrently calculated products and/or quotients is particularly advantageous in determining recurrent partial ranks, the cumulation of which is representative of a rank of a permutation.
Thus, in an advantageous implementation of the invention, the permutation codes involve the calculation of a quantity representative of a rank of a permutation comprising a cumulation of partial ranks, each partial rank then corresponding to one of said combinatorial expressions.
The calculation of the rank of a permutation can then be used, when encoding vector-quantized digital signals, to index the permutations of the components of a leader vector (operation OP2 in
Similarly, when decoding vector-quantized digital signals, the estimation of a rank of a permutation takes place given that, from a given value of a permutation rank:
In an exemplary implementation, this condition of proximity is satisfied if the given value of the rank can be bracketed by cumulations of partial ranks up to the partial rank associated with the presumed component, on the one hand, and up to the partial rank associated with a component corresponding to an incrementation of the presumed component, on the other hand.
This condition of proximity can therefore correspond to a general formulation of the inequalities (8) described above in the case of a Schalkwijk enumeration.
Thus, the present invention can advantageously be applied to the source encoding/decoding with vector quantization in the sense of
However, the encoding/decoding can also be of the channel modulation encoding/decoding type, in the sense of
The calculation of a partial rank involves terms (in a product or a quotient) which, as a general rule as will be seen later, remain less than or equal to the maximum dimension n of the permutation codes. Thus, in a advantageous implementation, the selected integer numbers with pre-recorded decompositions comprise at least:
In a particular optional implementation, the selected integer numbers can also comprise the value 0.
Thus, if the permutation code uses a Schalkwijk enumeration, a partial rank Ikd
represents a product for an integer index i ranging from 0 to m,
represents a sum for an index i ranging from 0 to m,
In this case, the selected integer numbers with pre-recorded decompositions and that are then to be identified in the expression of the partial rank Ikd
each term wki being between 0 and n, and
each between 1 and n−1, for all the integers k between 0 and n−1.
Still in the particular case of the Schalkwijk enumeration, the temporary storage of the intermediate decompositions is advantageously applied as to follows: the sum of the exponents in the decomposition of the term
is temporarily stored in memory for a preceding index k, to add to it or subtract from it the exponents of the decomposition of a term wki for a current index k.
Other characteristics and advantages of the invention will become apparent from studying the detailed description below, and the appended drawings in which, in addition to
As a reminder, and in particular with reference to
These operations will be designated thus with reference to the source encoding/decoding with vector quantization. It should be remembered that these operations can also be conducted in channel encoding/decoding, in modulation.
To illustrate straight away the principle of the invention, the prime number power factorization is described below.
The decomposition of a non-zero positive integer number K, in prime number powers, is expressed thus:
pi being an ith prime number (p0=1, p1=2, p2=3, p3=5, p4=7, p5=11, p6=13, p7=17, etc.).
The exponent of pi is denoted eKi in the decomposition of the integer number K and mK denotes the index of the greatest prime factor involved in the decomposition of K with non-zero exponent.
For example, the number K=120 (or 5!) is expressed:
120=1·23·31·51 and mk=3 here since the greatest factor “5” is of index 3 (p3=5). The following therefore applies: e5!0=1, e5!1=3, e5!2=1 and e5!3=1.
In practice, the number “1” being the neutral element of the multiplication, p0 can be eliminated from the decomposition, i.e.:
Of course, K=0 cannot be decomposed in prime factor powers.
The decomposition in products of prime number powers of the positive integers less than or equal to 16 is given in table 3a and the decomposition of their factorials is given in table 3b. This decomposition involves six prime numbers (2, 3, 5, 7, 11 and 13). Since the columns are indexed by the prime number pi and the rows by n, the cell of table 3a (respectively 3b) at the intersection of column pi and row n is the exponent eni (respectively en!i) of the prime number pi in the prime number power product decomposition of the number n (respectively n!).
For any positive integer number n>1, the number mn! of prime factors of n! is such that: pm
As table 3a shows, many exponents of the decomposition of a number n are zero. In the last column of table 3a, the number m′n of non-zero exponents in the decomposition of n has been noted. The absence of decomposition (and therefore of exponents) for n=0 is indicated in the row n=0 of table 3a by the “-” symbol.
There follows a description of the application of such a decomposition to the calculation of a partial rank of a permutation code, in the case of the Schalkwijk formula initially, then in the general case.
It should be remembered that the partial rank, denoted Ikd
such that three terms can be decomposed in prime number powers. These are the terms:
From the exponents of the decompositions of (n−1−k)!, of Pk and of Sk, the exponents of the decompositions of Ikd
In effect, the exponent
of a prime factor pi in the decomposition of Ikd
is equal to the sum of the exponents of pi of the first two terms (numerator of Ikd
Thus, with reference to
according to the relation (10) hereinabove (step G-3), this exponent being specific to the decomposition of the intermediate rank Ikd
Finally, the rank t of the permutation is obtained, in encoding in the step G-7, or, in decoding (broken lines in
In the general case, and independently of the Schalkwijk enumeration, if a partial rank t′ (t′>0) of a permutation is in the form of a numerator of Nt′ terms νj (1≦j≦Nt′) and of a denominator of Dt′ terms ρj (1≦j≦Dt′), such that:
then the exponents et′i of the decomposition of the partial rank t′ are determined from the intermediate decompositions that are the decompositions of the Nt′νj and of the Dt′ρj, which is expressed:
The decomposition in prime number factors will also be used hereinafter for a formulation in products of quotients of integer numbers of a partial rank t′.
Also as a general rule, if
To return to the particular case of the Schalkwijk enumeration, to then calculate a partial rank Ikd
the partial rank Ikd
It should be indicated here that the terms (n−1−k)! and Pk are strictly positive integers but the term Sk can be zero and therefore not decomposable. In this case, the partial rank Ikd
More generally, if
It will then be remembered that the prime factor factorization of the terms that make up a partial rank makes it possible to eliminate the divisions by replacing them with multiplications of prime factor powers, with, in particular, simple additions and subtractions of the exponents associated with these prime numbers.
Thus, in the sense of the present invention, the following steps are provided, from a limited number of prime factor decompositions of integer numbers, stored in memory (hereinafter called “basic decompositions”):
appearing in the rank of a permutation (hereinafter called “intermediate decompositions”), in the sense of the step G-2 in
in the sense of the step G-3 in
in the sense of the step G-4 in
Of course, the basic decompositions to be stored are preferably the subject of an advantageous selection. In a preferred but non-limiting implementation, the basic decompositions to be stored will be selected according to the maximum dimension of the permutation codes concerned (this maximum dimension being denoted n). Thus, the basic decompositions are preferably:
It is then possible to identify a basic decomposition by a number m giving:
Examples of this implementation will be described later with reference to tables 4a to 4d, in the context of a so-called “exploded” representation of the decompositions. It should be pointed out that a so-called “compact” representation, described in detail later, consists in storing a single word, the bits of which give all the exponents involved in a decomposition.
It is then possible to define different sets of basic decompositions, and the procedures for representing and storing these basic decompositions.
Moreover, the selection of the terms for which the intermediate decompositions are determined, and the determination proper of these intermediate decompositions, are the subject of advantageous implementations that will be described later. The decomposition of a partial rank and the calculation of a partial rank from its decomposition are also the subject of advantageous implementations described later.
There now follows a description of the selection of the basic decompositions to be stored.
Generally, and independently of the technique for enumerating a permutation code of dimension n, the calculation of the rank of a permutation uses the integer numbers l (0≦l≦n) and above all their factorials l! (0≦l≦n). In a preferred implementation, the basic decompositions are the decompositions of the factorials of l! (0≦l≦n) and of l (1≦l≦n) where n is the maximum dimension of the permutation codes concerned, as indicated hereinabove. (2n+1) basic decompositions are therefore provided in this preferred implementation.
Nevertheless, other implementations are possible.
For example, only (n+1) basic decompositions, namely those of l (1≦l≦n) and of 0! need be provided. Thus, if a decomposition of l! (l>0) is necessary to the calculation of a partial rank, it is calculated in the step for determining the intermediate decompositions from the 1 basic decompositions of j (1≦j≦l) with
Conversely, only the (n+1) decompositions of l! (0≦l≦n) need be provided. If a decomposition of l (l>0) is necessary to the calculation of a partial rank, it is calculated in the step for determining the intermediate decompositions from the two basic decompositions of base of l! and (l−1)! and from the relation:
e
l
i
=e
l!
i
−e
(l-1)!
i
It will therefore be understood that the selection of the set of basic decompositions can advantageously result in a trade-off between minimizing the memory needed to store the representations of these basic decompositions and minimizing the complexity of the step for determining the intermediate decompositions.
Below is a description of a representation of the decompositions in the sense of the invention.
As indicated above, a decomposition (whatever the partial rank, intermediate or basic) is defined by a number m giving the number of prime factors to be considered, these m prime factors and their respective exponents. Different solutions are proposed below for representing the decompositions and storing the data for the basic decompositions.
Exploded Representation of the Exponents
Representation of the Factorials l! (0≦l≦n)
The number ml! of prime factors involved in the decomposition of the value l! increases with the number l. A first solution for representing the decomposition of l! (0≦l≦n) consists in storing for each value of l (0≦l≦n) the number ml! and the ml! exponents of the powers of pi (1≦i≦ml!). It will be noted that the ml! exponents of l! are non-zero.
In a more advantageous variant, the set of the basic decompositions shares the same number mn! of prime factors and mn! exponents are stored for each basic decomposition, the exponents of the basic decomposition of l! of index greater than ml! being zero. This solution makes it possible to make use of a table of the exponents by providing a regular addressing of this table. However, such an implementation requires a considerable memory size. This table comprises mn!×(n+1) values and the exponent el!i is stored at the address (mn!, l+(i−1)) of this table, in which the notation (x, y) targets the cell of this table on the row x and in the column y. Of course, it will be understood that other conventions can be considered. Thus, instead of considering a two-dimensional table with m columns and N lines with therefore comprises m×N cells (or elements), it is possible to consider a one-dimensional table having m×N cells, the cell at the address (x, y) of the two-dimensional table then being located at the address m×x+y of the one-dimensional table. The exponent el!i stored at the address (l,(i−1)) of the two-dimensional table is then stored at the address (mn!×l+(i−1)) of the one-dimensional table. For example, the exponents of the decompositions of the factorials of the numbers 0 to 8 can be stored in the two-dimensional table comprising 36 cells, consisting of 4 columns (columns pi=2, 3, 5, 7) of table 3b and 9 rows (rows n=0, . . . 8). These same exponents can be stored in a single-dimensional table Dl! with 36 cells, given hereinbelow (appendix A-11). The cell at the address (x, y) of the first table being equal to the cell at the address Dl!: 4×x+y.
In addition, provision can be made to store the (n+1) values of ml! to be able to reduce the calculations of the intermediate decompositions using the basic decomposition of l!.
Representation of the Integers l (1≦l≦n)
To represent the basic decomposition l (1≦l≦n), several solutions can also be provided for. A first solution is to store, for each value 1, the number ml and the ml exponents of the powers of pi (1≦i≦ml) of l. In a variant, it may be preferred to store as many exponents as for l! (ml! or mn!) exponents. The basic decompositions of l and l! then share the same number m.
In another variant, the fact that the number m′l of non-zero exponents of the decomposition of l is low can be exploited. For example, it was apparent in table 3a that this number m′l was at most 2 (for l≦16). Thus, it is possible to store only this number and the corresponding values pi or the indices i.
However, it is also necessary to provide for storing the indices i of these prime factors of non-zero power because they are no longer implicitly recognized by the address of the corresponding exponent in the table.
Representation of a Decomposition Other than a Basic Decomposition
The representation of an intermediate decomposition depends on the representation of the basic decompositions from which it is determined. Similarly, the representation of a decomposition of a partial rank depends on the representation of the intermediate decompositions from which it is determined.
Storage of the Basic Decompositions
Typically, four possible storage solutions can be illustrated by way of example by tables 4a to 4d hereinbelow for a permutation code of dimension 8 (n=8) in which four (m8!=4) prime numbers (2, 3, 5 and 7) are considered. These examples can be applied to the 3GPP AMR-WB+ encoder (standards [3GPPTS26.273] and [3GPPTS26.304]). This encoder uses an algebraic vector quantization, the dictionary of which is a union of the permutation codes of the Gosset array RE8 of dimension 8.
The first three solutions (tables 4a-4c) represent and store the basic decompositions of l! in the same way. In effect, a storage of ml! and of the ml! exponents of the powers of pi (1≦i≦ml!) of l! is provided. They differ in the representation and the storage of the basic decompositions of l. Table 4a shows a first solution targeting the storage of ml and of the ml exponents of the powers of pi (1≦i≦ml!) of l. Table 4b shows a second solution targeting the storage of the ml! exponents of the powers of pi (1≦i≦ml!) of l.
Table 4c below shows a third solution targeting the storage of the number m′l of non-zero exponents of the powers of pi of l, of the corresponding indices i and their exponents. In the table represented, for greater clarity, it is the prime factors pi that are indicated.
In a fourth solution (illustrated by table 4d below), the set of the basic decompositions is represented by the number mn! and, for each basic decomposition (l or l!), mn! exponents are stored. Table 4d is extracted from the four columns (pi=2, 3, 5 and 7) and the 9 rows (n=0 to 8) of tables 3a and 3b given previously.
In the TDAC encoder which uses a statistical vector quantization with variable dimension and resolution, of maximum dimension 15, six (m15!=6) prime numbers are considered: 2, 3, 5, 7, 11 and 13. The 6 columns (pi=2, 3, 5, 7, 11 and 13) and the 16 rows (n=0 to 15) of tables 3a and 3b can then illustrate the storage of the set of the basic representations for the fourth solution.
Compact Representation of the Exponents
There follows a description of another advantageous solution that minimizes the storage and that consists in compactly representing the exponents of a basic decomposition on a limited number of words. In this variant of representation of the basic decompositions, the intermediate decompositions and those of the partial ranks are also represented compactly. Advantageously, this solution also minimizes the complexity of the determination of these decompositions, as will be seen.
Compact Representation of the Decompositions
A search is conducted to determine, for each prime factor pi, an upper limit βi of the maximum value of its exponent in the numerator of the partial ranks. This limit gives the maximum number of possible values of the exponent of pi, namely βi+1. By using bni denote the integer number of bits to represent the value (βi+1) in binary fashion, we obtain:
in which ┌x┐ denotes the integer immediately greater than or equal to x (┌x┐−1<x≦┌x┐).
The exponents of the power factor decomposition of a term K involved in a partial rank t′ can be represented compactly by a word eK of Bn bits
this word eK being such that:
The notation “<<B” represents a left shift of B bits.
It will be noted that if the number n is great, it may be that Bn is greater than the number of bits B0 used to represent integers (16, 32 or 40 bits). In this case, the exponents of the prime factor decomposition of an integer K involved in t′ are represented in the form of M whole words eK(m), 0≦m<M (with, of course, M>1).
Advantageously, the M words can be formed as follows:
in which Bni
in which (Bni
Of course, other variants can be provided. For example, one variant consists in separately storing the exponent of p1 and applying the above processing operation from the exponent of p2.
Determination of the Upper Limit
The limits βi can be determined in a number of ways. Using the information on the permutation code (size q of the alphabet, weight wi 0≦i<q), the maximum value of each exponent of the numerator of a partial rank can be explicitly determined. If a number of permutation codes (possibly of different dimensions) are used, the greatest of the maximum values is preferably chosen for each exponent.
The present invention advantageously proposes a general processing operation to determine the upper limit in the context of the Schalkwijk enumeration. The processing operation uses no a priori information on the permutation codes used other than the maximum dimension. It simply uses the relation:
to then choose
This very general processing is particularly suitable when a large variety of permutation codes is used.
Table 5a gives the upper limits of the maximum values of the exponents in the numerators of Ikd
Table 6a (respectively 6b) gives the compact representation of the exponents of l and l! for the dimension n equal to 8 (respectively 15).
As a purely illustrative example, let us try to determine the decomposition of the integer l=12 using table 6b.
Preferably, in table 6b, since the maximum dimension of the codes is n=15, the exponent of “2” is represented on 4 bits, that of “3” on 3 bits, and the other prime factors 5, 7, 11, 13 on 2 bits. In the table, in the column l=12, its compact exponent e12=18 can be read.
Relying on the reading of the table below, the binary representation of 18 (=16+2) on B15=15 bits is: 000 0000 0001 0010, i.e. 00 00 00 00 001 0010 by grouping together the bits associated with one and the same prime number.
The 4 low order bits (weights i=0 to 3) are the exponent of the prime factor 2, i.e.: 0010=2, which means that 2 is the exponent to be assigned to the prime number 2.
The next 3 bits (weights i=4 to 6) are the exponent of the prime factor 3, i.e.: 001=1, which means that 1 is the exponent to be assigned to the prime number 3.
The next 2 bits (weights i=7 to 8) are the exponent of the prime factor 5, i.e.: 00=0
The next 2 bits (weights i=9 to 10) are the exponent of the prime factor 7, i.e.: 00=0
The next 2 bits (weights i=7 to 12) are the exponent of the prime factor 11, i.e.: 00 =0
The next 2 bits (weights i=13 to 14) are the exponent of the prime factor 13, i.e.: 00=0
The extraction procedure entails masking the high order bits to recover the exponent of the prime factor contained in the low order bits, then shifting the compact exponent of the number of bits recovered to switch to the exponent of the next prime factor.
Thus, in dimension 15, there are 6 exponents to be extracted, beginning with the exponent of 2.
The binary representation of the exponent of 2 corresponds to the 4 lower order bits of 18, i.e. 0 0 1 0 which corresponds to 2. To recover them, the high order bits of 18 are masked with 15 (denoted 18 & 15), which is equivalent to: 24−1=1 1 1 1.
The result obtained is e12=18 & (2<<4−1)=2, which means that 2 is the exponent to be assigned to the prime number 2.
Then 18 is shifted 4 bits to the right, giving: 000 0000 0001=1
The binary representation of the exponent of 3 corresponds to the three low order bits of 1, i.e. 0 0 1 (=1). To recover them, the high order bits of 1 are to masked by 7 (denoted 1 & 7 and with the value 23−1=1 1 1).
The result obtained is e112=1 & (2<<3−1)=1, which means that 1 is the exponent to be assigned to the prime number 3.
Then 1 is shifted 2 bits to the right, which then gives: 0000 0000=0 for all the other high order bits.
It will therefore be recalled that the powers of l=12 are:
Upper Limit for the Denominator
It is assumed here that, for each prime factor, its exponent in the denominator of a partial rank t′ is less than or equal to its exponent in the numerator of t′. Such is the case if t′ is strictly positive because et′i=(e(Num(t′)i−eDen(t′)i)≧0, therefore e(Num(t′)i≧eDen(t′)i.
In practice, with the Schalkwijk formula, and if q>1, the value e(n−1)!i is an upper limit of the maximum value β′i of the exponent of the denominator Pk (Pk≦(n−1)! if q>1).
It is therefore sufficient to check the inequality e(n−1)!i<2b
In the other cases, it is possible to search explicitly for β′i and calculate bni using the maximum of βi and β′i.
In the case where q=1, it will be understood that a single code word of known rank (t=0) is involved in the permutation code and it is therefore useless a priori to perform the rank calculations and the corresponding reverse operations. However, if there is no desire to deal with this particular case to separately, provision can still be made to calculate the value bni by the maximum of βi and en!i. Table 7 below illustrates this case for n=16.
There now follows a brief description of the memory capacities required to store the basic decompositions.
Independently of the solution chosen for the representation of the basic decompositions, the basic decompositions are stored in tables and use can then be made of an addressing of these tables in rank encoding and decoding operations. Although the decomposition of 0 is not possible (and, moreover, not used), it may be preferable to store “dummy” exponents for the decomposition of 0 (for example 0s or 1s), to simplify the address calculation. Table 8 below summarizes the memory size needed to store the data relating to the basic decompositions for the five solutions described for these two cases (storage or non-storage of the dummy decomposition of 0).
In the fifth solution, the storage (+mn!) of the numbers of bits bni is taken into account. However, in practice, rather than read them from a memory, the latter are “hardwired” (their value being set in a calculation program without being declared as variable), as will be seen in the embodiments below. It therefore appeared pointless to store them in practice.
Table 9 gives the memory needed to store the data relating to the exponents of the decomposition of these five solutions for nmax=8 and 15 (with dummy storage of 0).
There now follows a description of the storage of the prime factor powers.
Outside the basic decompositions, the present invention uses the prime factor powers to calculate the partial rank from its decomposition. It is possible, from a table of these prime factors, to calculate their powers in real time (“on line”). Preferably, the powers of the prime numbers other than 2 are precalculated and stored and only the powers of 2 are calculated in real time. The table 10a below gives the powers of 3, 5 and 7 needed for a permutation code of dimension 8 (like those used in the AMR-WB+ encoder). Table 10b gives the powers of 3, 5, 7, 11 and 13 needed for the permutation codes of maximum dimension 15 (like those used in the TDAC encoder).
Here again, it is possible to store only the necessary number of powers for each prime factor. As a variant, if it is preferred to have only a single table of regularly addressable powers, provision can be made to store, for each prime factor, as many values as the number of powers of p2 that are needed (p2=3). For the unused powers, it is, of course, possible to use a storage of dummy values such as 1s or 0s.
There now follows a description of the calculation of the rank of a permutation to perform an encoding using the invention.
There are several variants depending on the set of basic decompositions selected and their representation. For conciseness, the explanation of the possible implementations below is limited to the case of the preferred embodiment for the set of basic decompositions, with decompositions of the factorials of l! and l.
Below, the solution of exploded representation of the exponents with mn! exponents for each basic decomposition, which is the most general case, is explained first. Variants of exploded representation of the exponents are then described. Finally, the solution of compact representation of the exponents of the basic decompositions is explained, together with a few of its variants. It will then become apparent that the invention is perfectly applicable to a processing operation for encoding the rank of a permutation.
The Schalkwijk algorithm is reviewed below, as an example of enumeration processing.
Exploded Representation of the Exponents of the Decomposition
Let n be the maximum dimension of the permutation codes used, and mn! be the number of prime factors involved in the decomposition of the quantity n!
A description of a first embodiment of an encoding that uses an exploded representation of the exponents of the decomposition is given below.
Here, the exponents of the basic decompositions of l and l! are preferably stored according to the “fourth” solution of table 4d hereinabove, with dummy storage for l=0 in two single-dimensional tables respectively denoted Dl and Dl! having mn!×(n+1) cells. As mentioned hereinabove, it would also be possible to consider two-dimensional tables having mn! columns and (n+1) rows. The exponents of l (respectively l!) being stored regularly (each on mn! values), the operations for reading the exponents of a basic decomposition necessitate an address calculation in the table Dl(respectively Dl!). To read the exponents of the decomposition of l! (respectively l), it is necessary to point to the address (l×mn!) of the table Dl! (respectively Dl), and so target the address of the exponent el!1(respectively el1), the exponent el!2 (respectively el2) being at the next address (l×mn!+1) and more generally the exponent el!i (respectively eli) being at the address (l×mn!+i−1). As mentioned hereinabove, in two-dimensional tables, the exponent el!i(or eli) is at the address ((l; (i−1)) (column (i−1) and row l).
It should be noted that, if provision is not made for dummy storage for l=0, the address calculation in the table Dl of the n basic decompositions of l (l>0) is: (l−1)×mn!.
Reference is made to
In the step C-1, the variable dk is read. The step C-2 consists of an update of the cell dk of the table w: w[dk]=w[dk]+1.
The step C-3 is an update of the exponents of the decomposition of Pk (table P), with, in particular:
Thus, to implement the step C-31, the first exponent of the basic decomposition w[dk] in the table Dl, denoted ew[d
In parallel, in the step C-4, Sk is calculated from the usual relation
The step C-5 is a test on the value of Sk. If Sk is zero (Y arrow), which means that the partial rank Ikd
In parallel, the step C-7 consists in reading the mn! exponents e(n−1−k)!i of the basic decomposition (n−1−k)! in the table Dl! at the address mn!×(n−1−k). It will be noted that the step C-7 is performed if the sum Sk is not zero (N output of the test C-5) so as to avoid the unnecessary reading in the table Dl! if, in any case, the partial rank Ikd
In the step C-8, the results of the different readings of the tables can be grouped together to calculate first of all the mn! exponents of the decomposition of the partial rank Ikd
Finally, in the step C-9, the partial rank Ikd
It will be recalled that the term w[dk] is a weight that is necessarily less than or equal to the maximum dimension n of the permutation code concerned. Similarly, the sum Sk of such weights remains less than the maximum dimension n and the same applies, of course, for (n−1−k). The decompositions of w[dk], Sk and (n−1−k)! are indeed listed in the tables of decompositions of the integers or of the integer factorials ranging up to the maximum dimension n, such as the table 4d. From the decomposition w[dk] listed in a table and the decomposition of Pk−1 determined on the preceding loop (k−1) and kept in memory, the decomposition of Pk is determined.
As a purely illustrative example of the partial rank calculation steps of
In this example, it is assumed that, at a preceding position k=3, the table of the weights w is {1, 1, 0, 3} and therefore P3=1! 1 0! 3!=6. The table P of the exponents of the decomposition of P3 (=21×31×50×70) is therefore {1, 1, 0, 0}.
In the position k=2, for the step C-1, it is assumed that d2=2 has been read. In this example, in the step C-2, the cell w[2] is updated by incrementing it by 1 (w[2]=0+1=1).
In the step C-31, the four exponents of the decomposition of 1 (=w[2]) are read, namely 0, 0, 0, 0 (see table 4d, 6th to 9th columns and 3rd row l=1).
Then (step C-32), the table P is updated, so P={1, 1, 0, 0} is obtained.
In the step C-4, Sk is calculated: Sk=w[0]+w[1]=1+1=2. Sk is therefore not zero (test C-5).
In the step C-9, the partial rank Ikd
Referring again to
The step C-11 then aims to decrement the variable k (k=k−1) and the step C-12 is a test on the value of k to decide whether to continue the processing. Thus, if k≧0 (Y arrow at the output of the test C-12), the steps of the processing operation are reiterated from the first step C-1, with a value of k decremented by one unit. Otherwise (N arrow at the output of the test C-12), the processing is terminated in the end step C-13 (“END”).
Thus, it will be understood that the above step C-9 calculates a partial rank from its decomposition determined in the step C-8, itself determined from three intermediate decompositions:
The determination of two of them, (n−1−k)! and Sk, performed in the steps C-6 and C-7, involves a simple read in the respective tables of the basic decompositions Dl and Dl!. The determination of the third intermediate decomposition (that of Pk) can also be performed simply from the q basic decompositions of w[d]! read in the table Dl! according to a relation of the type
The step C-3 presents a more advantageous variant of determination of this intermediate decomposition. The intermediate decomposition of Pk can in effect be determined from a basic decomposition read in the table Dl and from another intermediate decomposition (that of Pk+1) calculated for another partial rank (Ik+1d
Hereinabove, the calculation of the rank was performed by a loop from the last position (k=n−1) to the first position (k=0). However, the invention also applies, of course, to a loop from the first to the last position. All that is needed is to change the initialization phase and adapt the steps C-2 and C-3 and their order. To this end, the table of the weights w can be initialized with the q weights w0d. Then, for d ranging from 0 to q−1, the mn! exponents of the decomposition of w[d]! are read in the table Dl!, to calculate by cumulative addition, the mn! values of the table of the intermediate decomposition (exponents of the decomposition of P0).
The step C-3 then takes place after the step C-10 and before the step C-2. The intermediate decomposition of P is updated by subtracting from P[i] the exponent ew[d
It is worth briefly noting that, for the permutation codes of variable dimension n, rather than performing mn
More generally, the implementation represented in
Thus, in a first variant, each basic decomposition (of l or l!) also contains the number ml!. The reading of the number ml! (0≦l≦n) offers advantages. In effect, the steps C-3 and C-6 to C-10 are no longer carried out mn! times each, but only:
of the step C-8.
If, furthermore, the values ml have been stored, all that is then needed is to perform:
In another variant of the encoding, if in addition, the storage of the exponents of the basic decomposition of/is used according to the third solution (table 4c hereinabove), the step C-3 can be performed for mw[d
In a third variant, instead of decomposing a partial rank into three terms (two in the numerator and one in the denominator), it is decomposed into two terms, one of which is a quotient. Thus, the partial rank Ikd
This quotient can be updated by the following relation:
Thus, rather than determine the decomposition of Rk from the q+1 basic decompositions (those of (n−1−k)! and of the q w[dk]! read in the table Dl!), an intermediate decomposition of Rk is determined from the intermediate decomposition of Rk+1 and from the basic decompositions of (n−1−k) and of w[dk] (these two basic decompositions being read in the table Dl), which is expressed:
Compared to the preceding variants, instead of determining and storing the intermediate decomposition of the denominator of Ikd
According to the storage option, this update can be done with mn! additions and subtractions or m(n−1−k)! additions (respectively mw[d
According to the storage option, mn! additions or ms
It will be noted that this ratio Rk is not necessarily an integer, which means that the exponents R[i] can be negative. In this variant, the decomposition of the factorials in the encoding (therefore table Dl!) is no longer useful, such that a simple set of (n+1) basic decompositions of the integers l (l≦n) can be used by storing only the table Dl.
Compact Representation of the Exponents of the Decomposition
There now follows a description of a second embodiment of an encoding, based on a compact representation of the exponents of the decomposition.
The exponents of the basic decompositions are represented compactly and no longer in exploded form as described hereinabove with reference to the first embodiment. For conciseness, only the case where the compact representation of the exponents is contained in a single word is explained. These words are therefore stored as described hereinabove, with storage of a dummy word for l=0 in the two tables respectively denoted D′l and D′l! with (n+1) cells. The calculation of the address in these two tables of a word containing the exponents of a basic decomposition is direct because, to read the word of the decomposition of l! (respectively l), all that is needed is to point to the address l of the table D′l! (respectively D′l).
It will be noted that, without the storage of a dummy word for l=0, the word corresponding to a basic decomposition l (with l>0) is at the address (l−1) in table D′l.
The word eP will be updated in each position (step CC-3 below).
Reference is now made to
In the step CC-1, the variable dk is read. The step CC-2 consists of an update of the variable w: w[dk]=w[dk]+1.
The step CC-3 is an update of the word ep with, in particular:
In parallel, in the step CC-4, the sum Sk is calculated:
The following step CC-5 is a test on the value of Sk. If the sum Sk is zero (N arrow), the index k is directly decremented. Otherwise (Y arrow at the output of the test CC-5), the processing continues with a reading of the word eS
In parallel (and advantageously depending on the result of the test CC-5), the word e(n−1−k)! containing the compact representation of the exponents of (n−1−k)! is read in the table D′l!, in the step CC-7.
In the step CC-8, the results obtained from the various steps CC-3, CC-6, CC-7 are grouped together to calculate the word
containing the compact representation of the decomposition of the partial rank Ikd
The step CC-9 targets the extraction of the mn! exponents
of the decomposition of the partial rank Ikd
by appropriate offset of the bits of the word as described previously. To this end, a loop is provided on an index i ranging from 1 to mn! (initialization CC-91 of i at 1, test CC-93 on the value of i and incrementation CC-94 until i reaches the value of mn!). Each loop of i then has applied to it instructions CC-92 of the type:
It will be recalled that the notations “<<b” and “>>b” respectively designate a left shift and a right shift of b bits. Moreover, the notation “&” designates the bit-by-bit “AND” logic operator. The instruction i′1) consists in recovering the bni low order bits of
In practice, the masks corresponding to the instruction “((1<<bni)−1)” are hardwired.
In other words, a mask ((1<<bni)−1) of the high order bits is applied at the outset (for the loop index i equal to 1) so as first of all to recover only the bn1 low order bits of
giving the exponent
associated with the first prime factor p1.
are shifted by bn1 “to the right” to recover the highest order bits giving the exponents associated with the next prime factor p2 (mask ((1<<bn2)−1) of the high order bits,
is then extracted,
The following step CC-10 consists in calculating the partial rank Ikd
Then, the partial rank Ikd
The value of the index k is decremented in the next step CC-12 (k=k−1) and, before recommencing the steps CC-4, CC-1, CC-7 and subsequent steps with this decremented value, a check is carried out in the test CC-13 to see if the value of k has not reached −1 (k<0), in which case the processing is terminated (step CC-14).
Thus, independently of the representation of the decompositions, the present invention makes it possible to effectively calculate partial ranks. The step CC-10 targets the calculation of a partial rank from its decomposition determined in the preceding steps CC-8 and CC-9. Three intermediate decompositions (of the terms (n−1−k)!, Sk and Pk) are used. The determination of two of them ((n−1−k)! and Sk) performed in the steps CC-6 and CC-7 consists in a simple reading of their compact representation in the tables D′l! and D′l. The determination of the third intermediate decomposition (Pk)) performed in the step CC-3 also requires a reading of the table D′l followed by an update of the compact representation of this intermediate decomposition by addition of the compact representation of the basic decomposition which has been read.
As indicated previously with reference to the first embodiment, the storage of the values ml! (0≦l≦n) makes it possible to reduce the complexity of the steps CC-9 and CC-10. The loop for extraction of the exponents of the decomposition of a partial rank Ikd
There now follows a description of the decoding of the rank of a permutation by using the invention.
Here again, there are several variants depending on the solution for representation of the basic decompositions (exploded or compact). There follows a description of a first embodiment of the decoding, similar to the first embodiment described hereinabove for the encoding using the exploded representation of the decompositions and their storage according to the fourth solution relating to table 4d given above. It will become apparent that the invention is perfectly applicable to a decoding of the rank of a permutation taking the Schalkwijk algorithm as the example.
The decoding using an exploded representation of the exponents of the decomposition preferably begins with an initialization of the data as follows.
Reference is now made to
The first step D-1 consists in a reading of the mn! exponents e(n—1−k)!i of the decomposition of (n−1−k)! in the table Dl!
The next step D-2 sets the values dk=0 and Ikd
There then follows a search for the first value of dk of the alphabet such that w[dk]≠0. For this purpose, a test D-3 is provided to check whether w[dk]=0, in which case (Y arrow) the value of dk is incremented (dk=dk+1) and the test D-3 is reiterated until a non-zero w[dk] value is found. If this value has been found (N arrow for w[dk]≠0), the next step is a test D-5 on the value of the rank t. If the rank is zero (Y arrow at the output of the test D-5), applying the following steps is pointless until the exponents of Pk are updated (step D-18). If the rank is not zero (N arrow at the output of the test D-5), the processing continues with the subsequent steps D-6 and D-7 in which there are respectively set Sk=0 and an intermediate value Ik at Ik=Ikd
The next step D-8 is a calculation for updating the sum Sk=Sk+w[dk]. It is followed by the reading of the table Dl (step D-9) of the mn! exponents eS
The step D-10 targets the calculation of the mn! exponents
of the decomposition of the partial rank, from the relation:
with 1≦i≦mn!. The exponents P[i], initialized as described above, are updated in the step D-18 which will be described later, at the end of the current loop and for a next loop.
The step D-11 targets the calculation of the partial rank:
The next three steps target a test on the value of the total rank t, by comparing it to the value of the partial rank. To do this, in the step D-12, the value of dk is incremented (dk=dk+1) and the test D-13 is as follows: t−Ikd
If the inequality is satisfied (Y arrow), the steps D-7 to D-13 are repeated with the new incremented value of dk. Otherwise (N arrow), the processing continues with the step D-14 for decrementing the value of dk (dk=dk−1) to return to the value of dk before the step D-12. For this value of dk, the partial rank Ikd
Before looping back to the first step D-1, a check is carried out to ensure that the n components have not all been processed. For this, the test D-21 on the value of k is provided, by comparing it to n (k<n). As long as the index k has not reached the value n (Y arrow at the output of the test D-21), the processing is recommenced at the step D-1 for a next value of k. Otherwise (N arrow at the output of the test D-21), the processing is terminated at the end step D-22.
It should be remembered that the step D-11 targets the calculation of a partial rank by using its decomposition determined in the step D-10 from three intermediate decompositions of the respective terms (n−1−k)!, Sk and Pk. The determinations of two of them ((n−1−k)! and Sk) performed in the steps D-1 and D-9 consist of a simple reading in the respective tables Dl! and Dl. The determination of the third intermediate decomposition (Pk)) performed in the step D-18 is also performed by a reading of the table Dl (step D-17) followed by an update by exponents of this intermediate decomposition by subtraction of the exponents of the basic decomposition that has been read (step D-18). The initialization described hereinabove of this intermediate decomposition requires q readings of the table Dl! followed by an update by exponents of this intermediate decomposition by addition of the exponents of the q basic decompositions read.
As for the encoding described hereinabove, the processing of
A variant involving the use of the exponents of the ratio Rk (as described previously) is particularly interesting. In effect, in the decoding processing operation described hereinabove with reference to
are calculated for several values of d. For each value of d tested, the other variants require, for each exponent, a subtraction and an addition
However, the variant using the exponents of the ratio Rk requires only an addition
because only the sum Sk and its exponents eS
There now follows a description, with reference to
First, the data is initialized as follows.
The letters “DC” in the notations of the steps DC-n of
The step DC-1 consists in reading the word e(n−1−k)! containing the compact representation of the mn! exponents of the term (n−1−k)! in the table D′l!
The steps DC-2 to DC-8 are similar to the steps D-2 to D-8 described hereinabove with reference to
On the other hand, in the step DC-9, the word eS
The general step DC-11 globally consists in an extraction of the exponents of the partial rank Ikd
described hereinbelow (step DC-112), comparison (test DC-113) of the loop index i with the value mn!, with incrementation of the index i (step DC-114) until the value mn! has been reached);
(step DC-112):
by masking of the high order bits of
by the mask ((1<<bni)−1):
This general step DC-11 is similar to the general step CC-9 in
The steps DC-12 to DC-17 are, themselves similar to the steps D-11 to D-16 described hereinabove with reference to
The updating of the exponents of Pk (table P) in the general step DC-18 is done, on the other hand, in the step DC-181, by reading the word ew[d
Then, the steps DC-19 to DC-22 are similar to the steps D-19 to D-22 in
There now follows a description of the various advantages offered by the variants explained hereinabove.
The variants of the first embodiment with exploded representation, using the tables of mn! (and/or ml or m′l), comprise fewer addition/subtraction operations than the main embodiment that only uses the table of the values mn!.
In this case, the saving in complexity is significant above all for the last positions (that is, when m(n−k)!, m′l, or ml are less than mn!). Nevertheless, this saving in complexity is accompanied by an increase in the complexity of the memory reading steps (steps C-31, C-6 and C-7). While there are fewer values to be read, the address calculation is, on the other hand, more complicated.
An interesting trade-off then involves regularly storing the basic decompositions (with mn! exponents) to facilitate the addressing of the tables Dl and Dl! and then store the values ml! in a table Dm with (n+1) cells. The values ml should then be stored to effectively reduce the number of additions/subtractions. However, this measure is necessarily accompanied by a reading of the values mS
Moreover, the benefits provided by the compact representation compared to the exploded representation are as follows:
also requires only a single addition and a single subtraction,
However, the compact representation requires an extraction of the exponents of the partial rank Ikd
(steps CC-9 and DC-11).
However, this operation is not necessarily a drawback for the calculation of the partial rank from its prime factor decomposition, as will be seen hereinbelow.
There now follows a description of the advantages of such variants for the calculation of the partial rank from its prime factor decomposition.
The complexity of the step for calculating the product of the prime factor powers, in the sense of the steps C-9 and CC-10 (respectively D-11 and DC-12) in the encoding (respectively in the decoding) increases significantly with the number of factors, even though it remains far less complex than the division in the sense of the prior art. Now, in practice, many of the exponents of the decomposition of a partial rank are zero and therefore the corresponding powers are also at 1. Often, all the exponents are zero or only the first exponents are not zero. It is therefore useful to be able to detect and keep only the non-zero exponent powers. In the detailed representation, this detection can be performed only by mn! tests or m(n−1−k)! tests (one for each prime factor).
Advantageously, the compact representation makes it possible to test with a single test whether all the exponents are zero
and, in this case, the rank t′=1. Furthermore, the detection of the high order bit of et′ makes it possible to obtain the index of the greatest non-zero exponent prime factor in the rank t′ and reduce the number of repetitions of the loop of the step CC-9 (respectively DC-11) in the encoding (respectively in the decoding).
It should nevertheless be noted that the detection of the non-zero exponents, in detailed representation as in compact representation, increases the complexity. If all the exponents are non-zero, the complexity of the multiplication of the powers of the prime factors remains the same and this complexity is then compounded by that of the procedure for detecting non-zero exponents.
Thus, in a first variant, the detection of the zero exponents can be conducted only if the possible number of prime factors becomes great (k much less than n) and the complexity of the multiplication of their powers is greater than the complexity of the detection procedure. For this, different loops according to the positions can be provided, even though this implementation is applied at the cost of an increase in the lines of instruction.
It is also possible to combine the exploded and compact representations. For the last positions (the value ml! being low), the calculation of the intermediate decompositions entails few operations. The use of an exploded representation is then favored, this representation not requiring the extraction of the exponents of a partial rank. On the other hand, for the first positions, the use of a compact representation is more favored.
There now follows a description of a few exemplary implementations in existing encoders/decoders.
3GPP AMR-WB+ Encoder
The 3GPP AMR-WB+ encoder (standard [3GPPTS26.304]) uses an algebraic vector quantization for which the dictionary is a union of the permutation codes of the Gosset network RE8 of dimension 8.
The TCX technique corresponds to a predicting encoding by transform. More specifically, it involves an FFT transform encoding method applied after perceptual weighting filtering. In the standard [3GPPTS26.304], the FFT spectrum obtained is subdivided into subbands (or subvectors) of dimension n=8 and these subvectors are separately encoded. The quantization of the subvectors uses the regular network of points RE8. The quantization dictionaries in dimension 8 consist of a union of permutation codes of type I obtained from the network of points RE8.
In the TCX encoder according to the standard [3GPPTS26.304], each permutation code corresponds to a given signed leader vector in dimension n=8. The quantization index of a point of the network RE8 is calculated by a formula of the type:
index=cardinality offset+permutation rank
The rank is calculated by the Schalkwijk formula whereas the cardinality offset is tabulated. Nevertheless, these signed leaders are represented via their absolute leaders in order to optimize the storage and the search in the permutation codes. The list of the associated absolute leaders can be found in the reference:
“Low-complexity multi-rate lattice vector quantization with application to wideband TCX speech coding at 32 kbit/s” by Ragot S., Bessette B., Lefebvre R., in Proc. ICASSP, vol. 1, May 2004 pp. 501-4.
To illustrate the different variants of the invention, three exemplary implementations are described below. The first two exemplary implementations relate to the calculation of the rank of a permutation (encoding), one using an exploded representation of the decompositions and the other a compact representation.
In these exemplary implementations hereinbelow and the corresponding appendices, the tables R and P are indexed from R[0] to R[mn!−1] and from P[0] to P[mn!−1] (and not from 1 to mn! as described by way of example hereinabove), but with no particular impact on the processing for the calculation of the rank.
In this embodiment, an exploded representation of the basic decompositions is used.
Their exponents are stored in two tables with 36 cells (=(8+1)×4). These are the tables given in appendix A-11 and denoted Dl [36] (containing the exponents of the decompositions of the integers l (0≦l≦8) therefore with storage of a dummy decomposition for 0) and Dl![36] (containing the exponents of the decompositions of their factorials).
The three tables of the powers of 3, 5 and 7 are also stored:
In this embodiment, the decomposition of a partial rank is determined from two intermediate decompositions, one being the basic decomposition of the integer Sk and the other being the intermediate decomposition of the quotient:
As indicated above, rather than determine the intermediate decomposition of Rk from basic decompositions corresponding to the (q+1) basic decompositions of (7−k)! and (wki!)i=0, . . . , q−1, this intermediate decomposition is determined from the intermediate decomposition of Rk+1 and two basic decompositions of (7−k) and wkd(k). The four exponents of this intermediate decomposition are stored in a table R.
The invention applies the knowledge of m(7−k)! and of the maximums of mw
The corresponding processing is given in appendix A-12. It will be noted that the loop on the positions is exploded. It will also be noted that the exponent of the prime factor pi of the quotient is stored in the cell R[i−1] of the table R with 4 cells.
In a variant with the 3GPP AMR-WB+ encoder, the basic decompositions are compactly represented. The words containing their exponents are stored in two tables with 9 cells (=(8+1)). Referring to appendix A-21, the table D′l contains the words for the decompositions of the integers l (0≦l≦8) (therefore with dummy storage of the decomposition for l=0) and the table D′l! contains the words for the decompositions of their factorials.
The powers of 3, 5 and 7 are also stored in a table Pow[12] with 12 cells (with dummy storage of 0 for the unused powers).
The decomposition of a partial rank is determined from three intermediate decompositions, two being the basic decompositions of the integer Sk and of the factorial (7−k)! and the third being an intermediate decomposition of the denominator of the partial rank:
As indicated previously, rather than determining the intermediate decomposition of Pk from the q basic decompositions of (wki!)i=0, . . . , q−1, this decomposition is determined from the intermediate decomposition of Pk+, and from the basic decomposition of wkd(k). The compact word containing the four exponents of this intermediate decomposition is denoted “eP” in appendix A-22. Also, “el” denotes the compact word containing the four exponents of the decomposition of a partial rank.
Here again, the knowledge of m(7−k)! is applied to extract only m(7−k)! exponents of the compact word representing the decomposition of a partial rank.
The corresponding processing is the subject of appendix A-22. Here again, the loop on the positions is exploded.
The third exemplary implementation deals with the decoding of the rank of a permutation, in 3GPP AMR-WB+ encoding.
An exploded representation of the basic decompositions is preferably used, as in the first exemplary embodiment, and a decomposition of the partial rank in three terms is preferably used as in the second exemplary embodiment. The loop on the positions is not however exploded.
As indicated previously, rather than determine the intermediate decomposition of Pk from basic decompositions, it is determined from an intermediate decomposition of Pk−1 and from the basic decomposition of wkd
The corresponding processing is the subject of appendix A-3. It will be noted that the exponent of the prime factor pi of the quotient (respectively of the product) is stored in the cell R[i−1] (respectively P[i−1]) of the table R (respectively P) with four cells.
Thus, the first example hereinabove uses a decomposition of the partial rank in two terms (including a quotient), the other two examples use a decomposition in three terms (two for the numerator and one for the denominator). The mode for the decoding uses only m8!(=4) terms whereas the two modes for the encoding use a separate processing of the positions to use the ml! or ml terms that are not read but “hardwired” in the algorithm by exploding the loop on the eight positions of a point of the Gosset network.
Exemplary Implementation for the TDAC Encoder
A final exemplary implementation concerns the Applicant's TDAC perceptual frequency encoder used to encode digital audio signals sampled at 16 kHz (wideband), the principle of which is described hereinbelow.
The TDAC encoder uses a statistical vector quantization with variable dimension and resolution and maximum dimension 15.
In the case of the permutation codes of the regular network of points RE8 of dimension 8, the invention essentially makes it possible to reduce the complexity. However, in the case of the TDAC encoder, which uses permutation codes of dimensions greater than 12, the invention proves highly advantageous because it allows not only for a reduction in complexity but also for an implementation of the encoder on fixed point processors, the maximum precision of which is limited to unsigned 32-bit integers. Without the invention, such an implementation would be extremely complex.
The principle of this encoder is as follows.
An audio signal that is limited in bandwidth to 7 kHz and sampled at 16 kHz is subdivided into frames of 320 samples (20 ms). A modified discrete cosine transform (“MDCT”) is applied to input signal blocks of 640 samples with an overlap of 50% (which corresponds to a refreshing of the MDCT analysis every 20 ms). The spectrum is limited to 7225 Hz by setting the last 31 coefficients to zero (then only the first 289 coefficients being different from 0). A masking curve is determined from this spectrum and all the masked coefficients are set to zero. The spectrum is divided into 32 bands of unequal widths. Any masked bands are determined according to the transform coefficients of the signals. For each band of the spectrum, the energy of the MDCT coefficients is calculated (to evaluate scale factors). The 32 scale factors constitute the spectral envelope of the signal which is then quantized, encoded and transmitted in the frame. The dynamic allocation of the bits is based on a masking curve for each band calculated from the dequantized version of the spectral envelope, so as to obtain a compatibility between the binary allocation of the encoder and of the decoder. The standardized MDCT coefficients in each band are then quantized by vector quantizers using dictionaries that are nested size wise, the dictionaries consisting of a union of type II permutation codes. Finally, the tone and voicing information and the spectral envelope and the coded coefficients are multiplexed and transmitted in frame.
The exemplary implementation for the calculation of the rank of a permutation (encoding) here uses a compact representation of the decompositions. The dimension of the permutation codes used being variable, the loop on the positions is not exploded. This embodiment illustrates a method of detecting non-zero exponents of the decomposition of a partial rank.
Here, the basic decompositions are compactly represented. The words containing their exponents are stored in two tables with 16 cells (=(15+1)). In appendix B-1, the table D′l contains the words for the decompositions of the integers l (0≦l≦15) and the table D′l! contains the words for the decompositions of their factorials.
The powers of 3 are also stored in a table with eight cells (denoted Pow3) and the powers of 5, 7, 11 and 13 in a table (denoted Pow) with twenty cells (with dummy storage of 0 for the unused powers).
The corresponding processing is retranscribed in appendix B-2.
Of course, the present invention is not limited to the embodiments described hereinabove by way of example; it embraces other variants.
To the knowledge of the Applicant, the present invention involves the first use of the prime factor power decompositions in the permutation codes. This use is however particularly advantageous when combinatorial expression calculations are provided, as in the vector quantization with permutation codes. Thus, generally, the present invention targets this use of the prime factor power decompositions for any combinatorial expression, even different from a rank of a permutation, in, encoding/decoding by one or more permutation codes.
The present invention is advantageously applicable in speech signal encoding/decoding, for example in telephony terminals, notably cellular. However, it applies to the encoding/decoding of signals of any other type, notably of image or video signals and encoded modulation.
The present invention also targets a computer program designed to be stored in memory of a digital signal encoding/decoding device, using permutation codes. This program then comprises instructions for implementing the steps of the method in the sense of the invention. Typically,
The present invention also targets a digital signal encoding/decoding device, using permutation codes, and comprising, with reference to
These means MEM, PROC can be designed:
Of course, the prerecorded representations in the memory MEM can be in the form of address contents (exploded representation) or in the form of words of bits (compact representation).
Number | Date | Country | Kind |
---|---|---|---|
0601435 | Feb 2006 | FR | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/FR2007/050778 | 2/13/2007 | WO | 00 | 8/15/2008 |