The present invention is concerned with coding and/or decoding in the context of wireless communications apparatus. It is applicable to, but not exclusive to, mobile telephones, base stations, WLAN with multiple antennas, vocoders and other data compression processors, and positioning systems requiring incremental refinement of positional information (such as for GPS applications).
Vector quantisation is a family of source coding (or data compression) techniques whereby an input sequence of symbols is represented by a codeword drawn from a given codebook according to some minimum-distortion criterion.
In some applications, the most important information about the input vector that has to be accurately represented is its “direction”. The concept of vector direction is associated with the concept of vector space spanned by a given vector; that is, all vectors that differ by a scaling coefficient are characterised by the same direction.
For example, if v is an M-dimensional complex vector and α≠0 is an arbitrary complex scalar, then v and αv have the same direction. Direction quantisation is obtained by using vector quantisation codebooks formed by unit vectors distributed on the M-dimensional complex sphere, for example. This explains the name of spherical vector quantisation. One typical distortion measure used in spherical vector quantisation is the chordal distance, which, for two vectors u and v is defined as
where <u, v>=vHu denotes the inner product and ∥u∥ the Euclidean norm.
Thus, the two linearly dependent vectors, v and αv, have zero chordal distance, whereas two orthogonal vectors have maximum chordal distance, equal to 1. A spherical vector quantiser selects the codebook representative, cεC, for an input vector v, with minimum chord distance, following the rule
In many vector quantisation applications, it is desirable to provide a description of input vectors that can be considered as a refinement of previous descriptions. For example, a message may be described with a particular amount of distortion and then a decision can be taken thereafter that the message needs to be specified more accurately. Then, when an addendum to the original message is sent, this refinement is presumed to be as efficient as if the stricter requirement had been used for the initial description.
In these multi-stage quantisation schemes, the goal is to operate as close as possible to the rate-distortion bound after each refinement stage. Another example where it may be desirable to add information on previous descriptions is in the encoding of correlated sources, to reduce the amount of redundant information. Instead of quantising the samples of a correlated vector process independently, some form of differential or hierarchical encoding can be applied to these samples.
Two example constructions that have been also used for spherical vector quantisation will now be described.
1) Hierarchical Codebook Construction.
Two (or more) spherical codebooks are designed with a “nested” structure. The finer codebook is built as a collection of several smaller codebooks, one for each of the quantisation regions of the coarse codebook. Each of these finer constituent codebooks has code-vectors distributed in one particular quantisation region of the coarse codebook. The two-stage coding principle is clear: as long as the source vector is confined within one quantisation region of the coarse codebook, the description can be refined by using the finer codebook, otherwise the description is updated from the coarse codebook.
2) Differential Encoding.
In this case, quantisation regions close in chordal distance are assigned to codewords close in Hamming distance. For a given sample input vector the difference between the actual quantisation index and the previous index is encoded. Therefore, if the vector process is correlated sufficiently, only the least significant bits of the code message need to be communicated to the decoder. The most significant bits will be sent only when the source has experienced an abrupt change.
Spherical vector quantisation (SVQ) is described, for example, in D. Love, R. Heath, and T. Strohmer, “Grassmannian beamforming for multiple-input multiple-output wireless systems,” IEEE Trans. Inform. Theory, vol. 49, no. 10, pp. 2735-2747, Oct. 2003. A hierarchical codebook design method for SVQ has recently been proposed in J. Kim, W. Zirwas, and M. Haardt, “Efficient feedback via subspace-based channel quantization for distributed antenna systems with temporally correlated channels,” EURASIP J Advances in Signal Processing, 2008.
However, the above references do not address refinement of directional quantisation of a vector source. This finds application for example in the encoding of channel state information for feedback and other source coding problems. Ordinary techniques for directional vector quantisation generally perform a single-stage quantisation and achieve a fixed resolution, whereas in many situations it is desirable to dynamically change the accuracy of the vector representation during the encoding process. In this case, it is more efficient to add information to previous descriptions of a source rather than performing a new quantisation with a finer codebook.
One other problem not addressed by the prior art is the provision of directional quantisation of a correlated vector source with adaptive resolution and constant rate.
The problem can be best illustrated by an example. In this example, it is desired to describe the “direction” of a continuous-time correlated vector source, by sampling and quantising the source at fixed time instants. The codebook index transition rate over time or, in other words, the rate at which the quantisation index changes over time, depends on the sampling rate and on the codebook size, which also determines the resolution of the vector quantiser. If the codebook provides the same code index over several consecutive quantisation events, there is clearly a waste of resources. This redundancy could be removed, for example, by reducing the sampling rate while keeping the same codebook size, by enlarging the codebook for the same sampling rate, or by adjusting both parameters.
There are applications where it is desirable 1) to keep the sampling rate and the message size at each quantisation event fixed, and 2) to operate with very few pre-defined codebooks. This is the case, for example, for mobile stations (MS) having to report periodically their channel directions to a base station (BS). In such a case, the BS may want to avoid negotiating the feedback rate every so often with each individual MS.
These problems can also be addressed by hierarchical codebook construction or differential encoding techniques. However, designing these nested codebooks according to some optimality criteria is generally difficult and the construction is unique for a given channel vector dimension. In fact, all codebooks in the hierarchy have to have the same dimensionality: if the same codec has to operate with different vector dimensions, different sets of codebooks must be provided for each input dimension.
An aspect of the present invention provides an SVQ scheme using “off-the-shelf” and scalable codebooks. In particular, in such aspect of the present invention, the codebook dimension is reduced by one at each refinement stage, which further reduces the encoding complexity, and the codebooks are designed separately. Besides, the same codebook of dimension n−1, defined for the first refinement stage of a dimension-n vector source, can be used for the initial quantisation stage of an (n−1)-dimensional vector.
An aspect of the invention provides a coder architecture that provides a representation of an input vector direction with incremental accuracy by using a set of vector codebooks of decreasing dimensions.
An aspect of the invention provides a decoder architecture that provides a representation of an input vector direction with incremental accuracy by using a set of vector codebooks of decreasing dimensions.
An aspect of the invention provides an encoding procedure for constructing a basis set of orthogonal reference vectors from a set of codebooks of decreasing dimensions, such that a truncated representation of a vector by the first n components in the new reference system, introduces a chordal distortion bounded above by the product of maximum chordal distances associated with the first n codebooks.
An aspect of the invention provides an encoding procedure for describing with incremental accuracy the direction of vectors of different dimensions, by using the same set of quantisation codebooks.
It will be appreciated that the nature of the invention lends itself to implementation by computer means, such as involving a computer program product. This product can be introduced as computer executable code, such as on a storage medium or a signal carrier, or as a hardware device such as an FPGA, a DSP or an ASIC.
Other aspects of the invention will become apparent from the following description of a specific embodiment of the invention, with reference to the accompanying drawings, in which:
The main structural difference between the embodiment of the present invention described herein and other methods for directional vector quantisation is that, in other techniques, the directional information can be refined by designing tree-structured codebooks, wherein each refinement codebook, apart from the initial one, can be partitioned into as many sub-sets as the number of quantisation regions of the coarse codebook one level up the tree structure. Therefore, all the codebooks in the hierarchy have the same dimensionality, and can be used for quantisation of just one particular vector dimension. The reader will recognise that the dimension of a codebook denotes the size of its vector components, and its size indicates its cardinality, that is the number of elements in the codebook.
Conversely, in this described embodiment of the invention, each refinement codebook has one dimension less than the parent codebook and is designed independently of the others. Therefore, the embodiment can use readily available codebooks, each of which is good for directional quantisation in its dimension, and can use the same codebooks for quantising vectors of different dimensions with various refinement depths.
Three main structural blocks can be identified in
A directional vector quantisation operation is applied to the reduced error vector, er(k−1), of dimension M−k+1. The distortion measure, d(er(k−1), c1(k)), used in this vector quantisation, denotes chordal distance, as defined in equation (1). This operation is not carried out in the M-th stage, as the error vector er(m−1) reduces to a scalar quantity.
An error reduction operation reduces by one the dimension of the error vector. The error vector is defined as the “chord” between the input vector er(k−1) in the directional quantisation operation and the output quantisation vector ci
It should be noted that the unitary matrix involved in the reduction operation is uniquely identified by the codec parameters, namely the codebook vectors, the error components to be nulled and the fact that it consists of a single plane-rotation. Therefore the matrix quantities Rk can be pre-computed and stored at both the encoder and decoder. It should be noted that there exist an infinite number of other unitary matrices that perform the same error reduction operation, therefore different assumptions could be made to fix one particular transformation. Any such criterion to select Rk out of its possibilities is assumed to be known at the both encoder and decoder (this does not apply if the matrices are pre-computed and stored), and does not change the nature of the encoding technique.
Calculation of the reconstruction coefficient yields the combination coefficients required for perfect reconstruction of the vector v direction. This operation is carried out from the second stage onwards, as one coefficient is required in order to combine two directional vector indications. The fundamental reconstruction step can be easily derived from the definition of the “chordal error” vector, which reads:
e
(k)
=e
(k−1)
−<e
(k−1)
, c
i
(k)
>c
i
(k),
where the notation of
ê
r
(k−1)
=c
i
(k)+αkêr(k). (2)
It should be noted that the directional information contained in the error vectors involved in the two expressions above is maintained, although they differ by a complex multiplication coefficient.
In another implementation, a different reconstruction formula may be used, which maintains the directions of the vectors involved. The general formula is obtained from (2) by a multiplication by a complex scalar coefficient, βk, and reads:
ê
r
(k−1)′=βkc(k)+αkêr(k)′.
Another structural property of the construction is that, say {circumflex over (v)}m the reconstructed vector using m refinement stages. Then, if the reconstruction coefficients, αk, k=1, . . . , m−1, are known exactly at the decoder, the final distortion is the product of distortions associated with the quantisation stages (it should be noted that the distortion measure is a positive number between 0 and 1), i.e.
If the reconstruction coefficients are quantised, it can be shown that under very mild constraints on the accuracy of the scalar quantisers, distortion reduction can be guaranteed at each refinement stage, that is:
d({circumflex over (v)}m, v)<d({circumflex over (v)}m−1,v)< . . . <d({circumflex over (v)}1, v).
Error augmentation, which is the reverse operation of the error reduction step carried out by the encoder; and
Error reconstruction, which implements the reconstruction formula (2) in a recursive fashion.
It should be noted that the decoder need not know the full encoded message up to the maximum refinement depth, in order to reconstruct the vector direction. In fact, reconstruction of the source vector can take place after each refinement message is received, generating increasingly accurate estimates of the vector direction.
One application for which embodiments of the present invention are envisaged to offer clear benefits is in the encoding of channel state information (CSI) for feedback purposes. Taking as an example a cellular or WLAN system, where the base station (BS) uses M antennas for transmission, where, to fix ideas, M can be 2, 3 or 4, and there is a multiplicity of mobile stations (MS) that have to report periodically their channel signature to the BS. This channel signature may consist of a complex M−dimensional vector, represented by vector v in
In a conventional system a single codebook is defined for each possible number of BS transmit antennas in use, i.e. C(1)⊂C4, C(2)⊂C3, C(3)⊂C2 in the present example. Each MS is obliged to report a feedback message every τf seconds, which consists of the codebook index, i1, resulting from a minimum chordal distance quantisation of the vector of channel measurements with codebook C(4−M+1). If a particular MS reports the same codebook index, on average, for two consecutive time intervals, this means that the channel coherence time is roughly 2τf and the feedback can be made more resource efficient in at least two ways: by either defining a finer codebook for any possible dimension M, or by reducing the feedback interval. Either solution can be impractical, as each requires introduction of new codebooks and/or negotiation of different feedback report rates for each MS.
The present embodiment of the invention provides an attractive solution to the problem, without the drawbacks of new codebook definitions and/or variable feedback report rates: the MS can report index i1 at times 0,2τf, 4τf, . . . , i.e. using a coarser time interval, τc=2τf, whereas a refinement message is sent on odd times τf, 3τf, 5τf, . . . . Refinement is achieved by using codebook C(4−M+1), for M=4, 3, while no codebook is required in the case M=2. The refinement message consists, in general, of an index i2 and a quantised reconstruction coefficient, Q1(α1), which can be encoded together in such a way, for example, that the coded message size is the same in all feedback reports.
In the exemplary network of
By means of the bus 1306, user operable input devices 1318 are in communication with the processor 1302. The user operable input devices 1318 comprise any means by which an input action can be interpreted and converted into data signals, for example, DIP switches. Audio/video output devices 1316 are further connected to the general-purpose bus 1306, for the output of information to a user. Audio/video output devices 1316 include any device capable of presenting information to a user, for example, status LEDs.
Let v=[−0.0624 −0.1168 −0.9912]T be the vector for which representation of direction is desired. Without loss of generality, the vector norm has been normalised to 1. Two codebooks are defined for directional quantisation in 3 and 2 dimensions, respectively, each consisting of 4 code-vectors:
At each refinement stage, it is assumed that the error vector component to be nulled is the last—this assumption will appear clear later in the example. Any other component can be chosen for each refinement stage, provided that coder and decoder share the same information.
Codebook C(1)={c0(1), . . . , c3(1)} is used for quantising v with minimum chordal distance.
Hence, the first quantisation operation yields the index
The selected code-vector representative is marked in bold (y1) in
Besides, as noted above, these matrices can be pre-computed and stored beforehand, as they depend solely on the codebook vectors and on the error component to be nulled.
This transformation is also such that R1Te1=[er(1)T 0]T, where the last component of the error vector has been nulled, according to the initial assumption. Therefore, a new directional quantisation can be applied to the reduced error vector er(1) by using codebook C(2)={c0(2), . . . , c3(2)}. The second stage quantisation index is given by
Besides this, a combination coefficient must be calculated. Firstly, some essential notation regarding the following two inner products will be explained:
where cos θ1, cos θ2 ε [0,1], while φ1,φ2 ε {−π, 0}, in the real case (φ1,φ2 ε [−π, π) in the complex case). The combination coefficient of interest is given by
α1=ej(φ
Therefore, the direction of e(2) is exactly described by the unit vector R2[1 0]T, and the degenerate third-stage quantisation vector is given by the scalar c3=1. The “inner product” associated with this pseudo quantisation stage is given by
and the combination coefficient reads
α2=ej(φ
The three orthogonal vectors (V) are depicted in
such that
are described as the two truncated representations. In the old set of coordinates the three vectors read
{circumflex over (v)}1=C{circumflex over (v)}1*=[−0.1266 0.5723 0.8102]T,
{circumflex over (v)}2=C{circumflex over (v)}2*=[0.2039 0.2243 1.1077]T,
{circumflex over (v)}3=C{circumflex over (v)}3*=[0.0724 0.1355 1.1499]T.
The three vectors are plotted in
d(v,{circumflex over (v)}1)=sin θ1=0.5069,
d(v,{circumflex over (v)}2)=sin θ1 sin θ2=0.1415,
d(v,{circumflex over (v)}3)=sin θ1 sin θ2 sin θ3=0.
The three distortion functions are separable functions of the distortions introduced by each directional quantisation stage. This property allows the codebooks to be designed C(1), C(2) separately for minimum chordal distortion in R3, R2 respectively.
Several different message formats can be used to encode the above vector decomposition, such that reconstruction of vectors {circumflex over (v)}1, {circumflex over (v)}2, or {circumflex over (v)}3 at the decoder is possible, depending on the refinement depth. The reference axes (i.e. quantisation directions) can be represented exactly by a codebook index, while the last possible basis direction is fully determined by the previous stages. For the scalar coefficients it is necessary to provide a finite-bit representation. For this reason it is convenient to adopt the representation of the combination coefficients given in (3)-(4). The floating point representation of the coded message, for each stages, which allows perfect reconstruction of the vector v direction, after three stages, is given by
i1|i2,α1|α2.
It can be shown that, by representing the “phase” of the combination coefficients (i.e. their sign in the real case) with 1 bit for the real case and 2 bits for the complex case, refinement can be guaranteed, even if no bits are used for representing the amplitude of the same coefficients. The strategy consists in setting a fixed threshold for each combination coefficients: if the amplitude of the coefficient is smaller than the pre-defined threshold, then the encoder stops the refinement process (i.e. the coefficient is replaced by zero), if it is larger or equal, the refinement stage is carried out and the amplitude of the coefficient is quantised with the requested number of bits (which can be zero). These threshold values are intended as parameters of the codec, which are known to both the encoder and decoder, and can be optimised for minimum average distortion, for any particular codec configuration. Besides, they can be fixed or periodically adjusted.
As an example, it is assumed that it is desired to expend 2 bits for the representation of each combination coefficient in the above case. The thresholds for an isotropic input vector distribution and a uniform quantiser are optimised, which yields the values ε1=0.3848, ε2=01296.
The maximum values of the coefficients, which determine the boundaries of the uniform quantisers, can be determined from the maximum distortions of codebooks C(1), C(2). It can be noted that C(1) corresponds to a cube in R3, therefore its maximum chordal distortion, which follows from basic geometrical properties, is dmax(1)=√{square root over (2/3)}. The maximum chordal distortion of codebook C(2) is also trivially found to be dmax(2)=sin(π/8). The two coefficients α1, α2 are then bounded by
The characteristics of the two uniform quantisers are drawn in
It can be shown that, if |α1|<ε1, it cannot be guaranteed in general that the reconstruction {circumflex over (v)}2 has a smaller distortion than {circumflex over (v)}1. Similarly, if |α2|<ε2, the reconstruction {circumflex over (v)}3 does not necessarily have lower distortion than {circumflex over (v)}2. Therefore, when using the quantisers of
In a different implementation, it may be decided to refine the vector description up to a given depth in all cases, by slightly modifying the characteristics of the scalar quantisers as shown in
In the particular example herein, the two pairs of quantisers give out the same result, because both coefficients exceed the threshold, as can be seen from (3)-(4). The coded message in the three possible refinement stages, then reads
i1|i2,Q(α1)|Q(α2). (5)
In terms of number of bits, the first message in this example consists of a 2-bit index, the first refinement stage (second message) requires 4 bits in total (2-bit index and 2 bits for the reconstruction coefficient), while the last message (second refinement stage) contains 2 bits for the coefficient. By adjusting the codebook sizes and the resolution of the scalar quantisers, it is possible to have coded messages with the same size for all refinement stages.
The decoder can reconstruct the three estimates of the direction of v, for the three possible encoding stages, from the coded message (5), by using the following formulas
As noted above, the transformation matrices R1, R2 are uniquely identified by vectors ci
d(v,{circumflex over (v)}1)=0.5069,
d(v,{circumflex over (v)}2)=0.2024,
d(v,{circumflex over (v)}3)=0.1517.
In order to evaluate how efficient the encoding scheme is in refining the description of a vector direction, the distortion-rate curve for an isotropic input distribution in R4 is compared with the distortion-rate performance of a single-stage minimum chord-distance quantisation.
A 2-bit vector codebook is used for the first quantisation stage and a 1-bit vector codebook for the first two refinement stages. In the third and last possible refinement stage the 1-bit vector codebook degenerates in a 1-bit scalar quantisation of the reconstruction coefficient tan θ2 cos θ3. The real combination coefficients (0-bit quantisation) for the first two refinement stages are not encoded, while one bit is used for the phase coefficient, which is enough to encode the phase information exactly in the real case.
Therefore, each of the four possible quantisation stages generates a 2-bit coded message. For the single-stage directional vector quantisation, the best known Grassmannian line packings are used.
Further variations, modifications and additional features will be apparent to the skilled person considering the above disclosure and no statement above is intended to limit the scope of protection sought for the invention, which is to be determined by reference to the appended claims, interpreted in the light of but not specifically limited to, the above description of specific embodiments.
Number | Date | Country | Kind |
---|---|---|---|
0812032.1 | Jul 2008 | GB | national |
This application is a division of and claims the benefit of priority under 35 U.S.C. §120 from U.S. Ser. No. 12/495,042 filed Jun. 30, 2009, the entire contents of which are incorporated herein by reference. U.S. Ser. No. 12/495,042 claims the benefit of priority under 35 U.S.C. §119 from United Kingdom Application No. 0812032.1 filed Jul. 1, 2008.
Number | Date | Country | |
---|---|---|---|
Parent | 12495042 | Jun 2009 | US |
Child | 13610974 | US |