Modem communication systems employing digital systems for providing voice communications, unlike many analog systems, are required to quantify speech objects for transmission and reception. Techniques of Vector Quantization are commonly used to send voice parameters by sending the index representing a finite number of parameters, which reduces the effective bandwidth required to communicate. The reduction of bandwidth is especially attractive on bandwidth constrained channels. Vector quantization is the process of grouping source outputs together and encoding them as a single block. The block of source values can be viewed as a vector, hence the name vector quantization. The input source vector is then compared to a set of reference vectors called a codebook. The vector that minimizes some suitable distortion measure is selected as the quantized vector. The rate reduction occurs as the result of sending the codebook index instead of the quantized reference vector over the channel. The vector quantization of speech parameters has been a widely studied topic in current research. At low rate of quantization, efficient quantization of the parameters using as few bits as possible is essential. Using suitable codebook structure, both the memory and computational complexity can be reduced. However when bit-errors occur within the transmitted vector, an incorrect decoded vector is received resulting in audible distortion in the re-constructed speech. For example, a channel limited to only 3 kHz currently requires very low bit-rates in order to maintain intelligible speech.
The codebook created with the LBG codebook design algorithm results in an ordering that is dependent on the training data and choices made to seed the initial conditions. The gain codebook order that was trained using the LBG algorithm was further randomized using the random function available in the C programming language.
Thus there is a need to improve the bit-error tolerance performance of low-rate vocoders that use Vector Quantization (VQ) in order to reduce the effective bit-rate necessary to send intelligible speech over a bandwidth constrained channel. Likewise, as codebooks increase in size, it becomes a difficult computational task to order the codebooks using current computer techniques, thus there is a need to reduce the computational complexity of ordering codebooks to improve bit-error tolerance performance.
Therefore it is an object of the disclosed subject matter to overcome these and other problems in the art and present a novel system and method for improving the bit-error tolerance of vector quantization codebooks when using a parametric speech model over a bandwidth constrained channel.
It is also an object of the disclosed subject matter to present a novel method to overcome the computational load of a complete solution of locating the optimal codebook ordering that maps vectors with similar Euclidean distance with vector indices with similar Hamming distance. The invention results in a technique that allows ordering of large codebooks such that the distortion of single and many double bit-errors resulting in vectors that have less audible distortion as compared to random ordering.
It is further an object of the disclosed subject matter to present a novel method for improving bit error tolerance of vector quantization codebooks. Embodiments include sorting the codebook vectors based on Euclidian distance from the origin thereby creating an ordered set of codebook vectors and assigning codewords to the codebook vectors in order of their hamming weight and value. A first distortion sum is calculated for all possible single bit errors and a first pair of successive codewords are swapped, and a second distortion sum for all possible single bit errors is calculated. Embodiments of the disclosed subject matter maintain the swapped vectors if the second distortion sum is less than the first distortion sum; thereby creating an improved bit error tolerance codebook.
It is still another object of the disclosed subject matter to present a novel method of transmitting intelligible speech over a bandwidth constrained channel. An embodiment of the method relates quantized vectors of speech to code words, where the quantized vectors approximate in Euclidean distance are assigned to code words approximate in hamming distance; thereby creating an index. Embodiments also encode the speech object by quantizing the speech object and selecting its corresponding codeword from the index and transmitting the codeword over the bandwidth constrained channel for decoding by a receiver using the same index, thereby allowing the transmission of intelligible speech over the bandwidth constrained channel.
Is yet another object of the disclosed subject matter to present a system for vector quantization reordering an LBG codebook to enable communication over bandwidth constrained channels. Embodiments of the system include a processor operably connected to an electronic memory and hard disk drive storage, the hard disk storage containing a computation program; wherein the processor reorders the LBG code book by reassigning quantized vectors close in Euclidian distance to indices close in hamming distance. Embodiments also include an input device operably connected to the hard drive for entering the LBG codebook; and an output operably connected to the processor for storing the reordered codebook.
It is an additional object of the disclosed subject matter to present a novel improvement for a method in a communication system operating over a bandwidth constrained communication channel, of transmitting quantized vectors by transmitting indices corresponding to the quantized vectors. Embodiments of the improvement comprises the step of corresponding quantized vectors close in Euclidean distance to indices close in hamming distance.
These and many other objects and advantages of the present invention will be readily apparent to one skilled in the art to which the invention pertains from a perusal or the claims, the appended drawings, and the following detailed description of the preferred embodiments.
The subject matter of the disclosure will be described with reference to the following drawings:
Embodiments of the disclosed subject matter orders or maps codebook vectors such that they are more immune to channel errors which induce subsequent voice distortion. The decoded vector with channel errors is correlated with the transmitted vector when using the ordered gain codebook. The embodiments of the disclosed subject matter assign (correlate or match) vectors close (or approximate) in Euclidian distance to codewords (indices) close (or approximate) in hamming distance. The hamming distance between two words is the number of corresponding bits which differ between two words (codewords). This distance is independent of the order in which the corresponding bit occur. For example the codewords 0001, 0100 and 1000 are all the same hamming distance from 0000. This reassignment effectively reorders a codebook containing vectors and indices into a new codebook that has its vectors and indices ordered to increase the bit error tolerance of voice signals transmitted using the codebook.
An embodiment for reordering a codebook according to the disclosed subject matter is shown in
The codebook is then sorted in the sort codebook block 702. Block 702 orders the codebook vectors based on their distance from the origin. The codebook vectors are sorted from closest to the origin to farthest. This initial sorting is a precursor that conditions the ordered vectors to reduce the complexity and computational load on the final sorting.
In the embodiment of
In block 711 for successive codewords the vectors are swapped, such that the vector assigned to codeword v(n) is reassigned to codeword v(j) and the vector originally assigned to codeword v(j) is likewise reassigned to codeword v(n).
After swapping vectors, a second distortion sum of the total distance error between the vectors for all possible single bit errors, or double bit errors, is again calculated in block 712, in the same manner as the first distortion sum, this sum D(k), however now includes the effects of the swapped vectors. The sums are then compared in block 713, if the second sum is less than the first sum D(k−1), then the second sum D(k) represents a more favorable assignment of codewords and vectors from the perspective of minimizing distortion cause by single bit errors and the swapped vectors are maintained and D(k−1) is replaced with D(k). If the swap is not advantageous then the vectors are swapped back, again if the first distortion sum includes double bit error, the second sum must likewise include theses double bit error possibilities as well.
The process continues with the next successive codewords until the vectors swapped, or subsequently unswapped, are the last two in the codebook, then difference D(new)−D(old) (D(new)−D(old)=D(m)−D(m−1)) is compared in block 717 to a predetermined value P, if the difference is less than P the process is complete however if the difference is not less than P then D(m−1) is equated to D(m) and the process begins again at block 704 where m is incremented by one.
An exemplary algorithm representing an embodiment of the process described in
Block 701
Initialization: Given the codebook size N and vector length L, the following parameters are computed:
Q=log 2(N)
m=0
n=0
D(old)=MAX FLOAT VALUE
P=0.001
where Q is the length of the codebook index in bits, m, n, and j are counters, and D(k) is the sum of all single bit-error distortion for the current codebook for the kth vector swap
Block 702
Presorting the Codebook Y={y(i); i=, . . . , N−1}{y(i);i=0, . . . , N−1}:
r(0)={if min (dist(0,y(i))) n0=i; a11}{r(0) then is the closest codebook vector to the all zero vector}
r(1)={if min (dist(0,y(i))) n1=i; i<>n0}{r(1) is the second closest to the all zero vector, and so on}
↓
r(N−1)={if min (dist(0,y(i)))nN−1=i;<>n0, n1, . . . , nN−2}
The resulting sorted codebook output from block 702 is a group of N vectors, R={r(i); i=0, . . . , N−1}.
Block 703
Block 704
Increment value of m by one:
m=m+1
Block 710
Compute Sum of all single bit-error distortion:
Block 711
Swap Candidate codebook vectors:
Block 712
Compute sum of all single bit-error distortion D(K) where v(n) and v(j) are swapped.
Block 713, 714 and 715
If D(k)<D(k−1) then D(k−1)=D(k) otherwise undo vector swap
Block 716
If (j==CBSIZE) then (n=n+1, j=j+1)
if(n<(CBSIZE−1) and (j<CBSIZE) then goto block 711)
Block 717
If D(New)−D(old)<P then {D(old)=D(new) and goto block 704}
Block 718
Process complete.
An embodiment of the disclosed subject matter in which the previously described process can be implemented is illustrated in
While preferred embodiments of the present invention have been described, it is to be understood that the embodiments described are illustrative only and that the scope of the invention is to be defined solely by the appended claims when accorded a full range of equivalence, many variations and modifications naturally occurring to those of skill in the art from a perusal thereof.
Number | Name | Date | Kind |
---|---|---|---|
4791654 | De Marca et al. | Dec 1988 | A |
6453287 | Unno et al. | Sep 2002 | B1 |
Number | Date | Country |
---|---|---|
0 294 012 | Jun 1997 | EP |
Number | Date | Country | |
---|---|---|---|
20040153318 A1 | Aug 2004 | US |