This application claims the benefit under 35 U.S.C. § 119(a) of Korean Patent Application No. 2005-3595, filed on Jan. 14, 2005, in the Korean Intellectual Property Office, the entire disclosure of which is hereby incorporated by reference.
1. Field of the Invention
The present invention relates to voice signal processing. More particularly, the present invention relates to an apparatus and method for converting the data rate of digital voice packets.
2. Description of the Related Art
A variable-rate voice coder supports various data rates for creating voice packets. An exemplary variable-rate voice coder is a Code Excited Linear Prediction (CELP) encoder such as the Qualcomm-Code Excited Linear Prediction (QCELP) and Enhanced Variable Rate Codec (EVRC), both of which are used in Code Division Multiple Access (CDMA) systems. A variable-rate voice coder compresses or decompresses a voice packet rate according to the characteristics of an input voice signal or a system-requested data rate. QCELP and EVRC can generate voice packets at a rate of ½, ¼ or ⅛ a full rate.
CELP-based voice coders such as QCELP or EVRC are the type of voice coders used most often. This coding technology encodes only non-redundant information by comparing the current input voice signal with the previous input voice signal. The coded information is a Linear Prediction Coefficient (LPC) filter parameter, a pitch filter parameter, and a codebook parameter. An LPC filter is known as a Formant filter.
A digital communication system needs a packet rate converter for converting a high-rate compressed voice packet into a low-rate packet to be able to decrease the rate of the high-rate compressed voice packet when requested to do so by the system. On the other hand, since a low-rate compressed voice packet can be sent through a communication path supporting a high rate, no packet rate converter is required in this case.
The PCM-level packet rate converter 110 illustrated in
Accordingly, there is a need for an improved apparatus and method for converting a packet rate such that good voice quality and a low degree of complexity are ensured without any algorithmic delay.
Exemplary embodiments of the present invention address at least the above problems and/or disadvantages and provide at least the advantages described below. Accordingly, an aspect of the present invention is to provide an apparatus and method for converting a packet rate such that good voice quality and a low degree of complexity are ensured without any algorithmic delay.
According to one aspect of an exemplary embodiment of the present invention, in a method of converting the rate of a voice packet, the rate of at least one first element of an input voice packet compressed at a first rate is converted into a second rate at a PCM level, and the rate of a second element of the input voice packet is converted to the second rate at a parameter level. An output voice packet compressed at the second rate is generated by combining the first element and second elements of the second rate.
According to another aspect of an exemplary embodiment of the present invention, in a method of converting the rate of a CELP voice packet, an LPC filter parameter of an input voice packet compressed at a first rate is dequantized and quantized at a second rate. Thus, the LPC filter parameter at the second rate is output. A pitch filter parameter of the input voice packet is dequantized and then quantized at the second rate. Thus, the pitch filter parameter at the second rate is output. A codebook parameter of the input voice packet is decoded and then compressed at the second rate. Thus, the codebook parameter at the second rate is output. The LPC filter parameter, pitch filter parameter, and codebook parameter at the second rate form an output voice packet at the second rate.
Other aspects, advantages, and salient features of the invention will become apparent to those skilled in the art from the following detailed description, which, taken in conjunction with the annexed drawings, discloses exemplary embodiments of the invention.
The above and other objects, features, and advantages of certain embodiments of the present invention will be more apparent from the following description taken in conjunction with the accompanying drawings, in which:
Throughout the drawings, the same drawing reference numerals will be understood to refer to the same elements, features, and structures.
The matters defined in the description such as a detailed construction and elements are provided to assist in a comprehensive understanding of the embodiments of the invention and are merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted for clarity and conciseness.
If a packet rate is changed, there is little difference in the decoding and coding, when compared to the codebook parameter 14, except for the number of quantized bits used and a rate per frame with regard to the LPC filter parameter 10 and the pitch parameter 12. Therefore, parameter-level packet rate conversion of the LPC filter parameter 10 and the pitch parameter 12 has no influence on performance.
Accordingly, in accordance with an exemplary embodiment of the present invention, the rate of the LPC filter parameter 10 and the pitch parameter 12 is converted at a parameter level and the codebook parameter 14 is converted at a PCM level. The combined use of the parameter-level packet rate conversion and the PCM-level packet rate conversion fulfills the desired performance with respect to algorithmic delay, complexity and voice quality.
Referring to
The components of the hybrid packet rate converter 210 will be described in more detail.
The dequantizer 412 dequantizes the pitch bit stream of the input voice packet compressed at the first rate. The time-based converter 414 functions to match the number of pitch filter parameter values per frame at the first rate to the second rate. If the same number of pitch filter parameter values per frame are used at the first and second rates, the time-based converter 414 passes the pitch filter parameter values through a bypass 402. If more pitch filter parameter values per frame are used at the first rate than at the second rate, the time-based converter 414 decimates the pitch filter parameter values through a decimator 404. The quantizer 416 quantizes the time-based converted pitch filter parameter values at the second rate and outputs a pitch bit stream at the second rate.
The codebook target signal synthesizer 560 receives a codebook index CBi1 and a codebook gain CBg1 as the codebook parameter of the input voice packet compressed at the first rate. The rate 1 codebook 522 outputs a first code vector corresponding to the codebook gain CBg1. The codebook gain multiplier 524 multiplies the first code vector by the codebook gain CBg1. For the input of the product, the LPC filter 526 generates the codebook target signal s1(n) using the LPC filter parameter of the second rate ao1 - aoN. Here, n is a frame index.
The rate 2 codebook 512 outputs a second code vector corresponding to a rate 2 codebook index CBi2 under the control of the minimum error detector 518. The codebook gain multiplier 514, multiplies the second code vector by the rate 2 codebook index CBi2. For the input of the product, the LPC filter 516 generates the candidate signal s2(n) using the LPC filter parameter of the second rate ao1 - aoN.
Finally, the combiner 520 generates the error signal e(n) by subtracting the candidate signal s2(n) from the target signal s1(n). The minimum error detector 518 searches for a combination of CBi2 and CBg2 that minimizes the error signal e(n) in the rate 2 codebook 512 by controlling the above operation. The final rate 2 codebook index and codebook gain CBi2 and CBg2 become the codebook parameter compressed at the second rate.
Referring to
Referring to
In accordance with the exemplary embodiments of the present invention as described above, the elements of a compressed voice packet are rate-converted separately at a parameter level and at a PCM level. Therefore, algorithmic delay is reduced, while ensuring good voice quality and relatively low degree of complexity.
While the invention has been shown and described with reference to certain embodiments thereof, it will be understood by those skilled in the art that various changes in, form and details maybe made therein without, departing from the spirit and scope of the invention as defined by the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
2005-3595 | Jan 2005 | KR | national |