The present invention relates to data processing and more particularly to speech processing in a wireless device.
Wireless devices or mobile stations such as cellular handsets and other wireless systems transmit and receive representations of speech waveforms. A physical layer of a cellular handset typically includes circuitry for performing two major functions, namely encoding and decoding. This circuitry includes a channel codec for performing channel encoding and decoding functions and a vocoder for performing voice encoding and decoding functions. The vocoder performs source encoding and decoding on speech waveforms. Source coding removes redundancy from the waveform and reduces the bandwidth (or equivalently the bit-rate) in order to transmit the waveform in real-time. The channel codec increases redundancy in the transmitted signal to enhance the robustness of the transmitted signal. Synchronizing these two functions allows the system to operate properly.
A number of different wireless protocols exist. One common protocol is referred to as global system for mobile communications (GSM). In a GSM system, the vocoder operates on blocks of speech data that are 20 milliseconds (ms) in duration. The channel codec transmits and receives data every 4.615 ms. Since the speech encoder (i.e., vocoder) serves as a data source to the channel encoder/modulator (i.e., channel codec) and the speech decoder (i.e., vocoder) serves as the data sink for the channel demodulator/decoder (i.e., channel codec), the vocoder and channel codec should be maintained in synchronization.
Further, the speech encoder should deliver data to the channel encoder with sufficient time margin to complete channel encoding and modulation operations before the time at which the data are transmitted over the air. Further complicating the issue are limits on the round-trip delay of the overall communications link. Hence, the vocoder cannot deliver the data too early lest the delay budget (such as that set forth by the European Telecommunications Standards Institute (ETSI)) be violated, and cannot deliver data too late lest the data be discarded. As a practical matter, the later the vocoder delivers data to the channel codec, the harder a digital signal processor (DSP) must work to complete all signal processing on schedule, thus creating a greater system load.
Adaptive multi-rate (AMR) vocoders have been introduced recently in certain cellular communication standards, such as GSM and WCDMA. AMR vocoders support multiple source rates, and compared to other vocoders, provide some technical advantages. These advantages include more effective discontinuous transmission (DTX) because of an in-band signaling mechanism, which allows for powering down a transmitter when a user of a cellular phone is not speaking. In such manner, prolonged battery life and reduced average bit rate, leading to increased network capacity is provided. AMR also allows for error concealment.
In a system supporting AMR, the bit rate of network communications can be controlled by the radio access network depending upon air interface loading and the quality of speech conditions. To handle such different bit rates, the network will send configuration messages to a cellular phone to control its transmission at a selected bit rate. During an AMR voice call, the network may send a message to the mobile station to change the AMR configuration (e.g., source rate). Since both the channel codec and vocoder use this information, careful synchronization is needed between the codec and vocoder during AMR configuration changes.
Accordingly, methods and apparatus to maintain synchronization between channel codec and vocoder would improve performance of a mobile station.
In one embodiment, the present invention includes a method for maintaining a vocoder and channel codec in substantial synchronization. The method may include receiving a configuration message that includes rate information and an indicator corresponding to an effective radio block for application of the rate information at a mobile station, coding a current radio block via a speech encoder and a channel encoder, configuring the speech encoder and the channel encoder with the rate information after performing the coding, and then coding the effective radio block with the rate information. In other embodiments, a different priority of configuration activities may be implemented to maintain the vocoder and channel codec in substantial synchronization.
Other embodiments may be implemented in an apparatus, such as an integrated circuit (IC). The IC may include a vocoder to encode speech blocks and a channel encoder coupled to the vocoder to channel encode the encoded speech blocks. In various embodiments, configuration information may be applied to the vocoder and channel codec to maintain them in substantial synchronization. More specifically, the configuration information may be applied according to a priority to maintain the synchronization, while at the same time performing coding and decoding operations on outgoing and incoming radio blocks in accordance with rate information, for example, indicated by a network. As an example, the IC may take the form of a digital signal processor.
Embodiments of the present invention may be implemented in appropriate hardware, firmware, and software. To that end, a method may be implemented in hardware, software and/or firmware to synchronize a channel codec and vocoder, e.g., of a wireless device. The method may perform various functions including receiving a configuration message from a network at a wireless device and configuring the channel codec and vocoder based upon the configuration message according to a priority so that the channel codec and the vocoder operate in synchronization. For example, the priority may cause the device to schedule encoding configuration prior to decoding configuration. Furthermore, the priority may further schedule the encoding configuration before transmission of an effective radio block for the new configuration, and schedule the decoding configuration after decoding of a radio block immediately prior to the effective radio block.
In one embodiment, a system in accordance with an embodiment of the present invention may be a wireless device such as a cellular telephone handset, personal digital assistant (PDA) or other mobile device. Such a system may include a transceiver, as well as digital circuitry. The digital circuitry may include circuitry such as an IC that includes at least some of the above-described hardware, as well as control logic to implement the above-described methods.
Referring to
While shown as including a number of particular components in the embodiment of
DSP 10 may be adapted to perform various signal processing functions on audio data. In an uplink direction, DSP 10 may receive incoming voice information, for example, from a microphone 5 of the handset and process the voice information for an uplink transmission. This incoming audio data may be converted from an analog signal into a digital format using a codec 20 formed of an analog-to-digital converter (ADC) 18 and a digital-to-analog converter (DAC) 22, although only ADC 18 is used in the uplink direction. In some embodiments, the analog voice information may be sampled at 8,000 samples per second or 8 kHz. The digitized sampled data may be stored in a temporary storage medium (not shown in
The audio samples may be collected and stored in the buffer until a complete data frame is stored. While the size of such a data frame may vary, in embodiments used in a time division multiple access (TDMA) system, a data frame (also referred to as a “speech frame”) may correspond to 20 ms of real-time speech (e.g., corresponding to 160 speech samples). In various embodiments, the input buffer may hold 20 ms or more of speech data from ADC 18. As will be described further below, an output buffer (not shown in
The buffered data samples may be provided to an audio processor 30a for further processing, such as equalization, volume control, fading, echo suppression, echo cancellation, noise suppression, automatic gain control (AGC), and the like. From front-end processor 30a, data is provided to a vocoder 35 for encoding and compression. As shown in
In the downlink direction, incoming RF signals may be received by antenna 70 and provided to RF circuitry 60 for conversion to baseband signals. The transmission chain then occurs in reverse such that the modulated baseband signals are coupled through modem 50, channel decoder 45b of codec 40, vocoder 35 (and more specifically speech decoder 42b), audio processor 30b, and DAC 22 (via a buffer, in some embodiments) to obtain analog audio data that is coupled to, for example, a speaker 8 of the handset.
For purposes of further illustration, the discussion is with respect to a representative GSM/GPRS/EDGE/TDMA system (generally a “GSM system”). However, other protocols may implement the methods and apparatus disclosed herein, particularly where shared configuration information is to be updated in both voice and channel codecs.
A GSM system makes use of a TDMA technique, in which each frequency channel is further subdivided into eight different time slots numbered from 0 to 7. Referring now to
A 26-multiframe is used as a traffic channel frame structure for the representative system. Referring now to
In a GSM system, a speech frame is 20 msec while a radio block is 4 TDMA frames, which is 4*4.615=18.46 msec. Data output from a speech codec is to be transmitted during the next radio block, and every three radio blocks, the TDMA frame or radio block boundary and the speech frame boundaries are aligned.
Consider a first radio block N, e.g., frame 0 to frame 3 (T0-T3) of frame structure 90 of
During an AMR voice call, the network may send a message to the mobile station to request the mobile station to change its AMR configuration. Typically, the configuration message includes rate information (e.g., a new transmission rate) and an identification of a given radio block (corresponding to a transmission time) at which the rate information is to take effect. In order to apply a new AMR configuration, a software routine may be implemented. In one embodiment the routine, AMR configure (AMRCFG), may be used to apply AMR configuration for uplink and downlink. For ease of discussion, configuration for the uplink side may be also referred to herein as ACU and configuration for the downlink side may be also referred to herein as ACD.
In various embodiments, configuration information may be applied according to task execution priorities within the cellular transmission cycle. For example, in one embodiment a task execution priority may proceed as follows: channel encoding may have a highest priority; configuration (i.e., in uplink and downlink directions) may have a next priority; channel decoding may have a next lower priority; and finally, speech encoding/decoding may have a lowest priority. While described with this particular priority in one embodiment, it is to be understood that other priorities may be present in other embodiments. Furthermore, in different embodiments task executions may be moved relative to a given execution priority for multiple radio blocks. That is, in some embodiments tasks for a first block may be performed before tasks (including configuration/reconfiguration) for a subsequent radio block.
Consider now an example where a network sends a configuration message, e.g., an AMR configuration message to a cellular phone. Furthermore, assume that the message instructs the phone to apply the new configuration information to a next radio block N+1. At a frame interrupt for the first TDMA frame of a current radio block N, a DSP kernel schedules channel encoding for block N and ACU. Accordingly, channel encoding for block N and reconfiguration of the speech encoder and channel encoder may be performed during transmission of block N. At the frame interrupt for the first TDMA frame of radio block N+1, the DSP kernel schedules channel encoding for block N+1 (implicitly, speech encoding for block N+1 occurs prior to the channel encoding). Because of the task execution priority, the execution sequence for this example may occur as follows:
In various embodiments, ACU may be performed such that the new configuration is applied to the speech encoder first and then to the channel encoder so that the speech encoder and channel encoder are synchronized.
Referring now to
Still referring to
Next, the current radio block (e.g., radio block N) may be channel encoded using the channel encoder (block 120). As described above, this channel encoding occurs prior to transmission of radio block N. Furthermore, in some embodiments the channel encoding may be scheduled to occur upon receipt of a frame interrupt signal received just prior to the beginning of a transmission period of radio frame N.
To maintain synchronization between speech encoder and channel encoder, the new configuration information may be scheduled to be applied to the speech encoder at a predetermined time after receipt of this frame interrupt signal. That is, to maintain synchronization, the channel encoding of radio block N should occur using the prior configuration information. Furthermore, the channel encoding may be executed with a higher priority than reconfiguration.
Accordingly, at the predetermined time, the new configuration information may be applied to the speech encoder (block 130). For example, in some embodiments the new configuration information may be applied at a predetermined time after receipt of the first frame interrupt signal for radio block N. However the new configuration information may be applied to the speech encoder at various other time instants. Preferably, however, the new configuration information is applied after channel encoding radio block N.
Still referring to
Finally, radio block N+1 may be encoded using the new configuration information in the speech encoder and channel encoder (block 150). In some embodiments, the speech encoding for radio block N+1 may begin during transmission of radio block N. For example, this speech encoding may begin within a third or fourth TDMA frame of radio block N.
In the downlink side, configuration/reconfiguration of a channel decoder and speech decoder may be performed prior to decoding activities for the block in which configuration information is to be effected. Thus, configuration/reconfiguration may be performed after decoding of a previous block (e.g., block N) and during transmission of the radio block in which the new configuration information is to be effected (e.g., a block N+1), but prior to decoding operations for the block. Then, after receiving the last burst in block N+1, the DSP kernel may schedule configuration events, along with channel decoding and speech decoding. Because of the task execution priority, the execution sequence for this example may occur as follows:
Referring now to
As shown in
Thereafter, the new configuration information may be applied to the speech decoder (block 230). For example, in some embodiments reconfiguration of the speech decoder may occur immediately following reconfiguration of the channel decoder. Finally, radio block N+1 may be decoded using the new configuration information in the channel decoder and speech decoder (block 240). In various embodiments, channel decoding may occur for the radio block upon receipt of the last TDMA frame (i.e., T7) for radio block N+1. Upon completion of the channel decoding, speech decoding may then be performed. While described with this particular process in the embodiment of
Referring now to
Still referring to
Still referring to
Still referring to
Using the embodiments of the present invention, synchronization between channel codec and vocoder may be optimally maintained. Such synchronization may further be maintained during configuration, for example, AMR configuration. Accordingly, embodiments of the present invention may be used to maintain the vocoder and channel codec in substantial synchronization. As used herein, the term “substantial” synchronization means that vocoder and channel codec may be synchronized to a level at which adverse performance effects are avoided.
The methods described herein may be implemented in software, firmware, and/or hardware. A software implementation may include an article in the form of a machine-readable storage medium onto which there are stored instructions and data that form a software program to perform such methods. As an example, a DSP may include instructions or may be programmed with instructions stored in a storage medium to perform channel codec-vocoder synchronization in accordance with an embodiment of the present invention.
Referring now to
Incoming RF signals are provided to a transceiver 310 which may be a single chip transceiver including both RF components and baseband components. Transceiver 310 may be formed using a complementary metal-oxide-semiconductor (CMOS) process, in some embodiments. As shown in
In some embodiments, transceiver 310 may correspond to ASIC 15 of
After processing signals received from RF transceiver 312, baseband processor 314 may provide such signals to various locations within system 300 including, for example, an application processor 320 and a memory 330. Application processor 320 may be a microprocessor, such as a central processing unit (CPU) to control operation of system 300 and further handle processing of application programs, such as personal information management (PIM) programs, email programs, downloaded games, and the like. Memory 330 may include different memory components, such as a flash memory and a read only memory (ROM), although the scope of the present invention is not so limited. Additionally, a display 340 is shown coupled to application processor 320 to provide display of information associated with telephone calls and application programs, for example. Furthermore, a keypad 350 may be present in system 300 to receive user input.
While the present invention has been described with respect to a limited number of embodiments, those skilled in the art will appreciate numerous modifications and variations therefrom. It is intended that the appended claims cover all such modifications and variations as fall within the true spirit and scope of this present invention.