Method and apparatus for asymmetric communication of compressed speech

Information

  • Patent Grant
  • 6324515
  • Patent Number
    6,324,515
  • Date Filed
    Tuesday, June 2, 1998
    26 years ago
  • Date Issued
    Tuesday, November 27, 2001
    23 years ago
Abstract
This invention relates to a method and an apparatus for processing digital audio signals that may reduce the signal degradation occurring when the signal is exchanged between two communication terminals equipped with vocoders in a communication network. The solution proposed by this invention is to provide a communication terminal with a vocoder including a decoder section provided with a plurality of decoding units. A switch activates a selected one of the decoding units in dependence of the format of the compressed audio signal data frames received from a remote communication terminal. This system allows the communication terminal to support a number of different speech compression formals. In order to achieve simplicity and low cost, the communication terminal is provided with a single encoding unit. This results in an asymmetric arrangement where the communication terminal has a large number of decoding units than encoding units. The great majority of the speech compression algorithms deployed in wireless and Internet telephony standards have the property that their speech decoders are of far less computational complexity than their respective speech encoder. Therefore, a low-cost terminal can be produced which supports a low complexity speech encoder unit and a variety of speech decoder units.
Description




FIELD OF THE INVENTION




This invention relates to a method and an apparatus for processing digitized voice signals in a communications environment that can be of a wireless nature.




BACKGROUND OF THE INVENTION




In recent years, the telecommunications industry has witnessed the proliferation of a variety of digital vocoders in order to meet bandwidth demands of different wireline, and wireless communication systems. The name “vocoder” stems from the fact that its applications are specific to the encoding and decoding of voice signals primarily. Vocoders are usually integrated in mobile telephones and in base stations of the communication network. They are also found on computer sound cards and used for Internet telephony. They provide high compression of a digitized voice signal as well as the reverse transformation while maintaining acceptable speech quality.




For the purposes of this specification, the term “vocoder” is defined as a speech coding device that includes two main sections, namely an encoder section and a decoder section. The encoder section receives a speech signal in digitized form, such as PCM (pulse code modulation) samples, calculates speech parameters (compressed form of speech) and transmits those parameters on a communication channel. The decoder section receives the compressed form speech parameters and then synthesizes the speech signal. In a specific example, the synthesis operation produces PCM samples. The purpose of the decoder section, therefore, is to effect a transformation that is the reverse of the encoding operation, namely transforming compressed speech frames into an uncompressed speech signal.




The main advantage of compressing speech is that it uses less of the limited channel bandwidth for transmission. The main disadvantage is a reduction in speech quality.




The rapid growth in the diversity of networks and the number of users of such networks is increasing the number of instances where two vocoders are placed in tandem to serve a single connection. Tandem connections of low bit-rate vocoders are known to cause additional distortions and reduce the quality of the speech signal. One example of such a scenario in a wireless context is a wireless-to-wireless link.




In such a case, a first encoder section is used to compress the speech signal generated by the first wireless user. The compressed speech frames generated by the encoder section are transmitted to a base station serving the local wireless terminal and they are then decompressed (converted to PCM format samples) by the decoder section of the local vocoder. The resulting PCM samples arrive at the remote base station serving the second wireless terminal, over the digital trunk of the telephone network. At the remote base station, the PCM samples are compressed by the encoder section of the local vocoder. The compressed speech signal is then transmitted to the second wireless terminal. A the decoder section at the second wireless terminal decompresses the received compressed speech frames to synthesize the original speech signal from the first wireless terminal.




This method of transmitting the speech signals introduces degradation in the speech quality. This is due to the successive compression/decompression cycles of the signal. A possible solution to this problem is to bypass the decoder section of the first base station and the encoder section of the second base station. With this arrangement, compressed speech frames are directly transmitted from one wireless terminal to the other wireless terminal, rather than being converted to PCM samples and transmitted in PCM form through the PSTN network. However, this solution is only feasible when the two mobile terminals, hence the base stations serving them, employ identical vocoders. If the two terminals involved in a connection utilize different vocoders, it is no longer feasible to bypass the intermediate decompression/compression stages. The “bypass” approach is described in the international application serial number PCT/CA95/00704 dated Dec. 13, 1995. The contents of this disclosure are incorporated herein by reference.




Another possible solution is to equip each wireless terminal with a plurality of encoders and decoders selectively used in dependence of the encoders and decoders provided at the remote wireless terminal. At call set-up time, through a combination of in-band and out-of-band signaling and a negotiation protocol, the most suitable common encoders and decoders are selected by each terminal. Unfortunately, this solution may not always be practical since the provisioning of a number of encoders and decoders implies higher costs. In effect, the complexity, and therefore the cost, of encoders is relatively higher than the cost of decoders. This solution hence requires the use of very powerful digital signal processing devices and consumes a great amount of memory capacity.




Thus, there exists a need in the industry for a device and a method capable of improving the voice quality during connections that may include tandemed vocoders, that can be implemented at a relatively low cost.




OBJECTIVES AND SUMMARY OF THE INVENTION




An object of the invention is to provide a novel vocoder that is capable of processing compressed speech frames in a variety of formats.




Another object of the invention is to provide a method for reducing audio signal degradation when an audio signal is transmitted between two communication terminals.




As embodied and broadly described herein, the invention provides a vocoder for processing audio signals, comprising:




a first input for receiving an audio signal;




a second input for receiving compressed audio signal frames;




an encoding section including at least one encoder unit, said encoder unit being coupled to said first input for receiving the audio signal and generating a succession of compressed audio signal frames;




a decoding section including:




a) a group of decoding units, each decoding unit being capable of receiving compressed audio signal frames and generating an audio signal;




b) a switch capable of acquiring a plurality of decoder unit selection positions, in each decoder unit selection position said switch directing the compressed audio signal frames received at said second input to a selected one of said decoder units of said decoding section.




In this specification, the term “wireless terminal” is intended to include both mobile terminals and fixed wireless terminals. The term “wireless terminal” is part of a larger family of terminals referred to herein as “speech compression terminals”. Speech compression terminals comprise vocoders that are capable of converting speech from a digitized format to a compressed format and vice versa. Other examples of speech compression terminals are those used for Internet telecommunications, Integrated Services Digital Network (ISDN) telecommunications, etc.




The expression “audio signal frame” or “audio data frame” will refer to a group of bits organized in a certain structure that conveys some audio information for a segment of a predefined length. Typically, an audio signal frame when representing a segment of audio signal in compressed form will include a coefficients segment and an excitation segment. The audio signal frame may also include additional elements that may be necessary for the intended application.




The expressions “first format”, “second format”, etc. when used to describe the audio signal in compressed form in the format of the encoder section of a given vocoder, refers to signals in compressed form that are, generally speaking, not compatible with each other, although they may share a common basic structure. For example, such signals may be divided into a coefficient segment and an excitation segment. Thus, a vocoder operating with signals under the first format will not, generally speaking, be capable of processing signals expressed under any other format than the first format.




In a most preferred embodiment, the vocoder in accordance with the present invention is provided with an encoder section having a single encoder unit while the decoder section has a plurality of decoder units. This asymmetric vocoder configuration manifests a higher compatibility with other type of vocoders while being relatively unexpensive. The majority of the speech compression/decompression algorithms that are in use in wireless and Internet telephony applications have the property that the speech decompression part of the algorithm requires far less computational complexity than the respective speech compression part. Therefore, a low-cost speech compression terminal can be produced which supports a decoder section having a plurality of decoding units that can process compressed audio frames of different formats.




This solution requires a less powerful digital signal processing device than the previously proposed solutions both in terms of the processing capability and memory. This approach will also allow the extension of the life of existing speech compression terminals; that is, these speech compression terminals, equipped with slower digital signal processors, can be retrofitted by merely updating their Read Only Memory (ROM) containing the processor instructions implementing the compression/decompression algorithm.




By providing the decoding section of the vocoder with a plurality of decoder units, each unit capable of processing a different format of compressed audio signal frames, the vocoder can decompress all audio signal frames in a format other than the format of its encoder section. This feature is particularly useful in speech compression communication terminals, such as a wireless terminal, where a variety of speech encoding/decoding formats exist. Theoretically, the present invention allows to develop a speech compression terminal that could be fully compatible with every terminal existing today by providing the decoder section of the vocoder with a decoder unit for each available compressed audio data frame format.




The selection of the decoder unit to be used during the operation of the vocoder can be made in different ways. One possibility, which is particularly suitable for use in wireless terminals, is to rely on a control signal issued by the base station. Each base station communicates with its associated wireless terminal to instruct the terminal to activate a specific decoder unit of the decoder section. The instructions are communicated through any suitable control signal that is received by the decoder unit switch. When the switch receives the signal it activates the designated decoder unit so the communication can take place. If the speech encoder of one terminal has a matching decoder in the other, that decoder is invoked, eliminating the need for decompression and compression of speech in the base stations for the given direction of transmission.




Exchange of voice data then begins and the base stations enter a handshaking procedure with one another to determine if compressed audio signal frames issued by the other terminal are of the same format. In the affirmative, the base stations proceed to establish the bypass mode, where their vocoders are placed off-line, such that the compressed audio signal frames of each terminal are transported through both base stations without encoding/decoding. Decoding occurs only at the communication terminals.




Another possibility is to allow the base stations and the wireless terminals to operate independently from one another. This variant requires the wireless terminal to have the capability of recognizing the incoming compressed audio data frames so as to determine their format and issue accordingly a control signal to the decoder unit switch so the appropriate decoder unit can be enabled. In a specific example, each compressed audio data frame may be provided with a field containing a combination of bits designating the encoder format that has been used to generate the frame. A wireless terminal receiving the frame can read this tag and dynamically set the decoder unit switch to the appropriate position. In this case, each base stations involved in the call also reads this field and proceeds to establish the bypass mode if the bit combination indicated designates a format that is supported by the wireless terminal associated with the base station.




The present invention also allows the establishment of asymmetric communication which is an advantage particularly in circumstances where asymmetry exists in the transmission medium. One example is the case of a wireless transmission where the channel characteristics in the forward and reverse directions may be different. If, for example, the forward link suffers from higher levels of interference compared to the reverse link, it is possible to select in the forward link an encoder unit/decoder unit pair operating at a lower bit-rate, thereby allowing the allocation of a higher bandwidth to the forward channel vocoder. This will allow an increase in the power of forward error correction in the forward link. Note that in this scenario, the encoder section of each vocoder has more than one encoder unit so as to allow an encoding format selection to be made.




Another example is the case of Internet telephony, where congestion in a given direction may necessitate the selection of a lower bit-rate encoder unit in that direction, in order to trade-off speech quality for a lower rate of packet loss.




Yet another example is when voice and data are simultaneously transferred. In this case a lower bit-rate encoder unit may be advantageously used in one direction for the duration of data transfer.




As embodied and broadly described herein, the invention also provides a method for processing audio information, said method comprising the steps of:




receiving at a first input an audio signal;




processing said audio signal by an encoder section to generate a succession of compressed audio data frames;




providing a decoder section having a plurality of decoder units, each decoder unit being capable of receiving compressed audio signal frames and generating an audio signal;




providing a switch capable of acquiring a plurality of decoder unit selection positions, in each decoder unit selection position said switch directing data containing audio information to a selected one of said decoder units;




receiving a control signal indicative of a decoder unit to be enabled for processing incoming compressed audio signal frames;




setting said switch at a position in accordance with said control signal.




As embodied and broadly described herein, the invention further provides a method for configuring two compressed speech communication terminals for permitting establishment of a call session between the communication terminals, each compressed speech communication terminal comprising a vocoder that includes:




an encoder section for generating compressed audio data frames;




a decoder section, said decoder section including a plurality of decoder units;




said method comprising the step of enabling a selected decoder unit in each communication terminal that supports the compressed audio data frames generated by the encoder section of the other communication terminal.




As embodied and broadly described herein, the invention further provides a base station for use in a communication network, said base station being capable of supporting a call session involving a wireless terminal that exchanges data with said base station through an air interface, said base station including:




an encoder section for receiving an audio signal and for generating compressed audio data frames for transmission to the wireless terminal over the air interface, said encoder section being capable of selectively acquiring an operative mode and a bypass mode, in said operative mode an audio signal input to said base station being processed by said encoder section to generate a succession of compressed audio data frames, in said bypass mode said encoder section being disabled whereby data input to said base station being transmitted without processing by said encoder section to the wireless terminal over said air interface;




a control module capable of recognizing signaling information indicative of an encoder format used to generate compressed audio data frames forwarded to said base station from a source other than the wireless terminal, said control module being responsive to said signaling information for:




a) causing said encoder section to proceed to establish said bypass mode;




b) enabling issuance of a control signal over the air interface toward the wireless terminal instructing the terminal to activate a decoder unit compatible with the encoder format indicated in the signaling information.











BRIEF DESCRIPTION OF THE DRAWINGS





FIG. 1

is a block diagram providing a simplified illustration of a wireless telecommunication network;





FIG. 2

is a block diagram showing two asymmetric vocoders in accordance with an embodiment of the invention, and which are part of compressed speech terminals engaged in a call session;





FIG. 3

is a block diagram showing two base stations in accordance with an embodiment of the invention;





FIG. 4

is a flowchart of a method for effecting decoder unit selection, during a communication between two compressed speech terminal, in accordance with an embodiment of the invention; and





FIG. 5

is a block diagram of a signal processing device built in accordance with an embodiment of the invention and that can be used to implement the function of the vocoders described in FIG.


2


.











DESCRIPTION OF A PREFERRED EMBODIMENT





FIG. 1

is a block diagram providing a simplified view of a wireless telecommunications network.

FIG. 1

shows a wireless terminal


100


communicating over a wireless link


105


(through an air interface) to a Base Station A (BS A)


110


. Wireless link


105


is used to transfer voice data as well as signal and control data. This involves several channels as will be described in detail later. BS A


110


communicates with BS B


140


through the PSTN


120


via physical links


115


and


135


which carry voice data as well as signal and control data. Finally, BS B communicates with wireless terminal


140


over wireless link


135


, which is identical to wireless link


105


in this particular embodiment of the invention.




When a call is made to or from a wireless terminal


100


or


140


, four radio channels are involved in each of the links


105


and


135


between wireless terminals


100


and


140


and their respective base stations


110


and


130


. The channel used to communicate voice data from the BC to the wireless terminal is called Forward Voice Channel (FVC). The channel used to communicate voice data from the wireless terminal to the BS is called the Reverse Voice Channel (RVC). Two other channels carry the handshaking information required to establish communications with wireless terminals. They are the Forward Control Channel (FCC) and the Reverse Control Channel (RCC). Among other things, the FCC and RCC are used to broadcast the Mobile Identification Number (MIN) and the wireless terminal's capabilities. These capabilities would include, for example, the transmission power, the available encoders and decoders, etc. The FCC and the RCC are example of out-of-band communication means for signal and control data. An alternative method of communicating signal and control data would be in-band signaling and involves sending the control information on reserved bits in a compressed audio data frame. This process consists of utilizing certain bits from certain speech samples to transmit signaling information. The location of the signaling bits and the bit robbing rate are selected to reduce the perceptual effect of the bit substitution, such that the audible signal is not significantly affected. The receiver of the compressed audio data frames (either the base station or the wireless terminal, depending upon the direction of the transmission) knows the location of the signaling bits in the compressed audio data frames and it is thus capable of decoding the message.





FIG. 2

is a block diagram showing two asymmetric vocoders in accordance with an embodiment of the invention. Two similar asymmetric vocoders


200


and


220


are shown. For the purpose of this example, assume that each vocoder resides in a wireless terminal. The remaining components of the wireless terminals have not been shown because they are not necessary in the context of this description. The vocoders have the same components except that the encoder unit and decoder unit formats may vary. In order to avoid redundancy, only one of the vocoders will be described in detail here.




The asymmetric vocoder


200


has an encoder section


221


. The encoder section is comprised of a single encoder unit


222


of format A. The encoder section


221


includes an input


230


that receives an audio signal, say in PCM format, and converts that signal into a succession of compressed audio data frames of format A that are issued from output


240


. The vocoder also includes a decoder section


223


that includes a group of decoder units


224


,


226


and


228


of various formats. The purpose of the decoder section is to perform the reverse transformation that is, convert compressed audio data frames received at input


249


to an audio signal such as expressed in PCM format, an output


235


. The actual decoding operation is effected by a decoder unit of the group of decoder units that supports the format of the compressed audio data frames received at the input


249


. A switch functional block


232


, selects the actual decoder unit to be activated to perform the decoding operation of the particular format of compressed audio data frames received at the input


249


.




A signal and control functional block


230


is provided to issue a command signal to the switch


232


to adopt a particular decoder unit selection position. The signal and control functional block


230


is designed to perform a high level function and would normally be shared with other components of the wireless terminal. In other words, the signal and control functional block


230


will not be dedicated to the vocoder but could service the entire wireless terminal. One function that the signal and control functional block


230


performs that is related to the operation of the vocoder


200


is the determination of the position the switch


232


should take based on the format of the compressed audio data frames received at the input


249


. Two different possibilities exist in this regard.




Under the first possibility, the format may be explicitly communicated to the signal and control functional block


230


, either by using out-of-band signaling or in-band signaling. Such explicit communication can originate from any node in the communication network that is capable of remotely configuring the vocoder


200


during call set-up. In a specific example, the base station that supports the wireless terminal will direct the latter to activate a specific one of the decoder units. The base station issues the appropriate control signal explicitly stating the decoder unit to be enabled. The control signal is directed to the signal and control functional block


230


that uses this control signal to set the switch


232


to the appropriate position.




Under a second possibility, the configuration of the vocoder


200


is effected locally without the need of receiving an external command. The local configuration operation involves recognizing the specific format of the compressed audio data frames that arrive at the input


249


and activate the corresponding decoder unit. In a specific example, this can be made by observing each compressed audio data frame (or alternatively one frame in a group of frames) to determine its format. This involves looking for a characteristic information that conveys the particular format of the frame. One possibility is to look for a combination of bits in a field of the frame that signals the particular format of the frame. Once the frame format has been determined, the signal and control functional block


230


will issue a control signal to the switch to


232


so the latter can adopt the appropriate position and enable the decoder unit that is compatible with the frame's format.




The links


242


and


247


carry voice signals and are dotted lines to illustrate that many links and many network components of various types may be involved. The same applies for the signal and control link


250


. In this particular embodiment link


250


is an example of an out-of-band communication means for signal and control data.




The apparatus illustrated at

FIG. 5

can be used to implement the function of the vocoder


200


whose operation is detailed above in connection with FIG.


2


. The apparatus comprises input signal lines


230


and


249


, signal output lines


240


and


235


, a signal input/output line


250


, a processor


514


and a memory


516


. The memory


516


is used for storing instructions for the operation of the processor


514


and also for storing the data used by the processor


514


in executing those instruction. A bus


518


is provided for the exchange of information between the memory


516


and the processor


514


. The instructions stored in the memory


516


allow the apparatus to implement the functional blocks depicted in the diagram at FIG.


2


. Those functional blocks can be viewed as individual program elements or modules that process the data at one of the inputs and issue processed data at the appropriate output.




Under this mode of construction, the encoder unit and the decoder units are actually program elements that are invoked when an encoding/decoding operation is to be performed. The switch functional block


232


is the portion of the overall program that controls which program element corresponding to a particular decoder unit is to be utilized during a given call session.




Other forms of implementation are possible. The encoder unit and the encoder units may be formed by individual circuits, such as microcircuits on a chip. The switch


232


can be any sort of mechanism that can selectively direct the signal to be processed to the appropriate circuit.





FIG. 3

is a block diagram showing two base stations (BS) in accordance with a preferred embodiment of the invention. BS A


110


and BS B


130


shown here are identical. In order to avoid redundancy, only BS A will be described in detail here.




BS A


110


has a processor


300


, a memory


305


, various databases


310


containing information necessary for the operation of the base station, a encoder unit


315


, a decoder unit


320


and a data bus


325


. In the example shown, the encoder unit


315


and the decoder unit


320


are shown as separate components that can be in the form of micro circuits realized on a chip. This form of implementation is shown merely as a possible example and not with the intent to limit the scope of the invention to this particular configuration. The encoder unit and the decoder unit can also be realized through software that is executed by the processor


300


. The processor


300


executes the programs stored in memory


305


and is responsible for controlling all information entering or leaving the base station


110


as well as controlling all information circulating inside BS A


110


on data bus


325


. In addition to storing programs, the memory


305


may act as a buffer for storing data temporarily. The databases


310


may hold information regarding the terminal capabilities or data of the same type.




The encoder unit


315


will convert speech from PCM format to compressed form. The decoder


320


converts compressed speech to PCM format. As stated before, the inputs and outputs to BS A


110


may be a wireless link


105


to a wireless terminal


100


and a second link


115


to the PSTN


120


.





FIG. 4

is a flowchart of a method for establishing decoder unit selection, during a communication between two terminals, in accordance with an embodiment of the invention. In this example, the wireless terminal


100


requests establishment of a call session with wireless terminal


140


. At step


400


, the procedure is initiated by establishing the FCC and the RCC between the calling terminal


100


and BS A


110


as described earlier. At step


402


, data is sent on the RCC from a wireless terminal


100


to inform BS A


110


of, among other things, the capabilities of the calling terminal


100


, the calling number and the called number. The calling terminal


100


capabilities may include the list of encoder units and decoder units available.




The information sent from the wireless terminal


100


to the base station


110


regarding the capabilities of the wireless terminal may be explicit or non explicit. The explicit form of communication involves sending on a control channel the list of all the encoder and decoder units available in the terminal or it may involve simply sending the formats that are being supported by this terminal. Thus, the wireless terminal may send a string of bits that designate the format in which speech is encoded and also the list of the formats that can be handled by the decoder section. The non explicit form of communication involves sending a unique code that can be used by the base station


110


to determine the capabilities of the wireless terminal. This form of a construction may involve the transmission of fewer bits. A further requirement is a look up table in the memory of the base station


110


that contains the list of all of the possible wireless terminal identification codes and each code is associated with the encoding and decoding formats that are supported by that particular wireless terminal. When a wireless terminal sends to the base station its identification code, the base station consults the look up table and determines what are the exact capabilities of that terminal. That information can then be used by the base station to determine whether data packets in compressed form that are received from the base station associated with the called wireless terminal can be sent directly to the calling wireless terminal. This will be discussed in greater detail later.




BS A


110


then requests a connection to the called terminal


140


through the PSTN


120


(step


404


). The request for connection involves sending information regarding the capabilities of the the calling terminal


100


. This information may be sent either over the signaling network of the PSTN or through in-band signaling by using the bit stealing protocol mentioned earlier. At step


406


, the calling terminal


100


capabilities are thus communicated to BS B


130


. At step


408


, the FCC and RCC are established between the called terminal


140


and BS B


130


. The called terminal


140


capabilities are then communicated to BS B on RCC (step


410


). This is effected in a manner similar to the process described in connection with wireless terminal


100


and its associated base station


110


. At this point BS B


130


is aware of the capabilities of both terminals involved in the communication link.




The next step of the process is to communicate to BS A


110


what are the capabilities of the called terminal


140


. This is effected (step


412


) by constructing a message at BS B


130


that contains the relevant information and that is sent to BS A


110


. In a specific example, this message can take the form of an acknowledgement message where BS B


130


advises BS A


110


that the original message (step


406


) has been properly received. At this point, both BS A


110


and BS B


130


have the required information to determine the best encoder/decoder combination in each of the terminals


100


and


140


in order to obtain acceptable speech quality. The following specific example will illustrate the process in greater detail. Assume that the calling terminal


100


has an encoder unit of the format A, and has a decoder section provided with four decoder units, there being a decoder unit for format A, a decoder unit for format B, a decoder unit for format C and a decoder unit for format D. As to the called terminal, (


140


) it has an encoder unit of the format D, and has a decoder section provided with four decoder units, there being a decoder unit for format A, a decoder unit for format B, a decoder unit for format C and a decoder unit for format D. The action that each base station takes can be determined through logic principles with the aid of a simple look-up table. The principles are the following ones:




A) If the remote wireless terminal has a decoder unit of a format supporting the format of the encoder unit for the local wireless terminal, then the decoder section of the base station proceeds to establish the bypass mode, thus the compressed audio signal frames received from the local wireless terminal are transmitted without decoding to the base station supporting the remote wireless terminal;




B) If the local wireless terminal has a decoder unit that can support the encoder format of the remote terminal then the encoder unit of the local base station proceeds to establish the bypass mode, whereby compressed audio frames transmitted from the remote wireless terminal are simply re-transmitted to the local wireless terminal without re-encoding, and the base station signals the local wireless terminal to activate the appropriate decoder unit.




Note that steps A and B of this process do not need to be executed simultaneously such that compressed audio data frames generated by the calling wireless terminal


100


are transmitted throughout the entire communication path, including both base stations without decoding. At the same time, data traveling on the reverse direction may be processed differently. For instance, if the calling wireless terminal


100


does not possess a decoder unit that supports the format of the encoder unit of the called wireless terminal


140


, the base stations then process data from the called terminal to the calling terminal in the default mode where compressed audio data frames received at BS B


130


are decoded into PCM samples, the PCM samples transmitted over the communication network to the BS A


110


and there they are encoded into a format that is supported by the decoder unit of the calling terminal


100


. The compressed audio data frames are then sent over the wireless link to the calling wireless terminal


100


. This mode of communication avoids vocoder tandeming only on one side of the communication path which, objectively is not optimal, however, it demonstrates the flexibility of the system and the usefulness of the asymetry of the vocoder.




For the actual decision making process a lookup table could be used, as mentioned earlier. Such lookup table can be constructed to map the capabilities of the terminals involved in the call session to a set of actions to be taken by a base station. The following is a possible representation of such a look up table.















COMMUNICATION TERMINAL TYPE















0




1




2



















COMMUNICA-




0




Action set 0




Action set 1




Action set 2






TION TERMINAL




1




Action set 3




Action set 4




Action set 5






TYPE




2




Action set 6




Action set 7




Action set 8














In the above table the communication terminal type is a code that uniquely identifies the capabilities of the wireless terminal. For instance, the codes may mean the following:

















WIRELESS




ENCODER




DECODER






TERMINAL CODE




UNIT FORMAT




UNIT FORMATS











0




Format A




Formats A, B, C,








D, E and F






1




Format B




Formats A, B, C,








D, E and F






2




Format C




Formats A, B, C,








D, E and F














Thus, taking the example of BS A


110


, when the calling wireless terminal has been determined to be of type 0 and the called wireless terminal to be of type 0 then the action set 0 is taken by the base station. The action set 0 implies the following:




1) Start handshaking procedure toward setting the encoder unit and the decoder unit of the base station to the bypass mode;




2) Issue a signal to the calling wireless terminal


100


to activate the decoder unit corresponding to format A.




The other action sets do not need to be described because they can be easily derived from the above description.




Referring back to

FIG. 4

the steps


414


-


428


summarize the process for configuring the base stations and the wireless terminals, as described above. More specifically, now that both BS A


110


and BS B


130


know the capabilities of terminals


100


and


140


, they can instruct their respective terminals to select a specific decoder (steps


414


and


416


). At step


418


, a Forward Voice Channel (FVC) and a Reverse Voice Channel (RVC) are established between the base stations and their respective terminals. At step


420


, voice data is exchanged between all entities.




At steps


422


and


424


, BS A


110


and BS B


130


begin the handshaking process to determine if bypass establishment is possible. If it is, the processor


350


will bypass the decoder


370


and encoder


365


of BS B


130


and the processor


300


will bypass the encoder


315


and the decoder


320


of BS A


110


(step


426


). If bypass is not possible, the base stations acquire the default mode (step


428


) where each base station encodes/decodes the compressed audio signal frames without any bypass.




The above description of a preferred embodiment of the present invention should not be read in a limitative manner as refinements and variations are possible without departing from the spirit of the invention. The scope of the invention is defined in the appended claims and their equivalents.



Claims
  • 1. A vocoder for processing audio signals, comprising:a first input for receiving an audio signal; a second input for receiving compressed audio signal frames; an encoding section including at least one encoder unit, said encoder unit being coupled to said first input for receiving the audio signal and generating a succession of compressed audio signal frames; a decoding section including: a) a group of decoder units, each decoder unit being capable of receiving compressed audio signal frames and generating an audio signal, said vocoder comprising more decoder units than encoder units; b) a switch capable of acquiring a plurality of decoder unit selection positions, in each decoder unit selection position said switch directing the compressed audio signal frames received at said second input to a selected one of said decoder units of said decoding section.
  • 2. A vocoder as defined in claim 1, wherein the decoder units are capable of processing compressed audio data frames of different formats.
  • 3. A vocoder as defined in claim 2, wherein said vocoder includes a single encoder unit.
  • 4. A vocoder as defined in claim 3, wherein said switch is responsive to a control signal to acquire a selected decoder unit selection position.
  • 5. A compressed speech communication terminal comprising the vocoder defined in claim 1.
  • 6. A method for processing audio information, said method comprising the steps of:receiving at a first input an audio signal; processing said audio signal by an encoder section to generate a succession of compressed audio data frames, said encoder section including at least one encoder unit; providing a decoder section having a plurality of decoder units, each decoder unit being capable of receiving compressed audio signal frames and generating an audio signal, said decoder section including more decoder units than encoder units; providing a switch capable of acquiring a plurality of decoder unit selection positions, in each decoder unit selection position said switch directing data containing audio information to a selected one of said decoder units; receiving a control signal indicative of a decode unit to be enabled for processing incoming compressed audio signal frames; setting said switch at a position in accordance with said control signal.
  • 7. A method as defined in claim 6, wherein said control signal is issued over an air interface.
  • 8. A method as defined in claim 7, wherein said control signal is generated by a base station of a telecommunication network.
  • 9. A method as defined in claim 8, wherein the decoder units are capable of processing compressed audio data frames of different formats.
  • 10. A method for configuring two compressed speech communication terminals for permitting establishment of a call session between the communication terminals, each compressed speech communication terminal comprising a vocoder that includes:an encoder section for generating compressed audio data frames; a decoder section, said decoder section including a plurality of decoder units; at least one of said two compressed speech communication terminals further including a decoder section having more decoder units than encoder units; said method comprising the step of enabling a selected decoder unit in each communication terminal that supports the compressed audio data frames generated by the encoder section of the other communication terminal.
  • 11. A method as defined in claim 10, wherein each compressed speech communication terminal includes a switch capable of acquiring a plurality of decoder unit selection positions, in each decoder unit selection position said switch directing compressed audio signal frames received from the other compressed speech communication terminal to a selected one of said decoder units of said decoding section, said method comprising the steps of forwarding to the switch of each compressed switch communication terminal a control signal for setting each switch to a selected decoder unit selection position whereby each compressed speech communication terminal is capable of supporting the compressed audio data frames generated by the encoder section of the other compressed speech communication terminal.
  • 12. A method as defined in claim 11, wherein the decoder units are capable of processing compressed audio data frames of different formats.
US Referenced Citations (6)
Number Name Date Kind
5182746 Hurlbut et al. Jan 1993
5455579 Bennett el al. Oct 1995
5479445 Klokler et al. Dec 1995
5742910 Gallant et al. Apr 1998
5799091 Lodenius Aug 1998
6078809 Proctor Jun 2000
Foreign Referenced Citations (2)
Number Date Country
PCTCA9500704 Dec 1995 CA
9163309 A Jun 1997 JP