Claims
- 1. A multi-channel audio encoder, comprising:a frame grabber that applies an audio window to each channel of a multi-channel audio signal sampled at a sampling rate to produce respective sequences of audio frames; a plurality of filters that split the channels' data frames into respective pluralities of frequency subbands over a baseband frequency range, said frequency subbands each comprising a sequence of subband frames that have at least one subframe of audio data per subband frame; an analyzer that generates an estimated error signal from which it computes a prediction gain for each subframe; a plurality of subband encoders that code the audio data in the respective channels' frequency subbands into encoded subband signals, each subband encoder comprising: a plurality of adaptive differential pulse code modulation (ADPCM) coders that encode the lower frequency subbands, said ADPCM coders' prediction capabilities being disabled when their respective prediction gains are less than a threshold gain thereby forming an adaptive pulse code modulation (APCM) coder that encodes the lower frequency bands; and a plurality of vector quantizers (VQ) that encode the higher frequency bands; and a multiplexer that packs and multiplexes the encoded subband signals into an output frame for each successive data frame thereby forming a data stream at a transmission rate.
- 2. The multi-channel audio encoder of claim 1, wherein the vector quantizer codes all of the subbands above a threshold frequency.
- 3. The multi-channel audio encoder of claim 1, wherein said baseband frequency range has a maximum frequency, further comprising:a prefilter that splits each of said audio frames into a baseband signal and a high sampling rate signal at frequencies in the baseband frequency range and above the maximum frequency, respectively; and a high sampling rate encoder that encodes the audio channels' high sampling rate signals into respective encoded high sampling rate signals, said multiplexer packing the channels' encoded high sampling rate signals into the respective output frames so that the baseband and high sampling rate portions of the multi-channel audio signal are independently decodable.
- 4. The multi-channel audio encoder of claim 1, wherein the multi-channel audio signal is encoded at a target bit rate, further comprising:a global bit manager (GBM) that computes a psychoacoustic signal-to-mask ratio (SMR) for each subframe, and when the ADPCM coder is enabled modifies the SMRs by respective fractions of their associated prediction gains, allocates bits to satisfy each SMR, computes the allocated bit rate over all subbands, and adjusts the individual allocations such that the actual bit rate approximates the target bit rate.
- 5. The multi-channel audio encoder of claim 4, wherein the subband encoder splits each subframe into a plurality of sub-subframes, further comprising:an analyzer that detects transients in the estimated error signal when the ADPCM coder is enabled and in the audio data when the APCM coder is enabled in each sub-subframe, generates a transient code that indicates whether there is a transient in any sub-subframe other than the first and in which sub-subframe the transient occurs, and when a transient is detected generates a pre-transient scale factor for those sub-subframes before the transient and a post-transient scale factor for those sub-subframes including and after the transient and otherwise generates a uniform scale factor for the subframe, said ADPCM and APCM coders using said pre-transien t, post-transient and uniform scale factors to scale the error signal and audio data, respectively, prior to coding to reduce coding error in the sub-subframes corresponding to the pre-transient scale factors.
RELATED APPLICATION
This application is a divisional of application Ser. No. 08/642,254 filed May 2, 1996 entitled MULTI-CHANNEL PREDICTIVE SUBBAND AUDIO CODER USING PSYCHOACOUSTIC ADAPTIVE BIT ALLOCATION IN FREQUENCY, TIME AND OVER THE MULTIPLE CHANNELS. which is hereby incorporated by reference and which is itself a continuation-in-part of provisional application Serial No. 60/007,896 filed Dec. 1, 1995.
US Referenced Citations (7)
Foreign Referenced Citations (3)
Number |
Date |
Country |
0365265 |
Oct 1989 |
EP |
0411998 |
Jul 1990 |
EP |
0520444 |
Jun 1992 |
EP |
Non-Patent Literature Citations (1)
Entry |
“AES Recommended Practice for Digital Audio Engineering—Serial Multichannel Audio Digital Interface (MADI)”, AES Journel of the Audio Engineering Society, 39, No. 5, pp. 371-377 (May 1991). |
Provisional Applications (1)
|
Number |
Date |
Country |
|
60/007896 |
Dec 1995 |
US |