Claims
- 1. Process for reducing data for transmission or storage of digital audio signals from several interdependent channels, in which blocks of samples of signals in the respective channels are transformed from time domain into a frequency domain representation, whereby a value is determined for each of a plurality of frequency components in each channel, and the values determined for the respective frequency components are coded, taking account a masking threshold determined by means of a psychoacoustic model, said process comprising the steps of:
- determining a first data rate necessary for separate coding of signals in each of the respective channels, said first data rate being determined collectively for all frequency components of signals in said respective channels;
- determining a second data rate necessary for joint coding of said signals in the respective channels, said second data rate being determined collectively for all frequency components of signals in said respective channels;
- comparing said first and second data rates;
- performing joint coding of said signals for all frequency components of the respective channels so long as the data rate necessary for joint coding of said signals does not exceed the data rate necessary for separate coding by a predetermined threshold value; and
- performing separate coding of said signals for all frequency components of the respective channels, when the data rate necessary for joint coding of said signals exceeds the data rate necessary for separate coding by at least said predetermined value.
- 2. Process according to claim 1 wherein for the comparison of the data rates necessary for separate and joint coding, an estimator is formed, which indicates a number of bits required to code each frequency component, whereby for a predetermined coding process the interference caused by the latter are kept below the masking threshold.
- 3. Process according to claim 2 wherein an estimator SF (Ki, Kj, Kk . . . ) of the necessary data rate for separate coding of signals in the respective channels is formed by addition of estimators SF (Ki), SF (Kj) . . . for signals from the respective channels Ki, Kj.
- 4. Process according to claim 3 wherein joint coding takes place by formation of linear combinations of input signals.
- 5. Process according to claim 2 wherein an estimator SF (Mijk . . . ) of the necessary data rates for the joint coding of signals from channels Ki, Kj, Kk . . . is formed by addition of estimators SF (Mi), SF (Mj), SF (Mk) . . . , in which Mi is the ith matrixed channel.
- 6. Process according to claim 5 wherein signals of several channels Ki, Kj, Kk . . . are jointly coded if the following condition is fulfilled, SF (Mijk . . . )<C.sub.1 SF (Ki, Kj, Kk . . . )+C.sub.2, in which C.sub.1 and C.sub.2 are predeterminable constants.
- 7. Process according to claim 6 wherein the constant C.sub.1 has a value between 1 and 2.
- 8. Process according to claim 6 wherein the constant C.sub.2 is zero.
- 9. Process according to claim 2 wherein perceptual entropy of the audio signal is used as the estimator.
- 10. Process according to claim 1 wherein a permitted maximum interference for the decoded signal in the channels is predetermined, the particular masking threshold for said maximum interference in the channels being calculated and used for determination of the estimator for the joint coding.
- 11. Process according to claim 1 wherein said step of determining a first data rate comprises:
- determining an estimated data rate for each respective channel, for separate coding of all frequency components therein; and
- adding said estimated data rates to form said first data rate.
- 12. Process according to claim 1 wherein said step of determining a second data rate comprises:
- combining said signals of said respective channels to form a plurality of combined signals based thereon;
- determining an estimated data rate for each respective combined signal for coding of all frequency components therein; and
- adding said estimated data rates to form said second data rate.
- 13. Process according to claim 12 wherein said combined signals comprise at least a middle and a side signal.
Priority Claims (1)
Number |
Date |
Country |
Kind |
42 17 276.4 |
May 1992 |
DEX |
|
Parent Case Info
This application is a continuation of application Ser. No. 08/338,618, filed as PCT/DE93/00448 May 18, 1993.
US Referenced Citations (3)
Number |
Name |
Date |
Kind |
4942607 |
Schroder et al. |
Jul 1990 |
|
5014318 |
Schott et al. |
May 1991 |
|
5285498 |
Johnston |
Feb 1994 |
|
Non-Patent Literature Citations (2)
Entry |
"Perceptual Transform Coding of Wideband Stereo Signals", James D. Johnst ICASSP-89, vol. 3, 1989, pp. 1993-1996. |
"Transform Coding of Audio Signals Using Perceptual Noise Criteria," James D. Johnston, IEEE Journal on Selected Areas in Communications, vol. 6, No. 2, Feb. 1988, pp. 314-323. |
Continuations (1)
|
Number |
Date |
Country |
Parent |
338618 |
Feb 1995 |
|