Transmitter, encoding system and method employing use of a bit need determiner for subband coding a digital signal

Information

  • Patent Grant
  • 5365553
  • Patent Number
    5,365,553
  • Date Filed
    Wednesday, October 27, 1993
    31 years ago
  • Date Issued
    Tuesday, November 15, 1994
    30 years ago
Abstract
Transmitter, encoding system and method for subband coding a digital signal. The encoding system includes a splitter for dividing the digital signal into subband signals SB.sub.1, . . . , SB.sub.M ; a quantizer unit for quantizing time-equivalent q sample signal blocks of the subband signals; a bit need determiner and a bit allocator. The bit need determiner determines a bit need b.sub.m which corresponds to the number of bits by which the q samples in a time-equivalent signal block in a subband signal SB.sub.m should be represented, where 1.ltoreq.m.ltoreq.M. The bit allocator allocates n.sub.m bits to each of the q samples of the time-equivalent signal block of subband signal SB.sub.m on the basis of the bit need b.sub.m and an available bit quantity B, n.sub.m being the number of bits by which the q samples in the time-equivalent signal block of subband signal SB.sub.m will actually be represented, where 1.ltoreq.m.ltoreq.M.
Description

BACKGROUND OF THE INVENTION
The present invention relates to an encoding system for subband coding of a wideband digital signal, for example, a digital signal having a specific sampling frequency F.sub.s. The encoding system comprises: (a) a splitter which divides the bandwidth of the wideband digital signal into M successive subbands which augment with frequency, and generates, in response to the wideband digital signal, M subband signals with sampling frequency reduction, each of the subband signals being associated with one of the subbands; (b) a quantizer unit for quantizing block-by-block the respective subband signals, a subband signal SB.sub.m of the subband signals being composed of successive signal blocks, each signal block comprising q samples, the q samples in a quantized signal block of subband signal SB.sub.m each being represented by n.sub.m bits; (c) a bit need determiner for determining bit needs for corresponding (i.e., time-equivalent) signal blocks of the subband signals, a signal block of subband signal SB.sub.m having a bit need b.sub.m which is related to the number of bits by which the q samples in that signal block should be represented; and (d) a bit allocator for allocating bits from an available quantity of bits B to the samples in the time-equivalent signal blocks of the subband signals in response to the bit needs determined by the bit need determiner, such allocation establishing the value of n.sub.m ; where 1.ltoreq.m.ltoreq.M. The encoding system may further comprise a formatting circuit for assembling the quantized samples of the time-equivalent signal blocks to form an output signal having successive frames and including scale factor information in each frame, which scale factor information comprises x-bit words, an x-bit word representing a scale factor associated with the samples in a signal block. An encoding system of the aforementioned type is known from U.S. Pat. No. 4,896,362.
SUMMARY OF THE INVENTION
The invention specifically relates to a bit need determiner for, determining the bit needs b.sub.1 to b.sub.M for corresponding (i.e., time-equivalent) signal blocks in the subbands 1 to M on the basis of the output signals (i.e., the subband signals SB.sub.L to SB.sub.M) of the splitter.
The invention therefore has for an object to derive the bit needs by implementing a novel method.
For this purpose, the encoding system according to the invention is characterized in that the bit need determiner is arranged:
for estimating the power v.sub.m in a signal block of subband signal SB.sub.m in a subband m for corresponding signal blocks of the subband signals in the subbands;
for determining the sample SF.sub.m having the maximum absolute value in the signal block;
for calculating the magnitude w.sub.m according to the formula ##EQU1## for calculating b.sub.m according to the formula ##EQU2## wherein d.sub.mi is a matrix coefficient in an M.times.M matrix [D], this matrix coefficient denoting the coefficient by which the power v.sub.i in the subband i is multiplied to calculate the value of the masked power in the subband m as a result of the signal in the block in the subband i; w.sub.r.m is a measure for the masking threshold in the subband m; and K.sub.1, K.sub.2 and K.sub.3 are constants.
It should be observed in this context that the bit needs for the corresponding signal blocks in the various subbands has already been determined in the prior art. However, different algorithms have been used and different assumptions have been made prior hereto.
For example, in the article entitled "Low bit rate coding of high quality audio signal. An introduction to the MASCAM system", by G. Theile et al., published in the EBU Technical Review, No. 230, in August, 1988, a signal-to-noise ratio is determined per subband. This signal-to-noise ratio, expressed in dB, and divided by 6, then yields the bit need in a subband.
The coefficients K.sub.1 and K.sub.2 are preferably selected to be equal to 1 or 1.sqroot.3. K.sub.3 has a wider range of possibilities, as this constant has less influence on the eventual result of the encoding. For example, one may take the value of 1 for K.sub.3, or omit K.sub.3 altogether.
For the calculation of the bit needs, a logarithmic representation for the various magnitudes is preferably used. This is advantageous in that a relatively small word length (number of bits by which the various magnitudes are represented) will be sufficient for the various magnitudes, whereas still a sufficient relative accuracy for the bit needs can be realized. This implies that the electronics for the realization of the bit need determiner may be of simpler structure.





BRIEF DESCRIPTION OF THE DRAWINGS
The invention will be further explained in the following descriptions with reference to a number of exemplary embodiments, in which:
FIG. 1 shows an encoding system according to the invention;
FIG. 1a shows the corresponding (i.e., time-equivalent) signal blocks in the subband signals SB.sub.1 to SM.sub.M, each signal block comprising q samples;
FIG. 2 shows the quantization to a three-bit binary representation;
FIG. 3 shows the positions of the bit needs b.sub.1, b.sub.2, . . . , along a value axis;
FIG. 4 shows the method of determining the bit needs b.sub.1, . . . , b.sub.M ;
FIG. 5 shows the method of allocating bits to the signal blocks of the subband signals in the subbands;
FIG. 6 shows the initial bit allocation;
FIG. 7 shows the correction Table to be used for number additions utilizing a logarithmic representation of the numbers;
FIG. 8 shows a functional block diagram of the bit need determiner;
FIG. 9 shows a functional block diagram of the bit allocator;
FIG. 10 shows the use of the encoding system of FIG. 1 in a transmitter in the form of a recording arrangement for recording the quantized subband signals on a magnetic record carrier:
FIGS. 11, 12a, 12b, and 13 show the different allocation stages in dependence on the power value v.sub.i ; and
FIG. 14 shows a functional block diagram of the unit for generating the control signals required in the different allocation stages.





DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
The description of the system in the drawings will relate to the description of the subband encoding of a single digital signal. That is to say, subband encoding of a mono audio signal, or of only the right or left signal portion of a stereo audio signal. This means that in each subband there is only one subband signal present. At the end of this description is an explanation of how the bit need may be determined in case of subband encoding of a stereo signal, it being understood that there are two subband signals in each subband in that situation.
FIG. 1 shows an encoding system according to the invention. A wideband digital signal is applied to input terminal 1, such as a digitally sampled audio signal having a bandwidth of approximately 20 kHz. Input 1 is supplied with the samples of the digital audio signal having, for example, a sampling frequency of 44 kHz, each sample being, for example, 16 bits. The signal is applied to a splitter 2 which comprises M signal splitter filters. The splitter 2 distributes the digital signal over M subbands by means of the M filters, that is to say, a low-pass filter LP, M-2 band-pass filters BP and a high-pass filter HP, for example. M is, for example, equal to 32. The sampling frequency of the M subband signals is reduced in the blocks referenced 9. In those blocks, the sampling frequency is reduced by a factor M. The signals, thus, obtained are available at the outputs 3.1, 3.2, . . . , 3.M. At the output 3.1, a signal SB.sub.1 is available in the lowest subband 1. An the output 3.2, a signal SB.sub.2 is available in the lowest but one subband 2. At the output 3.M, a signal SB.sub.M is available in the highest subband M. The subband signals SB.sub.1 to SB.sub.M at the outputs 3.1 to 3.M each include successive samples, each sample being expressed in numbers of 16 bits or over, for example, up to 24 bits. In the exemplary embodiment of the invention under discussion, the subbands 1 to M all have the same width. This is not a necessity, however.
In a publication, by M. A. Krasner, entitled "The Critical Band Coder--Digital Encoding of Speech Signals Base on the Perceptual Requirements of the Auditory System" in IEEE ICASSP 80, Vol. 1, pp. 327-331, Apr. 9-11, 1990 a subdivision into a plurality of subbands is provided whose bandwidths approximately correspond to the bandwidths of the critical bands of the human auditory system in the respective frequency areas.
The operation of the splitter 2 will not further be explained since the operation of such a device has already been described extensively hereinbefore. For this purpose, reference is made to the above-mentioned Krasner publication and U.S. Pat. Nos. 4,896,362 and 5,214,678 which are incorporated herein by reference.
The samples in each subband signal are grouped as successive signal blocks of q successive samples, for example q may be equal to 12, see FIG. 1a, and are applied to corresponding quantizers Q.sub.1 to Q.sub.M. In a quantizer Q.sub.m, the samples of a signal block are quantized to obtain quantized samples, each quantized sample having a number of bits n.sub.m which is smaller than 16.
FIG. 2 shows quantization using a 3-bit binary representation. During the process of quantization, the q samples in each of the signal blocks (groups of the subband signals) are each normalized and then quantized using a number of bits 3 in the example of FIG. 2. Normalization is performed by dividing the amplitudes of the q samples by the amplitude of the sample having the largest absolute value in the signal block. The amplitude of the sample having the largest amplitude in a signal block of a subband signal SB.sub.m yields the scale factor SF.sub.m for that signal block, (see copending U.S. patent application Ser. No. 07/997,158 filed Dec. 21, 1992 and incorporated herein by reference). Subsequently, the amplitudes of the normalized samples, which are now situated in an amplitude range from -1 to +1, are quantized according to FIG. 2.
This implies that normalized samples in the amplitude range between -1 and -0.71 are quantized with the 3-bit number 000, samples in the amplitude range from -0.71 to 0.42 are quantized with 001, samples in the amplitude range from 0.42 to 0.14 are quantized with 010, samples in the amplitude range from -0.14 to 0.14 are quantized with 001, samples in the amplitude range from 0.14 to 0.42 are quantized with 100, samples in the amplitude range from 0.42 to 0.71 are quantized with 101 and samples in the amplitude range from 0.71 to 1.00 are quantized with 110. In copending application Ser. No. 07/997,158, mentioned above, three-bit quantization is extensively discussed (see FIGS. 24, 25 and 26 and the relevant descriptions in that document).
The quantized samples in the subband signals SB.sub.1 to SB.sub.M are thereafter available at the respective outputs 4.1 to 4.M. See FIG. 1.
Furthermore, the outputs 3.1 to 3.M of splitter 2 are coupled to the respective inputs 5.1 to 5.M of a bit need determining circuit 6. The bit need determining circuit 6 determines the bit need b.sub.m for the corresponding (i.e., time-equivalent) signal blocks of q samples of the subband signals SB.sub.1 to SB.sub.M. The bit need b.sub.m is a number related to the number of bits with which each of the q samples in a signal block in a subband signal should be quantized.
The bit needs b.sub.1 to b.sub.M, derived by the bit need determining circuit 6, are applied to a bit allocation circuit 7. The bit allocation circuit 7 determines the actual number of bits n.sub.1 to n.sub.M with which the q samples of the corresponding signal blocks in the subband signals SB.sub.1 to SB.sub.M are to be quantized. Control signals corresponding to the numbers n.sub.1 to n.sub.M are applied to the respective quantizers Q.sub.1 to Q.sub.M through lines 8.1 to 8.M, so that the quantizers can quantize the samples with the correct number of bits.
The following will provide an explanation of the operation of the bit need determining circuit 6 and the bit allocation circuit 7. The bit needs for the time-equivalent signal blocks of q samples in each of the subband signals SB.sub.1 to SB.sub.M are derived from estimates of the power v.sub.m and the scale factor SF.sub.m of the signal block in the subband signal SB.sub.m.
The power v.sub.m may, for example, be estimated by means of the following formula: ##EQU3## where s.sub.i is the amplitude of the i.sup.th sample in the q-sample signal block of the subband signal SB.sub.m, the scale factor SF.sub.m being equal to the amplitude of the sample in the signal block having the largest absolute value, as has already been observed hereinbefore. It should be observed in this context that the estimate for the power v.sub.m in a signal block of the subband signal SB.sub.m might also be estimated by assuming that v.sub.m is equal to the squared scale factor SF.sub.m.
For all corresponding signal blocks in the subband signals SB.sub.1 to SB.sub.M, the power v.sub.m and the scale factor SF.sub.m are determined in this fashion. The powers are ordered as a vector {v}. By multiplying the vector {v} by an M.times.M matrix [D], one will obtain a vector {w} from the following formula:
{w}=[D]{v}+{w.sub.r }EQ. (2)
In this formula, [D] is a matrix whose coefficients d.sub.ij denote the coefficient by which the power v.sub.j of the q-sample signal block of the subband signal SB.sub.j is to be multiplied to calculate the masked power in the subband i for subband signal SB.sub.i due to the signal in the q-sample signal block of the subband signal SB.sub.j, and w.sub.r.i is the coefficient in the vector {w.sub.r } which denotes the masking threshold in the subband i for subband signal SB.sub.i. Thus, w.sub.r.m bears a relation to the maximum power of a signal in a subband m in which that signal will not be audible.
The vector {w}, therefore, has coefficients w.sub.i which are estimates of the masked quantizing noise in each subband i for subband signal SB.sub.i. Quantizing-noise in a subband i for subband signal SB.sub.i which has a power of less than w.sub.i, is thus inaudible. The coefficients d.sub.ij of the matrix [D] can be calculated according Philips Journal of Research, Vol. 44, 329-343, 1989, incorporated herein by reference. The bit need b.sub.1 to b.sub.M can be calculated from the following formula: ##EQU4## more generally the following may hold: ##EQU5## The former formula, EQ. (3), can simply be derived from the latter formula, EQ. (4), by assuming: K.sub.2 =1/.sqroot.3 and K.sub.3 =1, where K.sub.1, K.sub.2 and K.sub.3 are constants, for which it holds that K.sub.1 is preferably equal to approximately 1 and K.sub.2 is preferably equal to approximately 1.sqroot.3. K.sub.3 has a wider range of possibilities. It may be assumed that K.sub.3 will be less than 10, where K.sub.3 will, for example, preferably be taken to be equal to 1 or may be neglected. In addition, in the latter case there will be a simpler implementation of the calculation.
The bit needs b.sub.1 to b.sub.M are obtained in this fashion, and they are situated in a specific amplitude range. They may be negative and non-integers. A bit need b.sub.m bears a relation to the number of bits with which the samples in the q-sample signal block of a subband signal SB.sub.m should be quantized, so that it holds that if b.sub.m1 for subband signal SB.sub.m1 is greater than b.sub.m2 for the subband signal SB.sub.m2, the number of bits with which the q samples in a signal block of the subband signal SB.sub.m1 should be quantized will have to be greater than the number of bits with which the q samples of the time-equivalent signal block of the subband signal SB.sub.m2 should be quantized.
This is shown in qualitative terms with reference to FIG. 3. In FIG. 3, seven bit needs b.sub.1 to b.sub.5, b.sub.max and b.sub.min are plotted along a value axis. b.sub.max is the bit need having the maximum value and b.sub.min is the bit need having the minimum value. It will be noticed that b.sub.min, b.sub.2 and b.sub.5 are negative and that, furthermore, the following holds: b.sub.min <b.sub.5 <b.sub.2 <b.sub.4 <b.sub.1 <b.sub.3 <b.sub.max. In terms of quality, it may now be assumed that q samples in a signal block of subband signal SB.sub.m, with b.sub.m =b.sub.min, should be quantized with the minimum number of bits and the q samples in a signal block of the subband signal SB.sub.m, with b.sub.m =b.sub.max, with the maximum number of bits.
FIG. 4 shows a flow chart of the program of operation of the bit need determining circuit 6 for determining the bit needs b.sub.1 to b.sub.M for time-equivalent q-sample signal blocks of the subband signals SB.sub.1 to SB.sub.M. In this case only, a single q-sample signal block of a subband signal is considered. For each successive q-sample signal block of that subband signal, and the signal blocks of the other subband signals corresponding with the aforesaid signal block (in case of a time-parallel subband signal supply), the operation represented in FIG. 4 will be performed once again.
The operation starts at block 10. First, the running variable m is set to 1 (block 12). Then, the q samples S.sub.1, . . . , S.sub.q in a signal block of a subband signal SB.sub.m are input (block 14) and the power v.sub.m is calculated (block 16). Also the scale factor SF.sub.m (block 18) is determined.
The blocks 14, 16 and 18 are repeated for all corresponding signal blocks of the subband signals via the loop through blocks 20 and 22. If the values v.sub.m and SF.sub.m have been determined for all corresponding signal blocks, the matrix calculation will be performed in order to obtain the vector {w} (block 24).
Subsequently, m is again set to 1 (block 26) and the bit need (b.sub.m) is determined (block 28) for all corresponding signal blocks of the subband signals via the loop through the blocks 30 and 32 after which the operation is terminated (block 34). The bit needs b.sub.m are determined in block 28 according to the formula stated previously in which the constant K.sub.1, K.sub.2 and K.sub.3 are equal to the values of 1, 1.sqroot.3 and 0, respectively.
The program of FIG. 4 shows the time-consecutive calculations of the coefficients v.sub.1 to v.sub.M in the vector {v}, compare the loop through block 22 in the program, and the method shows the time-consecutive calculations of the bit needs b.sub.1 to b.sub.M, compare the loop through block 32. This is a very suitable method, more specifically, if the corresponding signal blocks having the samples s.sub.1 to s.sub.q for the subband signals SB.sub.1, SB.sub.2, . . . , SB.sub.M-1, SB.sub.M, are applied serially.
If the signal blocks are applied in parallel, the calculation of the coefficients v.sub.1 to v.sub.M could be performed in parallel for all subband signals, and, thus, the loop through block 22 is avoided. Likewise, the bit needs b.sub.1 to b.sub.M may be calculated in parallel, and this will render the loop through block 32 redundant.
The operation of the bit allocation circuit 7 will now be explained. The flow chart of FIG. 5 for a program will be used for this purpose. The program determines for time-equivalent q-sample signal blocks in the subband signals SB.sub.1 to SB.sub.M the values n.sub.1 to n.sub.M from the bit needs b.sub.1 to b.sub.M. Here too, it is a matter of a single signal block having q samples of a subband signal. For directly successive q-samples signal blocks in that subband signal and the time-equivalent signal blocks in the other subband signals, the method of FIG. 5 will be carried out again.
It is now assumed that, after quantization, B.sub.0 bits are available for transmitting the overall information connected with the M signal blocks of q samples of, for example, 24 bits each. Assuming that after quantization, R bits are available per sample, averaged over the subbands, it holds that B.sub.0 is equal to the largest integer smaller than M.q.R.
In copending application Ser. No. 07/997,158, it is shown that not only are the quantized samples transmitted, but also the scale factors SF.sub.1 to SF.sub.M (scale factor information) and the bit allocation information (that is to say, information which bears a relation to the number of bits with which the samples in a signal block in a subband signal are to be quantized, i.e., the values n.sub.1 to n.sub.M). The bit allocation information is then represented by y=4 bits for each n.sub.m. Thus, this implies that, actually, only B=B.sub.0 -y.M bits are available for the transmission of M signal blocks of quantized subband signals and the scale factor information.
Copending application Ser. No. 07/997,158 further describes that the y-bit number (y=4) 0000 in the bit allocation information denotes that no samples are transmitted in the relevant subband. In that case, no scale factor information for that subband will be transmitted either. The scale factor information for a subband is represented by means of an x-bit number (x=6).
The method of bit allocation is as follows. The method starts at block 40, FIG. 5. Initially, all numbers n.sub.m are first set to zero. Then an initial bit allocation is performed at block 44. (This initial bit allocation will be explained later). Then, the maximum bit need is determined. This is the bit need b.sub.j. In the example of FIG. 3, this would be b.sub.max. Next, it is considered whether n.sub.j is greater than or equal to a certain value n.sub.max (block 48). In the present example, n.sub.max is equal to 16. This means that the quantized samples can only be represented by binary numbers with a maximum of 16 bits.
If n.sub.j is greater than or equal to n.sub.max, the q-sample signal block of the subband signal SB.sub.j will be excluded from the allocation of further bits. For this purpose, the bit need b.sub.j is made equal to a so-called "flag value" (block 66). The flag value is represented in FIG. 3 and is a value smaller than the minimum bit need b.sub.min. If c.sub.1, in the block 56 to be discussed hereinbelow, is greater than unity, n.sub.j might be greater than n.sub.max. In addition, n.sub.j will then be assumed to be equal n.sub.max at block 66.
If n.sub.j is equal to zero (block 50), the program will proceed through the blocks 52 and 54. At block 54, a.sub.1 bits are initially allocated to the signal block of subband signal SB.sub.j. This means n.sub.j=a.sub.1. The total number B of available bits now decreases by a.sub.1.q+x. The q quantized samples of the signal block in the subband signal SB.sub.j are represented each by a.sub.1 bits and, in addition, an x-bit-long scale factor SF.sub.j is to be added. Furthermore, in block 54, the bit need b.sub.j is decreased by a value a.sub.2. If n.sub.j is unequal to zero, the program will proceed through block 56. The number of bits n.sub.j is now increased by c.sub.1. The total number B of available bits now decreases by c.sub.1.q, due to the fact that the q quantized samples of a signal block are now represented by an additional number of c.sub.1 bits.
Naturally, bit allocation only takes place if there are still sufficient bits available. Therefore, block 52 is present. If there are insufficient bits available the program will proceed through block 66 at which the relevant bit need b.sub.j is again made equal to the flag value. The signal block in the subband concerned is then excluded from further bit allocation.
As long as there are bit needs that have values greater than the flag value (block 58) and as long as there are still sufficient bits available (block 60), the program will return through circuit 62 to block 46 for a next calculation of the maximum bit need. If all bit needs b.sub.m are smaller than or equal to the flag value, the program will stop. The program will also stop if there are insufficient bits to be allocated (block 60).
The method is characterized in that when a first bit allocation is performed (block 54), the number of allocated bits (a.sub.1) is greater than the number of bits of one or more subsequent allocations (block 56) (c.sub.1), worded differently a.sub.1 >c.sub.1. Furthermore, it holds that a.sub.2 is greater than or equal to unity. Preferably, a.sub.1 is equal to a.sub.2 and c.sub.1 equal to c.sub.2. a.sub.1, a.sub.2, c.sub.1 and c.sub.2 are numbers greater than zero. a.sub.1 and c.sub.1 are preferably integers. But, this is not a necessity. An example may be shown for this purpose.
It is assumed that one wishes to quantize the q samples in a signal block in five levels. For this purpose, 3 bits are needed per sample. However, this is not an efficient encoding because a subdivision into seven levels of 3 bits is possible.
If, however, three samples are combined, these three samples each with five signal levels will present 125 options. These 125 options may be represented by means of a 7-bit binary number. Thus, no more than 7/3 bits per sample. n.sub.m would in that case be equal to 7/3. This will provide a more efficient encoding.
It has been indicated hereinbefore that when the quantized samples are transmitted, both scale factor information and bit allocation information are co-transmitted. The scale factor information then has the form of x-bit words, of which each x-bit word denotes a scale factor SF.sub.m that belongs to the q samples in a signal block of the subband signal SB.sub.m. The bit allocation information then has the form of y-bit words, of which each y-bit would denotes a number of bits n.sub.m by which each sample in a signal block of the subband signal SB.sub.m is represented. This is described in copending application Ser. No. 07/997,158.
If in the bit need determining circuit 6 only the scale factors SF.sub.m are used for calculating the powers v.sub.m, that is to say, because v.sub.m is assumed to be equal to the square of SF.sub.m, the bit allocation information need not be co-transmitted. On the receiver side, the bit needs b.sub.1 to b.sub.M can be derived from the transmitted scale factors SF.sub.m and, on the basis of these needs, the magnitudes n.sub.1 to n.sub.M, while implementing the calculation method as discussed hereinbefore. Thus, the receiver likewise comprises a bit need determiner which derives the powers v.sub.m from the scale factors SF.sub.m and derives from these powers the bit needs b.sub.m, and also includes a bit allocator which is capable of calculating the magnitudes n.sub.1 to n.sub.M on the basis of the bit needs b.sub.m and the available bit quantity, which in this case is equal to B.sub.0. Since, as observed hereinbefore, B=B.sub.0 -y.M, the latter method is advantageous in that more bits can be allocated to the subbands.
It may sometimes be necessary to pre-allocate a number of bits to a signal block of a subband signal SB.sub.m, for example, in the case where there are signal blocks which must be quantized with more than zero bits irrespective of their bit need. The reason for this is that the signal blocks (namely subsequent signal blocks of one subband signal) must not be switched on or off in an unqualified manner. This would produce audible effects.
It may sometimes also be necessary or useful to exclude a signal block in advance from bit allocation.
For these purposes, block 44 in the program of FIG. 5 is inserted. FIG. 6 shows an elaboration of block 44. In FIG. 6, the first two signal blocks are the signal blocks of the subband signals SB.sub.k and SB.sub.1 to which have been pre-allocated a number of bits A.sub.k0 or A.sub.10. This implies that n.sub.k =A.sub.k0 and n.sub.1 =A.sub.10. From the bit needs b.sub.k and b.sub.1, the respective values A.sub.k1 and A.sub.i1 are subtracted, and the remaining number of bits B is reduced by A.sub.k0.q-x and A.sub.i0.q-x respectively. Actually, for A.sub.k0 and A.sub.i0 the same holds as for a.sub.1. Preferably, A.sub.k0 =A.sub.i1 =a.sub.1. For A.sub.k1 and A.sub.i1 the same holds as for a.sub.2. Preferably, A.sub.k1 =A.sub.i1 =a.sub.2. The signal blocks of the subband signals SB.sub.k and SB.sub.1 may naturally be allocated more bits, as required, at block 56 of the method presented in FIG. 5.
Furthermore, at block 44 of FIG. 6, it is shown that the signal block of the subband signal SB.sub.f is excluded from bit allocation. For this purpose, the bit need b.sub.f for this signal block is made equal to the flag value.
FIGS. 11, 12 and 13 represent the situations in which there is initial bit allocation, no initial bit allocation or no bit allocation to the subbands. The Figures show the consecutive time intervals .increment.T in which a group of M corresponding signal blocks of the M subband signals are processed. In each time interval the power v.sub.i (t) and the magnitude w.sub.i (t) are determined for each subband signal SB.sub.i block. If v.sub.i (t) is greater than w.sub.i (t), there will be initial bit allocation to the subband signal SB.sub.i block. As will be evident from FIG. 11, this holds for periods of time situated before t=t.sub.1.
FIG. 14 is a block diagram of a circuit by means of which, on the basis of the magnitudes v.sub.i and w.sub.i, control signals may be derived which denote whether initial bit allocation is to take place, in which case the output of the SR flip-flop 140 is "high" or "logic 1"; whether no bit allocation is to take place, in which case the output of the SR flop-flop 141 is "high"; or whether no initial bit allocation is to take place, in which case the output of a counter 142 is "high". In the latter case, bits may still be allocated to the subband signal block in question, but the allocation then takes place at block 54 and/or at block 56 according to the method of FIG. 5. These control signals may thus be applied to block 44 in FIG. 6 and denote what functions are to be performed in this block.
At the instant t=t.sub.1, FIG. 11, v.sub.i (t) becomes smaller than w.sub.i (t). The output 144 of the comparator 143 now becomes "low" whereas the output 145 of this comparator becomes "high". Through the OR-gate 147, this "high" signal is applied to the AND-gate 148 so that clock pulses are passed to the AND-gate 149 at a rate f equal to 1/.increment.T. Since a "high" signal is applied to the other input of the AND-gate 149 through the inverter 150, the clock pulses are passed to the input 151. The counter 142 now counts down from the initial position 5 (decimal under the influence of the clock pulses. Since the output of the counter 142 remains "low", the position of the flip-flop 140 does not change so that the initial bit allocation is maintained.
One time interval later, v.sub.i (t) is again larger than w.sub.i (t). The output 144 of the comparator 143 becomes "high" again, which implies that the rising edge is fed to the set input of the counter 142 through the OR-gate 152. The count of counter 142 is reset to 5 (decimal). At the instant t.sub.2, FIG. 11, v.sub.i (t) again becomes smaller than w.sub.i (t). Now, v.sub.i (t) remains smaller than w.sub.i (t) for a sufficiently long period of time to make it possible for the counter 142 to count back until the 0 count (decimal) is reached. This is reached at the instant t=t.sub.3, FIG. 11. At that moment, the output of the counter 142 becomes "high". The flip-flop 140 is reset. Through inverter 150 and AND-gate 149 the counting operation of counter 142 is blocked so that it retains the 0 count.
Initial bits are no longer allocated to this subband signal block. At the instant t=t.sub.4, FIG. 11, v.sub.i (t) again becomes larger than w.sub.i (t). The counter 142 is reset to the count 5 and, in addition, the flip-flop 140 is set in a manner so that initial bits are again allocated.
FIG. 12a represents a situation in which v.sub.i (t), prior to the instant at which the counter 142 is reset to zero, already becomes smaller than a specific threshold value v.sub.thr. At the instant t=t.sub.5, the output 145 of the comparator 143 becomes "low" again and the output 146 "high". Since the inverter 153 applies a "high" signal to one input of the AND-gate 154, the "high" signal is conveyed to the AND-gate 148 through the AND-gate 154 and the OR-gate 147. The counter 142 continues to count. The phase of the initial bit allocation is thus maintained until the count 0 (decimal) is reached. The output of the counter 142 now briefly rises. This entails that flip-flop 141 is set through the AND-gate 155. Through the AND-gate 156 and the OR-gate 152 the "high" signal of the flip-flop 141 is applied to the set input of the counter 142, which immediately afterwards jumps to count 5 (decimal). In addition, the further down-counting of the counter 142 is blocked because the inverter 153 applies a "low" signal to the one input of the AND-gate 154. From instant t.sub.6 onwards there is no bit allocation whatsoever to the relevant subband signal block.
FIG. 12b represents the situation in which v.sub.i (t) has remained in the range between v.sub.thr and w.sub.i (t) sufficiently long so that the phase of "no initial bit allocation" has commenced. At the instant t.sub.7, v.sub.i will be smaller than v.sub.thr. At that moment, the output 145 will become "low" and the output 146 "high". At that moment, the flip-flop 141 is set through the AND-gate 155 and the counter 142 is reset to the count 5 through the AND-gate 156 and the OR-gate 152. The output of the counter 142 thus becomes "low" and the output of the flip-flop 141 "high". There is bit allocation.
FIG. 13 shows a situation in which v.sub.i (t) increases again. At the instant t.sub.8, v.sub.i (t) becomes larger than v.sub.thr. The output 145 becomes "high" so that the counter 142 may count down. One time interval later v.sub.i (t) is again smaller than v.sub.thr. The output 146 becomes "high" again so that the counter is reset to the count of 5 through the AND-gate 156 and the OR-gate 152. If v.sub.i (t) is greater than v.sub.thr for a sufficiently long period of time, the counter 142 can count down to zero. At t=t.sub.8, the output of counter 142 becomes "high". Through the AND-gate 159, to which a "high" signal is applied through the inverter 158, the flip-flop 141 is reset, so that at this moment the phase of "no bit allocation" is terminated and changed into the phase of "no initial bit allocation".
In the following, a simplified calculation of the bit need b.sub.m will be explained. In this calculation, a logarithmic representation is used for the various magnitudes which play a part in the calculation. This is possible because with respect to the calculation of the bit needs b.sub.1 to b.sub.M the concern is relative, not absolute, precision of the bit needs.
In logarithmic representation, a number g is approximated by g=r.sup.k, where r is a fixed base greater than unity and the power k is selected to be an integer. The number g is approximated in the best possible way by a correct choice of k. The integer k is used as a representation of g. In the calculation for the bit need b.sub.m, there are both multiplications of two numbers and additions of two numbers. Multiplications in the logarithmic representation correspond with the addition of the powers. That is to say, if g.sub.1 =r.sup.k 1 and g.sub.2 =r.sup.k 2, the logarithmic representation of g.sub.1.g.sub.2 will be equal to k.sub.1 +k.sub.2.
For the logarithmic representation of additions of these two numbers g.sub.1 and g.sub.2, the following holds. Assuming that g.sub.1 >g.sub.2, it holds that g.sub.1 +g.sub.2 =r.sup.k 1+T(k.sub.1 -k.sub.2). The logarithmic representation for g.sub.1 +g.sub.2 is thus equal to k.sub.1 +T(k.sub.1 -k.sub.2). T(k.sub.1 -k.sub.2) and is a correction factor in the form of an integer which may be derived from a Table. FIG. 7 shows a Table of this type for r=2.sup.1/16. The value for r equalling 2.sup.1/16 can be obtained from an accuracy analysis of the bit needs b.sub.m.
It may further be inferred that the calculation of the bit need b.sub.m in a logarithmic representation with a properly selected base r, in lieu of the customary calculations in a linear fixed-point representation, considerably reduces the word widths of the numbers. In addition, no multiplier-accumulator is necessary for calculating the vector {w}, but only a simple accumulator and a Table having a restricted number of entries. The Table of FIG. 7, for example, narrows down to a ROM having contents smaller than 0.5 kbit. The series of numbers stored in the ROM is relatively small. In addition, these numbers are arranged in a specific order. Therefore, it is possible to reduce the look-up Table even more at the cost of some logic.
It should be observed in this context that the logarithmic representation for the addition of two numbers as described hereinbefore is known per se by the name of Zech logarithm and described in the F. J. MacWilliams et al. publication entitled "The Theory of Error Correcting Codes" (North Holland Publishing Co. 1983), compare, more specifically, chapter 3, section 4, page 191.
The bit need determining circuit 6 and the bit allocation circuit 7 may be realized as software. However, hardware designs are also possible. For example, FIG. 8 shows a hardware design of the bit need determining circuit 6.
FIG. 8 shows the corresponding signal blocks of the subband signals SB.sub.1 to SB.sub.M, which are serially applied to input 70. The first sample s.sub.1 of the subband signal SB.sub.1 is applied first and the last sample s.sub.q of the subband signal SB.sub.M is applied last.
In the largest sample determining unit 71, the largest sample, i.e., SF.sub.m, is determined for each signal block, which value is then stored in a memory 72. In a squaring unit 73, the samples are squared and thereafter applied to an input of an adder 74. The output of the adder 74 is coupled to an input of a memory 75. The output of that memory 75 ms coupled both to a second input of the adder 74 and an input of a divider 76. The elements referenced 74, 75 and 76 determine the magnitude v.sub.m for each signal block, compare block 16 in FIG. 4. To this end, the first sample s.sub.1 of a signal block 3 of the subband signal SB.sub.m is squared in the squaring unit 73 and, in adder 74, added to the value stored in memory 75, which value is momentarily zero, and thereafter stored in the memory 75. Subsequently, the second sample s.sub.2 is squared, added to the value stored in memory 75 and then stored in that memory. This is continued until the last sample s.sub.q of the signal block of the subband signal SB.sub.m is squared and added to the value stored in memory 75. The sum thus obtained in the memory 75 is equal to ##EQU6## which then, after a division by q in the divider 76, is stored as a coefficient v.sub.m in memory 77. Similar calculations are made for the corresponding signal blocks of the further subband signals until all coefficients of the vector {v} have been stored in the memory 77. The bit need determining circuit 6 further includes a memory 78 for storing the matrix coefficient d.sub.m1 of the matrix [D] and a memory 79 for storing the coefficients w.sub.r.m of the vector {w.sub.r }. Outputs of the memories 77 and 78 are coupled to inputs of the multiplier 80. An output of the multiplier 80 is coupled to a first input of an adder 81 whose one output is coupled to the input of a memory 82. The output of the memory 82 is coupled both to a second input of the adder 81 and to a first input of an adder 83.
The elements referenced 80, 81 and 82 are intended to perform the matrix multiplication [D]{v}. During this operation, the value d.sub.m1 from memory 78 is multiplied by the value v.sub.1 from memory 77, and the result is added by adder 81 to the value present in memory 82 at that moment, which is zero, and then stored in memory 82. After this, d.sub.m2 is multiplied by v.sub.2, and the result is added to the value stored in memory 82. This is continued until d.sub.mM is multiplied by v.sub.M, and the result is added to the value stored in memory 82. At that moment, the value ##EQU7## is stored in memory 82. In adder 83, the value w.sub.r.m stored in memory 79 is added to this result. The value w.sub.m thus obtained is stored in memory 84. This procedure is reiterated for the corresponding signal blocks in the further subbands until all coefficients of the vector {w} are stored in memory 84.
Subsequently, for each subband signal SB.sub.m block, the magnitudes SF.sub.m and w.sub.m are read from the memories 72 and 84 and applied to the calculation unit which eventually determines the bit need b.sub.m. This bit need is stored in a memory 86. This calculation is also performed for the further time-equivalent blocks of the subband signals until all bit needs b.sub.1 to b.sub.M are stored in the memory 86.
The procedure may be reiterated for a successive series of M signal blocks. Also, the arrangement shown in FIG. 8 utilizes the fact that there is serial information supply. If the signal blocks were supplied in parallel, the calculation could largely be performed in parallel. This means, for example, that the circuit comprising the elements 71, 73, 74, 75 and 76 could occur M times in the arrangement. The circuit comprising elements 80, 81, 82 and 83 could then likewise occur M times.
FIG. 9 shows a block diagram of an embodiment of the bit allocation circuit 7. The bit allocation circuit comprises a memory 90 in which the number of bits B still to be allocated have been stored, a memory 91 in which the values n.sub.1 to n.sub.M are stored and a memory 92 in which the bit needs b.sub.1 to b.sub.M have been stored. This memory 92 could correspond with memory 86 of FIG. 8. At the beginning of an allocation cycle, the initial value for B is stored in memory 90, which value is available at terminal 94. Furthermore, the initial values for the bit needs b.sub.1 to b.sub.m have been stored in memory 92, whereas memory 91 stores all zeros fed to terminal 93 by means of reset signals.
Subsequently, detector 95 determines the maximum value of the bit needs stored in memory 92. This may, for example, be realized by successively reading out all bit needs b.sub.1 to b.sub.M at the output 96 and applying these bit needs through line 97 to the input 98 of the detector 95. At the output 99, the detector 95 provides the index of the maximum bit need b.sub.j. This index j is used as an address for addressing, through line 100, the locations in the memories 91 and 92 in which the values are stored for n.sub.j and b.sub.j, respectively, so that these values are available at the respective outputs 101 and 96. Output 101 is coupled to an input of an n.sub.j =0 detector 102. If the detector 102 detects n.sub.j =0, it provides at output 103 a control signal which is applied to control signal inputs of controllable switches S.sub.1, S.sub.2 and S.sub.3. These switches then assume different positions from the ones shown in FIG. 9. This results in a subtractor 105 subtracting the value a.sub.1.q+k from the value B available at output 106 of memory 90, and this new value is again applied to input 107 of this memory through line 104 so that the new value is stored in memory 90. Furthermore, through lines 108 and 109 the value a.sub.1, which is available at terminal 110, is applied to the input 111 of memory 91. Switch S4 then has the position shown in FIG. 9, and a.sub.1 is stored in memory 91 as a new value for n.sub.j. In the subtractor 112, the value a.sub.2 is subtracted from the value b.sub.j available at the output 96 of memory 92. The value, thus obtained, is applied to input 115 through lines 113 and 114, while switch S.sub.5 has the position shown in FIG. 9, so that the new value for b.sub.j can be stored in the memory location b.sub.j in memory 92. The sequence described hereinbefore corresponds to the method as denoted in block 54 in FIG. 5.
If the detector 102 detects that n.sub.j is unequal to zero, no (or a different) control signal is generated. Switches S.sub.1, S.sub.2 and S.sub.3 then have the positions shown in FIG. 9. The value c.sub.1.q is now subtracted from the value B stored in memory 90, and the result, thus obtained, is stored again in memory 90. In the adder 117, the value c.sub.1 is added to the value n.sub.j, which is read from the memory 91 through output 101. Again, through lines 108 and 109, the new value for n.sub.j is applied to input 111 of memory 91 to be stored in the memory 91. Furthermore, in subtractor 112, the value c.sub.2 is subtracted from the value b.sub.j present at the output 91, and the value, thus obtained, is applied to input 115 through lines 113 and 114 in order to be stored in memory 92. This sequence corresponds with block 56 of the method shown in FIG. B.
In the method of FIG. 5, there is further shown a block 48 for making a decision as is whether n.sub.j .gtoreq.n.sub.max ?. If it is, b.sub.j is made equal to the flag value (block 66 in FIG. 5), and n.sub.j is made equal to n.sub.max (should it turn out to be necessary). In the circuit of FIG. 9, this has been taken into account by means of the n.sub.j .gtoreq.n.sub.max detector 118. If detector 118 detects a situation in which n.sub.j .gtoreq.n.sub.max it generates at its output 119 a control signal which is applied to the control input of the controllable switch S.sub.4 and, through an OR-gate 120, to the control input of the controllable switch S.sub.5, which switches then assumes positions different from the ones shown in FIG. 9. The value n.sub.max, applied to the terminal 121, is now applied to the input 111 of memory 91. n.sub.max is then stored in the memory location for n.sub.j in memory 91. Accordingly, the flag value, block 122, is applied to input 115 so that the flag value is stored in the memory location for b.sub.j in memory 92.
It will be evident that there is a central control unit (not shown) which detects the output signal of detector 118 and, on detection of this signal, applies only load pulses to the memories 91 and 92 for storing there n.sub.max and the flag value. No load pulse is then applied to memory 90, since the value B in the memory is to remain unchanged.
Furthermore, the flag value is allocated to b.sub.j if both n.sub.j is equal to zero and B.gtoreq.a.sub.1.q+x (see the blocks 50, 52 and 66 in FIG. 5). Thus, the circuit of FIG. 9 includes the detector 123 and the AND-gate 124. At the occurrence of detection signals of both detector 103 and detector 123, switch S.sub.5 is again set to the opposite position shown in FIG. 9, and the flag value b.sub.f is stored in memory 92 in location j. In this case, the central processor will generate a load pulse only for memory 92 and no load pulses for memories 90 and 91.
It will be self-evident that the initial bit allocation as described with reference to FIG. 6 may also be implemented here, for examples controlled by the necessary control and address signals from the central controller. This will not be explained any further because after the above explanation it may be assumed to be within the grasp of those skilled in the art.
FIG. 10 shows the use of the encoding system as described hereinbefore, in a transmitter, especially a transmitter in the form of a recording arrangement for recording the quantized subband signals in one or more tracks on a magnetic record carrier. The section referenced 130 is the encoding system of FIG. 1, (i.e., a subband coder) discussed hereinbefore, which produces the quantized subband signals at the outputs 4.1 to 4.M. The section referenced 131 is a formatting circuit which assembles the quantized subband signals into an encoded output digital signal which is available at output 132. This encoded output digital signal comprises successive frames of which the format is extensively discussed in copending patent application Ser. No. 07/997,158. The structure of the formatting circuit 131 is also explained in that document.
The section referenced 133 is a code word converter which renders the encoded output digital signal suitable for recording on a record carrier, for example, a magnetic record carrier 134. The converter 133 comprises an 8-to-10 converter. In a converter of this type, 8-bit data words in the serial information stream are converted to 10-bit code words. Furthermore, interleaving may take place. The object of this is to enable error correction of the received information on the receiver side (when reproducing the data on the record carrier).
The output signal of converter 133 is applied to a recorder 135 by which the signal is recorded in one or more longitudinal tracks on the record carrier 134. The recorder 135 comprises one or more recording heads 136.
For a further explanation of the arrangement of FIG. 10, copending U.S. patent application Ser. No. 07/669,136 filed Mar. 13, 1991 should be referred to, which is also incorporated herein by reference.
It should further be observed that the invention is not restricted only to the depicted exemplary embodiments. Various modifications of the embodiments described are feasible without departing from the scope of the invention as defined in the claims.
In the foregoing the bit need determination and bit allocation have been described for a number of M subband signals, when there is a single subband signal (for example, a mono signal) in each subband. However, the invention may also be applied to a system for subband encoding of a stereo signal. This means that there will be two subband signals in each subband, that is to say, a left and a right subband signal. Two alternative ways of subband encoding of a stereo signal will be briefly discussed hereinbelow.
A first option is to process the left and right subband signals separately in the manner described above. The M subband signals SB.sub.1 to SB.sub.M as discussed above are then, for example, the M left subband signals. The procedure discussed hereinbefore is then carried out for these left subband signals. In the bit need determining circuit 6, first the bit needs b.sub.l1 to b.sub.M1 are determined. Thereafter, the numbers of bins to be allocated, i.e., n.sub.l1 to n.sub.M1, are determined in the bit allocation circuit 7. In the procedure discussed above and explained with reference to FIG. 5, the value B was used for the bit allocation, B being equal to the number of available bits. It will be obvious that in the present case just half this number of available bits B is used for determining n.sub.l1 to n.sub.M1. The other half of the number of available bits will then be used for bit allocation to the right subband signals.
The arrangement for the stereo signal subband encoding according to the first option actually comprises twice the arrangement shown in FIG. 1. The second section of the arrangement thus comprises a second splitter, such as the splitter 2, for generating the M right subband signals. Furthermore, another bit need determiner is present, such as circuit 6, which determines the bit needs b.sub.lr to b.sub.Mr, and another bit allocator, such as circuit 7, which derives therefrom the numbers of allocated bits n.sub.lr to n.sub.Mr. Also for this purpose, half the actual number of available bits is available.
According to a second option for subband encoding of a stereo signal, the bit needs b.sub.l1 to b.sub.M1 and b.sub.lr to b.sub.Mr are derived in the same manner as in the first option. In contradistinction to the first option, however, in which the bit allocation for the left and right subband signals was performed separately, in the second option, the 2M bit needs b.sub.l1 to b.sub.M1 and b.sub.lr to b.sub.Mr are applied to a bit allocator such as circuit 7, which then naturally has 2M inputs. In this unit the 2M numbers n.sub.l1 to n.sub.M1 and n.sub.lr to n.sub.Mr are derived in a manner similar to the manner described above with reference to FIG. 5 on the basis of the actually available number of bits. For this purpose, the bit allocator have 2M outputs.
It should further be observed that when a stereo signal is encoded, 2M values for the bit allocation information are concerned, represented each by y bits. This means that for the bit allocation procedure for a stereo signal no more than B=B.sub.0 -2.y.M bits are available.
Claims
  • 1. An encoding system for encoding a digital signal having a specific sampling frequency and bandwidth, comprising:
  • splitter means for dividing the bandwidth of the digital signal into M successive subbands, and generating, in response to the digital signal, M subband signals having reduced sampling frequencies, each of the subband signals being associated with one of the subbands;
  • quantizing means for quantizing time-equivalent signal blocks of the subband signals, a subband signal SB.sub.m of the subband signals having successive signal blocks which each contain q samples of that subband signal, each sample in a signal block of subband signal SB.sub.m having an amplitude and being quantized by n.sub.m bits, where n.sub.m may vary for different signal blocks of subband signal SB.sub.m ;
  • bit need determining means for determining bit needs for the time-equivalent signal blocks, said bit need determining means comprising:
  • (a) means for estimating power within the time-equivalent signal blocks, the signal block of subband signal SB.sub.m having a power v.sub.m ;
  • (b) means for determining scale factors for the time-equivalent signal blocks, a scale factor SF.sub.m for the signal block of subband signal SB.sub.m being determined from a sample therein having a maximum absolute amplitude value;
  • (c) means for determining masking magnitudes for the time-equivalent signal blocks, the signal block of subband signal SB.sub.m having a masking magnitude w.sub.m which is determined in accordance with the following relationship: ##EQU8## where d.sub.mi v.sub.i denotes masked power in the signal block of subband signal SB.sub.m as a result of power v.sub.i in a time-equivalent signal block of a subband signal SB.sub.i of the subband signals, d.sub.mi denotes a matrix coefficient in an M.times.M matrix by which the power v.sub.i is multiplied to determine the masked power in the signal block of subband signal SB.sub.m as a result of the time-equivalent signal block of subband signal SB.sub.i, and w.sub.r.m denotes a masking threshold in the signal block of subband signal SB.sub.m ; and
  • (d) means for determining the following relationship for the time-equivalent signal blocks: ##EQU9## where K.sub.1, K.sub.2 and K.sub.3 are constants; and b.sub.m is a bit need for the signal block of subband signal SB.sub.m corresponding to the number of bits by which the q samples in that signal block should be represented, and b.sub.m may vary for different signal blocks of the subband signal SB.sub.m ; and
  • bit allocation means for allocating bits to the time-equivalent signal blocks from an available number of bits B, n.sub.m bits being allocated to each of the q samples of the signal block of subband signal SB.sub.m in accordance with at least the bit need, b.sub.m, for that signal block;
  • wherein M, m and i are integers such that 1.ltoreq.m.ltoreq.M and 1.ltoreq.i.ltoreq.M; q and B are integers, where q is greater than unity and B is greater than zero; and b.sub.m, n.sub.m, v.sub.m, v.sub.i, SF.sub.m, w.sub.m, d.sub.mi and w.sub.r.m are variables, where n.sub.m and SF.sub.m are greater than or equal to zero.
  • 2. The encoding system as claimed in claim 1, wherein K.sub.1 =1, K.sub.2 =1/.sqroot.3 and K.sub.3 is preferably equal to either 1 or zero.
  • 3. The encoding system as claimed in claim 2, wherein said means for estimating power estimates the power v.sub.m in the signal block of subband signal SB.sub.m according to the following relationship: ##EQU10## where s.sub.j is the amplitude of a jth sample in that signal block, j being an integer such that 1.ltoreq.j.ltoreq.q and s.sub.j being a variable.
  • 4. The encoding system as claimed in claim 1, wherein said means for estimating power estimates the power v.sub.m in the signal block of subband signal SB.sub.m according to the following relationship: ##EQU11## where s.sub.j is the amplitude of a jth sample in that signal block, j being an integer such that 1.ltoreq.j.ltoreq.q and s.sub.j being a variable.
  • 5. The encoding systems as claimed in claim 1, wherein said bit need determining means utilizes a logarithmic representation for d.sub.mi, v.sub.i, w.sub.m and w.sub.r.m when determining the masking magnitude w.sub.m.
  • 6. The encoding system as claimed in claim 5, wherein K.sub.1 =1, K.sub.2 =1/.sqroot.3 and K.sub.3 is preferably equal to either 1 or zero.
  • 7. The encoding system as claimed in claim 6, wherein said means for estimating power estimates the power v.sub.m in the signal block of subband signal SB.sub.m according to the following relationship: ##EQU12## where s.sub.j is the amplitude of a jth sample in that signal block, j being an integer such that 1.ltoreq.j.ltoreq.q and s.sub.j being a variable.
  • 8. The encoding system as claimed in claim 5, wherein said means for estimating power estimates the power v.sub.m in the signal block of subband signal SB.sub.m according to the following relationship: ##EQU13## where s.sub.j is the amplitude of a jth sample in that signal block, j being an integer such that 1.ltoreq.j.ltoreq.q and s.sub.j being a variable.
  • 9. The encoding system as claimed in claim 5, wherein said bit need determining means further comprises means for adding and multiplying logarithmically represented values.
  • 10. The encoding system as claimed in claim 1, further comprising signal formatting means for assembling into a frame of an output digital signal having successive frames the q samples from the time-equivalent signal blocks of the subband signals which have been quantized by said quantizing means, scale factor information being included in the frame in the form of x-bit words representing the scale factors associated with the time-equivalent signal blocks for which the q samples are included in the frame.
  • 11. A transmitter, comprising the encoding system of claim 1.
  • 12. The transmitter as claimed in claim 11, further comprising signal formatting means for assembling into a frame of an output digital signal having successive frames the q samples from the time-equivalent signal blocks of the subband signals which have been quantized by said quantizing means, scale factor information being included in the frame in the form of x-bit words representing the scale factors associated with the time-equivalent signal blocks for which the q samples are included in the frame.
  • 13. The transmitter as claimed in claim 12, further comprising a recording means for recording the output digital signal in a track on a record carrier.
  • 14. The transmitter as claimed in claim 13, wherein the record carrier is a magnetic record carrier.
  • 15. A method of encoding a digital signal having a specific sampling frequency and bandwidth, comprising:
  • dividing the bandwidth of the digital signal into M successive subbands, and generating, in response to the digital signal, M subband signals having reduced sampling frequencies, each of the subband signals being associated with one of the subbands;
  • quantizing time-equivalent signal blocks of the subband signals, a subband signal SBm of the subband signals having successive signal blocks which each contain q samples of that subband signal, each sample in a signal block of subband signal SB.sub.m having an amplitude and being quantized by n.sub.m bits, where n.sub.m may vary for different signal blocks of subband signal SB.sub.m ;
  • wherein in order to quantize the time-equivalent signal blocks, the following steps are performed:
  • determining bit needs for the time-equivalent signal blocks by:
  • (a) estimating power within the time-equivalent signal blocks, the signal block of subband signal SB.sub.m having a power v.sub.m ;
  • (b) determining scale factors for the time-equivalent signal blocks, a scale factor SF.sub.m for the signal block of subband signal SB.sub.m being determined from a sample therein which has a maximum absolute amplitude value;
  • (c) determining masking magnitudes for the time-equivalent signal blocks, the signal block of subband signal SB.sub.m having a masking magnitude w.sub.m which is determined in accordance with the following relationship: ##EQU14## where d.sub.mi v.sub.i denotes masked power in the signal block of subband signal SB.sub.m as a result of power v.sub.i in a time-equivalent signal block of a subband signal SB.sub.i of the subband signals, d.sub.mi denotes a matrix coefficient in an M.times.M matrix by which the power v.sub.i is multiplied to determine the masked power in the signal block of subband signal SB.sub.m as a result of the time-equivalent signal block of subband signal SB.sub.i, and w.sub.r.m denotes a masking threshold in the signal block of subband signal SB.sub.m ; and
  • (d) determining the following relationship for the time-equivalent signal blocks of the subband signals: ##EQU15## where K.sub.1, K.sub.2 and K.sub.3 are constants; and b.sub.m is a bit need for the signal block of subband signal SB.sub.m corresponding to the number of bits by which the q samples in that signal block should be represented, and b.sub.m may vary for different signal blocks of subband signal SB.sub.m ; and
  • allocating bits to the time-equivalent signal blocks from an available number of bits B, n.sub.m bits being allocated each of the q samples of the signal block of subband signal SB.sub.m in accordance with the bit need b.sub.m for that signal block;
  • wherein M, m and i are integers such that 1.ltoreq.m.ltoreq.M and 1.ltoreq.i.ltoreq.M; q and B are integers, where q is greater than unit and B is greater than zero; and b.sub.m, n.sub.m, v.sub.m, v.sub.i, SF.sub.m, w.sub.m, d.sub.mi and w.sub.r.m are variables, where n.sub.m and SF.sub.m are greater than or equal to zero.
  • 16. The method as claimed in claim 15, wherein in determining the masking magnitude w.sub.m a logarithmic representation is used for d.sub.mi, v.sub.i, w.sub.m and w.sub.r.m.
  • 17. The method as claimed in claim 15, wherein K.sub.1 =1, K.sub.2 =1/.sqroot.3 and K.sub.3 is preferably equal to either 1 or zero.
  • 18. The method as claimed in claim 11, wherein the power v.sub.m in the signal block of subband signal SB.sub.m is estimated in accordance with following relationship: ##EQU16## where s.sub.j is the amplitude of a jth sample in that signal block, j being an integer such that 1.ltoreq.j.ltoreq.q and s.sub.j being a variable.
  • 19. The method as claimed in claim 15, further comprising the step of assembling into a frame of an output digital signal having successive frame the q samples from the time-equivalent signal blocks of the subband signals which have been quantized, scale factor information being included in the frame in the form of x-bit words representing the scale factors associated with the time-equivalent signal blocks for which the q samples are included in the frame.
  • 20. The method as claimed in claim 19, further comprising the recording the output digital signal in a track on a record carrier.
  • 21. The method as claimed in claim 20, wherein the record carrier is a magnetic record carrier.
  • 22. An encoding system for encoding a digital signal, comprising:
  • means for dividing the digital signal into a plurality of subband signals, each of the subband signals having a plurality of signal blocks, each containing q samples of that subband signal, where q is a positive integer, which are successive in time, each of the signal blocks of a subband signal being time-equivalent with a corresponding signal block of each of the other subband signals, corresponding signal blocks of the subband signals constituting time-equivalent signal blocks;
  • means for quantizing each of the q samples of each of the time-equivalent signal blocks with n.sub.m bits, where n.sub.m is a variable greater than or equal to zero which may vary for the time-equivalent signal blocks and/or different signal blocks within the same subband signal and m is a positive integer denoting which one of the subband signals a signal block comes from;
  • bit determining means for determining a bit need b.sub.m for each of the time-equivalent signal blocks, where b.sub.m is a variable which may vary for the time-equivalent signal blocks and/or different signal blocks within the same subband signal, the bit need b.sub.m for each of the time-equivalent signal blocks corresponding to the number of bits by which the q samples in that signal block should be represented and being determined on the basis of a scale factor for that signal block, a linear combination of each masked power in that signal block resulting from power in each of the time-equivalent signals blocks and a masking threshold for that signal block; and
  • means for allocating, from an available number of bits B, where B is a positive integer, the n.sub.m bits to each of the q samples of each of the time-equivalent signal blocks in accordance with the bit need b.sub.m for each of the time-equivalent signal blocks.
  • 23. The encoding system as claimed in claim 22, wherein said bit need determining means comprises:
  • estimation means for estimating the power within each of the time-equivalent signal blocks;
  • first determining means for determining the scale factor for each of the time-equivalent signal blocks:
  • second determining means for determining a masking magnitude for each of the time-equivalent signal blocks, the masking magnitude for a signal block being determined based on the masked power in the signal block as a result of the power in each of the time-equivalent signals blocks and the masking threshold for the signal block; and
  • third determining means for determining the bit need b.sub.m for each of the time-equivalent signal blocks.
  • 24. The encoding system as claimed in claim 23, wherein said estimation means estimates the power in a signal block in accordance with the following relationship: ##EQU17## where v.sub.m is a variable denoting the power in a signal block and s.sub.j is variable denoting the amplitude of a jth sample in the signal block, j being an integer such that 1.ltoreq.j.ltoreq.q.
  • 25. The encoding systems as claimed in claim 23, wherein said first determining means determines the scale factor for a signal block from a sample therein having a maximum absolute amplitude value.
  • 26. The encoding systems as claimed in claim 23, wherein said second determining means determines the masking magnitude for a signal block in accordance with the following relationship: ##EQU18## where w.sub.m is a variable denoting the masking magnitude for the signal block, M is a positive integer equal to the number of subband signals, each of which are denoted by i, which is a positive integer such that 1.ltoreq.i.ltoreq.M, d.sub.mi v.sub.i is the masked power in the signal block as a result of power v.sub.i, where v.sub.i is variable, in one of the time-equivalent signal blocks, which time-equivalent signal block is from subband signal i, d.sub.mi is variable denoting a matrix coefficient in an M.times.M matrix by which the power v.sub.i is multiplied to determine the masked power in the signal block as a result of the time-equivalent signal block from subband signal i, and w.sub.r.m is a variable denoting the masking threshold in the signal block.
  • 27. The encoding system as claimed in claim 23, wherein said third determining means determines the bits need b.sub.m for a signal block in accordance with the following relationship: ##EQU19## where K.sub.1, K.sub.2 and K.sub.3 are constants, SF.sub.m is a variable denoting the scale factor for the signal block and wm is a variable denoting the masking magnitude for the signal block.
  • 28. The encoding system as claimed in claim 22, further comprising signal formatting means for assembling into a frame of an output digital signal having successive frames the q samples from the time-equivalent signal blocks which have been quantized, scale factor information being included in the frame in the form of x-bit words representing the scale factors associated with the time-equivalent signal blocks for which the q samples are included in the frame.
  • 29. A transmitter, comprising the encoding system of claim 28.
  • 30. The transmitter as claimed in claim 29, further comprising a recording means for recording the output signal in a track of a record carrier.
  • 31. A transmitter, comprising the encoding system of claim 22.
  • 32. The encoding systems as claimed in claim 22, wherein the subband signals have reduced sampling frequencies as compared to the digital signal.
  • 33. A method for encoding a digital signal, comprising:
  • dividing the digital signal into a plurality of subband signals, each of the subband signals having a plurality of signal blocks, each containing q samples of that subband signal, where q is a positive integer, which are successive in time, each of the signal blocks of a subband signal being time-equivalent with a corresponding signal block of each of the other subband signals, corresponding signal blocks of the subband signals constituting time-equivalent signal blocks; and
  • quantizing each of the q samples of each of the time-equivalent signal blocks with n.sub.m bits, where n.sub.m is a variable greater than or equal to zero which may vary for the time-equivalent signal blocks and/or different signal blocks within the same subband signal and m is a positive integer denoting which one of the subband signals a signal block comes from;
  • wherein in order to quantize each of the time-equivalent signal blocks, the following additional steps are preformed:
  • determining a bit need b.sub.m for each of the time-equivalent signal blocks, where b.sub.m is a variable which may vary for the time-equivalent signal blocks and/or different signal blocks within the same subband signal, the bit need b.sub.m for each of the time-equivalent signal blocks corresponding to the number of bits by which the q samples in that signal block should be represented and being determined on the basis of a scale factor for that signal block, a linear combination of each masked power in that signal block resulting from power in each of the time-equivalent signals blocks and a masking threshold for that signal block; and
  • allocating, from an available number of bits B, where B is a positive integer, the n.sub.m bits to each of the q samples of each of the time-equivalent signal blocks in accordance with the bit need b.sub.m for each of the time-equivalent signal blocks.
  • 34. The method as claimed in claim 33, wherein determining the bit need b.sub.m for each of the time-equivalent signal blocks includes:
  • estimating the power within each of the time-equivalent signal blocks;
  • determining the scale factor for each of the time-equivalent signal blocks; and
  • determining a masking magnitude for each of the time-equivalent signal blocks, the masking magnitude for a signal block being determined based on the masked power in the signal block as a result of the power in each of the time-equivalent signals blocks and the masking threshold for the signal block.
  • 35. The method as claimed in claim 34, wherein the power in a signal block is estimated in accordance with the following relationship: ##EQU20## where v.sub.m is a variable denoting the power in a signal block and s.sub.j is variable denoting the amplitude of a jth sample in the signal block, j being an integer such that 1.ltoreq.j.ltoreq.q.
  • 36. The method as claimed in claim 34, wherein the scale factor for a signal block is determined from a sample therein having a maximum absolute amplitude value.
  • 37. The method as claimed in claim 34, wherein the masking magnitude for a signal block is determined in accordance with the following relationship: ##EQU21## where w.sub.m is a variable denoting the masking magnitude for the signal block, M is a positive integer equal to the number of subband signals, each of which are denoted by i, which is a positive integer such that 1.ltoreq.i.ltoreq.M, d.sub.mi v.sub.i is the masked power in the signal block as a result of power v.sub.i, where v.sub.i is variable, in one of the time-equivalent signal blocks, which time-equivalent signal block is from subband signal i, d.sub.mi is variable denoting a matrix coefficient in an M.times.M matrix by which the power v.sub.i is multiplied to determine the masked power in the signal block as a result of the time-equivalent signal block from subband signal i, and w.sub.r.m is a variable denoting the masking threshold in the signal block.
  • 38. The method as claimed in claim 34, further comprising assembling into a frame of an output digital signal having successive frames the q samples from the time-equivalent signal blocks which have been quantized, scale factor information being included in the frame in the form of x-bit words representing the scale factors associated with the time-equivalent signal blocks for which the q samples are included in the frame.
  • 39. The method as claimed in claim 33, wherein the bit need b.sub.m for a signal block is determined in accordance with the following relationship: ##EQU22## where K.sub.1, K.sub.2 and K.sub.3 are constants, SF.sub.m is a variable denoting the scale factor for the signal block and wm is a variable denoting a masking magnitude for the signal block, the masking magnitude being a function of each masked power in the signal block resulting from power in each of the time-equivalent signals blocks and the masking threshold for the signal block.
  • 40. A bit need determining device for determining bits needs for time-equivalent signal blocks of subband signals, the device comprising:
  • means for estimating power within each of the time-equivalent signal blocks;
  • means for determining a scale factor for each of the time-equivalent signal blocks;
  • means for determining a masking magnitude for each of the time-equivalent signal blocks, the masking magnitude for a time-equivalent signal block being determined based on a linear combination of each masked power in the time-equivalent signal block resulting from the power in each of the time-equivalent signals blocks and a masking threshold for the time-equivalent signal block; and
  • means for determining a bit need b.sub.m for each of the time equivalent signal blocks, the bit need b.sub.m for a time-equivalent signal block being determined based upon the scale factor and the masking magnitude for the time-equivalent signal block.
  • 41. A bit need determining device for determining bit needs for time-equivalent signal blocks of M subband signals, each of the time-equivalent signal blocks having q samples, where q is a positive integer, the device comprising:
  • means for estimating power within the time-equivalent signal blocks, the power within a time-equivalent signal block being denoted v.sub.m, where v.sub.m is a variable, and m is a positive integer, such that 1.ltoreq.m.ltoreq.M, denoting which one of subband signals the time-equivalent signal block comes from;
  • means for determining scale factors for the time-equivalent signal blocks, a scale factor SF.sub.m, where SF.sub.m is a variable greater than or equal to zero, for a time-equivalent signal block being determined from a sample therein having a maximum absolute amplitude value;
  • means for determining masking magnitudes for the time-equivalent signal blocks, a time-equivalent signal block having a masking magnitude w.sub.m, where w.sub.m is a variable, which is determined in accordance with the following relationship: ##EQU23## where i is a positive integer, such that 1.ltoreq.i.ltoreq.M, denoting one of the subband signals, d.sub.mi v.sub.i denotes masked power in the time-equivalent signal block as a result of power v.sub.i, where v.sub.i is variable, in one of the time-equivalent signal blocks, which is from subband signal i, d.sub.mi is variable denoting a matrix coefficient in an M.times.M matrix by which the power v.sub.i is multiplied to determine the masked power in the time-equivalent signal block as a result of the one of the time-equivalent signal blocks from subband signal i, and w.sub.r.m is a variable denoting the masking threshold in the signal block; and
  • means for determining the bit need b.sub.m for the time-equivalent signal block, the bit need bm for a time-equivalent signal block being determined in accordance with the following relationship: ##EQU24## where K.sub.1, K.sub.2 and K.sub.3 are constants.
  • 42. The bit need determining device as claimed in claim 41, wherein K.sub.1 =1, K.sub.2 =1/.sqroot.3 and K.sub.3 is preferably equal to either 1 or zero.
  • 43. The bit need determining device as claimed in claim 41, wherein said means for estimating power estimates the power v.sub.m in the time-equivalent signal block according to the following relationship: ##EQU25## where S.sub.j is variable denoting the amplitude of a jth sample in the time-equivalent signal block, j being an integer such that 1.ltoreq.j.ltoreq.q.
  • 44. The bit need determining device as claimed in claim 41, wherein said bit need determining means utilizes a logarithmic representation for the values of d.sub.mi, v.sub.i, w.sub.m and w.sub.r.m when determining the masking magnitude for the time-equivalent signal block.
  • 45. The bit need determining device as claimed in claim 44, further comprising means for adding and multiplying the logarithmically represented values.
  • 46. A transmitter, comprising the bit need determining device claimed in claim 41.
  • 47. A method for determining bits needs for time-equivalent signal blocks of subband signals, the device comprising:
  • estimating power within each of the time-equivalent signal blocks;
  • determining a scale factor for each of the time-equivalent signal blocks;
  • determining a masking magnitude for each of the time-equivalent signal blocks, the masking magnitude for a time-equivalent signal block being determined based on each masked power in the time-equivalent signal block resulting from the power in each of the time-equivalent signals blocks and a masking threshold for the time-equivalent signal block; and
  • determining a bit need b.sub.m for each of the time equivalent signal blocks, the bit need b.sub.m for a time-equivalent signal block being determined based upon the scale factor and the masking magnitude for the time-equivalent signal block.
  • 48. A method for determining bit needs for time-equivalent signal blocks of M subband signals, each of the time-equivalent signal blocks having q samples, where q is a positive integer, the device comprising:
  • estimating power within the time-equivalent signal blocks, the power within a time-equivalent signal block being denoted v.sub.m, where v.sub.m is a variable, and m is a positive integer, such that 1.ltoreq.m.ltoreq.M, denoting which one of subband signals the time-equivalent signal block comes from;
  • determining scale factors for the time-equivalent signal blocks, a scale factor SF.sub.m, where SF.sub.m is a variable greater than or equal to zero, for a time-equivalent signal block being determined from a sample therein having a maximum absolute amplitude value;
  • determining masking magnitudes for the time-equivalent signal blocks, a time-equivalent signal block having a masking magnitude w.sub.m, where w.sub.m is a variable, which is determined in accordance with the following relationship: ##EQU26## where i is a positive integer, such that 1.ltoreq.i.ltoreq.M, denoting one of the subband signals, d.sub.mi v.sub.i denotes masked power in the time-equivalent signal block as a result of power v.sub.i, where v.sub.i is variable, in one of the time-equivalent signal blocks, which is from subband signal i, d.sub.mi is variable denoting a matrix coefficient in an M.times.M matrix by which the power v.sub.i is multiplied to determine the masked power in the time-equivalent signal block as a result of the one of the time-equivalent signal blocks from subband signal i, and w.sub.r.m is a variable denoting the masking threshold in the signal block; and
  • determining the bit need b.sub.m for the time-equivalent signal block, the bit need bm for a time-equivalent signal block being determined in accordance with the following relationship: ##EQU27## where K.sub.1, K.sub.2 and K.sub.3 are constants.
  • 49. The method as claimed in claim 48, wherein K.sub.1 =1, K.sub.2 =1/.sqroot.3 and K.sub.3 is preferably equal to either 1 or zero.
  • 50. The method as claimed in claim 48, wherein the power v.sub.m in the time-equivalent signal block is estimated according to the following relationship: ##EQU28## where s.sub.j is variable denoting the amplitude of a jth sample in the time-equivalent signal block, j being an integer such that 1.ltoreq.j.ltoreq.q.
  • 51. The method as claimed in claim 48, wherein a logarithmic representation for the values of d.sub.mi, v.sub.i, w.sub.m and w.sub.r.m are utilized in determining the masking magnitude for the time-equivalent signal block.
Parent Case Info

This is a continuation of application Ser. No. 07/694,324, filed May 1, 1991 and now abandoned, which is a continuation of application Ser. No. 07/621,693, filed Nov. 30, 1990 and now abandoned.

US Referenced Citations (14)
Number Name Date Kind
4048443 Crochiere et al. Sep 1977
4455649 Esteban et al. Jun 1984
4535472 Tomcik Aug 1985
4811398 Copperi et al. Mar 1989
4896362 Veldhuis et al. Jan 1990
4912763 Galand et al. Mar 1990
4942607 Schroder et al. Jul 1990
4972484 Theile et al. Nov 1990
5105463 Veldhuis et al. Apr 1992
5109417 Fielder et al. Apr 1992
5115240 Fugiwara et al. May 1992
5142656 Fielder et al. Aug 1992
5214678 Rault et al. May 1993
5230038 Fielder et al. Jul 1993
Foreign Referenced Citations (1)
Number Date Country
3440613 Oct 1986 DEX
Non-Patent Literature Citations (6)
Entry
EBU Techn. Review, No. 230, Aug. 1988, G. Theile et al, "Low Bit Rate Coding of High-Quality Audio Signals. An Introduction to the Mascam System".
Philips Journal of Research 44, 329-343, 1989, R. N. J. Veldhuis et al. "Subband Coding of Digital Audio Signals".
IEEE ICASSP 80, vol. 1, pp. 327-331, Apr. 9-11, 1980, M. A. Krasner "The Critical Band Coder . . . Digital Encoding of Speech Signals Based on the Perceptual Requirements of the Auditory System".
F. A. Mac Williams et al, "The Theory of Error Correcting Codes", North Holland Publishing Comp. 1983.
"Digital Communication", Bernard Sklar, P. Hall pp. 72-73, 1988.
Concise Explanation of the Relevance of German Patent No. 34 40 613 C1.
Continuations (2)
Number Date Country
Parent 694324 May 1991
Parent 621693 Nov 1990