The present invention relates to methods and apparatuses for frame-wise coding and decoding of digital signals and associated signal processing, programs therefor and a recording medium having recorded thereon the programs.
Frame-wise processing of digital signals of speech, image or the like frequently involves processing which extends over frames, such as prediction or filtering. The use of samples of preceding and succeeding frames increases the continuity of reconstructed speech or image and the compression coding efficiency thereof. In packet communications, however, samples of the preceding and succeeding frames may sometimes be unavailable, and in some cases it is required that processing be started from only a specified frame. In these cases the continuity of reconstructed speech or image and the compression coding efficiency decrease.
A description will be given first, with reference to
A digital signal of a first sampling frequency from an input terminal 11 is divided by a frame dividing part 12 on a frame-by-frame basis, for example, every 1024 samples, and the digital signal for each frame is converted by a down-sampling part 13 from the first sampling frequency to a lower second sampling frequency. In this case, a high-frequency component is removed by low-pass filtering so as not to generate an aliasing signal by the sampling at the second sampling frequency.
The digital signal of the second sampling frequency is subjected to irreversible or reversible compression coding in a coding part 14, from which it is output as a main code Im. The main code Im is decoded by a local signal decoding part 15, and the decoded local signal of the second sampling frequency is converted by an up-sampling part 16 to a local signal of the first sampling frequency. Naturally enough, interpolation processing is performed in this instance. An error in the time domain between the local signal of the first sampling frequency and the branched digital signal of the first sampling frequency from the frame dividing part 12 is calculated in an error calculating part 17.
The error signal thus produced is provided to a prediction error signal generating part 51, wherein a prediction error signal of the error signal is generated.
The prediction error signal is provided to a compression coding part 18, wherein bits of its bit sequence are rearranged, and from which they are output intact as an error code Pe or after being subjected to reversible (Lossless) compression coding. The main code Im from the coding part 14 and the error code Pe are combined in a combining part 19, from which the combined output is provided in packetized form at an output terminal 21.
For the above-mentioned rearrangement of bit sequence and reversible compression coding, refer to, for example, JP Application Kokai Publication No. 2001-144847 Gazette (pages 6 to 8, FIG. 3), and for the packetizing, refer to, for example, T. Moriya and four others, “Sampling Rate Scalable Lossless Audio Coding,” 2002 IEEE Speech Coding Workshop Proceedings 2002, October.
In a decoder 30 the code from an input terminal 31 is separated by a separating part 32 into the main code Im and the error code Pe, and the main code Im is irreversibly or reversibly decoded into a decoded signal of the second sampling frequency by decoding that corresponds to coding in the coding part 14 of the coder 10. The decoded signal of the second sampling frequency is up-sampled in an up-sampling part 34, by which it is converted to a decoded signal of the first sampling frequency. Naturally enough, interpolation processing is performed to raise the sampling frequency in this instance.
The separated error code Pe is decoded in a decoding part 35 to reconstruct the prediction error signal. A concrete configuration of the decoding part 35 and its processing are described, for example, in the above-mentioned official gazette. The sampling frequency of the reconstructed prediction error signal is the first sampling frequency.
The prediction error signal is subjected to prediction synthesis in a prediction synthesis part 63, by which the error signal is reconstructed. The prediction synthesis part 63 corresponds in configuration to the prediction error signal generating part 51 of the coder 10.
The sampling frequency of the reconstructed error signal is the first sampling frequency, and the error signal and the decoded signal of the first sampling frequency, provided from the up-sampling part 34, are added together in an adding part 36 to reconstruct the digital signal, which is supplied to a frame combining part 37. The frame combining part 37 concatenates such digital signals sequentially reconstructed frame by frame and provides the concatenated signal to an output terminal 38.
In each of the up-sampling parts 16 and 34 in
As a result, the 0-value samples inserted into the solid-line sample sequence of the decoded signal, such as shown in
In such FIR filtering, each sample x(n) (where n=0, . . . , L−1) in the frame consisting of L samples as shown in
Accordingly, the first output sample y(0) of the current frame is dependent on T samples x(−T) to x(−1) of the immediately preceding frame. Similarly, the last output sample y(L−1) of the current frame is dependent on T values x(L) to x(L+T−1) of the immediately succeeding frame. The multiplying parts 221 to 22m in
In such a coding/decoding system as shown in
In the prediction error generating part 51 of the coder 10 in
In such autoregressive prediction processing, a sample at a point p preceding each sample x(n) (where n=0, . . . , L−1) in the frame consisting of L samples as shown in
In the above [*] represents rounding of the value *, for example, by omitting fractions. Accordingly, the first prediction error signal y(0) of the current frame is dependent on p input samples x(−p) to x(−1) of the immediately preceding frame. Incidentally, no rounding is required in the coding that allows distortion. The rounding may be done during calculation.
In the prediction synthesis part 63 of the decoder 30 in
In such autoregressive prediction synthesis, the prediction value is obtained by convoluting a sample at a point p preceding each input sample y(n) (where n=0, . . . , L−1) in a frame consisting of L samples as shown in
Accordingly, the first prediction synthesis sample x(0) of the current frame is dependent on p prediction synthesis samples x(−p) to x(−1) of the immediately preceding frame.
As described above, autoregressive prediction processing and prediction synthesis processing require input samples of the preceding frame and prediction synthesis samples of the preceding frame; in such a coding/decoding system as shown in
In JP Application Kokai Publication No. 2000-307654 there is proposed a scheme by which, in a conventional voice packet transmission system in which a speech signal is transmitted in packet form only during a speech-active duration but no packet transmission takes during a silent duration and at the receiving side a pseudo background noise is inserted in the silent duration, discontinuity of level between the speech-active duration and the silent duration is corrected to thereby prevent a conversation from starting or ending with a feeling of unnaturalness. According, to this scheme, at the receiving side an interpolation frame is inserted between a decoded speech frame of the speech-active duration and a pseudo background noise frame; in the case of using a hybrid coding system, filter coefficients or noise codebook index of the speech-active duration is used as the interpolation frame, and the gain coefficient used is one that takes an intermediate value of the background noise gain.
With the scheme set forth in the above-mentioned Japanese Application Kokai Publication No. 2000-307654, the speech signal is transmitted only during the speech-active duration, and the beginning and end of the speech-active duration are processed in the state in which preceding and succeeding frames do not exist originally.
In the processing for each frame, in the case of using a scheme that enhances the continuity, quality and coding efficiency of the reconstructed signal by processing the current frame through utilization of samples preceding and succeeding the current frame, it is desirable that degradation of the continuity, quality and coding efficiency be suppressed even if preceding and succeeding frames are unavailable at the receiving side (at the decoding side), or that even if only one frame is processed independently of other frames, the continuity, quality and efficiency can be provided ay substantially the same level as in the case where the preceding and succeeding frames are present. Such signal processing according to the present invention is applicable not only to part of coding processing for transmission or storage of a digital signal by coding it on a frame-by-frame basis and to part of decoding of a received code or code read out of a storage unit but also generally to frame-wise digital signal processing intended to provided enhanced quality and efficiency by utilization of samples of preceding and succeeding frames as well.
In other words, an object of the present invention it to provide a digital signal processing method, processor and program which, in the frame-wise processing of a digital signal by use of samples of its current frame alone, make it possible to achieve performance (continuity, quality, efficiency, etc.) substantially equal to that obtainable with the use of samples of preceding or/and succeeding frames as well.
A method for processing a digital signal on a frame-wise basis according to the invention of claim 1, comprises the steps of:
(a) modifying a sample sequence of a frame neighboring its first sample and/or a sample sequence of said frame neighboring its last sample in accordance with a consecutive-sample sequence consisting of consecutive samples forming part of said frame, thereby forming a modified sample sequence; and
(b) processing a series of sample sequence of said frame over said modified sample sequence.
The digital signal processing method according to the invention of claim 2 is a modification of the method of claim 1, wherein said step (a) includes a step of concatenating an alternative sample sequence, formed by using said series of sample sequences, to the front of the first sample of said frame and/or to the back of the last sample of said frame, thereby forming said modified sample sequence.
The digital signal processing method according to the invention claim 3 is a modification of the method of claim 2, wherein said step (a) includes a step of providing said alternative sample sequence by reversing the order of arrangement of samples of said consecutive-sample sequence.
The digital signal processing method according to the invention of claim 4 is a modification of the method of any one of claims 1, 2 and 3, wherein said step (a) of modifying a partial sample sequence in said frame containing the first sample and/or partial sample sequence in said frame containing the last sample by a calculation with said consecutive-sample sequence, thereby forming said modified sample sequence.
The digital signal processing method according to the invention of claim 5 is a modification of the method of claim 4, wherein said step (a) includes a step of concatenating a predetermined fixed sample sequence to the front of the first sample of said frame and/or to the back of said last sample.
The digital signal processing method according to the invention of claim 8 is a modification of the method of claim 2 or 3, which further comprises a step of providing, as a part of a code for the digital signal of said frame, auxiliary information indicating any one of a plurality of methods for using said consecutive-sample sequence as said alternative sample sequence and/or indicating the position of said consecutive-sample sequence
The digital signal processing method according to the invention of claim 9 is a modification of the method of claim 1, wherein:
said step (a) includes: a step of retrieving a sample sequence similar to a leading sample sequence or rear-end sample sequence of said frame and using said similar sample sequence as said consecutive-sample sequence; and a step of multiplying said similar sample sequence by a gain and the multiplied output is subtracted from said leading or rear-end sample sequence to form said modified sample sequence;
said step (b) a step of performing said processing to calculate a prediction error of the digital signal of said frame; and a step of providing, as a part of a code of said frame, auxiliary information indicating the position of said similar sample sequence in the frame and said gain.
The digital signal processing method according to the invention of claim 10 is a modification of the method of claim 1, wherein said step (a) includes the steps of:
(a-1) reconstructing the sample sequence of said frame by autoregressive prediction synthesis from a prediction error signal obtained from a code, and replicating said consecutive-sample sequence at the position in said frame specified by auxiliary information provided as part of said code; and
(a-2) multiplying said replicated sample sequence by a gain in said auxiliary information and adding the multiplied output to the first or last sample sequence of said frame to provide said modified sample sequence.
A digital signal processing method according to the invention of claim 11 is a method that performs filter or prediction processing of a digital signal on a frame-wise basis, the method comprising the step of:
(a) processing said digital signal by use of a tap number of prediction order dependent only on usable samples in a frame without using samples preceding a first sample of said frame and/or samples succeeding a last sample of said frame.
The digital signal processing method according to the invention of claim 15 is a modification of the method of claim 14, wherein said autoregressive linear prediction error generation processing is an operation using PARCOR coefficients.
A digital signal processing method according to the invention of claim 16 is a method that is used in frame-wise coding of an original digital signal and performs processing by use of samples of a frame preceding or/and succeeding the frame concerned, the method comprising the step of:
coding the first sample sequence of the frame concerned or the last sample sequence of said preceding frame separately of coding of said frame concerned, and providing auxiliary information as part of the code of said frame concerned.
A digital signal processing method according to the invention of claim 19 is a method that is used in frame-wise decoding of an encoded code of an original digital signal and performs processing by use of samples of a frame preceding or/and succeeding the frame concerned, the method comprising the step of:
(a) decoding an auxiliary code of said frame to obtain a first sample sequence of said frame or the last sample sequence of the preceding frame; and
(b) processing, for said frame, said first or last sample sequence as a decoded sample sequence at the end of the preceding frame.
A digital signal processor according to the invention of claim 22 is a processor for processing a digital signal on a frame-wise basis, the processor comprising:
means for forming a modified sample sequence by modifying a sample sequence of a frame neighboring its first sample and/or a sample sequence of said frame neighboring its last sample by using a consecutive-sample sequence consisting of consecutive samples forming part of said frame; and
means for processing said digital signal over said modified sample sequence.
The digital signal processor according to the invention of claim 23 is a modification of the processor of claim 22, wherein:
said modified sample sequence forming means includes: means for generating, as an alternative sample sequence, a consecutive-sample sequence consisting of consecutive samples forming part of the frame; and means for concatenating said alternative sample to at least one of the front of the first sample of the digital signal of the frame concerned and the back of the last sample of said digital signal of said frame; and
said processing includes means for performing linear coupling of the digital signal having concatenated hereto said alternative sample sequence.
The digital signal processor according to the invention of claim 24 is a modification of the processor of claim 22, wherein:
said modified sample sequence forming means includes: means selecting a consecutive-sample sequence, which consists of consecutive samples forming part of said frame, similar to the first or last sample sequence of the frame; means for multiplying said selected consecutive-sample sequence by a gain; and means for subtracting said gain-multiplied consecutive-sample sequence from the first or last sample sequence of said frame; and
said processing means includes: means for generating a prediction error of the digital signal of said subtracted frame by autoregressive prediction; and means for providing, as a part of code of the current frame, auxiliary information indicating the position of said consecutive-sample sequence in said frame and said gain.
The digital signal processor according to the invention of claim 25 is a modification of the processor of claim 22, which further comprises:
means for reconstructing a sample sequence of one frame by autoregressive synthesis filter on the basis of a prediction error signal obtained from a code; means for extracting the consecutive-sample sequence from said reconstructed sample sequence on the basis of position signal in auxiliary information used as a part of a code of said frame; means for multiplying said extracted consecutive-sample sequence by a gain contained in said auxiliary information; means for forming said modified sample sequence by adding said gain-multiplied consecutive-sample sequence to the first or last sample sequence of said reconstructed sample sequence; and
said processing means is means for performing autoregressive prediction synthesis for the digital signal over said modified sample sequence.
A readable recording medium, which has recorded a computer-executable program for implementing said digital signal processing method according to the present invention, is also included in the present invention.
According to the inventions of claims 1 and 22, the digital signal processing is performed extending over a modified sample sequence, by which it is possible to suppress discontinuity of a reconstructed signal due to a sharp change of the first or last sample of the current frame and hence improve the quality of the reconstructed signal.
According to the inventions of claims 2 and 23, an alternative sample sequence consisting of samples of only the current frame is concatenated to the frame, by which it is possible to achieve processing equivalent to digital signal processing that extends over the preceding and succeeding frames.
According to the invention of claim 3, the alternative sample sequence is formed by reversing the order of arrangement of the sample of a sample sequence, by which it is possible to increase the symmetry at the head and end of the frame, providing for increased continuity.
According to the invention of claim 4, a sample sequence in the current frame is used as high-reliability data, by which the first or last sample sequence of the frame can be modified through calculation.
According to the invention of claim 5, the digital signal processing can be simplified by using a fixed sample sequence as the alternative sample sequence.
According to the invention of claim 8, the optimum alternative sequence generating method is selected, and/or information on the position of the sample sequence used is sent to the receiving side, enabling it to achieve reconstruction with less distortion.
According to the inventions of claims 9 and 24, by modifying a sample sequence of the frame neighboring its first or last sample by using a sample sequence similar to the lading or rear-end sample sequence of the frame, it is possible to flatten the leading portion or rear-end portion of the signal and hence provide increased continuity.
According to the inventions of claims 10 and 25, at the decoding side a sample sequence of the position specified by auxiliary information to modify the first or last sample sequence by a specified gain, by which it is possible to implement processing that corresponds to the processing t the transmitting side.
According to the invention of claim 11, by performing digital signal processing while changing the tap number or prediction order according to the number of usable samples at each sample position in the frame, processing can be concluded within the frame.
According to the invention of claim 15, the use of the PARCOR coefficient permits reduction of the computational complexity involved.
According to the invention of claim 16, the first or last sample sequence of the frame is prepared separately as auxiliary information, which can be used as an alternative sample sequence immediately at the occurrence of a frame dropout at the receiving.
According to the invention of claim 19, the first sample sequence of the frame or the last sample sequence of the preceding frame, received as auxiliary information, is used as an alternative sample sequence, by which it is possible to facilitate random access to the frame.
First Mode of Working
In the first mode of working of the present invention, as shown in FIGS. 5A and 5B, a sample sequence ΔS consisting of consecutive samples which form part of a digital signal (a sample sequence) SFC of one frame, for example, stored in a buffer 100, that is, the sample sequence ΔS in the buffer 1100, is read out intact by an alternative sample sequence generating part 110, which outputs the sample sequence ΔS intact, or processes it as required, to provide an alternative sample sequence AS, then the alternative sample sequence AS is provided to a sample sequence concatenating part 120, wherein it is concatenated to the front of the lead sample of the current frame FC in the buffer 100 and the back of the last sample of the current frame FC, respectively, and the resulting concatenated sample sequence PS (=AS+SFC+AS, hereinafter referred to as a processed sample sequence) is provided to a linear coupling part 130, such as an FIR filter, wherein it is subjected to linear coupling. Of course, the alternative sample sequences AS need not always to be pre-concatenated directly to the current frame in the buffer 100 to form a series of processed sample sequences, but instead the alternative sample sequence AS to be concatenated to the current frame FC may be stored in the buffer 100 independently of the current-frame sample sequence so that they are read out in a sequential order AS−SFC−AS.
As indicated by the broken lines in
In the linear coupling part 130 samples of the preceding and succeeding frames are required, but a sample sequence consisting of samples forming part of the current frame is replicated and used as an alternative sample sequence in place of the required sample sequence of the preceding or succeeding frame; by this scheme, a processed digital signal (a sample sequence) SOU of one frame can be obtained with only the current-frame sample sequence SFC without using samples of the preceding and succeeding frames. In this instance, since the alternative sample sequence is formed by samples forming part of the current-frame sample sequence SFC, the continuity, quality and coding efficiency of the reconstructed signal become higher than in the case where the alternative sample sequences concatenated to the front and back of the current frame are processed as zeros.
A description will be given of Embodiment 1 in which the first mode of working is applied to the FIR filtering shown in
In the buffer 100 in
By the reading part 141, T samples x(L−T−1) to x(L−2) preceding the last sample x(L−1) are read out of the buffer 100 as the sample sequence ΔS′ consisting of consecutive samples forming part of the current frame, then the sample sequence ΔS is rearranged in a reverse order in a reverse arrangement part 142, from which the samples x(L−2), x(L−3), . . . , x(L−T−1) are provided as an alternative sample sequence AS′, and the alternative sample sequence AS′ is stored by the writing part 143 in the buffer 100 so that it is concatenated to the last sample x(L−1) of the current frame.
Thereafter, a sequence of processed samples n=−1 to n=L+T−1, that is, x(−T), . . . , x(−1), x(0), x(1), . . . , x(L−2), x(L−1), x(L), . . . , x(L+T−1), is read out by the reading part 141 from the buffer 100 and supplied to an FIR filter 150. The filter provides its filtered output y(0), . . . , y(L−1). In this example, the alternative sample sequence AS consists of the forward samples in the frame FC arranged symmetrically with respect to the first sample x(0), and the alternative sample sequence AS′ similarly consists of the samples in the frame FC arranged symmetrically with respect to the last sample x(L−1). In the forward and rearward end portions of the filter output, signal waveforms are symmetrical about the first and last samples x(0) and x(L−1), respectively, and hence frequency characteristics in front of and behind each of the first and the last samples bear similarity to each other; therefore, it is possible to obtain filter outputs y(0), . . . , y(L−1) which are smaller in variations of their frequency characteristics than in the case of the alternative sample sequences AS and AS′ being used and consequently smaller in errors than in the case where the preceding and succeeding frames are present.
Incidentally, in a windowing part 144 indicated by the broken line in
As regards the alternative sample sequence AS′, the sample sequence ΔS′ prior to the reverse arrangement may be multiplied by the window function ω(n).
The configuration of
For example, as shown in
A description will be given of Embodiment 2 in which the first mode of working of the invention is applied to the
That is, as shown in
Also in Embodiment 2, the samples can be read out one by one and fed to the FIR filter 150. For example, as parenthesized in step S2 of
As described above, according to Embodiments 1 and 2, it is possible to perform, by use of the sample sequence SFC of one frame, the digital processing that requires samples which form part of each of the preceding and succeeding frames—this provides enhanced signal continuity, quality and coding efficiency.
Embodiment 3 of the first mode of working of the invention provides auxiliary information representing either predetermined various alternative sample sequence generating methods or the most desirable alternative sample generating method by changing the position of taking out the sample sequence ΔS (or ΔS, ΔS′), or/and auxiliary information indicating the position where to take out the sample sequence ΔS. This embodiment is applied to, for example, the coding/decoding system shown in
The following is a list of examples of possible alternative sample sequence generating methods.
1. In
2. In
3. In
4. In
5. In
6. In
7. In
8. In
9. In Embodiment 1: no window function used;
10: In Embodiment 1: window function used;
11. In
12. In
13. In
14. In
15. In
16. In
17. In
18. In
Since methods 9 and 10 are contained in methods 6 and 8, respectively, methods 9, 10 and methods 6, 8 are not selected at the same time. In general, methods 1 to 4 generate favorable alternative pulse sequences than do methods 11 to 14, and hence they are not selected at the same time. Similarly, methods 5 to 8 and methods 15 to i 8 are not selected at the same time. Accordingly, a plurality of kinds of methods is predetermined as methods 1, . . . , M which includes, for example, one or more of methods 1 to 8 or one of more of methods 1 o 4 and either one of methods 9 and 10. Only one of methods 1 to 8 may sometimes be selected.
These predetermined generating methods are prestored in a generation method storage part 160 in
The select control part 170 calculates, in a similarity calculating part 171, calculates similarity between the candidate alternative sample sequence in the current frame FC and the corresponding sample sequence in the preceding frame FB or succeeding frame FF. In the similarity calculating part 171, as shown, for example, in
If the input candidate alternative sample is the sample sequence AS corresponding to that of the preceding frame, it is stored in a register 174, and the square error between the sample sequence AS and the sample sequence x(−T), . . . , x(−1) stored in the register 172 is calculated in a distortion calculating part 175. If the input candidate alternative sample is the sample sequence AS′ corresponding to that of the succeeding frame, it is stored in a register 176, and the square error between the sample sequence AS′ and the sample sequence x(L), . . . , x(L+T−1) stored in the register 173 is calculated in the distortion calculating part 175.
It can be said that the smaller the calculated square error (or weighted square error) is, the smaller the distortion of the candidate alternative sample sequence, that is, the greater its similarity to the corresponding to the last sample sequence of the preceding frame or the first sample sequence of the succeeding frame. The similarity may also be judged on the basis of the inner product (or cosine) of the vectors of each sample sequence and the vector of the corresponding sample sequence in such a manner that the similarity increases with an increase in the value of the inner product. In any of methods 1 to 8, the positions τ1 and τ2 are changed, for example, to τ=0, L−1, and the sample sequences at the position where the similarity is maximum is used as the candidate alternative sample sequences of the maximum similarity by that method. In the case of selecting two or more of methods 1 to 8, candidate alternative sample sequences of the maximum similarity are selected among those of the maximum similarity by the respective methods.
The alternative sample sequences AS and AS′ of the maximum similarity among the alternative sample sequences thus obtained by the respective methods are concatenated to the front and back of the sample sequence SFC of the current frame FC, thereafter being provided to the FIR filter 150. And information AIAS indicating the method used for generating the adopted alternative sample sequences AS and AS′, in the case of using methods 1 to 8, auxiliary information AI composed of information AIP indicating the position τ (or τ1 and τ2) of the taken-out sample sequence ΔS (or this taken-out sample sequence and ΔS′), and in the case of using only one of methods 1 to 8, only information AIP, is generated in an auxiliary information generating part 180, and if necessary, the auxiliary information AI is encoded in an auxiliary information coding part 190 into an auxiliary code CAI. The auxiliary information AI or auxiliary code CAI is transmitted or stored after being added to part of the current frame FC generated in the coder 10 shown in
In Embodiments 1 and 2, when τ (or τ1, τ2) is fixed, a pre-notification to that effect is provided to the decoding side, no auxiliary information is required.
A description will be given, with reference to
In the first place, the parameter m indicating the generating method is initialized at 1 (S1), then the method m is read out of the storage part 160 and set in the alternative sample sequence generating part 110 (S2), and the alternative sample sequences (candidates) AS and AS′ (S3). The similarity Em between the alternative sample sequences AS, AS′ and the preceding and succeeding frame sample sequences is obtained (S4), then a check is made to see if the similarity Em is higher than the maximum similarity EM until then (S5), and if so, EM is updated with Em (S6), after which the alternative sample sequence AS (or this sample sequence and AS′) prestored in the memory 177 (
When Em is not greater than EM in step S5, and after step S7, a check is made to see if m=M (S8), and if so, m is incremented by one in step S9, followed by a return to step S3 to proceed to the generation of the alternative sample sequence by the next method. If m=M in step S8, the alternative sample sequence AS (or AS and AS′) stored at that time is concatenated to the front and back of the sample sequence SFC of the current frame FC (S10), then the combined sample sequence is subjected to FIR filtering (S11), and the information AIAS indicting the method of generating the adopted alternative sample sequence or/and the auxiliary information AI indicating the position information AIP are generated (S12).
In the methods 1 to 8 for changing the position τ or τ1, τ2, the alternative sample sequence of the greatest similarity can be generated by the same steps as those S1 to S9 shown in
As described above, the most desirable alternative sample sequence is generated from the sample sequence SFC of the current frame FC and the auxiliary information AI is output as part of the code of the frame FC; therefore, in the case where digital signal processing for decoding the code of this frame requires samples of the preceding (past) and succeeding (future) frames (for example, the up-sampling part 34 of the decoder 30 in
This embodiment is applied to one portion of coding of a digital signal, for instance; a sample sequence similar to the leading portion (the leading sample sequence) in a frame is taken out therefrom, then similar sample sequence is multiplied by a gain (including a gain 1), and the gain-multiplied similar sample sequence is subtracted from the leading sample sequence is subjected to autoregressive prediction to generate a prediction error signal, thereby preventing the prediction efficiency from impairment by discontinuity. Incidentally, the smaller the prediction error, the high the prediction efficiency.
Embodiment 4 is applied, for example, to the prediction error generating part 51 in the coder 10 in
The digital signal (sample sequence) SFC={x(0), . . . , x(L−1)} of one frame FC to be processed is prestored in the buffer 100 in
For n=0, . . . , p−1: v(n)=x(n)−u(n)′
For n=p, . . . , L−1: v(n)=x(n)
The sample sequence x(n+τ), . . . , x(n+τ+p−1) may be multiplied by the gain β before it is shifted to the front position in the frame to form the sample sequence u(n)′.
An alternative sample sequence v(−p, . . . , v(−1) consisting of p (number of prediction orders) is concatenated to the front of the lead sample v(0) in an alternative sample sequence concatenating part 240 as shown in
The sample sequence v(−p), . . . , v(L−1) with the alternative sample concatenated thereto is input to the prediction error generating part 5, which generates a prediction error signal y(0), . . . , y(L−1) by autoregressive prediction (S5).
The position τ of the similar sample sequence x(n+τ), . . . , x(n+τ+p−1) and the gain β are determined such that, for example, the power of the prediction error signal y(0), . . . , y(L−1) becomes minimum. In this instance, τ and β are determined using the power of the prediction error signal from y(0) to y(2p) because once the calculation of the prediction value comes to use p samples subsequent to v(p) the prediction error power is not related to the part in the in the current frame from where the similar sample sequence x(n+τ), . . . , x(n+τ+p−1) is derived. The method of this determination is the same as the alternative sample sequence AS determining method described previously with reference to
A prediction error signal for the sample sequence v(−p), . . . , v(L−1) generated using τ and β determined as described above is generated, and the auxiliary information AI indicating τ and β used therefor is generated in an auxiliary information generating part 270 (S6), and if necessary, the auxiliary information AI is coded by an auxiliary information coding part 280 into a code CAI. The auxiliary information AI or code CAI is added to a part of a code of the input digital signal of the frame FC encoded by the coder.
In the above, the value of τ may preferably be greater than the prediction order p, and it is advisable to determine τ such that the sum, ΔU+τ, of the length ΔU of the similar sample sequence u(n) and τ is smaller than L−1, that is, x(τ+ΔU) falls within the scope of the frame FC concerned. The length ΔU of the similar sample sequence u(n) needs only to be equal to or smaller than τ and is not related to the prediction order p; it may be equal to or smaller or larger than p but may preferably be equal to or greater than p/2. Moreover, the front position of the similar sample sequence u(n) need not always be aligned with the front position in the frame FC, that is, u(n) may be set with n=3, . . . , 3+ΔU, for instance. The gain β, by which the similar sample sequence u(n) is multiplied, may be assigned a weight depending on the sample, that is, the sample sequence u(n) may be multiplied by a predetermined window function ω(n), in which case the auxiliary information needs only to indicate τ.
The embodiment of the prediction synthesis processing method corresponding to Embodiment 4 will be described as Embodiment 5. This prediction synthesis processing method is used in the decoding of the code of the digital signal encoded frame by frame, for example, in the prediction synthesis part 63 in the decoder 30 shown in
For example, in the buffer 100 there is stored a sample sequence y(0), y(L−1) of the current frame FC of the digital signal (a prediction error signal) to be subjected to prediction synthesis by the autoregressive prediction scheme, and the sample sequence y(0), . . . , y(L−1) is read out by a read/write part 310.
On the other hand, an alternative sample sequence AS={v(−p), . . . , v(−1)} of the length p equal to the prediction order p is generated in an alternative sample sequence generating part 320 (S 1). The alternative sample sequence used in this case is a predetermined sample sequence consisting of samples 0, . . . , 0, fixed values d, . . . , d, or other predetermined sample sequence. The samples of the alternative sample sequence v(−p), . . . , v(−1) are sequentially fed to the prediction synthesis part 63 with the lead sample v(−p) at the head, as substitutes for the last p samples of the prediction error signal of the frame immediately preceding the current frame FC, to the prediction synthesis part 63 (S2), after which the samples of the sample sequence y(0), . . . , y(L−1) to be subjected to prediction synthesis are sequentially fed to the prediction synthesis part 63 with the lead sample at the head, and prediction synthesis processing is carried out to generate a prediction synthesis signal v(n) (where n=0, . . . , L−1) (S3). The prediction synthesis signal v(n)′ thus obtained is temporarily stored in the buffer 100.
The auxiliary decoding part 330 decodes the auxiliary code CAI forming part of the code of the current frame FC to obtain auxiliary information, from which τ and β are obtained (S4). The auxiliary decoding part 320 may sometimes be supplied with the auxiliary information itself. In a sample sequence acquiring part 340, τ is used to replicate from the synthesis signal (sample) sequence a sample sequence v(τ), . . . , v(τ+p) consisting of a predetermined number p of consecutive samples in this case, that is, the prediction synthesis signal sequence v(n) is obtained intact as the replicated sample sequence v(τ), . . . , v(τ+p) (S5), then this sample sequence is so shifted as to bring its forefront to the front position of the frame FC to provide the sample sequence u(n), which is multiplied by the gain β from the auxiliary information in a gain multiplying part 350 to generate a corrected sample sequence u(n)′=βu(n) (S6).
This corrected sample sequence u(n)′ is added to the prediction synthesis sample (signal) sequence v(n) to provide a normal prediction synthesis signal x(n) (where n=0, . . . , L−1) (S7). The prediction synthesis sample sequence x(n) is as follows:
n=0, . . . , p−1: x(n)=v(n)+u(n)′
n=p, . . . , L−1: x(n)=v(n)
A control part 370 of the processing part 300 controls the respective parts to perform their processing.
In the way described above, a prediction synthesis signal of excellent continuity and quality can be obtained from only the frame FC. Since Embodiment 5 corresponds to Embodiment 4, the length ΔU of the corrected sample sequence u(n)′ is not limited specifically to p, that is, it is not related to the prediction order but predetermined; and the position of the lead sample of the corrected sample sequence u(n)′ need not be the same as the position of the lead sample v(0) of the synthesis signal v(n) but this is also predetermined. Moreover, in some cases the gain β is not contained in the auxiliary information and it is weighted by a predetermined window function ω(m) for each sample u(n).
Second Mode of Working
In the second mode of working of the present invention, the digital signal of the frame concerned is processed using a filter tap number or prediction order dependent only on usable samples (in the frame concerned), instead of using the samples x(1), x(2), . . . preceding (past) the lead sample of the frame concerned or the samples x(L), x(L+1), . . . succeeding the last sample x(L−1) of the frame concerned.
A description will be given of Embodiment 6 in which the second mode of working is applied to the case of making the autoregressive prediction. With reference to
A prediction coefficient estimating part 53 pre-calculates a 1st-order prediction coefficient {α(1)1}, a 2nd-order prediction coefficient {α(2)1, α(2)2}, . . . , a pth-order prediction coefficient {α(p)1, . . . , α(p)p}, using the samples x(0), . . . , x(L−1) of the current frame in the buffer.
The lead sample x(0) of the current frame FC is output intact as the prediction error signal y(0).
With respect to the next sample x(1), the product of the 1st-order prediction coefficient α(1)1, from the prediction coefficient estimating part 53 and x(0) is calculated in a multiplying part M1 to obtain a prediction value, and the prediction value is subtracted from x(1) to obtain the prediction error signal y(1).
Upon input of the next sample x(2), a convolution, α(2)1x(1)+α(2)2x(0), of the 2nd-order prediction coefficients α(2)1, α(2)2 from the prediction coefficient estimating part 53 and x(0), x(1) is performed in a multiplying part M2 to obtain a prediction value, and this prediction value is subtracted from x(2) to obtain the prediction error signal y(2).
Thereafter, upon each input of a sample a convolution is carried out between a prediction coefficient of the prediction order increased one by one through use of all preceding samples until then and the preceding samples to obtain a prediction value, and the prediction value is subtracted from the input sample at that time to obtain a prediction error signal.
That is, at the coding side (at the transmitting side), despite the presence of the frame FB preceding the current frame FC, no sample of the preceding frame is used; for the first (n=0) sample x(0) of the current frame FC, no linear prediction is made, and hence the prediction value y(0)=x(0) output. For the second to pth samples x(1) to x(p−1), convolutions are carried out between the samples x(0) to x(n) (where n=1, . . . , p−1) and nth-order prediction coefficients α(n)1, . . . , α(n)n to obtain prediction values x(n)′. For the samples subsequent to the (p+1)th sample, p samples x(n−p), . . . , x(n−1) (where n=p+1, p+2, . . . , L−1) are convoluted with pth-order prediction coefficients α(p)1, . . . , α(p)p to obtain prediction values x(n)′. In other words, the prediction values are obtained by the same scheme as used in the past. Incidentally, the pth-order prediction coefficients α(p)1 . . . , α(p)p in step S7 may be calculated in step S0 indicated by the broken-line block, and in step S4 the nth-order prediction coefficients α(n)1, . . . , α(n)n may be calculated from the pth-order prediction coefficients. Alternatively, in the course of calculating the pth-order prediction coefficients in step S0 the nth-order (where n=1, . . . , p−1) prediction coefficients may be calculated, respectively. The pth-order prediction coefficients are coded and set as auxiliary information to the receiving side.
An example of the procedure described above is shown in
A check is made to see if n is p (S6), and if not, then the procedure returns to step S3, and if n=p, then the pth-order prediction coefficients α(p)1, . . . , α(p)p are calculated from all the samples x(0), . . . , x(L−1) (S7), then a convolution is carried out between the prediction coefficients and the immediately preceding p past samples x(n−p), . . . , x(n−1) to obtain a prediction value, and the prediction value is subtracted from the current sample x(n) to obtain the prediction error signal y(n) (S8). In other words, Eq. (2) is calculated. A check is made to see if processing of all required samples is completed (S9), and if not, then n is incremented by one and the procedure returns to step S8 (S10); if completed, the processing ends.
Since the pth-order linear prediction requires past p samples, the prediction for the leading samples x(0), . . . , x(p−1) of the current frame calls for rear-end samples of the preceding frame, but as in Embodiment 6, by sequentially increasing the prediction order progressively from 0 to p−1 for the samples of sample numbers n=0 to n=p−1 and by performing the pth-order prediction for the samples after the sample number n=p, (consequently, by performing the prediction without using samples of the preceding frame), it is possible to reduce discontinuity of the prediction signal between the preceding and current frames.
Upon input of the next prediction error signal y(2), a convolution is conducted in the multiplying part M2 between the 2nd-order prediction coefficients α(2)1, α(2)2 from the prediction coefficient decoding part 66 and the prediction error signal y(0), y(1) to obtain a prediction value, which is added to y(2) to obtain a synthesis signal x(2). Thereafter, upon input of y(n) until n=p, y(0), . . . , y(n−1) are convoluted with the nth-order prediction coefficients α(n)1, . . . , α(n)n by the following calculation to obtain a prediction value:
The prediction value is added to y(n) to generate a prediction synthesis signal x(n). After n=p, as is the case with the prior art, the immediately preceding n prediction error signals y(n-p), . . . , y(n−1) are convoluted with the pth order prediction coefficient by Eq. (3) to obtain a prediction value, which is added to y(n) to obtain a prediction synthesis signal x(n). In this prediction synthesis, too, by setting the prediction coefficients to the values shown in the
In the linear prediction coefficients, an ith coefficient α(q)i of an order q takes a different value in accordance with the value of the order q. Accordingly, in Embodiment 7 described above, it is necessary that the prediction coefficient values by which the past samples are multiplied in the multiplying parts 241, . . . , 24p be changed for each input of the sample x(n) in such a manner that, for example, in
On the other hand, in PARCOR coefficients an ith coefficient remains unchanged even if the value of the order q changes. That is, PARCOR coefficients k1, k2, . . . , kp do not depend on the order. It is well-known that the PARCOR coefficient and the linear prediction coefficient are reversibly transformed to each other. Accordingly, it is possible to calculate the PARCOR coefficients k1, k2, . . . , kp from the input sample, the 1st-order prediction coefficient α(1)1 from the coefficient k1, and the 2nd-order prediction coefficients α(2)1, α(2)2 from the coefficients k1, k2; thereafter, (p−1)th-order prediction coefficients α(p−1)1, . . . , α(p−1)p−1 can similarly be obtained from the coefficients k1, . . . , kp−1. This calculation can be expressed as follows:
For i=1: α(1)1=k1
For i=2, . . . , p; α(i)i=−k1
α(i)j=α(i−1)j−kiα(i−1)i−j, j=1, . . . , i−1
This calculation can be conducted in a shorter time and hence more effectively than in the case of calculating {α(1)1}, {α(2)1, α(2)2}, {α(3)1, α(3)2, α(3)3}, . . . . , {α(p−1)1, α(p−1)2, . . . , α(p−1)p−1} by linear prediction for the sample number n=1, . . . , p−1 as described previously with reference to Embodiment 7.
Then Embodiment 8 uses the linear prediction coefficients α1, . . . , αp that are calculated from the PARCOR coefficients in the prediction coefficient determining part 53 in
The prediction coefficient determining part 53 calculates pth-order PARCOR coefficients k1, k2, . . . , kp by linear prediction analysis from all the sample SFC={x(0), . . . , x(L−1)} of the current frame, which coefficients are separately coded and sent as the auxiliary information CA.
For the input sample x(0), the prediction coefficient determining part 53 outputs it intact as y(0).
Upon input of x(1), the prediction coefficient determining part 53 calculates α(1)1 from k1 and sets it in the corresponding multiplier, from which is output a 1st-order prediction error y(1)=x(1)−[α(1)1x(0)].
Upon input of x(2), the prediction coefficient determining part 53 calculates 2nd-order prediction coefficients α(2)1, α(2)2 from k1 and k2, and sets them in the corresponding multiplier, from which is output a 2nd-order prediction error y(2)=x(2)−[α(2)1x(0)+α(2)2x(1)].
Upon input of x(3), the prediction coefficient determining part 53 calculates 3rd-order prediction coefficients α(3)1, α(3)2, α(3)3 from k1 k2 and k3, and sets them in the corresponding multiplier, from which is output a 3rd-order prediction error y(3)=x(3)−[α(3)1x(0)+α(3)2x(1)+α(3)3x(2)].
Similarly, until the sample x(p) is reached, the prediction order is increased in a sequential order, and thereafter pth-order prediction coefficients α(p)1, . . . , α(p)p are used.
In Embodiment 8 the invention has been described as being applied to the case of using, as the prediction error generating part 51, the autoregressive linear predictor shown in
In such pth-order PARCOR filtering that uses the PARCOR coefficient k, too, it is possible to reduce the discontinuity of the prediction error signals of the preceding and current frame by sequentially increasing the prediction order from 0 to p−1 for the sample numbers n=0 to n=p−1 and performing the pth-order prediction after the sample number n=p.
The prediction error samples y(n) are sequentially input to the adder 27Ap of the first stage (j=p) and are processed using the preset PARCOR coefficients k1, . . . , kp, by which the prediction synthesis signal sample x(n) are provided at the output of the adder 27A1 of the last stage (J=1). In this embodiment that performs the prediction synthesis using the PARCOR filter, too, the PARCOR coefficients k1, . . . , kp may be those shown in
A description will be given below of the procedure for performing the
The first sample x(0) is used intact as the prediction error signal sample y(0).
y(0)←x(0)
Upon input of the second sample x(1), the error signal y(1) is calculated by the 1st-order prediction alone.
y(1)←x(1)−k1x(0)
x(0)←x(0)−k1x(1)
Upon input of the third sample x(2), the prediction error signal y(2) is obtained by the following calculation. But x(1) is used to calculate y(3) in the next step.
t1←x(2)−k1x(1)
y(2)←t1−k2x(0)
x(0)←x(0)−k2t1
x(1)←x(1)−k1x(2)
Upon input of the fourth sample x(3), y(3) is obtained by the following calculation. But x(1) and x(2) are used to calculate y(4) in the next step.
t1←x(3)−k1x(2)
t2←t1−k2x(1)
y(3)←t2−k3x(0)
x(0)←x(0)−k3t2
x(1)←x(1)−k2t1
x(2)←x(2)−k1x(3)
Thereafter similar calculations are conducted. In this way, prediction processing can be started with the samples of the current frame. Furthermore, until p+1 samples x(n) are input, the k parameter remains unchanged, and another parameter is newly calculated and the order is incremented by one; once p coefficients are determined, the coefficients need only to be updated one by one upon each input of sample.
Similarly, prediction synthesis processing by the PARCOR filter shown in
As the first synthesis sample x(0) the input prediction error sample y(0) is used intact.
x(0)←y(0)
The second prediction synthesis sample x(1) is synthesized only by a 1st-order prediction.
x(1)←y(1)+k1x(0)
x(0)←x(0)−k1x(1)
The third prediction synthesis sample x(2) is obtained by the following calculation. But x(0) and x(1) are used to calculate x(3) in the next step, and they are not output.
t1←y(2)+k2x(0)
x(2)←t1+k1x(1)
x(0)←x(0)−k2t1
x(1)←x(1)−k1x(2)
x(3) is obtained by the following calculation. But x(0), x(1) and x(2) are used to calculate x(4) in the next step, and they are not output.
t2←x(3)+k3x(0)
t1←t2+k2x(1)
x(3)←t1−k1x(2)
x(0)←x(0)−k3t2
x(1)←x(1)−k2t1
x(2)←x(2)−k1x(3)
Thereafter similar calculations are carried out.
In the PARCOR filter of
In the PARCOR filter of
The PARCOR filter of
Embodiment 9 described above shows the case in which the autoregressive linear prediction filter processing does not use samples of the past frame but instead sequentially increases the order of linear prediction from the starting sample of the current frame to a predetermined number of samples; Embodiment 10 described below does not use samples of the past frame, either, in FIR filter processing and sequentially increases the tap number.
As described above, in the
Step 1: Initialize the sample number n and a variable t to zeros.
Step S2: Perform a convolution for the input sample by the following calculation to output the y(n).
Step S3: Increment t and n by one, respectively.
Step S4: Make a check to see if n=T, and if not, return to step S2 and perform steps S2, S3 and S4. As a result, a convolution is carried out with the tap number increased with an increase of n.
Step S5: If n=T, perform convolution by the following calculation to output y(n).
Step S6: Increment n by one.
Step S7: Make a check to see if n=L−T, and if so, return to step S5 and perform steps S5, S6 and S7 again. As a result, filtering is repeatedly carried out with a tap number 2T+1 until n=L−T is reached.
Step S8: If n=L−T, perform a convolution by the following calculation to output y(n).
Step S9: Make a check to see if n=L−1, and if not, end filtering.
Step 10: If not n=L−T, increment n by one and decrement T by one, then return to step S8 and perform step S8 and S9 again. As a result, filtering is carried out with the tap number gradually decreased with an increase of n toward the rear end of the frame.
Embodiment 11 utilizes the scheme of gradually increasing the prediction order by Embodiment 10 without using the alternative sample sequence in Embodiment 4. This embodiment will be described below with reference to
As depicted in
As described previously in respect of
For n=0, . . . , p−1: v(n)=x(n)−u(n)′
For n=p, . . . , L−1: v(n)=x(n)
After multiplication of the sample sequence x(n+τ), . . . , x(n+τ+p−1) the multiplied sample sequence may be displaced to the front position in the frame to form the sample sequence u(n)′.
The sample sequence v(0), . . . ., v(L−1) is input to the prediction error generating part 51, wherein it is subjected to the autoregressive prediction, described previously with reference to
The position τ and the gain β of the similar sample sequence x(n+τ), . . . , x(n+τ+p−1) are determined under the control of the selection/determination control part 260 as described previously with reference to Embodiment 4.
A prediction error signal is generated for the sample sequence v(p), v(L−1) generated using the τ and β determined as described above (S4), then the auxiliary information AI indicating the τ and β used at that time is generated in the auxiliary information generating pat 270, and if necessary, the auxiliary information AI is coded into the code CAI in the auxiliary information coding part 28. The auxiliary information AI or code CAI is added to as part of the encoding code of the input digital signal of the frame FC by the coder.
In the above, the value τ may preferably be larger than the prediction order p, and the value τ needs only to be determined such that the sum, ΔU+τ, of the length ΔU of the similar sample sequence u(n) and τ is equal to or smaller than L−1, that is, x(τ+ΔU) falls within the range of the current frame FC. The length ΔU of the similar sample sequence u(n) needs only to be equal to or smaller than τ, is not related to the prediction order p and may be equal to or smaller or larger than p, but it may preferably be equal to or greater than p/2. The front position of the similar sample sequence u(n) need not be brought into agreement with the front position in the frame FC, that is, the sample sequence u(n) may be shifted to such a position that n=3, . . . , 3+ΔU, for instance. The gain β for multiplying the similar sample sequence u(n) may also be weighted in dependence on the sample, that is, the sample sequence u(n) may be multiplied by a predetermined window function ω(n), in which the auxiliary information is enough to indicate τ alone.
A description will be given, with reference to
The example of the functional configuration of
The sample sequence y(0), . . . , y(L−1) of the current frame FC of the digital signal (a prediction error signal) to be subjected to prediction synthesis processing by the autoregressive prediction scheme is prestored, for example, in the buffer 100, from which the sample sequence y(0), . . . , y)L−1) is read out by the read/write part 310.
The sample sequence y(0), . . . , y(L−1) is fed to the prediction synthesis part 63, with the first sample in the head (S1). The sample sequence is subjected to the prediction synthesis processing to generate a prediction synthesis signal v(n) (where n=0, . . . , L−1) (S2). The prediction synthesis signal v(n)′ is temporarily stored in the buffer 100. This prediction synthesis utilizes the scheme described previously with reference to
In the auxiliary information decoding part 330 the auxiliary code CAI, which forms part of the code of the current frame FC, is decoded into auxiliary information, from which τ and β are obtained (S3). In some cases, the auxiliary information itself is input to the auxiliary information decoding 320. In the sample sequence acquiring part 340 a sample sequence v(τ), v(τ+p) consisting of a predetermined number p, in this example, of consecutive samples, is replicated from the synthesis signal (sample) sequence v(n) by use of τ, that is, the sample sequence v(τ), . . . , v(τ+p) is acquired with the prediction synthesis signal sequence v(n) unchanged (S4), and this sample sequence is shifted to bring its forefront to the front position in the frame FC to obtain a sample sequence u(n), which is multiplied in the gain multiplying part 350 by the gain β obtained from the auxiliary information, thereby generating a corrected sample sequence u(n)′=βu(n) (S5).
This corrected sample sequence u(n)′ is added to the prediction synthesis sample (signal) sequence v(n) to obtain a normal prediction synthesis signal x(n) (where n=0, . . . , L−1) (S6). The prediction synthesis sample sequence x(n) is:
For n=0, . . . , p−1: x(n)=v(n)+u(n)′
For n=p . . . , L−1: x(n)=v(n)
Since Embodiment 12 corresponds to Embodiment 11, the length ΔU of the corrected sample sequence u(n)′ is not limited-specifically to p, that is, it is not related to the prediction order but is predetermined; and the position of the lead sample of the corrected sample sequence u(n)′ need not always be brought into agreement with the lead sample v(0) of the synthesis signal v(n) and this also predetermined. Moreover, in some cases the gain β is not contained in the auxiliary information but instead it is weighted by a predetermined window function ω(n) for each sample u(n).
Third Mode of Working
In the third mode of working of the present invention, for example, in the case where frame-wise coding of the original digital signal includes processing for generating an autoregressive prediction error signal or interpolation filter processing, the last sample sequence of the (past) frame immediately preceding the current frame or the leading sample sequence of the current frame is coded separately, and the code (auxiliary code) is added to a part of the encoded code of the current frame of the original digital signal. At the time of subjecting the above-mentioned prediction synthesis or interpolation filter processing at the decoding side, when there is no code of the (past) frame preceding the current frame, the auxiliary code is decoded, and decoded sample sequence is used as a rear-end synthesis signal of the preceding frame in the prediction synthesis of the current frame.
A description will be given, with reference to
The input sample sequence x(n) is branched into two, one of which is provided to an auxiliary sample sequence obtaining part 410, wherein the rear-end samples x(−p), . . . , x(−1) of the (past) frame immediately preceding the current frame FC are obtained by a number equal to the prediction order p in the prediction error generating part 51, and the samples thus obtained are provided as an auxiliary sample sequence. The auxiliary sample sequence x(−p), . . . , x(−1) is coded in an auxiliary information coding part 420 to generate an auxiliary code CA, and this auxiliary code CA is used as a part of the encoded code of the original digital signal of the current frame FC. In this example, the main code Im, the error code Pe and the auxiliary code CA are combined in the combining part 19, from which they are output as a set of codes of the current frame FC, which is transmitted or recorded.
The auxiliary information coding part 420 does not always encode the auxiliary sample sequence x(−p), . . . , x(−1) (which is usually a PCM code) but instead may outputs the sample sequence after adding thereto a code indicating that it is an auxiliary sample sequence. Preferably, the auxiliary sample sequence is subjected to compression coding, for example, by a differential PCM code, prediction code (prediction error+prediction coefficient) or vector quantization code.
As indicated by the broken lines in
A description will be given, with reference to
In the absence of the code set of the preceding (past) frame, for example, when the code set (Im, Pe, CA) of the preceding frame is not available due to packet dropout during transmission, or when decoding is started from the code set of an intermediate one of a plurality of consecutive frames for random access, the absence of the code set of the preceding frame is detected in a dropout detecting part 450, then the auxiliary code CA (or CA′) (the auxiliary code CA or CA′ described previously with reference to Embodiment 13) separated in the separating part 32 is decoded in an auxiliary decoding part 460 into the auxiliary sample sequence x(−p), . . . , x(−1) (or x(0), . . . , c(p−1)), then this auxiliary sample sequence is input as a prediction-synthesis rear-end sample sequence x(−p), . . . , c(−1) to the prediction synthesis part 63, then the prediction error signals y(0), . . . , y(L−1) of the current frame are sequentially input to the prediction synthesis part 63, which performs prediction synthesis to generate the synthesis signal x( ), . . . , x(L−1). The auxiliary code CA (CA′) is double and hence is redundant, but a prediction synthesis signal of excellent continuity and quality can be obtained. The decoding scheme in the auxiliary decoding part 460 is a scheme corresponding to the coding scheme in the auxiliary information coding part 420 in
In the above there has been described, with reference to FIGS. 36 to 39, the digital signal processing associated with, for example, the prediction error generating part 51 in the coder 10 and the prediction synthesis part 63 in the decoder in
The most outstanding feature of the embodiments of FIGS. 36 to 39 is such as described below. That is, in the coding and decoding system in
Various codes can be used as the auxiliary code as referred to previously, but since the auxiliary sample sequence consists of a very small number of samples nearly equal to the prediction order, for instance, if a PCM code of the sample sequence, for example, is used as the auxiliary code CA, the auxiliary code CA of the current frame can be used intact as raw auxiliary sample sequence data after detection of the frame dropout at the decoding side, and hence decoding can be started at once. The application of this scheme to the RIF filter of the up-converting part also produces the same effects as mentioned above.
In the case of receiving video, audio or like information being delivered over the Internet, users cannot make random access at any frame and, in general, they are allowed to make random access only at the head PH of a first frame FH of a frame sequence forming a super frame SF shown in
At the point in time the receiving side makes random access to the first frame, it has no information on the preceding frame, and hence it concludes processing only with samples in the first frame. In such an instance, too, if the frame concerned is subjected to the digital signal processing by the present invention described above in its embodiments, it is possible to increase the accuracy of linear prediction immediately after random access and hence start high-quality reception in a short time.
For only the random-access starting frame, the digital processing is concluded with only samples in that frame without using samples of the preceding frame. This permits implementation of either of forward linear prediction and backward linear prediction. On the other hand, at each frame boundary PF it is possible to start linear prediction processing that utilizes samples of the immediately preceding frame.
In this practical embodiment the backward prediction part 511 performs linear prediction backward of the header symbol of the random-access starting frame. The prediction error generating part 51 performs forward linear prediction for the samples of frames. The decision part 512 encodes the prediction error obtained by the forward linear prediction of the samples of the random-access starting frame by the prediction error generating part 51 and encodes the prediction error obtained by the backward linear prediction of the samples of the starting frame by the backward linear prediction part 511, then compares the amounts of codes, and provides select information SL for selecting the code of the smaller amount to a select part 513. The select part 513 selects and outputs the prediction error signal y(n) of the smaller amount of code for the random-access starting frame, and for the subsequent frames the select part selects the output from the prediction error generating par 51. The select information SL is coded in the auxiliary information coding part 514 and output therefrom as the auxiliary code CA.
As described previously, in the prediction error generation processing of the sample sequence at the coding side in the embodiments of FIGS. 17 and of the sample sequence at the coding side in the embodiments of
As shown in
As depicted in
Other Modifications
The second and third modes of working are applicable not only to the case of using the autoregressive filter but also generally to FIR filtering or the like as is the case with the first mode of working of the invention. Furthermore, in each of the above-described embodiments the alternative sample sequences AS and AS′ may be replaced with high-order bits of the sample sequences, or the alternative samples sequences AS and AS′ may be obtained by using only high-order bits of samples of the sample sequences ΔS and ΔS′ extracted from the current frame to form the samples sequences AS and AS′.
While in the above the processing of the current frame utilizes the sample sequence in the current frame as a substitute for sample sequences of the preceding or/and succeeding frames, provision may be made to conclude the processing with samples only in the current frame without using such a substitute sample sequence.
For example, in a short filter of a small tap number, a simple extrapolation can be made in the case of smoothing or interpolating a sample value after up-sampling, for instance. For example, in
For the extrapolation of the sample x(0) the sample x(1) closest thereto is used intact as shown in
The digital signal to be processed in the above is processed usually on the frame-wise basis, but nay signals can be used as long as they require filtering over the frame preceding or/succeeding the current frame; conversely speaking, the present invention is intended for processing that calls for such filtering, and it is not limited specifically to coding and decoding processing, and in the case of coding and decoding, it is applicable to any of reversible coding, reversible decoding and irreversible coding, irreversible decoding.
The digital processor (identified as processing part in some of the accompanying drawings) of the present invention described above can be implemented by executing programs by a computer. That is, programs for causing the computer to performs respective steps of the above-described various digital signal processing methods of the present invention recorded on a recording medium such as a CD-ROM or magnetic disk, or installed via a communication line into the computer for execution.
According to the embodiments of the present invention described above, it can be said that the digital signal processing method has such a configuration mentioned below.
(A) The digital signal processing method is a processing method using a filter that is used in a coding method for frame-wise coding of a digital signal, and in which the current sample and either of at least p (where p is an integer equal to or greater than 1) immediately preceding samples and Q (where Q is an integer equal to or greater than 1) immediately succeeding samples are linearly coupled, and the sample mentioned herein may be an input signal or an intermediate signal such as a prediction error.
The method is characterized in that:
According to the embodiments of the present invention described above, it can be said that the digital signal processing method has such a configuration mentioned below.
(A) The digital signal processing method is a processing method by a filter which is used in a coding method for coding a digital signal on a frame-wise basis, and in which the current sample and either of at least p (where p is an integer equal to or greater than 1) immediately preceding samples and Q (where Q is an integer equal to or greater than 1) immediately succeeding samples are linearly coupled, and the sample mentioned herein may be an input signal or an intermediate signal such as a prediction error.
The processing method is characterized in that:
an alternative p-sample sequence, which consists of p consecutive samples forming part of the current frame is disposed as the p samples immediately preceding the first sample of the current frame;
the first sample and at least one portion of said immediately preceding alternative sample sequence are linearly coupled by said filter, or an alternative Q-sample sequence, which consists of Q consecutive samples forming part of the current frame, is disposed as the Q samples immediately succeeding the last sample of the current frame; and
the last sample and at least one portion of the immediately succeeding alternative samples are linearly coupled by said filter.
Furthermore, it can be said that the digital signal processing method for decoding, for instance, has such a configuration mentioned below.
(B) The method is a processing method using a filter that is used in a decoding method for frame-wise reconstruction of a digital signal by use of a filter, in which the current sample and either of at least p (where p is an integer equal to or greater than 1) immediately preceding samples and Q (where Q is an integer equal to or greater than 1) immediately succeeding samples are linearly coupled, and the sample mentioned herein is an intermediate signal such as a prediction error;
characterized in that:
in the absence of the immediately succeeding frame:
p consecutive samples, which form part of the current frame, are used as the p alternative samples immediately preceding the first sample of the current frame, and the first sample and at least some of the alternative samples are linearly coupled by said filter; and
in the absence of the immediately succeeding frame:
Q consecutive samples, which form part of the current frame, are used as Q alternative samples immediately succeeding the last sample of the current frame, and the last sample and at least some of the alternative samples are linearly coupled.
As described above, according to the present invention, processing can be concluded in the frame concerned while maintaining substantially unchanged the continuity and coding efficiency of reconstructed signal that are obtainable in the presence of the immediately preceding or/and succeeding frames. This provides increased performance when random access is required on a frame-by-frame basis or when a packet loss occurs.
Number | Date | Country | Kind |
---|---|---|---|
2002-338131 | Nov 2002 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP03/14814 | 11/20/2003 | WO | 5/20/2005 |