The conventional technology comprises the following disclosures:
Audio and speech are in general time varying signals. Because the changes are relatively slow they are usually considered quasi-stationary over short period of time. Adaptive filter parameters (e.g. linear predictive coding, LPC, or long-term post filter, LTP) used for processing audio/speech signals are updated once per frame and kept constant over the frame duration, frame usually having length of 2 to 40 milliseconds. Such filtering is effectively time-varying and thus in general produces instabilities and discontinuities even if filtering with frozen filter parameters doesn't [1].
A cross-fade approach is known. The cross-fade approach may be summarized as:
An infinite impulse response (IIR) filter can be presented in the lattice-ladder form [7]. An interpolation of stable IIR filters in the lattice-ladder form should produce stable time-varying IIR filter [8]. Thus it should be possible to generalize the interpolation of the reflection coefficients from [6] in general case of IIR filters. However such approach would be too complex for the LTP filtering as the number of the non-zero reflection coefficients would be equal to the pitch lag (e.g. if used for smoothing the LTP discontinuities in [3] this would lead to filter orders bigger than 250).
Complexity is very important aspect in a real-time codec and it is desirable to use a method for avoiding discontinuities in time varying filtering with least complexity.
A low-complexity technique is advantageous for performing filtering operations.
According to an embodiment, a system for filtering an information input signal, divided into different update intervals, according to parameters varying with the update intervals, to obtain a filtered output signal may have: a first filter unit to filter a first filter input signal at least at an initial subinterval in a current update interval to obtain a first filter output signal, according to parameters associated to the preceding update interval, the first filter unit being configured to scale the parameters along at least the initial subinterval, wherein the first filter unit is configured to scale the parameters associated to the preceding update interval by a first scaling factor changing towards 0, or a value close to 0, along at least the initial subinterval; and a second filter unit to filter a second filter input signal, at the initial subinterval, according to parameters associated to the current update interval to obtain a second filter output signal, the second filter unit being configured to scale the parameters along at least the initial subinterval, wherein the second filter unit is configured to scale the parameters associated to the current update interval by a second scaling factor changing from 0, or a value close to 0, toward a value different from 0, or a value more distant from 0 than the value close to 0, along at least the initial subinterval, wherein the first filter input signal is based on the information input signal, the first filter output signal is an intermediate signal, the second filter input signal is based on the intermediate signal, and the filtered output signal is based on the second filter output signal.
According to another embodiment, a method for filtering an information input signal divided into different update intervals, according to parameters associated with the update intervals, to obtain a filtered output signal, may have the steps of: performing a first filtering at least at an initial subinterval of a current update interval according to parameters associated to the preceding update intervals, wherein the parameters along at least the initial subinterval are scaled by a first scaling factor changing towards 0, or a value close to 0, along at least the initial subinterval; and performing a second filtering at least at the initial subinterval, according to parameters associated to the current update interval, wherein the parameters along the initial subinterval are scaled by a second scaling factor changing from 0, or a value close to 0, toward a value different from 0, or a value more distant from 0 than the value close to 0, along at least the initial subinterval, wherein the first filtering is performed on the information input signal and the second filtering is performed on the signal obtained by the first filtering.
Another embodiment may have a non-transitory digital storage medium having a computer program stored thereon to perform the inventive method for filtering an information input signal divided into different update intervals when said computer program is run by a computer.
We present a low complexity technique for avoiding discontinuities, e.g., when Infinite Impulse Response (IIR) filter parameters change in consecutive frames.
In accordance to examples, there is provided a system for filtering an information input signal, divided into different update intervals, according to parameters varying with the update intervals, to obtain a filtered output signal, the system comprising:
Accordingly, discontinuities are reduced and/or avoided and the complexity is reduced, e.g., with respect to the cross-fade technique. There is no need for performing two different filter operations and, subsequently, for cross-fading the two filtered signals: simply, two filtering operations are performed, hence reducing the necessity of performing calculations.
In accordance to examples, there is provided a third filter unit to filter the information input signal, at a subsequent subinterval in the current update interval after the initial subinterval according to parameters associated to the current update interval.
Accordingly, apart from the initial subinterval, the current update interval may operate using the parameters which are most suited for it.
In accordance to examples, there is provided a selector configured to check if the distance between parameters for the current update interval is 0 or within a first threshold, so as to filter the information input signal at least in the initial subinterval using the third filter unit.
In accordance to examples, the third filter unit is configured to maintain the same filtering status along the subsequent interval and or to maintain the filter parameter constant.
In accordance to examples, at least one of the first, second and third filter units operates as long term, LTP, filter.
In accordance to examples, at least one of the first, second, and third unit has a transfer function comprising a numerator and a denominator, wherein the numerator comprises a gain value indicated by the gain information, and wherein the denominator comprises an integer part of a pitch lag indicated by the pitch lag information and a multi-tap filter depending on a fractional part of the pitch lag.
In accordance to examples, the parameters of at least one of the first, second, and third unit are obtained from harmonicity information, gain information, pitch lag information, the integer part of the pitch lag of the information input signal and/or the fractional part of the pitch lag of the information input signal.
In accordance to examples, the parameters of at least one of the first, second and/or third filter unit are parameters of a filter chosen among at least one or a combination of, linear predictive coding, LPC, filter, infinite impulse response, IIR, filter, and/or finite impulse response, FIR, filter.
In accordance to examples, the first filter unit is configured to scale parameters associated to the preceding update interval by a first scaling factor changing towards 0 along at least the initial subinterval and/or the second filter unit is configured to scale parameters associated to the current update interval by a second scaling factor changing from 0 toward a value different from 0 (or from a value close to 0 to a second value more distant from 0 than the value close to 0) along at least the initial subinterval.
In accordance to examples, the first scaling factor and the second scaling factor are non-negative values complementary with each other to a value greater than 0.
In accordance to examples, the first scaling factor is to change towards 0 towards the final extremity of at least the initial subinterval, and/or the second scaling factor is to change from 0 from the initial extremity of the current update interval towards a non-zero value (or from a value close to 0 to a second value more distant from 0 than the value close to 0).
In accordance to examples, there is provided a fourth filter unit configured to filter the information input signal, at least at the initial subinterval, using parameters obtained by interpolating the parameters associated to the current update interval and the parameters associated to the previous update interval.
In accordance to examples, there is provided a selector configured to check if the distance between parameters for the current update interval is within a second threshold, so as to filter the information input signal at least in the initial subinterval using the fourth filter unit.
The system may be further configured to actively set the second threshold on the basis of values associated to the signal.
In accordance to examples, the system may be configured to:
In accordance to examples, the system may be configured to:
In accordance to examples, the first filter unit is to provide the first filter output signal in the form of
where
where
In accordance to examples, the first filter unit is configured to provide the first filter output signal in the form of
and the second filter unit is configured to provide the filtered output signal (13) in the form of
wherein
is the length of the initial subinterval, {circumflex over (x)}(n) is the information input signal, is the intermediate signal, (n) is the filtered output signal, n is an instant, pintmem and pfrmem are respectively based on the integer part and fractional part of the pitch lag associated to the preceding update interval, pint and pfr are respectively based on the integer part and fractional part of the pitch lag associated to the current update interval, cnum(k) is a coefficient based on the gain value for the current update interval, cden(k, pfr) is a coefficient based on the gain value for the determined update interval and on the fractional part of the pitch, cnummem(k) is a coefficient based on the gain value for preceding update interval, cdenmem(k, pfrmem) is a coefficient based on the gain value for preceding update interval and on the fractional part of the pitch, Lden and Lnum are fixed and/or based on the sampling rate of the input signal.
In accordance to examples, the time length of the initial subinterval is between the 5% and the 40% of the time length of the current update interval.
In accordance to examples, the system is configured to check the gain gk of the current kth frame and the gain gk−1 of the previous (k−1)th frame, so that:
In accordance to examples, the system comprises an encoder side and a decoder side, wherein at least one of the first, second, third, and/or fourth filter units is at the decoder side.
In accordance to examples, the system comprises an encoder side and a decoder side, wherein at least one of the first, second, third, and/or fourth filter units is at the encoder side.
In accordance to examples, the system comprises a converter for converting a first representation of the information signal into a second representation of the information signal.
In accordance to examples, the system is configured to:
In accordance to examples, there is provided a method for filtering an information input signal, including different update intervals, according to parameters corresponding to the update intervals, to obtain a filtered output signal, the method comprising:
In accordance to examples, there is provide a non-transitory storage unit storing instructions which, when executed by a processor, cause the processor to perform one of the methods above or below and/or to implement a system as above or below and/or a component of such a system.
The information input signal may be, for example, an audio signal.
In some examples, the interval is the entire frame. In other examples, the interval is smaller than the frame.
Accordingly, the technique above may be only performed for an initial interval or final interval of the determined frame: in the subsequent interval, the parameters for the determined frame may be used, further reducing the computation complexity.
Accordingly, there is no modification of the output in the subsequent interval.
In examples, the interpolation of coefficients may be used instead of the technique discussed above. This may be controlled on the basis of a selection, to better adapt the filtering operations to the signal conditions.
In examples, it is possible to set the second threshold to the minimum between the integer part of the pitch lag at the determined frame and the integer part of the pitch lag at the previous frame (or the subsequent frame), and/or to set the second threshold to the maximum between the gain at the determined frame and the gain at the previous frame or the subsequent frame. Accordingly, it is possible to use the fourth filter unit when the distance between the gain at the determined frame and the gain at the previous frame is less than the second threshold and/or to use the first and second filter units when the distance between the integer part of the pitch lag at the determined frame and the integer part of the pitch lag at the previous frame or subsequent frame is less than the second threshold.
In examples, it is possible to define a condition associated to the gains of the signal at the determined frame and at the preceding or subsequent frame. Accordingly, it is possible to use the fourth filter unit when at least one of the gains of the signal at the determined frame and at the preceding or subsequent frame is zero. In examples, it is possible to use the first and second filter units when both the gains of the signal at the determined frame and at the preceding or subsequent frame are different from zero.
Accordingly, the parameters associated to the previous frame or the subsequent frame and the parameters associated to the determined frame are modified (e.g., sample-by-sample) to perform a graceful filter which avoids and/or reduces the discontinuity between the frames.
Accordingly, the input signals of the first and the second filter units may be easily and coherently damped.
In examples, the first filter unit is configured to provide the first filter output signal (13) in the form of
and the second filter unit (14) is configured to provide the filtered output signal (13) in the form of
wherein
is the length of the initial subinterval, {circumflex over (x)}(n) is the information input signal (11), is the intermediate signal, (n) is the filtered output signal (15), n is an instant, pintmem and pfrmem are respectively based on the integer part and fractional part of the pitch lag associated to the preceding frame, pint and pfr are respectively based on the integer part and fractional part of the pitch lag associated to the determined frame, cnum(k) and cden(k, pfr) are coefficients based on the gain value for the determined frame, cnummem(k) and cden(k, pfr) are coefficients based on the gain value for preceding frame, Lden and Lnum are fixed and/or based on the sampling rate of the input signal.
In examples, in the lower filtering status the parameters of a same filter are modified so as to have a value closer to 0 than in the higher filtering status. For example, the parameters may be scaled by scaling factors which gradually vary.
The present invention may be used, for example, for Long Term Post Filtering (LTPF). It is a tool for transform-based audio coding that helps at reducing the inter-harmonic noise. It relies on a post-filter that is applied on the time-domain signal after transform decoding. This post-filter is essentially an infinite impulse response (IIR) filter with a comb-like frequency response controlled by one parameter (pitch lag) or two parameters (pitch lag and gain). For better robustness, the post-filter parameters (a pitch lag and a gain per frame) are estimated at the encoder-side and encoded in a bitstream when the gain is non-zero.
Embodiments of the present invention will be detailed subsequently referring to the appended drawings, in which:
a, 2, 3, 4, 5a and 5b show systems according to examples;
a show methods according to examples;
In the approach presented here, filtering parameters (also referred to as filter parameters or parameters) are in general different for different update intervals. Two consecutive update intervals may have different parameters. In examples, the update interval may be signal adaptive and its length may be changed or shifted over time.
In some examples, the signal is divided into frames. For example, the frame may be associated to a fixed number of samples and/or a fixed time length (e.g., 20 ms). When transmitted or stored, the frame may be associated to particular parameters, e.g., filtering parameters. Within the same frame, the parameters may be in general constant.
In some examples, the update interval may correspond to a frame. Therefore, when transmitted or stored, the signal may comprise parameters (e.g., filtering parameters) associated to the frame (which is also the update interval), e.g., by data encoded in association with a particular frame.
In some examples, the update interval does not correspond to a pre-defined frame. When transmitted or stored, the signal may comprise parameters associated to the update interval even if they are signalled as being associated to the frame. In some cases, a new frame (with the new parameters) anticipates the new update interval, and the old update interval is still associated to the old parameters. When the update intervals are adaptive (e.g., their length is determined on the fly, on the basis of the characteristics of the signal, for example), of course they in general do not correspond with fixed-length frames. Hence, it is possible to associate parameters to a particular update interval (which in some cases, is associated to a frame).
The signal 11 may be an information signal, such as an audio signal. A digital representation of an information signal may be used. The signal may comprise a succession of samples, each acquired at a different time instant (e.g., discrete time instants). The signal may be divided into different frames and/or update intervals (e.g., a succession of samples). Each frame and/or update interval may be constituted by a plurality of samples (e.g., 1, 2, . . . , n, . . . ), e.g., each associated to a discrete time instant). Each frame and/or update interval may be subdivided into an initial subinterval and a subsequent subinterval (the subinterval may be a proper subinterval, as its length may be smaller than the length of the update interval). In general, samples of the initial subinterval precede (are before) the samples of the subsequent subinterval of the same frame and/or update interval. A determined (current, presently processed) frame precedes a subsequent frame and/or update interval and is preceded by a preceding (previous, old) frame. A determined (current, present) update interval precedes a subsequent update interval and is preceded by a preceding (previous, old) update interval. The initial subinterval of an update interval may have a time length which is between 1% and 99%, more in particular 20% and 30% (e.g., one quarter), of the time length of the current update interval. The subsequent subinterval may have a time length which is between 1% and 99%, more in particular 70% and 80% (e.g., three quarters), of the time length of the current update interval.
For some examples, it is referred to “at least one initial subinterval” of the update interval, implying that also the 100% of the update interval may be covered in some examples.
The system 10 may filter the information input signal 11 (x) according to parameters varying with the update intervals (e.g., parameters which, in general, vary in time according to the particular update interval they are associated to, e.g., by virtue of the parameters being encoded and associated to a particular frame). The system 10 may provide a filtered output signal 15 (y or y[n]). The system 10 may provide post-filtered audio signal (e.g., a time domain, TD, representation of a post-filtered audio signal).
The system 10 may comprise a first filter unit 12 to filter the information input signal 11 (first filter input signal). The first filter unit 12 may operate with samples of at least an initial subinterval of a current update interval (a present update interval, such as an update interval which is currently processed), to obtain a first filter output signal which is an intermediate signal 13 (y′ or y′ [n]). (The at least an initial subinterval may refer, in examples, to only the initial subinterval or, in examples, to an interval bigger than the initial subinterval, such as the update interval). The intermediate signal 13 may be obtained using parameters associated to the preceding update interval (e.g., the update interval preceding the current update interval). The first filter unit 12 is configured to change (e.g., gradually, e.g., by fading, e.g., monotonically) the parameters along at least the initial subinterval from a higher-filtering status to a lower-filtering status. For example, the parameters may be less reduced and/or less damped (hence implying a higher-filtering status) in correspondence to the first samples in the initial subinterval. The parameters may be more reduced and/or more damped (hence implying a lower-filtering status) in correspondence to the last samples in the at least the initial subinterval (where the parameters may be processed to be closer to 0).
The system 10 may comprise a second filter unit 14. The second filter unit 14 may have a second filter input signal and a second filter output signal. The second filter unit 14 may filter the intermediate signal 13 (which is the second filter input signal). The second filter unit 14 may operate with samples of at least the initial subinterval of the current update interval (e.g., the same at least initial subinterval on which the filter unit 12 operates). The second filter unit 14 may filter the intermediate signal according to parameters associated to the current update interval. The second filter unit 14 may be configured to change (e.g., gradually, e.g., by fading, e.g., monotonically) the parameters along at least the initial subinterval from a lower-filtering status to a higher-filtering status. For example, the parameters may be more reduced and/or more damped (hence implying a lower-filtering status) for the first samples in the at least an initial subinterval (where the parameters may be processed to be 0 or close to 0). The parameters may be less reduced and/or less damped (hence implying a higher-filtering status) for the last samples in the at least an initial subinterval.
The lower-filtering status may be such that the impulse response is closer to the impulse response of the identity filter than the impulse response in the higher-filtering status. In some examples, the lower-filtering status and the higher-filtering status may be such that the lower-filtering status implies an increased energy of the impulse response with respect to the energy of the impulse response of the higher-filtering status. In some examples, the lower-filtering status and the higher-filtering status may be such that the lower-filtering status implies a reduced energy of the impulse response with respect to the energy of the impulse response of the higher-filtering status. When the first filter unit 12 operates at the lower-filtering status, it implies an impulse response which is different from (e.g., lower or higher than) the energy implied by the same first filter unit 12 when it operates at higher filtering-status. The same applies to the second filter unit 14. In the lower-filtering status the parameters may be closer to 0 than in the higher-filtering status.
In examples, the first filtering unit 12 may change from the higher-filtering status towards a lower-filtering status, and/or the second filtering unit 14 may change from the lower-filtering status to a higher-filtering status.
In general terms, when changing from the higher-filtering status to the lower-filtering status, the parameters may be modified (e.g., gradually and/or monotonically and/or by fading and/or by damping). In the lower-filtering status, the parameters may be processed to be 0, or may be processed to be closer to 0 than the parameters in the higher-filtering status. For example, in the higher-filtering status the parameters may be less reduced and/or less damped and/or more distant from 0 than in the lower-filtering status.
In lower-filtering status, a filtering effect may be reduced (e.g., the same effect of or an effect close to the effect of the identity filter may be obtained). In higher-filtering status, a strong filtering effect may be obtained.
In higher-filtering status, the input signal is strongly filtered. In lower-filtering status the input signal is not strongly filtered. In examples, the more the parameters are dampened, the lower the filtering status.
The first and/or second filter units 12, 14 may be configured to damp and/or reduce the parameters so that in the higher-filtering status the reduced and/or damped parameters are less damped (e.g., more distant from 0) and/or less reduced than in the lower-filtering status (e.g., close to 0).
The first filtering unit 12 may be configured to scale (e.g., downscale) the parameters using a first damping factor or scaling factor, and/or the second filtering unit 14 may be configured to scale (e.g., downscale) the parameters using a second damping factor or scaling factor. The first damping factor or scaling factor and/or second damping factor or scaling factor may be closer to 0 in the lower-filtering status than in the higher-filtering status.
The first and/or second filter unit 12, 14 may be configured to damp and/or reduce and/or downscale the parameters so that the parameters are damped and/or reduced by first damping factors and/or scaling factors which are closer to 0 in the lower-filtering status than in the higher-filtering status.
The first filter unit 12 may be configured to change the parameters from the higher-filtering status to the lower-filtering status by gradually damping and/or reducing the parameters associated to the preceding update interval, and/or the second filter unit 14 may be configured to change the parameters by gradually increasing and/or enlarging from parameters close to 0 to the parameters associated with the current update interval.
The first filter unit 12 may be configured to change the parameters from the higher-filtering status to the lower-filtering status by using decreasing damping factors and/or scaling factors (e.g., by using damping factors and/or scaling factors which, e.g. gradually, go towards 0). The second filter unit 14 may be configured to change (e.g., gradually) the parameters from the lower-filtering status to the higher-filtering status by enlarging (e.g., gradually) damping factors and/or scaling factors (e.g., by using damping factors and/or scaling factors which, e.g. gradually, depart from 0).
The first filter unit 12 may be configured to change the parameters from the higher-filtering status to the lower-filtering status by applying a decreasing windowing function to the parameters (e.g., a windowing function which goes towards 0, e.g. gradually), and/or the second filter unit 14 may be configured to change the parameters from the lower-filtering status to the higher-filtering status by applying an enlarging windowing function to the parameters (e.g., a windowing function which departs from 0, e.g. gradually).
The parameters may be provided, for example, as control data in the input signal 11. The filters may be, for example, Liner Time Invariant (LTI) Infinite impulse response (IIR) filters (e.g., for LTP postfiltering). The parameters may be or comprise, for example, gain information and/or pitch information.
In particular, the first and second filter units 12 and 14 may be part of an LTP filter (or postfilter), e.g., at a decoder (e.g., an audio decoder). For example, the parameters may be obtained from harmonicity-based measurements. For example, the parameters may be based on a pitch lag T, on an integer part Tint of the pitch lag, on a fractional part Tfr of a pitch lag, and/or on a gain g, as obtained, for example, at an audio decoder. The parameters may be associated, for example, to an update interval (which in some examples is a frame of a fixed length or may have an adaptive length).
Each of the first and second filter unit may therefore be associated to particular parameters and/or to a particular transfer function. In particular, the transfer function may be of the type having a numerator and a denominator, wherein the numerator comprises a gain value indicated by the gain information, and wherein the denominator comprises an integer part of a pitch lag indicated by the pitch lag information and a multi-tap filter depending on a fractional part of the pitch lag. For example, a transfer function may be:
whose parameters may be determined from parameters estimated at the encoder-side and/or decoded from a bitstream. g may be the decoded gain, Tint and Tfr integer and fractional part of a decoded pitch-lag, α and β two scalars that weight the gain, and B(z,Tfr) a low-pass FIR filter whose coefficients depend on the fractional part of the decoded pitch-lag. The order and the coefficients of B(z, Tfr) can also depend on the bitrate and the output sampling rate. A different frequency response can be designed and tuned for each combination of bitrate and output sampling rate. An example of the transfer function 130 is provided in
Notably, parameters and the transfer function may change for each update interval (which may be one frame according to which the original signal may be subdivided). Therefore, the kth update interval may be associated to an Hk(z) transfer function and parameters such as Tint,k, Tfr,k, gk, while the (k−1)th update interval may be associated to an Hk−1(z) transfer function and parameters such as Tint,k−1, Tfr,k−1, gk−1. Therefore, at the kth frame or update interval, the first filter unit 12 may operate using the old parameters Tint,k−1, while the second filter unit 14 may operated using the updated parameters Tint,k, Tfr,k, gk. This process may be performed for at least the initial subinterval (or, in some examples, for the 100%) of the kth updated interval.
More in general, each or at least one of the filters at elements 10 and 12 may be an LTI IIR filter (which may also be represented as H(z)) represented in the form:
or using a linear difference equation:
The coefficients bi and aj may be filter parameters. Notably, the coefficients bi and aj may vary, in general, for different frames and/or update intervals.
It has been noted that the filtered signal 15 results in a smooth transition between the preceding (k−1)th update interval and the current kth update interval. Discontinuities between different update intervals are therefore avoided and/or reduced.
Moreover, the processes to perform the filtering functions have a particularly reduced complexity. This kind of system may be used, for example, for a long term post filter (LTPF).
The first and second filter units 12 and 14 may be considered as being connected in series (or cascade, or “one after other”).
The parameters stored in the storage element 21b (which had been prepared for the preceding (k−1)th frame) may be applied to a first filtering portion 22 (which may implement functions of the first filter unit 12) for at least the initial subinterval of the current kth frame or update interval. The parameters stored in the storage element 21a (prepared for the current kth frame) may be applied to a second filtering portion 24 (which may implement functions of the second filter unit 14) for at least the initial subinterval of the current kth frame or update interval.
However, the parameters stored in the storage element 21b (originally prepared for the preceding (k−1)th update interval) may be changed, e.g., by a block 23b, to cause the first filtering portion 22 to move (e.g., gradually, e.g., monotonically) from a higher-filtering status to a lower-filtering status within the initial subinterval of the current kth update interval. For example, the parameters may be scaled (e.g., downscaled, damped) using a damping factor which goes to 0, e.g., a decreasing damping factor (e.g., damping by multiplication). Block 23b may apply a windowing function. The windowing function may change towards 0 and/or decrease in at least the initial subinterval (e.g., from a highest positive value, e.g. 1, at the first sample of the kth current update interval, to a lowest value, e.g. 0, at the last sample of the at least initial subinterval).
For example, the reduction of the filtering effect (e.g., from the higher-filtering status to the lower-filtering status) may be obtained by gradually reducing the damping factor (e.g., from a maximum value, e.g. 1, to a minimum or negligible value, e.g. 0). When the damping factor is negligible (or 0), the parameters are modified to a negligible (or 0) value and the output 13 (y′) of the first filtering portion 22 is almost similar to (or the same of) the information input signal 11 (x).
The parameters stored in the storage element 21a (associated to the current kth frame or update interval) may be changed by a block 23a to cause the second filtering portion 24 to move (e.g., gradually, e.g., monotonically) from a lower-filtering status to a higher-filtering status within at least the initial subinterval of the current kth frame or update interval. For example, the parameters may be scaled (e.g., downscaled, damped) by an increasing damping factor (e.g., by multiplication), which may enlarge from 0 or a value close to 0 to a value more distant to 0. Block 23a may apply a windowing function. The windowing function may increase or otherwise change (e.g., from a lower-filtering status towards a higher-filtering status) from an initial time instant of the initial subinterval to a final time instant of the initial subinterval (e.g., from a value close to 0 to a value more distant from 0, and/or from a lowest value at the first sample of the initial subinterval to a positive highest value, e.g. 1, at the last sample of the initial subinterval of the kth current frame or update interval or at the last sample of the frame or update interval).
For example, the increase of the filtering effect (e.g., from a lower-filtering status towards a higher-filtering status) may be obtained by gradually enlarging from 0 or a value close to 0 to a value more distant from 0, e.g., by enlarging or increasing (e.g., monotonically or strictly monotonically) the damping factor (e.g., from a value close to 0 to a value more distant from 0, and/or from a minimum or negligible value, e.g. 0, to a maximum value, e.g. 1). When the damping factor is negligible (or 0), the parameters are modified to a negligible (or 0) value and the output of the second filtering portion 24 is almost similar to (or the same of) its input (which is the intermediate signal y′ or 13).
In examples, the parameters for the first and/or second filter units 12, 14 may be modified during the succession of the samples of at least the initial subinterval, by factors (e.g., the scaling factors of blocks 23a, 23b) which are complementary with each other to a constant value (e.g., a positive value, such as 1), so that their sum is constant. In examples, the variation of the factors may be linear (e.g., describable with a 1st-degree equation).
In examples, the first filtering portion 22 and the second filtering portion 24 may share the same hardware/software structure, whose output changes only by virtue of the input of different inputs and/or parameters and/or factors.
Notably, the parameters may be associated, in the storage elements 21a and 21b, to a particular length of the subinterval or to the 100% of update interval. Therefore, it may be known which percentage (or in any case which portion) of the update interval or frame is to be smoothed. In some cases, a user selection (e.g., set during a configuration session) may define the length of the at least a subinterval.
A selector 32 may monitor the information input signal 11 and change between the use of filtering section 10 for the initial subinterval and use of the filtering section 31 for the subsequent interval. Notably, the filtering section 31 for the subsequent interval (third filter) may be made of structural and/or functional blocks used for the first and/or second filters 12 and 14.
In addition or in alternative, the selector 32 may decide if, during the initial subinterval, the filtering section 10 (with the first and second filters 12, 14) is to be used for the initial subinterval or if the filtering section 31 is to be used for at least the initial subinterval (besides using the filtering section 31 for the subsequent subinterval, in case). The decision may be based on particular conditions, which may, for example, be based on the parameters (e.g., on comparisons between parameters' values associated to the current update interval and parameters' values associated to the previous update interval). Some examples of this decision are provided in following passages.
The filters at elements 22, 24, and 31 may be LTP postfilter as discussed above or more in general LTI IIR filters (which may also be represented as H(z)) and can be presented in the form:
or using a linear difference equation:
The coefficients bI and aj may be filter parameters.
We consider a time varying filter that is in update interval k equal to the LTI IIR filter Hk (which may be an LTP postfilter):
where T relates to the update interval (and may consist in a discrete number of samples) and k is its index, k−1 being associated to the preceding (k−1)th update interval. The third filtering section 31 may be of this type. P and Q may be specific to the filter (e.g., feedforward filter order and feedback filter order, respectively). Q may be for example related to the maximum possible value for Tint.
The first filter elements 12 and/or 22 may output an intermediate signal y′ in the form:
where sk−1[n] changes towards 0 when n increases for kT−Tl≤n<(k+1)T
where Tl relates to the initial subinterval.
The second filter elements 14 and/or 24 may output a filtered output signal y in the form:
Notably, the filtered output value y[n] has an input based on the intermediate filter output y′[n] provided by the first filter.
In some examples, the first, second and third filter units 51, 52, and 53 are obtained as in
The first filter unit 51 may implement the first filter unit 12 and/or the first filtering portion 22 and be used for filtering the initial subinterval of a current kth update interval with parameters (originally associated to the preceding (k−1)th update interval) which move from a higher-filtering status to a lower-filtering status. The second filter unit 52 may implement the second filter unit 14 and/or the second filtering portion 24 and be used for filtering the initial subinterval of the current update interval with parameters (actually associated to the current kth update interval) which move from a lower-filtering status to a higher-filtering status. The third filter unit 53 may implement the third filtering section 31 for the subsequent interval of the current update interval.
The fourth filter unit 54 may implement a filter for filtering the initial subinterval of a current kth update interval with parameters obtained by interpolating the parameters for the preceding (k−1)th update interval and the parameters for the current kth update interval.
The selector 55 may operate:
With reference to the decision for the initial subinterval, the selector 55 may operate, for example, by using a first and/or one second threshold and/or conditions (e.g., conditions on the gain of the signal in subsequent update intervals). For example, the selector 55 may choose:
The second threshold may be actively set, for example, as the minimum between the integer part of the pitch lag at the current update interval and the integer part of the pitch lag at the previous update interval.
In addition or in alternative, it is also possible to use a condition based on the gain of the signal at the previous update interval, so as to use the fourth filter unit 54 when the distance between the gain at the determined (current) update interval and the gain at the previous update interval is less than the first and/or second threshold. Accordingly, the second threshold as well as the condition on the gain may be modified in real time, e.g., to obtain a better filtering behaviour.
At step S61, an input information signal (e.g., x, 11) may be obtained for a (determined) current kth update interval together with parameters associated to the current kth update interval.
Then, first filter operations 61 may be performed (e.g., by any of the components 12, 22, 51) for the initial subinterval of the current kth update interval, by cycling the value (index) n among a plurality of samples of the initial subinterval. At S611, the first sample of the initial subinterval may be taken into consideration by initializing the variable n (as n=0). At S612, an intermediate value y′ [n] is obtained using parameters associated to the preceding (k−1)th update interval. At 613 (“last sample of the initial SIT”), it is checked whether n has reached the value associated to the last sample of the initial subinterval (e.g., it is checked whether n is the last index of the initial subinterval). If n has reached the last value (index) of the initial subinterval, the first filter operations 61 are concluded and the second filter operations 62 are initiated. Otherwise, at S614 (“change the parameters along the initial SI of the current kth UI from a higher-filtering status to a lower-filtering status”), the parameters are changed so as to move from a higher-filtering status to a lower filtering status (e.g., by reducing the factors at block 23b of
The second filter operations 62 may be performed (e.g., by any of the components 14, 24, 52) for the initial subinterval of the current kth update interval, by cycling the value (index) n among a plurality of samples of the initial subinterval. At S621, the first sample (of r n=0) of the initial subinterval may be taken into consideration by initializing the variable n to 0. At S622, a filtered output value y[n] is obtained using parameters associated to the current kth update interval. At S623 (“last sample of the initial SI?”), it is checked whether the index n has reached the value associated to the last sample of the initial subinterval (e.g., it is checked whether n is the last index of the initial subinterval). If the index n has reached the last value of the initial subinterval, the second filter operations 62 are concluded and the third filter operations 63 are initiated. Otherwise, at S624 (“change the parameters along the initial SI of the current kth UI from a lower-filtering status to a higher-filtering status”), the parameters are changed so as to move from a lower-filtering status to a higher filtering status (e.g., enlarging the values from 0 or a value close to 0 to a value more distant from 0, e.g. by increasing the values of the factors at block 23a). At S625, a new sample is taken into consideration (n=n+1) and step S612 is invoked.
The third filter operations 63 are performed (e.g., by any of the components 31, 24, 53) for the subsequent (e.g., final) subinterval of the current kth update interval, by cycling the value (index) n among a plurality of samples of the subsequent interval. At S632, a filtered output value y[n] is obtained using parameters associated to the current kth update interval. At 633, it is determined whether the index n has reached the value associated to the last sample of the current kth update interval (e.g., it is checked whether if n is the last index of the update interval). If n has reached the last value of the update interval, the third filter operations 63 are concluded. Otherwise, at S635, a new sample is taken into consideration (e.g., by updating the index by n=n+1) and step S632 is repeated with the new index n.
At the end of the method, all the values y[n] of the filtered output signal have been obtained. The value of the index k may be updated at S64. Step S61 may be invoked again for a (k+1)th update interval.
It is noted that it is not strictly needed for step S61 to be after any of steps S611-S63. In some examples, the information input signal at a current kth update interval may also have been obtained during the processing of the operations 61-63. In examples, the information input signal at a current kth update interval may have been obtained a priori.
In the examples in which the first and second filtering operations are performed for the 100% of the update interval, the third filtering operations 63 are not performed.
At step S72 (“Are parameters for kth UI and those for the (k−1)th UI the same or is their distance within a first threshold?”), a first comparison is performed: the parameters for the current kth update interval are compared to the parameters for the preceding (k−1)th update interval (e.g., by selector 55). If the parameters are the same or a distance between a parameter of the kth update interval and a parameter of the (k−1)th update interval is within a first (low) threshold, at S73 (“Perform third filtering operations 63 without damping coefficients along the whole kth UI”) third filtering operations 63 are performed using, for the current kth update interval (both the initial subinterval and the subsequent subinterval), the parameters associated to the current kth update interval (as these parameters are the same or almost the same for the two consecutive update intervals, it is not needed to damp or smooth them: therefore, it is possible to apply the third filter 31 or 53 to the whole update interval, without distinguishing the initial subinterval from the subsequent subinterval). Subsequently, at S77, the value of k is updated and the new (k+1)th update interval may now be filtered. This decision may be taken, for example, by any of the selectors shown in
If the distance between the parameters is over the first threshold, a second check may be performed: for example, at S74 (“Is the distance between parameters for kth UI and those for the (k−1)th UI within a second thresholds? gk=0 or gk−1=0?”) the parameters for the current kth update interval are compared to the parameters for the preceding (k−1)th update interval (e.g., at selector 55). The parameters checked at S74 may also be different from those checked at S72. If the distance between the parameters for the current kth update interval and those for the preceding (k−1)th update interval is within a second threshold (which may be higher than the first threshold), at S75 (“Perform fourth filtering operations interpolating the parameters”) a fourth filtering operation (e.g., by the fourth filter unit 54) may be operated at the initial subinterval of the current kth update interval. In this case, the parameters to be applied to the initial subinterval may be obtained by interpolating (and/or averaging) the parameters for the preceding (k−1)th update interval and those for the current kth update interval. After that, a third filtering 63 may be operated at S75′ (“3rd filtering on the subsequent SI”) for the subsequent interval. Subsequently, at S77 (“k=k+1”), k is updated and the new (k+1)th update interval may now be filtered.
If at S74 it is verified that the distance between the parameters for the current kth update interval and those for the preceding (k−1)th update interval is over the second threshold, at step S76 (“Perform first and second filtering operations 61 and 62 on the initial SI of the kth UI and perform the third filtering operations 63 on the subsequent SI of the kth UI”) the first, second, and third filtering operations 61, 62, 63 may be operated (e.g., by the elements 12, 14, 22, 24, 31, 51, 52, and/or 53): therefore, the first subinterval of the current update interval may be first filtered by any of 12, 22, 51, and/or 61 and then by any of 14, 24, 52 and/or 62, and the subsequent subinterval is filtered using any of 31, 53, and/or 63. Subsequently, at S77, k is updated and the new (k+1)th update interval may be filtered.
In examples, at least another condition may be set, in addition or in alternative (in some cases, at step S74). In some examples it is provided that, in order to initiate step S76, the condition has to be verified. In examples, the condition at S74 may comprise at least one of:
In some examples, step S74 may be performed before step S72.
In some examples, only one comparison is performed. Therefore, there are examples, which do not have the steps S72 and S73 and there are examples which do not have the steps S74 and S75.
In some examples, only one comparison is performed. Therefore, there are examples, which do not have the steps S72 and S73 and there are examples which do not have the steps S74 and S75.
In examples, the first and/or second threshold(s) is(are) (or other conditions on the parameters, e.g., on the gain) for the first and/or second steps S72, S74, may be obtained in real time, e.g., from values of the parameters.
In some examples relating to an LTP filter, a second threshold may be a pitch lag distance threshold defined so as to use the fourth filter unit 54 (and/or interpolation) when the distance (e.g., modulo difference) between parameters of the current kth update interval and parameters of the preceding (k−1)th update interval is less than the minimum between the integer part of the pitch lag associated to the kth update interval and the integer part of the pitch lag associated to the (k−1)th update interval. Therefore, the second comparison at step S74 may be:
|Tint,k−Tint,k−1|<min(Tint,k,Tint,k−1)
where Tint,k and Tint,k−1 are the integer parts of the pitch lag at update intervals k and k−1, respectively. Accordingly, the second comparison at S74 may, in some examples, check if both the two following conditions apply:
Tint,k<2Tint,k−1
Tint,k>½Tint,k−1,
Hence, the second comparison at S74 be such that, in order to perform the filtering with the fourth filter unit 54, it is needed that the integer pitch is not increasing (from the (k−1)th update interval to the kth update interval) for more than 100% nor decreasing for more than 50%: i.e., that there is no pitch doubling or halving between the previous update interval and the current update interval.
Analogously, the second comparison at S74 be such that the first and second filtering (e.g., using any of the elements 12, 14, 51, 52, etc.) may be triggered when it is verified that
∥Tint,k−Tint,k−1|≥min(Tint,k,Tint,k−1)
which is how to say that at least one of the two following conditions is verified:
Tint,k≥2Tint,k−1
Tint,k≥½Tint,k−1,
Hence, the first and second filter units may be activated (e.g., by the selector 55) when the integer part of the pitch lag at the current kth update interval varies extremely with respect to the integer part of the pitch lag at the previous (k−1)th update interval.
Another condition may be set. For example, step S74 may provide that, in order to perform the first and second filtering operations 61 and 62 (e.g., with the first and second filter unit 12 and 14), at least one the following conditions is to be verified:
gk=0
gk−1=0
When gk−1=0 an effect may be obtained which is the same of skipping the first filter. When gk=0 an effect may be obtained which is the same of skipping the second filter.
A condition may be that, if both the following conditions is verified:
gk≠0
gk−1≠0
in that case, the difference between the integer parts of the pitch lags (of the current and the previous frames) is checked at S74 (e.g., as discussed above).
In this example it may be seen that:
Notably, (2) increases quality as compared to (1) and (3). (2) has lower complexity then conventional technology.
In some examples, the fourth filter unit 54 is not used and, therefore, the second verification at S74 are not performed and only a comparison with an extremely small threshold (or a comparison on the exact value) may be performed.
Other examples (e.g., non-LTP filters) may be based on other parameters. However, the present method performs for any IIR filter.
In general terms, if there is a parameter difference, then the first and the second filter are used. Otherwise the third filter is used in the initial subinterval.
The present solution may be used for example when LPC parameters change in a Code-Excited Linear Prediction (CELP) codec. That way discontinuities, that exist even after subframe based line spectral frequencies (LSF) interpolation, in CELP may be handled. Another example where this technique can be used is filtering for formant enhancement in CELP codecs.
The apparatus 80 may comprise, for example, an optional prefilter 81a which may be operated, for example, as any of the systems 10, 30, 40, and 50, and which may perform any of the methods above and/or below. In other examples, the apparatus 80 may be avoided.
The apparatus 80 may comprise a windowing block 82. The apparatus 80 may comprise a domain converter 83 which may convert TD (time domain) representation of the information signal into an FD (frequency domain) representation) of the information signal. For example, the converter 83 may be a Modified Discrete Cosine Transform (MDCT) block or Modified Discrete Sine Transform (MDST) block 83 (or a block associated to another lapped transformation), downstream to the windowing block 82, e.g., for a conversion into the frequency domain (FD). The apparatus 80 may comprise a Temporal Noise Shaping (TNS) block 84 to control temporal shape of quantization noise within a window of a transform. The apparatus 80 may comprise a frequency domain noise shaper (FDNS) block 85. The apparatus 80 may comprise a block 87 for obtaining FDNS parameters downstream to the windowing block 82. The apparatus 80 may comprise a quantization block 86 which may also include an entropy coder. The apparatus 80 may comprise a TD/TD (Time Domain Transient Detector) block 88.
The apparatus 80 may comprise an LTP block 89 for obtaining LTP parameters (e.g., harmonicity information, gain information, pitch information such as pitch lag, etc.). At least some of the parameters obtained by the LTP block 89 may be used by devices 10, 30, 40, 50, and/or by methods 60 and/or 70 for each kth update interval of the signal. For example, the parameters for the kth update interval may be the pitch lag and the gain (which is some cases is optional and may be estimated at the decoder side) associated to the signal at the kth update interval. The operations of the LTP block 89 may be independent from the operations of the prefilter 81a: the prefilter 81a may also not to be present, but the LTP block 89 may operate correctly by providing parameters to the decoder side.
The signal may be encoded by a bitstream writer 89′ and may be stored in a memory and/or transmitted to a decoder (e.g., wirelessly, e.g., using a standard protocol such as Bluetooth).
The LTP postfilter 91 may obtain the digital representation 11a (x) of the signal to be filtered from the block 97, for example. The LTP postfilter 91 may obtain the coefficients 11b from the bitstream, for example.
At least one of the systems 80 and/or 90 may perform an analysis operation (e.g., at the block 89) for obtaining the parameters associated to the kth and/or (k+1)th update interval.
A digital input information signal x[n] (11a) may be filtered with a time varying filter, whose parameters change at update interval T (e.g., the current kth update interval), producing the filtered output signal y[n]. The update interval T can also be signal adaptive and thus T can change over time. We may consider filters that can be represented as a liner time invariant (LTI) Infinite impulse response (IIR) filter during a time interval T. Time interval T may be the frame (e.g., the current filter discussed above) or a sub-frame of the digital signal. We may use the term frame and/or update interval for both a frame and a sub-frame.
The LTI IIR filter (which may also be represented as H(z)) can be presented in the form:
or using a linear difference equation:
The coefficients bi and aj are filter parameters (e.g., parameters to be stored in memory elements 21a and 21b, for example). The LTI IIR filter may be uniquely defined by its coefficients (parameters).
We consider a time varying filter that is in update interval k equal to the LTI IIR filter Hk:
Instead of instantly changing the filter parameters at border between update intervals k−1 and k, we process a portion at the beginning of the update interval k with new set of time varying filters:
y′ [n] may be the intermediate output of the first filter unit 12 of
An example of sk [n] and sk−1 [n] is provided in
In
The system as described in [3] may be used as a basis, including Time Domain Transient Detector (TD TD), Windowing, MDCT, TNS, FDNS, OLA, Quantization, Arithmetic Coder and LTP postfiltering (see blocks 82-89 and 92-97 above). The modification may be realized in the LTP postfilter and thus the details of the LTP will be described.
In general terms, the LTP may be seen as a harmonic post-filter for filtering a representation of an information signal. It may be based on a transfer function comprising a numerator and a denominator, wherein the numerator may comprise a gain value indicated by the gain information, and the denominator may comprise an integer part of a pitch lag indicated by the pitch lag information and a multi-tap filter depending on a fractional part of the pitch lag. In examples, the transfer function of the post-filter comprises, in the numerator, a further multi-tap FIR filter for a zero fractional part of the pitch lag. In examples, the denominator comprises a product between the multi-tap filter and the gain value. In examples, the numerator may further comprise a product of a first scalar value and a second scalar value, wherein the denominator comprises the second scalar value and not the first scalar value, wherein the first and second scalar values are predetermined and have values greater than or equal to 0 and lower than or equal to 1, and wherein the second scalar value may be lower than the first scalar value.
An example of LTP filter is provided hereinbelow (see also
At the encoder side (e.g. apparatus 80), LTP parameter calculation may be used as described in [14]:
1. Pitch Estimation
a. First Estimation of the Integer Part of the Pitch Lag
b. Refinement of the Integer Part of the Pitch Lag
c. Estimation of the Fractional Part of the Pitch Lag
2. Gain Estimation and Quantization
The gain may be estimated on the input audio signal at the core encoder sampling rate, but it can also be any audio signal like the LPC weighted audio signal. This signal is noted y[n] and can be the same or different than x[n].
The prediction yP[n] of y[n] may be first found by filtering y[n] with the following filter
P(z)=B(z,Tfr)z−T
with Tint the integer part of the pitch lag (estimated in 1.b.) and B (z, Tfr) a low-pass FIR filter whose coefficients depend on the fractional part of the pitch lag Tfr (estimated in 1.c.).
One example of B(z) when the pitch lag resolution is ¼:
Tfr=0/4B(z)=0.0000z−2+0.2325z−1+0.5349z0+0.2325z1
Tfr=¼B(z)=0.0152z−2+0.3400z−1+0.5094z0+0.1353z1
Tfr= 2/4B(z)=0.0609z−2+0.4391z−1+0.4391z0+0.0609z1
Tfr¾B(z)=0.1353z−2+0.5094z−1+0.3400z0+0.0152z1
The gain g is then computed as follows:
and limited between 0 and 1.
Finally, the gain is quantized e.g. with two bits, using e.g. uniform quantization.
The LTP postfilter from [14] may be used (an example of the transfer function is provided in
whose parameters are determined from the parameters estimated at the encoder-side and decoded from the bitstream. g is the decoded gain, Tint and Tfr the integer and fractional part of the decoded pitch-lag, α and β two scalars that weight the gain, and B(z, Tfr) a low-pass FIR filter whose coefficients depends on the fractional part of the decoded pitch-lag. The order and the coefficients of B(z, Tfr) can also depend on the bitrate and the output sampling rate. A different frequency response can be designed and tuned for each combination of bitrate and output sampling rate.
Difference in the LTP postfilter to [14] may be the transition from one frame to the next. LTP postfilter used in the end portion of the frame k−1 is Hk−1:
and in the end portion of the frame k is Hk:
With bj(Tfr) defined in the following table:
In the beginning portion of the frame k (current kth frame or update interval) there may be three possibilities:
and subsequently by H′k:
For cases where complexity is more important than the quality, the 3. possibility is used whenever at least one of gk≠gk−1, Tint,k≠Tint,k−1, Tfr,k≠Tfr,k−1 is satisfied. Some of these conditions may be more important than some other ones, according to specific examples. In some examples, the difference in the pitch is the most important condition to be verified, for the choice between the 2. and 3. possibilities.
An example of the applications of the equations above for LTP postfilter is provided below with reference to
y[n]=x[n]−=x[n], by virtue of gk=0;
Notably, when it is determined (e.g., by the selector 55) that the first and/or second filtering will not be performed (or will provide, as the output, the same value of the input, basically operating as “identity filter”, which is in general useless), it is possible to bypass the useless filtering unit and/or section (e.g., as in
A discussion is here provided on the operation of the filters for an LTP postfilter. The decoded signal after MDCT or MDST (or any other lapped transform) synthesis may be postfiltered in the time-domain using an IIR filter whose parameters depend on the LTPF bitstream data may be, for example, “pitch_index” and/or “ltpf_active” (the latter activating/deactivating an LTP postfilter operation). To avoid discontinuity when the parameters change from one frame to the next, a transition mechanism may be applied on the first quarter of the current frame.
An LTPF IIR postfilter can be implemented using (see also above):
with {circumflex over (x)}(n) is the filter input signal (i.e. the decoded signal after MDCT or MDST synthesis) and (n) is the filter output signal.
The integer part pint and the fractional part pfr of the LTPF pitch-lag may be computed as follows. First the pitch-lag (e.g., at 12.8 kHz) may be recovered using
The pitch-lag may then be scaled to the output sampling rate f, and converted to integer and fractional parts using
wherein Fs is the sampling rate.
The filter coefficients cnum(k) and cden(k,pfr) may be computed as follows
cnum(k)=0.85*gainltpf*tab_ltpf_num_fs[k] for k=0 . . . Lnum
cden(k,pfr)=gainltpf*tab_ltpf_den_fs_[pfr][k] for k=0 . . . Lden
with
and gainltpf and gainind may be obtained, in some examples, according to procedures such as:
the tables tab_ltpf_num_fs[k] and tab_ltpf_den_fs[pfr][k] are predetermined. Some examples may be (instead of “fs”, an actual bandwidth is used):
Five different cases may be considered:
with cnummem, cdenmem, pintmem and pfrmem are the filter parameters computed in the previous frame
Advantages of the invention are here discussed. The examples above are less complex than the previously used implementations. For example in the exemplary case of the LTP postfilter, the complexity advantage over the LPC method used in [13] is clear. Comparing to the cross-fade methods used in [9], [10], [11] and [12] there is one operation less per sample. To see this, notice that
can be realized by subtracting constant
from βgk−1 and adding constant
sample wise. Together with the filtering using H′k−1 and H′k this is equivalent number of operations to filtering with Hk−1 and Hk followed by weighting of the filtered signals as used in the cross-fade method. The cross-fade method then is continued with the addition of the weighted signals, while the proposed method already has already produced the output.
When it is determined that the first and/or second filtering will not be performed (or will provide, as the output, the same value of the input, basically operating as “identity filter”, which is in general useless), it is possible to bypass the useless filtering unit and/or section (e.g., as in
In general, discontinuities are avoided when filter changes at a frame border from Hk−1:
to Hk:
by filtering the beginning portion with length L of the frame k with time-varying filter H′k−1:
followed by filtering the intermediate output y′[n] of the time-varying filter H′k−1 with the time-varying filter H′k:
H′k−1 is time-varying filter that changes from full filtering with Hk−1 to no filtering. H′k is time-varying filter that changes from no filtering to full filtering with Hk.
A first example (based on the example of
A second example is here provided in pseudocode:
2.
In this 2. implementation we could also include the fractional part of the pitch in addition to checking the difference between the integer part of the pitch.
In examples, the systems 110 and 120 may be the same device.
Depending on certain implementation requirements, examples may be implemented in hardware. The implementation may be performed using a digital storage medium, for example a floppy disk, a Digital Versatile Disc (DVD), a Blu-Ray Disc, a Compact Disc (CD), a Read-only Memory (ROM), a Programmable Read-only Memory (PROM), an Erasable and Programmable Read-only Memory (EPROM), an Electrically Erasable Programmable Read-Only Memory (EEPROM) or a flash memory, having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system such that the respective method is performed. Therefore, the digital storage medium may be computer readable.
Generally, examples may be implemented as a computer program product with program instructions, the program instructions being operative for performing one of the methods when the computer program product runs on a computer. The program instructions may for example be stored on a machine readable medium.
Other examples comprise the computer program for performing one of the methods described herein, stored on a machine readable carrier. In other words, an example of method is, therefore, a computer program having a program instructions for performing one of the methods described herein, when the computer program runs on a computer.
A further example of the methods is, therefore, a data carrier medium (or a digital storage medium, or a computer-readable medium) comprising, recorded thereon, the computer program for performing one of the methods described herein. The data carrier medium, the digital storage medium or the recorded medium are tangible and/or non-transitionary, rather than signals which are intangible and transitory.
A further example comprises a processing unit, for example a computer, or a programmable logic device performing one of the methods described herein.
A further example comprises a computer having installed thereon the computer program for performing one of the methods described herein.
A further example comprises an apparatus or a system transferring (for example, electronically or optically) a computer program for performing one of the methods described herein to a receiver. The receiver may, for example, be a computer, a mobile device, a memory device or the like. The apparatus or system may, for example, comprise a file server for transferring the computer program to the receiver.
In some examples, a programmable logic device (for example, a field programmable gate array) may be used to perform some or all of the functionalities of the methods described herein. In some examples, a field programmable gate array may cooperate with a microprocessor in order to perform one of the methods described herein. Generally, the methods may be performed by any appropriate hardware apparatus.
While this invention has been described in terms of several advantageous embodiments, there are alterations, permutations, and equivalents which fall within the scope of this invention. It should also be noted that there are many alternative ways of implementing the methods and compositions of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations, and equivalents as fall within the true spirit and scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
17201105 | Nov 2017 | EP | regional |
This application is a continuation of copending International Application No. PCT/EP2018/080837, filed Nov. 9, 2018, which is incorporated herein by reference in its entirety, and additionally claims priority from European Application No. EP 17201105.8, filed Nov. 10, 2017, which is also incorporated herein by reference in its entirety. The present examples relate to systems and methods for performing filtering of signals (e.g., LTP postfilter and/or a pre-filter).
Number | Name | Date | Kind |
---|---|---|---|
4972484 | Link et al. | Nov 1990 | A |
5012517 | Chhatwal et al. | Apr 1991 | A |
5581653 | Todd | Dec 1996 | A |
5651091 | Chen et al. | Jul 1997 | A |
5781888 | Herre | Jul 1998 | A |
5812971 | Herre | Sep 1998 | A |
5819209 | Inoue | Oct 1998 | A |
5909663 | Iijima et al. | Jun 1999 | A |
5999899 | Robinson | Dec 1999 | A |
6018706 | Huang et al. | Jan 2000 | A |
6148288 | Park | Nov 2000 | A |
6167093 | Tsutsui et al. | Dec 2000 | A |
6507814 | Gao | Jan 2003 | B1 |
6570991 | Scheirer et al. | May 2003 | B1 |
6665638 | Kang et al. | Dec 2003 | B1 |
6735561 | Johnston et al. | May 2004 | B1 |
7009533 | Wegener | Mar 2006 | B1 |
7302396 | Cooke | Nov 2007 | B1 |
7353168 | Chen et al. | Apr 2008 | B2 |
7395209 | Dokic et al. | Jul 2008 | B1 |
7539612 | Chen et al. | May 2009 | B2 |
7546240 | Chen et al. | Jun 2009 | B2 |
8015000 | Chen et al. | Sep 2011 | B2 |
8095359 | Boehm et al. | Jan 2012 | B2 |
8280538 | Kim et al. | Oct 2012 | B2 |
8473301 | Chen et al. | Jun 2013 | B2 |
8543389 | Ragot et al. | Sep 2013 | B2 |
8554549 | Morii et al. | Oct 2013 | B2 |
8612240 | Fuchs et al. | Dec 2013 | B2 |
8682681 | Fuchs et al. | Mar 2014 | B2 |
8738385 | Chen | May 2014 | B2 |
8751246 | Bayer et al. | Jun 2014 | B2 |
8847795 | Faure et al. | Sep 2014 | B2 |
8891775 | Mundt al. | Nov 2014 | B2 |
8898068 | Fuchs et al. | Nov 2014 | B2 |
9026451 | Kleijn et al. | May 2015 | B1 |
9123350 | Zhao et al. | Sep 2015 | B2 |
9489961 | Kovesi et al. | Nov 2016 | B2 |
9595262 | Guillaume et al. | Mar 2017 | B2 |
10296959 | Chernikhova et al. | May 2019 | B1 |
10726854 | Ghido et al. | Jul 2020 | B2 |
20010026327 | Schreiber et al. | Oct 2001 | A1 |
20030088408 | Thyssen | May 2003 | A1 |
20030101050 | Cuperman et al. | May 2003 | A1 |
20040158462 | Rutledge et al. | Aug 2004 | A1 |
20040162866 | Malvar et al. | Aug 2004 | A1 |
20050010395 | Chiu et al. | Jan 2005 | A1 |
20050015249 | Chen et al. | Jan 2005 | A1 |
20050192799 | Kim et al. | Sep 2005 | A1 |
20050246178 | Fejzo | Nov 2005 | A1 |
20060288851 | Esima et al. | Dec 2006 | A1 |
20070033056 | Groeschl et al. | Feb 2007 | A1 |
20070078646 | Lei et al. | Apr 2007 | A1 |
20070118361 | Sinha et al. | May 2007 | A1 |
20070118369 | Chen | May 2007 | A1 |
20070124136 | Den Brinker et al. | May 2007 | A1 |
20070127729 | Breebaart et al. | Jun 2007 | A1 |
20070129940 | Geyersberger et al. | Jun 2007 | A1 |
20070154031 | Avendano et al. | Jul 2007 | A1 |
20070276656 | Solbach et al. | Nov 2007 | A1 |
20080033718 | Zopf et al. | Feb 2008 | A1 |
20080091418 | Laaksonen et al. | Apr 2008 | A1 |
20080126086 | Kandhadai et al. | May 2008 | A1 |
20080126096 | Choo et al. | May 2008 | A1 |
20090076805 | Du et al. | Mar 2009 | A1 |
20090076830 | Taleb | Mar 2009 | A1 |
20090089050 | Mo et al. | Apr 2009 | A1 |
20090138267 | Davidson et al. | May 2009 | A1 |
20090248424 | Koishida et al. | Oct 2009 | A1 |
20090254352 | Zhao | Oct 2009 | A1 |
20100010810 | Morii | Jan 2010 | A1 |
20100070270 | Gao | Mar 2010 | A1 |
20100094637 | Vinton | Apr 2010 | A1 |
20100115370 | Ramo et al. | May 2010 | A1 |
20100198588 | Osada et al. | Aug 2010 | A1 |
20100223061 | Ojanpera | Sep 2010 | A1 |
20100312552 | Kandhadai et al. | Dec 2010 | A1 |
20100312553 | Fang et al. | Dec 2010 | A1 |
20100324912 | Choo et al. | Dec 2010 | A1 |
20110015768 | Kim et al. | Jan 2011 | A1 |
20110022924 | Malenovsky et al. | Jan 2011 | A1 |
20110035212 | Briand et al. | Feb 2011 | A1 |
20110060597 | Chen et al. | Mar 2011 | A1 |
20110071839 | Budnikov et al. | Mar 2011 | A1 |
20110095920 | Ashley et al. | Apr 2011 | A1 |
20110096830 | Ashley et al. | Apr 2011 | A1 |
20110116542 | Antonini et al. | May 2011 | A1 |
20110125505 | Gournay et al. | May 2011 | A1 |
20110145003 | Bessette | Jun 2011 | A1 |
20110196673 | Park et al. | Aug 2011 | A1 |
20110200198 | Bayer et al. | Aug 2011 | A1 |
20110238425 | Lecomte et al. | Sep 2011 | A1 |
20110238426 | Borsum et al. | Sep 2011 | A1 |
20120010879 | Kikuiri et al. | Jan 2012 | A1 |
20120022881 | Geiger et al. | Jan 2012 | A1 |
20120072209 | Krishnan et al. | Mar 2012 | A1 |
20120109659 | Chen et al. | May 2012 | A1 |
20120214544 | Rodriguez et al. | Aug 2012 | A1 |
20120245947 | Neuendorf et al. | Sep 2012 | A1 |
20120265540 | Fuchs et al. | Oct 2012 | A1 |
20120265541 | Geiger et al. | Oct 2012 | A1 |
20130030819 | Carlsson et al. | Jan 2013 | A1 |
20130096912 | Kjoerling et al. | Apr 2013 | A1 |
20130226594 | Fuchs et al. | Aug 2013 | A1 |
20130282369 | Ryu et al. | Oct 2013 | A1 |
20140052439 | Nanjundaswamy et al. | Feb 2014 | A1 |
20140067404 | Baumgarte | Mar 2014 | A1 |
20140074486 | Dietz et al. | Mar 2014 | A1 |
20140108020 | Bai et al. | Apr 2014 | A1 |
20140142957 | Lee et al. | May 2014 | A1 |
20140172141 | Mangold | Jun 2014 | A1 |
20140223029 | Bhaskar et al. | Aug 2014 | A1 |
20140330556 | Biswas et al. | Nov 2014 | A1 |
20140358531 | Vos Koen | Dec 2014 | A1 |
20150010155 | Lang et al. | Jan 2015 | A1 |
20150081312 | Fuchs et al. | Mar 2015 | A1 |
20150142452 | Lee et al. | May 2015 | A1 |
20150154969 | Craven et al. | Jun 2015 | A1 |
20150170668 | Kovesi et al. | Jun 2015 | A1 |
20150221311 | Jeon et al. | Aug 2015 | A1 |
20150228287 | Bruhn et al. | Aug 2015 | A1 |
20150255079 | Huang et al. | Sep 2015 | A1 |
20150302859 | Aguilar et al. | Oct 2015 | A1 |
20150302861 | Salami et al. | Oct 2015 | A1 |
20150325246 | Chen et al. | Nov 2015 | A1 |
20150371647 | Faure et al. | Dec 2015 | A1 |
20160019898 | Schreiner et al. | Jan 2016 | A1 |
20160027450 | Gao | Jan 2016 | A1 |
20160078878 | Ravelli et al. | Mar 2016 | A1 |
20160104488 | Schnabel | Apr 2016 | A1 |
20160111094 | Dietz et al. | Apr 2016 | A1 |
20160163326 | Resch et al. | Jun 2016 | A1 |
20160189721 | Johnston et al. | Jun 2016 | A1 |
20160225384 | Kjörling et al. | Aug 2016 | A1 |
20160247506 | Lecomte et al. | Aug 2016 | A1 |
20160285718 | Bruhn | Sep 2016 | A1 |
20160293174 | Atti et al. | Oct 2016 | A1 |
20160293175 | Atti et al. | Oct 2016 | A1 |
20160307576 | Doehla et al. | Oct 2016 | A1 |
20160365097 | Guan et al. | Dec 2016 | A1 |
20160372125 | Atti et al. | Dec 2016 | A1 |
20160372126 | Atti et al. | Dec 2016 | A1 |
20160379655 | Truman et al. | Dec 2016 | A1 |
20170011747 | Faure et al. | Jan 2017 | A1 |
20170053658 | Atti et al. | Feb 2017 | A1 |
20170078794 | Bongiovi et al. | Mar 2017 | A1 |
20170103769 | Laaksonen et al. | Apr 2017 | A1 |
20170110135 | Disch et al. | Apr 2017 | A1 |
20170133029 | Markovic et al. | May 2017 | A1 |
20170140769 | Ravelli et al. | May 2017 | A1 |
20170154631 | Bayer et al. | Jun 2017 | A1 |
20170154635 | Doehla et al. | Jun 2017 | A1 |
20170221495 | Sung et al. | Aug 2017 | A1 |
20170236521 | Atti et al. | Aug 2017 | A1 |
20170249387 | Hatami-Hanza | Aug 2017 | A1 |
20170256266 | Sung et al. | Sep 2017 | A1 |
20170294196 | Bradley et al. | Oct 2017 | A1 |
20170303114 | Johansson et al. | Oct 2017 | A1 |
20190027156 | Sung et al. | Jan 2019 | A1 |
20200335118 | Markovic | Oct 2020 | A1 |
Number | Date | Country |
---|---|---|
101140759 | Mar 2008 | CN |
102779526 | Nov 2012 | CN |
107103908 | Aug 2017 | CN |
0716787 | Jun 1996 | EP |
0732687 | Sep 1996 | EP |
1791115 | May 2007 | EP |
2676266 | Dec 2013 | EP |
2980796 | Feb 2016 | EP |
2980796 | Feb 2016 | EP |
2980799 | Feb 2016 | EP |
3111624 | Jan 2017 | EP |
2944664 | Oct 2010 | FR |
H05-281996 | Oct 1993 | JP |
H07-28499 | Jan 1995 | JP |
H0811644 | Jan 1996 | JP |
H9-190195 | Jul 1997 | JP |
H9-204197 | Aug 1997 | JP |
H10-51313 | Feb 1998 | JP |
H1091194 | Apr 1998 | JP |
H11-330977 | Nov 1999 | JP |
2004-138756 | May 2004 | JP |
2006-527864 | Dec 2006 | JP |
2007519014 | Jul 2007 | JP |
2007-525718 | Sep 2007 | JP |
2009-003387 | Jan 2009 | JP |
2009-008836 | Jan 2009 | JP |
2009-538460 | Nov 2009 | JP |
2010-500631 | Jan 2010 | JP |
2010-501955 | Jan 2010 | JP |
2012-533094 | Dec 2012 | JP |
2015-505992 | Feb 2015 | JP |
2016-523380 | Aug 2016 | JP |
2016-200750 | Dec 2016 | JP |
2017-37328 | Feb 2017 | JP |
2017-522604 | Aug 2017 | JP |
2017-528752 | Sep 2017 | JP |
100261253 | Jul 2000 | KR |
20030031936 | Apr 2003 | KR |
1020050007853 | Jan 2005 | KR |
1020090077951 | Jul 2009 | KR |
10-2010-0136890 | Dec 2010 | KR |
1020100136899 | Dec 2010 | KR |
20130019004 | Feb 2013 | KR |
1020130054993 | May 2013 | KR |
1020160079056 | Jul 2016 | KR |
1020160144978 | Dec 2016 | KR |
20170000933 | Jan 2017 | KR |
2337414 | Oct 2008 | RU |
2376657 | Dec 2009 | RU |
2413312 | Feb 2011 | RU |
2419891 | May 2011 | RU |
2439718 | Jan 2012 | RU |
2483365 | May 2013 | RU |
2520402 | Jun 2014 | RU |
2568381 | Nov 2015 | RU |
2596594 02 | Sep 2016 | RU |
2596596 | Sep 2016 | RU |
2015136540 | Mar 2017 | RU |
2628162 02 | Aug 2017 | RU |
2016105619 | Aug 2017 | RU |
200809770 | Feb 2008 | TW |
201005730 | Feb 2010 | TW |
201126510 | Aug 2011 | TW |
201131550 | Sep 2011 | TW |
201207839 | Feb 2012 | TW |
201243832 | Nov 2012 | TW |
201612896 | Apr 2016 | TW |
201618080 | May 2016 | TW |
201618086 | May 2016 | TW |
201642246 | Dec 2016 | TW |
201642247 | Dec 2016 | TW |
201705126 | Feb 2017 | TW |
201711021 | Mar 2017 | TW |
201713061 | Apr 2017 | TW |
201724085 | Jul 2017 | TW |
201732779 | Sep 2017 | TW |
9916050 | Apr 1999 | WO |
2004072951 | Aug 2004 | WO |
2005086138 | Sep 2005 | WO |
2005086139 | Sep 2005 | WO |
2007073604 | Jul 2007 | WO |
2007138511 | Dec 2007 | WO |
2008025918 | Mar 2008 | WO |
2008046505 | Apr 2008 | WO |
2008072701 | Jun 2008 | WO |
2009066869 | May 2009 | WO |
2011048118 | Apr 2011 | WO |
2011086066 | Jul 2011 | WO |
2011086067 | Jul 2011 | WO |
2012000882 | Jan 2012 | WO |
2012000882 | Jan 2012 | WO |
2012126893 | Sep 2012 | WO |
2014165668 | Oct 2014 | WO |
2014202535 | Dec 2014 | WO |
2014202535 | Dec 2014 | WO |
2015063045 | May 2015 | WO |
2015063227 | May 2015 | WO |
2015071173 | May 2015 | WO |
2015174911 | Nov 2015 | WO |
2016016121 | Feb 2016 | WO |
2016142002 | Sep 2016 | WO |
2016142337 | Sep 2016 | WO |
WO-2019092220 | May 2019 | WO |
Entry |
---|
Sujoy Sarkar, “Examination Report for IN Application No. 202037018091”, dated Jun. 1, 2021, Intellectual Property India, India. |
P.A Volkov, “Office Action for RU Application No. 2020120251”, dated Oct. 28, 2020, ROSPATENT, Russia. |
P.A Volkov, “Office Action for RU Application No. 2020120256”, dated Oct. 28, 2020, ROSPATENT, Russia. |
D.V.Travnikov, “Decision on Grant for RU Application No. 2020118969”, dated Nov. 2, 2020, ROSPATENT, Russia. |
Santosh Mehtry, “Office Action for IN Application No. 202037019203”, dated Mar. 19, 2021, Intellectual Property India, India. |
Hiroshi Ono, “Office Action for JP Application No. 2020-526081”, dated Jun. 22, 2021, JPO, Japan. |
Hiroshi Ono, “Office Action for JP Application No. 2020-526084”, dated Jun. 23, 2021, JPO, Japan. |
Dietz, Martin et al., “Overview of the EVS codec architecture.” 2015 IEEE International Conference on Acoustics, Signal Processing (ICASSP), IEEE, 2015. |
Kazunori Mochimura, “Decision to Grant a Patent for JP application No. 2020-524579”, dated Nov. 29, 2021, JPO, Japan. |
John Tan, “Office Action for SG Application 11202004173P”, dated Jul. 23, 2021, IPOS, Singapore. |
Takeshi Yamashita, “Office Action for JP Application 2020-524877”, dated Jun. 24, 2021, JPO, Japan. |
ROSPATENT Examiner, “Decision on Grant Patent for Invention for RU Application No. 2020118949”, dated Nov. 11, 2020, ROSPATENT, Russia. |
Tsuyoshi Yamashita, “Office Action for JP Application No. 2020-526154”, dated Aug. 25, 2021, JPO, Japan. |
Guojun Lu et al., “A Technique towards Automatic Audio Classification and Retrieval, Forth International Conference an Signal Processing”, 1998, IEEE, Oct. 12, 1998, pp. 1142 to 1145. |
Hiroshi Ono, “Office Action for JP Application No. 2020-526135”, dated May 21, 2021, JPO Japan. |
International Telecommunication Union, “G. 729-based embedded variable bit-rate coder: An 8-32 kbit/s scalable wideband coder bitstream interoperable with G.729”. ITU-T Recommendation G.729.1., May 2006. |
3GGP TS 26.445, “Universal Mobile TElecommunications System (UMTS); LTE; Codec for Enhanced Voice Services (EVS); Detailed algorithmic description (3GPP TS 26.445 version 13.4.1 Release 13)”, ETSI TS 126 445 V13.4.1., Apr. 2017. |
Nam Sook Lee, “Office Action for KR Application No. 10-2020-7016100”, dated Jan. 13, 2022, KIPO, Republic of Korea. |
Nam Sook Lee, “Office Action for KR Application No. 10-2020-7016224”, dated Jan. 13, 2022, KIPO, Republic of Korea. |
Mam Sook Lee, “Office Action for KR Application No. 10-2020-7015835”, dated Jan. 13, 2022, KIPO, Republic of Korea. |
Mao Xiaohong, “Examination Report for SG Application No. 11202004228V”, dated Sep. 2, 2021, IPOS, Singapore. |
Mao Xiaohong, “Search Report for SG Application No. 11202004228V”, Sep. 3, 2021, IPOS, Singapore. |
Nam Sook Lee, “Office Action for KR Application No. 10-2020-7015512”, dated Sep. 9, 2021, KIPO, Republic of Korea. |
Tomonori Kikuchi, “Office Action for JP Application No. 2020-524874”, dated Jun. 2, 2021, JPO Japan. |
Tetsuyuki Okumachi, “Office Action for JP Application 2020-118837”, dated Jul. 16, 2021, JPO, Japan. |
Tetsuyuki Okumachi, “Office Action for JP Application 2020-118838”, dated Jul. 16, 2021, JPO, Japan. |
O.E. Groshev, “Office Action for RU Application No. 2020118947”, dated Dec. 1, 2020, ROSPATENT, Russia. |
O.I. Starukhina, “Office Action for RU Application No. 2020118968”, dated Dec. 23, 2020, ROSPATENT, Russia. |
Khalid Sayood, “Introduction to Data Compression”, Elsevier Science & Technology, 2005, Section 16.4, Figure 16. 13, p. 526. |
Patterson et al., “Computer Organization and Design”, The hardware/software Interface, Revised Fourth Edition, Elsevier, 2012. |
ETSI TS 126 445 V13.2.0 (Aug. 2016), Universal Mobile Telecommunications System (UMTS); LTE; Codec for Enhanced Voice Services (EVS); Detailed algorithmic description (3GPP TS 26.445 version 13.2.0 Release 13) [Online]. Available: http://www.3gpp.org/ftp/Specs/archive/26_series/26.445/26445-d00.zip. |
Geiger, “Audio Coding based on integer transform”, Ilmenau: https://www.db-thueringen.de/receive/dbt_mods_00010054, 2004. |
Henrique S Malvar, “Biorthogonal and Nonuniform Lapped Transforms for Transform Coding with Reduced Blocking and Ringing Artifacts”, IEEE Transactions on Signal Processing, IEEE Service Center, New York, NY, US, (199804), vol. 46, No. 4, ISSN 1053-587X, XP011058114. |
Anonymous, “ISO/IEC 14496-3:2005/FDAM 9, AAC-ELD”, 82. MPEG Meeting;Oct. 22, 2007-Oct. 26, 2007 Shenzhen; (Motion Picture Expert Group or ISO/IEC JTC1/SC29/WG11),, (Feb. 21, 2008), No. N9499, XP030015994. |
Virette, “Low Delay Transform for High Quality Low Delay Audio Coding”, Université de Rennes 1, (Dec. 10, 2012), pp. 1-195, URL: https://hal.inria.fr/tel-01205574/document, (Mar. 30, 2016), XP055261425. |
SO/IEC 14496-3:2001; Information technology—Coding of audio-visual objects—Part 3: Audio. |
3GPP TS 26.403 v14.0.0 (Mar. 2017); General audio codec audio processing functions; Enhanced acPIus general audio codec; Encoder specification; Advanced Audio Coding (AAC) part; (Release 14). |
ISO/IEC 23003-3; Information technology—MPEG audio technologies—Part 3: Unified speech and audio coding, 2011. |
3GPP TS 26.445 V14.1.0 (Jun. 2017), 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Codec for Enhanced Voice Services (EVS); Detailed Algorithmic Description (Release 14), http://www.3gpp.org/ftp//Specs/archive/26_series/26.445/26445-e10.zip, Section 5.1.6 “Bandwidth detection”. |
Eksler Vaclav et al, “Audio bandwidth detection in the EVS codec”, 2015 IEEE Global Conference on Signal and Information Processing (GLOBALSIP), IEEE, (Dec, 14, 2015), doi:10.1109/GLOBALSIP.2015.7418243, pp. 488-492, XP032871707. |
Oger M et al, “Transform Audio Coding with Arithmetic-Coded Scalar Quantization and Model-Based Bit Allocation”, International Conference on Acoustics, Speech, and SignalProcessing, IEEE, XX,Apr. 15, 2007 (Apr. 15, 2007), p. IV-545, XP002464925. |
Asad et al., “An enhanced least significant bit modification technique for audio steganography”, International Conference on Computer Networks and Information Technology, Jul. 11-13, 2011. |
Makandar et al., “Least Significant Bit Coding Analysis for Audio Steganography”, Journal of Future Generation Computing, vol. 2, No. 3, Mar. 2018. |
ISO/IEC 23008-3:2015; Information technology—High efficiency coding and media delivery in heterogeneous environments—Part 3: 3D audio. |
ITU-T G.718 (Jun. 2008): Series G: Transmission Systems and Media, Digital Systems and Networks, Digital terminal equipments—Coding of voice and audio signals, Frame error robust narrow-band and wideband embedded variable bit-rate coding of speech and audio from 8-32 kbit/s. |
3GPP TS 26.447 V14.1.0 (Jun. 2017), Technical Specification, 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Codec for Enhanced Voice Services (EVS); Error Concealment of Lost Packets (Release 14). |
DVB Organization, “ISO-IEC_23008-3_A3_(E)_(H 3DA FDAM3).docx”, DVB, Digital Video Broadcasting, C/O EBU—17A Ancienne Route—CH-1218 Grand Saconnex, Geneva—Switzerland, (Jun. 13, 2016), XP017851888. |
Hill et al., “Exponential stability of time-varying linear systems,” IMA J Numer Anal, pp. 865-885, 2011. |
3GPP TS 26.090 V14.0.0 (Mar. 2017), 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Mandatory Speech Codec speech processing functions; Adaptive Multi-Rate (AMR) speech codec; Transcoding functions (Release 14). |
3GPP TS 26.190 V14.0.0 (Mar. 2017), Technical Specification, 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Speech codec speech processing functions; Adaptive Multi-Rate—Wideband (AMR-WB) speech codec; Transcoding functions (Release 14). |
3GPP TS 26.290 V14.0.0 (Mar. 2017), Technical Specification, 3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Audio codec processing functions; Extended Adaptive Multi-Rate—Wideband (AMR-WB+) codec; Transcoding functions (Release 14). |
Edler et al., “Perceptual Audio Coding Using a Time-Varying Linear Pre- and Post-Filter,” in AES 109th Convention, Los Angeles, 2000. |
Gray et al., “Digital lattice and ladder filter synthesis,” IEEE Transactions on Audio and Electroacoustics, vol. vol. 21, No. No 6, pp. 491-500, 1973. |
Lamoureux et al., “Stability of time variant filters,” CREWES Research Report—vol. 19, 2007. |
Herre et al., “Enhancing the performance of perceptual audio coders by using temporal noise shaping (TNS).” Audio Engineering Society Convention 101. Audio Engineering Society, 1996. |
Herre et al., “Continuously signal-adaptive filterbank for high-quality perceptual audio coding.” Applications of Signal Processing to Audio and Acoustics, 1997. 1997 IEEE ASSP Workshop on. IEEE, 1997. |
Herre, “Temporal noise shaping, quantization and coding methods in perceptual audio coding: A tutorial ntroduction.” Audio Engineering Society Conference: 17th International Conference: High-Quality Audio Coding. Audio Engineering Society, 1999. |
Fuchs Guillaume et al., “Low delay LPC and MDCT-based audio coding in the EVS codec”, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, (Apr. 19, 2015), doi: 10.1109/ICASSP.2015.7179068, pp. 5723-5727, XP033187858. |
Niamut et al., “RD Optimal Temporal Noise Shaping for Transform Audio Coding”, Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on Toulouse, France May 14-19, 2006, Piscataway, Nj, USA,IEEE, Piscataway, NJ, USA, (Jan. 1, 2006), doi:10.1109/ICASSP.2006.1661244, ISBN 978-1-4244-0469-8, pp. V-V, XP031015996. |
ITU-T G.711 (Sep. 1999): Series G: Transmission Systems and Media, Digital Systems and Networks, Digital transmission systems—Terminal equipments—Coding of analogue signals by pulse code modulation, Pulse code modulation (PCM) of voice frequencies. Appendix I: A high quality low-complexity algorithm for packet loss concealment with G.711. |
Cheveigne et al.,“YIN, a fundamental frequency estimator for speech and music.” The Journal of the Acoustical Society of America 111.4 (2002): 1917-1930. |
Ojala P et al, “A novel pitch-lag search method using adaptive weighting and median filtering”, Speech Coding Proceedings, 1999 IEEE Workshop on Porvoo, Finland Jun. 20-23, 1999, Piscataway, NJ, USA, IEEE, US, (Jun. 20, 1999), doi:10.1109/SCFT.1999.781502, ISBN 978-0-7803-5651-1, pp. 114-116, XP010345546. |
“5 Functional description of the encoder”, Dec. 10, 2014 (Dec. 10, 2014), 3GPP Standard; 26445-C10_1_S05_S0501, 3rd Generation Partnership Project (3GPP)?, Mobile Competence Centre ; 650, Route Des Lucioles ; F-06921 Sophia-Antipolis Cedex ; France Retrieved from the Internet:URL : http://www.3gpp.org/ftp/Specs/2014-12/Rel-12/26_series/ XP050907035. |
Nam Sook Lee, “Office Action for KR Application No. 10-2020-7016424”, dated Feb. 9, 2022, KIPO, Korea. |
Nam Sook Lee, “Office Action for KR Application No. 10-2020-7016503”, dated Feb. 9, 2022, KIPO, Korea. |
Xiong-Malvar, “A Nonuniform Modulated Complex Lapped Transform”, IEEE Signal Processing Letters, vol. 8, No. 9, Sep. 2001. (Year: 2001). |
Raj et al., “An Overview of MDCT for Time Domain Aliasing Cancellation”, 2014 International Conference on Communication and Network Technologies (ICCNT). (Year: 2014). |
Malvar, “Biorthogonal and Nonuniform Lapped Transforms for Transform Coding with Reduced Blocking and Ringing Artifacts”, IEEE Transactions on Signal Processing, vol. 46, No. 4, Apr. 1998. (Year: 1998). |
Malvar, “Lapped Transforms for Efficient Transform/Subband Coding”, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 38, No. 6, Jun. 1990. (Year: 1990). |
Malvar, “Fast Algorithms for Orthogonal and Biorthogonal Modulated Lapped Transforms”, Microsoft Research, 1998. (Year: 1998). |
Princen-Bradley, “Analysis/Synthesis Filter Bank Design Based on Time Domain Aliasing Cancellation”, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-34, No. 5, Oct. 1986. (Year: 1986). |
Shlien, “The Modulated Lapped Transform, Its Time-Varying Forms, and Its Applications to Audio Coding Standards”, IEEE Transactions on Speech and Audio Processing, vol. 5, No. 4, Jul. 1997. (Year: 1997). |
ETSI TS 126 445 V12.0.0, “Universal Mobile Telecommunications System (UMTS); LTE; EVS Codec Detailed Algorithmic Description (3GPP TS 26.445 version 12.0.0 Release 12)”, Nov. 2014. |
ETSI TS 126 403 V6.0.0, “Universal Mobile Telecommunications System (UMTS); General audio codec audio processing functions; Enhanced aacPIus general audio codec; Encoder specification; Advanced Audio Coding (AAC) part (3GPP TS 26.403 version 6.0.0 Release 6)”, Sep. 2004. |
ETSI TS 126 401 V6.2.0, “Universal Mobile Telecommunications System (UMTS); General audio codec audio processing functions; Enhanced aacPIus general audio codec; General description (3GPP TS 26.401 version 6.2.0 Release 6)”, Mar. 2005. |
3GPP TS 26.405, “3rd Generation Partnership Project; Technical Specification Group Services and System Aspects General audio codec audio processing functions; Enhanced aacPlus general audio codec; Encoder specification parametric stereo part (Release 6)”, Sep. 2004. |
3GPP TS 26.447 V12.0.0, “3rd Generation Partnership Project; Technical Specification Group Services and System Aspects; Codec for Enhanced Voice Services (EVS); Error Concealment of Lost Packets (Release 12)”, Sep. 2014. |
ISO/IEC FDIS 23003-3:2011(E), “Information technology—MPEG audio technologies—Part 3: Unified speech and audio coding”, ISO/IEC JTC 1/SC 29/WG 11, Sep. 20, 2011. |
Valin et al., “Definition of the Opus Audio Codec”, Internet Engineering Task Force (IETF) RFC 6716, Sep. 2012. |
Nam Sook Lee, “Decision to Grant a Patent for KR Application No. 10-2020-7015511”, dated Apr. 19, 2022, KIPO, Republic of Korea. |
Nam Sook Lee, “Dedsion to Grant a Patent for KR Application No. 10-2020-7016100”, dated Apr. 21, 2022, KIPO, Republic of Korea. |
Nam Sook Lee, “Dedsion to Grant a Patent for KR Application No. 10-2020-7015836”, dated Apr. 28, 2022, KIPO, Republic of Korea. |
Nam Sook Lee, “Dedsion to Grant a Patent for KR Application No. 10-2020-7015512”, dated Apr. 20, 2022, KIPO, Republic of Korea. |
Nam Sook Lee, “Decision to Grant a Patent for KR Application No. 10-2020-7015835”, dated Apr. 22, 2022, KIPO, Republic of Korea. |
Nam Sook Lee, “Decision to Grant a Patent for KR Application No. 10-2020-7016224”, dated Jul. 25, 2022, KIPO, Republic of Korea. |
Audio codec processing functions, Extended Adaptive Multi-Rate—Wideband (AMR-WB+) codec 3GPP TS 26.290 version 9.0.0 Release 9, 2009.09. |
Number | Date | Country | |
---|---|---|---|
20200335118 A1 | Oct 2020 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/EP2018/080837 | Nov 2018 | US |
Child | 16869143 | US |