The present invention generally relates to digital sampling rate conversion systems, and more specifically to devices and methods for making such systems shift-invariant and/or characterizing such systems.
Digital sampling rate conversion (SRC) systems receive sampled input signals at a given sampling rate and output the signal with a different sampling rate. The SRC system may increase (upsample) or decrease (downsample) the sampling rate of the signal.
The sampling rate conversion systems are widely used in audio or video systems for performing conversion of audio or video files from a given codec to another. SRC systems may have other applications.
Before integrating a sampling rate conversion system in a multimedia (e.g. audio and/or video) architecture, there may be the need of characterizing the SRC system for studying its behaviour.
Different approaches to characterization and performance measurements for multirate systems exist in the prior art.
SRC may be dealt with as a specific case of a multirate system (MRS). In some prior art approaches, fractional rate sampling rate convertors (SRC) are not considered as Linear Periodically Time Varying (LPTV) systems. For an SRC, input and output periods are constrained to be different. In other approaches, assuming an explicit definition of LPTV with potentially different input and output periods, SRC is clearly considered as LPTV. LPTV systems form a wider class than SRC since filter banks belong to LPTV.
LPTV systems may be entirely described in a polyphase context with tools including: analysis network (PPAN), synthesis network (PPSN) and polyphase matrix.
Typically, the following approach may be used in order to describe a LPTV system: Find a representation that embeds a Linear Time Invariant (LTI) system and characterize the LTI system with some suitable method.
Various different approaches are possible. For example, the LTI system may include a MIMO LTI system characterized by a bi-spectrum. Other approaches include a linear switched time varying (LSTV) system.
The characterizing bi-spectrum may be obtained through different means. For example via formal analysis by serial or parallel concatenation of basic building blocks (MIMO LTI system, decimator, expander, modulator), or via black box analysis with excitation of the system with a set of orthogonal test vectors.
Several techniques may be used for characterizing sampling rate conversion systems (SRC) from the outside (the SRC system being considered as a black box or a near black box (or grey) component).
The parameters characterizing the SRC system may be:
the in-band linear characteristics of the system,
the out-of-band characteristics of the system that generates spectrum aliases, and
the additional background noise due to rounding errors.
The characterizing methods may be based on a distortion measurement either on a sine wave or on a sine sweep wave. The characterization methods may also be based on the reconstruction of the impulse response of the SRC system.
However, SRC systems are not shift-invariant (or time-invariant) systems. For an SRC system, the output of a delayed sequence is not necessarily the delayed version of the resulting output of the non-delayed sequence. This lack of shift-invariance generates aliasing, which penalizes the characterization methods.
Therefore, whereas the aliasing behaviour is not due to a non-linear behaviour, aliasing of SRC systems is usually studied as if it was a non-linear distortion with methods and criterion such as total distortion methods.
In fact, for an SRC system, a source of non-linearity may be the rounding errors. Total distortion methods are thus unable to discriminate between aliasing and real non-linearity.
Hence, there is a need for facilitating the characterization of the SRC systems.
Therefore, a first aspect of the invention relates to a method for making SRC systems shift-invariant (or time-invariant).
It is noted that embodiments of the invention may be equally applicable to any LPTV system (e.g. when P and R are not coprime), even though described below in the context to SRC systems only.
According to the first aspect, there is provided a method of converting the sampling rate of an input signal with input/output shift-invariance using a Linear Periodically Time Varying, LPTV, system.
The method comprises generating a set of polyphase components of the input signal.
The LPTV system being a digital Sampling Rate Conversion, SRC, system and having a fractional ratio R/P, with R and P being co-primes, the method further comprises selecting three integer numbers Q, M and q that satisfy the following equations:
M=RQ,
q=PQ, and
MP=qR,
and the generation of each polyphase component of the set of polyphase components comprises
feeding the generated set of polyphase components to the SRC system; and,
generating an output signal corresponding to the input signal with converted sampling rate, by performing interleaving, shifting and addition on signals output by the SRC system corresponding to the generated set of polyphase components processed by said SRC system.
Once the SRC system is made shift-invariant (or time-invariant) LTI (Linear Time Invariant) characterization methods can be applied and true non-linearity of the SRC system can be studied separately.
A second aspect of the invention relates to a method of characterizing a Linear Periodically Time Varying, LPTV system being a Sampling Rate Conversion, SRC system. The method comprises:
inputting at least one input signal being one of the group consisting of: an impulse and a test sequence;
implementing a method according to the first aspect with the input signal; and,
characterizing the SRC system through the output signal.
In a variant, the second aspect relates to a method of characterizing a Linear Periodically Time Varying, LPTV, system comprising:
inputting at least one input signal being one of the group consisting of: a sine-sweep signal, a sine signal, multiple sine signals, a multi-sine signal, multiple multi-sine signals, a maximum length sequence, or an all pass filtered impulse signal;
implementing a method according to the first aspect, with the input signal; and,
characterizing the LPTV system through the output signal.
Embodiments of the invention are recited in the dependent claims attached hereto.
For example, the method may be performed in real time. In such an embodiment, different instantiations of the SRC system may be used for generating in parallel the polyphase components.
In other embodiments, the polyphase components may be generated by a same instantiation of the SRC system.
The output signal may be generated by an overlapping-add operation.
Once the method implemented, SRC systems behave as linear and shift invariant (LSI) systems. Hence, the SRC systems may be modelled as discrete filters operating in the oversampling domain.
According to some embodiments, the alias components are pushed out of the bandwidth of the SRC system, which facilitates the characterization of the SRC system.
Thus, methods for characterizing shift-invariant systems can be applied in the oversampled domain.
For example, the impulse response characterization method or the transfer function characterization method may be used.
Once the SRC system is made shift-invariant, any remaining distortion or background noise in the SRC system may be associated to internal rounding errors caused by the arithmetic of the SRC system.
Therefore, any distortion method identifying these non-linearities can be used in the oversampled domain for characterizing those rounding errors, and then the SRC system.
For example, the Harmonic distortion method, or the Exponential Sine-Sweep may be used.
A third aspect of the present invention relates to a computer program product comprising one or more stored sequences of instructions that are accessible to a processor and which, when executed by the processor, cause the processor to carry out the steps of the method of the first and/or the second aspect of the present invention.
A fourth aspect of the present invention relates to a system for converting the sampling rate of an input signal with input/output shift-invariance comprising:
a Linear Periodically Time Varying, LPTV, system being a Sampling Rate Conversion, SRC system having a fractional ratio R/P, with R and P being co-primes;
a first input port adapted to receive the input signal;
a first output port adapted to output polyphase components of the input signal to the SRC system,
a second input port adapted to receive signals outputted by the SRC system corresponding to the polyphase components processed by said SRC system,
a processing unit being adapted to generate the polyphase components and to select three integer numbers Q, M and q that satisfy the following equations:
M=RQ,
q=PQ, and
MP=qR,
said processing unit comprising:
a time shifting unit adapted to time shift the input signal a number of times comprised between 0 and MP, and
an M-fold decimator adapted to decimate the time shifted input signals. and being further adapted to perform interleaving, shifting and addition on the signals outputted by the SRC system for generating an output signal, and
a second output port adapted to output the generated output signal.
A fifth aspect of the present invention relates to a system configured to characterize a sampling rate conversion system comprising:
a system according to the fourth aspect, and
a characterization unit adapted to perform a method according to the second aspect.
The present invention is illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings, in which like reference numerals refer to similar elements and in which:
a, 2b, 2c are flowcharts illustrating steps of methods according to embodiments of the invention;
a and 3b are a schematic illustrations of multi-rate frameworks according to embodiments of the invention;
In the following, we consider a digital sampling rate conversion (SRC) system having a fractional ratio R/P (R and P being co-primes).
An SRC system 100 may be represented as the association of an R-f old expander 101, followed by a low-pass filter 102, and followed by a P-fold decimator 103, as illustrated in
In some embodiments the system 100 may be seen as a canonical representation for a resampler, 101 combined with 102 as an integer rate upsampler and 102 combined with 103 as an integer rate downsampler.
This representation always exists and is unique. Therefore, it is a canonical representation. The canonical representation of an SRC may be different from the actual implementation. In implementations that include multistage operations with possible interpolation algorithms, the canonical representation of the SRC system may be either unknown or difficult to retrieve.
The parameters characterizing the SRC algorithms used in SRC systems may be defined by the properties of the low pass filter H(z) as it will be apparent to the person with ordinary skill in the art.
Thus, the SRC system may be characterized, for example, by determining the z-transform of the impulse response H(z) of the low-pass filter, or its transfer function
v designating the reduced pulsation, and R the expansion ratio from the SRC system.
According to some embodiments, as illustrated by
For example, as illustrated by
After step S202, a step S205 of feeding the SRC system with the polyphase components is performed.
The polyphase components may be fed to a same instantiation of the SRC system. However, for example for real time applications, wherein the output samples of the different instantiations of the SRC systems produced at the same time are grouped within one frame, the SRC system may be instantiated a number of times equal to the number of polyphase components. Then, the processing of the polyphase components by the instantiations of the SRC system may be performed at the same time (in parallel). Other variants of the number of instantiations and feeding of polyphase components may also be applicable.
Then, after step S205, a step S206 of generating an output signal is performed. The output signal is generated based on the output of the SRC system (or the outputs of the different instantiations of the SRC system). The output signal is generated by interleaving, shifting and adding the signals output by the SRC system (or the outputs of the different instantiations of the SRC system). The output signal may also be generated by an overlapping add operation on the outputs of the SRC system (for example in case of a real time application).
For example, as illustrated by
The above described embodiment is also illustrated by the multi-rate frameworks of
a shows a number PQR of parallel processing lines each comprising a QR-fold decimator, the SRC system with a rate of R/P and a PQ-fold expander. The input signal is fed to a first processing line and to a delay z−1 that outputs the delayed signal to another processing line and to another delay, and so on until the last processing line receives the input signal delayed PQR times. The first processing line outputs a signal to a delay z−1 that feds the delayed signal to an adder that sums the delayed signal and the output of the second processing line and so on. The combination of the delays and the adders performs the operation of overlap add.
b shows an alternative framework using anti-causal advances z. In this framework, the input signal b is fed to the last processing line and the delays z−1 of the left side of the framework are replaced by advances z.
A system according to some embodiments is illustrated by
The system comprises an input port 401 for receiving a signal at a rate M times the operating input rate of the SRC system 402. The input port 401 may also receive a signal used for characterizing the SRC system 402.
The system further comprises a processing unit 403 for generating the polyphase components. These polyphase components may be fed to the SRC system 403 through an output port 404 that may be coupled to an input port 405 of the SRC system.
The system further comprises an input port 406 for receiving signals output by the SRC system through its output port 412 corresponding to the processed polyphase components. These processed polyphase components may be used by the processing unit 403 for generating and outputting the output signal through an output port 407.
The output port 407 receives a signal at a rate q time the operating output rate of the SRC system. The feeding rate of 401 is identical to the filling rate of 407.
The processing unit may comprise a decimator 408, an expander 409, and a time shifting unit 410.
According to some embodiments, the system may comprise a characterization unit 411 for characterizing the SRC system.
Also, the system according to embodiments may comprise a memory unit 413 for storing data for the implementation of a method according to embodiments of the invention, or storing a computer program according to embodiments of the invention.
In the following, the method for making the SRC system shift-invariant is further explicated.
Making the SRC System Shift-Invariant
In the following, it is shown how a set of system responses organized as a sum of interleaved and shifted signal is linear and time-invariant when this set forms a set of regularly delayed signals from 0 to MP−1 samples.
Under those conditions, the z-transforms S(z) of the interleave-shift-added (ISA) system responses follow an expression which allows a characterization of the SRC system for both in-band and out-of-band domains (B(z) being the z-transform from an interleaved test sequence):
S(z)=H(zQ)×B(z).
Let's consider the problem of characterizing an upsampler with a rate of R=3. The upsampler may be characterized by its canonical representation through a low pass filter H(z). The R-polyphase decomposition (hl(zR))lε[0,R) of H(z) is given as:
For the sake of readability, in the following, a sequence (sn)nεN may be assimilated to its z-transform S(z).
Now, we consider a set of test vectors
for instance slightly delayed versions of a band-limited unit impulse or, said differently, an approximation of the shifted unit
(this being only a notation for a sequence describing a fractional delay of the shifted unit impulse, here
does not equal zk).
The individual vectors
are obtained as the M-polyphase decomposition of B(z):
For the matter of this example, consider M=24, which is not too large but still shares a common denominator with most usual resampling ratios (apart ratios translating sampling rates from the 44.1 kHz domain to sampling rates simple multiple of 8 kHz).
Let's start with the processing (i.e. here, the up-sampling) of each test vectors
resulting into the vectors
The label kR/M is chosen in order to remind that if everything was ideal,
would match the fractional delay
In this case, we have M/R=Q=8 and therefore the 8 first processed vectors form a polyphase decomposition of the unit impulse: s0/8(z), s1/8(z), . . . , s7/8(z). Therefore, let's consider S0(z) obtained as the result of interleaving those 8 vectors:
Now, S0(z) is separable; define βl(z) as the second term of this product:
Then, we have:
S
0(z)=H(zQ)×β0(z) (2)
One may note that the 8 following processed vectors form another polyphase decomposition of the unit impulse: s8/8(z), s9/8(z), . . . , s15/8(z). We can consider S1(z) obtained as the result of interleaving those 8 one-sample shifted vectors,
Note also that the 8 last processed vectors form again another decomposition of the unit impulse: s16/8(z), s17/8(z), . . . , s23/8(z). S2(z) is obtained as the result of interleaving those 8 two-sample shifted vectors, i.e. (z−2sk/8(z))kε[16,24):
Finally we have S(z), the z-transform from the sequence obtained as the addition from the three previous sequences:
We can recognize in Σl=0R−1βl(z) the z-transform B(z) (cf. Equation 1). Therefore, we have generated a sequence as the interleave-shift-add (ISA) operation from the processed test vectors
which z-transform, S(z), is not only separable from H(z), but can also be divided by B(z) (i.e. the z-transform from the interleaved test sequences):
S(z)=H(zQ)×B(z) (3)
We consider the case of a simple downsampler of rate 1/P. Test vectors
are obtained as the M-polyphase decomposition of B(z) (cf. example 1, with for instance M=24):
Consider also the continuation of the test vectors
for k≧M:
The down-sampling of any test vector
results into the vector
with ωP being a non-obvious root from XP−1.
Now, consider the z-transform S(z) of the sequence obtained by interleaving MP consecutive sequences derived from
for instance starting from k=0:
Consider the second part from this expression:
Note that XP−1=(X−1)Σ=0P−1Xl. Therefore, if ωPm≠1, then Σl=0P−1ωPlm=0. Said differently and using the Kronecker symbol
Now, let's return to the original evaluation:
This concludes the evaluation of S(z) in the simple down-sampling case:
S(z)=H(zM)×B(z) (4)
This equation is similar to Eq. (3). This proves once again that H(eiπv) can be estimated and that both in-band and out-of-band characteristics can be safely recovered.
We consider the characterization of a resampler with a fractional ratio R/P, R and P being co-primes. Following the upsampler example, we assume that this resampler is characterized by the unknown filter H(z) in its canonical representation.
An integer M is selected so that Q=M/R is an integer number. An integer q is defined as q=MP/R. The following relationships recap the properties between P, R, M and q:
M=RQ, q=PQ, MP=qR.
Finally, we consider the set of test vectors
obtained as the M-polyphase decomposition of B(z) (cf. example 1):
Define also the composite root Ωm,n as:
For further calculus, define also βlm(z) (cf. the upsampler example) and Bm(z) as:
k+IQ with k in ε[0,Q) and I in ε[0, PR) covers the entire range [0, qR). Therefore, the previous sum can be rewritten as:
For the second evaluation, we have ωMPRm=ωRqRm=ωqm and therefore:
Now, let evaluate βlm(z) and finally Bm(z):
The resampling of any test vector
by the fractional ratio R/P results into the vector
(cf. Appendix).
Consider the ISA sequence, S(z), derived from qR consecutive sequences
e.g. with k in [0, qR). This ISA sequence results from the sum of R shifted interleaved sequences, z−lqSl(z). Each of those sequences, Sl(z), is the result from interleaving q consecutive sequences
e.g. for k in [lq, (l+1)q) (cf. the upsampler example for another slightly different—but equivalent—practical description for the ISA procedure). This process is described in
Let evaluate this expression in term of B(Ωm,nz):
Since P and R are co-primes, mR+nP mod(PQR) covers exactly the range [0, PQR) when m is in [0, P) and n in [0,RQ). Therefore, the double sum in m and n can be replaced by a simple sum in i, with m=i×
Since ωPQR is a root of the unity, we have:
This concludes the evaluation of S(z) and generalizes both Eq. (4) and Eq. (3):
S(z)=H(zQ)×B(z) (5)
The transfer function
from the resampler of ratio R/P can be estimated and both in-band and out-of-band characteristics can be recovered.
The object of this appendix is to evaluate the polyphase expression of a re-sampled signal.
Consider the input sequence which z-transform is b(z). Consider a fractional resampler×R/P the canonical representation of which is based on the low pass filter H(z). Let evaluate the z-transform s(z) of the re-sampled signal. In order to do so, consider first the z-transform σ(z) resulting from the R-expansion:
σ(z)=H(z)·b(zR)
The z-transform s(z) is obtained from σ(z) after a P-fold decimation; when we consider a 0 offset for this decimation, we obtain (with ωp being a root of XP−1):
Altogether, we obtain:
The Characterization of an SRC System
In the previous examples, the transfer function for a resampler with a fractional ratio R/P was established in Eq. (5), for instance, evaluated in the oversampled normalized domain:
Note that
includes M/R duplications of
One possible way to recover
from
is therefore to reweight appropriately each duplicate:
It is even possible to take into account spectrum duplicates corresponding to negative frequencies:
with, in that case, the weights updated accordingly:
bk/M was previously presented as an approximation from a shifted unit impulse. However, this assumption is not needed in order to obtain Eq. (3). Indeed, for the same reasons than for regular filter transfer function characterization, a pseudo-noise excitation vector can be preferred to the impulse-response method, in our case, a specifically designed set of test vectors can also be preferred to a band-limit impulse response. The recover formula in Eq. (6) indicates that it is even not necessary for
to be exactly non-zero since the duplications for
produce redundancy and therefore a tolerance to zeros.
An additional possibility for recovering H(z) and to increase tolerance to zeros (and incidentally tolerance to floating-point rounding errors) is to consider multiple answers S(u)(z) to different sets of test vectors B(u)(z) instead of S(z) alone. In such case, the recovery formula Eq. (6) is generalized by:
Said differently, it is enough for the aliased transfer functions
to cover the range [0,R) in the oversampled normalized spectral domain in order to recover safely the transfer function
is obtained with a low-order low-pass filter. The second set of test vectors
is derived from
by a modulation at the Nyquist frequency (i.e. a multiplication by the sequence 1, −1, 1, −1, 1 . . . ).
Instead of recovering the transfer function
the multi-rate framework from
for m in [0, P); any other contribution being pure zero:
In the impulse method, the amount of processed vector needed is reduced: P processed vectors for the impulse method instead of at least PQR for other methods such as the transfer function method. However, it is possible that the redundancies aspects in the transfer function method guaranty a better robustness against quantization noises.
Those simple examples demonstrate the different possibilities available for estimating an upsampler transfer function and to reveal the upsampler design characteristics for both in-band and out-of-band spectral domains.
The Total Harmonic Distortion (THD) method is a power comparison between the original signal (sine wave) and the signal obtained by extracting the harmonic partials (comb filter) generated by the system on the original sine wave.
The Total Distortion (THD+Noise) method is a power comparison between the original signal (sine wave) and the signal obtained by removing the sine wave component from the distorted tone.
The test input signal b is a sine wave which period in the oversampled domain is Mp samples.
Then, the above discussed method making the SRC system shift-invariant is performed in order to produce the output signal s.
The width of the support of s is denoted lgt.
The complex vector Sk of size lgt is Discrete Fourier Transform of the output signal s.
The element indexed by
in the vector Sk corresponds to the source sine wave. The harmonic distortion of the source sine can be found in the vector Sk for every indexes multiple of k0, i.e. for k=jk0, with j from 0 to pR−1.
Since the analysed system is made shift-invariant by the present method, the only source of harmonic non-linearities originates from rounding error due to the floating-point or fixed-point arithmetic used internally by the device.
The Sine Sweep method consists in the generation of a sine sweep as the input signal and the examination in a time-frequency domain (e.g. a spectrogram) of the SRC processing. This examination reveals separable alias trajectories for each aliased components; it reveals also the characteristics from the in-band linear response which is roughly available as the time envelop from the time response.
The Exponential Sine-Sweep consists in the generating an exponential sine-sweep in the oversampled domain and feeding it to the LSI system. The output reveals separately the linear characteristics, the stop-band characteristics and the rounding-error characteristics.
The Exponential Sine Sweep (ESS) method, aims at characterizing simultaneously the linear and non-linear behaviour of a device (e.g. a transducer) provided that this device follows a model restricted to a certain subclass of Volterra Kernels.
When the ESS test vector is fed into a non-linear device, each distorted harmonic component appears as an anti-causal echo. Deconvolution separates each echo and allows objective measurement.
The sine wave used in the Harmonic Distortion method is replaced by an Exponential Sine-Sweep. Then, the Exponential Sine-Sweep (ESS) method is applied in the oversampled domain in order to characterize the shift-invariant system.
The ESS method applied to the SRC system enables to access to the linear characteristics of the SRC (i.e. the in-band and out-of-band characteristics from the core low-pass filter H) as well as the harmonic distortion characteristics of the algorithms (originating from rounding errors in internal arithmetics).
The Silence Sweep method allows the measure of the total distortion instead of the harmonic distortion. The idea relies on replacing the original sine sweep by a sine sweep subtracted from a pseudo-noise.
Embodiments of the present invention can be embedded in a computer program product, which comprises all the features enabling the implementation of the methods described herein, and which—when loaded in an information processing system—is able to carry out these methods. Computer program means or computer program in the present context mean any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after a conversion to another language. Such a computer program can be designed based on the flowchart of
Embodiments of the invention can be embedded in hardware circuits. For example, the circuits are implemented by the preparation of ASICs, FPGAs, microcontrollers or discrete logic elements by interconnecting an appropriate network of conventional component circuits. The circuit may also comprise memory elements such as ROMs, RAMs, EPROMs, EEPROMs, Flash memory, magnetic or optical cards, or any type of media suitable for storing electronic information.
Expressions such as “comprise”, “include”, “incorporate”, “contain”, “is” and “have” are to be construed in a non-exclusive manner when interpreting the description and its associated claims, namely construed to allow for other items or components which are not explicitly defined also to be present. Reference to the singular is also to be construed in be a reference to the plural and vice versa.
While there has been illustrated and described what are presently considered to be the preferred embodiments of the present invention, it will be understood by those skilled in the art that various other modifications may be made, and equivalents may be substituted, without departing from the true scope of the present invention. Additionally, many modifications may be made to adapt a particular situation to the teachings of the present invention without departing from the central inventive concept described herein. Furthermore, an embodiment of the present invention may not include all of the features described above. Therefore, it is intended that the present invention not be limited to the particular embodiments disclosed, but that the invention include all embodiments falling within the scope of the appended claims.
A person skilled in the art will readily appreciate that various parameters disclosed in the description may be modified and that various embodiments disclosed may be combined without departing from the scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
10306182.6 | Oct 2010 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP11/68882 | 10/27/2011 | WO | 00 | 7/9/2013 |
Number | Date | Country | |
---|---|---|---|
61448356 | Mar 2011 | US |