The present invention relates to the field of compressed sensing and more specifically to compressed sensing by using a wavelet dictionary.
Compressed sensing, also known as compressive sensing or compressed sampling or sparse sampling is a rapid growing field that has attracted considerable attention in recent years.
The ever increasing amount of data generated by sensing systems such as wireless sensor networks (WSNs) has dramatically emphasized the need for sampling measurement signals at a low rate, i.e. below the Nyquist frequency. Compressive sensing theory states that a signal can be sampled without information loss at a rate close to its information content (Landau rate).
In general, sampling a signal x below the Nyquist frequency, that is at a frequency lower than fNyq=2fmax, where fmax is the highest frequency present in the spectrum, leads to aliasing and therefore impedes signal recovery. Assuming that the signal is sampled at the Nyquist frequency fNyq in a given interval of a real (e.g. of time or space), it can be represented by a vector denoted x of size N in the canonical basis (instants in time, points in space) of this domain.
However, if the representation of x is K-sparse in a given domain (hereinafter called sparse domain), i.e. has at most K nonzero values in a basis spanning this domain, it can be recovered from a number M of linear measurements which is in the order of magnitude of K log N. If we denote y the measurement vector that is the vector constituted of these M linear measurements, it can be expressed as:
y=Φx=ΦΨs (1)
where s is the K-sparse representation of x in the sparse domain, Y is the transformation matrix (also called sparsity matrix) of size N×N from the basis spanning the dual domain to the canonical basis, and Φ is the measurement matrix (also called acquisition matrix) of size M×N, with typically M<<N, capturing the reduction of dimensionality.
The signal s can then be recovered from the measurement vector y, the signal estimate ŝ being obtained minimizing the L1-norm:
where, l1 and l2 denote respectively the L1-norm and the L2-norm, B(y) ensures that ŝ is consistent with the measurement vector y and ε is a tolerance parameter dependent upon the noise.
Several reconstruction algorithms have been proposed in the literature, e.g. the basic pursuit algorithm (BP), the orthogonal matching pursuit algorithm (OMP) and the tree-based OMP (TOMP).
A comprehensive introduction to compressive sensing can be found in Chap. I of the book entitled “Compressive sensing: theory and its applications” by Y. C. Eldar and G. Kutyniok, Cambridge University Press, 2012.
A first example of compressive sensing method is non-uniform sampling (NUS).
We assume in the following that the signal to be sampled is K-sparse in the frequency domain.
Non-uniform sampling allows a sample rate close to the optimal information rate (i.e. the Landau rate) to be approached without complicated analog processing. In a nutshell, a NUS sampler samples the input signal at irregularly distributed instants by taking only a subset of the samples of a Nyquist converter. Assuming an acquisition window of length Tacq, the number of samples acquired by the NUS is equal to M<N=fNyqTacq, where M samples are chosen at random among the N samples of the Nyquist converter.
As shown on
Using the formalism of expression (1) the non-uniform sampling can be expressed as:
y=RTINF1s (3)
where the transformation matrix is here Ψ=F−1, i.e. the inverse discrete Fourier transform (DFT), IN is the matrix identity of size N×N, with N=fNyqTacq, and RT is a selection matrix of size M×N representing a random selection of M columns among N of the identity matrix. Each row of RT has an element equal to 1 and all other elements equal to 0, the M column indicia of the elements equaling to 1 being chosen at random among {1, . . . , N}. The measurement matrix is therefore simply Φ=RTIN.
The non-uniform sampling method is a simple compressive method but has several shortcomings.
First, in case the signal input to the NUS sampler is wideband, the ADC still needs to capture a snapshot of this wideband signal with frequencies possibly reaching up to the highest frequency present in the spectrum.
Second, although the NUS sampler operates at a sub-Nyquist rate, a clock at the Nyquist rate is required to synchronize the samples.
Third, a NUS sampler is highly sensitive to timing jitter. If the input signal varies rapidly, a small error in the sampling time may lead to an erroneous sample value.
A second example of compressive sensing method is random-modulation pre-integration (RMPI). According to this scheme, the signal is mixed in parallel channels with a plurality of pseudo-random binary sequences (PRBS) at the Nyquist rate, the output results being integrated and sampled at a low rate. The idea behind RMPI is that it is much easier to mix a wideband signal at the Nyquist rate than to accurately sample it at this rate.
More recently, it has been proposed in patent U.S. Pat. No. 8,836,557 to achieve compressed sensing of a signal by exploiting its sparsity in a Gabor frame representation. More specifically, the signal is mixed in a plurality of channels operating in parallel with modulating waveforms belonging to a Gabor frame and the modulation products are integrated over a finite time interval. The modulating waveforms are designed such that the mixing operation produces weighted superposition of various subs-sections of the signal related to different sub-intervals of the finite time interval. The integration results provide sample values that represent the signal and enable its reconstruction.
However, this method is not optimal for a multiband signal inasmuch as it does not exploit the structure of the signal spectrum. In addition, the pseudo-random binary sequence needs to be run at a high rate and therefore leads to a high power consumption.
The purpose of the present invention is therefore to propose a compressed sensing method which does not present the disadvantages of the prior art, in particular which enables sampling of a sparse multi-band signal at a very low rate, close to the information rate.
The present invention is defined in the appended independent claims. Various preferred embodiments are defined in the dependent claims.
The present invention will be better understood from the description of the following embodiments, by way of illustration and in no way limitative thereto:
We will assume in the following that the signal of interest x is sparse in the frequency domain, i.e. that only few (K) coefficients of the vector s=Fx, where F is a Fourier matrix, are non-zero. By non-zero it is meant here that these coefficients are greater in absolute value than a predetermined threshold η. In addition it is assumed that the signal may occupy a plurality NT of bands known a priori. The K non-zero coefficients are clustered within a subset of Nb<NT bands of these bands (Nb≥1), hereinafter referred to as bands of interest. Hence, the frequency support Ω of the signal may be expressed in terms of bands of interest:
where Bi, i=1, . . . , Nb denote the bands where there is at least one spectral coefficient exceeding η.
Finally, it will be assumed that the respective central frequencies fic of the bands of interest Bi are known or can be determined. A method for determining the boundaries of the bands is described further below.
In the present instance, the spectrum of the sparse signal lies within two bands of interest B1,B2 respectively centered around the frequencies f1c,f2c. The non-zero component at the frequency fα lies outside the bands of interest and is considered as an interference signal.
From bandpass sampling theory (see for example the article of R. G. Vaughan et al. entitled “The theory of bandpass sampling published in IEEE Trans. on Signal Proc., Vol. 39, No. 9, September 1991, pp. 1973-1978”, it is known that if a signal exhibits a band of bandwidth Bw located between a frequency fL and a frequency fII, the signal can be sampled at the frequency 2Bw without suffering aliasing coming from within the band Bw itself. In other words, the samples at the Nyquist rate can be decimated by an undersampling factor
Bandpass sampling theory has been generalized to accommodate the sampling of multi-band signals as described in the article of R. Venkataramani et al. entitled “Optimal sub-Nyquist non-uniform sampling and reconstruction for multi-band signals” published in IEEE Trans. on signal processing, Vol. 49, No. 10, October 2001, pp. 2301-2313. Basically, a multiband signal can be sampled by taking samples on a periodic non-uniform time grid where the basic frequency of the sampling grid is chosen as the lowest sampling rate fs guaranteeing the absence of aliasing (that is the overlap of copies of the spectrum, translated by multiples of Ω, λ(Ω)≤fs≤λ([Ω]), where [Ω] is the smallest interval containing Ω and λ(.) denotes the Lebesgue measure. This method of non-uniform sampling is also called multi-coset sampling in the literature.
However, bandpass sampling and multi-coset sampling dramatically suffer from aliasing, coming from outside Ω, and therefore turn out to be less efficient than conventional Nyquist sampling. The signal to noise ratio (SNR) is indeed degraded by the sub-sampling ratio. If the signal of interest is affected by interference located outside the support Ω, the interference lines are folded into the baseband, thus preventing an accurate reconstruction thereof
The idea underlying the invention is first to perform a projection of the signal of interest onto waveforms belonging to a Gabor frame or wavelet frame, the waveforms succeeding one another at the lower bound bandpass sampling rate fΩ=λ(Ω), the parameters of the waveforms being chosen according to the characteristics of the bands Bi i=1, . . . , Nb, and then to non-uniformly sample the projection results thus obtained. The compressed sensing method according to the invention can therefore be regarded as a non-uniform wavelet bandpass sampling method (Wav-NUBPS).
First, it is recalled that a wavelet is a waveform of limited duration, zero average and non-zero norm. A continuous function x(t) in L2() (and more generally L2() can be represented by a set of functions ψs,δ(t) obtained by scaling (with scale s) and shifting (with shift δ) a mother wavelet ψ.
The wavelet transform of a continuous function x(t)ϵL2() is defined by:
where
is a scaled and shifted version of the mother wavelet ψ.
By projecting x(t) onto the wavelet at various continuous scales and positions, a function of two variables Wx(s,δ) is defined, called the continuous wavelet transform (CWT) of x. Thus, the CWT represents a one dimensional signal in a two-dimensional time-frequency domain (s,δ). In practice, the continuous variables s and δ are replaced by their discrete counterparts sj=s0j and δk,j=kδ0s0j, k, jϵ, the time-frequency domain being covered by waveforms belonging to a frame (or dictionary):
Dψ is in general an overcomplete frame and not necessarily a basis of L2(). The projection of x(t) onto the waveforms of the dictionary defines a set of discrete values, Wx(sj,δk,j) called the discrete wavelet transform (DWT) of x.
Similarly, the signal of interest can be projected onto a Gabor frame constituted by the waveforms:
gk,j(t)=g(t−ak)e2πibjt (7)
where g(t) is a window function (e.g. a raised cosine window) having a non-zero value only over a window width and a zero value elsewhere, a is a discretization interval in time and b is a discretization interval in frequency, and kϵ, jϵ. Hence, gk,j(t) is a waveform centered around the instant a·k and the frequency bj.
The time-frequency domain is covered by the overcomplete frame (Gabor frame):
DG={gk,j(t)}kϵ,jϵ (8)
It will be understood that, contrary to the Morlet frame, the waveforms of the Gabor frame all have the same time window width described by the time support of the window function g(t).
Similarly as explained above, the signal of interest x(t) can be projected onto the waveforms of the frame, thereby defining a set of discrete values:
Gx(k·a,j·b)=x,gk,j=∫−∞+∞x(t)g(t−ak)e−2πibjtdt (9)
where denotes the Hermitian dot product.
The signal of interest x(t) being K-sparse in the frequency domain, it can be represented by a vector s having K non-null elements in the Fourier basis, where x=Ψs.
The signal x(t) can be measured by projecting it on a set of waveforms of the dictionary (Gabor frame, Morlet frame or wavelet Haar basis).
The measurement column vector, y, thus obtained can be expressed as in (1) where the sparsifying or transformation matrix Ψ=F−1 is the inverse discrete Fourier transform and the measurement matrix is Φ=RTWT, RT being the selection matrix (selection of M rows of the sampling matrix WT, hence of M columns of the matrix W) and W is a matrix of size N×N, representing the Ns atoms of the dictionary in the N dimension space. More specifically, each of the Ns rows of the sampling matrix WT represents an atom defined by one or more parameters. For example, each row of WT is defined by the parameter pair (s,δ) in case of Morlet wavelets and the parameter pair (a,b) for a Gabor frame.
In other words, the index of the row in the matrix W corresponds to the time index while the index of the column corresponds to a waveform index in the dictionary defined by a given set of parameters.
If the dictionary is a wavelet frame, the elements of the matrix W are given by:
where s0j is the scale of the waveform kδ0s0j is the time-shift of the waveform with respect to the origin of the time axis.
If the dictionary is a Gabor frame, the elements of the matrix W are given by:
Wk,j=g(t−ka)e2πibjt (11)
Therefore the waveform index corresponds here to a couple of scale and time delay in the case of a wavelet frame and a couple of carrier frequency and time delay in the case of a Gabor frame.
The compressed sensing method assumes prior knowledge of the band boundaries of the multi-band sparse signal of interest x(t). If these band boundaries are not known, a support recovery of the spectrum is carried out at step 310 and the bands of interest are derived therefrom. In this sense, step 310 is optional as shown by the dashed line. Various algorithms can be used for this recovery such as the compressive binary search (CBS) method described in the article of M. A. Davenport et al. entitled “Compressive binary search” published in the Proc. of IEEE International Symposium on Information Theory (ISIT), February 2012, included herein by reference. A refined CBS method using wavelets that demonstrates a practical implementation solution will be explained further below in relation with
At step 320, the signal of interest is projected onto, i.e. correlated with, a plurality of waveforms belonging to a dictionary, typically an overcomplete dictionary (Gabor frame, wavelet frame e.g. C-Morlet wavelet frame) or a simple basis (Haar wavelet family). These waveforms succeed one another at the sampling rate
being defined as the undersampling factor. Two consecutive waveforms are therefore separated by TΩ=γTNyq.
More specifically, if the signal of interest is a multiband signal exhibiting Nb bands of respective bandwidths Biw, i=1, . . . , Nb, the undersampling factor is given by:
where
is the aggregated bandwidth of the multiband signal.
Although the number of sampling instants is reduced by the undersampling factor γ with respect to conventional sampling at the Nyquist frequency, it should be understood that the number of points of the sampling grid (corresponding to the number of available atoms) can be much higher than the original dimension space N (corresponding to the number of samples at the standard Nyquist rate N=Tacq/TNyq) when an overcomplete dictionary is used
In case of a Gabor frame, the waveforms are indexed by their central frequencies bj and their time shifts ak, where the time shifts are defined from the beginning of the acquisition window. Hence, the Gabor frame provides a sampling grid of N×Nfc points, the granularity on time axis being TNyq and
being the number of central frequencies used. However, the fact we perform bandpass sampling at fΩ amounts to a selection of NΩ×Nfc points in the sampling grid where NΩ=fΩTacq. In other words, the sampling grid actually used for the present invention exhibits a coarser time granularity than at the Nyquist frequency.
Similarly, in case of a wavelet frame (e.g. C-Morlet frame), the waveforms are indexed by their scales s0j and their time shifts kδ0s0j. Hence, the wavelet frame provides a non-uniform sampling grid of Nsc×Nδ points where Nsc is the number of scales and Nδ the number of elementary shifts available in the dictionary for a given scale. As it can be seen from expression (10) the time shifts depend on the considered scale. A low scale value is associated with short duration wavelets and therefore with short time shifts whereas a high scale value is associated with long duration wavelets and therefore with long time shifts. At any rate, the time shift resolution shows a coarser time granularity than at the Nyquist frequency.
In view of the non-uniform sampling step 330 described further below, it is generally sufficient to project the signal onto only one waveform per bandpass time shift. However, if more diversity is required, the signal can be projected on a plurality of waveforms per time shift, for example different scales (wavelet frame) and/or different central frequencies (Gabor frame).
Additionally, the parameters of the waveforms used for the projection are chosen based on the characteristics of the bands Bi of the input signal. In other words, only a subset of the dictionary may actually be used for the projection.
For example, in case of a Gabor frame, the time width of the window function g(t) is chosen to be shorter than the inverse of the aggregated bandwidth, Baggw, so that each projection captures information over the whole bandwidth of interest. In other words, the spectrum of the window function should be as flat as possible over the bandwidth of interest and not concentrate energy on a specific frequency within the support. Alternatively, a specific window gi(I) can be chosen for each individual sub-band Bi, the time width of the window function gi(t) being shorter that the bandwidth Biw of the bandwidth it is associated with.
Similarly, in case of a wavelet frame, the scale s can be chosen higher than a predetermined value depending upon Baggw in order to capture information of the signal over the whole bandwidth of interest. The benefit of this approach would be to extract information with variable frequency resolution according to the scale.
More generally, the waveforms used for the projection are selected from the dictionary, such that the mutual coherence of the matrix WT and the matrix FΩ−1 where WT is the sampling matrix (or projection matrix) and FΩ−1 is the matrix of the inverse Fourier transform limited to the spectral support Ω. The mutual coherence of WT and FΩ−1 is defined as:
Hence, the lower the mutual coherence of WT and FΩ−1, that is the lesser the mutual coherence between the sampling matrix and the sparsifying matrix, the better the spreading of the measurement over the spectral support Ω.
It has been shown that, in order to minimize the mutual coherence and to optimally spread out the measurements over the spectral support, the ratio between the waveform bandwidth and the aggregated signal bandwidth, that is the relative bandwidth of the waveform, should be higher than 50%.
Furthermore, the waveforms are advantageously chosen so that their central frequencies lie within the spectral support.
The selected atoms are distributed in time every TΩ=γTNyq. They comprise a first plurality of waveforms following one another every
and a second plurality of waveforms following each other every
the waveforms of the first plurality alternating with the waveforms of the second plurality. The central frequencies (bj) of the waveforms of the first plurality are distributed within the band B1 and the central frequencies of the waveforms of the second plurality are distributed within the band B2.
More generally, the waveforms used for the projection of a multiband signal form a sequence recurring with a period γTNyq, the waveforms being distributed among a plurality of sets, each set being associated with a given band of the multiband signal, the waveforms belonging to different sets being interleaved. The waveforms associated with a band Bi form a subsequence of the sequence and recur with a period
the union of these subsequences providing the sequence in question.
Turning back to
If the acquisition interval of the signal is Tacq, the M sampling instants (equivalent to time shifts) are randomly selected among NΩ sampling instants uniformly distributed over the acquisition interval, where NΩ=Tacq·fNyq/γ.
The expression sampling matrix should not be misconstrued: it provides here the correlation results with waveforms of the dictionary instead of Dirac pulses sampling as in the conventional sampling approach (the correlation with Dirac pulses results sampling the signal of interest at the instants of the pulses). By doing so, the invention mitigates drastically the level of aliasing. Indeed, the correlation of the signal of interest x(t) with the waveform ψ(t) is equivalent, in the frequency domain, to a multiplication of the spectrum s(f) of this signal with the conjugate of the Fourier transform of the waveform, denoted Ψ°(−f). Therefore, each measurement produces a filtered version of s(f) instead of the whole spectrum.
For example, in case the dictionary is a Gabor frame, the Gabor atoms exhibit a Gaussian shape in the frequency domain which is very efficient for rejection purposes. As a result the method according to the invention outperforms the bandpass sampling approach in terms of noise and interference out of band rejection efficiency. In addition, it allows for a compression down to the information rate reducing the data stream. Finally, the solution does not require any high speed A/D converter operating at the Nyquist rate.
The input K-sparse multiband signal x(t) is mixed in an analog mixer 510 with a sequence Swav(t) of waveforms following each other at the bandpass sampling interval TΩ=γTNyq, where γ is the undersampling factor defined in (12). In other words, the sequence Swav(t) is constituted of waveforms time shifted at multiples of the bandpass sampling interval TΩ=γTNyq.
The waveforms are generated by a waveform generator 520 that will be described in more detail further below.
The waveforms belong to an overcomplete dictionary such as a wavelet frame or a Gabor frame. Alternately, the Haar basis can also considered.
The output of the mixer 510 is integrated over intervals of length τ. The integration time r is advantageously chosen equal to the maximum time width of the waveforms in order to prevent noise integration. The integrator 530 operates at the rate 1/TΩ so that a correlation value (or projection value) is provided at each and every period TΩ.
The sequence of correlation values is then non-uniformly sampled at random in the ADC converter 540, with an average decimation rate of NΩ/M. In other words, M correlation values are selected randomly out of NΩ and digitally converted. The correlation values are output at the rate
which can be made arbitrarily close to the Landau rate.
According to a preferred variant, the waveform generator does not output a waveform (in other words its output is inhibited) when the correlation value is not sampled by the A/D converter. The sequence of waveforms where M out of NΩ are randomly upheld and the others are omitted is denoted Swavrdm(t). The omitted waveforms are shown in grey in the original sequence Swav. In that instance the output of a pseudo-random binary sequence (PRBS) generator (not shown), in practice a linear feedback shift register (LFSR), controls both the output of the waveform generator 520 and the A/D converter 550.
This second example of implementation differs from the first one in that it comprises a plurality P of projection and non-uniform sampling branches operating in parallel.
More specifically, each branch 590p, p=1, . . . , P comprises an analog mixer 510p mixing the input signal with a subsequence of waveforms Swavp obtained by decimating the sequence Swav by a factor P. More specifically, if we denote Swav(1), Swav(2), Swav(3) . . . the waveforms of the sequence Swav at TΩ, 2TΩ, 3TΩ, . . . the mixer 510p mixes the input signal with the sequence S, comprised of the waveforms Swav(p), Swav(p+P−1), Swav(p+2P−1), . . . generated by a corresponding waveform generator 520p. The integrator 530p integrates the mixer output over intervals of length τ. In contrast to the previous example, the integrator 530p operates at the reduced rate 1/(PTΩ). The subsequence of correlation values is non-uniformly sampled by the A/D converter 540p, with an average rate of NΩ/M, each A/D converter being driven by a corresponding PRBS generator (not shown). The compressed sensing measurements are represented by the set of sampled correlation values provided by the P branches. The correlation values are output at the rate
which can be made arbitrarily close to the Landau rate.
The preferred variant mentioned in relation to
Finally, it should be noted that the sampling rates and the decimation rates of the various branches need not be equal. For example the ith branch, i=1, . . . , P, may use a sampling rate of fsi=fΩ/pi and a decimation rate NΩ/mi, where
In such instance, it can be shown that the number of samples retained over the acquisition time Tacq=NΩTΩ remains equal to M. Preferably, these values pi are chosen such that pi=2i−1.
This third example of implementation differs from the second one in that each branch is dedicated to a given waveform parameter or a given set of waveform parameters. The sequence of waveforms generated by a waveform generator 520p is denoted Swavpar p. For example, each branch can be dedicated to a given scale (in case of use of a wavelet frame) or a given central frequency of the waveform (in case of use of a Gabor frame).
The various branches do not necessarily operate at the reduced rate 1/(PTΩ) but may operate a higher rate up to 1/TΩ. If we assume that the P branches operate synchronously at the bandpass sampling rate 1/TΩ (as shown in the figure), they will collectively provide NΩP correlation values within a time interval Tacq=NΩTΩ, out of which M correlation values will be selected and digitally converted in ADC 550, where at most one correlation value out of P is selected per period Ts. It results that here again the correlation values are output at the rate
which can be made arbitrarily close to the Landau rate.
This example of implementation is advantageous in that it entails more diversity than the second example of implementation.
The waveform generator relies on a cross-coupled Voltage Control Oscillator (VCO). The man skilled in the art will recognize NMOS active cross paired transistors 611 and 612, in parallel with the passive LC tank, 620 that form an oscillator when the power source 630 is activated.
The frequency of the VCO is controlled by the frequency control signal Vf. This signal tunes the LC tank to a specific oscillation frequency.
Every clock period Ts, if the PRBS generator 640 outputs a logic “1” level, a pulse is generated by the baseband pulse shaper 650, having the desired shape of the waveform. The shaped pulse drives the bias current source Ibias so as to provide oscillation during a limited time window defined by the pulse width.
Additional features may be envisaged to further improve the waveform generator.
For example, an additional switch may be provided in parallel to the VCO that creates a shortcut between branches and accelerate oscillation damping. This shortcut avoids any ringing and hangover between pulses.
The waveform generator may be adapted to generate pulse oscillations in quadrature so as to provide complex wavelet. Typically this could be implemented by any QVCO implementation such as doubling frequency, polyphase filtering or combined VCO.
The waveform generator may further comprise means for setting the initial phase of the pulse when starting the oscillation. This could be implemented by providing an unbalance or mismatch between the two branches, that will force oscillation starting in one direction, for example by using NMOS transistors having different W/L ratios.
In the compressed sensing method of
The method performs a search by dichotomy through the bandwidth of the signal. Basically, the method goes through a series of iterations, the bandwidth being split into 2i+i
The method starts at step 710, with a band B split into 2i
The index i is initialized i=1.
At step 720, each bin is split into two half-bins of width
a lower (or left) half-bin and an upper (or right) half-bin. Two wavelets are associated with each half-bin, a left wavelet with a central frequency corresponding to the center of the left half-bin and a right wavelet with a central frequency corresponding to the center of the right half-bin. The left and right wavelets have the same scale which corresponds to the width of the half-bin, i.e.
At step 730, the signal is projected onto the 2i
At step 740, for each bin, the left and the right projection values are compared to each other. The half-bin corresponding to the higher projection value is retained while the half-bin corresponding to the lower projection value is discarded.
The process of bin splitting and projection onto left/right wavelet goes on until a predetermined granularity is reached, i.e. until the size of the bins lies below a predetermined value.
At step 750, it is checked whether i=imax corresponding to the desired resolution level. If it is not the case, the value i is incremented in 755 and the process loops back at 720. If the desired resolution level has been reached, the process continues with step 760 and ends.
The bins that have not been discarded are considered to contain the non-zero components. In other words, as shown in 760, the union of these bins corresponds to the spectral support Ω of the input signal. It is then straightforward to determine the boundaries of the bands present in the spectrum.
The input signal analyzed in the example shown in
and
are retained. These half-bins are used as bins at the next iteration. At the second iteration, the half-bins
and
are retained. The process goes on with scales of increased values until a predetermined resolution level (here
has been reached.
This algorithm works well provided the left and right projection values are well separated. However, where a non-zero spectral component falls at edge of an half-bin there is a risk that the projections values are close to each other, within the noise margin. Hence an incorrect half-bin may be selected and the subsequent iteration will not be able to recover this component.
This ambiguity at the edges of the bins can be eliminated for a real signal. Indeed, in such a case, the Fourier transform of the signal is Hermitian symmetric, that is the spectrum is conjugate symmetric and therefore the support is identical at both ends of the spectrum. To get rid of this ambiguity, it is proposed to introduce a frequency offset in the complex wavelets central frequencies (e.g. C-Morlet wavelets). As a result, the distribution of a wavelet in a (half-)bin is not symmetric anymore. Hence, a non-zero frequency component giving rise to equal values at the edge of a bin at one end of the spectrum gives rise to unequal values at the other end of the spectrum. It is therefore possible to determine unambiguously the (half-) bin where the non-zero component falls into.
Number | Name | Date | Kind |
---|---|---|---|
8804811 | Muqaibel | Aug 2014 | B2 |
9660693 | Ni | May 2017 | B1 |
20100241378 | Baraniuk | Sep 2010 | A1 |
20120250748 | Nguyen | Oct 2012 | A1 |
Number | Date | Country | |
---|---|---|---|
20180131504 A1 | May 2018 | US |