METHOD FOR THE NON-LINEAR ESTIMATION OF A MIXTURE OF SIGNALS

Description

The present invention relates to a method for the non-linear estimation of wireless signals from several sources, the time/frequency representation of which shows an unknown non-zero proportion of zero components, using an array made up of P>2 antennas, when the directional vectors U and V of the sources emitting these signals are additionally known or estimated.

It is commonly necessary to estimate wireless signals (soundproofing) originating from radars, communication systems, or acoustic signals (audio or sonar), and received by a listening system made up of an antenna array.

The received signal results from a temporal and spectral mixture of no more than 2 sources, the directional vectors of which are assumed to be known, since they are estimated beforehand.

The criterion traditionally used is the maximum likelihood (ML), leading to processing by spatial linear filtering, which improves the signal-to-noise ratio by a factor equal to the number of sensors in the single-source case.

In the scenarios being considered (known directional vectors), this processing leads to a nonbiased linear estimate with minimal signal variance for which no a priori knowledge is available.

Other methods that are more complicated to carry out can be used, such as Capon filtering, when the directional vectors are imperfectly or not known (“Robust Adaptive Beamforming”, eds P. Stoica and J. Li, Wiley, 2006).

None of these methods exploit any a priori on the signal, and in particular none make it possible to correctly delimit the temporal and/or spectral supports of the signal, since a linear (ML) or pseudo-linear (Capon) processing always provides an output signal, even if at the input, the measurement is only made up of noise.

The problem is to access finer knowledge of the signal.

The invention aims to propose a method enabling a finer determination of the components of the signal.

To that end, the invention relates to a method including the following steps:

- a) Calculating the successive discrete Fourier transforms of the signal received by the antennas and sampled to obtain a time-frequency P-vector grid of the signal; each element of the grid being referred to as a box and containing a complex vector X forming a measurement;
- b) For each box, calculating the conditional expectation estimator of the signal, or of the signals, from the measurement X and an a priori probability density for the signals that is a Gaussian mixture.

The method thus makes it possible to answer the following questions: for which of the signals present (their number being assumed to be limited to 2 locally), what are the temporal and spectral supports of the supposed signal described by the components obtained using a time/frequency analysis? And, what is the value of each component when it is not zero? The answer to these questions makes it possible to improve the knowledge of the signal.

According to specific embodiments, the method includes one or more of the following features:

- said method includes a step for estimating parameters necessary to establish the conditional expectation using the method of moments operating on the boxes of a divided window in the time/frequency grid;
- the calculation of the conditional expectation estimator is approximated by a Conditional Expectation with 4 Linear Filters obtained by a four-hypothesis decision processing pertaining to four Hermitian forms of the measurement X, followed by linear filtering commanded by the result of the decision;
- the calculation of the Conditional Expectation estimator with 4 Linear Filters is approximated by a Conditional Expectation with Independent Decisions obtained by a two-hypothesis decision processing pertaining to U*X and V*X, followed by linear filtering commanded by the result of the decision;
- as a function of the result of the decision, the linear filtering processing yielding the Conditional Expectation estimator with Independent Decisions, is either:
  - the estimator of the dual-source maximum likelihood for each source;
  - the single-source maximum likelihood estimator for the first source, 0 for the second source;
  - 0 for the first source, the single-source maximum likelihood estimator for the second source;
  - 0 for each source.
- the calculation of the Conditional Expectation estimator with Independent Decisions is approximated by a Threshoided Maximum Likelihood obtained by estimating the signal(s) using the maximum likelihood method followed by the comparison of each estimate to a threshold;
- the or each decision threshold is chosen to respect a so-called false alarm likelihood consisting of declaring the signal to be non-zero when it is zero;
- said method includes:
  - A first estimate of the signals done using the Conditional Expectation with Independent Decisions or Thresholded Maximum Likelihood method,
  - An estimate of parameters done from the components of the signal obtained in the previous step,
  - A second estimate of the signals done using the Conditional Expectation method or the Conditional Expectation with 4 Linear Filters method, informed of the values of the parameters obtained in the previous step.

The invention will be better understood upon reading the following description, done in reference to the appended drawings:

FIG. 1 is a schematic view illustrating signal sources and an installation for estimating wireless signals coming from these sources according to the invention, provided solely for information and not intended to represent reality;

FIG. 2 is a flowchart of one of the methods as implemented in the invention in the single-source case;

FIG. 3 is a flowchart of one of the methods as implemented in the invention in the dual-source case.

The device 8 for estimating a mixture of signals coming from several sources 12 according to the invention illustrated in FIG. 1 comprises an antenna array made up of a plurality of antenna elements 10 or sensors. Each antenna element is coupled to a receiving channel, in particular to digitize the analog signal received by each sensor.

The invention is suitable both for monopolar antenna arrays and bipolar antenna arrays.

The device further includes computing modules. In different alternative embodiments of the device for estimating signals according to the invention, the computer modules can be arranged with different architectures; in particular, each step of the method can be carried out by a separate module, or on the contrary, all of the steps can be grouped together within a single information processing unit 14.

The computing unit(s) are connected to the sensors by any means suitable for transmitting the received signals.

The computing unit(s) include information storage means, as well as computing means making it possible to carry out the algorithms of FIGS. 2 and 3, depending on whether a mono-source or dual-source scenario exists.

Advantageously, the reception is done on a spatial diversity array (interferometric array) and the demodulation of the signal allowing the “baseband lowering” is done by the same local oscillator for all of the antennas in order to ensure coherence. The received signal is sampled in real or complex (double demodulation and quadrature or any other method) on each channel.

The received signal, filtered in a band typically of several hundred MHz, is modeled by: s₀(t)eⁱ^2πf⁰^t. This model does not provide a hypothesis on the type of modulation.

This signal is sampled at a rhythm Te such that 1/Te>>2×band of the wanted signal.

The weighted and overlapping Discrete Fourier Transforms (DFT) of this signal are calculated over N_TDFpoints. The weighting serves to reduce the secondary lobes. Since this weighting causes of variation in the contribution of the data to the DFT (the data at the center of the temporal support of the DFT being assigned a much higher weight than the data on the edges of the support), which can go as far as a loss of the short signals, the hypothesis is used of overlapping of the temporal supports.

The collected measurements are therefore the results of the DFTs. They constitute a time-frequency grid, the boxes of which are called time-frequency boxes. Each box of the grid contains a complex vector that is the result of the Discrete Fourier Transforms for all of the channels, for a given time interval and frequency interval.

For the frequencies of the signals and the distances involved here, the wave front is considered to be planar. The antennas therefore receive the signal with a phase difference depending on the two angles between the wave plane and the plane of the antennas.

In the mono-phase case, the measurements collected on the complete network are therefore written as follows:

X
_n
=s
_n
U+W
_n

- X_nrepresents the measurements: X_nis a complex column vector with dimension P, where P is the number of channels and n=1, 2, . . . N is a double index (I,c) traveling the space of the times (index of the Fourier transform) and frequencies (channel number for a Fourier transform). More specifically, the index n travels the boxes of a temporal x frequency rectangular window with size N. The indices I and c correspond to the row and column numbers of the boxes of the window.
- s_nis a complex number representing the signal after DFT.
- W_nis the thermal noise in the time/frequency box with index n. W_nis a column vector with dimension P. W_nis Gaussian over its real and imaginary components, independent from one time/frequency box to the other and independent from one antenna channel to the other. In other words, W_nis spatially, frequentially and temporally blank. The standard deviation of the noise counted on each real or imaginary component, at a time/frequency box, is equal to σ. For the hypothesis of independence of the noise from one box to the next be verified, the overlap of the DFTs is limited to 50%.

In the case of a monopolar interferometric array, U is written as follows:

$U = g (\begin{matrix} u_{1} \\ \dots \\ u_{P} \end{matrix}),$

where g is a complex scalar dependent on the polarization of the incident wave and its arrival direction, and where u_iare modulus 1 complex numbers representing the geometric phase shifts associated with the direction of the incident wave. It is possible to choose one of the antennas of the receiver as phase reference.

In the case of a interferometric array, U is written as follows:

U=hH+vV,

where h and v are complex scalars such as |h|²+|v|²=1 that express the polarization of the incident wave, and where H (V, resp.) is the response from the array to a horizontally (vertically, resp.) polarized wave. H and V only depend on the direction and frequency of the incident wave.

In all cases, U is of dimension P, where P is the number of antennas used.

It is possible to consider that U is standardized, and that s_ncarries the power of the signal and the mean gain of the array.

U is kept by the DFT, which is a linear transform, and is found on the signal output by the DFT.

In the general scenario, it is possible to have a mixture of K signals (K optionally being greater than the resolution of the array). The signal is then written:

$\begin{matrix} X_{n} = \sum_{k = 1}^{N} s_{k} (n) U_{k} + W_{n}; n = 1, 2, \dots N Expression of a signal mixture & Equation 1 \end{matrix}$

The fundamental hypothesis is that when the set of N time/frequency boxes is restricted to a rectangular zone or window with index j, the complexity of the environment is such that in such a window, the mixture of the signals is limited to two signals.

The model then becomes:

X
_jn
=s
_j1(n)U_j1+s_j2(n)U_j2+W_n Equation 2 Expression of a signal mixture in a window

Windows of predetermined size are defined to cover the time-frequency grid. All of these windows form a division of the grid. The size of the windows is chosen such that no more than two signals from two sources are present in each window.

The vector U or the vectors U and V designating the unitary directional vector(s) formed by the incident signal(s) relative to the array are next extracted (or estimated), using any suitable known means.

A loop is formed to travel all of the windows defining the division of the grid.

For each window, a step for estimating the signal(s) is carried out using the conditional expectation and approximations that can be made thereto for the specific model of the signals.

The final processing is broken down into:

- a nonlinear decision step of the situation in each time/frequency box: source 1 present and source 2 present, or: source 1 present and source 2 absent, or: source 1 absent and source 2 present, or source 1 absent and source 2 absent, then
- a linear filtering step specific to the situation.
  
  The obtained processing is nonbiased, practically optimal within the meaning of the mean quadratic error, and provides a time/frequency delimitation of the support of the signal to be estimated.

The estimate of the signal is done over a window where the situation is mono-source or dual-source.

In a mono-source situation, the vectorial signal measured for the box with index n is

X
_n
=s
_n
U+W
_n
, n=1, 2, . . . Equation 3 Expression of the measured signal (mono-source scenario)

In a dual-source situation, one has:

X
_n
=s
_n
U+c
_n
V+W
_n
, n=1, 2 . . . Equation 4 Expression of the measured signal (dual-source scenario)

In the writings of Equation 3 and Equation 4, U and V are unitary directional vectors of a monopolar or bipolar array, s_nand c_nare complex signals that must be estimated; W_nis the noise, which is spatially white and in n, Gaussian, centered and with covariance: E(W_nW_n*)=2σ²I_pwhere I_pis the unit matrix of C^p(P being the number of receiving channels). * designates the conjugated transpose.

Hereinafter, it will be assumed that U and V are estimated by Û and {circumflex over (V)}, for example using a method of the MUSIC type, and that this estimate having been done over a number of boxes N>>1, it is possible to make the approximation U=Û, V={circumflex over (V)}.

To account for the fact that s_n(or c_n) can be zero for certain n, without knowing the modulation of the signal in advance, s_n(or c_n) is modeled as independent samples in n of a random variable whereof the probability density is a mixture (q, 1−q, 1<1) of two Gaussians centered on respective variances 2σ_j²(j=1 for s_nand j=2 for c_n), and 2τ², with τ²<σ²<σ_j².

2σ_j²is the power of the wanted signal if it is present when τ²is neglected in the expression (1−q)2τ²+q2σ_j²of the mean power.

τ is a regularization parameter of the model that makes it possible to apply the Bayes formula for laws of probability allowing probability densities relative to the Lebesgue measurement; however, the physical reality is that there is no signal when this is modeled using the variance-centered Gaussian 2τ²(power 2τ²). That is why, at the end of the calculations, only the limit of the expressions is retained when τ→0.

S_nand c_nare considered to be independent.

In both cases, mono-source or dual-source, the measured signal is written in the single form:

X
_n
=MS
_n
+W
_n
, n=1, 2, . . . N Equation 5 Metrical form of the measured signal

where M=U and S_n=s_nin mono-source, and M=(U V), S_n(s_nc_n)^Tin dual-source, with M known and σ²known.

In the rest of the document, to make the notations lighter, with the understanding that the processing is done on each box with index n independently, the index n is omitted and X denotes the measurement in a time/frequency box, s the mono-source signal and S=(s c)^Tthe dual-source signal in a time/frequency box.

The a priori knowledge of S is given by a probability density. In the mono-source case:

$\begin{matrix} p (s) = \frac{q}{2 {πσ}_{1}^{2}} \exp (- \frac{{\langle s \rangle}^{2}}{2 σ_{1}^{2}}) + \frac{1 - q}{2 {πτ}^{2}} \exp (- \frac{{\langle s \rangle}^{2}}{2 τ^{2}}) Probability density o f the signal (mono - source case) & Equation 6 \end{matrix}$

In the dual-source case:

$\begin{matrix} p (S) = \sum_{j} \frac{q_{j}}{π^{2} \det C_{j}} \exp (- S^{*} C_{j}^{- 1} S) Probability density of the signal (d ual - source case) & Equation 7 \end{matrix}$

where q₁=q²,q₂=q₃=q(1−q),q₄=(1−q)²if the sources are considered to be independent and equally probable (hereinafter, we generalize to a distribution q₁,q₂,q₃,q₄of 4 situations not connected by the above expressions),

And

$\begin{matrix} C_{1} = (\begin{matrix} 2 σ_{1}^{2} & 0 \\ 0 & 2 σ_{2}^{2} \end{matrix}), C_{2} = (\begin{matrix} 2 σ_{1}^{2} & 0 \\ 0 & 2 τ^{2} \end{matrix}), C_{3} = (\begin{matrix} 2 τ^{2} & 0 \\ 0 & 2 σ_{2}^{2} \end{matrix}), C_{4} = (\begin{matrix} 2 τ^{2} & 0 \\ 0 & 2 τ^{2} \end{matrix}) Covariance matrices of the signal & Equation 8 \end{matrix}$

are the covariance matrices of S for the four possible cases.

S is estimated using the conditional expectation by using the mono-source and dual-source models (Equation 5, Equation 6 and Equation 5, Equation 7).

The Conditional Expectation (CE) is the estimator S that minimizes the mean quadratic deviation E(∥S−Ŝ∥²). It is also nonbiased, and provides an explicit solution for Ŝ. It is built as follows:

Let X be the measurement; its probability density, which depends on the parameter to be estimated S, is interpreted as the conditional probability density of X knowing S. One therefore has p(X/S) and p(S) derived from the a piori knowledge of S.

Ŝ is given by the explicit formula:

$\begin{matrix} \hat{S} = \int_{domS} Sp (S / X) dS Estimate of S & Equation 9 \end{matrix}$

p(S/X), the conditional probability of S knowing X, is obtained by the Bayes formula.

$\begin{matrix} p (S / X) = \frac{p (X / S) \cdot p (S)}{p (X)} General writing of the conditional probability of S knowing X & Equation 10 \\ With p (X) = \int_{domS} p (X / S) \cdot p (S) dS General writing of the probability density of X & Equation 11 \end{matrix}$

In the case where X=MS+V and p(S) is Gaussian, covariance-centered C, it is possible to find S analytically, which is generally not the case.

This is a linear function of X. In fact (in dimensions 2):

$p (X / S) = \frac{1}{π^{2} 4 σ^{4}} \exp (- \frac{{ X - MS }^{2}}{2 σ^{2}})$

$p (S) = \frac{1}{π^{2} \det C} \exp (- S^{*} C^{- 1} S)$

$p (X / S) p (S) = \frac{1}{π^{4} 4 σ^{4} \det C} \exp (- \frac{{ X }^{2}}{ 2 σ^{2} } + \frac{X^{*} MS}{2 σ^{2}} + \frac{S^{*} M^{*} X}{2 σ^{2}} - \frac{S^{*} M^{*} MS}{2 σ^{2}} - S^{*} C^{- 1} S)$

Let us take

$\sum^{- 1} = \frac{M^{*} M}{2 σ^{2}} + C^{- 1}$

By completing the “square” in S, we have:

$p (X / S) p (S) = \frac{K_{2}}{\det C} \exp (- {(S - \frac{\sum}{2 σ^{2}} M^{*} X)}^{*} \sum^{- 1} (S - \frac{\sum}{2 σ^{2}} M^{*} X) - \frac{{ X }^{2}}{2 σ^{2}} + \frac{X^{*} M \sum M^{*} X}{4 σ^{4}})$

where K₂is a constant (=1/π⁴4σ⁴) in dimension 2. Σ is a Hermitian matrix defined positive. One deduces from this:

$\begin{matrix} \int_{domS} p (X / S) p (S) dS = \frac{K_{2} π^{2} \det \sum}{\det C} \exp (- \frac{{ X }^{2}}{2 σ^{2}} + \frac{X^{*} M \sum M^{*} X}{4 σ^{4}}) Probability density of measurements in dual - source & Equation 12 \\ \int_{domS} S \cdot p (X / S) p (S) dS = \frac{K_{2} π^{2} \det \sum}{\det C} \exp (- \frac{{ X }^{2}}{2 σ^{2}} + \frac{X^{*} M \sum M^{*} X}{4 σ^{4}}) \cdot \frac{\sum}{2 σ^{2}} M^{*} X Conditional probability density of S knowing X X in dual - source & Equation 13 \end{matrix}$

The complete expressions of Equation 12 will be used to find the desired estimator for our problem.

In the case where S is a Gaussian sample, Equation 9, Equation 10, Equation 11, Equation 12 and Equation 13 yield:

$\begin{matrix} \hat{S} = \frac{\sum}{2 σ^{2}} M^{*} X Estimate of S in dual - source  (case where S is Gaussian) & Equation 14 \end{matrix}$

which can also be written:

{circumflex over (S)}=(2σ²Σ⁻¹)⁻¹M*X=(M*M+2σ²C⁻¹)⁻¹M*X

One concludes from this that if 2σ²C⁻¹<<M*M, Ŝ is reduced to the maximum likelihood estimator of S using the model of Equation 5 with no a priori knowledge of S: Ŝ_MV=(M*M)⁻¹M*X.

The condition 2σ²C⁻¹<<M*M as matrices is also expressed by C>>2σ²(M*M)⁻¹, which means that the a priori on S that is defined by C does not provide real information on S.

The estimate of the signal in the mono-source case is as indicated below.

In the one-dimensional case for S=s (mono-source), M=U, and therefore M*M=1;

The matrix C is reduced to the constant c;

$p (X / S) p (S) = \frac{1}{2 {πσ}^{2}} \exp (- \frac{{ X - Us }^{2}}{2 σ^{2}});$

$p (s) = \frac{1}{π \cdot c} \exp (- \frac{{\langle s \rangle}^{2}}{c})$

$\sum^{- 1} = \frac{1}{2 σ^{2}} + \frac{1}{c} or \sum = \frac{2 σ^{2} c}{2 σ^{2} + c}$

is deduced from this

$p (X / S) p (S) = \frac{K_{1}}{c} \exp (- \frac{2 σ^{2} + c}{2 σ^{2} c} {\langle 1 - \frac{c}{2 σ^{2} + c} U^{*} X \rangle}^{2} - \frac{{ X }^{2}}{2 σ^{2}} + \frac{c}{2 σ^{2} (2 σ^{2} + c)} {\langle U^{*} X \rangle}^{2})$

$with K_{1} = \frac{1}{2 {πσ}^{2}}$

This then yields:

$\begin{matrix} \int_{doms} s \cdot p (X / S) \cdot p (S) = \frac{K_{1}}{c} π \frac{2 σ^{2} c}{2 σ^{2} + c} \exp (- \frac{{ X }^{2}}{2 σ^{2}} + \frac{c}{2 σ^{2} (2 σ^{2} + c)} {\langle U^{*} X \rangle}^{2}) \cdot \frac{c}{2 σ^{2} + c} U^{*} X Conditional probability density of S knowing X in mono - source & Equation 15 \\ \int_{doms} p (X / S) p (S) = \frac{K_{1}}{c} π \frac{2 σ^{2} c}{2 σ^{2} + c} \exp (- \frac{{ X }^{2}}{2 σ^{2}} + \frac{c}{2 σ^{2} (2 σ^{2} + c)} {\langle U^{*} X \rangle}^{2}) Probability density of measurements in mono - source & Equation 16 \end{matrix}$

The conditional expectation is obtained, in the Gaussian case for s, by the quotient of Equation 15 by Equation 16:

$\hat{s} = \frac{c}{2 σ^{2} + c} U^{*} X$

- Equation 17 Estimate of s in mono-source (case where s is Gaussian)
  
  If c>>2σ², which expresses that one has no a priori information on s, ŝ is reduced to the maximum likelihood estimator. ŝ_MV=U*X.

The estimate of the signal in the dual-source case is as indicated below.

The conditional expectation estimator is obtained using Equation 15 and Equation 16 for the mixture density of S given by Equation 7.

After simplifying by the common factor

$K_{2} π^{2} \exp (- \frac{{ X }^{2}}{2 σ^{2}})$

in all of the terms in the numerator and the denominator, one obtains:

$Equation 18$

$\hat{S} = \frac{\sum_{j} q_{j} \frac{\det \sum_{j}}{\det C_{j}} \exp (\frac{1}{4 σ^{4}} X^{*} M \sum_{j} M^{*} X) \frac{\sum_{j}}{2 σ^{2}} M^{*} X}{\sum_{j} q_{j} \frac{\det \sum_{j}}{\det C_{j}} \exp (\frac{1}{4 σ^{4}} X^{*} M \sum_{j} M^{*} X)}$

$Condition expectation in dual - source$

$With \sum_{j}^{- 1} = \frac{M^{*} M}{2 σ^{2}} + C_{j}^{- 1} or \sum_{j} = 2 {σ^{2} (M^{*} M + 2 σ^{2} C_{j}^{- 1})}^{- 1}$

Let us take Γ_j=2σ²C_j⁻¹(without dimension),

Q
_j=(M*M+2σ²C_j⁻¹)⁻¹=(M*M+Γ_j)⁻¹=Σ_j/2σ²

We have:

$\begin{matrix} \hat{S} = \frac{\sum_{j = 1}^{4} q_{j} \det Q_{j} \det Γ_{j} \exp (\frac{X^{*} {MQ}_{j} M^{*} X}{2 σ^{2}}) Q_{j} M^{*} X}{\sum_{j = 1}^{4} q_{j} \det Q_{j} \det Γ_{j} \exp (\frac{X^{*} {MQ}_{j} M^{*} X}{2 σ^{2}})} Conditional expectation in dual - source & Equation 19 \end{matrix}$

According to Equation 8,

$M^{*} M = (\begin{matrix} 1 & U^{*} V \\ V^{*} U & 1 \end{matrix})$

$Γ_{1} = (\begin{matrix} σ^{2} / σ_{1}^{2} & 0 \\ 0 & σ^{2} / σ_{2}^{2} \end{matrix}), Q_{1}^{- 1} = (\begin{matrix} 1 + σ^{2} / σ_{1}^{2} & U^{*} V \\ V^{*} U & 1 + σ^{2} / σ_{2}^{2} \end{matrix})$

$Γ_{2} = (\begin{matrix} σ^{2} / σ_{1}^{2} & 0 \\ 0 & σ^{2} / τ^{2} \end{matrix}), Q_{2}^{- 1} = (\begin{matrix} 1 + σ^{2} / σ_{1}^{2} & U^{*} V \\ V^{*} U & 1 + σ^{2} / τ^{2} \end{matrix})$

$Γ_{3} = (\begin{matrix} σ^{2} / τ^{2} & 0 \\ 0 & σ^{2} / σ_{2}^{2} \end{matrix}), Q_{3}^{- 1} = (\begin{matrix} 1 + σ^{2} / τ^{2} & U^{*} V \\ V^{*} U & 1 + σ^{2} / σ_{2}^{2} \end{matrix})$

$Γ_{4} = (\begin{matrix} σ^{2} / τ^{2} & 0 \\ 0 & σ^{2} / τ^{2} \end{matrix}), Q_{4}^{- 1} = (\begin{matrix} 1 + σ^{2} / τ^{2} & U^{*} V \\ V^{*} U & 1 + σ^{2} / τ^{2} \end{matrix})$

One deduces from this:

$\begin{matrix} Q_{1} = \frac{1}{\begin{matrix} (1 + σ^{2} / σ_{1}^{2}) (1 + σ^{2} / σ_{2}^{2}) - \\ {\langle U^{*} V \rangle}^{2} \end{matrix}} (\begin{matrix} 1 + σ^{2} / σ_{2}^{2} & - U^{*} V \\ - V^{*} U & 1 + σ^{2} / σ_{1}^{2} \end{matrix}) Q_{2} = \frac{1}{\begin{matrix} (1 + σ^{2} / σ_{1}^{2}) (1 + σ^{2} / τ^{2}) - \\ {\langle U^{*} V \rangle}^{2} \end{matrix}} (\begin{matrix} 1 + σ^{2} / τ^{2} & - U^{*} V \\ - V^{*} U & 1 + σ^{2} / σ_{1}^{2} \end{matrix}) Q_{3} = \frac{1}{\begin{matrix} (1 + σ^{2} / τ^{2}) (1 + σ^{2} / σ_{2}^{2}) - \\ {\langle U^{*} V \rangle}^{2} \end{matrix}} (\begin{matrix} 1 + σ^{2} / σ_{2}^{2} & - U^{*} V \\ - V^{*} U & 1 + σ^{2} / τ^{2} \end{matrix}) Q_{4} = \frac{1}{{(1 + σ^{2} / τ^{2})}^{2} - {\langle U^{*} V \rangle}^{2}} (\begin{matrix} 1 + σ^{2} / τ^{2} & - U^{*} V \\ - V^{*} U & 1 + σ^{2} / τ^{2} \end{matrix}) Qi matrices & Equation 20 \end{matrix}$

The expression of the estimate of the signal is subject to an approximation to allow it to be estimated as indicated below in the dual-source case.

The products of determinants in Equation 19 are respectively equal to the following expressions, an approximation of which is provided for a good signal-to-noise ratio (σ₁²/σ²>>1 and σ₂²/σ²>>1) and for τ→0.

detQ
₁
detΓ
₁=[(1+σ²/σ₁²)(1+σ²/σ₂²)−|U*V|²]⁻¹σ⁴/σ₁²σ₂²≈(1−|U*V|²)⁻¹σ⁴/σ₁²σ₂²

detQ
₂
detΓ
₂=[(1+σ²/σ₁²)(1+σ²/τ²)−|U*V|²]⁻¹σ⁴/σ₁²τ²≈(τ²/σ²)(1+σ²/σ₁²)⁻¹·(σ⁴/σ₁²τ²)≈σ²/σ₁²

detQ
₃
detΓ
₃=[(1+σ²/τ²)(1+σ²/σ₂²)−|U*V|²]⁻¹σ⁴/σ₂²τ²≈σ²/σ₂²

detQ
₄
det Γ
₄=[(1+σ²/τ²)²−|U*V|²]⁻¹σ⁴/τ⁴≈1 Equation 21 Det Qi det Γi

Likewise, one finds, for Q_jwhen τ→0:

Q₁is unchanged.

$\begin{matrix} Q_{1} \approx \frac{1}{\begin{matrix} (1 + σ^{2} / σ_{1}^{2}) (1 + σ^{2} / σ_{2}^{2}) - \\ {\langle U^{*} V \rangle}^{2} \end{matrix}} (\begin{matrix} 1 & - U^{*} V \\ - V^{*} U & 1 \end{matrix}) Q_{2} \approx (\begin{matrix} 1 / (1 + σ^{2} / σ_{1}^{2}) & 0 \\ 0 & 0 \end{matrix}) Q_{3} \approx (\begin{matrix} 0 & 0 \\ 0 & 1 / (1 + σ^{2} / σ_{2}^{2}) \end{matrix}) Q_{4} \approx (\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}) Simplified form of the Qi matrices & Equation 22 \end{matrix}$

One can see that the products det Q_idet Γ_ihave a finite limit in each of the four situations, as do the matrices Q_j, which is a satisfactory behavior.

One has thus obtained a first expression of the estimator.

In reality, only one of the terms in Equation 19 is preponderant for each box, which leads to a first simplification. The new estimating processing of the Ŝ is deduced from this. Ŝ=Q_j0M*X where

$j_{0} = Arg \underset{j}{Max} {q_{j} \det Q_{j} \det Γ_{j} \exp (X^{*} {MQ}_{j} M^{*} X / 2 σ^{2})}$

Which is simplified as:

$\begin{matrix} \hat{S} = Q_{j 0} M^{*} X where j_{0} = Arg \underset{j}{Max} F_{j} = Arg \underset{j}{Max} {\ln (q_{j} \det Q_{j} \det Γ_{j}) + X^{*} {MQ}_{j} M^{*} X / 2 σ^{2}} Estimate of the signal with decision function & Equation 23 \end{matrix}$

M*X is given by:

$M^{*} X = (\begin{matrix} U^{*} X \\ V^{*} X \end{matrix})$

The det Q_j·det Γ_jare given by Equation 21.

The Q_jare given by Equation 22.

The q_jare for example given by

$\begin{matrix} q_{j} = {\begin{matrix} q^{2}, j = 1 \\ q (1 - q), j = 2, 3 \\ {(1 - q)}^{2}, j = 4 \end{matrix}, Expression of the q_{i} parameters & Equation 24 \end{matrix}$

if there is independence of the 4 situations and equal probability for s≠0,c≠0.

F(j) is given by:

F(j₁)=ln(q₁detQ₁detΓ₁)+X*MQ₁M*X/2σ²=ln(q²(1−|U*V|²)σ²/σ₁²σ₂²)+X*MQ₁M*X/2σ²

F(j₂)=ln(q₂detQ₂detΓ₂)+X*MQ₂M*X/2σ²=ln(q(1−q)σ²/σ₁²)+X*UU*X/2σ²

F(j₃)=ln(q₃detQ₃detΓ₃)+X*MQ₃M*X/2σ²=ln(q(1−q)σ²/σ₂²)+X*VV*X/2σ²

F(j₄)=ln(q₄detQ₄detΓ₄)+X*MQ₄M*X/2σ²=ln((1−q)²)

For j₀=1, the maximum likelihood estimator is found. Indeed, in this case,

$Γ_{1} = (\begin{matrix} σ^{2} / σ_{1}^{2} & 0 \\ 0 & σ^{2} / σ_{2}^{2} \end{matrix}) \approx 0$

and Q₁=(M*M)⁻¹such that S=(M*M)⁻¹M*X. One can see that if U*V=0, then the estimates of s and c are completely separate, since then, the relationship Ŝ=(M*M)⁻¹M*X is simplified and uncoupled into ŝ=U*X, ĉ=V*X.

For j₀=2,Ŝ^T=(U*X,0) (filtering by the directional vector of source 1).

For j₀=3,Ŝ^T=(0,V*X) (filtering by the directional vector of source 2).

For j₀=4,Ŝ^T=(0,0)

Where the symbol ^Tdesignates the transpose.

One has thus “linearized” the optimal processing, since one has obtained four linear filters, controlled by the decision on the type of situation for each box: both sources are present/source 1 is present/source 2 is present/neither of the sources is present. The obtained estimator is called “Conditional Expectation with 4 Linear Filters”.

In this way, we have simplified the optimal estimator, by breaking it down into two steps:

- Detection of the situation
- Application of filtering appropriate to the situation

It is satisfactory to see that the estimator is independent of τ, which is the expected behavior, since τ is not a physical parameter, but an artifice making it possible to modify the “absence of signal” situation by a very tight Gaussians.

This estimator requires the calculation of three quadratic forms and a test. The difficulty remains calculating the unknown parameters q_{j,j=1 . . . 4}, σ₁², σ₂²(the power of the noise 2σ²is presumed to be known).

In the specific case where the 2 sources are independent, and have the same power and the same presence rate, the parameters can be estimated by calculating the empirical moments of order 2 and 4.

If we call 2σ′²the shared value of the variance of the Gaussian representing each source, and q the probability shared by the 2 sources, everything happens as if we were in a mono-source situation, with a single source of variance 2σ_M^′2=2σ′²and presence probability q_M=2q.

σ_M^′2and q_Mare then given by the following equations:

${\begin{matrix} \frac{1}{N} \sum_{n} {\langle X_{n} \rangle}^{2} = 2 q_{M} (\frac{{σ^{'}}_{M}^{2}}{P} + σ^{2}) + (1 - q_{M}) 2 σ^{2} \\ \frac{1}{N} \sum_{n} {\langle X_{n} \rangle}^{4} = 8 q_{M} (\frac{{σ^{'}}_{M}^{2}}{P} + σ^{2}) + 8 (1 - q_{M}) σ^{4} \end{matrix}$

In the general case (independent sources), there are 4 parameters of the model: q₁, q₂, σ₁², σ₂². One skilled in the art knows how to generalize the method of moments above to higher orders to obtain the estimates of these parameters.

An alternative of the estimating processing consists of simplifying the decision step previously described, as follows:

The conditional expectation considers all four possible situations:

s≠0,c≠0;s≠0,c=0;s=0,c≠0;s=0,c=0

It is normally necessary to address a decision problem with four hypotheses.

To simplify, we propose to test s≠0 against s=0 independently of c on the one hand, and to test c≠0 against c=0 independently of s on the other hand. We therefore perform two tests with two hypotheses instead of one test with four hypotheses.

These tests will be done from preprocessed measurements U*X, V*X.

TEST OF s≠0 AGAINST s=0

$\begin{matrix} H_{1} : {\begin{matrix} U^{*} X = s + U^{*} V \cdot c + u \\ V^{*} X = V^{*} U \cdot s + c + v \end{matrix}, s \neq 0 H_{0} : {\begin{matrix} U^{*} X = U^{*} V \cdot c + u \\ V^{*} X = c + v \end{matrix}, s \neq 0 Simplified hypothesis test : & Equation 25 \end{matrix}$

the 2 hypotheses for s

where the (.) indicate the scalar x scalar product

and where u=U*W,v=V*W:(u,v) is therefore Gaussian, centered and with covariance:

$E {(\begin{matrix} U^{*} W \\ V^{*} W \end{matrix}) (\begin{matrix} W^{*} U & W^{*} V)} \end{matrix} = 2 σ^{2} (\begin{matrix} 1 & U^{*} V \\ V^{*} U & 1 \end{matrix}),$

and where c is an unknown parameter.

It is a problem not varying by the group of sector translations (U*V 1)^Tand a linear hypothesis problem (see “Testing Statistical Hypothesis”, 3^rdedition, E. L. Lehmann, J. P. Domano, Springer, 2005). It may be processed by first performing a projection on the orthogonal of (U*V 1)^Tto eliminate c, then testing the presence of s using the chi2 test. The projection is written:

$U^{*} X - (U^{*} V) V^{*} X = {\begin{matrix} (1 - {\langle U^{*} V \rangle}^{2}) s + u - (U^{*} V) v & (H_{1}) \\ u - (U^{*} V) v & (H_{0}) \end{matrix}$

The test to be performed therefore pertains to the measurement |U*X−(U*V)V*X|²:

|U*X−(U*V)V*X|²>ou<λ Equation 26 Simplified hypothesis test on s

Which amounts to the same thing as performing the test:

${\langle {\hat{s}}_{MV} \rangle}^{2} = \frac{{\langle U^{*} X - (U^{*} V) V^{*} X \rangle}^{2}}{(1 - {\langle U^{*} V \rangle}^{2})} > ou < λ^{'},$

where ŝ_MVis the estimate within the meaning of the maximum likelihood of s.

TEST OF c≠0 AGAINST c=0

In the same way for c, M*X is projected on the orthogonal of (1 V*U)^Tin order to eliminate the terms in s, and the new measurement to be considered is obtained:

$- (V^{*} U) U^{*} X + V^{*} X = {\begin{matrix} (1 - {\langle U^{*} V \rangle}^{2}) c + v - (U^{*} V) u & (H_{1}^{'}) \\ v - (V^{*} U) u & (H_{0}^{'}) \end{matrix}$

One obtains the following test:

|(V*U)U*X−V*X|²>ou<μ Equation 27 Simplified hypothesis test on c

which can be written

${\langle {\hat{c}}_{MV} \rangle}^{2} = \frac{{\langle (V^{*} U) U^{*} X - V^{*} X \rangle}^{2}}{(1 - {\langle U^{*} V \rangle}^{2})} > ou < μ^{'}$

where ĉ_MVis the estimate of c within the meaning of the maximum likelihood of c.

In dual-sources, the proposed estimator consists of performing the following operations:

As illustrated in FIG. 3, from the calculation of the dual-source maximum likelihood estimator Ŝ_MV=(ŝ,ĉ)=(M*M)⁻¹M*X done in step 220, thresholding of the modulus of each component of Ŝ_MVis done in step 320 on each of the components |S_MV| and |ĉ_MV|.

Then, depending on the situation, spatial filtering is applied in steps 331 to 334 under the following conditions, which makes it possible to obtain the so-called Conditional Expectation with Independent Decisions (CEID) estimator:

If |ŝ_MV|>seuil and |ĉ_MV|>seuil:Ŝ_ECDI=(M*M)⁻¹M*X (step 331)

If |ŝ_MV|>seuil and |ĉ_MV|>seuil:ŝ_ECDI=U*X,ĉ_ECDI=0 (step 332)

If |ŝ_MV|>seuil and |ĉ_MV|>seuil:ŝ_ECDI=0,ĉ_ECDI=V*X (step 333)

If |ŝ_MV|>seuil and |ĉ_MV|>seuil:Ŝ_ECDI=0 (step 334)

In mono-source, the proposed estimator consists of performing the following operations, as illustrated in FIG. 2:

From the calculation of the dual-source maximum likelihood estimator ŝ_MV=U*X done in step 210, thresholding of the modulus of ŝ_MVis done in step 350, then, depending on the situation, spatial filtering is applied in steps 361 or 362 under the following conditions:

If |ŝ_MV|>seuilŝ_ECDI=U*X (step 361)

If |ŝ_MV|<seuilŝ_ECDI=0 (step 362)

Advantageously, the threshold for steps 320, 350 is determined as follows:

Pfa refers to the probability of deciding s≠0 whereas s=0, and Pd is the probability of deciding s≠0 whereas s≠0.

For example and non-limitingly, it is proposed to use the Neyman-Pearson criterion, which consists of determining the Pfa (for example, several percent), and in return, maximizing Pd, which makes it possible to obtain a threshold on λ′ and μ′. For example and non-limitingly, it is also possible to adjust the value of λ′ (μ′, resp.) such that 1−Pd=Pfa around a set RSB.

A close alternative of the Maximum Likelihood called Thresholded Maximum Likelihood (TML) consists of performing the following operations:

Calculating the maximum likelihood estimator Ŝ_MV=(M*M)⁻¹M*X

Thresholding the modulus of the components of Ŝ_MV

Depending on the situation, applying spatial filtering

If |ŝ_MV|<seuil:ŝ_MVS=0, otherwise ŝ_MVS=ŝ_MV

If |ĉ_MV|<seuil:ĉ_MVS=0, otherwise ĉ_MVS=ĉ_MV

Another alternative consists of using one of the two previous estimators to obtain an initialization of the unknown parameters q_{j,j=1 . . . 4}, σ₁², σ₂², then applying the Conditional Expectation estimator or the Conditional Expectation with 4 Linear Filters estimator.

Claims

1. A method for the non-linear estimation of no more than two mixed signals from separate sources, the time/frequency representation of which shows an unknown non-zero proportion of zero components, using an array made up of P>2 antennas, when the directional vectors U and V of the sources emitting these signals are additionally known or estimated, comprising the following steps: a) calculating the successive discrete Fourier transforms of the signal received by the antennas and sampled to obtain a time-frequency P-vector grid of the signal; each element of the grid being referred to as a box and containing a complex vector X forming a measurement; andb) for each box, calculating the conditional expectation estimator of the signal, or of the signals, from the measurement X and an a priori probability density (p(s)) for the signals that is a Gaussian mixture.
2. The method according to claim 1, characterized in that it includes a step for estimating parameters (qj,σ12,σ22) necessary to establish the conditional expectation using the method of moments operating on the boxes of a divided window in the time/frequency grid.
3. The method according to claim 1, characterized in that the calculation of the conditional expectation estimator is approximated by a Conditional Expectation with 4 Linear Filters obtained by a four-hypothesis decision processing pertaining to four Hermitian forms of the measurement X, followed by linear filtering commanded by the result of the decision.
4. The method according to claim 3, characterized in that the calculation of the Conditional Expectation estimator with 4 Linear Filters is approximated by a Conditional Expectation with Independent Decisions obtained by a two-hypothesis decision processing pertaining to U*X and V*X, followed by linear filtering commanded by the result of the decision.
5. The method according to claim 4, characterized in that, as a function of the result of the decision, the linear filtering processing yielding the Conditional Expectation estimator with Independent Decisions, is either: the estimator of the dual-source maximum likelihood for each source;the single-source maximum likelihood estimator for the first source, 0 for the second source;0 for the first source, the single-source maximum likelihood estimator for the second source;0 for each source.
6. The method according to claim 5, characterized in that the calculation of the Conditional Expectation estimator with Independent Decisions is approximated by a Thresholded Maximum Likelihood obtained by estimating the signal(s) using the maximum likelihood method followed by the comparison of each estimate to a threshold.
7. The method according to claim 4, characterized in that the or each decision threshold is chosen to respect a so-called false alarm likelihood consisting of declaring the signal to be non-zero when it is zero.
8. The method according to claim 4, characterized in that it includes: a first estimate of the signals done using the Conditional Expectation with Independent Decisions or Thresholded Maximum Likelihood method,an estimate of parameters (qj,σ12,σ22) done from the components of the signal obtained in the previous step, anda second estimate of the signals done using the Conditional Expectation method or the Conditional Expectation with 4 Linear Filters method, informed of the values of the parameters (qj,σ12,σ22) obtained in the previous step.
9. The method according to claim 5, characterized in that it includes: a first estimate of the signals done using the Conditional Expectation with Independent Decisions or Thresholded Maximum Likelihood method,an estimate of parameters (qj,σ12,σ22) done from the components of the signal obtained in the previous step, anda second estimate of the signals done using the Conditional Expectation method or the Conditional Expectation with 4 Linear Filters method, informed of the values of the parameters (qj,σ12,σ22) obtained in the previous step.
10. The method according to claim 6, characterized in that it includes: a first estimate of the signals done using the Conditional Expectation with Independent Decisions or Thresholded Maximum Likelihood method,an estimate of parameters (qj,σ12,σ22) done from the components of the signal obtained in the previous step, anda second estimate of the signals done using the Conditional Expectation method or the Conditional Expectation with 4 Linear Filters method, informed of the values of the parameters (qj,σ12,σ22) obtained in the previous step.
11. The method according to claim 4, characterized in that it includes: a first estimate of the signals done using the Conditional Expectation with Independent Decisions or Thresholded Maximum Likelihood method,an estimate of parameters (qj,σ12,σ22) done from the components of the signal obtained in the previous step, anda second estimate of the signals done using the Conditional Expectation method or the Conditional Expectation with 4 Linear Filters method, informed of the values of the parameters (qj,σ12,σ22) obtained in the previous step.
12. The method according to claim 5, characterized in that it includes: a first estimate of the signals done using the Conditional Expectation with Independent Decisions or Thresholded Maximum Likelihood method,an estimate of parameters (qj,σ12,σ22) done from the components of the signal obtained in the previous step, anda second estimate of the signals done using the Conditional Expectation method or the Conditional Expectation with 4 Linear Filters method, informed of the values of the parameters (qj,σ12,σ22) obtained in the previous step.
13. The method according to claim 6, characterized in that it includes: a first estimate of the signals done using the Conditional Expectation with Independent Decisions or Thresholded Maximum Likelihood method,an estimate of parameters (qj,σ12,σ22) done from the components of the signal obtained in the previous step, anda second estimate of the signals done using the Conditional Expectation method or the Conditional Expectation with 4 Linear Filters method, informed of the values of the parameters (qj,σ12,σ22) obtained in the previous step.

Priority Claims (1)

Number	Date	Country	Kind
1402981	Dec 2014	FR	national

PCT Information

Filing Document	Filing Date	Country	Kind
PCT/EP2015/081213	12/23/2015	WO	00

METHOD FOR THE NON-LINEAR ESTIMATION OF A MIXTURE OF SIGNALS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)

PCT Information