The present inventions relates to a metod and device for separating a mixture of source signals to regain the source signals.
In recent time several papers concerning signal separation of dynamically mixed source signals have been put forward [1-3, 8, 17, 19, 20]. In principle it is possible to separate the sources exploiting only second order statistics, cf. [8]. The blind signal separation problem with dynamic/convolutive mixtures is solved in the frequency domain in several papers presented, cf. [3, 20]. Basically, dynamic source separation in the frequency domain aims to solve a number of static/instantaneous source separation problems, one for each frequency bin in question. In order to obtain the dynamic channel system (mixing matrix), the estimates corresponding to different frequencies bins, have to be interpolated. This procedure seems to be a nontrivial task, due to scaling and permutation indeterminacies [16]. The approach in the present paper is a “time-domain approach”, see [8], which models the elements of the channel system with Finite Impulse Response (FIR) filters, thus avoiding this indeterminacies.
A quasi-maximum likelihood method for signal separation by second order statistics is presented by Pham and Garat in [11]. An algorithm is presented for static mixtures, i.e. mixing matrices without delays. Each separated signal si i=1, . . . , M is filtered with a Linear Time Invariant (LTI) filter hi. The criterion used is the estimated cross-correlations for these filtered signals. The optimal choice of the filter hi, according to [11], is the filter with frequency response inversely proportional to the spectral density of the corresponding source signal. The filters hi, i=1, . . . , M are thus whitening filters. However, the spectral densities of the source signals are usually unknown, and perhaps time varying. One approach is to estimate these filters as done in the present paper and in the prediction error method as presented in [1]. Moreover, several aspects of the algorithm presented in [8] remained open.
Characterizing features of the present invention, i.e. the method for separating a mixture of source signals to regain the source signals; are bringing each measured signal to a separation structure including an adaptive filter, the adaptive filter comprising filter coefficients; using a generalised criterion function for obtaining the filter coefficients, the generalised criterion function comprising cross correlation functions and a weighting matrix, the cross correlation functions being dependent on the filter coefficients; estimating the filter coefficients, the resulting estimates of the filter coefficients corresponding to a minimum value of the generalised criterion function; and updating the adaptive filter with the filter coefficients.
Other characterizing features of the present invention, i.e. the device for separating a mixture of source signals to regain the source signals, the input to the device being based on measured signals, the device comprises: signaling links for bringing each measured signal to a separation structure including an adaptive filter, the adaptive filter comprising filter coefficients; a generalised criterion function means for obtaining the filter coefficients, the generalised criterion function means comprising cross correlation functions and a weighting matrix, the cross correlation functions being dependent on the filter coefficients; means for estimating the filter coefficients, the resulting estimates of the filter coefficients corresponding to a minimum value output of the generalised criterion function; and updating means for updating the adaptive filter with the filter coefficients.
Specific fields of application of the present invention include mobile telephone technology, data communication, hearing aids and medical measuring equipment, such as ECG. Also included is echo cancelling which can primarily occur in the telecommunications field.
In the present invention a signal separation algorithm is derived and presented. A main result of the analys is an optimal weighting matrix. The weighting matrix is used to device a practical algorithm for signal separation of dynamically mixed sources. The derived algorithm significantly improves the parameter estimates in cases where the sources have similar color. In addition the statistical analysis can be used to reveal attainable (asymptotic) parameter variance given a number of known parameters.
The basis for the source signals, in the present paper, are M mutually uncorrelated white sequences. These white sequences are termed source generating signals and denoted by ξk(n) where k=1, . . . , M. The source generating signals are convolved with linear time-invariant filters Gk(q)/Fk(q) and the outputs are,
referred to as the source signals and where q and T is the time shift operator and matrix transpose, respectively. The following assumptions are introduced
A1. The generating signal ξ(n) is a realization of a stationary, white zero-mean Gaussian process:
ξ(n)εN(0, Σ), Σ=diag(σ12, . . . σM2)
A2. The elements of K(q) are filters which are asymptotically stable and have minimum phase.
Condition A1 is somewhat restrictive because of the Gaussian assumption. However, it appears to be very difficult to evaluate some of the involved statistical expectations unless the Gaussian assumption is invoked.
The source signals x(n) are unmeasurable and inputs to a system, referred to as the channel system. The channel system produces M outputs collected in a vector y(n)
y(n)=[y1(n) . . . yM(n)]T=B(q)x(n).
which are measurable and referred to as the observables. In the present paper the channel system, B(q), given in
where Bij(q), ij=1, . . . M are FIR filters. The objective is to extract the source signals from the observables. The extraction can be accomplished by means of all adaptive separation structure, cf. [8]. The inputs to the separation structure are the observable signals. The output from the separation structure, s1(n), . . . ,sM(n), depend on the adaptive filters, Dij(q,θ), i,j=1, . . . M, and can be written as
s(n,θ)=[s1(n,θ) . . . sM(n,θ)]T=D(q,θ)y(n),
where θ is a parameter vector containing the filter coefficients of the adaptive filters. That is the parameter vector is θ=[d11T. . . dMMT]T where dij, i,j=1, . . . M are vectors containing the coefficients of Dij(q, θ), i,j=1, . . . M, respectively. Note, that unlike B(q) the separation matrix D(q, θ) does not contain a fixed diagonal, cf. [6, 13].
Most of the expressions and calculations in the present paper will be derived for the two-input two-output (TITO) case, M=2. The main reason for using the TITO case is that it has been shown to be parameter identifiable under a set of conditions, cf. [8]. However, the analysis in the current paper is applicable on the more general multiple-input multiple-output (MIMO) case, assuming that problem to be parameter identifiable as well.
Assuming that N samples of y1(n) and y2(n) are available, the criterion function proposed in [8], reads as
where
To emphasize the dependence on θ equation (2.6) can be rewritten as
where the notation d12(i) denotes the i:th coefficient of the filter D12(q).
For notational simplicity, introduce the following vector as
{dot over (r)}N(θ)=[{dot over (R)}, 1,2(−U, θ) . . . {dot over (R)},1,2(Uθ)]T
where the subscript N indicates that the estimated cross-covariances are based on N samples. Furthermore, introduce a positive definite weighting matrix W(θ) which possibly depends on θ too. Thus, the criterion, in equation (2.6), can be generalized as
which will be investigated. Note, the studied estimator is closely related to the type of non-linear regressions studied in [15]. The estimate of the parameters of interest are obtained as
Although the signal separation based on the criterion (2.6) has been demonstrated to perform well in practice, see for example [12], there are a couple of open problems in the contribution [8];
The purpose of the present contribution is to:
In addition to A1 and A2 the following assumptions are considered to hold throughout the description:
This section deals with the statistical analysis and it will begin with consistency. The asymptotic properties (as N-∞) of the estimate of θN (θ^N) is established in the following. However, first some preliminary observations are made. In [8] it was shown that
The convergence in (3.1) is uniform in a set DM, where θ is a member
Furthermore, since the applied separation structure is of finite impulse response (FIR) type, the gradient is bounded
for N larger than some N0<∞. In equation (3.4), C is some constant, C<∞, and nθ denotes the dimension of θ. The above discussion, together with the identifiability analysis [8] then shows the following result:
Having established (strong) consistency, the asymptotic distribution of θ^N is considered. Since the θ^N minimizes the criterion VN(θ), VN(θ^N)=0, where VN denotes the gradient of VN. By the mean value theorem,
0=VN′({dot over (θ)}N)=VN′(θ0)+VN″(θξ)({dot over (θ)}N−θ0),
where θξ is on a line between θ^N and θ0. Note, since θ^N is consistent, the θ^N−θ0 and consequently, θξ→θ0, as N→∞.
Next, investigate the gradient evaluated at θ0 (for notational simplicity, let W(θ)=W)
where
Note, evaluation of G is straightforward, see for example [8]. The introduced approximation does not affect the asymptotics, since the approximation error goes to zero at a faster rate than does the estimate of rN(θ0) (r^N(θ0)). Furthermore, since rN(θ0)=0, the asymptotic distribution of VN′(θ)
is identical to the asymptotic distribution of GTWr^N(θ0), where
Applying for example Lemma B.3 in [15], and using the fact that both s1(n:θ0) and s2(n;θ0) stationary ARMA processes, one can show that (√N)GTWr^N(θ0) converges in distribution to a Gaussian random vector, i.e.
√{square root over (N)}GTW{dot over (r)}N(θ0)εAs N(0, GTWMWG),
where
This means that the gradient vector, is asymptotically normally distributed, with zero-mean and with a covariance matrix M.
Before presenting the main result of the current paper, the convergence of the Hessian matrix V″N must be investigated. Assuming that the limit exists, define
To establish the convergence of V″N(θξ), the following (standard) inequality is applied
where ∥ ∥F denotes the Frobenius norm. Due to the FIR separation structure, the second order derivatives are continuous. Moreover, since θξ converges w.p.1 to θ0, the first term converges to zero w.p.1. The second term converges also to zero w.p.1. This can be shown using a similar methodology that was used to show (3.3). Note also that since the third order derivatives are bounded, the convergence is uniform in θ.
It is, now, straightforward to see that the limiting Hessian V−″ can be written as
Thus, for large N,
√{square root over (N)}({dot over (θ)}N−θ0)≅√{square root over (N)}(GTWG)−1GTW{dot over (τ)}N(θ0),
assuming the inverses exists (generically guaranteed by the identifiability conditions in A3. Here all approximation errors that goes to zero faster than O (1/√(N)) have been neglected. Finally, the following result can be stated.
Consider the signal separation method based on second order statistics, where θ^N is obtained from (2.10). Then the normalized estimation error, √(N)(θ^N−θN), has a limiting zero-mean Gaussian distribution
√{square root over (N)}({dot over (θ)}N−θ0)εAsN(0, P),
where
P=(GTWG)−1GTWMWG(GTWG)−1.
Obviously, the matrix M plays a central role, and it is of interest to find a more explicit expression. For simplicity we consider only the case when the generating signals are zero-mean, Gaussian and white (as stated in Assumption A1). It seems to be difficult to find explicit expressions for the non-Gaussian case. Note also that this is really the place where the normality assumption in A1 is crucial. For example, the asymptotic normality of √(N)r^N(θ0) holds under weaker assumptions.
Theorem 6.4.1 in [5] indicates precisely how the components of M can be computed. These elements are actually rather easy to compute, as the following will demonstrate. Let
Furthermore, introduce the following Z-transforms
Then it follows that
Thus, the βτ's are the covariances of an ARMA process with power spectrum
Computation of ARMA covariances is a standard topic, and simple and efficient algorithms for doing this exists, see for example [15, Complement C7.7]. Given βτfor τ=0 . . . , 2U, the weighting matrix can, hence, be constructed as
Thus, in the present problem formulation the separated signals are distorted with the determinant of the channel system the channel system determinant equals det {B(z)}, and one may define the reconstructed signals as
as long as det{D(q, θ0)} is minimum phase.
To complete our discussion, it is also pointed out how the matrix G can be computed. The elements of G are all obtained in the following manner. Using Equation (2.8). it follows that
which are straightforward to compute.
Next, consider the problem of choosing W. Our findings are collected in the following result. The asymptotic accuracy of θ^N, obtained as the minimizing argument of the criterion (2.10), is optimized if
W=W0=M−1
For this choice of weighting,
P(W0)=(GTM−1G)−1
The accuracy is optimized in the sense that P(W0)−P(W) is positive semi-definite for all positive definite weighting matrices W.
The proof follows from well-known matrix optimization results, see for example [9, Appendix 2].
The result could have been derived directly from the ABC theory in [15, Complement C4.4]. However, the result above itself is a useful result motivating the presented analysis.
Before considering the actual implementation of the optimal weighting strategy, let us make a note on the selection of U. This parameter is a user-defined quantity, and it would be interesting to gain some insight into how it should be chosen. Note, Assumption A3 states a lower bound for U with respect to identifiability. The following result may be useful.
Assume that the optimal weighting W0 is applied in the criterion (2.10). Let PU(W) denote the asymptotic covariance for this case. Then
PU(W0)≧PU-1(W0)
The proof follows immediately from the calculations in [15, Complement C4.4]. Note that, when the optimal weighting, W0 is applied the matrices {PU(W0)} forms a non-increasing sequence. However, in practice one must be aware that a too large value of U in fact may deteriorate the performance. This phenomenon may be explained by that a large value of U means that a larger value of N is required in order for the asymptotic results to be valid.
In the present section a comparison of signal separation based on an algorithm within the scope of the present invention and the algorithm in [8] will be made. The purpose for the comparison is to show the contribution of the present invention. Put differently, does the weighting lead to a significant decrease of the parameter variance? In all of our simulations, U=6. Furthermore, the term relative frequency is used in several figures. Here relative frequency corresponds to frel=2F/FSwhere FS is the
Here the channel system is defined by B12(q)=0.3+0.1q−1 and B21(q)=0.1+0.7q−1. The source signal x1(n) is an AR(2) process with poles at radius 0.8 and angles π/4. The second source signal is, also, an AR(2) process. However, the poles are moved by adjusting the angles in the interval [0,π/2], while keeping the radius constant at 0.8. At each angle 200 realizations have been generated and processed by the channel system and separation structure. That is to say, for each angle the resulting parameter estimates have been averaged. Finally, each realization consists of 4000 samples.
In
Apparently, it gets more difficult to estimate the channel parameters when the source colors are similar. Therefore, in
Further details of the present invention are that the method, according the device for separating signals, is repeatedly performed on the measured signal or on fractions thereof. Also, the method may be repeatedly performed according to a predetermined updating frequency. It should be noted that the predetermined updating frequency may not be constant. Further, the number of filter coefficients is predetermined. Finally, the number of filter coefficients is arranged to be predetermined in the above embodiment.
This application is a continuation of International Application No. PCT/SE00/00451 filed on Mar. 7, 2000.
Number | Name | Date | Kind |
---|---|---|---|
5168459 | Hiller | Dec 1992 | A |
5491754 | Jot et al. | Feb 1996 | A |
5539832 | Weinstein et al. | Jul 1996 | A |
5568519 | Baier et al. | Oct 1996 | A |
5694474 | Ngo et al. | Dec 1997 | A |
5825671 | Deville | Oct 1998 | A |
5909646 | Deville | Jun 1999 | A |
5999956 | Deville | Dec 1999 | A |
6002776 | Bhadkamkar et al. | Dec 1999 | A |
6185309 | Attias | Feb 2001 | B1 |
6317703 | Linsker | Nov 2001 | B1 |
6577675 | Lindgren et al. | Jun 2003 | B2 |
6625587 | Erten et al. | Sep 2003 | B1 |
6654719 | Papadias | Nov 2003 | B1 |
6711528 | Dishman et al. | Mar 2004 | B2 |
20020085741 | Shimizu | Jul 2002 | A1 |
20020136328 | Shimizu | Sep 2002 | A1 |
20040057585 | Madievski et al. | Mar 2004 | A1 |
Number | Date | Country |
---|---|---|
WO 9916170 | Jan 1999 | WO |
Number | Date | Country | |
---|---|---|---|
20020051500 A1 | May 2002 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCTSE00/00451 | Mar 2000 | US |
Child | 09947755 | US |