Signal analysis device for modeling spatial characteristics of source signals, signal analysis method, and recording medium

Information

  • Patent Grant
  • 11423924
  • Patent Number
    11,423,924
  • Date Filed
    Friday, February 1, 2019
    5 years ago
  • Date Issued
    Tuesday, August 23, 2022
    2 years ago
Abstract
A signal analysis device includes a memory and processing circuitry coupled to the memory and configured to obtain, for a spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling spatial characteristics of J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, a simultaneous decorrelation matrix P as a matrix in which all PHRjP are diagonal matrices, or/and Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to the J source signals for observation signal vectors based on observation signals acquired at I (I is an integral number equal to or larger than 2) different positions.
Description
CROSS-REFERENCE TO RELATED APPLICATIONS

The present application is based on PCT filing PCT/JP2019/003671, filed Feb. 1, 2019, which claims priority to JP 2018-030701, filed Feb. 23, 2018, JP 2018-133592, filed Jul. 13, 2018, and JP 2018-164051, filed Aug. 31, 2018, the entire contents of each are incorporated herein by reference.


TECHNICAL FIELD

The present invention relates to a signal analysis device, a signal analysis method, and a recording medium.


BACKGROUND ART

In the related art, there has been developed a method of estimating, in a situation in which two source signals are present in a mixed manner, a source signal component as a component derived from each of the source signals from a plurality of observation signals acquired at different positions.


CITATION LIST

[Non Patent Citation]




  • Non Patent Document 1: N. Q. K. Duong, E. Vincent and R. Gribonval, “Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial covariance Model”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 7, pp. 1830-1840, September 2010.



SUMMARY OF INVENTION
Technical Problem

The following describes a signal analysis device in the related art with reference to FIG. 24. FIG. 24 is a diagram illustrating a configuration of the signal analysis device in the related art. As illustrated in FIG. 24, a signal analysis device 1P in the related art includes a multichannel Wiener filter unit 10a. In a case of denoting a vector, a matrix, or a scalar A as “{circumflex over ( )}A”, “{circumflex over ( )}A” is assumed to be equivalent to a “symbol with “{circumflex over ( )}” added right above “A””.


The multichannel Wiener filter unit 10a calculates an estimation value of a source signal component based on an observation signal yi(k) (i is an index of the observation signal, and k is an index of a sample point), a variance parameter vj(k) (j is an index of a source signal) as a parameter for modeling a variance for each sample point of the source signal, and a spatial covariance matrix Rj as a parameter for modeling a spatial characteristic of the source signal.


Specifically, the multichannel Wiener filter unit 10a creates an observation signal vector y(k) as an I-dimensional column vector including all observation signals as elements by the following expression (1). In the following description, a superscript of T represents transposition.

y(k)=(y1(k) . . . yI(k))T  (1)


The observation signal vector y(k) is modeled by the following expression (2) as a sum of components (source signal components) x1(k) and x2(k) derived from two source signals.

y(k)=x1(k)+x2(k)  (2)


The multichannel Wiener filter unit 10a further calculates an estimation value {circumflex over ( )}xj(k) of a source signal component xj(k) by the following expression (3) by applying a multichannel Wiener filter Wj(k) as a matrix of I×I to the observation signal vector y(k).












x
^

j



(
k
)


=





v
j



(
k
)






R
j



(




v
1



(
k
)




R
1


+



v
2



(
k
)




R
2



)



-
1







W
j



(
k
)






y


(
k
)







(
3
)







However, the signal analysis device 1P in the related art is requested to perform an inverse matrix operation that requests a large calculation amount for each sample point. Specifically, the signal analysis device 1P in the related art is requested to calculate an inverse matrix (v1(k)R1+v2(k)R2)−1 for each k. Thus, with the signal analysis device 1P in the related art, a large calculation amount is requested for calculating the estimation value of the source signal component as a component derived from the source signal.


The present invention is made in view of such a situation, and provides a signal analysis device, a signal analysis method, and a signal analysis program that can calculate an estimation value of a source signal component with a small calculation amount.


Solution to Problem

To solve the above problem and attain the object, a signal analysis device according to the present invention includes: a memory; and processing circuitry coupled to the memory and configured to: obtain, for a spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling spatial characteristics of J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, a simultaneous decorrelation matrix P as a matrix in which all PHRjP are diagonal matrices, or/and Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to the J source signals for observation signal vectors based on observation signals acquired at I (I is an integral number equal to or larger than 2) different positions.


A signal analysis device according to the present invention includes: a memory; and processing circuitry coupled to the memory and configured to: obtain a simultaneous decorrelation matrix P as a matrix including I (I is an integral number equal to or larger than 2) different generalized eigenvectors p1, . . . , pI as column vectors satisfying piHR2pm=0 (i and m are any integral numbers equal to or larger than 1 and equal to or smaller than I different from each other) for spatial covariance matrices R1 and R2 for modeling spatial characteristics of two source signals that are present in a mixed manner, or Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to two source signals for observation signal vectors based on observation signals acquired at I positions.


A signal analysis device according to the present invention includes: a memory; and processing circuitry coupled to the memory and configured to: obtain a simultaneously decorrelated observation signal vector in which components corresponding to J source signals are decorrelated by assuming, for a spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling spatial characteristics of J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, a matrix P in which all PHRjP become diagonal matrices as a simultaneous decorrelation matrix for decorrelating components corresponding to the J source signals for observation signal vectors based on observation signals acquired at I (I is an integral number equal to or larger than 2) different positions, and multiplying Hermitian transposition of the simultaneous decorrelation matrix by the observation signal vector.


A signal analysis device according to the present invention includes: a memory; and processing circuitry coupled to the memory and configured to: obtain a spatial covariance matrix as a parameter for modeling a spatial characteristic of a source signal based on a covariance matrix subjected to simultaneous decorrelation obtained from J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, or/and obtain a variance parameter as a parameter for modeling a variance for each sample point of the source signal based on the covariance matrix subjected to simultaneous decorrelation.


Advantageous Effects of Invention

According to the present invention, an estimation value of a source signal component can be calculated with a small calculation amount.





BRIEF DESCRIPTION OF DRAWINGS


FIG. 1 is a diagram illustrating an example of a configuration of a signal analysis device according to a first embodiment.



FIG. 2 is a flowchart illustrating an example of a processing procedure of signal analysis processing according to the first embodiment.



FIG. 3 is a diagram illustrating an example of a configuration of a signal analysis device according to a second embodiment.



FIG. 4 is a flowchart illustrating an example of a processing procedure of signal analysis processing according to the second embodiment.



FIG. 5 is a diagram illustrating an example of a configuration of a signal analysis device according to a third embodiment and a seventh embodiment.



FIG. 6 is a flowchart illustrating an example of a processing procedure of signal analysis processing according to the third embodiment and the seventh embodiment.



FIG. 7 is a diagram illustrating an experiment condition for a sound source separation experiment using a conventional method and a method according to the third embodiment.



FIG. 8 is a diagram illustrating a result of the sound source separation experiment (real time factor) using the conventional method and the method according to the third embodiment.



FIG. 9 is a diagram illustrating a result of the sound source separation experiment (sound source separation performance) using the conventional method and the method according to the third embodiment.



FIG. 10 is a diagram illustrating an example of a configuration of a signal analysis device according to a fourth embodiment.



FIG. 11 is a flowchart illustrating an example of a processing procedure of signal analysis processing according to the fourth embodiment.



FIG. 12 is a diagram illustrating an example of a configuration of a signal analysis device according to a fifth embodiment.



FIG. 13 is a flowchart illustrating an example of a processing procedure of signal analysis processing according to the fifth embodiment.



FIG. 14 is a diagram illustrating an experiment condition for a sound source separation experiment using the conventional method and a method according to the fifth embodiment.



FIG. 15 is a diagram illustrating a result of the sound source separation experiment (real time factor) using the conventional method and the method according to the fifth embodiment.



FIG. 16 is a diagram illustrating a result of the sound source separation experiment (sound source separation performance) using the conventional method and the method according to the fifth embodiment.



FIG. 17 is a diagram illustrating an example of a configuration of a signal analysis device according to a sixth embodiment.



FIG. 18 is a flowchart illustrating an example of a processing procedure of signal analysis processing according to the sixth embodiment.



FIG. 19 is a diagram illustrating an example of a configuration of a frequency domain signal separation unit.



FIG. 20 is a flowchart illustrating an example of a processing procedure of frequency domain signal separation processing.



FIG. 21 is a diagram illustrating an example of a configuration of a signal analysis device according to an eighth embodiment.



FIG. 22 is a flowchart illustrating an example of a processing procedure of signal analysis processing according to the eighth embodiment.



FIG. 23 is a diagram illustrating an example of a computer in which a computer program is executed to implement a signal analysis device.



FIG. 24 is a diagram illustrating a configuration of a conventional signal analysis device.





EMBODIMENTS FOR CARRYING OUT THE INVENTION

The following describes embodiments of a signal analysis device, a signal analysis method, and a signal analysis program according to the present application in detail based on the drawings. The present invention is not limited to the embodiments described below. In the following description, in a case of denoting a vector, a matrix, or a scalar A as “{circumflex over ( )}A”, “{circumflex over ( )}A” is assumed to be equivalent to a “symbol with “{circumflex over ( )}” added right above “A””. In a case of denoting a vector, a matrix, or a scalar A as “{tilde over ( )}A”, “{tilde over ( )}A” is assumed to be equivalent to a “symbol with” added right above “A””. In the following description, signal separation is assumed to be a subordinate concept of signal analysis.


First Embodiment

First, the following describes a configuration, a processing procedure, and an effect of a signal analysis device according to a first embodiment. In the first embodiment, in a situation in which two source signals are present in a mixed manner, I (I is an integral number equal to or larger than 2) observation signals yi(k) (i=1, . . . ; I is an index of the observation signal, and k is an index of a sample point) acquired by a sensor such as a microphone at different positions, a variance parameter vj(k) (j=1, 2 is an index of the source signal) as a parameter for modeling a variance (power) for each sample point of the source signal, and a spatial covariance matrix Rj as a parameter for modeling a spatial characteristic of the source signal are assumed to be input to the signal analysis device.


Examples of a “signal” in the first embodiment include sound, brainwaves, and radio signals. The “observation signal” in the first embodiment indicates a signal acquired by a plurality of sensors (sensor array). Examples of the sensor array include a microphone array, an electroencephalograph, a magnetoencephalograph, and an antenna array. The “source signal” in the first embodiment indicates a signal as a constituent element of the observation signal. The observation signal is modeled as a sum of the two source signals.


In addition to a problem setting (for example, sound source separation) for estimating, in a situation in which two object signals (for example, voice) are present in a mixed manner, components corresponding thereto, a problem setting for “signal separation” in the first embodiment includes a problem setting (also referred to as noise removal) for estimating, in a situation in which an object signal (for example, voice) and noise (for example, music coming through a television, or a rowdy crowd) are present in a mixed manner, components corresponding thereto (or only a component corresponding to the object signal). The source signal in the former case is the object signal (for example, voice). The source signals in the latter case are the object signal (for example, voice) and noise (for example, music coming through a television, or a rowdy crowd). The “rowdy crowd” is mixture of many kinds of noise, which is collectively regarded as one source signal in this case.


[Configuration and Processing of Signal Analysis Device]


First, with reference to FIG. 1 and FIG. 2, the following describes a configuration of the signal analysis device and a processing procedure of signal analysis processing according to the first embodiment. FIG. 1 is a diagram illustrating an example of the configuration of the signal analysis device according to the first embodiment. FIG. 2 is a flowchart illustrating an example of the processing procedure of the signal analysis processing according to the first embodiment. A signal analysis device 1 according to the first embodiment is, for example, implemented by a predetermined computer program being read by a computer and the like including a read-only memory (ROM), a random-access memory (RAM), a central processing unit (CPU), and the like, and the CPU executing the predetermined computer program.


As illustrated in FIG. 1, the signal analysis device 1 includes an observation signal vector creation unit 10, a signal parameter analysis unit 20, a simultaneous decorrelation unit 30, a single channel Wiener filter unit 40, and a simultaneous decorrelation inverse conversion unit 50.


First, the following describes an outline of each unit of the signal analysis device 1. The observation signal vector creation unit 10 acquires an input observation signal yi(k) (Step S10), and creates an observation signal vector as an I-dimensional column vector constituted of all observation signals represented by expression (4) for each sample point (Step S11).

y(k)=(y1(k) . . . yI(k))T  (4)


The signal parameter analysis unit 20 acquires an input spatial covariance matrix Rj, and solves a generalized eigenvalue problem to calculate a generalized eigenvalue matrix Λ and a generalized eigenvector matrix P (Step S12).


The simultaneous decorrelation unit 30 calculates an observation signal vector y′(k) subjected to simultaneous decorrelation based on the observation signal vector y(k) from the observation signal vector creation unit 10 and the generalized eigenvector matrix P from the signal parameter analysis unit 20 (Step S13). Specifically, as represented by expression (5), the simultaneous decorrelation unit 30 multiplies the observation signal vector y(k) by Hermitian transposition PH of the generalized eigenvector matrix P to calculate the observation signal vector y′(k) subjected to simultaneous decorrelation.

y′(k)=PHy(k)  (5)


In this case, “′ (superscript of prime)” represents a variable subjected to simultaneous decorrelation. Simultaneous decorrelation indicates linear transformation of simultaneously decorrelating components (source signal components) corresponding to two source signals included in the observation signal vector y(k).


The single channel Wiener filter unit 40 acquires the input variance parameter vj(k), and calculates an estimation value {circumflex over ( )}x′j (k) of a source signal component x′j(k) subjected to simultaneous decorrelation based on the generalized eigenvalue matrix Λ from the signal parameter analysis unit 20 and the observation signal vector y′(k) subjected to simultaneous decorrelation from the simultaneous decorrelation unit 30 (Step S14). Specifically, the single channel Wiener filter unit 40 calculates the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation by the expressions (6) and (7).












x
^

1




(
k
)


=


(








v
1



(
k
)




[
Λ
]


11






v
1



(
k
)




[
Λ
]


11

+


v
2



(
k
)






[


y




(
k
)


]


1



















v
1



(
k
)




[
Λ
]


II






v
1



(
k
)




[
Λ
]


II

+


v
2



(
k
)






[


y




(
k
)


]


I


)

T





(
6
)









x
^

2




(
k
)


=


(






v
2



(
k
)







v
1



(
k
)




[
Λ
]


11

+


v
2



(
k
)






[


y




(
k
)


]


1

















v
2



(
k
)







v
1



(
k
)




[
Λ
]


II

+


v
2



(
k
)






[


y




(
k
)


]


I


)

T





(
7
)







Regarding a vector α, [α]l represents the l-th element of α, and regarding a matrix A, [A]lm represents an (l, m) element of A. Additionally, {circumflex over ( )} (hat) above a variable represents an estimation value.


The simultaneous decorrelation inverse conversion unit 50 calculates and outputs an estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) based on the generalized eigenvector matrix P from the signal parameter analysis unit 20 and the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 40 (Step S15). Specifically, as represented by expression (8), the simultaneous decorrelation inverse conversion unit 50 multiplies the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation by an inverse matrix (PH)−1 of the Hermitian transposition PH of the generalized eigenvector matrix P multiplied by the simultaneous decorrelation unit 30 to calculate and output the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k).

{circumflex over (x)}j(k)=(PH)−1{circumflex over (x)}′j(k)  (8)


Background of Processing in First Embodiment

Next, the following describes a background of processing in the first embodiment. The following describes a case in which the observation signal is represented by a complex number (description of a case in which the observation signal is represented by a real number is almost the same, so that the description thereof is omitted). In the first embodiment, the observation signal vector y(k) is modeled by the following expression (9) as a sum of components (source signal components) x1(k) and x2(k) corresponding to the two source signals.

y(k)=x1(k)+x2(k)  (9)


It is assumed that a mean of the source signal component xj(k) is 0, a covariance matrix thereof is Sj(k), and the two source signal components x1(k) and x2(k) are not correlated with each other. Both of the two source signal components x1(k) and x2(k) may be the source signal components corresponding to the object signal (for example, voice). Alternatively, one of the source signal components x1(k) and x2(k) may be the source signal component corresponding to the object signal (for example, voice), and the other one thereof may be the source signal component corresponding to noise (for example, noise from a television or a rowdy crowd).


As a method of estimating the source signal component xj(k) from the observation signal vector y(k), there is known a method based on a multichannel Wiener filter. In this method, the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is calculated by expression (10).












x
^

j



(
k
)


=





S
j



(
k
)





(



S
1



(
k
)


+


S
2



(
k
)



)


-
1






multichannel





wiener





filter





y


(
k
)







(
10
)







That is, in this method, the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is calculated by multiplying the observation signal vector y(k) by a multichannel Wiener filter Sj (k)(S1(k)+S2 (k))−1 as a matrix of I×I.


[Conventional Signal Analysis Method]


In a conventional signal analysis method (for example, the signal analysis method described in Non Patent Document 1), the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is calculated based on expression (10). In this case, the covariance matrix Sj(k) of the source signal component xj(k) is modeled by the following expression (11). In expression (11), E represents an expected value operation.

Sj(k)=E(xj(k)xj(k)H)=vj(k)Rj  (11)


In this case, vj(k) is a variance parameter as a parameter for modeling variance for each sample point (for each k) of the source signal, and Rj is a spatial covariance matrix as a parameter for modeling a spatial characteristic of the source signal. In the conventional signal analysis method, the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is calculated by using expression (12) that is obtained by substituting expression (11) for expression (10).












x
^

j



(
k
)


=





v
j



(
k
)






R
j



(




v
1



(
k
)




R
1


+



v
2



(
k
)




R
2



)



-
1






multichannel





wiener





filter





y


(
k
)







(
12
)







There has been the problem that the conventional signal analysis method described above is difficult to be applied to processing of a large-scale data set (the number of sample points k is extensive), real-time processing (short-time processing is requested), an appliance having a low calculation capacity, and the like because a huge amount of calculation is requested.


Actually, in the related art, at the time of calculating the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k), inverse matrix calculation (specifically, calculation of (v1(k)R1+v2 (k)R2)−1)) is requested to be performed for each sample point (that is, multiple times). The inverse matrix calculation is an arithmetic operation that requests a large calculation amount of O(I3) for the I-th order matrix. The number of sample points ranges typically from several hundreds to tens of millions. Thus, in the conventional signal analysis method, a huge amount of calculation is requested at the time of calculating the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k).


[Signal Analysis Method According to First Embodiment]


On the other hand, the first embodiment provides a method of calculating the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) with a small calculation amount based on simultaneous decorrelation of the two source signal components x1(k) and x2(k). The following describes a concept of the first embodiment.


When a covariance matrix E(α(k)α(k)H) of a signal α(k) is a diagonal matrix with respect to each sample point k, α(k) is decorrelated. To put it briefly, this means that “elements having different vectors α(k) are not correlated with each other”. Assuming that covariance matrices vj(k)Rj (j is 1 or 2) of the source signal components xj(k) in expression (11) are both diagonal matrices, the inverse matrix operation (v1(k)R1+v2(k)R2)−1 in expression (12) can be implemented by a simple reciprocal operation for each diagonal element, so that calculation can be performed with a small calculation amount. However, normally, elements having different source signal components xj(k) are strongly correlated with each other, so that the covariance matrix vj(k)Rj thereof is not a diagonal matrix. Thus, in the first embodiment, it is considered that the two source signal components x1(k) and x2(k) are simultaneously decorrelated.


As represented by expression (13) and expression (14), simultaneous decorrelation of the two source signal components x1(k) and x2(k) indicates that linear transformation is performed by multiplying x1(k) and x2(k) by a regular matrix V to cause source signal components x′1(k) and x′2(k) after the linear transformation to be simultaneously decorrelated (that is, covariance matrices S′1(k) and S′2(k) of x′1(k) and x′2(k) become diagonal matrices at the same time). Due to this simultaneous decorrelation, the covariance matrices vj(k)Rj of the source signal components xj(k) are both converted into diagonal matrices. Thus, due to the simultaneous decorrelation, the inverse matrix operation (v1(k)R1+v2(k)R2)−1 in expression (12) is changed into an inverse matrix operation for the diagonal matrix, which can be implemented by a simple reciprocal operation for each diagonal element, so that calculation can be performed with a small calculation amount.

x′1(k)=Vx1(k)  (13)
x′2(k)=Vx2(k)  (14)


When the covariance matrices S′1(k) and S′2(k) of x′1(k) and x′2(k) are calculated, the following expressions (15) to (19) are obtained.











S
1




(
k
)


=

E


(



x
1




(
k
)






x
1




(
k
)


H


)






(
15
)






=


VE


(



x
1



(
k
)






x
1



(
k
)



I

I



)




V
H






(
16
)






=



VS
1



(
k
)




V
H






(
17
)






=



v
1



(
k
)




VR
1



V
H






(
18
)








S
2




(
k
)


=



v
2



(
k
)




VR
2



V
H






(
19
)







The condition for simultaneous decorrelation is that these matrices become diagonal matrices at the same time, that is, the following expression (20) and expression (21) are established at the same time. In this case, Λ1 and Λ2 are certain diagonal matrices.

VR1VH1(diagonal matrix)  (20)
VR2VH2(diagonal matrix)  (21)


That is, an object of simultaneous decorrelation is to obtain the regular matrix V such that, when spatial covariance matrices R1 and R2 are multiplied by the regular matrix V from the left and multiplied by Hermitian transposition VH of V from the right, these matrices become diagonal matrices at the same time (simultaneous diagonalization).


The matrix V can be obtained as follows based on the generalized eigenvalue problem. The generalized eigenvalue problem for the spatial covariance matrices R1 and R2 represented by expression (22) is solved.

R1p=λR2p  (22)


As a result, I generalized eigenvalues λ1, . . . , λI, and I generalized eigenvectors p1, . . . , pI respectively corresponding to λ1, . . . , λI are obtained. In this case, the generalized eigenvectors p1, . . . , pI can be taken to satisfy the following expression (23).











p
l
H



R
2



p
m


=

{




1
,





l
=
m







0
,





l

m










(
23
)







When the matrix P is determined by the following expression (24) using these generalized eigenvectors p1, . . . , pI, the following expression (25) and expression (26) are satisfied (for example, refer to reference document 1 “G. Strang, “Linear Algebra and Its Applications”, Harcourt Brace Jovanovich, 1988”).

P=(p1 . . . pI)  (24)
PHR1P=Λ  (25)
PHR2P=I  (26)


In this case, I is a unit matrix, and Λ is a diagonal matrix of expression (27) constituted of generalized eigenvalues λ1, . . . , λI.









Λ
=

(




λ
1



0





0




0



λ
2






0


















0


0






λ
I




)





(
27
)







Thus, when the matrix V is determined by the following expression (28), expression (20) and expression (21) as the conditions for simultaneous decorrelation are satisfied (Λ1=Λ, and Λ2=I).

V=PH  (28)


Hereinafter, the diagonal matrix Λ constituted of the generalized eigenvalues in expression (27) is referred to as a generalized eigenvalue matrix, and the matrix P constituted of the generalized eigenvectors in expression (24) is referred to as a generalized eigenvector matrix in some cases. As described above, P is also a matrix that implements simultaneous decorrelation, so that P is also referred to as a simultaneous decorrelation matrix from that point of view.


The signal parameter analysis unit 20 calculates the generalized eigenvalues λ1, . . . , λI for the spatial covariance matrices R1 and R2 and the generalized eigenvectors p1, . . . , pI respectively corresponding to λ1, . . . , λI that satisfy expression (23), and creates the generalized eigenvalue matrix Λ as a diagonal matrix including the generalized eigenvalues λ1, . . . , λI as diagonal elements and the generalized eigenvector matrix P as a regular matrix including the generalized eigenvectors p1, . . . , pI as column vectors by expression (27) and expression (24).


As described above, the two source signal components x1(k) and x2(k) are simultaneously decorrelated by linear transformation of multiplying the source signal components x1(k) and x2(k) by the Hermitian transposition PH of the matrix P defined by expression (24). When both sides of expression (9) representing a model of the observation signal vector are subjected to this linear transformation, the following expression (29) is obtained.












P
H



y


(
k
)






=


y




(
k
)





=




P
H




x
1



(
k
)






=


x
1




(
k
)





+



P
H




x
2



(
k
)






=


x
2




(
k
)










(
29
)







In this case, based on expression (18), expression (19), expression (25), expression (26), and expression (28), the covariance matrices S′1(k) and S′2(k) of x′1(k) and x′2(k) are both diagonal matrices as represented by the following expression (30) and expression (31).

S′1(k)=v1(k)Λ  (30)
S′2(k)=v2(k)I  (31)


In this case, Λ is a diagonal matrix defined by expression (27). In a case of applying the multichannel Wiener filter to expression (29), the estimation value {circumflex over ( )}x′j(k) of x′j(k) is given by the following expression (32).












x
^

j




(
k
)


=





S
j




(
k
)





(



S
1




(
k
)


+


S
2




(
k
)



)


-
1






multichannel





wiener





filter





y


(
k
)







(
32
)







The covariance matrices S′1(k) and S′2(k) in expression (32) are diagonal matrices as represented by expression (30) and expression (31). Thus, expression (32) becomes very simple as represented by the following expression (36) and expression (40). In this way, due to simultaneous decorrelation, the inverse matrix operation (v1(k)R1+v2(k)R2)−1 in expression (12) is changed into the inverse matrix operation (v1(k)Λ+v2(k)I)−1 for the diagonal matrix as represented by expression (33) and expression (37), which can be implemented by a simple reciprocal operation for each diagonal element, so that calculation can be performed with a small calculation amount.

















x
^

1




(
k
)


=



v
1



(
k
)





Λ


(




v
1



(
k
)



Λ

+



v
2



(
k
)



I


)



-
1





y




(
k
)








(
33
)






=



v
1



(
k
)




(




λ
1



0





0




0



λ
2






0


















0


0






λ
I




)

×


(







v
1



(
k
)




λ
1


+


v
2



(
k
)





0





0




0






v
1



(
k
)




λ
2


+


v
2



(
k
)








0


















0


0









v
1



(
k
)




λ
I


+


v
2



(
k
)






)


-
1





y




(
k
)







(
34
)






=


(







v
1



(
k
)




λ
1






v
1



(
k
)




λ
1


+


v
2



(
k
)






0





0




0






v
1



(
k
)




λ
2






v
1



(
k
)




λ
2


+


v
2



(
k
)









0


















0


0









v
1



(
k
)




λ
I






v
1



(
k
)




λ
I


+


v
2



(
k
)







)




y




(
k
)







(
35
)






=


(









v
1



(
k
)




λ
1






v
1



(
k
)




λ
1


+


v
2



(
k
)






[


y




(
k
)


]


1












v
1



(
k
)




λ
1






v
1



(
k
)




λ
1


+


v
2



(
k
)






[


y




(
k
)


]


I




)

T





(
36
)














x
^

2




(
k
)


=



v
2



(
k
)





(




v
1



(
k
)



Λ

+



v
2



(
k
)



I


)


-
1





y




(
k
)








(
37
)






=



v
2



(
k
)





(







v
1



(
k
)




λ
1


+


v
2



(
k
)





0





0




0






v
1



(
k
)




λ
2


+


v
2



(
k
)








0


















0


0









v
1



(
k
)




λ
I


+


v
2



(
k
)






)


-
1





y




(
k
)







(
38
)






=


(






v
2



(
k
)






v
1



(
k
)




λ
1


+


v
2



(
k
)






0





0




0





v
2



(
k
)






v
1



(
k
)




λ
2


+


v
2



(
k
)









0


















0


0








v
2



(
k
)






v
1



(
k
)




λ
I


+


v
2



(
k
)







)




y




(
k
)







(
39
)






=


(








v
2



(
k
)






v
1



(
k
)




λ
1


+


v
2



(
k
)






[


y




(
k
)


]


1











v
2



(
k
)






v
1



(
k
)




λ
1


+


v
2



(
k
)






[


y




(
k
)


]


I




)

T





(
40
)







The single channel Wiener filter unit 40 calculates the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation by expression (36) and expression (40).


Processing of expression (36) and expression (40) can also be interpreted as processing based on the single channel Wiener filter. To explain that fact, first, the following explains what the single channel Wiener filter is. While the multichannel Wiener filter performs estimation by using a plurality of (I) observation signals, the single channel Wiener filter performs estimation by using only one observation signal. With the single channel Wiener filter, one observation signal (scalar) y(k) is modeled as a sum of the two source signal components (scalars) x1(k) and x2(k) by the following expression (41).

y(k)=x1(k)+x2(k)  (41)


In this case, in accordance with the fact that there is only one observation signal, y(k), x1(k), and x2(k) are all scalars. It is assumed that a mean of the source signal component xj(k) is 0, variance thereof is σ2j(k), and the two source signal components x1(k) and x2(k) are not correlated with each other. In the method based on the single channel Wiener filter, the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is calculated by the following expression (42).












x
^

j



(
k
)


=





σ
j
2



(
k
)





σ
1
2



(
k
)


+


σ
2
2



(
k
)







single





channel





wiener





filter





y


(
k
)







(
42
)







That is, in this method, the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is calculated by multiplying the observation signal y(k) by a single channel Wiener filter σ2j(k)/(σ21(k)+σ22(k)).


In a case of seeing expression (29) for each element, the following expression (43) is obtained based on the i-th element.

[y′(k)]i=[x′1(k)]i+[x′2(k)]i  (43)


It is considered that [x′j(k)]i is estimated from [y′(k)]i using the single channel Wiener filter based on the above expression. Typically, when a covariance matrix of a probability vector α is A, a variance of the l-th element [α]l of α is an (l, l) element [A]ll of A, so that a variance of [x′j(k)]i is [S′j(k)]ii. Thus, based on expression (30) and expression (31), the variance of [x′1(k)]i is v1(k)λi, and the variance of [x′2(k)]i is v2(k). Accordingly, an estimation value [{circumflex over ( )}x′j(k)]i of [x′j(k)]i based on the single channel Wiener filter is given by the following expression (44) and expression (45).











[



x
^

1




(
k
)


]

i

=







v
1



(
k
)




λ
i






v
1



(
k
)




λ
i


+


v
2



(
k
)







single





channel





wiener





filter





[



y
^





(
k
)


]


i





(
44
)








[



x
^

2




(
k
)


]

i

=






v
2



(
k
)






v
1



(
k
)




λ
i


+


v
2



(
k
)







single





channel





wiener





filter





[



y
^





(
k
)


]


i





(
45
)







These values are nothing but the i-th elements of expression (36) and expression (40), respectively.


The estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k)=PHxj(k) subjected to simultaneous decorrelation that is obtained as described above is expected to take a value close to a true value as represented by the following expression (46) if estimation has been successively performed.

{circumflex over (x)}′j(k)≈PHxj(k)  (46)


The following expression (47) is obtained by multiplying both sides of the above expression by the matrix (PH)−1.

xj(k)≈(PH)−1{circumflex over (x)}′j(k)  (47)


Thus, the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is calculated by the following expression (48).

{circumflex over (x)}j(k)=(PH)−1{circumflex over (x)}′j(k)  (48)


In this way, according to the first embodiment, an inverse matrix operation is not requested to be performed for each sample point in calculating the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k), so that signal separation can be implemented with a small calculation amount. It is noted that the above expression includes the inverse matrix (PH)−1, but the inverse matrix (PH)−1 does not depend on k, so that calculation is not requested to be performed for each sample point but may be performed only once.


The estimation value of the source signal component xj(k) in the conventional signal analysis method is represented by {circumflex over ( )}xjconv (k), and the estimation value of the source signal component xj(k) in the first embodiment is represented by {circumflex over ( )}xjprop(k). It is demonstrated that {circumflex over ( )}xjconv(k) is identical to {circumflex over ( )}xjprop(k) in theory. In this case, {circumflex over ( )}xjconv(k) is given by expression (49).

{circumflex over (x)}jconv(k)=vj(k)Rj(v1(k)R1+v2(k)R2)−1y(k)  (49)


Based on expression (5), expression (33) to expression (40), and expression (48), {circumflex over ( )}xjprop(k) is equal to the following expression (50) and expression (51).

[Expression 41]
{circumflex over (x)}1prop(k)=(PH)−1(v1(k)Λ)(v1(k)Λ+v2(k)I)−1PHy(k)  (50)
{circumflex over (x)}2prop(k)=(PH)−1(v2(k)(v1(k)Λ+v2(k)I)−1)PHy(k)  (51)


The following describes a case of j=1. A case of j=2 is the same as the case of j=1, so that the description thereof will not be made. Based on expression (25) and expression (26), the following expression (52) and expression (53) are derived.

R1=(PH)−1ΛP−1  (52)
R2=(PH)−1P−1  (53)


By substituting these expressions for expression (49), the following expression (54) to expression (57) are obtained.












x
^

1
conv



(
k
)


=



v
1



(
k
)





(

P
H

)


-
1



Λ








P

-
1




(




v
1



(
k
)





(

P
H

)


-
1



Λ






P

-
1



+



v
2



(
k
)





(

P
H

)


-
1




P

-
1




)



-
1




y


(
k
)







(
54
)











=



v
1



(
k
)





(

P
H

)


-
1



Λ








P

-
1




(



(

P
H

)


-
1




(




v
1



(
k
)



Λ

+



v
2



(
k
)



I


)



P

-
1



)



-
1




y


(
k
)








(
55
)











=



v
1



(
k
)





(

P
H

)


-
1



Λ






P

-
1





P


(




v
1



(
k
)



Λ

+



v
2



(
k
)



I


)



-
1




P
H



y


(
k
)








(
56
)











=



x
^

1
prop



(
k
)







(
57
)







According to the above description, calculation results of the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) are identical to each other in theory between the conventional signal analysis method and the first embodiment. However, as a matter of course, a calculation error is caused by a computer in some cases.


In this way, in the signal analysis device 1 according to the first embodiment, the signal parameter analysis unit 20 obtains the simultaneous decorrelation matrix P as a matrix including the I (I is an integral number equal to or larger than 2) generalized eigenvectors p1, . . . , pI different from each other as column vectors satisfying piHR2pm=0 (i and m are any integral numbers equal to or larger than 1 and equal to or smaller than I different from each other) for the spatial covariance matrices R1 and R2 that model spatial characteristics of the two respective source signals that are present in a mixed manner, or the Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to the two source signals for observation signal vectors based on observation signals acquired at I positions. The signal analysis device 1 performs simultaneous decorrelation using this parameter for decorrelation, so that it is sufficient that multiplication may be performed for each element with a small calculation amount in filtering, and the calculation amount of the estimation value of the source signal component can be reduced.


By multiplying the observation signal vectors based on the input I observation signals by the Hermitian transposition PH of the simultaneous decorrelation matrix P obtained by the signal parameter analysis unit 20, the simultaneous decorrelation unit 30 obtains a simultaneously decorrelated observation signal vector the components of which corresponding to the two source signals are decorrelated.


The single channel Wiener filter unit 40 then applies the single channel Wiener filter based on a diagonal component of a matrix PHRjP (j is an integral number equal to or larger than 1 and equal to or smaller than 2) based on the simultaneous decorrelation matrix P to each element of the simultaneously decorrelated observation signal vector obtained by the simultaneous decorrelation unit 30, and obtains a vector based on a simultaneously decorrelated source signal component estimation value.


Additionally, by multiplying the vector based on the simultaneously decorrelated source signal component estimation value obtained by the single channel Wiener filter unit 40 by the inverse matrix (PH)−1 of the Hermitian transposition PH of the simultaneous decorrelation matrix P, the simultaneous decorrelation inverse conversion unit 50 obtains the estimation value of the source signal component.


Effect of First Embodiment

According to the first embodiment having such a configuration, as described above, signal separation can be implemented with a smaller calculation amount without deteriorating signal separation performance as compared with the conventional signal analysis method. That is, in the first embodiment, the observation signal vector is subjected to filtering after being subjected to simultaneous decorrelation, and lastly subjected to inverse conversion of simultaneous decorrelation to calculate the estimation value of the source signal component. In this case, the matrix P that implements simultaneous decorrelation can be obtained by solving the generalized eigenvalue problem for the spatial covariance matrix.


As a result, in the first embodiment, the estimation value of the source signal component can be calculated with a smaller calculation amount as compared with the related art. The main reason therefor is described below.


First, in the first embodiment, filtering becomes processing of multiplication for each element with a small calculation amount due to simultaneous decorrelation, and processing of an inverse matrix operation for each sample point with a large calculation amount is not requested. Second, in the first embodiment, simultaneous decorrelation is performed by using the matrix P that is obtained by solving the generalized eigenvalue problem for the spatial covariance matrix without using the sample point, so that the generalized eigenvalue problem is not requested to be solved for each sample point but may be solved once. This processing is enabled by modeling the covariance matrix of the source signal component to be scalar multiplication of the spatial covariance matrix without using the sample point. Third, in the first embodiment, an inverse matrix operation for the matrix PH of simultaneous decorrelation is requested to be performed at the time of performing inverse conversion of simultaneous decorrelation, but the inverse matrix operation is not requested to be performed for each sample point but may be performed only once based on the same reason.


Furthermore, in the first embodiment, the processing of performing filtering on the observation signal vector after performing simultaneous decorrelation, and lastly performing inverse conversion of simultaneous decorrelation in the first embodiment is equivalent to the processing of directly performing filtering on the observation signal vector in the related art. Due to this, according to the first embodiment, the estimation value of the source signal component can be calculated with a smaller calculation amount than that in the related art without deteriorating the signal separation performance as compared with the related art.


First Modification of First Embodiment

Next, the following describes a first modification of the first embodiment. In the first embodiment, the generalized eigenvectors p1, . . . , pI are assumed to satisfy the constraint condition of expression (23). The first modification of the first embodiment describes the signal analysis device in a case of assuming that the generalized eigenvectors p1, . . . , pI satisfy the constraint condition other than expression (23). The signal parameter analysis unit of the signal analysis device according to the first modification of the first embodiment solves a generalized eigenvalue problem represented by expression (58) for the spatial covariance matrices R1 and R2.

R1p=λR2p  (58)


As a result, the I generalized eigenvalues λ1, . . . , λI and the I generalized eigenvectors p1, . . . , pI respectively corresponding to λ1, . . . , λI are obtained. In the example of the first embodiment described above, the generalized eigenvectors p1, . . . , pI are taken to satisfy the following expression (59).











p
l
H



R
2



p
m


=

{




1
,




l
=
m






0
,




l

m









(
59
)







On the other hand, in the first modification of the first embodiment, the generalized eigenvectors p1, . . . , pI are taken to satisfy the following expression (60) (each σl is a predetermined positive number).











p
l
H



R
2



p
m


=

{





σ
l

,




l
=
m






0
,




l

m









(
60
)







When the matrix P is determined by the following expression (61) using these generalized eigenvectors p1, . . . , pI, expression (62) and expression (63) are satisfied.

P=(p1 . . . pI)  (61)
PHR1P=Λ1  (62)
PHR2P=Λ2  (63)


In this case, Λ1 and Λ2 are diagonal matrices represented by the following expression (64) and expression (65).










Λ
1

=

(





λ
1



σ
1




0





0




0




λ
2



σ
2







0


















0


0







λ
I



σ
I





)





(
64
)







Λ
2

=

(




σ
1



0





0




0



σ
2






0


















0


0






σ
I




)





(
65
)







Thus, when the matrix V is determined by the following expression (66), expression (20) and expression (21) as the conditions for simultaneous decorrelation are satisfied. In this case, Λ1 and Λ2 in expression (64) and expression (65) correspond to Λ1 and Λ2 in expression (20) and expression (21).

V=PH  (66)


Hereinafter, the diagonal matrix Λ constituted of the generalized eigenvalues in expression (27) is referred to as a generalized eigenvalue matrix, and the matrix P constituted of the generalized eigenvectors in expression (61) is referred to as a generalized eigenvector matrix in some cases. The matrix P obtained as described above can be used as the simultaneous decorrelation matrix. The single channel Wiener filter unit of the signal analysis device according to the first modification of the first embodiment calculates the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation by the following expression (67) and expression (68).












x
^

1




(
k
)


=


(










v
1



(
k
)




[

Λ
1

]


11






v
1



(
k
)




[

Λ
1

]


11

+




v
2



(
k
)




[

Λ
2

]


11





[


y




(
k
)


]


1













v
1



(
k
)




[

Λ
1

]


11






v
1



(
k
)




[

Λ
1

]


11

+




v
2



(
k
)




[

Λ
2

]


11





[


y




(
k
)


]


I




)

T





(
67
)









x
^

2




(
k
)


=


(










v
2



(
k
)




[

Λ
2

]


11






v
1



(
k
)




[

Λ
1

]


11

+




v
2



(
k
)




[

Λ
2

]


11





[


y




(
k
)


]


1













v
2



(
k
)




[

Λ
2

]


11






v
1



(
k
)




[

Λ
1

]


11

+




v
2



(
k
)




[

Λ
2

]


11





[


y




(
k
)


]


I




)

T





(
68
)







Pieces of processing performed by the other units are the same as those in the first embodiment described above.


In this way, in the first modification of the first embodiment, the signal parameter analysis unit 20 obtains the simultaneous decorrelation matrix P as a matrix including the I (I is an integral number equal to or larger than 2) generalized eigenvectors p1, . . . , pI different from each other as column vectors satisfying piHR2pm=0 (i and m are any integral numbers equal to or larger than 1 and equal to or smaller than I different from each other) for the spatial covariance matrices R1 and R2 that model spatial characteristics of the two respective source signals that are present in a mixed manner, or the Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to the two source signals for observation signal vectors based on observation signals acquired at I positions.


Second Modification of First Embodiment

Next, the following describes a second modification of the first embodiment. The second modification of the first embodiment describes another modification of the processing performed by the signal parameter analysis unit. In the first embodiment, the signal parameter analysis unit 20 obtains the matrix P in which PHR1P and PHR2P become diagonal matrices, and the diagonal matrix Λ=PHR1P at this point, based on eigenvalue analysis. On the other hand, in the second modification of the first embodiment, the signal parameter analysis unit 20 obtains the matrix P in which Λ1=PHR1P and Λ2=PHR2P are caused to be closer to certain diagonal matrices Σ1 and Σ2 as much as possible, and Λ1=PHR1P and Λ2=PHR2P at this point, based on optimization.


That is, in the second modification of the first embodiment, the signal parameter analysis unit 20 obtains the matrix P as described below, and Λ1=PHR1P and Λ2=PHR2P at this point (refer to expression (69) and expression (70)).

PHR1P≃Σ1(diagonal matrix)  (69)
PHR2P≃Σ2(diagonal matrix)  (70)


Specifically, by minimizing a cost function of the following expression (71) for the matrix P and the diagonal matrices Σ1 and Σ2, the matrix P can be obtained.
















P
H



R
2


P

-


2




F
2




error





between






P
H



R
1


P





and






Σ
1




+







P
H



R
1


P

-


1




F
2




error





between






P
H



R
2


P





and






Σ
2








(
71
)







In this case, ∥A∥F represents a Frobenius norm of the matrix A. For example, minimization of the cost function of the above expression can be performed in accordance with an algorithm described in reference document 2 “A. Yeredor, “Non-Orthogonal Joint Diagonalization in the Least-Squares Sense with Application in Blind Source Separation”, IEEE Transactions on Signal Processing, vol. 50, no. 7, pp. 1545-1553, July 2002.”. The matrix P obtained as described above can be used as the simultaneous decorrelation matrix.


Based on Λ1=PHR1P and Λ2=PHR2P, similarly to the first modification of the first embodiment, the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation can be calculated by the single channel Wiener filter unit 40. Pieces of processing performed by the other units are the same as those in the first embodiment described above.


In this way, in the second modification of the first embodiment, the signal parameter analysis unit 20 obtains the simultaneous decorrelation matrix P as a matrix that is obtained so that all PHRjP are caused to be close to the diagonal matrix or/and the Hermitian transposition PH thereof, for the spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling respective spatial characteristics of J (J is 2 in this modification) source signals that are present in a mixed manner, as a parameter for decorrelating components corresponding to the J source signals for the observation signal vectors based on the observation signals acquired at I (I is an integral number equal to or larger than 2) positions different from each other.


Third Modification of First Embodiment

Next, the following describes a third modification of the first embodiment. The third modification of the first embodiment describes a case in which J source signals are typically present, not limited to the case in which the two source signals are present, based on the method in the second modification of the first embodiment.


In the third modification of the first embodiment, in a situation in which the J source signals are present in a mixed manner, the I observation signals yi(k) acquired at different positions, the variance parameter vj(k) (j=1, . . . ; J is an index of the source signal) as a parameter for modeling variance (power) for each sample point of the source signal, and the spatial covariance matrix Rj as a parameter for modeling the spatial characteristic of the source signal are assumed to be input to the signal analysis device 1.


In the third modification of the first embodiment, the signal parameter analysis unit 20 obtains the matrix P in which Λ1=PHR1P, . . . , ΛJ=PHRJP are caused to be closer to certain diagonal matrices Σ1, . . . , ΣJ as much as possible, and Λ1=PHR1P, . . . , ΛJ=PHRJP at this point, based on optimization.


That is, in the third modification of the first embodiment, the signal parameter analysis unit 20 obtains the matrix P represented by the following expression (72) to expression (73), and Λ1=PHR1P, . . . , ΛJ=PHRJP at this point.

PHR1P≃Σ1(diagonal matrix)  (72)
. . .
PHRJP≃ΣJ(diagonal matrix)  (73)


Specifically, by minimizing a cost function of the following expression (74) for the matrix P and each diagonal matrix Σj (j is an integral number equal to or larger than 1 and equal to or smaller than J), the matrix P can be obtained.












j
=
1

J









P
H



R
j


P

-


j




F
2




error





between






P
H



R
j


P





and






Σ
j








(
74
)







For example, minimization of the cost function of the above expression can be performed in accordance with the algorithm described in reference document 2. In the third modification of the first embodiment, P that is obtained as described above can be used as the simultaneous decorrelation matrix. Additionally, based on Λ1=PHR1P, . . . , ΛJ=PHRJP, the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation can be calculated.


Next, the following describes a configuration and processing of the signal analysis device 1 in a case in which the number of source signals is J. The signal analysis device 1 includes the observation signal vector creation unit 10, the signal parameter analysis unit 20, the simultaneous decorrelation unit 30, the single channel Wiener filter unit 40, and the simultaneous decorrelation inverse conversion unit 50. The observation signal vector creation unit 10 acquires the input observation signal yi(k), and creates the observation signal vector y(k) as an I-dimensional column vector constituted of all the observation signals for each sample point. The observation signal vector y(k) is represented by the following expression (75).

y(k)=(y1(k) . . . yI(k))T  (75)


The signal parameter analysis unit 20 acquires the input spatial covariance matrix Rj.











P
H



R
1


P

=

Λ
1





(
76
)









(
77
)








P
H



R
J


P

=

Λ
J





(
78
)







The signal parameter analysis unit 20 then calculates the simultaneous decorrelation matrix P in which each matrix Λj (j is an integral number equal to or larger than 1 and equal to or smaller than J) defined by the expressions (76) to (78) described above becomes a diagonal matrix, and Λ1, . . . , ΛJ at this point.


The simultaneous decorrelation unit 30 calculates the observation signal vector y′(k) subjected to simultaneous decorrelation based on the observation signal vector y(k) from the observation signal vector creation unit 10 and the simultaneous decorrelation matrix P from the signal parameter analysis unit 20. Specifically, as represented by expression (79), the simultaneous decorrelation unit 30 multiplies the observation signal vector y(k) by the Hermitian transposition PH of the simultaneous decorrelation matrix P to calculate the observation signal vector y′(k) subjected to simultaneous decorrelation.

y′(k)=PHy(k)  (79)


The simultaneous decorrelation indicates linear transformation for simultaneously decorrelating components (source signal components) corresponding to the J source signals included in the observation signal vector y(k).


The single channel Wiener filter unit 40 acquires the input variance parameter vj(k), and calculates the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation based on the matrix Λj from the signal parameter analysis unit 20 and the observation signal vector y′(k) subjected to simultaneous decorrelation from the simultaneous decorrelation unit 30. Specifically, the single channel Wiener filter unit 40 calculates the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation by the following expression (80).












x
^

j




(
k
)


=


(










v
j



(
k
)




[

Λ
j

]


11





l
=
1

J






v
l



(
k
)




[

Λ
l

]


11





[


y




(
k
)


]


1













v
j



(
k
)




[

Λ
j

]


II





l
=
1

J






v
l



(
k
)




[

Λ
l

]


II





[


y




(
k
)


]


I




)

T





(
80
)







The simultaneous decorrelation inverse conversion unit 50 calculates and outputs the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) based on the simultaneous decorrelation matrix P from the signal parameter analysis unit 20 and the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 40. Specifically, as represented by expression (81), the simultaneous decorrelation inverse conversion unit 50 multiplies the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation by the inverse matrix (PH)−1 of the matrix PH multiplied by the simultaneous decorrelation unit 30 to calculate and output the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k).

{circumflex over (x)}j(k)=(PH)−1{circumflex over (x)}′j(k)  (81)


In this way, in the third modification of the first embodiment, the signal parameter analysis unit 20 obtains the simultaneous decorrelation matrix P as a matrix that is obtained so that all PHRjP are caused to be close to the diagonal matrix or/and the Hermitian transposition PH thereof, for the spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling the respective spatial characteristics of the J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, as a parameter for decorrelating components corresponding to the J source signals for the observation signal vectors based on the observation signals acquired at the I (I is an integral number equal to or larger than 2) positions different from each other.


Effects of First Embodiment and First to Third Modifications of First Embodiment

In this way, according to the first embodiment and the first to the third modifications of the first embodiment, the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) can be calculated with a small calculation amount. Specifically, with the configuration based on eigenvalue analysis in the first embodiment, signal separation can be performed with a smaller calculation amount than that in the related art without deteriorating the signal separation performance as compared with the related art.


In the second and the third modifications of the first embodiment, the matrix Λj is obtained to be closer to the diagonal matrix Σj. However, actually, the matrix Λj does not become the diagonal matrix in some cases. In this case, processing of the single channel Wiener filter is performed by using, as λ, a “diagonal component” of the matrix Λj=matrix PHRjP that is close to the diagonal matrix. Thus, in the processing of the single channel Wiener filter according to the second and the third modifications of the first embodiment, only the diagonal component is used without using a nondiagonal component of the matrix Λj, so that the configuration may be such that the nondiagonal component is not specified.


Second Embodiment

Next, the following describes a configuration, a processing procedure, and an effect of the signal analysis device according to a second embodiment.


As described above in the first embodiment, in a case in which the variance parameter vj(k) and the spatial covariance matrix Rj are known, the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) can be calculated based on the variance parameter vj(k) and the spatial covariance matrix Rj. On the other hand, in a case in which the variance parameter vj(k) and the spatial covariance matrix Rj are unknown, the variance parameter vj(k) and the spatial covariance matrix Rj are requested to be estimated.


Thus, the second embodiment describes a method of estimating the variance parameter vj(k) and the spatial covariance matrix Rj, and the source signal component xj(k) at the same time using an iterative method in a situation in which the two source signals are present in a mixed manner. In the second embodiment, the I observation signals yi(k) acquired at different positions are assumed to be input to the signal analysis device.


[Configuration and Processing of Signal Analysis Device]


With reference to FIG. 3 and FIG. 4, the following describes a configuration of the signal analysis device and a processing procedure of the signal analysis processing according to the second embodiment. FIG. 3 is a diagram illustrating an example of the configuration of the signal analysis device according to the second embodiment. FIG. 4 is a flowchart illustrating an example of the processing procedure of the signal analysis processing according to the second embodiment.


As illustrated in FIG. 3, a signal analysis device 201 includes an initialization unit (not illustrated), an observation signal vector creation unit 210, a signal parameter analysis unit 220, a simultaneous decorrelation unit 230, a single channel Wiener filter unit 240, a posterior covariance matrix update unit 250, a variance parameter update unit 260, a spatial covariance matrix update unit 270, a convergence determination unit (not illustrated), and a simultaneous decorrelation inverse conversion unit 280.


First, the following describes an outline of each unit of the signal analysis device 201. The initialization unit (not illustrated) sets an initial estimation value {circumflex over ( )}vj(0)(k) of the variance parameter vj(k) and an initial estimation value {circumflex over ( )}Rj(0) of the spatial covariance matrix Rj (Step S20). These values may be set based on a random number, for example.


The observation signal vector creation unit 210 acquires the input observation signal yi(k) (Step S21), and creates the observation signal vector y(k) as an I-dimensional column vector constituted of all the observation signals for each sample point (Step S22).


The signal parameter analysis unit 220 updates the generalized eigenvalue matrix Λ and the generalized eigenvector matrix P by solving the generalized eigenvalue problem based on the estimation value {circumflex over ( )}R′j of the spatial covariance matrix R′j subjected to simultaneous decorrelation from the spatial covariance matrix update unit 270 (as an exception, the initial estimation value {circumflex over ( )}Rj(0) of the input spatial covariance matrix Rj is used at the time of the first processing performed by the signal parameter analysis unit 220) (Step S23).


The simultaneous decorrelation unit 230 updates the observation signal vector y′(k) subjected to simultaneous decorrelation based on the observation signal vector y(k) from the observation signal vector creation unit 210 and the generalized eigenvector matrix P from the signal parameter analysis unit 220 (Step S24).


The single channel Wiener filter unit 240 updates the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation based on the generalized eigenvalue matrix Λ from the signal parameter analysis unit 220, the observation signal vector y′(k) subjected to simultaneous decorrelation from the simultaneous decorrelation unit 230, and the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 260 (as an exception, the initial estimation value {circumflex over ( )}vj(0)(k) of the input variance parameter vj(k) is used at the time of the first processing performed by the single channel Wiener filter unit 240) (Step S25).


The posterior covariance matrix update unit 250 updates a posterior covariance matrix Σ′(k) subjected to simultaneous decorrelation based on the generalized eigenvalue matrix Λ from the signal parameter analysis unit 220, and the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 260 (as an exception, the initial estimation value {circumflex over ( )}vj(0)(k) of the input variance parameter vj(k) is used at the time of the first processing performed by the posterior covariance matrix update unit 250) (Step S26).


The variance parameter update unit 260 updates the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) based on the generalized eigenvalue matrix Λ from the signal parameter analysis unit 220, the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 240, and the posterior covariance matrix Σ′(k) subjected to simultaneous decorrelation from the posterior covariance matrix update unit 250 (Step S27).


The spatial covariance matrix update unit 270 updates the estimation value {circumflex over ( )}R′j of the spatial covariance matrix R′j subjected to simultaneous decorrelation based on the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 240, the posterior covariance matrix E′(k) subjected to simultaneous decorrelation from the posterior covariance matrix update unit 250, and the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 260 (Step S28).


The convergence determination unit (not illustrated) determines whether the update converges (Step S29). If the convergence determination unit determines that the update does not converge (No at Step S29), the process returns to the processing performed by the signal parameter analysis unit 220 at Step S23, and the processing is continued.


On the other hand, if the convergence determination unit determines that the update converges (Yes at Step S29), the following processing is performed by the simultaneous decorrelation inverse conversion unit 280.


Specifically, the simultaneous decorrelation inverse conversion unit 280 calculates and outputs the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) based on the generalized eigenvector matrix P from the signal parameter analysis unit 220 and the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 240 (Step S30).


Next, the following describes details about each unit of the signal analysis device 201. The initialization unit is described above.


The observation signal vector creation unit 210 acquires the input observation signal yi(k), and creates the observation signal vector y(k) as an I-dimensional column vector constituted of all the observation signals by the following expression (82).

y(k)=(y1(k) . . . yI(k))T  (82)


The signal parameter analysis unit 220 receives the initial estimation value {circumflex over ( )}Rj(0) of the spatial covariance matrix Rj from the initialization unit at the time of the first processing performed by the signal parameter analysis unit 220. The signal parameter analysis unit 220 calculates the generalized eigenvalues λ1, . . . , λI, and the generalized eigenvectors p1, . . . , pI respectively corresponding to λ1, . . . , λI satisfying expression (83) by solving the generalized eigenvalue problem for {circumflex over ( )}R1(0) and {circumflex over ( )}R2(0).











p
l
H




R
^

2

(
0
)




p
m


=

{




1
,




l
=
m






0
,




l

m









(
83
)







Due to this, the signal parameter analysis unit 220 updates the generalized eigenvalue matrix Λ and the generalized eigenvector matrix P by the following expression (84) and expression (85).









Λ


(




λ
1



0





0




0



λ
2






0


















0


0






λ
I




)





(
84
)






P


(




p
1







p
l




)





(
85
)







The signal parameter analysis unit 220 calculates the generalized eigenvalues λ1, . . . , λI, and the generalized eigenvectors q1, . . . , qI respectively corresponding to λ1, . . . , λI satisfying expression (86) by receiving the estimation value {circumflex over ( )}R′j of the spatial covariance matrix R′j subjected to simultaneous decorrelation from the spatial covariance matrix update unit 270 and solving the generalized eigenvalue problem for {circumflex over ( )}R′1 and {circumflex over ( )}R′2 at the time of the second and subsequent processing performed by the signal parameter analysis unit 220.











q
l
H




R
^

2




q
m


=

{




1
,




l
=
m






0
,




l

m









(
86
)







The signal parameter analysis unit 220 then updates the generalized eigenvalue matrix Λ and the matrix Q by expression (87) and expression (88).









Λ


(




λ
1



0





0




0



λ
2






0


















0


0






λ
I




)





(
87
)









Q←(q1 . . . qI)  (88)


The signal parameter analysis unit 220 then updates the generalized eigenvector matrix P by the following expression (89).

P←PQ  (89)


As represented by expression (90), the simultaneous decorrelation unit 230 updates the observation signal vector y′(k) subjected to simultaneous decorrelation by multiplying the observation signal vector y(k) by the Hermitian transposition PH of the generalized eigenvector matrix P.

y′(k)←PHy(k)  (90)


The single channel Wiener filter unit 240 updates the estimation value {circumflex over ( )}x′j(k) of the source signal component x′j(k) subjected to simultaneous decorrelation by expression (91) and expression (92).












x
^

1




(
k
)





(











v
^

1



(
k
)




[
Λ
]


11







v
^

1



(
k
)




[
Λ
]


11

+



v
^

2



(
k
)






[


y




(
k
)


]


1














v
^

1



(
k
)




[
Λ
]


II







v
^

1



(
k
)




[
Λ
]


II

+



v
^

2



(
k
)






[


y




(
k
)


]


I




)

T





(
91
)









x
^

2




(
k
)





(









v
^

2



(
k
)








v
^

1



(
k
)




[
Λ
]


11

+



v
^

2



(
k
)






[


y




(
k
)


]


1












v
^

2



(
k
)








v
^

1



(
k
)




[
Λ
]


II

+



v
^

2



(
k
)






[


y




(
k
)


]


I




)

T





(
92
)







As an exception, at the time of the first processing performed by the single channel Wiener filter unit 240, the initial estimation value {circumflex over ( )}vj(0)(k) of the variance parameter vj(k) from the initialization unit is used instead of the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 260 in the above two expressions.


The posterior covariance matrix update unit 250 updates the posterior covariance matrix Σ′(k) subjected to simultaneous decorrelation by the following expression (93).











Σ




(
k
)




(








v
^

1



(
k
)








v
^

2



(
k
)




[
Λ
]


11








v
^

1



(
k
)




[
Λ
]


11

+



v
^

2



(
k
)






0





0




0







v
^

1



(
k
)








v
^

2



(
k
)




[
Λ
]


22








v
^

1



(
k
)




[
Λ
]


22

+



v
^

2



(
k
)









0


















0


0










v
^

1



(
k
)








v
^

2



(
k
)




[
Λ
]


II








v
^

1



(
k
)




[
Λ
]


II

+



v
^

2



(
k
)







)





(
93
)







As an exception, at the time of the first processing performed by the posterior covariance matrix update unit 250, the initial estimation value {circumflex over ( )}vj(0)(k) of the variance parameter vj(k) from the initialization unit is used instead of the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 260 in the above expression.


The variance parameter update unit 260 updates the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) by the following expression (94).












v
^

j



(
k
)




{






1
I






i
=
1

I









[



x
^

1




(
k
)


]

i



2

+


[


Σ




(
k
)


]

ii




[
Λ
]

ii




,




j
=
1








1
I






i
=
1

I



(






[



x
^

2




(
k
)


]

i



2

+


[


Σ




(
k
)


]

ii


)



,




j
=
2









(
94
)







The spatial covariance matrix update unit 270 updates the estimation value {circumflex over ( )}R′j of the spatial covariance matrix R′j subjected to simultaneous decorrelation by the following expression (95). In expression (95), K represents the number of sample points.











R
^

j





1
K






k
=
1

K




1



v
^

j



(
k
)





(





x
^

j




(
k
)







x
^

j




(
k
)


H


+


Σ




(
k
)



)








(
95
)







The simultaneous decorrelation inverse conversion unit 280 calculates and outputs the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) by the following expression (96).

{circumflex over (x)}j(k)=(PH)−1{circumflex over (x)}′j(k)  (96)


[Background of Processing Performed by Signal Analysis Device According to Second Embodiment]


[Conventional Signal Analysis Method]


Next, the following describes a background of processing performed by the signal analysis device 201. First, the following describes the related art. Non Patent Document 1 discloses the method of estimating the variance parameter vj(k) and the spatial covariance matrix Rj, and the source signal component xj(k) at the same time using the iterative method in a situation in which two source signals are present in a mixed manner.


The following describes modeling of the observation signal vector y(k) in the related art. The source signal component xj(k) in expression (9) is assumed to follow a complex gaussian distribution the mean of which is 0 and the covariance matrix thereof is vj(k)Rj as represented by the following expression (97).

p(xj(k))=custom character(xj(k);0,vj(k)Rj)  (97)


In this case, N(z; μ, Σ) represents a complex gaussian distribution (expression (98)) the mean of which is μ and the covariance matrix thereof is Σ.










𝒩


(


z
;
μ

,
Σ

)


=


1

det


(
πΣ
)





exp


(


-


(

z
-
μ

)

H







-
1




(

z
-
μ

)



)







(
98
)







It is assumed that the two source signal components x1(k) and x2(k) are not correlated with each other. Under a condition to which the observation signal vector y(k) is given, a posterior probability density of the source signal component xj(k) is given by the following expression (99) (for example, refer to Non Patent Document 1).

p(xj(k)|y(k))=custom character(xj(k);{circumflex over (x)}j(k),Σj(k))  (99)


In this case, the mean {circumflex over ( )}xj(k) and the covariance matrix Σj(k) are given by the following expression (100) and expression (101).

{circumflex over (x)}j(k)=vj(k)Rj(v1(k)R1+v2(k)R2)−1y(k)  (100)
Σj(k)=(v1(k)R1)(v1(k)R1+v2(k)R2)−1(v2(k)R2)  (101)


The mean {circumflex over ( )}xj(k) in expression (100) is nothing but the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) based on the multichannel Wiener filter in expression (10) (due to this, the same symbol is used for both). Thus, it is found that the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) based on the multichannel Wiener filter in expression (10) can be interpreted as a point for maximizing the posterior probability density of the source signal component xj(k) under a condition to which the observation signal vector y(k) is given. Expression (101) is independent of j, so that it is written as expression (102).

Σ1(k)=Σ2(k)=Σ(k)  (102)


In the related art, through the first processing to the fourth processing described below, the variance parameter vj(k) and the spatial covariance matrix Rj, and the source signal component xj(k) are estimated at the same time.


In the first processing, the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) and the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj are initialized. These values may be initialized based on a random number, for example.


In the second processing, the source signal component xj(k) is estimated based on the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) and the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj. While one estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is calculated in the first embodiment, the posterior probability density of the source signal component xj(k) (refer to expression (99)) is updated herein. Specifically, the mean {circumflex over ( )}xj(k) of expression (99) and the covariance matrix (k) (which is equal to Σ1(k) and Σ2(k) as represented by expression (102). Also refer to expression (101)) are updated by the following expression (103) and expression (104).

{circumflex over (x)}j(k)←{circumflex over (v)}j(k){circumflex over (R)}j({circumflex over (v)}1(k){circumflex over (R)}1+{circumflex over (v)}2(k){circumflex over (R)}2)−1y(k)  (103)
Σ(k)←({circumflex over (v)}1(k){circumflex over (R)}1)({circumflex over (v)}1(k){circumflex over (R)}1+v2(k){circumflex over (R)}2)−1({circumflex over (v)}2(k){circumflex over (R)}2)  (104)


In the third processing, the estimation value {circumflex over ( )}vj(k) of the variance parameter v (k) and the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj are updated by the following expression (105) and expression (106) based on the mean {circumflex over ( )}xj (k) of the posterior probability density of the source signal component xj(k) and the covariance matrix Σ(k). In this case, tr represents a trace.












v
^

j



(
k
)





1
I



tr
[



(


R
^

j

)


-
1




(





x
^

j



(
k
)







x
^

j



(
k
)


H


+

Σ


(
k
)



)


]






(
105
)








R
^

j




1
K






k
=
1

K




1



v
^

j



(
k
)





(





x
^

j



(
k
)







x
^

j



(
k
)


H


+

Σ


(
k
)



)








(
106
)







In the fourth processing, the second and the third processing are alternately iterated until the update converges, and in a case in which it converges, the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is output.


As a problem of the related art, the related art is difficult to be applied to processing of a large-scale data set in which the number of sample points k is extensive, real-time processing requested to be performed in a short time, an appliance having a low calculation capacity, or the like because a huge amount of calculation is requested.


Actually, in the related art, at the time of calculating the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) and the posterior covariance matrix Σ(k), inverse matrix calculation (specifically, calculation of (v1(k)R1+v2(k)R2)−1) and matrix multiplication (specifically, calculation of (v1(k)R1) (v1(k)R1+v2(k)R2)−1(v2(k)R2)) are requested to be performed for each sample point (that is, multiple times). Both of the inverse matrix calculation and the matrix multiplication are arithmetic operations requesting a large calculation amount of O(I3) for the I-th order matrix. The number of sample points typically ranges from several hundreds to tens of millions.


Due to this, in the conventional signal analysis method, a huge amount of calculation is requested to calculate the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) and the posterior covariance matrix Σ(k). Additionally, in the related art, the entire signal analysis device is requested to perform iteration in the fourth processing described above (the number of iterations typically reaches several tens of times), so that the calculation amount becomes much more extensive.


[Signal Analysis Method According to Second Embodiment]


On the other hand, the second embodiment provides a method of calculating the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k), the posterior covariance matrix Σ(k), the spatial covariance matrix R, and the variance parameter v with a small calculation amount based on simultaneous decorrelation of the two source signal components x1(k) and x2(k). As a result, signal analysis can be implemented with a small calculation amount by the signal analysis device 201 as a whole. The following describes a concept of the second embodiment.


By clearly indicating the number of times in sequence of iteration in expression (103) to expression (106) as update rules in the related art (refer to Non Patent Document 1), the following expression (107) to expression (110) are obtained. In this case, n=1, . . . , N is an index representing the number of times in sequence of iteration and N is the number of times of iteration.












x
^

j

(
n
)




(
k
)


=




v
^

j

(

n
-
1

)




(
k
)







R
^

j

(

n
-
1

)


(





v
^

1

(

n
-
1

)




(
k
)





R
^

1

(

n
-
1

)



+




v
^

2

(

n
-
1

)




(
k
)





R
^

2

(

n
-
1

)




)


-
1




y


(
k
)







(
107
)










(
n
)




(
k
)


=


(




v
^

1

(

n
-
1

)




(
k
)





R
^

1

(

n
-
1

)



)




(





v
^

1

(

n
-
1

)




(
k
)





R
^

1

(

n
-
1

)



+




v
^

2

(

n
-
1

)




(
k
)





R
^

2

(

n
-
1

)




)


-
1




(




v
^

2

(

n
-
1

)




(
k
)





R
^

2

(

n
-
1

)



)






(
108
)














v
^

j

(
n
)




(
k
)


=


1
I



tr
[



(


R
^

j

(

n
-
1

)


)


-
1




(





x
^

j

(
n
)




(
k
)







x
^

j

(
n
)




(
k
)


H


+




(
n
)




(
k
)



)


]







(
109
)













R
^

j

(
n
)


=


1
K






k
=
1

K




1



v
^

j

(
n
)




(
k
)





(





x
^

j

(
n
)




(
k
)







x
^

j

(
n
)




(
k
)


H


+




(
n
)




(
k
)



)









(
110
)







The generalized eigenvalue matrix and the generalized eigenvector matrix that are defined similarly to the first embodiment for the generalized eigenvalue problem of estimation values {circumflex over ( )}R1(n-1) and {circumflex over ( )}R2(n-1) of the spatial covariance matrices R1 and R2 are represented by Λ(n-1) and P(n-1). These satisfy expression (111) and expression (112).

P(n-1)H{circumflex over (R)}1(n-1)P(n-1)(n-1)  (111)
P(n-1)H{circumflex over (R)}2(n-2)P(n-1)=I  (112)


That is, Λ(n-1) and P(n-1) satisfy expression (113) and expression (114). P(n-1) is a matrix for implementing simultaneous decorrelation (simultaneous diagonalization), so that P(n-1) is also referred to as a simultaneous decorrelation matrix from that point of view.

{circumflex over (R)}1(n-1)=(P(n-1)H)−1Λ(n-1)(P(n-1))−1  (113)
{circumflex over (R)}2(n-1)=(P(n-1)H)−1(P(n-1))−1  (114)


By substituting expression (113) and expression (114) for expression (107) to be organized to eliminate the inverse matrix operation for each sample point from expression (107), the following expression (115) and expression (116) are obtained similarly to the first embodiment.












x
^

1

(
n
)


(
k
)

=



(

P


(

n
-
1

)


H


)


-
1




(









υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

11







υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

11

+



υ
^

2

(

n
-
1

)


(
k
)








0















0











υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

II







υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

II

+



υ
^

2

(

n
-
1

)


(
k
)






)




y



(

n
-
1

)


(
k
)






(
115
)















x
^

2

(
n
)


(
k
)

=



(

P


(

n
-
1

)


H


)


-
1




(







υ
^

2

(

n
-
1

)


(
k
)







υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

11

+



υ
^

2

(

n
-
1

)


(
k
)








0















0









υ
^

2

(

n
-
1

)


(
k
)







υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

II

+



υ
^

2

(

n
-
1

)


(
k
)






)




y



(

n
-
1

)


(
k
)






(
116
)












where
,



y



(

n
-
1

)


(
k
)

=


P


(

n
-
1

)


H




y

(
k
)







(
117
)








is the observation signal vector subjected to simultaneous decorrelation.


Similarly, by substituting expression (113) and expression (114) for expression (108) to eliminate the inverse matrix operation for each sample point from expression (108), the following expression (118) to expression (121) are obtained.













(
n
)



(
k
)


=




υ
^

1

(

n
-
1

)


(
k
)




(

P


(

n
-
1

)


H


)


-
1






Λ

(

n
-
1

)


(

P

(

n
-
1

)


)


-
1


×


(





υ
^

1

(

n
-
1

)


(
k
)




(

P


(

n
-
1

)


H


)


-
1






Λ

(

n
-
1

)


(

P

(

n
-
1

)


)


-
1



+




υ
^

2

(

n
-
1

)


(
k
)




(

P


(

n
-
1

)


H


)


-
1





(

P

(

n
-
1

)


)


-
1




)


-
1


⁠⁠
×

(




υ
^

2

(

n
-
1

)


(
k
)




(

P


(

n
-
1

)


H


)


-
1





(

P

(

n
-
1

)


)


-
1



)






(
118
)














=




υ
^

1

(

n
-
1

)


(
k
)




(

P


(

n
-
1

)


H


)


-
1






Λ

(

n
-
1

)


(

P

(

n
-
1

)


)


-
1


×



P

(

n
-
1

)


(





υ
^

1

(

n
-
1

)


(
k
)



Λ

(

n
-
1

)



+




υ
^

2

(

n
-
1

)


(
k
)


I


)


-
1




P


(

n
-
1

)


H


×

(




υ
^

2

(

n
-
1

)


(
k
)




(

P


(

n
-
1

)


H


)


-
1





(

P

(

n
-
1

)


)


-
1



)







(
119
)














=



(

P


(

n
-
1

)


H


)


-
1




(




υ
^

1

(

n
-
1

)


(
k
)





υ
^

2

(

n
-
1

)


(
k
)



Λ

(

n
-
1

)



)




(





υ
^

1

(

n
-
1

)


(
k
)



Λ

(

n
-
1

)



+




υ
^

2

(

n
-
1

)


(
k
)


I


)


-
1





(

P

(

n
-
1

)


)


-
1








(
120
)














=



(

P


(

n
-
1

)


H


)


-
1




(








υ
^

1

(

n
-
1

)


(
k
)







υ
^

2

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

11








υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

11

+



υ
^

2

(

n
-
1

)


(
k
)








0















0










υ
^

1

(

n
-
1

)


(
k
)







υ
^

2

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

II








υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

II

+



υ
^

2

(

n
-
1

)


(
k
)






)




(

P

(

n
-
1

)


)


-
1








(
121
)







In this way, the inverse matrix operation for each sample point can be eliminated. However, expression (121) still includes matrix multiplication for each sample point. Thus, it is considered that Σ′(n)(k) defined by expression (122) is updated instead of Σ(n)(k).

Σ′(n)(k)=P(n-1)HΣ(n)(k)P(n-1)  (122)


Based on expression (121), Σ′(n)(k) can be represented by expression (123).















(
n
)



(
k
)


=

(








υ
^

1

(

n
-
1

)


(
k
)







υ
^

2

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

11








υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

11

+



υ
^

2

(

n
-
1

)


(
k
)








0















0










υ
^

1

(

n
-
1

)


(
k
)







υ
^

2

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

II








υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

II

+



υ
^

2

(

n
-
1

)


(
k
)






)





(
123
)







Expression (123) does not include matrix multiplication for each sample point. Σ′(n)(k) can be regarded as a posterior covariance matrix subjected to linear transformation of simultaneous decorrelation, so that Σ′(n)(k) is referred to as the posterior covariance matrix subjected to simultaneous decorrelation hereinafter.


In expression (115) and expression (116), it is considered that {circumflex over ( )}x′j(n)(k) defined by expression (124) is updated instead of {circumflex over ( )}xj(n)(k).

{circumflex over (x)}′j(n)(k)=P(n-1)H{circumflex over (x)}j(n)(k)  (124)


Based on expression (115) and expression (116), expression (125) and expression (126) are obtained.












x
^

1



(
n
)


(
k
)

=


(









υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

11







υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

11

+



υ
^

2

(

n
-
1

)


(
k
)








0















0











υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

II







υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

II

+



υ
^

2

(

n
-
1

)


(
k
)






)




y



(

n
-
1

)


(
k
)






(
125
)















x
^

2



(
n
)


(
k
)

=


(







υ
^

2

(

n
-
1

)


(
k
)







υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

11

+



υ
^

2

(

n
-
1

)


(
k
)








0















0









υ
^

2

(

n
-
1

)


(
k
)







υ
^

1

(

n
-
1

)


(
k
)

[

Λ

(

n
-
1

)


]

II

+



υ
^

2

(

n
-
1

)


(
k
)






)




y



(

n
-
1

)


(
k
)






(
126
)







{circumflex over ( )}x′j(n)(k) is a source signal component subjected to simultaneous decorrelation.


Next, focusing on expression (109), expression (109) depends on {circumflex over ( )}xj(n)(k) and Σ(n)(k). As described above, in the second embodiment, {circumflex over ( )}x′j(n)(k) and Σ′(n)(k) are updated instead of {circumflex over ( )}xj(n)(k) and Σ(n)(k). Thus, expression (109) is requested to be rewritten into a form that can be calculated by using {circumflex over ( )}x′j(n)(k) and Σ′(n) (k). Based on expression (124) and expression (122), {circumflex over ( )}xj(n)(k) and Σ(n)(k) can be represented as expression (127) and expression (128).

{circumflex over (x)}j(n)(k)=(P(n-1)H)−1{circumflex over (x)}′j(n)(k)  (127)
Σ(n)(k)=(P(n-1)H)−1Σ′(n)(k)(P(n-1))−1  (128)


By substituting these for expression (109), expression (134) is obtained through expression (129) to expression (133).












υ
^

j

(
n
)


(
k
)

=


1
I



tr
[



(


R
^

j

(

n
-
1

)


)


-
1




(




(

P


(

n
-
1

)


H


)


-
1






x
^

j



(
n
)


(
k
)






x
^

j



(
n
)


(
k
)

H




(

P

(

n
-
1

)


)


-
1



+



(

P


(

n
-
1

)


H


)


-
1










(
n
)




(
k
)




(

P

(

n
-
1

)


)


-
1






)


]






(
129
)














=


1
I



tr
[



(


R
^

j

(

n
-
1

)


)


-
1





(

P


(

n
-
1

)


H


)


-
1




(





x
^

j



(
n
)


(
k
)






x
^

j



(
n
)


(
k
)

H


+






(
n
)



(
k
)



)




(

P

(

n
-
1

)


)


-
1



]







(
130
)














=


1
I



tr
[



(

P

(

n
-
1

)


)


-
1





(


R
^

j

(

n
-
1

)


)


-
1





(

P


(

n
-
1

)


H


)


-
1




(





x
^

j



(
n
)


(
k
)






x
^

j



(
n
)


(
k
)

H


+






(
n
)



(
k
)



)


]







(
131
)














=


1
I



tr
[



(


P


(

n
-
1

)


H





R
^

j

(

n
-
1

)




P

(

n
-
1

)



)


-
1




(





x
^

j



(
n
)


(
k
)






x
^

j



(
n
)


(
k
)

H


+






(
n
)



(
k
)



)


]







(
132
)














=

{






1
I



tr
[



(

Λ

(

n
-
1

)


)


-
1




(





x
^

1



(
n
)


(
k
)






x
^

1



(
n
)


(
k
)

H


+






(
n
)



(
k
)



)


]


,




j
=
1








1
I



tr

(





x
^

2



(
n
)


(
k
)






x
^

2



(
n
)


(
k
)

H


+






(
n
)



(
k
)



)


,




j
=
2










(
133
)














=

{






1
I






i
=
1

I







"\[LeftBracketingBar]"



[



x
^

1



(
n
)


(
k
)

]

i



"\[RightBracketingBar]"


2

+


[






(
n
)



(
k
)


]

ii




[

Λ

(

n
-
1

)


]

ii




,




j
=
1








1
I






i
=
1

I


(





"\[LeftBracketingBar]"



[



x
^

2



(
n
)


(
k
)

]

i



"\[RightBracketingBar]"


2

+


[






(
n
)



(
k
)


]

ii


)



,




j
=
2










(
134
)







Similarly, by substituting expression (127) and expression (128) for expression (110) to be rewritten into a form that can be calculated by using {circumflex over ( )}x′j(n)(k) and Σ′(n)(k), the following expression (135) and expression (136) are obtained.











R
^

j

(
n
)


=


1
K






k
=
1

K



1



υ
^

j

(
n
)


(
k
)




(




(

P


(

n
-
1

)


H


)


-
1






x
^

j



(
n
)


(
k
)






x
^

j



(
n
)


(
k
)

H




(

P

(

n
-
1

)


)


-
1



+



(

P


(

n
-
1

)


H


)


-
1









(
n
)




(
k
)




(

P

(

n
-
1

)


)


-
1






)








(
135
)














=




(

P


(

n
-
1

)


H


)


-
1


[


1
K






k
=
1

K



1



υ
^

j

(
n
)


(
k
)




(





x
^

j



(
n
)


(
k
)






x
^

j



(
n
)


(
k
)

H


+






(
n
)



(
k
)



)




]




(

P

(

n
-
1

)


)


-
1








(
136
)







Also in this case, {circumflex over ( )}R′j(n) defined by expression (137) is updated instead of {circumflex over ( )}Rj(n).

{circumflex over (R)}′j(n)=P(n-1)H{circumflex over (R)}j(n)P(n-1)  (137)


Due to this, the inverse matrix operation and the matrix multiplication can be reduced in some degree, so that the calculation amount can be reduced. {circumflex over ( )}R′j(n) can be represented by expression (138) based on expression (136).











R
^

j



(
n
)


=


1
K






k
=
1

K



1



υ
^

j

(
n
)


(
k
)




(





x
^

j



(
n
)


(
k
)






x
^

j



(
n
)


(
k
)

H


+






(
n
)



(
k
)



)








(
138
)







{circumflex over ( )}R′j(n) can be regarded as a spatial covariance matrix subjected to linear transformation of simultaneous decorrelation, so that {circumflex over ( )}R′j(n) is referred to as a spatial covariance matrix subjected to simultaneous decorrelation hereinafter.


Lastly, the following describes update of the generalized eigenvalue matrix Λ and the generalized eigenvector matrix P. At the first iteration, Λ(0) and P(0) may be obtained by solving the generalized eigenvalue problem for the initial estimation values {circumflex over ( )}R1(0) and {circumflex over ( )}R2(0) of the spatial covariance matrices R1 and R2.


In the second and subsequent iteration, estimation values {circumflex over ( )}R′1(n) and {circumflex over ( )}R′2(n) of the spatial covariance matrix subjected to simultaneous decorrelation are obtained. It is possible to apply the generalized eigenvalue problem thereto after returning the estimation values {circumflex over ( )}R′1(n) and {circumflex over ( )}R′2(n) to {circumflex over ( )}R1(n) and {circumflex over ( )}R2(n) by inverse conversion of simultaneous decorrelation, but a slightly waste calculation is generated.


Thus, calculation can be performed more efficiently as described below. First, the generalized eigenvalue problem for {circumflex over ( )}R′1(n) and {circumflex over ( )}R′2(n) is solved to obtain a diagonal matrix Λ(n) including the generalized eigenvalue as the diagonal component and a matrix Q(n) including the generalized eigenvectors as column vectors. These are assumed to satisfy (Q(n))H({circumflex over ( )}R′1(n))Q(n)(n) and (Q(n))H({circumflex over ( )}R′2(n))Q(n)=I.


Next, P(n) is updated by expression (139).

P(n)=P(n-1)Q(n)  (139)


In this case, Λ(n) and P(n) become the generalized eigenvalue matrix and the generalized eigenvector matrix for {circumflex over ( )}R1(n) and {circumflex over ( )}R2(n). This fact can be confirmed by the following expression (140) to expression (147).

[Expression 114]












R
^

1

(
n
)




P

(

n
-
1

)




Q

(
n
)



=



(

P


(

n
-
1

)


H


)


-
1




P


(

n
-
1

)


H





R
^

1

(
n
)




P

(

n
-
1

)




Q

(
n
)







(
140
)














=



(

P


(

n
-
1

)


H


)


-
1





R
^

1



(
n
)




Q

(
n
)








(
141
)














=



(

P


(

n
-
1

)


H


)


-
1





R
^

2



(
n
)




Q

(
n
)




Λ

(
n
)








(
142
)














=



(

P


(

n
-
1

)


H


)


-
1




P


(

n
-
1

)


H





R
^

2

(
n
)




P

(

n
-
1

)




Q

(
n
)




Λ

(
n
)








(
143
)














=



R
^

2

(
n
)




P

(

n
-
1

)




Q

(
n
)




Λ

(
n
)








(
144
)















(


P

(

n
-
1

)




Q

(
n
)



)

H





R
^

2

(
n
)


(


P

(

n
-
1

)




Q

(
n
)



)


=


Q


(
n
)


H




P


(

n
-
1

)


H





R
^

2

(
n
)




P

(

n
-
1

)




Q

(
n
)







(
145
)














=


Q


(
n
)


H





R
^

2



(
n
)




Q

(
n
)








(
146
)














=
I





(
147
)







Effect of Second Embodiment

In this way, according to the second embodiment, the estimation value of the source signal component (subjected to simultaneous decorrelation) and the posterior covariance matrix (subjected to simultaneous decorrelation), and the spatial covariance matrix and the variance parameter can be calculated with a smaller calculation amount as compared with the related art, so that signal separation can be implemented with a smaller calculation amount as compared with the related art.


The reason for that is as follows. First, similarly to the first embodiment, the inverse matrix operation for each sample point is not requested based on simultaneous decorrelation. Second, by updating the posterior covariance matrix Σ′(n)(k) subjected to simultaneous decorrelation instead of the posterior covariance matrix Σ(n)(k) itself, matrix multiplication for each sample point is not requested.


As described above, the processing of the conventional signal analysis method is equivalent to the processing in the second embodiment in theory, so that signal analysis results of the conventional signal analysis method and the second embodiment are identical to each other in theory (however, as a matter of course, a calculation error may be caused by a computer in some cases). Thus, according to the second embodiment, signal analysis can be implemented with a smaller calculation amount as compared with the conventional signal analysis method without deteriorating the signal analysis performance.


Unlike the first embodiment, according to the second embodiment, the variance parameter vj(k) and the spatial covariance matrix Rj, and the source signal component xj(k) can be estimated at the same time even in a case in which the variance parameter vj(k) and the spatial covariance matrix Rj are unknown.


Third Embodiment

Next, the following describes a configuration, a processing procedure, and an effect of the signal analysis device according to a third embodiment.


In the third embodiment, I (I is an integral number equal to or larger than 2) observation signals yi(t) acquired by a microphone at different positions are assumed to be input to the signal analysis device in a situation in which two sound source signals are present in a mixed manner. The observation signal yi(t) is subjected to frequency resolution, subjected to signal separation using the method described in the second embodiment for each frequency, and converted into a time domain at last.


[Configuration and Processing of Signal Analysis Device]


With reference to FIG. 5 and FIG. 6, the following describes a configuration of the signal analysis device according to the third embodiment and a processing procedure of signal analysis processing. FIG. 5 is a diagram illustrating an example of the configuration of the signal analysis device according to the third embodiment. FIG. 6 is a flowchart illustrating an example of the processing procedure of the signal analysis processing according to the third embodiment.


As illustrated in FIG. 5, a signal analysis device 301 includes a frequency domain conversion unit 310, a frequency domain signal separation unit 320, and a time domain conversion unit 330.


First, the following describes an outline of each unit of the signal analysis device 301. The frequency domain conversion unit 310 receives the input observation signal yi(t) (Step S40), and calculates an observation signal yi(k, f) of a time-frequency domain by short-time Fourier transformation and the like (Step S41). In this case, k represents an index of a time frame, and f represents an index of a frequency bin. Hereinafter, yi(k, f) is represented as yif(k).


The frequency domain signal separation unit 320 performs signal separation in a frequency domain by applying the method of the second embodiment to each f (Step S42). That is, by using yi1(k) as an input for f=1, estimation values {circumflex over ( )}x11(k) and {circumflex over ( )}x21(k) of the source signal components are obtained by using the method in the second embodiment.


By using yi2(k) as an input for f=2, estimation values {circumflex over ( )}x12(k) and {circumflex over ( )}x22(k) of the source signal components are obtained by using the method in the second embodiment. This processing is repeated for each f.


By using yiF(k) as an input for f=F, estimation values {circumflex over ( )}x1F(k) and {circumflex over ( )}x2F(k) of the source signal components are obtained by using the method in the second embodiment. Hereinafter, {circumflex over ( )}xjf(k) is represented as {circumflex over ( )}xj(k, f).


The time domain conversion unit 330 calculates and outputs an estimation value {circumflex over ( )}xj(t) of the source signal component of the time domain by inverse short-time Fourier transformation and the like based on the estimation value {circumflex over ( )}xj(k, f) of the source signal component of the time-frequency domain (Step S43).


In this way, in the third embodiment, the method according to the second embodiment is applied to each frequency bin f in the time-frequency domain. On the other hand, Non Patent Document 1 discloses a method of applying the related art described in the second embodiment to each frequency bin f in the time-frequency domain.


There has been the problem that this related art is difficult to be applied to processing of a large-scale data set, real-time processing (short-time processing is requested), an appliance having a low calculation capacity, and the like because a huge amount of calculation is requested. Actually, in the related art, inverse matrix calculation and matrix multiplication are requested to be performed for each frequency bin, for each frame, and for each iteration (that is, multiple times). Due to this, in the conventional signal analysis method, a huge amount of calculation is requested.


For example, in an experiment described later, the number of frequency bins was 256, the number of frames was 249, the number of iterations was 10, and it took 750 seconds for the processing, for the observation signal of 8 seconds. Assuming a data set of 80000 seconds (about 22 hours), it will take 7500000 seconds (about 87 days) for the processing according to calculations.


Effect of Third Embodiment

To confirm the effect of the present invention, the following describes a sound source separation experiment using the conventional method and the third embodiment.



FIG. 7 is a diagram illustrating an experiment condition for the sound source separation experiment using the conventional method and the method according to the third embodiment. FIG. 8 is a diagram illustrating a result of the sound source separation experiment (real time factor) using the conventional method and the third embodiment. FIG. 9 is a diagram illustrating a result of the sound source separation experiment (sound source separation performance) using the conventional method and the third embodiment.


A reverberation voice was generated by convoluting a reverberation impulse response that was measured in a laboratory illustrated in FIG. 7 into an English voice of 8 seconds, and reverberation voices of a plurality of persons were added to generate the observation signal. FIG. 8 and FIG. 9 illustrate performances in a case of performing sound source separation on the observation signal using the respective methods. FIG. 8 illustrates the real time factor (time requested for processing when an observation signal length is 1). FIG. 9 illustrates a signal-to-distortion ratio as sound source separation performance (as a value becomes larger, the sound source separation performance is higher).


As illustrated in FIG. 8 and FIG. 9, with the method according to the third embodiment, it is found that calculation time can be reduced by 250 times or more as compared with the conventional method without deteriorating the sound source separation performance. In a case in which a reverberation time is 130 milliseconds and 440 milliseconds, the signal-to-distortion ratio obtained by the method according to the third embodiment is slightly reduced as compared with the conventional method, but this is due to a calculation error caused by a computer. In this way, the method according to the third embodiment is equivalent to the conventional method in theory, but a difference in performance may be actually caused due to a calculation error. However, the difference is very small as illustrated in FIG. 8.


In this way, according to the third embodiment, the calculation time can be greatly reduced as compared with the conventional method without deteriorating the sound source separation performance.


Fourth Embodiment

Next, the following describes a configuration, a processing procedure, and an effect of the signal analysis device according to a fourth embodiment.


The fourth embodiment is obtained by changing the second embodiment to have a configuration of estimating the variance parameter vj(k) and the spatial covariance matrix Rj using an auxiliary function method.


[Configuration and Processing of Signal Analysis Device]


With reference to FIG. 10 and FIG. 11, the following describes a configuration of the signal analysis device according to the fourth embodiment and a processing procedure of the signal analysis processing. FIG. 10 is a diagram illustrating an example of the configuration of the signal analysis device according to the fourth embodiment. FIG. 11 is a flowchart illustrating an example of the processing procedure of the signal analysis processing according to the fourth embodiment.


As illustrated in FIG. 10, a signal analysis device 401 includes an initialization unit (not illustrated), an observation signal vector creation unit 410, a signal parameter analysis unit 420, a simultaneous decorrelation unit 430, a variance parameter update unit 440, a spatial covariance matrix update unit 450, and a convergence determination unit (not illustrated).


First, the following describes an outline of each unit of the signal analysis device 401. The initialization unit (not illustrated) sets the initial estimation value {circumflex over ( )}vj(0)(k) of the variance parameter vj(k) and the initial estimation value {circumflex over ( )}Rj(0) of the spatial covariance matrix Rj (Step S51). These values may be set based on a random number, for example.


The observation signal vector creation unit 410 acquires the input observation signal yi(k) (Step S52), and creates the observation signal vector y(k) as an I-dimensional column vector constituted of all the observation signals for each sample point (Step S53).


The signal parameter analysis unit 420 updates the generalized eigenvalue matrix Λ and the generalized eigenvector matrix P by solving the generalized eigenvalue problem based on the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj from the spatial covariance matrix update unit 450 (Step S54). Specifically, the signal parameter analysis unit 420 obtains the generalized eigenvalue matrix Λ and the generalized eigenvector matrix P that are defined similarly to the first embodiment for the generalized eigenvalue problem of the estimation values {circumflex over ( )}R1 and {circumflex over ( )}R2 of the spatial covariance matrices R1 and R2. These are assumed to satisfy expression (148) and expression (149).

PH{circumflex over (R)}1P=Λ  (148)
PH{circumflex over (R)}2P=I  (149)


As an exception, at the time of the first processing performed by the signal parameter analysis unit 420, the initial estimation value {circumflex over ( )}Rj(0) of the input spatial covariance matrix Rj is used instead of the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj from the spatial covariance matrix update unit 450 in the processing described above performed by the signal parameter analysis unit 420.


The simultaneous decorrelation unit 430 updates the observation signal vector y′(k) subjected to simultaneous decorrelation by expression (90) based on the observation signal vector y(k) from the observation signal vector creation unit 410 and the generalized eigenvector matrix P from the signal parameter analysis unit 420 (Step S55).


The variance parameter update unit 440 updates the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) by the following expression (150) and expression (151) based on the observation signal vector y′(k) subjected to simultaneous decorrelation from the simultaneous decorrelation unit 430 and the generalized eigenvalue matrix Λ from the signal parameter analysis unit 420 (Step S56).












υ
^

1

(
k
)






υ
^

1

(
k
)








i
=
1

I





[
Λ
]

ii



(






υ
^

1

(
k
)

[
Λ
]

ii

+



υ
^

2

(
k
)


)

2







"\[LeftBracketingBar]"



[


y


(
k
)

]

i



"\[RightBracketingBar]"


2







i
=
1

I




[
Λ
]

ii







υ
^

1

(
k
)

[
Λ
]

ii

+



υ
^

2

(
k
)











(
150
)















υ
^

2

(
k
)






υ
^

2

(
k
)








i
=
1

I



1


(






υ
^

2

(
k
)

[
Λ
]

ii

+



υ
^

2

(
k
)


)

2







"\[LeftBracketingBar]"



[


y


(
k
)

]

i



"\[RightBracketingBar]"


2







i
=
1

I


1






υ
^

1

(
k
)

[
Λ
]

ii

+



υ
^

2

(
k
)











(
151
)







In this case, {circumflex over ( )}v1(k) and {circumflex over ( )}v2(k) on the right side represent estimation values of the variance parameter obtained in the previous processing performed by the variance parameter update unit 440 (as an exception, the initial estimation value of the variance parameter from the initialization unit at the time of the first processing performed by the variance parameter update unit 440).


The spatial covariance matrix update unit 450 updates the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj based on the observation signal vector y′(k) subjected to simultaneous decorrelation from the simultaneous decorrelation unit 430, the generalized eigenvector matrix P and the generalized eigenvalue matrix Λ from the signal parameter analysis unit 420, and the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 440 (Step S57). Specifically, the spatial covariance matrix update unit 450 firstly updates a matrix Cj by the following expression (152).










C
j




P

(







k
=
1

K





υ
^

j

(
k
)







υ
^

1

(
k
)

[
Λ
]

11

+



υ
^

2

(
k
)






0





0




0






k
=
1

K





υ
^

j

(
k
)







υ
^

1

(
k
)

[
Λ
]

22

+



υ
^

2

(
k
)









0


















0


0









k
=
1

K





υ
^

j

(
k
)







υ
^

1

(
k
)

[
Λ
]

II

+



υ
^

2

(
k
)







)



P
H






(
152
)







Next, the spatial covariance matrix update unit 450 updates a vector z(k) by the following expression (153).










z

(
k
)



(






1






υ
^

1

(
k
)

[
Λ
]

11

+



υ
^

2

(
k
)



[


y


(
k
)

]


1












1






υ
^

1

(
k
)

[
Λ
]

II

+



υ
^

2

(
k
)



[


y


(
k
)

]


I


)

T









(
153
)







Next, the spatial covariance matrix update unit 450 updates a matrix Dj by the following expression (154).










D
j





(

P

-
1


)

H




Λ
j

[




k
=
1

K





υ
^

j

(
k
)



z

(
k
)




z

(
k
)

H



]



Λ
j



P

-
1







(
154
)







Next, the spatial covariance matrix update unit 450 updates the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj by the following expression (155).











R
^

j






D
j

1
2


(


(


D
j

1
2




C
j



D
j

1
2



)


1
2


)


-
1




D
j

1
2







(
155
)







The convergence determination unit (not illustrated) determines whether the update converges (Step S58). If the convergence determination unit determines that the update does not converge (No at Step S58), the process returns to the processing performed by the signal parameter analysis unit 420 (Step S54), and the processing is continued.


On the other hand, if the convergence determination unit determines that the update converges (Yes at Step S58), the variance parameter update unit 440 outputs the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k), and the spatial covariance matrix update unit 450 outputs the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj (Step S59).


Background of Fourth Embodiment

The following describes a background of the fourth embodiment. On the basis of the auxiliary function method, the variance parameter vj(k) and the spatial covariance matrix Rj can be estimated as follows. The estimation values of the variance parameter vj(k) and the spatial covariance matrix Rj are initialized in advance, and updated by the iteration processing described below. The estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) is updated by the following expression (156).












υ
^

j

(
k
)






υ
^

j

(
k
)







y

(
k
)

H




(




l
=
1

2





υ
^

l

(
k
)




R
^

l



)


-
1







R
^

j

(




l
=
1

2





υ
^

l

(
k
)




R
^

l



)


-
1




y

(
k
)



tr
[



(




l
=
1

2





υ
^

l

(
k
)




R
^

l



)


-
1





R
^

j


]








(
156
)







In update of the spatial covariance matrix Rj, the matrix Cj and the matrix Dj are firstly updated by expression (157) and expression (158), and the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj is secondly updated by expression (159).












C
j






k
=
1

K





υ
^

j

(
k
)




(




l
=
1

2





υ
^

l

(
k
)




R
^

l



)


-
1









(
157
)













D
j






R
^

j

[




k
=
1

K





υ
^

j

(
k
)




(




l
=
1

2





υ
^

l

(
k
)




R
^

l



)


-
1




y

(
k
)




y

(
k
)

H




(




l
=
1

2





υ
^

l

(
k
)




R
^

l



)


-
1




]




R
^

j






(
158
)
















R
^

j






D
j

1
2


(


(


D
j

1
2




C
j



D
j

1
2



)


1
2


)


-
1




D
j

1
2








(
159
)







In this case, a square root A1/2 of a positive semidefinite Hermitian matrix A is defined by expression (161) assuming that expression (160) is eigenvalue decomposition of A. In this case, U represents a unitary matrix constituted of eigenvectors of A, Λ represents a diagonal matrix constituted of eigenvalues of A, and Σ1/2 represents a diagonal matrix obtained by replacing each diagonal element [Σ]ii of Σ with a square root ([Σ]ii)1/2 thereof.









A
=

U





Σ






U
H






(
160
)







A

1
2


=

U






Σ

1
2




U
H






(
161
)







The update of the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) and the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj described above is iterated until the update converges.


Update rules for the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) and the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj described above are derived by using a method similar to the methods disclosed in reference document 3 “Kazuyoshi Yoshii, Ryota Tomioka, Daichi Mochihashi, and Masataka Goto, “Infinite Positive Semidefinite Tensor Factorization for Source Separation of Mixture Signals”, Proceedings of the 30th International Conference on Machine Learning, 2013.” and reference document 4 “Hiroshi Sawada, Hirokazu Kameoka, Shoko Araki, and Naonori Ueda, “Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data”, IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 21, No. 5, May 2013.”.


There has been such a problem in the methods described above that a huge amount of calculation is requested because an inverse matrix operation is requested to be performed for each sample point in each piece of the processing of expression (156) to expression (158).


On the other hand, in the fourth embodiment, the calculation amount can be reduced by avoiding the inverse matrix operation for each sample point based on simultaneous decorrelation (simultaneous diagonalization). Actually, by substituting expression (148) and expression (149) that are solved for {circumflex over ( )}R1 and {circumflex over ( )}R2, for expression (156) to expression (158), the update rules for the estimation values of the variance parameter vj(k) and the spatial covariance matrix Rj are obtained as follows.


The update rule for the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) is represented by the following expression (162) and expression (163).












v
^

j



(
k
)









v
^

j



(
k
)









y




(
k
)


H




(




l
=
1

2






v
^

l



(
k
)




Λ
l



)


-
1






Λ
j

(




l
=
1

2






v
^

l



(
k
)




Λ
l



)


-
1





y




(
k
)




tr
[



(




l
=
1

2






v
^

l



(
k
)




Λ
l



)


-
1




Λ
j


]











(
162
)







=






v
^

j



(
k
)









i
=
1

I






[

Λ
j

]

ii



(






v
^

1



(
k
)




[
Λ
]


ii

+



v
^

2



(
k
)



)

2








[


y




(
k
)


]

i



2







i
=
1

I





[

Λ
j

]

ii







v
^

1



(
k
)




[
Λ
]


ii

+



v
^

2



(
k
)












   


(
163
)








In this case, y′(k) represents a simultaneously decorrelated observation signal vector of expression (164), Λ1 represents the generalized eigenvalue matrix Λ, and Λ2 represents a unit matrix I.

y′(k)=PHy(k)  (164)


The update rule for the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj is represented by the following expression (165) and expression (166). First, the matrix Cj is represented by the following expression.










C
j






P
[




k
=
1

K






v
^

j



(
k
)





(




l
=
1

2






v
^

l



(
k
)




Λ
l



)


-
1




]



P
H






                


(
165
)







=




P


(







k
=
1

K






v
^

j



(
k
)








v
^

1



(
k
)




[
Λ
]


11

+



v
^

2



(
k
)







0





0




0






k
=
1

K






v
^

j



(
k
)








v
^

1



(
k
)




[
Λ
]


22

+



v
^

2



(
k
)










0


















0


0









k
=
1

K






v
^

j



(
k
)








v
^

1



(
k
)




[
Λ
]


II

+



v
^

2



(
k
)








)




P
H









(
166
)








The matrix Dj is represented by the following expression (167) and expression (168).










D
j







(

P

-
1


)

H




Λ
j

[




k
=
1

K






v
^

j



(
k
)





(




l
=
1

2






v
^

l



(
k
)




Λ
l



)


-
1





y




(
k
)










                              


(
167
)














y




(
k
)


H




(




l
=
1

2






v
^

l



(
k
)




Λ
l



)


-
1



]



Λ
j



P

-
1











=





(

P

-
1


)

H




Λ
j

[




k
=
1

K






v
^

j



(
k
)




z


(
k
)





z


(
k
)


H



]



Λ
j



P

-
1










(
168
)








In this case, vector z(k) is defined by the following expression (169) to expression (171).










z


(
k
)


=





(




l
=
1

2






v
^

l



(
k
)




Λ
l



)


-
1





y




(
k
)










(
169
)







=




(




1






v
^

1



(
k
)




[
Λ
]


11

+



v
^

2



(
k
)






0





0




0



1






v
^

1



(
k
)




[
Λ
]


22

+



v
^

2



(
k
)









0


















0


0






1






v
^

1



(
k
)




[
Λ
]


II

+



v
^

2



(
k
)







)




y




(
k
)










(
170
)







=




(






1






v
^

1



(
k
)




[
Λ
]


11

+



v
^

2



(
k
)






[


y




(
k
)


]


1









1






v
^

1



(
k
)




[
Λ
]


II

+



v
^

2



(
k
)






[


y




(
k
)


]


I




)

T





    


(
171
)








Effect of Fourth Embodiment

In this way, necessity for the inverse matrix operation for each sample point is removed by simultaneous decorrelation (simultaneous diagonalization), and the calculation amount can be reduced. Specifically, an update expression for the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) can be represented by using the diagonal matrix Λj=PHRjP as in expression (162) based on simultaneous decorrelation (simultaneous diagonalization), and as a result, the inverse matrix operation can be made unnecessary as in expression (163). The same applies to the update rule for the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj.


Similarly to the processing of expression (159), the inverse matrix operation is requested to be performed in the processing of expression (155), but the inverse matrix operation is not requested to be performed for each sample point but may be performed only once for each iteration, so that the calculation amount thereof is relatively small. Similarly, eigenvalue decomposition is requested to be performed for calculating a square root of the matrix in the processing of expression (155) similarly to the processing of expression (159), but the eigenvalue decomposition is not requested to be performed for each sample point but may be performed only twice for each iteration, so that the calculation amount thereof is relatively small. The processing of expression (166), expression (168), and expression (155) includes matrix multiplication as an arithmetic operation with a large calculation amount similarly to the inverse matrix operation, but the matrix multiplication is also not requested to be performed for each sample point, so that the calculation amount thereof is relatively small.


Modification of Fourth Embodiment

The present embodiment describes the case in which the two source signals are present, but can also be applied to a case in which J source signals are typically present, not limited to the case in which the two source signals are present, based on a concept similar to that of the third modification of the first embodiment.


As described above, in the first to the fourth embodiments, the signal parameter analysis unit obtains the simultaneous decorrelation matrix P as a matrix in which PHRjP all becomes the diagonal matrix or/and the Hermitian transposition PH thereof, for the spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling respective spatial characteristics of J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, as a parameter for decorrelating components corresponding to the J source signals for observation signal vectors based on the observation signals acquired at I (I is an integral number equal to or larger than 2) positions different from each other. It is sufficient that the signal analysis devices 1, 201, 301, and 401 perform multiplication for each element with a small calculation amount in filtering by performing simultaneous decorrelation using this parameter for performing decorrelation, and the calculation amount for the estimation value of the source signal component can be reduced.


Fifth Embodiment

Next, the following describes a configuration, a processing procedure, and an effect of the signal analysis device according to a fifth embodiment. The fifth embodiment is a variation of the third modification of the first embodiment. The third modification of the first embodiment has the precondition that the variance parameter vj(k) and the spatial covariance matrix Rj are input to the signal analysis device. On the other hand, in the fifth embodiment, the variance parameter vj(k) and the spatial covariance matrix Rj are not requested to be input to the signal analysis device, and the variance parameter vj(k), the simultaneous decorrelation matrix P and the spatial covariance matrix Λj subjected to simultaneous decorrelation as parameters for modeling the spatial covariance matrix Rj, and the source signal component xj(k) are estimated at the same time.


The fifth embodiment describes a method of implementing signal separation by estimating the variance parameter vj(k), the simultaneous decorrelation matrix P and the spatial covariance matrix Λj subjected to simultaneous decorrelation as parameters for modeling the spatial covariance matrix Rj, and the source signal component xj(k) at the same time in a situation in which J source signals are typically present in a mixed manner and the variance parameter vj(k) and the spatial covariance matrix Rj are unknown. In the fifth embodiment, I observation signals yi(k) acquired at different positions are assumed to be input to the signal analysis device.


[Configuration and Processing of Signal Analysis Device]


With reference to FIG. 12 and FIG. 13, the following describes a configuration of the signal analysis device and a processing procedure of the signal analysis processing according to the fifth embodiment. FIG. 12 is a diagram illustrating an example of the configuration of the signal analysis device according to the fifth embodiment. FIG. 13 is a flowchart illustrating an example of the processing procedure of the signal analysis processing according to the fifth embodiment.


As illustrated in FIG. 12, a signal analysis device 501 includes an initialization unit (not illustrated), an observation signal vector creation unit 510, a simultaneous decorrelation unit 530, a single channel Wiener filter unit 540, a posterior covariance matrix update unit 550, a variance parameter update unit 560, a spatial covariance matrix update unit 570, a signal parameter analysis unit 520, a convergence determination unit (not illustrated), and a simultaneous decorrelation inverse conversion unit 580.


First, the following describes an outline of each unit of the signal analysis device 501. The initialization unit (not illustrated) sets the initial estimation value {circumflex over ( )}vj(0)(k) of the variance parameter vj(k), the initial estimation value {circumflex over ( )}P(0) of the simultaneous decorrelation matrix P, and the initial estimation value {circumflex over ( )}Λj(0) of the spatial covariance matrix Λj subjected to simultaneous decorrelation (Step S61). These values may be set based on a random number, for example.


The observation signal vector creation unit 510 acquires I observation signals yi(k) input and acquired at different positions (Step S62), and creates the observation signal vector y(k) as an I-dimensional column vector for each sample point by expression (82) (Step S63).


The simultaneous decorrelation unit 530 updates the observation signal vector y′(k) subjected to simultaneous decorrelation by the following expression (172) based on the observation signal vector y(k) from the observation signal vector creation unit 510 and the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P from the signal parameter analysis unit 520 (as an exception, the initial estimation value {circumflex over ( )}P(0) of the simultaneous decorrelation matrix P from the initialization unit is used at the time of the first processing performed by the simultaneous decorrelation unit 530) (Step S64).

y′(k)←{circumflex over (P)}Hy(k)  (172)


However, at the time of the first processing performed by the simultaneous decorrelation unit 530, {circumflex over ( )}P is replaced with {circumflex over ( )}P(0) on the right side of expression (172).


The single channel Wiener filter unit 540 updates an estimation value {circumflex over ( )}xj′(k) of a source signal component xj′(k) subjected to simultaneous decorrelation by the following expression (173) based on the observation signal vector y′(k) subjected to simultaneous decorrelation from the simultaneous decorrelation unit 530, the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 560 (as an exception, the initial estimation value {circumflex over ( )}vj(0)(k) of the variance parameter vj(k) from the initialization unit is used at the time of the first processing performed by the single channel Wiener filter unit 540), and the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the spatial covariance matrix update unit 570 (as an exception, the initial estimation value {circumflex over ( )}Λj(0) of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the initialization unit is used at the time of the first processing performed by the single channel Wiener filter unit 540) (Step S65).











[



x
^

j




(
k
)


]

i










v
^

j



(
k
)




[


Λ
^

j

]


ii





l
=
1

J







v
^

l



(
k
)




[


Λ
^

l

]


ii





[


y




(
k
)


]


i





(
173
)







However, at the time of the first processing performed by the single channel Wiener filter unit 540, {circumflex over ( )}Λj (j=1, . . . , J) is replaced with {circumflex over ( )}Λj(0) (j=1, . . . , J), and {circumflex over ( )}vj(k) (j=1, . . . , J; k=1, . . . , K) is replaced with {circumflex over ( )}vj(0)(k) (j=1, . . . , J; k=1, . . . , K) on the right side of expression (173).


The posterior covariance matrix update unit 550 updates a posterior covariance matrix Σj′(k) subjected to simultaneous decorrelation by the following expression (174) based on the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 560 (as an exception, the initial estimation value {circumflex over ( )}vj(0)(k) of the variance parameter vj(k) from the initialization unit is used at the time of the first processing performed by the posterior covariance matrix update unit 550), and the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the spatial covariance matrix update unit 570 (as an exception, the initial estimation value {circumflex over ( )}Λj(0) of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the initialization unit is used at the time of the first processing performed by the posterior covariance matrix update unit 550) (Step S66).











[


Σ
j




(
k
)


]

im

=

{










v
^

j



(
k
)




[


Λ
^

j

]


ii

-






v
^

j



(
k
)


2




(


[


Λ
^

j

]

ii

)

2






l
=
1

J







v
^

l



(
k
)




[


Λ
^

l

]


ii




,




i
=
m






0
,




i

m









(
174
)







However, at the time of the first processing performed by the posterior covariance matrix update unit 550, {circumflex over ( )}Λj is replaced with {circumflex over ( )}Λj(0), and {circumflex over ( )}vj(k) is replaced with {circumflex over ( )}vj(0)(k) on the right side of expression (174).


The variance parameter update unit 560 updates the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) by the following expression (175) based on the estimation value {circumflex over ( )}xj′(k) of the source signal component xj′(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 540, the posterior covariance matrix Σj′(k) subjected to simultaneous decorrelation from the posterior covariance matrix update unit 550, and the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the spatial covariance matrix update unit 570 (as an exception, the initial estimation value {circumflex over ( )}Λj(0) of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the initialization unit is used at the time of the first processing performed by the variance parameter update unit 560) (Step S67).












v
^

j



(
k
)





1
I






i
=
1

I









[



x
^

j




(
k
)


]

i



2

+


[


Σ
j




(
k
)


]

ii




[


Λ
^

j

]

ii








(
175
)







However, at the time of the first processing performed by the variance parameter update unit 560, {circumflex over ( )}Λj is replaced with {circumflex over ( )}Λj(0) on the right side of expression (175).


The spatial covariance matrix update unit 570 updates the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation by the following expression (176) based on the estimation value {circumflex over ( )}xj′(k) of the source signal component xj′(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 540, the posterior covariance matrix Σj′(k) subjected to simultaneous decorrelation from the posterior covariance matrix update unit 550, and the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 560 (Step S68).











[


Λ
^

j

]

im



{






1
K






k
=
1

K









[



x
^

j




(
k
)


]

i



2

+


[


Σ
j




(
k
)


]

ii





v
^

j



(
k
)





,




i
=
m






0
,




i

m









(
176
)







The signal parameter analysis unit 520 receives the estimation value {circumflex over ( )}xj′(k) of the source signal component xj′(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 540, the posterior covariance matrix Σj′(k) subjected to simultaneous decorrelation from the posterior covariance matrix update unit 550, the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 560, and the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the spatial covariance matrix update unit 570 (and the initial estimation value {circumflex over ( )}P(0) of the simultaneous decorrelation matrix P from the initialization unit at the time of the first update of the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P performed by the signal parameter analysis unit 520), and updates the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P by the following expression (177) (Step S69). In this case, ei represents the i-th column of the unit matrix, and [A]i represents the i-th column of the matrix A.











[

P
^

]

i

=




P
^

(


1
JK






j
=
1

J




1


[


Λ
^

j

]

ii







k
=
1

K




1



v
^

j



(
k
)





(





x
^

j




(
k
)







x
^

j




(
k
)


H


+


Σ
j




(
k
)



)






)


-
1




e
i






(
177
)







However, at the time of the first update of the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P performed by the signal parameter analysis unit 520, {circumflex over ( )}P is replaced with {circumflex over ( )}P(0) on the right side of expression (177). Alternatively, the signal parameter analysis unit 520 may update the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P by successively applying expression (177) multiple times.


The convergence determination unit (not illustrated) determines whether the update converges (Step S70). If the convergence determination unit determines that the update does not converge (No at Step S70), the process returns to the processing at Step S64 performed by the simultaneous decorrelation unit 530, and the processing is continued.


On the other hand, if the convergence determination unit determines that the update converges (Yes at Step S70), processing by the simultaneous decorrelation unit 530 (Step S71), processing by the single channel Wiener filter unit 540 (Step S72), and processing by the simultaneous decorrelation inverse conversion unit 580 (Step S73) are performed. The processing performed by the simultaneous decorrelation unit 530 and the single channel Wiener filter unit 540 is described above, so that the following describes the processing performed by the simultaneous decorrelation inverse conversion unit 580. The simultaneous decorrelation inverse conversion unit 580 calculates and outputs the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) by the following expression (178) based on the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P from the signal parameter analysis unit 520, and the estimation value {circumflex over ( )}xj′(k) of the source signal component xj′(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 540 (Step S73).

{circumflex over (x)}j(k)←({circumflex over (P)}H)−1{circumflex over (x)}′j(k)  (178)


[Background of Processing Performed by Signal Analysis Device]


Next, the following describes a background of the processing performed by the signal analysis device 501 according to the fifth embodiment. In the fifth embodiment, the observation signal vector y(k) is modeled as a sum of J source signal components x1(k), . . . , xJ(k) by the following expression (179).










y


(
k
)


=




j
=
1

J




x
j



(
k
)







(
179
)







The source signal component xj(k) is assumed to follow a complex gaussian distribution the mean of which is 0 and the covariance matrix thereof is Sj(k)=vj(k)Rj as represented by expression (97). In this case, vj(k) is a variance parameter for modeling a variance of the j-th source signal for each sample point, and Rj is a spatial covariance matrix as a parameter for modeling a spatial characteristic of the j-th source signal. It is assumed that the variance parameter vj(k) is positive, and the spatial covariance matrix Rj is a positive-definite Hermitian matrix. It is assumed that the J source signal components x1(k), . . . , xJ(k) are not correlated with each other.


As a method of estimating the source signal component xj(k) from the observation signal vector y(k), there is known a method based on the multichannel Wiener filter. In this method, the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is calculated by the following expression (180).












x
^

j



(
k
)


=



v
j



(
k
)






R
j

(




l
=
1

J





v
l



(
k
)




R
l



)


-
1




y


(
k
)







(
180
)







As in the fifth embodiment, to calculate the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) (that is, to implement signal separation) by expression (180) in a situation in which the variance parameter vj(k) and the spatial covariance matrix Rj are unknown, these parameters are requested to be estimated.


[Conventional Signal Analysis Method]


Non Patent Document 1 discloses a method of implementing signal separation by estimating the variance parameter vj(k), the spatial covariance matrix Rj, and the source signal component xj(k) at the same time in a situation in which the variance parameter vj(k) and the spatial covariance matrix Rj are unknown.


In the related art, maximum likelihood estimation of the variance parameter vj(k) and the spatial covariance matrix Rj is performed. This maximum likelihood estimation is formulated as an optimization problem of the following expression (181).












max
Ψ









L
1



(
Ψ
)








s
.
t
.






v
j



(
k
)





>

0






(


j
=
1

,





,

J
;

k
=
1


,





,
K

)



,






R
j



0






(


j
=
1

,





,
J

)







(
181
)







In this case, Ψ represents parameters as a whole, and constituted of the variance parameter vj(k) (j=1, . . . , J; k=1, . . . , K) and the spatial covariance matrix Rj (j=1, . . . , J). L1(Ψ) represents a log-likelihood function of the following expression (182).











L
1



(
Ψ
)


=




k
=
1

K



ln






𝒩
(



y


(
k
)


;
0

,




j
=
1

J





v
j



(
k
)




R
j




)







(
182
)







The following expression (183) represents that the matrix A is a positive-definite Hermitian matrix.

Acustom character0  (183)


In the related art, maximum likelihood estimation is performed based on an EM algorithm. In the EM algorithm, an E-step and an M-step are alternately iterated.


At the E-step, a posterior probability p(xj(k)|y(k), {circumflex over ( )}Ψ) of the source signal component xj(k) based on a present estimation value {circumflex over ( )}Ψ of the parameter Ψ. It is indicated that this posterior probability is a complex gaussian distribution as represented by the following expression (184).

p(xj(k)|y(k),{circumflex over (Ψ)})=custom character(xj(k);{circumflex over (x)}j(k),Σj(k))  (184)


In this case, the mean {circumflex over ( )}xj(k) and the covariance matrix Σj(k) are represented by the following expression (185) and expression (186) using the present estimation value {circumflex over ( )}Ψ of the parameter ({circumflex over ( )}vj(k) is a present estimation value of the variance parameter vj(k), and {circumflex over ( )}Rj is a present estimation value of the spatial covariance matrix Rj).












x
^

j



(
k
)


=




v
^

j



(
k
)







R
^

j

(




l
=
1

J






v
^

l



(
k
)





R
^

l



)


-
1




y


(
k
)







(
185
)








Σ
j



(
k
)


=





v
^

j



(
k
)





R
^

j


-


(




v
^

j



(
k
)





R
^

j


)




(




l
=
1

J






v
^

l



(
k
)





R
^

l



)


-
1




(




v
^

j



(
k
)





R
^

j


)







(
186
)







Thus, at the E-step, the mean {circumflex over ( )}xj(k) and the covariance matrix Σj(k) are updated by expression (185) and expression (186). Expression (185) is nothing but an estimation value of the source signal component xj(k) based on the multichannel Wiener filter in expression (180). Hereinafter, {circumflex over ( )}xj(k) is referred to as an estimation value of the source signal component xj(k), and Σj(k) is referred to as a posterior covariance matrix.


At the M-step, the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) and the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj are updated based on the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) and the posterior covariance matrix Σj(k) that are updated at the E-step. These values are updated based on update rules described below (expression (187) and expression (188)).












v
^

j



(
k
)





1
I



tr


[



(


R
^

j

)


-
1




(





x
^

j



(
k
)







x
^

j



(
k
)


H


+


Σ
j



(
k
)



)


]







(
187
)








R
^

j




1
K






k
=
1

K




1



v
^

j



(
k
)





(





x
^

j



(
k
)







x
^

j



(
k
)


H


+


Σ
j



(
k
)



)








(
188
)







However, in the related art, in the update of the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) represented by expression (185) and the update of the posterior covariance matrix Σj(k) represented by expression (186), the inverse matrix calculation and the matrix multiplication are requested to be performed for each sample point. Specifically, the inverse matrix calculation represented by expression (189) (once for each iteration and each sample point) and the matrix multiplication represented by expression (190) (2J times for each iteration and each sample point) are requested to be performed.










(




l
=
1

J






v
^

l



(
k
)





R
^

l



)


-
1





(
189
)







(




v
^

j



(
k
)





R
^

j


)




(




l
=
1

J






v
^

l



(
k
)





R
^

l



)


-
1




(




v
^

j



(
k
)





R
^

j


)





(
190
)







Thus, a huge amount of calculation is requested in the related art, so that the related art is difficult to be applied to a case in which the number of the sample points K is large (the observation signal length is long) or a situation in which calculation resources are limited, such as signal separation in real time, variance processing in a sensor network, and the like.


[Signal Analysis Method According to Fifth Embodiment]


On the other hand, in the fifth embodiment, signal separation is enabled to be performed with a small calculation amount by implementing update of the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) and the posterior covariance matrix Σj(k) at the E-step with a small calculation amount based on simultaneous decorrelation of the J source signal components x1(k), . . . , xJ(k). The following describes a concept of the fifth embodiment.


Assuming that the covariance matrix vj(k)Rj of the source signal component xj(k) is a diagonal matrix in both of expression (185) and expression (186), the inverse matrix operation and the matrix multiplication in expression (185) and expression (186) result in a simple reciprocal operation and multiplication for each diagonal element, so that calculation can be performed with a small calculation amount. However, normally, elements having different vectors xj(k) (that is, the j-th source signals observed at different positions) are correlated with each other, so that a nondiagonal element of the covariance matrix vj(k)Rj is not 0, that is, the covariance matrix vj(k)Rj is not a diagonal matrix.


Thus, in the fifth embodiment, it is considered that the J source signal components x1(k), . . . , xJ(k) are subjected to simultaneous decorrelation by conversion of the following expression (191) using the Hermitian transposition PH of the regular matrix P.

x′j(k)=PHxj(k)  (191)


The covariance matrix of xj′(k) after conversion is given by the following expression (192) and expression (193).










E


(



x
j




(
k
)






x
j




(
k
)


H


)


=




P
H



E


(



x
j



(
k
)






x
j



(
k
)


H


)



P





                                                   


(
192
)







=





v
j



(
k
)




P
H



R
j


P








(
193
)








A condition under which xj′(k) (j=1, . . . , J) after conversion are all decorrelated, that is, a condition under which all covariance matrices given by expression (193) become diagonal matrices is represented by the following expression (194) to expression (196).

[Expression 156]











P
H



R
1


P

=


Λ
1







(
diagonal
)






(
194
)









(
195
)








P
H



R
J


P

=


Λ
J







(
diagonal
)






(
196
)







As already described above, in a case in which the number of signals is J=2, the regular matrix P and the diagonal matrices Λ1, . . . , ΛJ satisfying expression (194) to expression (196) can be obtained by solving the generalized eigenvalue problem. On the other hand, in the fifth embodiment, as an approach that can be applied not only to the case in which J=2 but also to a case in which the typical number of signals is J, maximum likelihood estimation of the regular matrix P and the diagonal matrices Λ1, . . . , ΛJ is performed under the constraint condition that J unknown spatial covariance matrices R1, . . . , RJ are subjected to simultaneous diagonalization based on a certain regular matrix P, that is, the constraint condition that expression (194) to expression (196) are established for the certain regular matrix P and the certain J diagonal matrices Λ1, . . . , ΛJ. The constraint condition described above is equivalent to the fact that the following expressions (expression (197) to expression (199)) are established for the certain regular matrix P and the certain J diagonal matrices Λ1, . . . , ΛJ.










R
1

=



(

P

-
1


)

H



Λ
1



P

-
1







(
197
)









(
198
)







R
J

=



(

P

-
1


)

H



Λ
J



P

-
1







(
199
)







In the fifth embodiment, it is assumed that the spatial covariance matrices R1, . . . , RJ are represented (modeled) as expression (197) to expression (199) using the regular matrix P as a parameter and the diagonal matrices Λ1, . . . , ΛJ as parameters. In this case, estimation of the spatial covariance matrices R1, . . . , RJ results in estimation of the matrix P (which is a matrix for implementing simultaneous decorrelation, so that it is referred to as a simultaneous decorrelation matrix hereinafter) and the diagonal matrices Λ1, . . . , ΛJ (which are matrices obtained by converting the spatial covariance matrices R1, . . . , RJ as represented by expression (194) to expression (196) using the simultaneous decorrelation matrix P, so that they are referred to as spatial covariance matrices subjected to simultaneous decorrelation hereinafter).


Thus, in the fifth embodiment, the simultaneous decorrelation matrix P, the spatial covariance matrix Λj subjected to simultaneous decorrelation, and the variance parameter vj(k) are estimated at the same time using a maximum likelihood method. Due to this, simultaneous decorrelation of the source signal component (simultaneous diagonalization of the covariance matrix of the source signal component) is implemented for the typical number of signals J, so that signal separation is enabled to be performed with a small calculation amount.


The maximum likelihood estimation described above is formulated as the following optimization problem (expression (200)).










max
Θ





L
2



(
Θ
)




s
.
t
.





{







v
j



(
k
)


>

0






(


j
=
1

,





,

J
;

k
=
1


,





,
K

)



,







P


GL


(

I
,


)



,


Λ
j



0


:






diagonal






matrix


(


j
=
1

,





,
J

)















(
200
)







In this case, Θ represents all parameters as a whole, and is constituted of the variance parameter vj(k) (j=1, . . . , J; k=1, . . . , K), the simultaneous decorrelation matrix P, and the spatial covariance matrix Λj (j=1, . . . , J) subjected to simultaneous decorrelation. L2(Θ) is a log-likelihood function in the following expression (201), and can be obtained by substituting expression (197) to expression (199) for expression (182). GL(I, C) is a set constituted of all regular I-th order complex matrices.











L
2



(
Θ
)


=




k
=
1

K



ln






𝒩
(



y


(
k
)


;
0

,




j
=
1

J





v
j



(
k
)





(

P

-
1


)

H



Λ
j



P

-
1





)







(
201
)







In the fifth embodiment, the maximum likelihood estimation described above is implemented by using the EM algorithm. The EM algorithm is constituted of the E-step and the M-step.


At the E-step, the posterior probability of the source signal component is updated based on a present estimation value {circumflex over ( )}Θ of the parameter Θ. Similarly to the case of the related art, a posterior probability p(xj(k)|y(k), {circumflex over ( )}Θ) of xj(k) is a complex gaussian distribution on the right side of expression (184), the mean thereof can be given by expression (185), and the covariance matrix thereof can be given by expression (186). However, unlike the case of the related art, the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj in expression (185) and expression (186) is represented by the following expression (202) to expression (204) using the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P and the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation.











R
^

1

=



(


P
^


-
1


)

H




Λ
^

1




P
^


-
1







(
202
)









(
203
)








R
^

J

=



(


P
^


-
1


)

H




Λ
^

J




P
^


-
1







(
204
)







By substituting expression (202) to expression (204) for expression (185) and expression (186), the following expression (205) to expression (208) are obtained.
















P
^

H





x
^

j



(
k
)








x
^

j


(
k
)



=






v
^

j



(
k
)







Λ
^

j

(




l
=
1

J






v
^

l



(
k
)





Λ
^

l



)


-
1







P
^

H



y


(
k
)







y


(
k
)








    


(
205
)







=




(











v
^

j



(
k
)




[


Λ
^

j

]


11





l
=
1

J







v
^

l



(
k
)




[


Λ
^

l

]


11





[


y




(
k
)


]


1














v
^

j



(
k
)




[


Λ
^

j

]


II





l
=
1

J







v
^

l



(
k
)




[


Λ
^

l

]


II





[


y




(
k
)


]


I




)

T








(
206
)






















P
^

H




Σ
j



(
k
)




P
^






Σ
j


(
k
)



=







v
^

j



(
k
)





Λ
^

j


-




v
^

j



(
k
)







Λ
^

j

(




l
=
1

J






v
^

l



(
k
)





Λ
^

l



)


-
1




(




v
^

j



(
k
)





Λ
^

j


)







             


(
207
)







=



(












v
^

j



(
k
)




[


Λ
^

j

]


11

-











v
^

j



(
k
)


2




(


[


Λ
^

j

]

11

)

2






l
=
1

J







v
^

l



(
k
)




[


Λ
^

l

]


11













O



















O
















v
^

j



(
k
)




[


Λ
^

j

]


II

-











v
^

j



(
k
)


2




(


[


Λ
^

j

]

II

)

2






l
=
1

J







v
^

l



(
k
)




[


Λ
^

l

]


II









)








(
208
)












Thus, {circumflex over ( )}xj′(k) that is obtained by base-converting the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) by the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P (hereinafter, referred to as an estimation value of the source signal component subjected to simultaneous decorrelation) can be updated by expression (173), and Σj′(k) that is obtained by base-converting the posterior covariance matrix Σj(k) by the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P (hereinafter, referred to as a posterior covariance matrix subjected to simultaneous decorrelation) can be updated by expression (174). At this point, the inverse matrix operation and the matrix multiplication become an inverse matrix operation and matrix multiplication for a diagonal matrix due to simultaneous diagonalization, so that the calculation amount thereof is not O(I3) but O(I), and calculation can be performed with a small calculation amount.


At the M-step, the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k), the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation, and the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P are updated based on maximization of a Q-function described below (refer to expression (209) and expression (210)). However, a constant term independent of Θ is ignored in expression (209) and expression (210), so that an equal sign represents that both sides thereof are equal to each other except a difference between constants independent of Θ.










Q


(
Θ
)


=



-




k
=
1

K






j
=
1

J



[


ln






det
(



v
j



(
k
)





(

P

-
1


)

H



Λ
j



P

-
1



)


+









           


(
209
)










tr


{



(



v
j



(
k
)





(

P

-
1


)

H



Λ
j



P

-
1



)


-
1




(





x
^

j



(
k
)







x
^

j



(
k
)


H


+


Σ
j



(
k
)



)


}


]









=



-




k
=
1

K






j
=
1

J



[


ln






det
(



v
j



(
k
)





(

P

-
1


)

H



Λ
j



P

-
1



)


+












(
210
)










tr


{



(



v
j



(
k
)




Λ
j


)


-
1




(





x
^

j




(
k
)







x
^

j




(
k
)


H


+


Σ
j




(
k
)



)


}


]










By partially differentiating the Q-function in expression (210) with respect to the variance parameter vj(k) to be 0, the following expression (211) and expression (212) are obtained.











v
j



(
k
)


=




1
I



tr


[



(

Λ
j

)


-
1




(





x
^

j




(
k
)







x
^

j




(
k
)


H


+


Σ
j




(
k
)



)


]







                                           


(
211
)







=




1
I






i
=
1

I









[



x
^

j




(
k
)


]

i



2

+


[


Σ
j




(
k
)


]

ii




[

Λ
j

]

ii











(
212
)








In this way, an update expression of the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) in expression (175) is derived.


Assuming that the Q-function in expression (210) that is partially differentiated with respect to the spatial covariance matrix Λj subjected to simultaneous decorrelation is equal to the zero matrix, the following expression (213) and expression (214) are obtained.










Λ
j

=




1
K






k
=
1

K




1


v
j



(
k
)





(


diag


(






x
^

j




(
k
)




2

)


+


Σ
j




(
k
)



)








      


(
213
)







=



(








1
K






k
=
1

K




1


v
j



(
k
)





(






[



x
^

j




(
k
)


]

1



2

+












[


Σ
j




(
k
)


]

11

)











O



















O












1
K






k
=
1

K




1


v
j



(
k
)





(






[



x
^

j




(
k
)


]

I



2

+












[


Σ
j




(
k
)


]

II

)







)








(
214
)








In this way, an update rule for the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation in expression (176) is derived. In expression (213), an absolute value square |•|2 is assumed to be calculated for each element of a vector, and diag(α) with respect to the vector α represents a diagonal matrix including an element of α as a diagonal element.


By partially differentiating the Q-function in expression (209) with respect to a complex conjugate P* of the simultaneous decorrelation matrix P, the following expression (215) is obtained.












Q




P
*



=



KJ


(

P
H

)



-
1


-




j
=
1

J




(




k
=
1

K




1


v
j



(
k
)





(





x
^

j



(
k
)







x
^

j



(
k
)


H


+


Σ
j



(
k
)



)



)


P






Λ
j

-
1









(
215
)







Assuming that this expression is equal to the zero matrix and taking vec of both sides thereof, the following expression (216) is obtained.










vec


(
P
)


=



[


1
JK






j
=
1

J




Λ
j

-
1





(




k
=
1

K




1


v
j



(
k
)





(





x
^

j



(
k
)







x
^

j



(
k
)


H


+


Σ
j



(
k
)



)



)




]


-
1




vec
(


(

P
H

)


-
1


)






(
216
)







In this case, vec(A) with respect to the matrix A represents a column vector obtained by longitudinally arranging a column vector of the matrix A. Regarding this fact, a formula of the following expression (217) is established.

vec(AXB)=(BT⊗A)vec(X)  (217)


A symbol circling “x” represents a Kronecker product. Based on a property of a block diagonal matrix, expression (216) is equivalent to the following expression (218) to expression (220).











[
P
]

i

=






(


1
JK






j
=
1

J




1


[

Λ
j

]

ii







k
=
1

K




1


v
j



(
k
)





(





x
^

j



(
k
)







x
^

j



(
k
)


H


+


Σ
j



(
k
)



)






)


-
1


[


(

P
H

)


-
1


]

i





     


(
218
)







=




P
(


1
JK






j
=
1

J




1


[

Λ
j

]

ii







k
=
1

K




1


v
j



(
k
)





(





x
^

j




(
k
)







x
^

j




(
k
)


H


+


Σ
j




(
k
)



)






)


-
1









(
219
)












P
H

[


(

P
H

)


-
1


]

i










=





P
(


1
JK






j
=
1

J




1


[

Λ
j

]

ii







k
=
1

K




1


v
j



(
k
)





(





x
^

j




(
k
)







x
^

j




(
k
)


H


+


Σ
j




(
k
)



)






)


-
1




e
i









(
220
)








Thus, based on a fixed point method, the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P may be updated by expression (177). In this way, on the basis of the fixed point method, a learning rate is not requested to be adjusted unlike a gradient method or a natural gradient method, so that optimization can be stably performed.


The following describes derivation of the posterior probability in detail. Logarithm posterior probabilities of hidden variables x1(k), . . . , xJ-1(k) can be written down as represented by the following expression (221) to expression (223).










ln






p


(



x
1



(
k
)


,





,



x

J
-
1




(
k
)




y


(
k
)



,
Θ

)





=
c





ln






p


(



y


(
k
)





x
1



(
k
)



,





,


x

J
-
1




(
k
)


,
Θ

)



+

ln






p


(



x
1



(
k
)


,





,



x

J
-
1




(
k
)



Θ


)






=
c

-





(
221
)










(


y


(
k
)


-




j
=
1


J
-
1





x
j



(
k
)




)

H




(



v
J



(
k
)




R
J


)


-
1




(


y


(
k
)


-




j
=
1


J
-
1





x
j



(
k
)




)


-




j
=
1


J
-
1







x
j



(
k
)


H




(



v
j



(
k
)




R
j


)


-
1





x
j



(
k
)





=
-




(
222
)









(


y


(
k
)


-

A







x
~



(
k
)




)

H




(



v
J



(
k
)




R
J


)


-
1




(


y


(
k
)


-

A







x
~



(
k
)




)


-




x
~



(
k
)


H




(






v
1



(
k
)




R
1




O





O




O





v
2



(
k
)




R
2







O


















O


O








v

J
-
1




(
k
)




R

J
-
1






)


-
1





x
~



(
k
)







(
223
)







In this case, a symbol with “c” added above the equal sign “=” represents that both sides are equal to each other except a difference between constants independent of x1(k), . . . , xJ-1(k). The matrix A represents a matrix of the following expression (224) in which J−1 I-th order unit matrices I are horizontally arranged. The spatial covariance matrix Rj (j=1, . . . , J) is given by expression (197) to expression (199). In the present specification, I is used for representing two kinds of meaning, that is, the number of observation signals (scalar) and the unit matrix (matrix), but confusion will not be caused because whether I is a scalar or a matrix is clear from the context.

Acustom character(I . . . I)  (224)


{tilde over ( )}x(k) represents a vector of the following expression (225) in which x1(k), . . . , xJ-1(k) are longitudinally arranged.

{tilde over (x)}(k)=(x1(k)T . . . xJ-1(k)T)T  (225)


Based on expression (223), it is derived that the posterior probabilities of the hidden variables x1(k), . . . , xJ-1(k) are a complex gaussian distribution of the following expression (226).

p(x1(k), . . . ,xJ-1(k)|y(k),Θ)=custom character({tilde over (x)}(k);{tilde over ({circumflex over (x)})}(k),{tilde over (Σ)}(k))  (226)


In this case, a covariance matrix {tilde over ( )}Σ(k) and a mean {circumflex over ( )}{tilde over ( )}x(k) are given by the following expression (227) to expression (230) and expression (231) to expression (233). An inverse matrix lemma is used at the time of transforming expression (227) into expression (228).












~



(
k
)


=



(





A
H



(



v
J



(
k
)




R
J


)



-
1



A

+









(
227
)












(






v
1



(
k
)




R
1




O





O




O





v
2



(
k
)




R
2







O


















O


O








v

J
-
1




(
k
)




R

J
-
1






)


-
1


)


-
1










=




(






v
1



(
k
)




R
1




O





O




O





v
2



(
k
)




R
2







O


















O


O








v

J
-
1




(
k
)




R

J
-
1






)

-














(






v
1



(
k
)




R
1




O





O




O





v
2



(
k
)




R
2







O


















O


O








v

J
-
1




(
k
)




R

J
-
1






)

×

A
H














(




v
J



(
k
)




R
J


+

A


(






v
1



(
k
)




R
1




O





O




O





v
2



(
k
)




R
2







O


















O


O








v

J
-
1




(
k
)




R

J
-
1






)

















A
H

)


A
×












(






v
1



(
k
)




R
1




O





O




O





v
2



(
k
)




R
2







O


















O


O








v

J
-
1




(
k
)




R

J
-
1






)








(
228
)







=




(






v
1



(
k
)




R
1




O





O




O





v
2



(
k
)




R
2







O


















O


O








v

J
-
1




(
k
)




R

J
-
1






)

-








(
229
)











(






v
1



(
k
)




R
1














v

J
-
1




(
k
)




R

J
-
1






)




(




j
=
1

J









v
j



(
k
)




R
j



)


-
1














(
230
)










(



v
1



(
k
)




R
1














v

J
-
1




(
k
)




R

J
-
1



)













x
~

^



(
k
)


=





~




(
k
)





A
H



(



v
J



(
k
)




R
J


)



-
1




y


(
k
)











(
231
)







=



(


(






v
1



(
k
)




R
1














v

J
-
1




(
k
)




R

J
-
1






)

-


(






v
1



(
k
)




R
1














v

J
-
1




(
k
)




R

J
-
1






)




(




j
=
1

J









v
j



(
k
)




R
j



)


-
1






















(




j
=
1


J
-
1










v
j



(
k
)




R
j



)

)








(



v
J



(
k
)




R
J


)


-
1




y


(
k
)









(
232
)







=




(






v
1



(
k
)




R
1














v

J
-
1




(
k
)




R

J
-
1






)



(




j
=
1

J









v
j



(
k
)




R
j



)



y


(
k
)










(
233
)








As it can be found later, at the E-step, the entire matrix {tilde over ( )}Σ is not requested to be calculated, and only a diagonal block may be calculated, the diagonal block at the time when the matrix {tilde over ( )}Σ is regarded as the (J−1)-th order square block matrix including the I-th order square matrix as an element. Where the j-th diagonal block is assumed to be written as Σj(k), this can be calculated as the following expression (234). Clearly, in a case of J=2, expression (234) is equivalent to expression (101).












j



(
k
)


=




v
j



(
k
)




R
j


-



v
j



(
k
)






R
j



(




l
=
1

J









v
l



(
k
)




R
l



)



-
1




(



v
j



(
k
)




R
j


)







(
234
)







When {circumflex over ( )}xj(k) is defined as (j, 1) block at the time when {circumflex over ( )}{tilde over ( )}x(k) is regarded as a block matrix having a size of (J−1)×1 and including an I-dimensional column vector as an element, this is represented by the following expression (235).












x
^

j



(
k
)


=



v
j



(
k
)






R
j



(




l
=
1

J









v
l



(
k
)




R
l



)



-
1




y


(
k
)







(
235
)







It is noted that {circumflex over ( )}xj(k) and Σj(k) are also represented by the following expression (236) and expression (237).

{circumflex over (x)}j(k)=ε(xj(k)|y(k))  (236)
Σj(k)=ε((xj(k)−{circumflex over (x)}j(k))(xj(k)−{circumflex over (x)}j(k))H|y(k))  (237)


In the above expression, j=1, . . . , J−1. E(•|y(k)) is an expected value (expression (238)) related to a posterior probability p (x1(k), . . . , xJ-1(k)|y(k), Θ) of the hidden variable.

ε(⋅|y(k))custom charactercustom character . . . custom character(⋅)p(x1(k), . . . ,xJ-1(k)|y(k),{circumflex over (Θ)})dx1(k) . . . dxJ-1(k)   (238)


Similarly to this operation, with respect to j=J, {circumflex over ( )}xJ(k) and ΣJ(k) are defined as represented by the following expression (239) to expression (243) and expression (244) to expression (251).












x
^

J



(
k
)




=






ɛ


(



y


(
k
)


-




j
=
1


J
-
1





x
j



(
k
)






y


(
k
)



)









(
239
)







=



ɛ


(



y


(
k
)


-

A



x
~



(
k
)






y


(
k
)



)









(
240
)







=




y


(
k
)


-

A




x
~

^



(
k
)











(
241
)







=




y


(
k
)


-




j
=
1


J
-
1






x
^

j



(
k
)











(
242
)







=





v
J



(
k
)






R
J



(




l
=
1

J









v
l



(
k
)




R
l



)



-
1




y


(
k
)










(
243
)










J



(
k
)


=



ɛ
(


(


y


(
k
)


-




j
=
1


J
-
1





x
j



(
k
)



-

y


(
k
)


+




j
=
1


J
-
1






x
^

j



(
k
)




)

×









(
244
)












(


y


(
k
)


-




j
=
1


J
-
1





x
j



(
k
)



-

y


(
k
)


+




j
=
1


J
-
1






x
^

j



(
k
)




)

H



y


(
k
)



)









=



ɛ
(

(





j
=
1


J
-
1





x
j



(
k
)



-




j
=
1


J
-
1






x
^

j



(
k
)




)









(
245
)












(





j
=
1


J
-
1





x
j



(
k
)



-




j
=
1


J
-
1






x
^

j



(
k
)




)

H



y


(
k
)



)









=



ɛ


(



A


(



x
~



(
k
)


-



x
~

^



(
k
)



)





(



x
~



(
k
)


-



x
~

^



(
k
)



)

H



A
H




y


(
k
)



)









(
246
)







=



A






ɛ


(



(



x
~



(
k
)


-



x
~

^



(
k
)



)




(



x
~



(
k
)


-



x
~

^



(
k
)



)

H




y


(
k
)



)




A
H









(
247
)







=



A




~




(
k
)



A
H











(
248
)







=







j
=
1


J
-
1






v
j



(
k
)




R
j



-


(




j
=
1


J
-
1






v
j



(
k
)




R
j



)




(




j
=
1

J





v
j



(
k
)




R
j



)


-
1











(
249
)










(




j
=
1


J
-
1






v
j



(
k
)




R
j



)










=




(




j
=
1


J
-
1






v
j



(
k
)




R
j



)




(




j
=
1

J





v
j



(
k
)




R
j



)


-
1




(



v
J



(
k
)




R
J


)









(
250
)








=






v
J



(
k
)




R
J


-


(



v
J



(
k
)




R
J


)




(




j
=
1

J





v
j



(
k
)




R
j



)


-
1




(



v
J



(
k
)




R
J


)















(
251
)








Accordingly, the following expression (252) and expression (253) are established with respect to j=1, . . . , J.












x
^

j



(
k
)


=



v
j



(
k
)






R
j



(




l
=
1

J









v
l



(
k
)




R
l



)



-
1




y


(
k
)







(
252
)









j



(
k
)


=




v
j



(
k
)




R
j


-



v
j



(
k
)






R
j



(




l
=
1

J









v
l



(
k
)




R
l



)



-
1




(



v
j



(
k
)




R
j


)







(
253
)







The following describes derivation of the Q-function in detail. Logarithms of joint distribution of the observation signal vector y(k) and the hidden variables x1(k), . . . , xJ-1(k) are written down as represented by the following expression (254) to expression (256). However, a constant term is ignored in expression (254) to expression (256), so that the equal sign represents that both sides thereof are equal to each other except a difference between constants.










ln






p


(



x
1



(
k
)


,





,



x

J
-
1




(
k
)



Θ


)



=



ln






p


(



y


(
k
)





x
1



(
k
)



,





,


x

J
-
1




(
k
)


,
Θ

)



+

ln






p


(



x
1



(
k
)


,





,



x

J
-
1




(
k
)



Θ


)




=
-





(
254
)










j
=
1


J
-
1




ln






det


(



v
j



(
k
)




R
j


)




-



(


y


(
k
)


-




j
=
1


J
-
1





x
j



(
k
)




)

H




(



v
J



(
k
)




R
J


)


-
1




(


y


(
k
)


-




j
=
1


J
-
1





x
j



(
k
)




)


-




(
255
)














j
=
1


J
-
1







x
j



(
k
)


H




(



v
j



(
k
)




R
j


)


-
1





x
j



(
k
)








(
256
)







In this case, it is noted that the following expressions (expression (257) to expression (262)) are established.










ɛ


(




x
j



(
k
)






x
j



(
k
)


H




y


(
k
)



)


=


ɛ


(



(



x
j



(
k
)


-



x
^

j



(
k
)


+



x
^

j



(
k
)



)




(



x
j



(
k
)


-



x
^

j



(
k
)


+



x
^

j



(
k
)



)

H




y


(
k
)



)


=





(
257
)














j



(
k
)


+




x
^

j



(
k
)







x
^

j



(
k
)


H







(


j
=
1

,





,

J
-
1


)







(
258
)












ɛ


(



(


y


(
k
)


-




j
=
1


J
-
1





x
j



(
k
)




)




(


y


(
k
)


-




j
=
1


J
-
1





x
j



(
k
)




)

H




y


(
k
)



)


=





(
259
)







ɛ


(



(


y


(
k
)


-




j
=
1


J
-
1






x
^

j



(
k
)



+




j
=
1


J
-
1






x
^

j



(
k
)



-




j
=
1


J
-
1





x
j



(
k
)




)




(


y


(
k
)


-




j
=
1


J
-
1






x
^

j



(
k
)



+




j
=
1


J
-
1






x
^

j



(
k
)



-




j
=
1


J
-
1





x
j



(
k
)




)

H




y


(
k
)



)


=




(
260
)














(


y


(
k
)


-




j
=
1


J
-
1






x
^

j



(
k
)




)




(


y


(
k
)


-




j
=
1


J
-
1






x
^

j



(
k
)




)

H


+



j



(
k
)



=





(
261
)














j



(
k
)


+




x
^

j



(
k
)







x
^

j



(
k
)


H







(
262
)







Thus, the Q-function is given by the following expressions (expression (263) to expression (266)). Expression (264) is nothing but expression (209).









Q


=
c








k
=
1

K






j
=
1

J



[



-
ln






det






(



v
j



(
k
)




R
j


)


-

tr
(


(



v
j



(
k
)




R
j


)


-
1














(
263
)











(




j



(
k
)


+




x
^

j



(
k
)







x
^

j



(
k
)


H



)

)

]









=






k
=
1

K






j
=
1

J



[



-
ln






det






(



v
j



(
k
)





(

P

-
1


)

H



Λ
j



P

-
1



)


-











(
264
)










tr


{



(



v
j



(
k
)





(

P

-
1


)

H



Λ
j



P

-
1



)


-
1




(




j



(
k
)


+




x
^

j



(
k
)







x
^

j



(
k
)


H



)


}


]









=






k
=
1

K






j
=
1

J



[



-
ln






det






(



v
j



(
k
)





(

P

-
1


)

H



Λ
j



P

-
1



)


-











(
265
)










tr


{



(



v
j



(
k
)




Λ
j


)


-
1




(




j




(
k
)


+




x
^

j




(
k
)







x
^

j




(
k
)


H



)


}


]









=






k
=
1

K






j
=
1

J



[



-
ln






det






(



v
j



(
k
)





(

P

-
1


)

H



Λ
j



P

-
1



)


-











(
266
)










tr


{



(



v
j



(
k
)




Λ
j


)


-
1




(




j




(
k
)


+

diag


(






x
^

j




(
k
)




2

)



)


}


]










In this case, a square of an absolute value |•|2 in expression (266) is assumed to be taken for each element. The symbol with “c” added above the equal sign “=” represents that both sides are equal to each other except a difference between constants independent of Θ (it is noted that the symbol is used for representing meaning different from the meaning previously described).


First Modification of Fifth Embodiment

Next, the following describes a first modification of the fifth embodiment. The first modification of the fifth embodiment describes a modification of the processing performed by the signal parameter analysis unit. In the fifth embodiment, the signal parameter analysis unit updates the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P based on the fixed point method. On the other hand, the signal parameter analysis unit according to the first modification of the fifth embodiment updates the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P based on any optimization method (for example, a gradient method, a natural gradient method, and a Newton's method) other than the fixed point method.


For example, in a case of the gradient method, the signal parameter analysis unit according to the first modification of the fifth embodiment updates the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P by the following expression (267) (this is directly derived from expression (215)).










P
^




P
^

+

μ


[



(


P
^

H

)


-
1


-


1
KJ




(


P
^

H

)


-
1







j
=
1

J








(




k
=
1

K








1



v
^

j



(
k
)





(





x
^

j




(
k
)







x
^

j




(
k
)


H


+



j




(
k
)



)



)




Λ
^

j

-
1






]







(
267
)







However, at the time of the first update of the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P performed by the signal parameter analysis unit, {circumflex over ( )}P is replaced with {circumflex over ( )}P(0) on the right side of expression (267). μ is a positive number that is called a learning rate, and may be determined to be μ=0.05, for example.


In a case of the natural gradient method, the signal parameter analysis unit according to the first modification of the fifth embodiment updates the estimation value of the simultaneous decorrelation matrix P by the following expression (268) (this is also directly derived from expression (215)).










P
^





(

1
+
μ

)



P
^


-


μ
KJ




(


P
^

H

)


-
1







j
=
1

J








(




k
=
1

K








1



v
^

j



(
k
)





(





x
^

j




(
k
)







x
^

j




(
k
)


H


+



j




(
k
)



)



)




Λ
^

j

-
1





P
^

H



P
^









(
268
)







However, at the time of the first update of the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P performed by the signal parameter analysis unit, {circumflex over ( )}P is replaced with {circumflex over ( )}P(0) on the right side of expression (268). μ is a positive number that is called a learning rate, and may be determined to be μ=0.05, for example. According to the natural gradient method, measurement of a space is appropriately considered, so that it is typically known that update can be performed more stably than the gradient method.


Second Modification of Fifth Embodiment

Next, the following describes a second modification of the fifth embodiment. The second modification of the fifth embodiment describes a modification of the processing performed by the signal parameter analysis unit. In the fifth embodiment and the first modification of the fifth embodiment, the signal parameter analysis unit updates the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P based on the Q-function. On the other hand, the signal parameter analysis unit according to the second modification of the fifth embodiment directly updates the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P based on a log-likelihood function L2(Θ).


The signal parameter analysis unit according to the second modification of the fifth embodiment updates the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P by the following expression (expression (269)) (it is assumed that E represents a unit matrix) based on the observation signal vector y′(k) subjected to simultaneous decorrelation from the simultaneous decorrelation unit, the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the spatial covariance matrix update unit, and the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit (and the initial estimation value {circumflex over ( )}P(0) of the simultaneous decorrelation matrix P from the initialization unit at the time of the first update of the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P performed by the signal parameter analysis unit).













[
P

^

]

i







P
^



(


1
K






k
=
1

K




1




j
=
1

J







v
^

j



(
k
)




[


Λ
^

j

]


ii






y




(
k
)






y




(
k
)


H




)



-
1




[
E
]


i





(
269
)







However, at the time of the first update of {circumflex over ( )}P performed by the signal parameter analysis unit, {circumflex over ( )}P is replaced with {circumflex over ( )}P(0) on the right side of expression (269). Alternatively, the signal parameter analysis unit may update the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P by successively applying expression (269) multiple times.


The following describes derivation of the update rule for the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P based on the log-likelihood function L2(Θ) in expression (269). By partially differentiating the log-likelihood function in expression (201) with respect to the complex conjugate P* of the simultaneous decorrelation matrix P, the following expression (270) is obtained.














L
2



(
Θ
)






P
*



=



K


(

P
H

)



-
1


-




k
=
1

K




y


(
k
)





y


(
k
)


H




P


(




j
=
1

J





v
j



(
k
)




Λ
j



)



-
1









(
270
)







Assuming that expression (270) is equal to a zero matrix, and taking vec of both sides thereof, the following expression (expression (271)) is obtained.










vec


(


(

P
H

)


-
1


)


=


[


1
K






k
=
1

K





(




j
=
1

J





v
j



(
k
)




Λ
j



)


-
1




(


y


(
k
)





y


(
k
)


H


)




]



vec


(
P
)







(
271
)







The matrix on the right side of expression (271) can be transformed as represented by the following expression (272) and expression (273).











1
K






k
=
1

K





(




j
=
1

J





v
j



(
k
)




Λ
j



)


-
1




(


y


(
k
)





y


(
k
)


H


)




=



1
K






k
=
1

K









(





1







j
=
1

J




v
j



(
k
)









[

Λ
j

]

11







y


(
k
)





y


(
k
)


H









O



















O









1







j
=
1

J




v
j



(
k
)









[

Λ
j

]

II







y


(
k
)





y


(
k
)


H









)




=





(
272
)






(





1
K






k
=
1

K




1







j
=
1

J




v
j



(
k
)









[

Λ
j

]

11







y


(
k
)





y


(
k
)


H











O



















O









1
K






k
=
1

K




1







j
=
1

J




v
j



(
k
)









[

Λ
j

]

II







y


(
k
)





y


(
k
)


H











)




(
273
)







Accordingly, the following expressions (expression (274) and expression (275)) are obtained.











[


(

P
H

)


-
1


]

i

=



(


1
K






k
=
1

K




1




j
=
1

J






v
j



(
k
)




[

Λ
j

]


ii





y


(
k
)





y


(
k
)


H




)



[
P
]


i





(
274
)










[
P
]

i


=




(


1
K






k
=
1

K




1




j
=
1

J






v
j



(
k
)




[

Λ
j

]


ii





y


(
k
)





y


(
k
)


H




)


-
1


[


(

P
H

)


-
1


]

i





(
275
)







By substituting expression (276) for expression (275), it is possible to obtain the update rule for the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P based on the fixed point method of expression (269).

y(k)=(PH)−1y′(k)  (276)


In this way, on the basis of the fixed point method, the learning rate is not requested to be adjusted unlike the gradient method and the natural gradient method, so that optimization can be stably performed.


Similarly to the first modification of the fifth embodiment, in the second modification of the fifth embodiment, the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P may be updated by using not only the fixed point method but also any optimization method (a gradient method, a natural gradient method, a Newton's method, and the like). The following describes such an example.


In a case of the gradient method, the signal parameter analysis unit according to the second modification of the fifth embodiment updates the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P by the following expression (277) (this is directly derived from expression (270)) based on the observation signal vector y(k) from the observation signal vector creation unit, the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the spatial covariance matrix update unit, and the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit (and the initial estimation value {circumflex over ( )}P(0) of the simultaneous decorrelation matrix P from the initialization unit at the time of the first update of the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P performed by the signal parameter analysis unit).










P
^




P
^

+

μ
[



(


P
^

H

)


-
1


-


1
K






k
=
1

K




y


(
k
)





y


(
k
)


H





P
^

(




j
=
1

J






v
^

j



(
k
)





Λ
^

j



)


-
1






]






(
277
)







However, at the time of the first update of the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P performed by the signal parameter analysis unit, {circumflex over ( )}P is replaced with {circumflex over ( )}P(0) on the right side of expression (277). Alternatively, the signal parameter analysis unit may update the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P by successively applying expression (277) multiple times. The signal parameter analysis unit may also use, instead of expression (277), an expression represented by using the observation signal vector y′(k) subjected to simultaneous decorrelation that is obtained by substituting expression (276) for expression (277). μ is a predetermined positive number that is called a learning rate, and may be determined to be μ=0.05 and the like, for example. In this case of the gradient method, the inverse matrix calculation is performed only once per one time of application of expression (277) and the matrix multiplication is not requested (the inverse matrix calculation for the diagonal matrix is not counted), so that the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P can be updated with a smaller calculation amount as compared with the case of the fixed point method.


In a case of the natural gradient method, the signal parameter analysis unit according to the second modification of the fifth embodiment updates the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P by the following expression (278) (this is also directly derived from expression (270)) based on the observation signal vector y(k) from the observation signal vector creation unit, the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the spatial covariance matrix update unit, and the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit (and the initial estimation value {circumflex over ( )}P(0) of the simultaneous decorrelation matrix P from the initialization unit at the time of the first update of the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P performed by the signal parameter analysis unit).










P
^





(

1
+
μ

)



P
^


-


μ
K






k
=
1

K




y


(
k
)





y


(
k
)


H





P
^

(




j
=
1

J






v
^

j



(
k
)





Λ
^

j



)


-
1





P
^

H



P
^









(
278
)







However, at the time of the first update of the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P performed by the signal parameter analysis unit, {circumflex over ( )}P is replaced with {circumflex over ( )}P(0) on the right side of expression (278). Alternatively, the signal parameter analysis unit may update the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P by successively applying expression (278) multiple times. The signal parameter analysis unit may also use, instead of expression (278), an expression represented by using the observation signal vector y′(k) subjected to simultaneous decorrelation that is obtained by substituting expression (276) for expression (278). μ is a predetermined positive number that is called a learning rate, and may be determined to be μ=0.05 and the like, for example. According to the natural gradient method, measurement of a space is appropriately considered, so that it is typically known that update can be performed more stably than the gradient method.


Third Modification of Fifth Embodiment

Next, the following describes a third modification of the fifth embodiment. In the third modification of the fifth embodiment, the fifth embodiment is applied for each frequency similarly to the third embodiment.


In the third modification of the fifth embodiment, it is assumed that I (I is an integral number equal to or larger than 2) observation signals yi(t) acquired by a microphone at different positions are input to the signal analysis device in a situation in which J sound source signals are present in a mixed manner. The observation signal yi(t) is subjected to frequency resolution, subjected to signal separation using the method described in the fifth embodiment for each frequency, and converted into a time domain at last.


With reference to FIG. 5 and FIG. 6, the following describes a configuration of the signal analysis device according to the third modification of the fifth embodiment, and a processing procedure of the signal analysis processing.


Pieces of processing performed by the frequency domain conversion unit 310 (Step S40, Step S41) and the time domain conversion unit 330 (Step S43) are the same as those in the third embodiment. The following describes processing performed by the frequency domain signal separation unit 320 (Step S42) as a difference.


The frequency domain signal separation unit 320 receives the observation signal of the time frequency domain represented by expression (279) from the frequency domain conversion unit 310, calculates the estimation value of the source signal component of the time-frequency domain represented by expression (280) by performing signal separation in the frequency domain using the method of the fifth embodiment for each f, and passes the estimation value to the time domain conversion unit 330 (Step S42).

yif(k)  (279)
{circumflex over (x)}jf(k)  (280)


That is, by using the observation signal represented by expression (281) as an input for f=1, the estimation value of the source signal component of expression (282) is obtained by using the method in the fifth embodiment.

yi1(k)  (281)
{circumflex over (x)}11(k), . . . ,{circumflex over (x)}J1(k)  (282)


By using the observation signal represented by expression (283) as an input for f=2, the estimation value of the source signal component of expression (284) is obtained by using the method in the fifth embodiment.

yi2(k)  (283)
{circumflex over (x)}12(k), . . . ,{circumflex over (x)}J2(k)  (284)


The same processing is repeatedly performed for each f, and finally, by using the observation signal represented by expression (285) as an input for f=F, the estimation value of the source signal component of expression (286) is obtained by using the method in the fifth embodiment.

yiF(k)  (285)
{circumflex over (x)}1F(k), . . . ,{circumflex over (x)}JF(k)  (286)


The estimation value of the source signal component of the time-frequency domain that is obtained in this way (refer to expression (287)) is passed to the time domain conversion unit 330.

{circumflex over (x)}jf(k)  (287)


Effect of Fifth Embodiment

To confirm the effect of the present invention, the following describes a sound source separation experiment using the conventional method and the fifth embodiment.



FIG. 14 is a diagram illustrating an experiment condition for the sound source separation experiment using the conventional method and the fifth embodiment. FIG. 15 is a diagram illustrating a result of the sound source separation experiment (real time factor) using the conventional method and the fifth embodiment. FIG. 16 is a diagram illustrating a result of the sound source separation experiment (sound source separation performance) using the conventional method and the fifth embodiment.


A reverberation voice was generated by convoluting a reverberation impulse response illustrated in FIG. 14 that was measured in a laboratory into an English voice of 8 seconds, and reverberation voices of a plurality of persons were added to generate the observation signal. FIG. 15 and FIG. 16 illustrate performances in a case of performing signal separation on the observation signal using respective methods. FIG. 15 illustrates the real time factor (time requested for processing when the observation signal length is 1). FIG. 16 illustrates a signal-to-distortion ratio as sound source separation performance (as a value becomes larger, the sound source separation performance is higher).


As illustrated in FIG. 15 and FIG. 16, with the method according to the fifth embodiment, the calculation time was able to be reduced by 420 times or more even in a case in which the number of sound sources was equal to or larger than 3, and sound source separation performance higher than that in the conventional method was able to be obtained.


In this way, with the fifth embodiment, the calculation time can be greatly reduced as compared with the conventional method, and signal separation performance higher than that in the conventional method can be implemented. The fifth embodiment can also be applied to a case in which the number of signals is not 2, and can be applied to a case in which the number of signals is equal to or larger than 3.


Sixth Embodiment

Next, the following describes a configuration, a processing procedure, and an effect of the signal analysis device according to a sixth embodiment. The sixth embodiment describes a specific method for implementing the modification of the fourth embodiment.


Similarly to the fifth embodiment, the sixth embodiment describes a method of implementing signal separation by estimating the variance parameter vj(k), the spatial covariance matrix Λj subjected to simultaneous decorrelation and the simultaneous decorrelation matrix P as parameters for modeling the spatial covariance matrix Rj, and the source signal component xj(k) at the same time in a situation in which J source signals are typically present in a mixed manner and the variance parameter vj(k) and the spatial covariance matrix Rj are unknown.


While the fifth embodiment is based on the EM algorithm, the sixth embodiment is based on the auxiliary function method. In the sixth embodiment, it is assumed that I observation signals yi(k) acquired at different positions are input to the signal analysis device.


[Configuration and Processing of Signal Analysis Device]


With reference to FIG. 17 and FIG. 18, the following describes the configuration of the signal analysis device and the processing procedure of the signal analysis processing according to the sixth embodiment. FIG. 17 is a diagram illustrating an example of the configuration of the signal analysis device according to the sixth embodiment. FIG. 18 is a flowchart illustrating an example of the processing procedure of the signal analysis processing according to the sixth embodiment.


As illustrated in FIG. 17, a signal analysis device 601 includes an initialization unit (not illustrated), an observation signal vector creation unit 610, a simultaneous decorrelation unit 630, a single channel Wiener filter unit 640, an observation covariance matrix update unit 650, a variance parameter update unit 660, a spatial covariance matrix update unit 670, a signal parameter analysis unit 620, a convergence determination unit (not illustrated), and a simultaneous decorrelation inverse conversion unit 680.


First, the following describes an outline of each unit of the signal analysis device. The processing performed by the initialization unit (not illustrated) (Step S81), the processing performed by the observation signal vector creation unit 610 (Steps S82 and S83), the processing performed by the simultaneous decorrelation unit 630 (Step S84), and the processing performed by the single channel Wiener filter unit 640 (Step S85) are the same as those in the fifth embodiment, so that the description thereof will not be repeated herein.


The observation covariance matrix update unit 650 receives the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 660 (as an exception, the initial estimation value {circumflex over ( )}vj(0)(k) of the variance parameter vj(k) from the initialization unit is used at the time of the first processing performed by the observation covariance matrix update unit 650) and the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the spatial covariance matrix update unit 670 (as an exception, the initial estimation value {circumflex over ( )}Λj(0) of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the initialization unit at the time of the first processing performed by the observation covariance matrix update unit 650), and updates an observation covariance matrix U′(k) subjected to simultaneous decorrelation by the following expression (288) (Step S86).











U




(
k
)







j
=
1

J






v
^

j



(
k
)





Λ
^

j







(
288
)







However, at the time of the first processing performed by the observation covariance matrix update unit 650, {circumflex over ( )}vj(k) is replaced with {circumflex over ( )}vj(0)(k), and {circumflex over ( )}Λj is replaced with {circumflex over ( )}Λj(0) in the above expression.


The variance parameter update unit 660 receives the estimation value {circumflex over ( )}xj′(k) of the source signal component xj′(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 640, the observation covariance matrix U′(k) subjected to simultaneous decorrelation from the observation covariance matrix update unit 650, and the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the spatial covariance matrix update unit 670 (as an exception, the initial estimation value {circumflex over ( )}Λj(0) of the spatial covariance matrix Λj subjected to simultaneous decorrelation from the initialization unit is used at the time of the first processing performed by the variance parameter update unit 660), and updates the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) by the following expression (expression (289)) (Step S87).












v
^

j



(
k
)









i
=
1

I








[



x
^

j




(
k
)


]

i



2



[


Λ
^

j

]

ii







i
=
1

I





[


Λ
^

j

]

ii



[


U




(
k
)


]

ii









(
289
)







However, at the time of the first processing performed by the variance parameter update unit 660, {circumflex over ( )}Λj is replaced with {circumflex over ( )}Λj(0) in the above expression.


The spatial covariance matrix update unit 670 receives the estimation value {circumflex over ( )}xj′(k) of the source signal component xj′(k) subjected to simultaneous decorrelation from the single channel Wiener filter unit 640, the observation covariance matrix U′(k) subjected to simultaneous decorrelation from the observation covariance matrix update unit 650, and the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) from the variance parameter update unit 660, and updates the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation by the following expression (290) (Step S88).











[


Λ
^

j

]

im



{










k
=
1

K








[



x
^

j




(
k
)


]

ii



2




v
^

j



(
k
)








k
=
1

K






v
^

j



(
k
)




[


U




(
k
)


]

ii





,




i
=
m






0
,




i

m









(
290
)







The signal parameter analysis unit 620 performs processing similar to that in the second modification of the fifth embodiment to update the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P (Step S89).


The convergence determination unit (not illustrated) determines whether the update converges (Step S90). If the convergence determination unit determines that the update does not converge (No at Step S90), the process returns to the processing performed by the simultaneous decorrelation unit 630 (Step S84), and the processing is continued.


On the other hand, if the convergence determination unit determines that the update converges (Yes at Step S90), the processing performed by the simultaneous decorrelation unit 630 (Step S91), the processing performed by the single channel Wiener filter unit 640 (Step S92), and the processing performed by the simultaneous decorrelation inverse conversion unit 680 (Step S93) are performed similarly to the fifth embodiment, and the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) is calculated to be output, accordingly.


[Background of Processing Performed by Signal Analysis Device]


Next, the following describes a background of the signal analysis device according to the sixth embodiment. As described above in the fifth embodiment, to calculate the estimation value {circumflex over ( )}xj(k) of the source signal component xj(k) (that is, to implement signal separation) by expression (180) in a situation in which the variance parameter vj(k) and the spatial covariance matrix Rj are unknown, the variance parameter vj(k) and the spatial covariance matrix Rj are requested to be estimated.


By solving the optimization problem of expression (181) based on the auxiliary function method, maximum likelihood estimation of the variance parameter vj(k) and the spatial covariance matrix Rj can be performed.


In the auxiliary function method, a step of maximizing an auxiliary function L1(Ψ, Φ) for an auxiliary variable Φ to update Φ based on the auxiliary variable Φ and the auxiliary function L1(Ψ, Φ) satisfying the following expression, and a step of updating the parameter Ψ so that the auxiliary function L1(Ψ, Φ) is not reduced are alternately iterated (refer to expression (291)).










(






Ψ

)



(



max
Φ




L
1
-



(

Ψ
,
Φ

)



=


L
1



(
Ψ
)



)





(
291
)







In the sixth embodiment, L1(Ψ, Φ) in expression (292) is used as the auxiliary function.











L
1
-



(

Ψ
,
Φ

)




=
Δ




-




k
=
1

K



[


ln





det






U


(
k
)



+




j
=
1

J



tr


(



v
j



(
k
)




R
j




U


(
k
)



-
1



)



+




j
=
1

J



tr


{


y


(
k
)





y


(
k
)


H





G
j



(
k
)


H




(



v
j



(
k
)




R
j


)


-
1





G
j



(
k
)



}




]



+
constant





(
292
)







In this case, the “constant” on the right side is a constant independent of the parameter Ψ and the auxiliary variable Φ. It can be indicated that expression (292) satisfies the condition for the auxiliary function of expression (291) similarly to reference document 3 and reference document 4. It is assumed that the auxiliary variable Φ is constituted of U(k) and Gj(k), U(k) is a positive-definite Hermitian matrix, and Gj(k) satisfies expression (293).













j
=
1

J




G
j



(
k
)



=
E




(
293
)







At the step of maximizing the auxiliary function L1(Ψ, Φ) for the auxiliary variable Φ to update Φ, U(k) and Gj(k) are updated by expression (294) and expression (295) based on the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) and the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj. U(k) is nothing but a covariance matrix of the observation signal vector y(k), so that U(k) is referred to as an observation covariance matrix.










U


(
k
)







j
=
1

J






v
^

j



(
k
)





R
^

j







(
294
)








G
j



(
k
)







v
^

j



(
k
)







R
^

j

(




l
=
1

J






v
^

l



(
k
)





R
^

l



)


-
1







(
295
)







At the step of updating the parameter Ψ so that the auxiliary function L1(Ψ, Φ) is not reduced, the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) and the estimation value {circumflex over ( )}Rj of the spatial covariance matrix Rj are updated by the following expression (296) and expression (297) based on the auxiliary variable that is updated as described above.












v
^

j



(
k
)






tr


(



G
j



(
k
)




y


(
k
)





y


(
k
)


H





G
j



(
k
)


H




R
^

j

-
1



)



tr


(



U


(
k
)



-
1





R
^

j


)








(
296
)








R
^

j




C
j

-
1



#


D
j






(
297
)







In this case, A#B is a geometric mean of A and B defined by expression (298).

A#B=B1/2(B−1/2AB−1/2)1/2B1/2  (298)


The matrix Cj and the matrix Dj are calculated by expression (299) and expression (300), respectively.










C
j






k
=
1

K






v
^

j



(
k
)





U


(
k
)



-
1








(
299
)







D
j






k
=
1

K




1



v
^

j



(
k
)






G
j



(
k
)




y


(
k
)





y


(
k
)


H





G
j



(
k
)


H







(
300
)







Update expressions of expression (294) to expression (300) can be derived similarly to reference document 3 and reference document 4. Additionally, by eliminating the auxiliary variable Gj(k) by substitution, an update rule in which the auxiliary variable Gj(k) does not appear can be derived. The update rule (one time of iteration) in this case is represented by the following expression (301) to expression (305). Clearly, in a case of J=2, update based on expression (301) to expression (305) is equivalent to the update based on expression (156) to expression (159).










U


(
k
)







j
=
1

J






v
^

j



(
k
)





R
^

j







(
301
)









v
^

j



(
k
)






v
j



(
k
)






tr
(



U


(
k
)



-
1




y


(
k
)





y


(
k
)


H




U


(
k
)



-
1





R
^

j


)


tr
(



U


(
k
)



-
1





R
^

j


)








(
302
)







C
j






k
=
1

K






v
^

j



(
k
)





U


(
k
)



-
1








(
303
)







D
j






R
^

j

[




k
=
1

K






v
^

j



(
k
)





U


(
k
)



-
1




y


(
k
)





y


(
k
)


H




U


(
k
)



-
1




]




R
^

j






(
304
)








R
^

j




C
j

-
1



#


D
j






(
305
)







However, in the method described above, the inverse matrix calculation U(k)−1 included in expression (302) to expression (304) is requested to be performed for each sample point. Thus, the method described above requests a huge amount of calculation.


On the other hand, in the sixth embodiment, similarly to the fifth embodiment, by performing the inverse matrix calculation U(k)−1 to result in an inverse matrix calculation with respect to a diagonal matrix that can be calculated with a small calculation amount based on simultaneous decorrelation of the J source signal components x1(k), . . . , xJ(k), signal separation with a small calculation amount is implemented. Specifically, in the sixth embodiment, similarly to the fifth embodiment, under the constraint condition that J spatial covariance matrices are represented in the forms of expression (197) to expression (199), maximum likelihood estimation of the simultaneous decorrelation matrix P and the spatial covariance matrices Λ1, . . . , ΛJ subjected to simultaneous decorrelation is performed. This maximum likelihood estimation is formulated as the optimization problem of expression (200). In the sixth embodiment, the maximum likelihood estimation described above is implemented based on the auxiliary function method. In this method, by applying one time of iteration in the auxiliary function method, a step of updating the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) and the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation, and a step of updating the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P are alternately iterated.


The step of updating the estimation value {circumflex over ( )}vj(k) of the variance parameter vj(k) and the estimation value {circumflex over ( )}Λj of the spatial covariance matrix Λj subjected to simultaneous decorrelation on the basis of the auxiliary function method is based on the auxiliary function L2(Θ, Φ) of expression (306).











L
2
-



(

Θ
,
Φ

)




=
Δ







k
=
1

K



[


ln





det






U


(
k
)



+




j
=
1

J



tr
(



v
j



(
k
)





(

P

-
1


)

H



Λ
j



P

-
1





U


(
k
)



-
1



)


+




j
=
1

J



tr


{


y


(
k
)





y


(
k
)


H





G
j



(
k
)


H




(



v
j



(
k
)





(

P

-
1


)

H



Λ
j



P

-
1



)


-
1





G
j



(
k
)



}




]


+
constant





(
306
)







In this case, the “constant” on the right side is a constant independent of the parameter Θ and the auxiliary variable Φ.


Similarly to the above process, by eliminating the auxiliary variable Gj(k) by substitution, the update rule is given by the following expression (307) to expression (311). In this case, D in italic on the right side of expression (310) is an operator for replacing a nondiagonal component of the matrix with 0.










U


(
k
)







j
=
1

J






v
^

j



(
k
)





(


P
^


-
1


)

H




Λ
^

j




P
^


-
1








(
307
)









v
^

j



(
k
)







v
^

j



(
k
)






tr
(



U


(
k
)



-
1




y


(
k
)





y


(
k
)


H




U


(
k
)



-
1





(


P
^


-
1


)

H




Λ
^

j




P
^


-
1



)


tr
(



U


(
k
)



-
1





(


P
^


-
1


)

H




Λ
^

j




P
^


-
1



)








(
308
)







C
j







k
=
1

K






v
^

j



(
k
)





P
^


-
1





U


(
k
)



-
1





(


P
^


-
1


)

H







(
309
)







D
j







Λ
^

j

[




k
=
1

K






v
^

j



(
k
)



𝒟


{



P
^


-
1





U


(
k
)



-
1




y


(
k
)





y


(
k
)


H




U


(
k
)



-
1





(


P
^


-
1


)

H


}



]




Λ
^

j






(
310
)








Λ
^

j





(

C
j


)


-
1



#


D
j







(
311
)







These can be further rewritten to be the following expression (312) to expression (315).











y




(
k
)






P
^

H



y


(
k
)







(
312
)








U




(
k
)







j
=
1

J






v
^

j



(
k
)





Λ
^

j







(
313
)









v
^

j



(
k
)










x
^

j




(
k
)


H




Λ
^

j

-
1






x
^

j




(
k
)




tr
(




U




(
k
)



-
1





Λ
^

j


)







(
314
)








Λ
^

j





(




k
=
1

K






v
^

j



(
k
)






U




(
k
)



-
1




)


-

1
2






(




k
=
1

K




1



v
^

j



(
k
)





𝒟


(




x
^

j




(
k
)







x
^

j




(
k
)


H


)




)


1
2







(
315
)







Derivation of the update rule for the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P is the same as that in the second modification of the fifth embodiment, so that description thereof will not be repeated herein. In this way, the update rule in the sixth embodiment is derived.


Modification of Sixth Embodiment

In the sixth embodiment, the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P is updated based on log likelihood, but the estimation value {circumflex over ( )}P of the simultaneous decorrelation matrix P may be updated based on the auxiliary function. Details are the same as that in the fifth embodiment and the first modification of the fifth embodiment, so that the description thereof will not be repeated herein.


Effect of Sixth Embodiment

According to the sixth embodiment, a calculation amount can be significantly reduced based on simultaneous decorrelation. With the EM algorithm, high signal separation performance is not necessarily implemented in a case in which the number of times of iteration is small. However, according to the sixth embodiment, high signal separation performance can be implemented even in a case in which the number of times of iteration is small based on the auxiliary function method. The sixth embodiment can also be applied to a case in which the number of signals is equal to or larger than 3.


Seventh Embodiment

A seventh embodiment describes a method of enabling signal separation to be performed with a small calculation amount even in a case in which the total number L of the source signals included in the observation signal satisfies L≥3. Such methods are also described in the fifth embodiment and the sixth embodiment, but the present embodiment describes another implementation method.


In the fifth embodiment and the sixth embodiment, modeling (refer to expression (179)) is performed such that “the observation signal includes all of the L source signals at all time-frequency points”, and signal separation is implemented with a small calculation amount even in a case of L≥3 based on simultaneous decorrelation of all of the L source signals (simultaneous diagonalization of spatial covariance matrices of all of the L source signals). This simultaneous decorrelation (simultaneous diagonalization) is based on the constraint condition (refer to expression (197) to expression (199)) that “the spatial covariance matrices of all of the L source signals can be subjected to simultaneous diagonalization with one simultaneous decorrelation matrix”. However, this constraint condition is not actually realistic in some cases. In such a case, modeling accuracy may be lowered in the fifth embodiment and the sixth embodiment, and as a result, signal separation performance may be lowered. The fifth embodiment and the sixth embodiment are based on simultaneous decorrelation of all of the L source signals (simultaneous diagonalization of the spatial covariance matrices of all of the L source signals), so that J=L is satisfied assuming that a maximum number of the source signals subjected to simultaneous decorrelation (maximum number of spatial covariance matrices of the source signals subjected to simultaneous diagonalization) is J. Thus, both of J and L are represented as J in the fifth embodiment and the sixth embodiment.


On the other hand, in the seventh embodiment, a sparse property of the source signal in the time-frequency domain is noted. The source signal often has a sparse property in the time-frequency domain, that is, the property that “energy concentrates on a small number of time-frequency components” (for example, refer to reference document 5 “O. Yilmaz and S. Rickard, “Blind separation of speech mixtures via time-frequency masking”, IEEE Transactions on Signal Processing, vol. 52, no. 7, pp. 1830-1847, July 2004”). In this case, it can be often assumed that the observation signal does not include all of the L source signals but includes source signals the number of which is smaller than L at each time-frequency point.


Thus, in the seventh embodiment, modeling is performed such that “the observation signal includes only two source signals at each time frequency point”, and signal separation is implemented with a small calculation amount even in a case of L≥3 based on simultaneous decorrelation of only two source signals included in the observation signal at each time-frequency point (simultaneous diagonalization of the spatial covariance matrix of only two source signals included in the observation signal at each time-frequency point). Two or less source signals can be subjected to simultaneous decorrelation by one simultaneous decorrelation matrix at all times (the spatial covariance matrices of two or less source signals can be subjected to simultaneous diagonalization by one simultaneous decorrelation matrix at all times) based on the generalized eigenvalue problem, so that an exceptional constraint condition is not requested to be imposed on the spatial covariance matrix of the source signal in the seventh embodiment unlike the fifth embodiment and the sixth embodiment.


Thus, according to the seventh embodiment, modeling can be performed with high accuracy even in a case in which the constraint condition such that “the spatial covariance matrices of all of the L source signals can be subjected to simultaneous diagonalization by one simultaneous decorrelation matrix” is not realistic, and as a result, high signal separation performance can be implemented even in such a case. In the seventh embodiment, the maximum number J of the source signals subjected to simultaneous decorrelation (maximum number J of the spatial covariance matrices of the source signals subjected to simultaneous diagonalization) is represented as J=2.


In the seventh embodiment, it is assumed that I (I is a natural number equal to or larger than 2) observation signals yi(t) acquired at different positions are input to the signal analysis device in a situation in which L (L is a natural number equal to or larger than 2) source signals are present in a mixed manner.


[Configuration and Processing of Signal Analysis Device]


With reference to FIG. 5 and FIG. 6, a configuration of the signal analysis device and a processing procedure of the signal analysis processing according to the seventh embodiment. FIG. 5 is a diagram illustrating an example of the configuration of the signal analysis device according to the seventh embodiment. FIG. 6 is a flowchart illustrating an example of the processing procedure of the signal analysis processing according to the seventh embodiment.


As illustrated in FIG. 5, the signal analysis device 301 includes the frequency domain conversion unit 310, the frequency domain signal separation unit 320, and the time domain conversion unit 330.


First, the following describes an outline of each unit of the signal analysis device 301. The frequency domain conversion unit 310 receives the input observation signal yi(t) (Step S40), and calculates the observation signal yi(k, f) of the time-frequency domain by short-time Fourier transformation and the like (Step S41). In this case, k represents an index of the time frame, and f represents an index of the frequency bin. yi(k, f) is also represented as yif(k).


The frequency domain signal separation unit 320 performs signal separation for each f (Step S42), and obtains the estimation value {circumflex over ( )}xj(k, f) of the source signal component xj(k, f) of the time-frequency domain. {circumflex over ( )}xj(k, f) is also represented as {circumflex over ( )}xjf(k).


The time domain conversion unit 330 calculates the estimation value {circumflex over ( )}xj(t) of the source signal component xj(t) of the time domain by inverse short-time Fourier transformation and the like based on the estimation value {circumflex over ( )}xj(k, f) of the source signal component xj(k, f) of the time-frequency domain, and outputs the estimation value (Step S43).


Next, the following describes the frequency domain signal separation unit 320 in more detail. With reference to FIG. 19 and FIG. 20, the following describes the configuration of the frequency domain signal separation unit 320, and a processing procedure of frequency domain signal separation processing. FIG. 19 is a diagram illustrating an example of the configuration of the frequency domain signal separation unit 320. FIG. 20 is a flowchart illustrating an example of the processing procedure of frequency domain signal separation processing.


As illustrated in FIG. 19, the frequency domain signal separation unit 320 includes an initialization unit (not illustrated), an observation signal vector creation unit 710, a signal parameter analysis unit 720, a simultaneous decorrelation unit 730, a signal presence posterior probability update unit 740, a single channel Wiener filter unit 750, a posterior covariance matrix update unit 760, a mixture weight update unit 770, a variance parameter update unit 780, a spatial covariance matrix update unit 790, a convergence determination unit (not illustrated), and a source signal component estimation unit 800. Processing by each of these units of the frequency domain signal separation unit 320 is performed for each frequency bin.


The frequency domain signal separation unit 320 initializes an index f of the frequency bin by f←0 (Step S101). The frequency domain signal separation unit 320 increments the index f of the frequency bin by 1 by f←f+1 (Step S102).


The initialization unit (not illustrated) sets an initial estimation value {circumflex over ( )}αB(0)(f) of a mixture weight αB(f), an initial estimation value {circumflex over ( )}vj(0)(k, f) of a variance parameter vj(k, f), and an initial estimation value {circumflex over ( )}Rj(0)(f) of a spatial covariance matrix Rj(f) (Step S103). These values may be set based on a random number, for example.


The observation signal vector creation unit 710 receives the observation signal yi(k, f) of the time-frequency domain from the frequency domain conversion unit (Step S104), and creates the observation signal vector y(k, f) by expression (316) (Step S105).

y(k,f)=[y1(k,f) . . . yI(k,f)]T  (316)


The following describes modeling of the observation signal vector y(k, f) in the seventh embodiment. In the present embodiment, it is assumed that the observation signal vector y(k, f) includes only two source signal components at each time-frequency point (k, f). The source signal components included in the observation signal vector y(k, f) at each time-frequency point (k, f) can be represented by a set B(k, f) as the whole index of the source signal components (in this case, it is assumed that an original number #B(k, f) of the set B(k, f) satisfies #B(k, f)≤2). In this case, the observation signal vector y(k, f) is modeled by expression (317) as a sum of source signal components xj(k, f) with respect to all sources j of B(k, f).










y


(

k
,
f

)


=




j


B


(

k
,
f

)







x
j



(

k
,
f

)







(
317
)







For example, in a case in which B(k, f)={1, 3} is established at a certain time-frequency point (k, f), expression (317) will be expression (318) at (k, f).

y(k,f)=x1(k,f)+x3(k,f)  (318)


The whole values (set) that can be taken by B(k, f) are represented as a set system H constituted of subsets of set {1, . . . , L} that is not empty (that is, subsets of a power set 2{1, . . . . , L} not including an empty set as a source). For example, in a case in which B(k, f) may be any set B⊂{1, . . . , L} that is not empty and satisfies #B≤2, H is given by expression (319) and expression (320).










=



{

B
|

B




[

1
,





,
L

}






and





1



#

B


2



}





                        


(
319
)







=



{


{
1
}

,





,

{
L
}

,

{

1
,
2

}

,

{

1
,
3

}

,





,

{


L
-
1

,
L

}


}








(
320
)








The set system H of expression (319) and expression (320) is represented as Hmax. In the seventh embodiment, it is assumed that the set system H is given in advance and H⊂Hmax is satisfied. It is also assumed that at least a set BEH satisfying #B=2 is present.


That is, it is assumed that a source (set) B of the set system H is present, and #B=2 is satisfied.


The signal parameter analysis unit 720 receives the estimation value {circumflex over ( )}Rj(f) of the spatial covariance matrix Rj(f) from the spatial covariance matrix update unit 790 (as an exception, the initial estimation value {circumflex over ( )}Rj(0)(f) of the spatial covariance matrix Rj(f) from the initialization unit is used at the time of the first processing performed by the signal parameter analysis unit 720), and calculates a simultaneous decorrelation matrix P{j, l}(f) and a diagonal matrix Λj,{j, l}(f) for any two components xj(k, f) and x1(k, f) (j and l are any different natural numbers equal to or larger than 1 and equal to or smaller than L) among L source signal components (Step S106). In this case, the simultaneous decorrelation matrix P{j, l}(f) is a matrix for performing simultaneous decorrelation on the two source signal components xj(k, f) and xl(k, f) (that is, a matrix for performing simultaneous diagonalization on spatial covariance matrices of the two source signal components xj(k, f) and xl(k, f)), and the set {j, l} as a subscript represents an index of the source signal components xj(k, f) and xl(k, f) subjected to simultaneous decorrelation by the simultaneous decorrelation matrix P{j, l}(f). The diagonal matrix Λj,{j, l}(f) is a diagonal matrix at the time of diagonalizing the spatial covariance matrix of the source signal component xj(k, f) by the simultaneous decorrelation matrix P{j, l}(f).


Specifically, the signal parameter analysis unit 720 performs the following processing. First, by solving the generalized eigenvalue problem {circumflex over ( )}Rl(f)p=λ{circumflex over ( )}Rj(f)p for the estimation values of the spatial covariance matrices of the two source signal components xj(k, f) and xl(k, f) (j and l are any two natural numbers satisfying 1≤j<l≤L) (as an exception, by solving the generalized eigenvalue problem {circumflex over ( )}Rl(0)(f)p=λ{circumflex over ( )}Rj(0)(f)p for initial estimation values of the spatial covariance matrices of the source signal components at the time of the first processing performed by the signal parameter analysis unit 720), generalized eigenvalues λ1,{j, l}(f), . . . , λI,{j, l}(f) and generalized eigenvectors p1,{j, l}(f), . . . , pI,{j, l}(f) respectively corresponding to the generalized eigenvalues λ1,{j, l}(f), . . . , λI,{j, l}(f) are obtained. The generalized eigenvectors p1,{j, l}(f), . . . , pI,{j, l}(f) are assumed to be taken to satisfy expression (321).













p

i
,

{

j
,
l

}





(
f
)


H





R
^

j



(
f
)





p

m
,

{

j
,
l

}





(
f
)



=

{




1
,




i
=
m






0
,




i

m









(
321
)







At the time of the first processing performed by the signal parameter analysis unit 720, {circumflex over ( )}Rj(f) is replaced with {circumflex over ( )}Rj(0)(f) in expression (321). Next, the signal parameter analysis unit 720 transforms a matrix including the generalized eigenvectors p1,{j, l}(f), . . . , p1,{j, l}(f) as column vectors to the simultaneous decorrelation matrix P{j, l}(f), transforms an I-th order unit matrix to the diagonal matrix Λj,{j, l}(f), and transforms a diagonal matrix including the generalized eigenvalues Λ1,{j, l}(f), . . . , λI,{j, l}(f) as diagonal elements to a diagonal matrix Λl,{j, l}(f).


The simultaneous decorrelation unit 730 receives the observation signal vector y(k, f) from the observation signal vector creation unit 710 and the simultaneous decorrelation matrix P{j, l}(f) from the signal parameter analysis unit 720, and calculates an observation signal vector y′{j, l}(k, f) that is subjected to simultaneous decorrelation for the source signal components xj(k, f) and xl(k, f) (j and l are any two natural numbers satisfying 1≤j<l≤L) by the following expression (322) (Step S107).

y′{j,l}(k,f)←P{j,l}(f)Hy(k,f)  (322)


In this case, the set {j, l} as a subscript represents indexes of the source signal components xj(k, f) and xl(k, f) subjected to simultaneous decorrelation.


The signal presence posterior probability update unit 740 receives the simultaneous decorrelation matrix P{j, l}(f) and the diagonal matrix Λj,{j, l}(f) from the signal parameter analysis unit 720, the observation signal vector y′{j, l}(k, f) subjected to simultaneous decorrelation from the simultaneous decorrelation unit 730, an estimation value {circumflex over ( )}αB(f) of the mixture weight αB(f) from the mixture weight update unit 770 (as an exception, the initial estimation value {circumflex over ( )}αB(0)(f) of the mixture weight {circumflex over ( )}αB(f) from the initialization unit is used at the time of the first processing performed by the signal presence posterior probability update unit 740), and an estimation value {circumflex over ( )}vj(k, f) of the variance parameter vj(k, f) from the variance parameter update unit 780 (as an exception, the initial estimation value {circumflex over ( )}vj(0) (k, f) of the variance parameter vj(k, f) from the initialization unit is used at the time of the first processing performed by the signal presence posterior probability update unit 740), and updates a signal presence posterior probability γB(k, f) for each source (set) B of the set system H by expression (323) (Step S108).














(
323
)









γ
B



(

k
,
f

)










α
^

B



(
f
)







det







P
B



(
f
)





2






i
=
1

I






j

B








v
^

j



(

k
,
f

)




[


Λ

j
,
B




(
f
)


]


ii






exp
(

-




i
=
1

I








[


y
B




(

k
,
f

)


]

i



2





j

B








v
^

j



(

k
,
f

)




[


Λ

j
,
B




(
f
)


]


ii





)







B













α
^


B





(
f
)







det







P

B





(
f
)





2









i
=
1

I






j


B








v
^

j



(

k
,
f

)










[


Λ

j
,

B






(
f
)


]

ii







exp
(

-




i
=
1

I








[


y

B






(

k
,
f

)


]

i



2





j


B










v
^

j



(

k
,
f

)




[


Λ

j
,

B






(
f
)


]


ii





)











Depending on a definition of the set system H, a simultaneous decorrelation matrix P{j}(f), a diagonal matrix Λj,{j}(f), and an observation signal vector y′{j}(k, f) subjected to simultaneous decorrelation with respect to a set (singleton set) {j} constituted of only one source may appear in expression (323), and these are defined by expression (324), expression (325), and expression (326) using P{j, l}(f), Λj,{j, l}(f), and y′{j, l}(k, f) with respect to a binary set.

P{j}(f)custom characterP{j,ρ(j)}(f)  (324)
Λj,{j}(f)custom characterΛj,{j,ρ(j)}(f)  (325)
y′{j}(k,f)=y′{j,ρ(j)}(k,f)  (326)


In this case, ρ:{1, . . . , L}→{1, . . . , L} is a predetermined function satisfying “ρ(j)≠j for all j∈{1, . . . , L}”. The reason for this definition is as follows.


That is, simultaneous decorrelation of one source signal component xj(k, f) is nothing but simple decorrelation of xj(k, f), and this is obviously implemented by simultaneous decorrelation of two source signal components xj(k, f) and xρ(j)(k, f).


The single channel Wiener filter unit 750 receives the observation signal vector y′{j, l}(k, f) subjected to simultaneous decorrelation from the simultaneous decorrelation unit 730, the diagonal matrix Λj,{j, l}(f) from the signal parameter analysis unit 720, and the estimation value {circumflex over ( )}vj(k, f) of the variance parameter vj(k, f) from the variance parameter update unit 780 (as an exception, the initial estimation value {circumflex over ( )}vj(0)(k, f) of the variance parameter vj (k, f) from the initialization unit is used at the time of the first processing performed by the single channel Wiener filter unit 750), and updates a posterior mean m′j,{j, l}(k, f) (j and l are any different natural numbers equal to or larger than 1 and equal to or smaller than L) subjected to simultaneous decorrelation by expression (327) (Step S109).











[


m

j
,

{

j
,
l

}






(

k
,
f

)


]

i










v
^

j



(

k
,
f

)




[


Λ

j
,

{

j
,
l

}





(
f
)


]


ii







v
^

j



(

k
,
f

)




[


Λ

j
,

{

j
,
l

}





(
f
)


]


ii

+





v
^

l



(

k
,
f

)




[


Λ

l
,

{

j
,
l

}





(
f
)


]


ii





[


y

{

j
,
l

}





(

k
,
f

)


]


i





(
327
)







The posterior covariance matrix update unit 760 receives the diagonal matrix Λj,{j, l}(f) from the signal parameter analysis unit 720, and the estimation value {circumflex over ( )}vj(k, f) of the variance parameter vj(k, f) from the variance parameter update unit 780 (as an exception, the initial estimation value {circumflex over ( )}vj(0)(k, f) of the variance parameter vj(k, f) from the initialization unit is used at the time of the first processing performed by the posterior covariance matrix update unit 760), and updates a posterior covariance matrix Σ′{j, l}(k, f) subjected to simultaneous decorrelation (j and l are any two natural numbers satisfying 1≤j<l≤L) by the following expression (328) (Step S110).











[


Σ

{

j
,
l

}





(

k
,
f

)


]

ii







v
^

j



(

k
,
f

)








v
^

l



(

k
,
f

)




[


Λ

l
,

{

j
,
l

}





(
f
)


]


ii






v
^

j



(

k
,
f

)


+





v
^

l



(

k
,
f

)




[


Λ

l
,

{

j
,
l

}





(
f
)


]


ii







(
328
)







The mixture weight update unit 770 receives the signal presence posterior probability γB(k, f) from the signal presence posterior probability update unit 740, and updates the estimation value {circumflex over ( )}αB(f) of the mixture weight αB(f) (B is any source (set) of the set system H) by expression (329) (Step S111).












α
^

B



(
f
)





1
K






k
=
1

K




γ
B



(

k
,
f

)








(
329
)







The variance parameter update unit 780 receives the diagonal matrix Λj,{j, l}(f) from the signal parameter analysis unit 720, the signal presence posterior probability γB(k, f) from the signal presence posterior probability update unit 740, the posterior mean m′j,{j, l}(k, f) subjected to simultaneous decorrelation from the single channel Wiener filter unit 750, and the posterior covariance matrix Σ′{j, l}(k, f) subjected to simultaneous decorrelation from the posterior covariance matrix update unit 760, and updates the estimation value {circumflex over ( )}vj(k, f) of the variance parameter vj(k, f) by expression (330) (Step S112).












v
^

j



(

k
,
f

)








B
:





j

B









γ
B



(

k
,
f

)




1
I






i
=
1

I









[


m

j
,
B





(

k
,
f

)


]

i



2

+


[


Σ
B




(

k
,
f

)


]

ii




[


Λ

j
,
B




(
f
)


]

ii









B
:





j

B








γ
B



(

k
,
f

)








(
330
)







Depending on the definition of the set system H, a posterior mean m′j,{j}(k, f) subjected to simultaneous decorrelation and a posterior covariance matrix Σ′{j}(k, f) subjected to simultaneous decorrelation with respect to the singleton set {j} may appear in expression (330), and these are defined by expression (331) and expression (332).

m′j,{j}(k,f)custom charactery′(k,f)  (331)
Σ′{j}(k,f)custom characterO  (332)


The spatial covariance matrix update unit 790 receives the simultaneous decorrelation matrix P{j, l}(f) from the signal parameter analysis unit 720, the signal presence posterior probability γB(k, f) from the signal presence posterior probability update unit 740, the posterior mean m′j,{j, l}(k, f) subjected to simultaneous decorrelation from the single channel Wiener filter unit 750, the posterior covariance matrix Σ′{j, l}(k, f) subjected to simultaneous decorrelation from the posterior covariance matrix update unit 760, and the estimation value {circumflex over ( )}vj(k, f) of the variance parameter vj(k, f) from the variance parameter update unit 780, and updates the estimation value {circumflex over ( )}Rj(f) of the spatial covariance matrix Rj(f) by expression (333) (Step S113).












R
^

j



(
f
)











B
:





j

B









(



P
B



(
f
)



-
1


)

H

[








k
=
1

K





γ
B



(

k
,
f

)








1


v
j



(

k
,
f

)






















(




m

j
,
B





(

k
,
f

)






m

j
,
B





(

k
,
f

)


H


+


Σ
B




(

k
,
f

)



)

]





P
B



(
f
)



-
1










B
:





j

B










k
=
1

K




γ
B



(

k
,
f

)









(
333
)







The convergence determination unit (not illustrated) determines whether the update converges (Step S114). If the convergence determination unit determines that the update does not converge (No at Step S114), the process returns to the processing performed by the signal parameter analysis unit 720, and the processing is continued.


On the other hand, if the convergence determination unit determines that the update converges (Yes at Step S114), the processing is performed by the source signal component estimation unit 800. The source signal component estimation unit 800 receives the simultaneous decorrelation matrix P{j, l}(f) from the signal parameter analysis unit 720, the signal presence posterior probability γB(k, f) from the signal presence posterior probability update unit 740, and the posterior mean m′j,{j, l}(k, f) subjected to simultaneous decorrelation from the single channel Wiener filter unit 750, calculates the estimation value {circumflex over ( )}xj(k, f) of the source signal component xj(k, f) by expression (334) (Step S115), and outputs the estimation value {circumflex over ( )}xj(k, f).












x
^

j



(

k
,
f

)







B
:





j

B









γ
B



(

k
,
f

)





(



P
B



(
f
)



-
1


)

H




m

j
,
B





(

k
,
f

)








(
334
)







The frequency domain signal separation unit 320 determines whether f=F is satisfied (Step S116). If it is determined that f=F is not satisfied (No at Step S116), the process returns to the processing of f←f+1 performed by the frequency domain signal separation unit 320, and the processing is continued.


On the other hand, if it is determined that f=F is satisfied (Yes at Step S116), the processing performed by the frequency domain signal separation unit 320 ends.


[Background of Processing Performed by Signal Analysis Device]


Next, the following describes a background of processing performed by the signal analysis device according to the seventh embodiment. In the seventh embodiment, the observation signal vector y(k, f) is modeled by expression (317).


In the seventh embodiment, the source signal component xj(k, f) is assumed to follow a complex gaussian distribution the mean of which is 0 and the covariance matrix thereof is Sj(k, f)=vj(k, f)Rj(f). In this case, the probability distribution of the observation signal vector y(k, f) under the condition that B(k, f) takes a value of B∈H is given by the following expression (335).










p


(




y


(

k
,
f

)


|

B


(

k
,
f

)



=
B

,

Φ
1


)


=

𝒩
(



y


(

k
,
f

)


;
0

,




j

B






v
j



(

k
,
f

)





R
j



(
f
)





)





(
335
)







In this case, Φl collectively represents all model parameters. The probability distribution of the set B(k, f) is given by the following expression (336).

P(B(k,f)=B|Φ1)=αB(f)  (336)


In this case, αB(f) satisfies a constraint condition of the following expression (337).













B







α
B



(
f
)



=
1




(
337
)







Based on expression (338) and expression (337), the probability distribution of the observation signal vector y(k, f) is given by the following expression (338).










p


(


y


(

k
,
f

)


|

Φ
1


)


=




B








α
B



(
f
)




𝒩
(



y


(

k
,
f

)


;
0

,




j

B






v
j



(

k
,
f

)





R
j



(
f
)





)







(
338
)







Hereinafter, αB(f) is referred to as a mixture weight. The parameter Φ1 is constituted of the mixture weight αB(f), the variance parameter vj(k, f), and the spatial covariance matrix Rj(f). The parameter Φ1 can be estimated by maximizing a likelihood function in the following expression (339).












k
=
1

K






f
=
1

F



p


(


y


(

k
,
f

)


|

Φ
1


)







(
339
)







Reference document 6 “R. Ikeshita, M. Togami, Y. Kawaguchi, Y. Fujita, and K. Nagamatsu, “Local Gaussian model with source-set constraints in audio source separation”, IEEE International Workshop on Machine Learning for Signal Processing, September 2017.” discloses a method of estimating the parameter Φ1 by maximizing the likelihood function in the above expression based on the EM algorithm. In this related art, the parameter Φ1 is estimated by alternately iterating the E-step and the M-step.


At the E-step, the signal presence posterior probability γB(k, f), a posterior mean mj, B(k, f), and a posterior covariance matrix ΣB(k, f) are calculated by the following expression (340) to expression (344).


















γ
B




(

k
,
f

)









α
^

B



(
f
)




𝒩
(



y


(

k
,
f

)


;
0

,




j

B











v
^

j



(

k
,
f

)






R
^

j



(
f
)





)







B











α
^


B





(
f
)




𝒩
(






y


(

k
,
f

)


;
0

,









j


B













v
^

j



(

k
,
f

)






R
^

j



(
f
)







)






(

B



)







(
340
)













m

j
,

{
j
}





(

k
,
f

)





y


(

k
,
f

)








(


{
j
}




)







(
341
)









m

j
,

{

j
,
i

}





(

k
,
f

)







v
^

j



(

k
,
f

)






R
^

j



(
f
)





(





v
^

j



(

k
,
f

)






R
^

j



(
f
)



+




v
^

l



(

k
,
f

)






R
^

l



(
f
)




)


-
1




y


(

k
,
f

)














(


j

l

,


{

j
,
l

}





)





(
342
)










{
j
}




(

k
,
f

)




O






(


{
j
}




)






(
343
)











{

j
,
l

}




(

k
,
f

)







v
^

j



(

k
,
f

)






R
^

j



(
f
)





(





v
^

j



(

k
,
f

)






R
^

j



(
f
)



+




v
^

l



(

k
,
f

)






R
^

l



(
f
)




)


-
1




(




v
^

l



(

k
,
f

)






R
^

l



(
f
)



)













(


j

l

,


{

j
,
l

}





)





(
344
)







In this case, the signal presence posterior probability γB(k, f) is defined by the following expression (345) as a posterior distribution that is a conditional probability distribution of the set B(k, f) under the condition that the observation signal vector y(k, f) is given.

γB(k,f)custom characterP(B(k,f)=B|y(k,f);{circumflex over (Φ)}1)(B∈H)  (345)


In this case, {circumflex over ( )}Φ1 represents a present estimation value of the parameter Φ1. The posterior mean mj, B(k, f) and the posterior covariance matrix ΣB(k, f) are defined as a mean and a covariance matrix of a posterior distribution (that is indicated to be a complex gaussian distribution) that is a conditional probability distribution of the source signal component xj(k, f) under the condition that the observation signal vector y(k, f) and the set B(k, f) are given. This posterior distribution is represented by the following expression (346).

p(xj(k,f)|y(k,f),B(k,f)=B;{circumflex over (Φ)}1)=N(xj(k,f);mj,B(k,f),ΣB(k,f))(j∈B∈H)  (346)


At the M-step, the estimation value {circumflex over ( )}αB(f) of the mixture weight αB(f), the estimation value {circumflex over ( )}vj(k, f) of the variance parameter vj(k, f), and the estimation value {circumflex over ( )}Rj(f) of the spatial covariance matrix Rj(f) are updated by update rules of the following expression (347) to expression (349).












α
^

B



(
f
)





1
K






k
=
1

K









γ
B



(

k
,
f

)








(

B



)








(
347
)









v
^

j



(

k
,
f

)








B
:

j

B













γ
B



(

k
,
f

)




1
I


t





r









(




R
^

j



(
f
)



-
1














(


m

j
,
B




(

k
,
f

)











m

j
,
B




(

k
,
f

)


H

+














B



(

k
,
f

)


)

)

)













B
:

j

B












γ
B



(

k
,
f

)








(
348
)









R
^

j



(
f
)











B
:

j

B










k
=
1

K





γ
B



(

k
,
f

)




1



v
^

j



(

k
,
f

)











(




m

j
,
B




(

k
,
f

)






m

j
,
B




(

k
,
f

)


H


+



B



(

k
,
f

)



)








B
:

j

B










k
=
1

K




γ
B



(

k
,
f

)









(
349
)







However, in this related art, in update of the signal presence posterior probability γB(k, f) in expression (340), update of the posterior mean mj, B(k, f) in expression (342), and update of the posterior covariance matrix Σ{j, l}(k, f) in expression (344), determinant calculation, inverse matrix calculation, and matrix multiplication are requested to be performed for each time-frequency point. Specifically, determinant calculation in a form of expression (350) in expression (340), inverse matrix calculation in a form of expression (351) in expression (340), expression (342), and expression (344), and matrix multiplication in a form of expression (352) in expression (344) are requested to be performed for each time-frequency point (k, f).

det({circumflex over (v)}j(k,f){circumflex over (R)}j(f)+{circumflex over (v)}l(k,f){circumflex over (R)}l(f))  (350)
({circumflex over (v)}j(k,f){circumflex over (R)}j(f)+{circumflex over (v)}l(k,f){circumflex over (R)}l(f))−1  (351)
{circumflex over (v)}j(k,f){circumflex over (R)}j(f)({circumflex over (v)}j(k,f){circumflex over (R)}j(f)+{circumflex over (v)}l(k,f){circumflex over (R)}l(f))−1({circumflex over (v)}l(k,f){circumflex over (R)}l(f))  (352)


Each of the determinant calculation, the inverse matrix calculation, and the matrix multiplication is a matrix arithmetic operation that requests a calculation amount of an order of O(I3), and the number of time-frequency points is typically very large, so that a huge amount of calculation is requested in this related art.


Assuming that the covariance matrices {circumflex over ( )}vj(k, f){circumflex over ( )}Rj(f) and {circumflex over ( )}vl(k, f){circumflex over ( )}Rl(f) in expression (350) to expression (352) are both diagonal matrices, determinant calculation, inverse matrix calculation, and matrix multiplication in expression (350) to expression (352) result in a simple scalar arithmetic operation for a diagonal element, so that calculation can be performed with a small calculation amount. However, different elements of the source signal component (vector) are typically correlated with each other, so that the covariance matrices {circumflex over ( )}vj(k, f){circumflex over ( )}Rj(f) and {circumflex over ( )}vl(k, f){circumflex over ( )}Rl(f) are not diagonal matrices.


Thus, in the seventh embodiment, it is considered that the two spatial covariance matrices {circumflex over ( )}Rj(f) and {circumflex over ( )}Rl(f) are subjected to simultaneous diagonalization based on the generalized eigenvalue problem. It is assumed that λ1,{j, l}(f), . . . , λI,{j, l}(f) are generalized eigenvalues of the generalized eigenvalue problem (expression (353)) for the spatial covariance matrices {circumflex over ( )}Rl(f) and {circumflex over ( )}Rj(f), and p1,{j, l}(f), . . . , pI,{j, l}(f) are generalized eigenvectors corresponding to the generalized eigenvalues λ1,{j, l}(f), . . . , λI,{j, l}(f).

{circumflex over (R)}l(f)p=λ{circumflex over (R)}j(f)p  (353)


In this case, the generalized eigenvectors p1,{j, l}(f), . . . , pI,{j, l}(f) are taken to satisfy the following expression (354).













p

i
,

{

j
,
l

}





(
f
)


H





R
^

j



(
f
)





p

m
,

{

j
,
l

}





(
f
)



=

{




1
,




i
=
m






0
,




i

m









(
354
)







Assuming that p{j, l}(f) is a matrix including the generalized eigenvectors p1,{j, l}(f), . . . , pI,{j, l}(f) as column vectors, and Λ{j, l}(f) is a diagonal matrix including the generalized eigenvalues λ1,{j, l}(f), . . . , λI,{j, l}(f) as diagonal elements, the following expression (355) is established.









{








P

{

j
,
l

}




(
f
)


H





R
^

j



(
f
)





P

{

j
,
l

}




(
f
)



=
I










P

{

j
,
l

}




(
f
)


H





R
^

l



(
f
)





P

{

j
,
l

}




(
f
)



=


Λ

{

j
,
l

}




(
f
)










(
355
)







Expression (355) indicates that the matrix P{j, l}(f) simultaneously diagonalizes the two spatial covariance matrices {circumflex over ( )}Rj(f) and {circumflex over ( )}Rl(f) (simultaneously decorrelates the two source signal components xj(k, f) and xl(k, f)). Thus, the matrix P{j, l}(f) is referred to as a simultaneous decorrelation matrix.


By solving expression (355) for the spatial covariance matrices {circumflex over ( )}Rj(f) and {circumflex over ( )}Rl(f), and substituting expression (355) for expression (340), expression (342), and expression (344) to be defined as expression (356) to expression (358) (j and l are any different integral numbers equal to or larger than 1 and equal to or smaller than L), expression (362), expression (363), and expression (364) (described later) are obtained.

m′j,{j,l}(k,f)custom characterP{j,l}(f)Hmj,{j,l}(k,f)  (356)
Σ′{j,l}(k,f)custom characterP{j,l}(f)HΣ{j,l}(k,f)P{j,l}(f)  (357)
y′{j,l}(k,f)custom characterP{j,l}(f)Hy(k,f)  (358)


By solving expression (356) and expression (357) for the posterior mean mj,{j, l}(k, f) and the posterior covariance matrix Σ{j, l}(k, f) to be substituted for expression (348) and expression (349), expression (366) and expression (368) are obtained.


In this way, the update rules for the signal analysis device according to the seventh embodiment are derived.


In this way, in the signal analysis device according to the seventh embodiment, the signal parameter analysis unit 720 obtains the simultaneous decorrelation matrix P{j, l}(f) as a matrix including I (I is an integral number equal to or larger than 2) different generalized eigenvectors p1,{j, l}(f), . . . , pI,{j, l}(f) as column vectors satisfying pi,{j, l}(f)H{circumflex over ( )}Rj(f)pm,{j, l}(f)=0 (i and m are any integral numbers equal to or larger than 1 and equal to or smaller than I different from each other) for the spatial covariance matrices {circumflex over ( )}Rl(f) and {circumflex over ( )}Rj(f) for modeling spatial characteristics of two source signals that are present in a mixed manner, or Hermitian transposition P{j, l}(f)H thereof, as a parameter for decorrelating components corresponding to the two source signals for the observation signal vectors based on the observation signals acquired at I positions.


Example of Seventh Embodiment

The present example describes a specific example of the set system H. In the present example, the set system H is assumed to be expression (359).

custom character={{1,2},{1,3}, . . . ,{1,J}}  (359)


This means that the observation signal vector at each time-frequency point always includes the first source signal component, and includes just one of the second to the j-th source signal components. For example, the first source signal component models a source signal component that is not sparse such as background noise, and the second to the J-th source signal components model a source signal component that is sparse such as a voice.


Expression (360) to expression (370) represent update expressions of processing performed by the signal parameter analysis unit 720, the simultaneous decorrelation unit 730, the signal presence posterior probability update unit 740, the single channel Wiener filter unit 750, the posterior covariance matrix update unit 760, the mixture weight update unit 770, the variance parameter update unit 780, the spatial covariance matrix update unit 790, and the source signal component estimation unit 800. In the seventh embodiment, sets {1, 2}, {1, 3}, . . . , {1, J} as sources of H correspond to indexes 2, 3, . . . , J on a one-to-one basis, so that the following notation is used for simplification. That is, y′{1, j}(k, f) is written as y′j(k, f), P{1, j}(f) is written as Pj(f), γ{1, j}(k, f) is written as γj(k, f), {circumflex over ( )}α{1, j}(f) is written as {circumflex over ( )}αj(f), Λl,{1, j}(f) is written as Λlj(f), m′l,{l, j}(k, f) is written as m′lj(k, f), and Σ′{1, j}(k, f) is written as Σ′j(k, f). In this case, it is assumed that j=2, . . . , J, and l=1, j (the same applies to update the expressions described below). In the present example, the signal parameter analysis unit 720 is not requested to calculate the simultaneous decorrelation matrix P{j, l}(f) and the diagonal matrix Λj,{j, l}(f) for any two xj(k, f) and xl(k, f) of the L source signal components (j and l are any different natural numbers equal to or larger than 1 and equal to or smaller than L). As represented by expression (360), it is sufficient that the signal parameter analysis unit 720 calculates the simultaneous decorrelation matrix P{1, j}(f) (that is, Pj(f)) and the diagonal matrix Λl,{1, j}(f) (that is, Λlj(f)) for x1(k, f) and xj(k, f) (j is any natural number equal to or larger than 2 and equal to or smaller than L).


By solving generalized eigenvalue problem {circumflex over (R)}j(f)p=λ{circumflex over (R)}1(f)p, simultaneous decorrelation matrix Pj(f) and diagonal matrix Λlj(f) are obtained.

    • (360)
















y
j




(

k
,
f

)







P
j



(
f
)


H



y


(

k
,
f

)








(
361
)













γ
j



(

k
,
f

)











α
^

j



(
f
)






i
=
1

I



(




v
^

1



(

k
,
f

)


+





v
^

j



(

k
,
f

)




[


Λ

j
,
j




(
f
)


]


ii


)








exp


(

-




i
=
1

I












[


y
j




(

k
,
f

)


]

i



2





v
^

1



(

k
,
f

)


+





v
^

j



(

k
,
f

)




[


Λ

j
,
j




(
f
)


]


ii





)













j


=
2

L










α
^


j





(
f
)






i
=
1

I



(




v
^

1



(

k
,
f

)


+





v
^


j





(

k
,
f

)




[


Λ


j


,

j






(
f
)


]


ii


)









exp


(

-




i
=
1

I












[


y

j






(

k
,
f

)


]

i



2





v
^

1



(

k
,
f

)


+





v
^


j





(

k
,
f

)




[


Λ


j


,

j






(
f
)


]


ii





)











(
362
)













[


m

l
,
j





(

k
,
f

)


]

i










v
^

l



(

k
,
f

)




[


Λ
lj



(
f
)


]


ii





v
^

1



(

k
,
f

)


+





v
^

j



(

k
,
f

)




[


Λ
jj



(
f
)


]


ii





[


y
j




(

k
,
f

)


]


i






(
363
)













[



j




(

k
,
f

)


]

ii







v
^

1



(

k
,
f

)








v
^

j



(

k
,
f

)




[


Λ
jj



(
f
)


]


ii






v
^

1



(

k
,
f

)


+





v
^

j



(

k
,
f

)




[


Λ
jj



(
f
)


]


ii








(
364
)














α
^

j



(
f
)





1
K






k
=
1

K








γ
j



(

k
,
f

)









(
365
)














v
^

j



(

k
,
f

)





1
I






i
=
1

I









[


m
jj




(

k
,
f

)


]

i



2

+


[



j




(

k
,
f

)


]

ii




[


Λ
jj



(
f
)


]

ii









(
366
)









v
^

1



(

k
,
f

)







j
=
2

L





γ
j



(

k
,
f

)




1
I






i
=
1

I



(






[


m

1





j





(

k
,
f

)


]

i



2

+


[



j




(

k
,
f

)


]

ii


)








(
367
)









R
^

j



(
f
)








(



P
j



(
f
)



-
1


)

H



[







k
=
1

K





γ
j



(

k
,
f

)




1



v
^

j



(

k
,
f

)










(




m
jj




(

k
,
f

)






m
jj




(

k
,
f

)


H


+



j




(

k
,
f

)



)




]






P
j



(
f
)



-
1







k
=
1

K




γ
j



(

k
,
f

)








(
368
)









R
^

l



(
f
)







j
=
2

J






(



P
j



(
f
)



-
1


)

H

[






1
K






k
=
1

K





γ
j



(

k
,
f

)




1



v
^

l



(

k
,
f

)





(




m
ij




(

k
,
f

)






m
ij




(

k
,
f

)


H


+



j




(

k
,
f

)



)




]





P
j



(
f
)



-
1








(
369
)














x
^

j



(

k
,
f

)






γ
j



(

k
,
f

)





(



P
j



(
f
)



-
1


)

H




m
jj




(

k
,
f

)








(
370
)







Eighth Embodiment

An eighth embodiment describes another configuration example of the signal analysis device according to the third embodiment. FIG. 21 is a diagram illustrating an example of a configuration of the signal analysis device according to the eighth embodiment. FIG. 22 is a flowchart illustrating an example of a processing procedure of signal analysis processing according to the eighth embodiment. As illustrated in FIG. 21, a signal analysis device 801 according to the eighth embodiment includes the frequency domain conversion unit 310, a frequency domain signal separation unit 320-1, and the time domain conversion unit 330.


In the signal analysis device 801, the frequency domain signal separation unit 320-1 calculates estimation values of a spectral basis tn(f), an activation basis wn(k), a partition function znj, a spatial covariance matrix Λj(f) subjected to simultaneous decorrelation, and a simultaneous decorrelation matrix P(f).


The spectral basis tn(f), the activation basis wn(k), partition function znj are parameters for modeling the variance parameter vj(k, f). The variance parameter vj(k, f) is modeled by expression (371) using these parameters.











v
j



(

k
,
f

)


=




n
=
1

N








z
nj




t
n



(
f
)





w
n



(
k
)








(
371
)







In this case, n is an index of the spectral basis and the activation basis. N is the number of spectral bases and activation bases.


The frequency domain signal separation unit 320-1 includes an initialization unit (not illustrated), an observation signal vector creation unit 810, a simultaneous decorrelation unit 830, a single channel Wiener filter unit 840, an observation covariance matrix update unit 850, a variance parameter update unit 860, a spatial covariance matrix update unit 870, a signal parameter analysis unit 820, a convergence determination unit (not illustrated), and a simultaneous decorrelation inverse conversion unit 880.


[Configuration and Processing of Signal Analysis Device]


With reference to FIG. 21 and FIG. 22, the following describes a configuration of the signal analysis device and a processing procedure of the signal analysis processing according to the eighth embodiment.


The initialization unit (not illustrated) performs initialization for calculating initial estimation values of the variance parameter vj(k, f), the spectral basis tn(f), the activation basis wn(k), the partition function znj, the spatial covariance matrix Λj(f) subjected to simultaneous decorrelation, and the simultaneous decorrelation matrix P(f) (Step S121). The initial estimation value is represented as {circumflex over ( )}vj(0)(k, f) and the like by adding a superscript of (0).


The frequency domain conversion unit 310 acquires the observation signal (Step S122), and converts the observation signal for a frequency domain (Step S123) to calculate the observation signal yi(k, f) of the time-frequency domain.


The observation signal vector creation unit 810 creates the observation signal vector y(k, f) in the same way as the seventh embodiment (Step S124).


The simultaneous decorrelation unit 830 calculates the observation signal vector y′(k, f) subjected to simultaneous decorrelation by expression (372) to be updated (Step S125).

y′(k,f)←{circumflex over (P)}(f)Hy(k,f)  (372)


In this case, {circumflex over ( )}P(f) is an estimation value of the simultaneous decorrelation matrix from the signal parameter analysis unit 820.


The observation covariance matrix update unit 850 updates an observation covariance matrix U′(k, f) subjected to simultaneous decorrelation by expression (373) (Step S126).











U




(

k
,
f

)







j
=
1

J










v
^

j



(

k
,
f

)






Λ
^

j



(
f
)








(
373
)







In this case, {circumflex over ( )}vj(k, f) is an estimation value of the variance parameter from the variance parameter update unit 860. {circumflex over ( )}Λj(f) is an estimation value of the spatial covariance matrix subjected to simultaneous decorrelation from the spatial covariance matrix update unit 870.


The variance parameter update unit 860 updates the estimation value {circumflex over ( )}vj(k, f) of the variance parameter using the following procedure (Step S127).


First, the variance parameter update unit 860 updates a spatial covariance matrix Σn(f) subjected to simultaneous decorrelation corresponding to the n-th basis by expression (374). In expression (374), {circumflex over ( )}znj is an estimation value of the partition function.












n



(
f
)







j
=
1

J





z
^

nj





Λ
^

j



(
f
)








(
374
)







Next, the variance parameter update unit updates an estimation value {circumflex over ( )}tn(f) of the spectral basis by expression (375). In expression (375), {circumflex over ( )}wn(k) is an estimation value of the activation basis.












t
^

n



(
f
)







t
^

n



(
f
)









k
=
1

K










w
^

n



(
k
)




tr


(







U




(

k
,
f

)



-
1





y




(

k
,
f

)











y




(

k
,
f

)


H





U




(

k
,
f

)



-
1






n



(
f
)






)








k
=
1

K










w
^

n



(
k
)




tr


(




U




(

k
,
f

)



-
1






n



(
f
)



)











(
375
)







Next, the variance parameter update unit 860 updates the estimation value {circumflex over ( )}wn(k) of the activation basis by expression (376).












w
^

n



(
k
)







w
^

n



(
k
)









f
=
1

F










t
^

n



(
f
)




tr


(







U




(

k
,
f

)



-
1





y




(

k
,
f

)











y




(

k
,
f

)


H





U




(

k
,
f

)



-
1






n



(
f
)






)








f
=
1

F










t
^

n



(
f
)




tr


(




U




(

k
,
f

)



-
1






n



(
f
)



)











(
376
)







Next, the variance parameter update unit 860 updates the estimation value {circumflex over ( )}znj of the partition function by expression (377).











z
^

nj





z
^

nj











k
=
1

K






f
=
1

F










t
^

n



(
f
)






w
^

n



(
k
)



tr











(




U




(

k
,
f

)



-
1





y




(

k
,
f

)













y




(

k
,
f

)


H





U




(

k
,
f

)



-
1






Λ
^

j



(
f
)



)











k
=
1

K






f
=
1

F










t
^

n



(
f
)






w
^

n



(
k
)




tr


(




U




(

k
,
f

)



-
1






Λ
^

j



(
f
)



)












(
377
)







Finally, the variance parameter update unit 860 updates the estimation value {circumflex over ( )}vj(k, f) of the variance parameter by expression (378).












v
^

j



(

k
,
f

)







n
=
1

N









z
^

nj





t
^

n



(
f
)






w
^

n



(
k
)








(
378
)







The spatial covariance matrix update unit 870 updates the estimation value {circumflex over ( )}Λj(f) of the spatial covariance matrix subjected to simultaneous decorrelation by expression (379) (Step S128).

{circumflex over (Λ)}j(f)←Aj(f)−1#Bj(f)  (379)


In this case, # is a geometric mean of a matrix, and Aj(f) and Bj(f) are matrices calculated by expression (380) and expression (381).
















A
j



(
f
)







j
=
1

J










v
^

j



(

k
,
f

)






U




(

k
,
f

)



-
1









(
380
)








B
j



(
f
)







Λ
^

j



(
f
)




(




j
=
1

J










v
^

j



(

k
,
f

)






U




(

k
,
f

)



-
1




𝒟


(



y




(

k
,
f

)






y




(

k
,
f

)


H


)






U




(

k
,
f

)



-
1




)





Λ
^

j



(
f
)







(
381
)







The signal parameter analysis unit 820 updates the estimation value {circumflex over ( )}P(f) of the simultaneous decorrelation matrix by expression (382) (Step S129).











[


P
^



(
f
)


]

i






(


1
K






k
=
1

K




1


[


U




(

k
,
f

)


]

ii




y


(

k
,
f

)





y


(

k
,
f

)


H




)


-
1




[



P
^



(
f
)


H

]


i





(
382
)







The convergence determination unit (not illustrated) determines whether the update converges (Step S130). If the convergence determination unit determines that the update does not converge (No at Step S130), the process returns to the processing performed by the simultaneous decorrelation unit 830 (Step S125), and the processing is continued.


On the other hand, if the convergence determination unit determines that the update converges (Yes at Step S130), the single channel Wiener filter unit 840 calculates an estimation value {circumflex over ( )}x′j(k, f) of the source signal component subjected to simultaneous decorrelation by expression (383) (Step S131).











[



x
^

j




(

k
,
f

)


]

i










v
^

j



(

k
,
f

)




[



Λ
^

j



(
f
)


]


ii





l
=
1

J











v
^

l



(

k
,
f

)




[



Λ
^

l



(
f
)


]


ii





[


y




(

k
,
f

)


]


i





(
383
)







The single channel Wiener filter unit 840 updates the estimation value {circumflex over ( )}x′j(k, f) of the source signal component subjected to simultaneous decorrelation (Step S132).


The simultaneous decorrelation inverse conversion unit 880 calculates the estimation value {circumflex over ( )}xj(k, f) of the source signal component by expression (384) (Step S133).

{circumflex over (x)}j(k,f)←(P(f)−1)H{circumflex over (x)}′j(k,f)  (384)


The time domain conversion unit 330 performs processing similar to that in the seventh embodiment to calculate (Step S134) and output the estimation value {circumflex over ( )}xj(t) of the source signal component of the time domain.


Effect of Eighth Embodiment

In the third embodiment, signal processing is performed based on only directivity of sound. On the other hand, in the eighth embodiment, signal analysis is performed while considering a structure (parts) of sound, so that accuracy in sound source separation can be improved.


The eighth embodiment exhibits the effect that a permutation problem is not caused in principle. In the eighth embodiment, processing is performed while considering what kinds of parts are present in all frequencies, so that processing is not independently performed for each frequency, and the permutation problem is not caused in principle.


[System Configuration and the Like]


The constituent elements of the devices illustrated in the drawings are merely conceptual, and it is not requested that they be physically configured as illustrated necessarily. That is, specific forms of distribution and integration of the devices are not limited to those illustrated in the drawings. All or part thereof may be functionally or physically distributed and/or integrated in any units depending on various loads and/or usage states. Furthermore, all or any part of the processing functions performed by each device may be implemented by a CPU and a computer program that is analyzed and executed by the CPU, or implemented as hardware based on wired logic.


Among the pieces of processing described in the embodiments, all or some of the pieces of the processing described as being automatically performed can be manually performed, or all or some of the pieces of the processing described as being manually performed can be automatically performed using a known method. The processing procedures, the control procedures, the specific names, and the information including various pieces of data and parameters described herein or illustrated in the drawings can be optionally changed unless otherwise specifically noted. That is, the processing described in the learning method and the voice recognition method described above is not only performed in the described order on a time-series basis, but also may be performed in parallel or individually as needed or depending on a processing capacity of a device that performs the processing.


[Computer Program]



FIG. 23 is a diagram illustrating an example of a computer in which a computer program is executed to implement the signal analysis devices 1, 201, 301, 401, 501, and 601. A computer 1000 includes, for example, a memory 1010 and a CPU 1020. The computer 1000 also includes a hard disk drive interface 1030, a disk drive interface 1040, a serial port interface 1050, a video adapter 1060, and a network interface 1070. These components are connected to each other via a bus 1080.


The memory 1010 includes a ROM 1011 and a RAM 1012. The ROM 1011 stores therein, for example, a boot program such as Basic Input Output System (BIOS). The hard disk drive interface 1030 is connected to a hard disk drive 1090. The disk drive interface 1040 is connected to a disk drive 1100. For example, a removable storage medium such as a magnetic disk and an optical disc is inserted into the disk drive 1100. The serial port interface 1050 is connected, for example, to a mouse 1110 and a keyboard 1120. The video adapter 1060 is connected to a display 1130, for example.


The hard disk drive 1090 stores therein, for example, an operating system (OS) 1091, an application program 1092, a program module 1093, and program data 1094. That is, a computer program that specifies each of the pieces of processing performed by the signal analysis devices 1, 201, 301, 401, 501, and 601 is implemented as the program module 1093 in which a code that can be executed by the computer 1000 is written. The program module 1093 is stored in the hard disk drive 1090, for example. For example, the hard disk drive 1090 stores therein the program module 1093 for executing processing similar to the functional configuration of each of the signal analysis devices 1, 201, 301, 401, 501, and 601. The hard disk drive 1090 may be replaced with a solid state drive (SSD).


Setting data used for the processing in the embodiments described above is stored in the memory 1010 or the hard disk drive 1090, for example, as the program data 1094. The CPU 1020 then reads the program module 1093 or the program data 1094 stored in the memory 1010 or the hard disk drive 1090 into the RAM 1012 to be executed, as needed.


The program module 1093 and the program data 1094 are not necessarily stored in the hard disk drive 1090, but may be stored in a removable storage medium and read by the CPU 1020 via the disk drive 1100 and the like, for example. Alternatively, the program module 1093 and the program data 1094 may be stored in another computer connected via a network (a local area network (LAN), a wide area network (WAN), and the like). The program module 1093 and the program data 1094 may be read out from another computer by the CPU 1020 via the network interface 1070.


The embodiments to which the invention made by the present inventors is applied are described above, but the present invention is not limited to the description and the drawings constituting part of the disclosure of the present invention according to the embodiments. That is, other embodiments, examples, operation techniques, and the like that are performed by those skilled in the art based on the embodiments are all encompassed by a scope of the present invention.


DESCRIPTION OF SIGNS






    • 1, 201, 301, 401, 501, 601, 801 Signal analysis device


    • 10, 210, 410, 510, 610, 710, 810 Observation signal vector creation unit


    • 20, 220, 420, 520, 620, 720, 820 Signal parameter analysis unit


    • 30, 230, 430, 530, 630, 730, 830 Simultaneous decorrelation unit


    • 40, 240, 540, 640, 750, 840 Single channel Wiener filter unit


    • 50, 280, 580, 680, 880 Simultaneous decorrelation inverse conversion unit


    • 250, 550, 760 Posterior covariance matrix update unit


    • 260, 440, 560, 780, 860 Variance parameter update unit


    • 270, 450, 570, 790, 870 Spatial covariance matrix update unit


    • 310 Frequency domain conversion unit


    • 320 Frequency domain signal separation unit


    • 330 Time domain conversion unit


    • 650, 850 Observation covariance matrix update unit


    • 740 Signal presence posterior probability update unit


    • 770 Mixture weight update unit


    • 800 Source signal component estimation unit




Claims
  • 1. A signal analysis device comprising: a memory; andprocessing circuitry coupled to the memory and configured to:obtain, for a spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling spatial characteristics of J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, a simultaneous decorrelation matrix P as a matrix in which all PHRjP are diagonal matrices, or/and Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to the J source signals for observation signal vectors based on observation signals acquired at I (I is an integral number equal to or larger than 2) different positions, wherein the source signals are sound, brain waves, or radio signals.
  • 2. The signal analysis device according to claim 1, wherein the processing circuitry is further configured to obtain, for the spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J), the simultaneous decorrelation matrix P as a matrix that is obtained so that all PHRjP become closer to diagonal matrices, or/and the Hermitian transposition PH thereof, as a parameter for decorrelating the components corresponding to the J source signals for the observation signal vectors based on the observation signals acquired at I (I is an integral number equal to or larger than 2) different positions.
  • 3. The signal analysis device according to claim 1, wherein the processing circuitry is further configured to update an estimation value of the simultaneous decorrelation matrix P based on a fixed point method, a gradient method, a natural gradient method, or a Newton's method.
  • 4. The signal analysis device according to claim 1, wherein the processing circuitry is further configured to obtain a simultaneously decorrelated observation signal vector in which the components corresponding to the J source signals are decorrelated by multiplying the Hermitian transposition PH of the simultaneous decorrelation matrix P obtained by observation signal vectors based on I input observation signals.
  • 5. The signal analysis device according to claim 4, wherein the processing circuitry is further configured to obtain a vector based on a simultaneously decorrelated source signal component estimation value by applying, to each element of the simultaneously decorrelated observation signal vector obtained, a single channel Wiener filter based on a diagonal component of a matrix PHRjP (j is an integral number equal to or larger than 1 and equal to or smaller than J) based on the simultaneous decorrelation matrix P.
  • 6. The signal analysis device according to claim 5, wherein the processing circuitry is further configured to obtain an estimation value of a source signal component by multiplying the vector based on the simultaneously decorrelated source signal component estimation value obtained by an inverse matrix of the Hermitian transposition PH of the simultaneous decorrelation matrix P.
  • 7. A signal analysis device comprising: a memory; andprocessing circuitry coupled to the memory and configured to:obtain a simultaneous decorrelation matrix P as a matrix including I (I is an integral number equal to or larger than 2) different generalized eigenvectors p1, . . . , pI as column vectors satisfying piHR2pm=0 (i and m are any integral numbers equal to or larger than 1 and equal to or smaller than I different from each other) for spatial covariance matrices R1 and R2 for modeling spatial characteristics of two source signals that are present in a mixed manner, or Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to two source signals for observation signal vectors based on observation signals acquired at I positions, wherein the source signals are sound, brain waves, or radio signals.
  • 8. The signal analysis device according to claim 7, wherein the processing circuitry is further configured to obtain a simultaneously decorrelated observation signal vector in which components corresponding to two source signals are decorrelated by multiplying the Hermitian transposition PH of the simultaneous decorrelation matrix P obtained by observation signal vectors based on I input observation signals.
  • 9. The signal analysis device according to claim 8, wherein the processing circuitry is further configured to obtain a vector based on a simultaneously decorrelated source signal component estimation value by applying, to each element of the simultaneously decorrelated observation signal vector obtained, a single channel Wiener filter based on a diagonal component of a matrix PHRjP (j is 1 or 2) based on the simultaneous decorrelation matrix P.
  • 10. The signal analysis device according to claim 9, wherein the processing circuitry is further configured to obtain an estimation value of a source signal component by multiplying the vector based on the simultaneously decorrelated source signal component estimation value obtained by an inverse matrix of the Hermitian transposition PH of the simultaneous decorrelation matrix P.
  • 11. A signal analysis device comprising: a memory; andprocessing circuitry coupled to the memory and configured to:obtain a simultaneously decorrelated observation signal vector in which components corresponding to J source signals are decorrelated by assuming, for a spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling spatial characteristics of J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, a matrix P in which all PHRjP become diagonal matrices as a simultaneous decorrelation matrix for decorrelating components corresponding to the J source signals for observation signal vectors based on observation signals acquired at I (I is an integral number equal to or larger than 2) different positions, and multiplying Hermitian transposition of the simultaneous decorrelation matrix by the observation signal vector, wherein the source signals are sound, brain waves, or radio signals.
  • 12. The signal analysis device according to claim 11, wherein the processing circuitry is further configured to obtain a vector based on a simultaneously decorrelated source signal component estimation value by applying, to each element of the simultaneously decorrelated observation signal vector obtained, a single channel Wiener filter based on a diagonal component of a matrix PHRjP (j is an integral number equal to or larger than 1 and equal to or smaller than J) based on the simultaneous decorrelation matrix P.
  • 13. The signal analysis device according to claim 12, wherein the processing circuitry is further configured to obtain an estimation value of a source signal component by multiplying the vector based on the simultaneously decorrelated source signal component estimation value obtained by an inverse matrix of the Hermitian transposition PH of the simultaneous decorrelation matrix P.
  • 14. A signal analysis device comprising: a memory; andprocessing circuitry coupled to the memory and configured to:obtain a spatial covariance matrix as a parameter for modeling a spatial characteristic of a source signal based on a covariance matrix subjected to simultaneous decorrelation obtained from J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, and obtain a variance parameter as a parameter for modeling a variance for each sample point of the source signal based on the covariance matrix subjected to simultaneous decorrelation, wherein the source signals are sound, brain waves, or radio signals.
  • 15. The signal analysis device according to claim 14, wherein the processing circuitry is further configured to obtain the spatial covariance matrix or/and the variance parameter using an auxiliary function method.
  • 16. A signal analysis method executed by a computer, the signal analysis method comprising: obtaining, for a spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling spatial characteristics of J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, a simultaneous decorrelation matrix P as a matrix in which all PHRjP are diagonal matrices, or/and Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to the J source signals for observation signal vectors based on observation signals acquired at I (I is an integral number equal to or larger than 2) different positions, wherein the source signals are sound, brain waves, or radio signals.
  • 17. A signal analysis method executed by a computer, the signal analysis method comprising: obtaining a simultaneous decorrelation matrix P as a matrix including I (I is an integral number equal to or larger than 2) different generalized eigenvectors p1, . . . , pI as column vectors satisfying piHR2pm=0 (i and m are any integral numbers equal to or larger than 1 and equal to or smaller than I different from each other) for spatial covariance matrices R1 and R2 for modeling spatial characteristics of two source signals that are present in a mixed manner, or Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to two source signals for observation signal vectors based on observation signals acquired at I positions, wherein the source signals are sound, brain waves, or radio signals.
  • 18. A signal analysis method executed by a computer, the signal analysis method comprising: obtaining a simultaneously decorrelated observation signal vector in which components corresponding to J source signals are decorrelated by assuming, for a spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling spatial characteristics of J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, a matrix P in which all PHRjP become diagonal matrices as a simultaneous decorrelation matrix for decorrelating components corresponding to the J source signals for observation signal vectors based on observation signals acquired at I (I is an integral number equal to or larger than 2) different positions, and multiplying Hermitian transposition of the simultaneous decorrelation matrix by the observation signal vector, wherein the source signals are sound, brain waves, or radio signals.
  • 19. A signal analysis method executed by a computer, the signal analysis method comprising: obtaining a spatial covariance matrix as a parameter for modeling a spatial characteristic of a source signal based on a covariance matrix subjected to simultaneous decorrelation obtained from J (J is an integral number equal to or larger than 2) source signals that are present in a mixed with manner, and obtain a variance parameter as a parameter for modeling a variance for each sample point of the source signal based on the covariance matrix subjected to simultaneous decorrelation, wherein the source signals are sound, brain waves, or radio signals.
  • 20. A non-transitory computer-readable recording medium storing therein a signal analysis program that causes a computer to execute a process comprising: obtaining, for a spatial covariance matrix Rj (j is an integral number equal to or larger than 1 and equal to or smaller than J) for modeling spatial characteristics of J (J is an integral number equal to or larger than 2) source signals that are present in a mixed manner, a simultaneous decorrelation matrix P as a matrix in which all PHRjP are diagonal matrices, or/and Hermitian transposition PH thereof, as a parameter for decorrelating components corresponding to the J source signals for observation signal vectors based on observation signals acquired at I (I is an integral number equal to or larger than 2) different positions, wherein the source signals are sound, brain waves, or radio signals.
Priority Claims (3)
Number Date Country Kind
JP2018-030701 Feb 2018 JP national
JP2018-133592 Jul 2018 JP national
JP2018-164051 Aug 2018 JP national
PCT Information
Filing Document Filing Date Country Kind
PCT/JP2019/003671 2/1/2019 WO
Publishing Document Publishing Date Country Kind
WO2019/163487 8/29/2019 WO A
US Referenced Citations (6)
Number Name Date Kind
20140056435 Kjems Feb 2014 A1
20170140771 Taniguchi May 2017 A1
20170347206 Pedersen Nov 2017 A1
20190191259 Wang Jun 2019 A1
20190267019 Ito Aug 2019 A1
20190318757 Chen Oct 2019 A1
Non-Patent Literature Citations (10)
Entry
International Search Report and Written Opinion dated Apr. 23, 2019 for PCT/JP2019/003671 filed on Feb. 31, 2019, 8 pages including English Translation of the International Search Report.
Duong, N.Q.K., et al., “Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, No. 7, Sep. 2010, pp. 1830-1840.
Ito, N. and Nakatani, T., “FASTFCA-AS: Joint Diagonalization Based Acceleration of Full-Rank Spatial Covariance Analysis for Separating Any Number of Sources,” arXiv preprint, May 24, 2018, 5 pages.
Ito, N. and Nakatani, T., “Multiplicative Updates and Joint Diagonalization Based Acceleration for Under-Determined BSS Using a Full-Rank Spatial Covariance Model,” Proceeding of 2018 EEE Global Conference on Signal and Information Processing (GlobalSIP), Nov. 2018, pp. 231-235.
Ito, N. and Nakatani, T., FASTFCA-AS: Joint Diagonalization Based Acceration of Full-Rank Spatial Covariance Analysis for Separating Any Number of Sources, International Workshop on Acoustic Signal Enhancement (IWAENC2018), Tokyo, Japan, Sep. 2018, pp. 151-155.
Ito, N., et al., “FASTFCA: A Joint Diagonalization Based Fast Algorithm for Audio Source Separation Using a Full-Rank Spatial Covariance Model,” arXiv preprint, May 17, 2018, 5 pages.
Ito, N., et al., “FastFCA: Acceleration of Sound Source Categorization Methods that Use Time-Varying Complex Gaussian Distributions Based on Simultaneous Diagonalization of Spatial Covariance Matrices,” Proceedings of the 2018 Spring Meeting of the Acoustical Society of Japan, Feb. 27, 2018, pp. 427-430 (with English Translation).
Ito, N., et al., “FastFCA: Joint Diagonalization Based Acceleration of Audio Source Separation Using a Full-Rank Spatial Covariance Model,” 2018 26th European Signal Processing Conference (EUSIPCO), Sep. 2018, pp. 1681-1685.
Ito, N., et al., “New application of the simultaneous diagonalization based on the problem of universalization eigenvalue: Speedup of the sound source separation,” Program of the 14th Joint Meeting of Study Groups, The Japan Society for Industrial and Applied Mathematics, Mar. 12, 2018, 20 pages (with partial translation).
Ito, N. and Nakatani, T., Accepted Papers; FASTFCA-AS: Joint Diagonalization Based Acceleration of Full-Rank Spatial Covariance Analysis for Separating Any Number of Sources, International Workshop on Acoustic Signal Enhancement (IWAENC2018), http://www.iwaenc2018.org/abstracts/57798.html, Tokyo, Japan, Sep. 2018, 1 Page.
Related Publications (1)
Number Date Country
20200411031 A1 Dec 2020 US