The present invention relates to a technique for separating and reproducing acoustic signals, and more particularly, to a technique for separating and reproducing different acoustic signals that are mixed with one another.
The signal separating and reproducing apparatus 1000 operates in the following manner. Individual channel input signals Xj(t) are supplied to the input terminal 1 and the input terminal 2. Here, j indicates the channel number (j=1, 2), and t indicates the time sample number. Both individual channel input signals are supplied to the separation filter analyzing unit 3.
The separation filter analyzing unit 3 separates acoustic and voice signals that are convoluted in the individual channel input signals. More specifically, the separation filter analyzing unit 3 performs a frequency transform on each of the individual channel input signals, so as to calculate a frequency sequence Xj(k,n). Here, k indicates the frequency component number (k=0, 1, . . . , N/2−1), N indicates the block length of the frequency transform, and n presents the frame number (n=0, 1, . . . ). The separation filter analyzing unit 3 regards every frequency component as an instantaneous mixture, and carries out an independent component analysis (hereinafter referred to as the “frequency region independent component analysis”), so as to calculate a separation filter frequency characteristics matrix W(k).
The separation filter frequency characteristics matrix W(k) is a matrix formed with two rows and two columns, with Wij(k) being the elements of the matrix as shown in the following equation. Here, i presents the separation signal number (i=1, 2), and j indicates the channel number.
The frequency region independent component analysis is a technique for separating linearly-coupled signals, based on the statistical independence between signals. Such a technique is disclosed in the later described Non Patent Literature 1, for example. It is known that such a frequency region independent component analysis has the problem that the order of separation signal numbers (i=1, 2) of the matrix elements Wij(k) at the respective frequency components becomes uncertain, and the problem that the sizes of the matrix elements Wij(k) at the respective frequency components become uncertain. To eliminate the uncertainty about the order, which is the former problem, there is a technique by which the continuity of the frequency direction is used, or a technique by which the arrival direction is used, for example.
To solve the problem about the sizes of the matrix elements, which is the latter problem, the following technique has been known. In a case where a separation and reproduction filter frequency characteristics matrix Mi(k) is generated by combining the separation filter frequency characteristics matrix W(k) and the reproduction filter frequency characteristics matrix W−1(k), which is the inverse matrix formed from the separation filter frequency characteristics matrix W(k) at the respective frequencies, uncertainty is not caused in the sizes of the matrix elements. The separation and reproduction filter frequency characteristics matrix Mi(k) is expressed by the following equation (2):
Mi(k)=W−1(k)·Pi(k)·W(k)i=1,2 (2)
Here, only the element on the ith row and the ith column of the matrix Pi(k) is “1”, and the other elements of the matrix Pi(k) are “0”, as expressed by the following equation (3):
A matrix W′(k) that is formed by adding coefficients a(k) and b(k) indicating the size uncertainties of the matrix elements of the separation filter frequency characteristics matrix W(k) to the separation filter frequency characteristics matrix W(k) is expressed by the following equation (4):
The separation and reproduction filter frequency characteristics matrix M′i(k) using the above matrix W′(k) can be expressed by the following equation (5):
As is apparent from the above, the separation and reproduction filter frequency characteristics matrix does not contain uncertainty about the sizes of the matrix elements.
The separation and reproduction filter calculating unit 4 performs an operation to eliminate the uncertainty about the sizes by the above described technique. More specifically, the reproduction filter frequency characteristics matrix W−1(k) is calculated by transforming the separation filter frequency characteristics matrix W(k) into an inverse matrix at the respective frequencies. The matrix W−1(k) and the original matrix W(k) are then combined, so that the above mentioned separation and reproduction filter frequency characteristics matrix Mi(k) is calculated. Further, an inverse frequency transform is performed on the separation and reproduction filter frequency characteristics matrix Mi(k) for each of the matrix elements Mij(i)(k) (i=1, 2; I=1, 2; j=1, 2), so as to calculate eight separation and reproduction filter coefficients Mij(i)(s) (s=0, 1, . . . , N−1). Here, I indicates the channel number of each separation signal (I=1, 2).
The separation and reproduction filter unit 5 implements filtering on input signals xj(t) (j=1, 2) for the respective channels, with use of four separation and reproduction filter coefficients Mij(1)(k) (I=1, 2; j=1, 2). Synthesized signals zI(1)(t) of the respective channels are then calculated according to the following equation (6). Here, “*” indicates a convolution operation.
zi(1)(t)=mI1(1)(s)*xi(t)+mI2(1)(s)*x2(t)I=1,2 (6)
Like the separation and reproduction filter unit 5, the other separation and reproduction filter unit 6 implements filtering on input signals xj(t) (j=1, 2) for the respective channels, with use of four separation and reproduction filter coefficients Mij(2)(k) (I=1, 2, j=1, 2). Synthesized signals zI(2)(t) of the respective channels are then calculated according to the following equation (7).
zI(2)(t)=mI1(2)(s)*xI(t)+mI2(2)(s)*x2(t)I=1,2 (7)
As a result of the above operation, the output terminal 7 outputs the synthesized signals zI(1)(t) of the corresponding channel, the output terminal 8 outputs the synthesized signal z2(1)(t) of the corresponding channel, the output terminal 9 outputs the synthesized signals zI(2)(t) of the corresponding channel, and the output terminal 10 outputs the synthesized signals z2(2)(t) of the corresponding channel.
{Citation List}
{Non Patent Literature}
{NPL 1} Shuxue. Ding, Masashi Otsuka, Masaki Ashizawa, Teruo Niitsuma, Kazuyoshi Sugai, “Blind source separation of real-world acoustic signals based on ICA in time-frequency-domain”, Technical Report of IEICE, SP2001-1, p.p. 1-8, April 2001
In a conventional signal separating and reproducing apparatus, however, there is uncertainty remaining about each separation filter and reproduction filter. Therefore, those filters cannot be calculated independently of each other. This is because, while the sizes of the matrix elements of the separation filter frequency characteristics matrix W(k) are uncertain with respect to each frequency component, an inverse frequency transform is performed only on the separation and reproduction filter frequency characteristics matrix Mi(k) obtained by combining the separation filter frequency characteristics matrix W(k) and its inverse matrix W−1(k), so as to eliminate the uncertainty. The separation and reproduction filter coefficients Mij(i)(s) are calculated in this manner.
In a case where a separation filter and a reproduction filter cannot be calculated separately from each other as described above, it is difficult to change the characteristics of one of the filters. Particularly, to control a localization of a sound source of channel-specific synthesized signals obtained at last, adjusting the characteristics of the reproduction filter is advantageous. In a conventional apparatus, however, it is difficult to recognize only the characteristics of the reproduction filter for the above described reasons. Therefore, it is also difficult to control the reproduction filter through external control operations.
The object of the present invention is to provide a technique for calculating a separation filter and a reproduction filter independently of each other in a signal separating and reproducing apparatus.
A signal separating and reproducing apparatus according to the present invention includes: a separation filter analyzing unit calculating a first matrix which indicates frequency characteristics of a separation filter from input signals of a plurality of channels; a filter coefficient restricting unit calculating restriction coefficients for restricting the separation filter, calculating a second matrix by using the restriction coefficients and the first matrix, and calculating separation filter coefficients by using the second matrix; a separation filter unit calculating separation signals by filtering on the input signals of the plurality of channels by using the separation filter coefficients; a reproduction filter calculating unit calculating a third matrix by transforming the second matrix into an inverse matrix at each frequency, and calculating reproduction filter coefficients by using the third matrix; and a reproduction filter unit calculating synthesized signals corresponding to the respective channels by filtering on the separation signals by using the reproduction filter coefficients, wherein the filter coefficient restricting unit calculates the restriction coefficients so that the reproduction filter coefficients indicate filter coefficients which perform a sound source localization on the separation signals.
In accordance with the present invention, a separation filter and a reproduction filter can be calculated independently of each other. Accordingly, it is possible to independently handle the reproduction filter having a characteristic of localizing a sound source and the separation filter having other characteristics than that of the reproduction filter.
{FIG. 1} A block diagram illustrates the structure of a first embodiment of the present invention;
{FIG. 2} A block diagram illustrates the structure of a second embodiment of the present invention;
{FIG. 3} A block diagram illustrates the structure of a third embodiment of the present invention; and
{FIG. 4} A block diagram illustrates the structure of a conventional signal separating and reproducing apparatus.
1, 2, 103, 121, 122: input terminal
3: separation filter analyzing unit
4: separation and reproduction filter calculating unit
5, 6: separation and reproduction filter unit
7, 8, 9, 10, 112, 113: output terminal
101: filter coefficient restricting unit
102: reproduction filter calculating unit
104: separation filter unit
105, 106: reproduction filter unit
110: reproduction filter feature quantity extracting unit
111: compression processing unit
120: reproduction filter creating unit
123: decoding unit
1000, 1001: signal separating and reproducing apparatus
1002: signal separating and reproducing system
1002A: signal separating and compressing apparatus
1002B: signal decoding and reproducing apparatus
A filter coefficient restricting unit 101 uses the separation filter frequency characteristics matrix W(k) calculated by the separation filter analyzing unit 3, to calculate a restricted separation filter frequency characteristics matrix Ws(k). The restricted separation filter frequency characteristics matrix Ws(k) is equivalent to the second matrix of the present invention. The filter coefficient restricting unit 101 also uses the restricted separated filter frequency characteristics matrix Ws(k), to calculate restricted separation filter coefficients wsij(s). So as to eliminate the uncertainty about the size of the separation filter frequency characteristics matrix W(k), the former restricted separation filter frequency characteristics matrix Ws(k) is calculated according to the following equation (8):
Here, Ci(k) (i=1, 2) indicate the restriction coefficients. The restriction coefficients Ci(k) of this embodiment are calculated so that the restricted separation signals ysi(t) (i=1, 2) from the later described separation filter unit 104 become the sum signals of the synthesized signals of the respective channels.
The calculation of the restriction coefficients Ci(k) is now described. The frequency characteristics of the separation signal separated by the separation filter frequency characteristics matrix W(k) are indicated as Yi(k), the frequency characteristics of the restricted separation signal separated by the restricted separation filter frequency characteristics matrix Ws(k) are indicated as Ysi(k), and the frequency characteristics of each synthesized signal of the respective channels are indicated as zii(k). When input acoustic signals are separated and reproduced, the synthesized signals of the respective channels to be ultimately obtained are the same, regardless of whether the separation filter frequency characteristics matrix W(k) or the restricted separation filter frequency characteristics Ws(k) is used. This is because uncertainty is not caused in separation and reproduction filters. In view of the above, the following equations (9) and (10) are established:
Also, the conditions under which the restricted separation signal ysi(t) becomes equal to the sum signal of synthesized signals of respective channels can be expressed by the following equation (11):
Ysi(k)=z1i(k)+z2i(k) (11)
Accordingly, the restricted coefficient Ci(k) should be calculated so as to satisfy the above equation (11). More specifically, the restricted coefficient Ci(k) is calculated according to the following equations (12) and (13):
The filter coefficient restricting unit 101 performs an inverse frequency transform on the restricted separation filter frequency characteristics matrix Ws(k) calculated according to the above equation (8), for each matrix element Wsij(k) (i=1, 2; j=1, 2). By doing so, the filter coefficient restricting unit 101 calculates four restricted separation filter coefficients wsij(s) (s=0, 1, 2, . . . , N−1).
The reproduction filter calculating unit 102 uses the restricted separation filter frequency characteristics matrix Ws(k) calculated by the filter coefficient restricting unit 101, so as to calculate corrected reproduction filter coefficients a′ij(s) (s=0, 1, 2, . . . , N−1). To do so, the reproduction filter calculating unit 102 first calculates the restricted reproduction filter frequency characteristics matrix Ws−1(k) by transforming the restricted separation filter frequency characteristics matrix Ws(k) into an inverse matrix at the respective frequencies. The matrix Ws−1(k) is equivalent to the third matrix of the present invention. With the restricted reproduction filter frequency characteristics matrix Ws−1(k) being used, the synthesized signals of the respective channels are expressed by the following equation (14):
By adding the equation (11) expressing that the restricted separation signal ysi(t) is equal to the sum signal of the synthesized signals of the respective channels, to the equation (14), the relationships expressed by the following equations (15) and (16) are established:
The filter characteristics indicated as the corrected reproduction filter coefficients a′Ii(s) can be such characteristics that retransform the sum signal of the synthesized signals of the channels into the synthesized signals of the respective channels, that is, characteristics which performs a localization of a sound source on the restricted separation signals ysi(t). This is based on the fact that the filter coefficient restricting unit 101 calculates the restriction coefficients Ci(k) so that the restricted separation signal ysi(t) becomes equal to the sum signal of the synthesized signals of each channel.
Where the matrix elements of the restricted reproduction filter frequency characteristics matrix Ws−1(k) are indicated as AIi(k) (I=1, 2; i=1, 2), the relationship among the matrix elements is expressed by the following equation (17):
A1i(k)+A2i(k)=1 (17)
Here, the amplitude difference CLD and the phase difference CPD between the channels of the synthesized signals are explained. It is a known fact that the amplitude difference CLD and the phase difference CPD between the channels are important aspects for feeling the localization of a sound source where the separation signals come from. The amplitude difference CLD and the phase difference CPD between the channels of the synthesized signals are expressed by the following equations (18) and (19):
CLDi(k)=|A2i(k)|/|A1i(k)| (18)
CPDi(k)=∠A2i(k)−∠A1i(k) (19)
Here, “|A|” is the amplitude of a complex number A, and “∠A” is the phase of the complex number A.
The reproduction filter calculating unit 102 calculates the corrected reproduction filter frequency characteristics matrix A′(k) by correcting the restricted reproduction filter frequency characteristics matrix Ws−1(k) in accordance with an external control signal that is supplied via the output terminal 103. The matrix elements of the corrected reproduction filter frequency characteristics matrix A′(k) are indicated as A′Ii(k) (I=1, 2; i=1, 2). For example, the localized position of the sound source of the corrected synthesized signals or CLD and CPD may be used as the external control signal.
As an example of the correction technique, an operation to be performed when only CLD is supplied as the external control signal is described. With CLD supplied through external control being βi(k), the energy of the restricted reproduction filter frequency characteristics matrix Ws−1(k) is calculated according to the following equation (20):
The number of the channel having a large energy is then selected for each separation signal. If the selection result is “I=1”, for example, the matrix element A′1i(k) of the corrected restriction filter frequency characteristics matrix A′(k) corresponding to the channel number “I=1” is calculated so as to satisfy the following equations (21) and (22):
Also, in accordance with the relationship between the calculated matrix element A′1i(k) and the following equation (23) based on the equation (17), the matrix element A′2i(k) corresponding to the other channel number “I=2” is calculated:
A′1i(k)+A′2i(k)=1 (23)
The reproduction filter calculating unit 102 calculates four corrected reproduction filter coefficients a′Ii(s) (s=0, 1, 2, . . . , N−1) by performing an inverse frequency transform on each matrix element A′Ii(k) (I=1, 2; i=1, 2) of the corrected reproduction filter frequency characteristics matrix A′(k) obtained by the above result.
The separation filter unit 104 uses the four restricted separation filter coefficients wsij(s) (i=1, 2; j=1, 2) calculated by the filter coefficient restricting unit 101, so as to perform filtering on the input signals xj(t) (j=1, 2) of the respective channels, and calculates the restricted separation signals ysi(t) according to the following equation (24):
ysi(t)=wsi1(s)*xI(t)+wsi2(s)*x2(t)i=1,2 (24)
The reproduction filter unit 105 uses the two corrected reproduction filter coefficients a′I1(s) (I=1, 2) to perform filtering on the restricted separation signal ysi(t), and calculates the synthesized signal zI(1)(t) of each channel according to the following equation (25):
zI(1)(t)=a′I1(s)*xI(t)I=1,2 (25)
The reproduction filter unit 106 uses the two corrected reproduction filter coefficients a′I2(s) (I=1, 2) to perform filtering on the restricted separation signal ys2(t), and calculates the synthesized signal zI(2)(t) of each channel according to the following equation (26):
zI(2)(t)=a′I2(s)*x2(t)I=1,2 (26)
As a result of the above operation, the output terminal 7 outputs the synthesized signal zI(1)(t) of the corresponding channel, the output terminal 8 outputs the synthesized signal z2(1)(t) of the corresponding channel, the output terminal 9 outputs the synthesized signal zI(2)(t) of the corresponding channel, and the output terminal 10 outputs the synthesized signal z2(2)(t) of the corresponding channel.
In accordance with this embodiment, the filter coefficient restricting unit 101 calculates the separation filter coefficients wsij(s), so that the restricted separation signals ysi(t) are the sum signals of the synthesized signals of the respective channels. Accordingly, separation filters can be calculated independently of reproduction filters. Furthermore, the filter characteristics indicated as the corrected reproduction filter coefficients a′Ii(s) are such characteristics that retransform the sum signal of the synthesized signals of the respective channels into the synthesized signals of the respective channels, that is, the characteristic of localizing the sound source. Thus, the reproduction filter calculating unit 102 can control the sound source localization of the synthesized signals in accordance with external control signals.
Next, a second embodiment of the present invention is described in detail, with reference to the accompanying drawings. This embodiment concerns a signal separating and compressing apparatus that separates and compresses input acoustic signals.
A reproduction filter feature quantity extracting unit 110 calculates the restricted reproduction filter frequency characteristics matrix Ws−1(k) which is the inverse matrix formed from the restricted separation filter frequency characteristics matrix Ws(k) calculated by the filter coefficient restricting unit 101. The inversion is performed at each frequency. Further, the matrix elements AIi(k) (I=1, 2; i=1, 2) of the restricted reproduction filter frequency characteristics matrix Ws−1(k) are quantized and encoded in each sub band, and the filter feature quantities are output to an output terminal 112. The sub bands may be defined in an unjustified dividing manner such as the Barkscale, or may be defined based on the restricted reproduction filter frequency characteristics matrix.
A compressing unit 111 compresses the restricted separation signals ysi(t) (i=1, 2) calculated by the separation filter unit 104, and outputs the compressed signals as signal compression data to an output terminal 113. In the compressing process, for example, the restricted separations signals ysi(t) may be compressed independently of one another by a transform coding technique that is known as one of the techniques for efficiently encoding audio signals of music and the likes.
Like the filter coefficient restricting unit 101 of the first embodiment, the filter coefficient restricting unit 101 of this embodiment calculates separation filter coefficients wsij(s) so that the restricted separation signals ysi(t) are equivalent to the sum signals of the synthesized signals of the respective channels. Accordingly, in accordance with this embodiment, acoustic input signals can be separated and reproduced independently of one another. Also, the signal separating and compressing apparatus 1002A of this embodiment compresses the separation signals ysi(t) obtained through separating operations, and then outputs the compressed signals. Accordingly, the separation signals can be transmitted together with filter feature quantity information to other apparatus, for example.
Next, a third embodiment of the present invention is described in detail, with reference to the accompanying drawings. This embodiment concerns a signal separating and reproducing system 1002 that includes the signal separating and compressing apparatus 1002A, and a signal decoding and reproducing apparatus 1002B that is connected to the signal separating and compressing apparatus 1002A and performs reproducing operations.
In the structure of the signal decoding and reproducing apparatus 1002B, the four output terminals 7, 8, 9, and 10 are the same as those of the signal separating and reproducing apparatus 1000 shown in
A reproduction filter creating unit 120 calculates the restricted reproduction filter frequency characteristics matrix Ws−1(k), based on the filter feature quantities supplied from the signal separating and compressing apparatus 1002A via an input terminal 121. Like the reproduction filter calculating unit 102 (
A decoding unit 123 performs decoding on the signal compression data supplied from the signal separating and compressing apparatus 1002A via an input terminal 122, so as to generate the restricted separation signals ysi(t). This decoding operation is the opposite operation of the operation performed by the compressing unit 104 (
In accordance with this embodiment, the filter characteristics indicated as the corrected reproduction filter coefficients a′Ii(s) are the characteristic of localizing the sound source, the same as in the first embodiment. Thus, the reproduction filter calculating unit 102 can control the sound source localization of the synthesized signals in accordance with the external control signals.
In each of the above embodiments, the restriction coefficients Ci(k) are calculated so that a filter characteristics obtained by adding all the channels of the corrected reproduction filter coefficients a′Ii(s) has an entire bandpass characteristics, that is, all the synthesized signals contained in the separation signals ysi(t) are output from the reproduction filter units (105, 106). However, it is also possible to calculate the restriction coefficient Ci(k) so that the synthesized signals contained in the separation signals are partially output.
The present invention can be applied to various structures that separate and generate original signals from mixed signals of acoustic and voice signals. Also, the present invention may be realized by a computer program.
Number | Date | Country | Kind |
---|---|---|---|
2006-334440 | Dec 2006 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2007/073677 | 12/7/2007 | WO | 00 | 6/11/2009 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2008/072566 | 6/19/2008 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6590871 | Adachi | Jul 2003 | B1 |
7200561 | Moriya et al. | Apr 2007 | B2 |
7593449 | Shattil | Sep 2009 | B2 |
7894611 | Beckmann et al. | Feb 2011 | B2 |
8160273 | Visser et al. | Apr 2012 | B2 |
20020073128 | Egelmeers et al. | Jun 2002 | A1 |
20030046064 | Moriya et al. | Mar 2003 | A1 |
20090022336 | Visser et al. | Jan 2009 | A1 |
Number | Date | Country |
---|---|---|
8-110794 | Apr 1996 | JP |
11-109014 | Apr 1999 | JP |
2000-181499 | Jun 2000 | JP |
2002-149190 | May 2002 | JP |
2003-333682 | Nov 2003 | JP |
2004-507923 | Mar 2004 | JP |
2004-523752 | Aug 2004 | JP |
2004-302122 | Oct 2004 | JP |
2005-091560 | Apr 2005 | JP |
2005-266797 | Sep 2005 | JP |
2006-017961 | Jan 2006 | JP |
2006-084928 | Mar 2006 | JP |
2006-154314 | Jun 2006 | JP |
WO-0217488 | Feb 2002 | WO |
WO-2005024788 | Mar 2005 | WO |
Number | Date | Country | |
---|---|---|---|
20100030554 A1 | Feb 2010 | US |