SOUND-SOURCE SEPARATION METHOD, APPARATUS, AND PROGRAM

Description

TECHNICAL FIELD

The present disclosure relates to sound-source separation method, apparatus, and program that form a directivity toward a sound source located in an arbitrary direction based on a sound wave signal.

BACKGROUND ART

In order to precisely separate sound waves of a target sound source, and to suppress target external sound like noises, in general, it is necessary to apply a directional microphone, and to dispose the plural directional microphones side by side at equal to or wider than a certain pitch. However, in the case of a compact sound collector device like an IC recorder, it is difficult to apply the sound collecting technology of employing a directional microphone and of utilizing the plural microphones with a wide pitch. In addition, a precise sound-source separation by an application of such a sound collecting technologies to recorded sound from plural sound sources and having undergone an artificial down-mix process is also difficult.

Hence, a large number of technologies of analyzing an amplitude difference and a phase difference between signals output by respective microphones after recording of sound wave, and performing a signal processing in accordance with an analysis result, thereby separating and extracting a target sound source have been proposed. In recent years, a statistical analysis, a frequency analysis, a complex analysis, etc., are applied to detect a difference in waveform structure of input signals, and the detection result is utilized for a sound-source separation process.

For example, a signal processing such that a conversion from a time axis to a frequency axis is performed on an input signal, a phase difference for each frequency is calculated, a frequency band of an input sound wave from a target sound source is specified based on the calculated difference, and the sound wave within that frequency band is emphasized is performed (see Patent Document 1).

In addition, in the signal processing, it is determined whether or not an input sound wave is in a target direction based on input signals from two microphones closely disposed to each other, a phase difference between the two input signals is corrected, thereby emphasizing sound present in the target direction (see Patent Document 2). The two input signals are referred to each other, and a filter is sequentially updated based on an obtained signal (see Patent Document 3).

SUMMARY OF INVENTION
Technical Problem

Downsizing of sound collector devices or devices equipped with the sound collector device involves a further narrowing of the disposing pitch of microphones, and thus an amplitude difference and a phase difference between signals are quite small. Hence, a large amount of efforts to clearly specify such amplitude difference and phase difference is necessary. This is particularly remarkable in a low-frequency range that has a longer wavelength of several ten times or more than the pitch of two microphones, and in a high-frequency range where the phase difference of sound wave reaching the two microphones becomes equal to or longer than a cycle.

In recent years, as disclosed in Patent Documents 1 to 3, the frequency analysis, the complex analysis, or the statistical analysis to a waveform structure is becoming highly sophisticated, thereby coping with the narrow disposing pitch of microphones. However, the sophistication of the analysis results in an elongation of a frame length, a large number of delay devices, a long filter length, and a long filter coefficient in the case of a conversion to a frequency range. Hence, because of the capacity of the arithmetic processing performance, it becomes difficult to form a real-time directivity. In order to reduce the arithmetic processing load, the number of microphones can be increased, but due to the limited dimension of a device, the pitch between microphones becomes further narrow.

The present disclosure has been made in order to address the above-explained technical problems of conventional technologies, and it is an objective of the present disclosure to provide sound-source separation method, apparatus, and program which can emphasize or suppress and output sound coming from an arbitrary direction with a little amount of calculation using microphones closely disposed to each other and without a highly sophisticated analysis.

Solution to Problem

To accomplish the above objective, a sound-source separation method according to an embodiment is to form a directivity in a specific direction relative to a pair of input signals, and the method includes:

a filtering step of filtering containing a delay by a specific time on one of the pair of input signals;

an interchanging step of, after the filtering step, alternately interchanging the pair of input signals through an interchanging circuit for each sampling, and generating a pair of interchanged signals;

a generating step of multiplying one of the interchanged signals by a coefficient m, and generating an error signal between the interchanged signals;

an updating step of calculating a recurrence formula of the coefficient m containing the error signal, and updating the coefficient m for each sampling; and an outputting step of multiplying the pair of input signals by the sequentially updated coefficient m and outputting resultant signals,

in which:

the specific time in the filtering step is equivalent to a time difference of sound wave that reaches a pair of microphones from the specific direction; and

in the filtering step, the pair of input signals originating from the sound wave from the specific direction is adjusted so as to have a same amplitude and a same phase.

In the filtering step, filtering may be performed on the one of the pair of input signals by a transfer function T1 that delays the input signal by the specific time, and

when a transfer function of sound wave from the specific direction to the microphone which outputs the input signal subjected to filtering is C11, and a transfer function of the sound wave to the other microphone is C12, the transfer function T1 may substantially satisfy T1×C11=C12.

The sound-source separation method may further include a delaying step of causing, to the other one of the pair of input signals, a delay time that is equal to or longer than a necessary time for sound wave to travel a distance between the pair of microphones,

in which in the filtering step, filtering may be performed on the one of the pair of input signals, the filtering containing a time delay obtained by adding the delay time by the delaying step and the specific time.

In the filtering step, filtering may be performed on the one of the pair of input signals by a transfer function T1 that delays the input signal by a specific time,

in the delaying step, the other one of the pair of input signals may be delayed by a transfer function D1 that delays the input signal by the delay time, and

In the generating and updating steps:

one of the interchanged signals may be caused to pass through a first integrator set with −1 time of a past coefficient m calculated one sampling before;

after through the first integrator, the pair of interchanged signals may be caused to pass through a first adder that adds those signals;

after through the first adder, the addition signal may be caused to pass through a second integrator set with a constant μ;

after through the second integrator, a resultant signal may be caused to pass through a third integrator set with the one interchanged signal before multiplied by the past coefficient m; and

after through the third integrator, a resultant signal may be caused to pass through a second adder set with a past coefficient m calculated one sampling before,

thereby updating the coefficient m for each sampling.

To accomplish the above objective, a sound-source separation apparatus according to an embodiment forms a directivity in a specific direction relative to a pair of input signals, and the apparatus includes:

a filter filtering containing a delay by a specific time on the one of the pair of input signals;

an interchanger alternately interchanging, after the filtering, the pair of input signals for each sampling, and generating a pair of interchanged signals;

an error signal generator multiplying one of the interchanged signals by a coefficient m, and generating an error signal between the interchanged signals;

a recurrence formula calculator calculating a recurrence formula of the coefficient m containing the error signal, and updating the coefficient m for each sampling; and

an integrator multiplying the pair of input signals by the sequentially updated coefficient m and outputting resultant signals,

in which:

the specific time in the filtering is equivalent to a time difference of sound wave that reaches a pair of microphones from the specific direction; and

in the filtering, the pair of input signals originating from the sound wave from the specific direction is adjusted so as to have a same amplitude and a same phase.

The filter may perform filtering on the one of the pair of input signals by a transfer function T1 that delays the input signal by the specific time, and

The sound-source separation apparatus may further include a delay that causes, to the other one of the pair of input signals, a delay time that is equal to or longer than a necessary time for sound wave to travel a distance between the pair of microphones,

in which the filter may perform filtering on the one of the pair of input signals, the filtering containing a time delay obtained by adding the delay time by the delaying step and the specific time.

The filter may perform filtering on the one of the pair of input signals by a transfer function T1 that delays the input signal by a specific time,

the delay may delay the other one of the pair of input signals by a transfer function D1 that delays the input signal by the delay time, and

The error signal generator and the recurrence formula calculator may:

cause one of the interchanged signals to pass through a first integrator set with −1 time of a past coefficient m calculated one sampling before;

after through the first integrator, cause the pair of interchanged signals to pass through a first adder that adds those signals;

after through the first adder, cause the addition signal to pass through a second integrator set with a constant μ;

after through the second integrator, cause a resultant signal to pass through a third integrator set with the one interchanged signal before multiplied by the past coefficient m; and

after through the third integrator, cause a resultant signal to pass through a second adder set with a past coefficient m calculated one sampling before,

thereby updating the coefficient m for each sampling.

To accomplish the above objective, a sound-source separation program according to an embodiment causes a computer to form a directivity in a specific direction relative to a pair of input signals, and the program causes the computer to function as:

a filter filtering containing a delay by a specific time on the one of the pair of input signals;

an interchanger alternately interchanging, after the filtering, the pair of input signals for each sampling, and generating a pair of interchanged signals;

an error signal generator multiplying one of the interchanged signals by a coefficient m, and generating an error signal between the interchanged signals;

a recurrence formula calculator calculating a recurrence formula of the coefficient m containing the error signal, and updating the coefficient m for each sampling; and

an integrator multiplying the pair of input signals by the sequentially updated coefficient m and outputting resultant signals,

in which:

the specific time in the filtering is equivalent to a time difference of sound wave that reaches a pair of microphones from the specific direction; and

in the filtering, the pair of input signals originating from the sound wave from the specific direction is adjusted so as to have a same amplitude and a same phase.

The filter may perform filtering on the one of the pair of input signals by a transfer function T1 that delays the input signal by the specific time, and

The sound-source separation program may further cause the computer to function as a delay that causes, to the other one of the pair of input signals, a delay time that is equal to or longer than a necessary time for sound wave to travel a distance between the pair of microphones,