(1) Field of the Invention
The present invention relates to an apparatus, a method and a program for distinguishing between a sound having the same fundamental period as a target sound but which differs therefrom and the target sound, and analyzing whether or not the target sound is contained in an evaluation sound. In particular, the present invention relates to an apparatus, a method and a program for analyzing whether or not a target sound is contained in an evaluation sound by determining a time period or a frequency band of the existence of a fundamental period of the target sound in the evaluation sound.
(2) Description of the Related Art
Techniques for analyzing fundamental periods are utilized and perform important roles in a wide range of fields including mixed sound separation, sound discrimination and voice synthesis. For instance, a technique used in the field of mixed sound separation uses pitch that is the fundamental period of voice to extract voice from mixed sound containing aperiodic noise. In addition, there is a technique that uses fundamental periods of musical sounds to separate a performance of an orchestra into its respective instruments. Furthermore, a technique used in the field of voice synthesis creates synthetic voice by extracting pitch, which is a fundamental period of voice, as a parameter.
In a first conventional technique for analyzing fundamental periods, a fundamental period is extracted by calculating autocorrelation using a time-frequency structure (spectrogram) created using an auditory filter or through Fourier transform (for instance, refer to Slaney, Malcolm, et al., “A Perceptual Pitch Detector”, 1990, ICASSP (International Conference on Acoustics, Speech, and Signal Processing), IEEE, Chapter 3).
The first conventional technique performs Fourier transform on signals inputted at predetermined time intervals to calculate a time-frequency structure (spectrogram). Then, for a predetermined frequency, a fundamental period is extracted by calculating an autocorrelation of a power spectrum in the direction of the temporal axis.
Methods of determining autocorrelation and fundamental periods will now be described.
If a power spectrum at a given point in time (sample number)
n [Formula 1]
of a given frequency may be expressed as
X(n) [Formula 2]
autocorrelation
R(τ) [Formula 3]
may be calculated using Formula 4,
where
τ [Formula 5]
represents a candidate of the fundamental period (fundamental period candidate) and
N [Formula 6]
represents the number of samples in an area of analysis.
A fundamental frequency
tp [Formula 7]
is determined as a fundamental period candidate having the maximum autocorrelation (Formula 3), as expressed by Formula 8.
tp=argτmaxR(τ).
In the example shown in
A second conventional technique for analyzing fundamental periods extracts a fundamental period by obtaining a time interval in which the size of a power spectrum equals or exceeds a predetermined threshold value using a temporal structure of a power spectrum at a given frequency, which is created through wavelet transform (for instance, refer to Japanese Unexamined Patent Application Publication No. 2004-126855 (claim 1, FIGS. 3 and 4)).
The second conventional technique performs wavelet transform on signals inputted at predetermined time intervals to calculate a temporal structure of a power spectrum. For instance, a binary wavelet transformed value
DyWT [Formula 9]
of an inputted signal
x(t) [Formula 10]
may be calculating using a scale parameter
a=2j [Formula 11]
quantized by a binary sequence and a shift parameter
b [Formula 12]
according to Formula 13, which is expressed as
In this case, a frequency band to be analyzed is determined by the scale parameter (Formula 11). The shift parameter (Formula 12) corresponds to the number of samples.
In Formula 13,
g(x) [Formula 14]
is a wavelet function, while
g*(x) [Formula 15]
is a complex conjugate of the wavelet function (Formula 14).
a=24. [Formula 16]
The ordinate represents the power spectrum (Formula 13) while the abscissa represents sample numbers (Formula 12).
As shown in
A0 [Formula 17]
for detecting peaks in the power spectrum has been set, whereby the size of the spectrum and the threshold value (Formula 17) are compared to determine a peak that equals or exceeds the threshold value. The time interval of a peak that exceeds the threshold value is considered to be the fundamental period
tp. [Formula 18]
In the example shown in
A third conventional technique for analyzing fundamental periods determines a fundamental period (pitch) using a residual waveform pattern obtained by passing an original voice through a filter set to an inverse filter characteristic of a vocal tract articulatory equivalent filter. In this case, a cross-correlation between a residual waveform pattern at a given time interval and a single pitch waveform pattern (basic waveform pattern) used when synthesizing a voiced voice is determined, whereby the time interval of the peak of the cross-correlation is considered to be the fundamental period (pitch) (for instance, refer to Japanese Unexamined Patent Application Publication No. 63-5398 (claim 1, FIG. 3)).
The residual waveform pattern depicted in
However, with the first conventional technique, there is a problem in that, even for a sound having the same fundamental period as a target sound but which differs therefrom, since the same fundamental period value as the target sound is outputted, it is difficult to analyze fundamental periods while distinguishing between the sound having the same fundamental period as a target sound but which differs therefrom and the target sound. For instance, it is difficult to analyze fundamental periods while distinguishing between the voices of two male speakers with similar fundamental periods (pitches). As a result, it is difficult to analyze whether or not an evaluation sound contains the target sound.
In addition, the second conventional technique also has the problem in that, even for a sound having the same fundamental period as a target sound but which differs therefrom, since the same fundamental period value as the target sound is outputted, it is difficult to analyze fundamental periods while distinguishing between the sound having the same fundamental period as a target sound but which differs therefrom and the target sound. Therefore, it is difficult to analyze whether or not an evaluation sound contains the target sound. For instance, when analyzing fundamental periods while distinguishing between the voices of two male speakers with similar fundamental periods, since the maximum value of a power spectrum fluctuates according to the volume of a voice, it is difficult to set a threshold value when the maximum value of the power spectrum of the speaker that is not the target is greater than the maximum value of the power spectrum of the speaker that is the target.
Furthermore, the third conventional technique also has the problem in that, even for a sound having the same fundamental period as a target sound but which differs therefrom, since the same fundamental period value as the target sound is outputted, it is difficult to analyze fundamental periods while distinguishing between the sound having the same fundamental period as a target sound but which differs therefrom and the target sound. Therefore, it is difficult to analyze whether or not an evaluation sound contains the target sound.
The present invention has been made in consideration of the above problems, and an object thereof is to provide a target sound analysis apparatus and the like capable of distinguishing between an “target sound” and a “sound having the same fundamental period as a target sound but which differs therefrom”, and to analyze whether or not the target sound is contained in an evaluation sound. In particular, the present invention is aimed at providing a target sound analysis apparatus and the like that determines a time period or a frequency band of an existence of a fundamental period of the target sound in the evaluation sound.
In order to achieve the object, the target sound analysis apparatus according to the present invention analyzes whether or not an evaluation sound contains a target sound. The target sound analysis apparatus includes: a target sound preparation unit operable to prepare the target sound that is an analysis waveform to be used for analyzing a fundamental period; an evaluation sound preparation unit operable to prepare the evaluation sound that is a to-be-analyzed waveform in which a fundamental period is to be analyzed; and an analysis unit operable to (i) sequentially calculate differential values between the evaluation sound and the target sound at corresponding points in time, by temporally shifting the target sound with respect to the evaluation sound, (ii) calculate an iterative interval between the points in time where the differential value is equal to or lower than a predetermined threshold value, and (iii) judge whether or not the target sound exists in the evaluation sound, based on a period of the iterative interval and the fundamental period of the target sound.
Thus, since a differential value between an evaluation sound and a target sound is calculated and whether or not the target sound exists in the evaluation sound is judged based on a period of an iterative interval when the differential value is equal to or lower than a predetermined threshold value and a fundamental period of the target sound, it is now possible to distinguish between a sound having the same fundamental period as a target sound but which differs therefrom and the target sound and analyze the presence or absence of the target sound. This is due to the fact that the minimum value of the differential values approximately becomes zero when the evaluation sound is the target sound, and minimum value of the differential values takes a large value that is distanced from zero when the evaluation sound has the same fundamental period as the target sound but differs from the target sound.
It is preferable that the target sound preparation unit is operable to prepare a target sound frequency pattern obtained by performing a frequency analysis on the target sound, that the evaluation sound preparation unit is operable to prepare an evaluation sound frequency pattern obtained by performing a frequency analysis on the evaluation sound, and that the analysis unit is operable to (i) sequentially calculate differential values between the evaluation sound frequency pattern and the target sound frequency pattern at corresponding points in time, by temporally shifting the target sound frequency pattern with respect to the evaluation sound frequency pattern, (ii) calculate an iterative interval between the points in time where the differential value is equal to or lower than a predetermined threshold value, and (iii) judge whether or not the target sound exists in the evaluation sound, based on a period of the iterative interval and the fundamental period of the target sound.
Thus, since a differential value between an evaluation sound frequency pattern and a target sound frequency pattern is calculated and whether or not the target sound exists in the evaluation sound is judged based on a period of an iterative internal when the differential value is equal to or lower than a predetermined threshold value and a fundamental period of the target sound, it is now possible to distinguish between a sound having the same fundamental period as a target sound but which differs therefrom and the target sound and analyze the presence or absence of the target sound. In this case, since the evaluation sound frequency pattern resulting from a frequency analysis of the evaluation sound and the target sound frequency pattern resulting from a frequency analysis of the target sound are used, it is now possible to analyze the presence or absence of the target sound on a per-frequency band basis. For instance, when analyzing an evaluation sound in which the target sound and noise are mixed, the presence or absence of the target sound may be analyzed by selecting a frequency band that is free of noise.
It is preferable that the target sound analysis apparatus further includes a sound information setting unit operable to set sound information regarding the target sound, wherein the target sound preparation unit is operable to prepare the target sound or the target sound frequency pattern, based on the set sound information.
Thus, since the target sound preparation unit prepares a target sound based on sound information set by the sound information setting unit, the target sound analysis apparatus is now capable of controlling a target sound to be prepared by the target sound preparation unit. In addition, since the target sound preparation unit prepares a target sound frequency pattern based on target sound-related sound information set by the sound information setting unit, the target sound analysis apparatus is now capable of controlling a target sound frequency pattern to be prepared by the target sound preparation unit. As a result, a user is now capable of setting a target sound using the sound information setting unit.
It is preferable that the sound information setting unit is operable to receive input of the target sound and set the inputted target sound as to the sound information, and that the target sound preparation unit is operable to either set the inputted target sound as to the target sound to be prepared or prepare the target sound frequency pattern by performing a frequency analysis on the target sound.
Thus, since the target sound preparation unit uses a target sound inputted by the sound information setting unit as the target sound to be prepared, the target sound preparation unit is no longer required to prepare in advance a plurality of sounds to be used as candidates for the target sound (target sound candidates), and a reduction of storage capacity may be achieved. In addition, since the target sound preparation unit uses a target sound inputted by the sound information setting unit to create a target sound frequency pattern, the target sound preparation unit is no longer required to prepare in advance a plurality of target sound frequency patterns corresponding to the target sound candidates, and a reduction of storage capacity may be achieved.
It is further preferable that the target sound analysis apparatus further includes a sound information setting unit is operable to receive a selection signal for selecting one of the plurality of the candidates for the target sound or one of the plurality of the candidates for the target sound frequency pattern, wherein the target sound preparation unit is operable to store a plurality of candidates for the target sound or a plurality of candidates for the target sound frequency pattern, and the target sound preparation unit is operable to set the candidate for the target sound selected by the selection signal or the candidate of the target sound frequency pattern selected by the selection signal, as to the target sound to be prepared or the target sound frequency pattern to be prepared, respectively.
Thus, since a target sound may be prepared using target sound candidates stored in the target sound preparation unit, there is no need to input a target sound. As a result, the presence or absence of a target sound may be analyzed even when a target sound cannot be inputted. For instance, when analyzing the presence or absence of a male voice in ambient noise, while it is impossible to pick up a male voice in a quiet environment in ambient noise, the presence or absence of the male voice may be analyzed by using the male voice in a quiet environment stored in the target sound preparation unit. In addition, since the time required for inputting a target sound may be omitted, real time processing may be achieved.
Furthermore, since a target sound frequency pattern may now be prepared using candidates for the target sound frequency pattern (target sound frequency pattern candidates) stored in the target sound preparation unit, there is no need to input a target sound, perform frequency analysis, and create a target sound frequency pattern. As a result, a target sound may be analyzed even when the target sound cannot be inputted. For instance, when analyzing the presence or absence of a male voice in ambient noise, while it will be impossible to pick up a male voice in a quiet environment in ambient noise, the presence or absence of the male voice may be analyzed by using a target sound frequency pattern created by performing frequency analysis on the male voice in a quiet environment stored in the target sound preparation unit. In addition, since the time required for inputting a target sound or performing frequency analysis on the inputted target sound may be omitted, real time processing may be achieved.
It is still further preferable that the target sound analysis apparatus further includes a threshold value setting unit operable to (i) sequentially calculate differential values between the evaluation sound and the target sound at corresponding points in time, by temporally shifting the target sound with respect to a plurality of the evaluation sounds, (ii) calculate a minimum value among the differential values, and (iii) set the predetermined threshold value based on a maximum value of the plurality of the minimum values corresponding to the plurality of the evaluation sounds.
As a result, it is now possible to set a threshold value that is shared by a plurality of evaluation sounds. For instance, even for the same motorcycle sound, when a motorcycle sound collected in ambient noise and a motorcycle sound collected in an environment without ambient noise are respectively set as evaluation sounds, a threshold value shared by the two motorcycle sounds may be set. Therefore, an appropriate threshold value with respect to a plurality of target sounds may be set and the presence or absence of target sounds may be analyzed with respect to a plurality of target sounds. In addition, analytical errors on the presence or absence of a target sound may be reduced by appropriately controlling the threshold value.
It is still further preferable that the target sound preparation unit is operable to prepare the target sound frequency pattern that includes at least one of an amplitude spectrum and a phase spectrum, the included spectrum being calculated from a cross correlation between the target sound and an aperiodic analysis waveform consisting of a predetermined frequency component, and the evaluation sound preparation unit is operable to prepare the evaluation sound frequency pattern that includes at least one of an amplitude spectrum and a phase spectrum, the included spectrum being calculated from a cross correlation between the evaluation sound and the aperiodic analysis waveform.
Thus, since a fundamental period of a target sound is analyzed using a target sound frequency pattern and an evaluation sound frequency pattern created using an aperiodic analysis waveform, periodic characteristics of the target sound and the evaluation sound appear. As a result, the presence or absence of the target sound may be analyzed. For instance, since the fundamental period of the target sound will even appear in a target sound frequency pattern of a frequency band that is higher than the fundamental period of the target sound, the presence or absence of the target sound may be analyzed even when noise is superimposed on a frequency band corresponding to the fundamental period of the target sound. In addition, since the fundamental period of the target sound appears in target sound frequency patterns across all frequency bands, fundamental periods may be analyzed on a per-frequency band basis to be used for target sound extraction.
It is still further preferable that the target sound preparation unit is operable to prepare the target sound frequency pattern that includes at least one of an amplitude spectrum and a phase spectrum, the included spectrum being calculated from respective cross correlations between the target sound and a plurality of local analysis waveforms that forms a portion of an analysis waveform consisting of a predetermined frequency component and that has predetermined temporal resolution, the evaluation sound preparation unit is operable to prepare the evaluation sound frequency pattern that includes at least one of an amplitude spectrum and a phase spectrum, the included spectrum being calculated from respective cross correlations between the evaluation sound and the plurality of the local analysis waveforms, and the analysis unit is operable to analyze the fundamental period of the target sound, by using, as a single group of data, the target sound frequency pattern prepared using the plurality of the local analysis waveforms and the evaluation sound frequency pattern prepared using the plurality of the local analysis waveforms, respectively.
Thus, since target sound frequency patterns prepared using a plurality of local analysis waveforms and evaluation sound frequency patterns prepared using a plurality of local analysis waveforms are respectively used as a single group of data to analyze a fundamental period, changes in temporal frequency structures at the frequency resolution of the analysis waveforms may be accommodated, and a fundamental period may be analyzed by seemingly increasing the frequency resolution. For instance, for a mixed sound, a fundamental period may be analyzed in a narrow frequency band with a low noise level. As a result, the presence or absence of a target sound in a mixed sound (evaluation sound) may be judged with greater accuracy.
It is still further preferable that the target sound analysis apparatus further include a frequency setting unit operable to set each frequency band of the target sound frequency pattern and the evaluation sound frequency pattern which are used by the analysis unit, wherein the analysis unit is operable to analyze the fundamental period of the target sound, by using the target sound frequency pattern and the evaluation sound frequency pattern whose frequency band is set by the frequency setting unit.
Thus, frequency bands of target sound frequency patterns and evaluation sound frequency patterns used by the analysis unit may be controlled using the frequency setting unit. As a result, it is now possible to change a frequency band to be analyzed or the bandwidth of a frequency band to be analyzed. For instance, when analyzing the presence or absence of a target sound from an evaluation sound in which the target sound and noise are mixed, the fundamental period may be analyzed by selecting a frequency band that is free of noise.
The present invention may be achieved not only as a target sound analysis apparatus provided with such characteristic units, but also as a target sound analysis method that includes, as steps, the characteristic units included in the target sound analysis apparatus, as well as a program that enables a computer to function as the characteristic units included in the target sound analysis apparatus. It is needless to say that such programs may be distributed via a recording medium such as a CD-ROM (Compact Disc-Read Only Memory) or a communication network such as the Internet.
As seen, when a differential value of an evaluation sound and a target sound is calculated by temporally shifting the target sound with respect to the evaluation sound, the present invention is capable of distinguishing between an “target sound” and a “sound having the same fundamental period as a target sound but which differs therefrom” and analyzing whether or not the target sound is contained in the evaluation sound by judging whether or not the target sound exists in the evaluation sound based on a period of an iterative interval when the differential value is equal to or lower than a predetermined threshold value and the fundamental period of the target sound. In addition, even when the evaluation sound contains a noise or the like having a waveform pattern that suddenly resembles that of the target sound, accurate analysis may be performed on whether the evaluation sound is really a sudden noise or is the target sound.
Further Information about Technical Background to this Application
The disclosure of Japanese Patent Application No. 2006-005178 filed on Jan. 12, 2006 including specification, drawings and claims is incorporated herein by reference in its entirety.
The disclosure of PCT application No. PCT/JP2006/325548 filed Dec. 21, 2006, including specification, drawings and claims is incorporated herein by reference in its entirety.
These and other objects, advantages and features of the present invention will become apparent from the following description thereof taken in conjunction with the accompanying drawings that illustrate a specific embodiment of the invention. In the Drawings:
First, the concept of a target sound analysis method according to the present invention will be described.
The description will now start with a case where an evaluation sound is a target sound. By temporally shifting the target sound shown in
Next, a case will be described where the evaluation sound has the same fundamental period as the target sound, but is a sound that differs from the target sound. By temporally shifting the target sound shown in
As described above, differential values between an evaluation sound and a target sound are calculated, and an analysis is performed on whether or not the target sound exists in an evaluation sound based on an iterative interval of a differential value that is equal to or lower than the predetermined threshold value. In other words, analysis is performed such that the target sound is judged to exist in the evaluation sound when the period of the iterative time interval is approximately equal to the fundamental period of the target sound, and the target sound is judged not to exist in the evaluation sound when the period of the iterative time interval is not approximately equal to the fundamental period of the target sound. This configuration enables analysis to be performed on whether or not a target sound exists in an evaluation sound while distinguishing between a sound that has the same fundamental period as the target sound but differs therefrom and the target sound.
In addition, by analyzing, based on iterative intervals, whether or not a target sound exists in an evaluation sound, even when the evaluation sound contains a noise or the like having a waveform pattern that partially resembles that of the target sound, accurate analysis may be performed on whether the evaluation sound is really a sudden noise or is the target sound (the details are described in the first embodiment).
The threshold value introduced in the present invention may be set as a value that is slightly greater than zero when the fundamental waveform pattern of the target sound does not fluctuate. In addition, when the fundamental waveform pattern of the target sound fluctuates, the threshold value may be set, by taking into consideration the fluctuation width of the fundamental waveform pattern of the target sound, to a value that is slightly larger than the maximum value of variation due to the fluctuation of the minimum value of the differential values. Furthermore, the threshold value may be adjusted through feedback of analysis error results. Moreover, when handling a plurality of target sounds, it is also possible to set a value for each target sound.
To provide a comparison with the present invention, results from a case where the third conventional technique is used are schematically shown in
The embodiments of the present invention will now be described with reference to the drawings.
A vehicle detection system 100 is a system that detects whether or not an evaluation sound S100 is a motorcycle sound, and if so, outputs an alarm sound S103. The vehicle detection system 100 includes a fundamental period analysis unit 101 and an alarm sound output unit 105.
The fundamental period analysis unit 101 is a processing unit that analyzes a fundamental period of the evaluation sound S100, and includes a target sound preparation unit 102, an evaluation sound preparation unit 103 and an analysis unit 104.
The target sound preparation unit 102 stores a target sound S101 and a fundamental period S105 of the target sound S101. The analysis unit 104 stores a threshold value S104. The target sound preparation unit 102 outputs the target sound S101 and the fundamental period S105 to the analysis unit 104. The evaluation sound preparation unit 103 inputs the evaluation sound S100, and outputs the same to the analysis unit 104. The analysis unit 104 temporally shifts the target sound S101 with respect to the evaluation sound S100 in order to sequentially calculate differential values of the evaluation sound S100 and the target sound S101 at corresponding points in time, analyzes whether or not the target sound S101 exists in the evaluation sound S100 based on a period of an iterative time interval between differential values that are equal to or lower than the threshold value S104 and the fundamental period S105 of the target sound S100, and using the fundamental period S105, outputs a detection signal S102 to the alarm sound output unit 105 when the target sound S101 exists in the evaluation sound S100.
The target sound preparation unit 102 is an example of a target sound preparation unit that prepares a target sound that is an analysis waveform pattern to be used for analyzing a fundamental period.
The evaluation sound preparation unit 103 is an example of an evaluation sound preparation unit that prepares an evaluation sound that is a to-be-analyzed waveform pattern in which a fundamental period will be analyzed.
The analysis unit 104 is an example of an analysis unit that temporally shifts the target sound with respect to the evaluation sound in order to sequentially calculate differential values of the evaluation sound and the target sound at corresponding points in time, calculates an iterative interval between the points in time where the differential value is equal to or lower than a predetermined threshold value, and judges whether or not the target sound exists in the evaluation sound based on a period of the iterative interval and the fundamental period of the target sound.
The alarm sound output unit 105 presents the alarm sound S103 to the user when the detection signal S102 is inputted.
Next, operations of the vehicle detection system 100 configured as above will be described.
In this example, prior to the shipment of the vehicle detection system 100, a motorcycle sound is stored as the target sound S101 in the target sound preparation unit 102 (step 200), and the fundamental period S105 of the motorcycle sound that is the target sound S101 is also stored. In addition, the threshold value S104 is stored in the analysis unit 104.
An example of a motorcycle sound is shown in
First, activation of the vehicle detection system 100 causes the evaluation sound preparation unit 103 to start retrieving peripheral sounds of the user, which is an evaluation sound S100, using a microphone (step 201). In this example, the evaluation sound is retrieved from peripheral sounds of the user in 9 ms intervals which include several fundamental periods of the motorcycle sound. In other words, the peripheral sounds of the user are segmented every 9 ms and inputted for analysis of the fundamental period of the motorcycle sound.
Next, analysis is performed on whether or not the fundamental period of the motorcycle sound that is the target sound S101 stored in the target sound preparation unit 102 is included in the evaluation sound S100 which includes peripheral sounds of the user (step 202). More specifically, the analysis unit 104 temporally shifts the target sound S101 with respect to the evaluation sound S100 in order to sequentially calculate differential values of the evaluation sound S100 and the target sound S101 at corresponding points in time, and analyzes the fundamental period of the target sound S101 based on a period of an iterative time interval between differential values that are equal to or lower than the threshold value S104. Then, using the fundamental period S105, the analysis unit 104 outputs a detection signal S102 to the alarm sound output unit 105 when the target sound S101 exists in the evaluation sound S100.
An example of an evaluation sound is shown in
BH(n) (n=0, 1, . . . , L), [Formula 19]
where n is a value of discretized time, and, for this example, L is a value corresponding to 9 ms.
An example of an evaluation sound is shown in
BT(n) (n=0, 1, . . . , W), [Formula 20]
where n is a value of discretized time, and, for this example, W is a value corresponding to 3 ms that is the fundamental period of the target sound S101.
A differential value when the target sound S101 is temporally shifted with respect to the evaluation sound S100 is shown in
where m is a value of discretized time which corresponds to the point in time of the start of the evaluation sound S100 for which a differential value is determined. The differential value is a summation of the differences between the evaluation sound and the target sound for a time width W. In this example, since the evaluation sound is the target sound, the iterative time interval between the differential values is 3 ms, which matches the fundamental period S105 of the target sound.
At this point, the threshold value S104 is introduced. This threshold value S104 will be expressed as 0. In this example, the threshold value S104 has been stored in the analysis unit 104 prior to shipment of the vehicle detection system 100, and in consideration of the fluctuation width of the fundamental waveform pattern of the target sound, is set to a value that is slightly greater than the maximum value of a variation due to the fluctuation of the minimum value of the differential values.
An example of an analysis method of the fundamental period of an evaluation sound is shown in
Next, since the fundamental period of the evaluation sound is 3 ms and is therefore in the range of 2.9-3.2 ms that is the fundamental period S105 of the target sound, the analysis unit 104 judges that the target sound S101 exists in the evaluation sound S100, and outputs the detection signal S102 to the alarm sound output unit 105 (step 203). The alarm sound output unit 105 presents the alarm sound S103 to the user at a timing where the detection signal S102 is inputted.
In addition,
An example of the evaluation sound S101 is shown in
A differential value when the target sound S101 is temporally shifted with respect to the evaluation sound S100 is shown in
At this point, the threshold value S104 is introduced. In this example, similarly, the threshold value S104 has been stored in the analysis unit 104 prior to shipment of the vehicle detection system 100, and in consideration of the fluctuation width of the fundamental waveform pattern of the target sound, is set to a value that is slightly greater than the maximum value of a variation due to the fluctuation of the minimum value of the differential values. This value is the same as the value in the examples shown in
In such a case, since either a fundamental period of the evaluation sound S100 does not exist, or even if a fundamental period of the evaluation sound S100 does exist, the fundamental period is not in the range of range 2.9-3.2 ms that is the fundamental period S105 of the target sound S101, the analysis unit 104 judges that the target sound S101 does not exist in the evaluation sound S100, and does not output the detection signal S102 to the alarm sound output unit 105 (step 203). As a result, since the detection signal S102 is not inputted, the alarm sound output unit 105 does not present the alarm sound S103 to the user.
When the evaluation sound S100 has a fundamental period that differs from that of the target sound S101, the fundamental period S105 of the target sound S101 does not appear in the fundamental period of the evaluation sound S100. Therefore, the analysis unit 104 judges that the target sound S101 does not exist in the evaluation sound S100, and the alarm sound S103 is not presented to the user.
Finally, the operations of the above-described steps 201 to 203 are repeated until the vehicle detection system 100 is brought to a stop (step 204).
As described above, according to the first embodiment of the present invention, a differential value between an evaluation sound and a target sound is calculated, and judgment is made on whether or not the target sound exists in the evaluation sound based on the period of an iterative interval and the fundamental period of the target sound for a differential value that is equal to or lower than the predetermined threshold value. As a result, analysis may now be performed on whether or not a target sound exists in an evaluation sound while distinguishing between a “sound that has the same fundamental period as the target sound but differs from the target sound” and the “target sound”.
A case will now be considered where, instead of the analysis unit 104, the existence of a target sound is judged solely by differential values between an evaluation sound and a target sound without analyzing the period of an iterative time interval. In other words, the target sound is judged to exist when the differential value is either zero or approaches zero. A method of judging the existence of a target sound solely by differential values is shown in
A first variation of the first embodiment will now be described.
The vehicle detection system 200 includes a fundamental period analysis unit 201 and the alarm sound output unit 105. The fundamental period analysis unit 201 includes a sound information setting unit 700, a target sound preparation unit 701, the evaluation sound preparation unit 103 and the analysis unit 104.
The analysis unit 104 stores a threshold value S104. The sound information setting unit 700 sets sound information S700 regarding the target sound, and outputs the sound information S700 to the target sound preparation unit 701. The target sound preparation unit 701 prepares the target sound S101 based on sound information S700 and at the same time prepares the fundamental period S105 of the target sound S101, and outputs the target sound S101 and the fundamental period S105 to the analysis unit 104. The evaluation sound preparation unit 103 inputs the evaluation sound S100, and outputs the same to the analysis unit 104. The analysis unit 104 sequentially calculates the differential values of the evaluation sound S100 and the target sound S101 at corresponding points in time, by temporally shifting the target sound S101 with respect to the evaluation sound S100. The analysis unit 104 analyzes whether or not the target sound S101 exists in the evaluation sound S100 based on the period of an iterative time interval of a differential value equal to or lower than the threshold value S104 and the fundamental period S105 of the target sound S101. The analysis unit 104 outputs a detection signal S102 to the alarm sound output unit 105 when the target sound S101 exists in the evaluation sound S100. The alarm sound output unit 105 presents the alarm sound S103 to the user when the detection signal S102 is inputted.
Next, operations of the vehicle detection system 200 configured as above will be described.
In this example, the threshold value S104 is stored in the analysis unit 104 prior to the shipment of the vehicle detection system 200. The threshold value S104 in this example is set to 0.2, which is a value that is slightly greater than zero.
First, the sound information setting unit 700 uses a microphone to retrieve a motorcycle sound that is sound information S700, and outputs the motorcycle sound to the target sound preparation unit 701 (step 800).
Next, the target sound preparation unit 701 prepares the target sound S101 by clipping a portion of the motorcycle sound that is sound information 5700 (step 801). At the same time, the fundamental period of the motorcycle sound is determined and set as the fundamental period S105. In this example, since the motorcycle sound is the only target sound and no other sounds having the same fundamental period as the motorcycle sound are included, the fundamental period of the motorcycle sound is determined using the method according to the first conventional technique.
Activation of the vehicle detection system 200 causes the evaluation sound preparation unit 103 to start retrieving peripheral sounds of the user, which is an evaluation sound 5100, using a microphone (step 201).
Next, analysis is performed on whether or not the fundamental period of the motorcycle sound that is the target sound S101 prepared by the target sound preparation unit 102 is included in the evaluation sound S100 which includes peripheral sounds of the user (step 202).
Next, judgment is made on whether or not an alarm sound should be presented. When the target sound exists, an alarm sound is outputted (step 203).
Since the steps 201, 202 and 203 are the same as in the first embodiment, descriptions thereof will be omitted.
Finally, the operations of the above-described steps 201 to 203 are repeated until the vehicle detection system 200 is brought to a stop (step 204).
As described above, since the target sound preparation unit 701 sets a target sound inputted by the sound information setting unit as the target sound to be prepared, the target sound preparation unit 701 is no longer required to prepare in advance a plurality of sounds to be used as target sound candidates, and reduction of storage capacity may be achieved.
Alternatively, in step 800, an evaluation sound S100 including the motorcycle sound may be inputted as sound information S700, and in step 801, a target sound S101 may be prepared by clipping the portion of the motorcycle sound from the sound information S700. In this case, the target sound S101 may be prepared even when sounds other than the target sound exist.
Another example of the sound information setting unit 700 and the target sound preparation unit 701 will now be described.
In this example, prior to the shipment of the vehicle detection system 200, a motorcycle sound, an engine sound of an automobile and a siren sound are stored as target sound candidates in the target sound preparation unit 701. In addition, a fundamental period corresponding to each target sound candidate is stored in the target sound preparation unit 701. Furthermore, the threshold value S104 is stored in the analysis unit 104.
An example of an engine sound of an automobile is shown in
Examples of target sound candidates are shown in
First, the sound information setting unit 700 presents the target sound candidates to the user.
Next, the target sound preparation unit 701 sets a target sound corresponding to the selection signal that is the sound information S700 as the target sound S101 (step 801). In addition, the fundamental period of the target sound corresponding to the selection signal is set as the fundamental period S105. In this example, the target sound S101 is the motorcycle sound and the fundamental period S105 is 2.9-3.2 ms, which is the fundamental period of the motorcycle sound.
Activation of the vehicle detection system 100 causes the evaluation sound preparation unit 103 to start retrieving peripheral sounds of the user, which is the evaluation sound S100, using a microphone (step 201).
Next, analysis is performed on whether or not the fundamental period of the motorcycle sound that is the target sound S101 prepared by the target sound preparation unit 102 is included in the evaluation sound S100 which includes peripheral sounds of the user (step 202).
Next, judgment is made on whether or not an alarm sound should be presented. When a target sound exists, an alarm sound is outputted (step 203).
Since the steps 201, 202 and 203 are the same as in the first embodiment, descriptions thereof will be omitted.
Finally, the operations of the above-described steps 201 to 203 are repeated until the vehicle detection system 200 is brought to a stop (step 204).
As described above, since a target sound may be prepared using target sound candidates stored in the target sound preparation unit 701, there is no need to input a target sound. As a result, a target sound may be analyzed even when a target sound cannot be inputted. For instance, when the existence of a motorcycle sound in ambient noise is analyzed, while it will be impossible to pick up a motorcycle sound in a quiet environment in ambient noise, the existence of the motorcycle sound may be analyzed by using the motorcycle sound in a quiet environment stored in the target sound preparation unit 701. In addition, since the time required for inputting a target sound may be omitted, real time processing may be achieved.
As described above, according to the first variation of the first embodiment of the present invention, since the target sound preparation unit 701 prepares a target sound based on sound information set by the sound information setting unit 700, the target sound to be prepared by the target sound preparation unit 701 may be controlled. As a result, a user is now capable of setting a target sound using the sound information setting unit 700.
A second variation of the first embodiment will now be described.
A vehicle detection system 300 includes a fundamental period analysis unit 301 and the alarm sound output unit 105.
The fundamental period analysis unit 301 includes a threshold value setting unit 1100, the sound information setting unit 700, the target sound preparation unit 701, the evaluation sound preparation unit 103 and the analysis unit 104.
A method will now be described in which the threshold value setting unit 1100 sets a threshold value based on a target sound prepared by the target sound preparation unit 701. In this example, the threshold value setting unit 1100 uses a “selection signal S1100A” shown in
In this example, prior to the shipment of the vehicle detection system, a “motorcycle sound”, an “engine sound of an automobile” and a “siren sound” are stored as target sound candidates in the target sound preparation unit 701. In addition, a fundamental period corresponding to each target sound candidate is stored in the target sound preparation unit 701. Furthermore, a threshold value corresponding to each target sound candidate stored in the target sound preparation unit 701 is stored in the threshold value setting unit 1100. In this case, a “threshold value of the motorcycle sound”, a “threshold value of the engine sound of an automobile” and a “threshold value of the siren sound” are stored. These threshold values are respectively set for each target sound candidate to a value that is slightly greater than the maximum value of a variation due to the fluctuation of the minimum value of differential values in consideration of the fluctuation width of the fundamental waveform pattern of the target sound candidate.
A threshold value setting method is shown in
The sound information setting unit 700 sets sound information S700 regarding the target sound, and outputs the sound information S700 to the target sound preparation unit 701. The target sound preparation unit 701 prepares the target sound S101 based on the sound information S700 and at the same time prepares the fundamental period S105 of the target sound S101, and outputs the target sound S101 and the fundamental period S105 to the analysis unit 104. The threshold value setting unit 1100 sets the threshold value S104 based on the target sound S101 prepared by the target sound preparation unit 701. The evaluation sound preparation unit 103 inputs the evaluation sound S100, and outputs the same to the analysis unit 104. The analysis unit 104 sequentially calculates the differential values of the evaluation sound S100 and the target sound S101 at corresponding points in time, by temporally shifting the target sound S101 with respect to the evaluation sound S100. The analysis unit 104 analyzes whether or not the target sound S101 exists in the evaluation sound S100 based on the period of an iterative time interval of a differential value equal to or lower than the threshold value S104 and the fundamental period S105 of the target sound S101. The analysis unit 104 outputs a detection signal S102 to the alarm sound output unit 105 when the target sound S101 exists in the evaluation sound S100. The alarm sound output unit 105 presents the alarm sound S103 to the user when the detection signal S102 is inputted.
Next, operations of the vehicle detection system 300 configured as above will be described.
In this example, the sound information setting unit 700 presents target sound candidates to the user to have the user select a target sound, and creates a selection signal (step 800). In this example, a motorcycle sound is selected.
Next, the target sound preparation unit 701 sets a target sound corresponding to the selection signal S1100A that is the sound information S700 as the target sound S101 (step 801). In this example, the motorcycle sound is selected as the target sound S101. In addition, the fundamental period of the target sound S101 corresponding to the selection signal S1100A is set as the fundamental period S105. In this example, the fundamental period S105 is 2.9-3.2 ms, which is the fundamental period of the motorcycle sound.
Since the steps 800 and 801 are the same as in the first embodiment, descriptions thereof will be omitted.
Next, the threshold value setting unit 1100 sets a threshold value corresponding to the target sound S101 prepared by the target sound preparation unit 701 from the threshold values stored in the threshold value setting unit 1100 as the threshold value S104. In this example, since the motorcycle sound is selected as the target sound, a threshold value corresponding to the motorcycle sound is set as the threshold value S104 (step 1200).
Activation of the vehicle detection system 300 causes the evaluation sound preparation unit 103 to start retrieving peripheral sounds of the user, which is the evaluation sound S100, using a microphone (step 201).
Next, analysis is performed on whether or not the fundamental period of the motorcycle sound that is the target sound S101 prepared by the target sound preparation unit 102 is included in the evaluation sound S100 which includes peripheral sounds of the user (step 202).
Next, judgment is made on whether or not an alarm sound should be presented. When a target sound exists, an alarm sound is outputted (step 203).
Since the steps 201, 202 and 203 are the same as in the first embodiment, descriptions thereof will be omitted.
Finally, the operations of the above-described steps 201 to 203 are repeated until the vehicle detection system 300 is brought to a stop (step 204).
As described above, since the analysis unit 104 is capable of analyzing a fundamental period using a threshold value corresponding to a target sound, it is now possible to switch among target sounds on which analysis of its existence is performed.
A method will now be described in which the user uses the threshold value setting unit 1100 to set a threshold value. In this example, the threshold value setting unit 1100 uses the “threshold value information S1100B” shown in
In this example, prior to the shipment of the vehicle detection system 300, a “motorcycle sound”, an “engine sound of an automobile” and a “siren sound” are stored as target sound candidates in the target sound preparation unit 701. In addition, a fundamental period corresponding to each target sound candidate is stored in the target sound preparation unit 701. Furthermore, the threshold value S104 is stored in the analysis unit 104. The threshold value is set to a value that is slightly greater than the maximum value of a variation due to the fluctuation of the minimum value of differential values in consideration of the fluctuation width of the fundamental waveform patterns of all sounds in the target sound candidate.
The sound information setting unit 700 sets sound information S700 regarding the target sound, and outputs the sound information S700 to the target sound preparation unit 701. The target sound preparation unit 701 prepares the target sound S101 based on the sound information S700 and at the same time prepares the fundamental period S105 of the target sound S101, and outputs the target sound S101 and the fundamental period S105 to the analysis unit 104. The threshold value setting unit 1100 sets the threshold value S104 based on the threshold value information S1100B inputted by the user. The evaluation sound preparation unit 103 inputs the evaluation sound S100, and outputs the same to the analysis unit 104. The analysis unit 104 sequentially calculates the differential values of the evaluation sound S100 and the target sound S101 at corresponding points in time, by temporally shifting the target sound S101 with respect to the evaluation sound S100. The analysis unit 104 judges whether or not the target sound S101 exists in the evaluation sound S100 based on the period of an iterative time interval of a differential value equal to or lower than the threshold value S104 and the fundamental period S105 of the target sound S101. When the analysis unit judges that the target sound S101 exists, the analysis unit 104 outputs a detection signal S102 to the alarm sound output unit 105. The alarm sound output unit 105 presents the alarm sound S103 to the user when the detection signal S102 is inputted.
Next, operations of the vehicle detection system 300 configured as above will be described.
First, the sound information setting unit 700 presents target sound candidates to the user to have the user select a target sound, and creates a selection signal (step 800). In this example, a motorcycle sound is selected.
Next, the target sound preparation unit 701 sets a target sound corresponding to the selection signal that is the sound information S700 as the target sound S101 (step 801). In this example, the motorcycle sound is selected as the target sound S101.
Since the steps 800 and 801 are the same as in the other example of the first variation according to the first embodiment, descriptions thereof will be omitted.
The threshold value setting unit 1100 then sets the value of the threshold value that is the threshold value information S1100B inputted by the user as the threshold value S104 (step 1200). As an alternative method, a threshold value stored in the analysis unit 104 may be adjusted in accordance with an increase/decrease in the threshold value that is the threshold value information S1100B inputted by the user, and set as the threshold value S104.
Activation of the vehicle detection system 300 causes the evaluation sound preparation unit 103 to start retrieving peripheral sounds of the user, which is the evaluation sound S100, using a microphone (step 201).
Next, analysis is performed on whether or not the motorcycle sound that is the target sound 5101 prepared by the target sound preparation unit 102 is included in the evaluation sound 5100 which includes peripheral sounds of the user (step 202).
Next, judgment is made on whether or not an alarm sound should be presented. When a target sound exists, an alarm sound is outputted (step 203).
Since the steps 201, 202 and 203 are the same as in the first embodiment, descriptions thereof will be omitted.
Finally, the operations of the above-described steps 201 to 203 are repeated until the vehicle detection system 300 is brought to a stop (step 204).
As described above, a user may now set an appropriate threshold value for a target sound using the threshold value setting unit 1100. As a result, analytical errors may be reduced.
A method will now be described in which the threshold value setting unit 1100 sets a threshold value based on the fluctuation width of the fundamental waveform pattern of the target sound S101 prepared by the target sound preparation unit 701. In this example, the threshold value setting unit 1100 uses “sound information S1100C” shown in
The sound information setting unit 700 outputs a sound that includes a target sound that is the sound information S700 regarding the target sound to the target sound preparation unit 701. The target sound preparation unit 701 prepares the target sound S101 based on the sound information S700 and at the same time prepares the fundamental period S105 of the target sound S101, and outputs the target sound S101 and the fundamental period S105 to the analysis unit 104. The threshold value setting unit 1100 sets a threshold value based on the fluctuation width of the fundamental waveform pattern of the target sound S101 prepared by the target sound preparation unit 701. The evaluation sound preparation unit 103 inputs the evaluation sound S100, and outputs the same to the analysis unit 104. The analysis unit 104 sequentially calculates the differential values of the evaluation sound S100 and the target sound S101 at corresponding points in time, by temporally shifting the target sound S101 with respect to the evaluation sound S100. The analysis unit 104 analyzes whether or not the target sound S101 exists in the evaluation sound S100 based on the period of an iterative time interval of a differential value equal to or lower than the threshold value S104 and the fundamental period S105 of the target sound S101. The analysis unit 104 outputs a detection signal S102 to the alarm sound output unit 105 when the target sound S101 exists in the evaluation sound S100. The alarm sound output unit 105 presents the alarm sound S103 to the user when the detection signal S102 is inputted.
Next, operations of the vehicle detection system 300 configured as above will be described.
First, the sound information setting unit 700 uses a microphone to retrieve a motorcycle sound that is sound information S700, and outputs the motorcycle sound to the target sound preparation unit 701 (step 800).
Next, the target sound preparation unit 701 prepares the target sound S101 by clipping a portion of the motorcycle sound that is the sound information S700 (step 801). At the same time, the fundamental period of the motorcycle sound is determined and set as the fundamental period S105. In this example, since the motorcycle sound is the only target sound and no other sounds having the same fundamental period as the motorcycle sound are included, the fundamental period of the motorcycle sound is determined using the method according to the first conventional technique.
Since the steps 800 and 801 are the same as in the first variation according to the first embodiment, descriptions thereof will be omitted.
Next, for the target sound S101, the threshold value setting unit 1100 inputs the motorcycle sound that is the sound information S700 as the sound information S1100C, and in consideration of the fluctuation width of the fundamental waveform pattern of the motorcycle sound, sets the threshold value S104 as a value that is slightly greater than the maximum value of a variation due to the fluctuation of the minimum value of the differential values (step 1200). In other words, the threshold value S104 is set in consideration of the fluctuation width of the fundamental waveform pattern of the target sound S101. In this example, the threshold value S104 is set using the same method as shown in
Activation of the vehicle detection system 300 causes the evaluation sound preparation unit 103 to start retrieving peripheral sounds of the user, which is the evaluation sound S100, using a microphone (step 201).
Next, analysis is performed on whether or not the fundamental period of the motorcycle sound that is the target sound S101 stored in the target sound preparation unit 102 is included in the evaluation sound S100 which includes peripheral sounds of the user (step 202).
Next, judgment is made on whether or not an alarm sound should be presented. When a target sound exists, an alarm sound is outputted (step 203).
Since the steps 201, 202 and 203 are the same as in the first embodiment, descriptions thereof will be omitted.
Finally, the operations of the above-described steps 201 to 203 are repeated until the vehicle detection system 300 is brought to a stop (step 204).
As described above, since the threshold value setting unit 1100 is capable of automatically determining a threshold value that is appropriate for a target sound, there is no need to prepare a threshold value in advance. As a result, when target sounds to be analyzed are added, the user will not be required to set threshold values for the added target sounds, and improved usability may be achieved.
As described above, according to the second variation of the first embodiment of the present invention, it is now possible to control the threshold value to be used by the analysis unit 104 using the threshold value setting unit 1100. Therefore, appropriate threshold values may be set for a plurality of target sounds and an analysis on whether or not a target sound exists may be respectively performed for the plurality of target sounds. In addition, analytical errors on whether or not a target sound exists may be reduced by appropriately controlling the threshold values.
Another method of analyzing the existence of a target sound by the analysis unit will be supplemented below. In this example, a method will be described in which the existence of a target sound is analyzed by clipping a portion of an evaluation sound and using the clipped portion as the target sound, and determining a fundamental period of the evaluation sound. In this case, the fundamental period of the target sound has not been stored in the fundamental period analysis unit.
A fundamental period analysis method according to this example is shown in
Differential values between the evaluation sound and the target sound A are determined in the same manner as in the first embodiment. In addition, differential value between the evaluation sound and the target sound B are determined in the same manner as in the first embodiment. The determined differential values are shown in
An auditory assistance system 1700 includes a fundamental period analysis unit 1701 and a sound extraction unit 1705. The fundamental period analysis unit 1701 includes a target sound preparation unit 1702, an evaluation sound preparation unit 1703 and the analysis unit 104.
The target sound preparation unit 1702 stores a target sound frequency pattern S1702 for each frequency band obtained through frequency analysis of the target sound, and a fundamental period S1706 of the target sound. The analysis unit 1704 stores a threshold value S1705. The target sound preparation unit 1702 outputs the target sound frequency pattern S1702 and the fundamental period S1706 to the analysis unit 1704. The evaluation sound preparation unit 1703 inputs an evaluation sound S1700, and performs frequency analysis on the evaluation sound S1700 to output an evaluation sound frequency pattern S1701 for each frequency band to the analysis unit 1704. For each frequency band, the analysis unit 1704 sequentially calculates the differential values of the evaluation sound frequency pattern S1701 and the target sound frequency pattern S1702 at corresponding points in time, by temporally shifting the target sound frequency pattern S1702 with respect to the evaluation sound frequency pattern S1701. Based on the period of an iterative time interval of a differential value equal to or lower than the threshold value S1705 and the fundamental period S1706 of the target sound, the analysis unit 1704 outputs area information S1703 that is information regarding a time-frequency area in which the target sound exists in the evaluation sound S1700 to the sound extraction unit 1705. The sound extraction unit 1705 extracts a target sound using the area information S1703 and the evaluation sound frequency pattern S1701, and presents the target sound to the user.
The target sound preparation unit 1702 is an example of a target sound preparation unit that prepares a target sound frequency pattern obtained by performing frequency analysis on a target sound.
The evaluation sound preparation unit 1703 is an example of an evaluation sound preparation unit that prepares an evaluation sound frequency pattern obtained by performing frequency analysis on an evaluation sound.
The analysis unit 1704 is an example of an analysis unit that sequentially calculates differential values of the evaluation sound frequency pattern and the target sound frequency pattern at corresponding points in time, by temporally shifting the target sound frequency pattern with respect to the evaluation sound frequency pattern, calculates an iterative interval between the points in time where the differential value is equal to or lower than a predetermined threshold value, and judges whether or not the target sound exists in the evaluation sound based on a period of the iterative interval and the fundamental period of the target sound.
Next, operations of the auditory assistance system 1700 configured as above will be described.
In this example, prior to the shipment of the auditory assistance system, a frequency pattern for each frequency band obtained by performing frequency analysis on the voice of the speaker A is stored as the target sound frequency pattern S1702 in the target sound preparation unit 1702 (step 1800), and the fundamental period S1706 of the voice of the speaker A that is the target sound is also stored. Furthermore, the threshold value S1705 is stored for each frequency band in the analysis unit 1704. In this example, the fundamental period S1706 of the voice of the speaker A that is the target sound is 3-12 ms. In addition, the target sound frequency pattern used herein may be obtained by performing discrete Fourier transform on the target sound according to the first embodiment. Note that, for this example, the target sound is not a motorcycle but the voice of the speaker A instead.
where N is a window length of Fourier transform which is set shorter than the length W of the target sound, and k represents an index at the frequency band to be analyzed. Here,
BT(n) (n=0, 1, . . . , N) [Formula 23]
represents the target sound, while
represents an analysis waveform pattern.
In addition, the target sound frequency pattern S1702 may be expressed as
where t represents the point in time of the start of the target sound to be analyzed. The target sound frequency pattern represents a temporal structure at the frequency of the target sound. In this example, target sound frequency patterns are calculated by shifting t by 1 point.
First, activation of the auditory assistance system 1700 causes the evaluation sound preparation unit 1703 to start retrieving the mixed sound of the three speakers, which is the peripheral sound of the user, which is the evaluation sound S1700, using a microphone. In this example, the evaluation sounds are retrieved in 30 ms intervals which include several fundamental periods of the voice of the speaker A. In other words, the fundamental period of the speaker A will be analyzed while segmenting the mixed sound every 30 ms and inputting the segments. Frequency analysis is then performed on the evaluation sound S1700 to create an evaluation sound frequency pattern S1701 for each frequency band (step 1801). The method of creating evaluation sound frequency patterns is the same as the method of creating target sound frequency patterns, only that the target sound is replaced by the evaluation sound S1700. Let an evaluation sound frequency pattern at a given point in time be expressed as
where N is a window length of Fourier transform which is set shorter than the length L of the evaluation sound S1700, and k represents an index at the frequency band to be analyzed. Here,
BH(n) (n=1, 2, . . . , N) [Formula 27]
represents evaluation sound.
In addition, the evaluation sound frequency pattern S1701 may be expressed as
Next, analysis is performed on whether or not the fundamental period of the voice of the speaker A that is the target sound stored in the target sound preparation unit 1702 is included in the evaluation sound S1700 which includes a mixed sound of the voices of the three speakers (step 1802). More specifically, for each frequency band, the analysis unit 1704 sequentially calculates the differential values of the evaluation sound frequency pattern S1701 and the target sound frequency pattern S1702 at corresponding points in time, by temporally shifting the target sound frequency pattern S1702 with respect to the evaluation sound frequency pattern S1701. The analysis unit 1704 analyzes the fundamental period of the target sound based on the iterative time interval between differential values that are equal to or lower than the threshold value S1705. Using the fundamental period S1706, the analysis unit 1704 then outputs area information S1703 that is information regarding a time-frequency area in which the target sound exists in the evaluation sound S1700 to the sound extraction unit 1705.
where m is a value of discretized time which corresponds to the point in time of the start of the evaluation sound frequency pattern S1701 for which a differential value will be determined. The differential value is a summation of the differences between the evaluation sound frequency pattern and the target sound frequency pattern for a time width (W−N). In this example, since the evaluation sound frequency pattern is the target sound frequency pattern, the iterative time interval between the differential values matches the fundamental period S1706 of the target sound (3-12 ms). In this example, the iterative time interval between the differential values is 6 ms.
At this point, the threshold value S1705 is introduced. Let the threshold value S1705 at the frequency band k be expressed as Θk. In this example, the threshold value S1705 has been stored in the analysis unit 1704 prior to shipment of the auditory assistance system, and in consideration of the fluctuation width of the fundamental waveform patterns of the target sound frequency pattern, the threshold value S1705 is set to a value that is slightly greater than the maximum value of a variation due to the fluctuation of the minimum value of the differential values.
Next, since the fundamental period of the evaluation sound frequency pattern is 6 ms and is within the range of 3-12 ms that is the fundamental period S1706 of the target sound, the target sound is judged to exist in the evaluation sound frequency pattern S1701, and area information S1703 to the effect that “the target sound exists in frequency band k” is created.
In addition, with respect to the analysis unit 1704,
At this point, the threshold value S1705 is introduced. In this example, the threshold value S1705 has similarly been stored in the analysis unit 1704 prior to shipment of the auditory assistance system, and in consideration of the fluctuation width of the fundamental waveform pattern of the target sound frequency pattern, the threshold value S1705 is set to a value that is slightly greater than the maximum value of a variation due to the fluctuation of the minimum value of the differential values. This value is the same as the value in the example shown in
Next, since a fundamental period of the evaluation sound frequency pattern does not exist and therefore is not within the range of 3-12 ms that is the fundamental period S1706 of the target sound, it is judged that the target sound does not exist in the evaluation sound frequency pattern S1701, and area information S1703 to the effect that “the target sound does not exist in frequency band k” is created.
When the evaluation sound frequency pattern at the frequency band k is a sound that has a different fundamental period from the target sound, the fundamental period S1706 of the target sound does not appear in the fundamental period of the evaluation sound frequency pattern S1701 at the frequency band k. Thus, the analysis unit 1704 judges that the target sound does not exist in the evaluation sound frequency pattern S1701, and area information S1703 to the effect that “the target sound does not exist in frequency band k” is created.
The above-described processing is performed for all frequency bands k (k=1, 2, . . . , N) to create finalized area information S1703.
Next, the sound extraction unit 1705 extracts a target sound using the area information S1703 and the evaluation sound frequency pattern S1701, and presents the target sound to the user (step 1803).
In this example, the frequency pattern of the time-frequency area of the evaluation sound frequency pattern S1701 described in the area information S1703 as “the target sound does not exist in frequency band k” is replaced with a zero value, while a frequency pattern of the extracted sound is created using the evaluation sound frequency pattern S1701 from the frequency pattern of the time-frequency area described as “the target sound exists in frequency band k”. The extracted sound S1704 is then created by performing an inverse Fourier transform on the frequency pattern of the extracted sound, and presented to the user through a speaker.
Finally, the operations of the above-described steps 1801 to 1803 are repeated until the auditory assistance system 1700 is brought to a stop (step 1804).
As described above, since the second embodiment of the present invention calculates differential values between an evaluation sound frequency pattern and a target sound frequency pattern and analyzes a fundamental period based on an iterative interval between differential values that are equal to or lower than a predetermined threshold value, analysis of a fundamental period may be performed while distinguishing between a sound that differs from a target sound but has the same fundamental period as the target sound and the target sound. In this case, since an evaluation sound frequency pattern and a target sound frequency pattern resulting from respective frequency analyses of the evaluation sound and a target sound are used, it is now possible to analyze fundamental periods on a per-frequency band basis. For instance, mixed sound separation may be achieved by extracting the frequency pattern of a target sound from the frequency pattern of the mixed sound for each frequency band. As a result, it is now possible to judge whether or not an evaluation sound contains the target sound.
(Variation of the Second Embodiment)
A variation of the second embodiment will now be described.
An auditory assistance system 1800 includes a fundamental period analysis unit 1801 and the sound extraction unit 1705. The fundamental period analysis unit 1801 includes the sound information setting unit 2300, the target sound preparation unit 2301, the evaluation sound preparation unit 1703 and the analysis unit 1704.
The analysis unit 1704 stores a threshold value S1705. The sound information setting unit 2300 sets sound information S2300 regarding the target sound, and outputs the sound information S2300 to the target sound preparation unit 2301. The target sound preparation unit 2301 prepares a target sound frequency pattern S1702 based on the sound information S2300 and at the same time prepares the fundamental period S1706 of the target sound, and outputs the target sound frequency pattern S1702 and the fundamental period S1706 to the analysis unit 1704. The evaluation sound preparation unit 1703 inputs an evaluation sound S1700, and performs frequency analysis on the evaluation sound S1700 to output an evaluation sound frequency pattern S1701 for each frequency band to the analysis unit 1704. For each frequency band, the analysis unit 1704 sequentially calculates the differential values of the evaluation sound frequency pattern S1701 and the target sound frequency pattern S1702 at corresponding points in time, by temporally shifting the target sound frequency pattern S1702 with respect to the evaluation sound frequency pattern S1701. Based on the period of an iterative time interval of a differential value equal to or lower than the threshold value S1705 and the fundamental period S1706 of the target sound, the analysis unit 1704 outputs area information S1703 that is information regarding a time-frequency area in which the target sound exists in the evaluation sound S1700 to the sound extraction unit 1705. The sound extraction unit 1705 extracts a target sound using the area information S1703 and the evaluation sound frequency pattern S1701, and presents the target sound to the user.
Next, operations of the auditory assistance system 1800 configured as above will be described.
In this example, the threshold value S1705 is stored in the analysis unit 1704 prior to the shipment of the auditory assistance system 1800. For all frequency bands in this example, the threshold value S1705 is set to 0.5, which is a value that is slightly greater than zero.
First, the sound information setting unit 2300 uses a microphone to retrieve a voice of the speaker A that is sound information S2300, and outputs the voice of the speaker A to the target sound preparation unit 2301 (step 2400).
Next, the target sound preparation unit 2301 prepares a target sound frequency pattern S1702 by clipping a portion of the voice of the speaker A that is sound information S2300 and performing frequency analysis of the clipped portion (step 2401). In this example, the target sound frequency pattern is created by discrete Fourier transform in the same manner as in the second embodiment. At the same time, the fundamental period of the voice of the speaker A is determined and set as the fundamental period S1706. In this example, since the voice of the speaker A is the only target sound and no other sounds having the same fundamental period as the voice of the speaker A are included, the fundamental period of the voice of the speaker A is determined using the method according to the first conventional technique.
Activation of the auditory assistance system 1800 causes the evaluation sound preparation unit 1703 to start retrieving the mixed sound of the three speakers, which is the peripheral sound of the user, which is the evaluation sound S1700, using a microphone. Frequency analysis is then performed on the evaluation sound S1700 to create an evaluation sound frequency pattern S1701 for each frequency band (step 1801).
Analysis is performed on whether or not the fundamental period of the voice of the speaker A that is the target sound frequency pattern S1702 prepared by the target sound preparation unit 2301 is included in the evaluation sound frequency pattern S1701 which includes the mixed sound of the voices of the three speakers to create area information 1703 (step 1802).
Next, the sound extraction unit 1705 extracts a target sound using the area information S1703 and the evaluation sound frequency pattern S1701, and presents the target sound to the user (step 1803).
Since the steps 1801, 1802 and 1803 are the same as in the second embodiment, descriptions thereof will be omitted.
Finally, the operations of the above-described steps 1801 to 1803 are repeated until the auditory assistance system 1800 is brought to a stop (step 1804).
As described above, since the target sound preparation unit 2301 uses a target sound inputted by the sound information setting unit 2300 as the target sound to be prepared, the target sound preparation unit 2301 is no longer required to prepare in advance a plurality of sounds to be used as target sound candidates, and a reduction of storage capacity may be achieved.
<Another Example>
Another example of the sound information setting unit 2300 and the target sound preparation unit 2301 will now be described.
In this example, prior to shipment of the auditory assistance system 1800, a frequency pattern of the voice of the speaker A, a frequency pattern of the voice of the speaker B and a frequency pattern of the voice of the speaker C have been stored as target sound frequency pattern candidates in the target sound preparation unit 2301. In addition, a fundamental period corresponding to each target sound (target sound frequency pattern) candidate is stored in the target sound preparation unit 2301. Furthermore, the threshold value S1705 is stored for each frequency band in the analysis unit 1704.
First, the sound information setting unit 2300 presents the target sound candidates to the user. In this case, the voice of the speaker A is selected, and a selection signal to the effect of “voices of speaker A” is created (step 2400).
Next, the target sound preparation unit 2301 sets a target sound frequency pattern corresponding to the selection signal that is the sound information S2300 as the target sound frequency pattern S1702 (step 2401). In this example, the frequency pattern of the voice of the speaker A is the target sound frequency pattern S1702. In addition, the fundamental period of the target sound corresponding to the selection signal is set as the fundamental period S1706. In this case, the fundamental period S1706 is 3-12 ms, which is the fundamental period of the voice of the speaker A.
Activation of the auditory assistance system 1800 causes the evaluation sound preparation unit 1703 to start retrieving the mixed sound of the three speakers, which is the peripheral sound of the user, which is the evaluation sound S1700, using a microphone. Frequency analysis is then performed on the evaluation sound S1700 to create an evaluation sound frequency pattern S1701 for each frequency band (step 1801).
Analysis is performed on whether or not the fundamental period of the voice of the speaker A that is the target sound frequency pattern S1702 prepared by the target sound preparation unit 2301 is included in the evaluation sound frequency pattern S1701 which includes the mixed sound of the voices of the three speakers to create area information 1703 (step 1802).
Next, the sound extraction unit 1705 extracts a target sound using the area information S1703 and the evaluation sound frequency pattern S1701, and presents the target sound to the user (step 1803).
Since the steps 1801, 1802 and 1803 are the same as in the second embodiment, descriptions thereof will be omitted.
Finally, the operations of the above-described steps 1801 to 1803 are repeated until the auditory assistance system 1800 is brought to a stop (step 1804).
As described above, since a target sound frequency pattern may now be prepared using target sound frequency pattern candidates stored in the target sound preparation unit 2301, there is no need to input a target sound, and perform frequency analysis thereon to create a target sound frequency pattern. As a result, the presence or absence of a target sound may be analyzed even when a target sound cannot be inputted. For instance, when analyzing the fundamental period of the voice of the speaker A in ambient noise, while it will be impossible to pick up the voice of the speaker A in a quiet environment in ambient noise, the presence or absence of the voice of the speaker A may be analyzed by using a target sound frequency pattern created by performing frequency analysis on the voice of the speaker A in a quiet environment stored in the target sound preparation unit 2301. In addition, since the time required for inputting a target sound or performing frequency analysis on the inputted sound may be omitted, real time processing may be achieved.
Incidentally, in the same manner as in the second variation of the first embodiment, a threshold value setting unit may be added in order to control the threshold value to be used by the analysis unit 1704. As a result, an appropriate threshold value with respect to a plurality of target sounds may be set and fundamental periods may be analyzed with respect to a plurality of target sounds. In addition, analytical errors on fundamental periods may be reduced by appropriately controlling the threshold values. Furthermore, while a threshold value has been set for each target sound in the second variation of the first embodiment, a threshold value may now be set for each frequency band. As a result, analytical errors may be further reduced.
<Yet Another Example>
Preferably, the target sound preparation unit 2301 prepares a target sound frequency pattern that includes at least one of an amplitude spectrum and a phase spectrum calculated from a cross correlation between the target sound and an aperiodic analysis waveform pattern which includes a predetermined frequency component, and the evaluation sound preparation unit 1703 prepares an evaluation sound frequency pattern that includes at least one of an amplitude spectrum and a phase spectrum calculated from a cross correlation between the evaluation sound and the analysis waveform pattern which includes a predetermined frequency component.
As a result, since a fundamental period of the target sound is analyzed using a target sound frequency pattern and an evaluation sound frequency pattern created using an aperiodic analysis waveform pattern, periodic characteristics of the target sound and the evaluation sound appear. Thus, a fundamental period of the target sound may be analyzed. For instance, since the fundamental period of the target sound appears even in a target sound frequency pattern of a frequency band that is higher than the fundamental period of the target sound, the fundamental period may be analyzed even when noise is superimposed on a frequency band that corresponds to the fundamental period of the target sound. In addition, since the fundamental period of the target sound will appear in target sound frequency patterns across all frequency bands, fundamental periods may be analyzed on a per-frequency band basis. As a result, it is now possible to judge whether or not an evaluation sound contains the target sound.
<Still Yet Another Example>
Preferably, the target sound preparation unit 2301 prepares a target sound frequency pattern that includes at least one of an amplitude spectrum and a phase spectrum calculated from respective cross correlations between the target sound and a plurality of local analysis waveform patterns that form a portion of an analysis waveform pattern which includes a predetermined frequency component and that has predetermined temporal resolution. The evaluation sound preparation unit 1701 prepares an evaluation sound frequency pattern that includes at least one of an amplitude spectrum and a phase spectrum calculated from respective cross correlations between the target sound and the plurality of local analysis waveform patterns. The analysis unit 1704 respectively uses the target sound frequency pattern prepared using the plurality of local analysis waveform patterns and the evaluation sound frequency pattern prepared using the plurality of local analysis waveform patterns as a single group of data in order to analyze the fundamental period of the target sound, and judges the existence of the target sound.
a) shows an analysis waveform pattern which includes by a cosine waveform pattern corresponding to three periods. When a frequency pattern is created by convoluting the analysis waveform pattern onto an evaluation sound or a target sound, since a single value is determined using a cosine waveform pattern corresponding to three periods, the temporal resolution will equal the length of the cosine waveform pattern corresponding to three periods.
On the other hand, as shown in
A description will now be given on the handling of frequency information contained in the frequency pattern determined using the cosine waveform pattern corresponding to three periods which is made possible by using frequency patterns prepared using a plurality of local analysis waveform patterns as a single group of data.
In this example, frequency patterns are created using discrete cosine transform.
If a frequency pattern of an analysis waveform pattern which includes a cosine waveform pattern corresponding to three periods may be expressed as
then frequency patterns of the local analysis waveform patterns may be expressed as
and N represents a number of samples of the window length of the discrete cosine transform. An evaluation sound or a target sound is represented as
Xn. [Formula 38]
Here, the relationship between the frequency pattern of the analysis waveform pattern and the frequency patterns of the local analysis waveform patterns may be expressed as
X
f
=X
f
1
+X
f
2
+X
f
3
+X
f
4
+X
f
5
+X
f
6. [Formula 39]
Since the frequency pattern of the analysis waveform pattern may be created by using frequency patterns prepared using six local analysis waveform patterns as a single group of data, frequency patterns of local analysis waveform patterns may be handled in the same way as the frequency pattern of the analysis waveform pattern by using the frequency patterns of local analysis waveform patterns as a single group of data.
As described above, it is now clear that frequency patterns of the six local analysis waveform patterns handled as a single group of data contains, in addition to frequency information held by the frequency pattern of the analysis waveform pattern, information regarding changes in temporal frequency structure.
Similar to
On the other hand, as shown in
In this example, since the frequency pattern of the analysis waveform pattern may also be expressed as a sum of three frequency patterns, frequency patterns prepared using three local analysis waveform patterns may be handled in the same way as the frequency pattern determined from the cosine waveform pattern corresponding to three periods by using the frequency patterns prepared using the three local analysis waveform patterns as a single group of data.
a) shows a frequency pattern at 2 KHz of a mixed sound of the voices of three speakers analyzed using the local analysis waveform patterns shown in
then frequency patterns of the local analysis waveform patterns of the target sound may be expressed by
where W is the same as in the second embodiment, N represents the number of samples of the window length of the discrete cosine transform, and Ck represents Formula 37. In addition, if the frequency pattern of the analysis waveform pattern of the evaluation sound is expressed as
then frequency patterns of the local analysis waveform patterns of the evaluation sound may be expressed by
where W is the same as in the second embodiment, N represents the number of samples of the window length of the discrete cosine transform, and Ck represents Formula 37.
In this example, for a frequency band f, a differential value when the target sound frequency pattern is temporally shifted with respect to the evaluation sound frequency pattern is expressed by an Euclidean distance. The differential value at the frequency pattern of the analysis waveform pattern may be expressed as
Then, the differential value at the frequency patterns of the local analysis waveform patterns may be expressed as
Considering now the distance between the frequency pattern XH and the frequency pattern XT using
Thus, since a target sound frequency pattern prepared using a plurality of local analysis waveform patterns and an evaluation sound frequency pattern prepared using a plurality of local analysis waveform patterns are respectively used as a single group of data in order to analyze a fundamental period, changes in temporal frequency structures in frequency information according to the frequency resolution of the analysis waveform patterns may be accommodated, and a fundamental period may be analyzed by seemingly arranging the frequency resolution to be increased.
The vehicle detection system 3002 includes the fundamental period analysis unit 3003 and the alarm sound output unit 105. The fundamental period analysis unit 3003 includes the target sound preparation unit 1702, the evaluation sound preparation unit 1703, a frequency setting unit 3000 and an analysis unit 3001.
In this example, the frequency setting unit 3000 uses “band information AS3001A” shown in
The target sound preparation unit 1702 stores a target sound frequency pattern S1702 for each frequency band obtained through frequency analysis of the target sound, and a fundamental period S1706 of the target sound. The analysis unit 3001 stores a threshold value S1705. The target sound preparation unit 1702 outputs the target sound frequency pattern S1702 and the fundamental period S1706 to the analysis unit 3001. The evaluation sound preparation unit 1703 inputs an evaluation sound S100, and performs frequency analysis on the evaluation sound S100 to output an evaluation sound frequency pattern S1701 for each frequency band to the analysis unit 3001. The frequency setting unit 3000 inputs band information AS3001A to create band information S3000, and outputs the same to the analysis unit 3001. For a frequency band based on the band information S3000, the analysis unit 3001 sequentially calculates the differential values of the evaluation sound frequency pattern S1701 and the target sound frequency pattern S1702 at corresponding points in time, by temporally shifting the target sound frequency pattern S1702 with respect to the evaluation sound frequency pattern S1701. The analysis unit 3001 judges whether or not the target sound exists in the evaluation sound S100 based on the period of an iterative time interval of a differential value equal to or lower than the threshold value S1705 and the fundamental period S1706 of the target sound. When the target sound exists, the analysis unit 3001 outputs a detection signal S102 to the alarm sound output unit 105. The alarm sound output unit 105 presents the alarm sound S103 to the user when the detection signal S102 is inputted.
Next, operations of the vehicle detection system 3002 configured as above will be described.
In this example, prior to the shipment of the vehicle detection system 1702, a frequency pattern for each frequency band obtained by performing frequency analysis on the motorcycle sound is stored as the target sound frequency pattern S1702 in the target sound preparation unit 102 (step 1800), and the fundamental period S1706 of the motorcycle sound that is the target sound is also stored. Furthermore, the threshold value S1705 is stored for each frequency band in the analysis unit 3001.
Activation of the vehicle detection system 3002 causes the evaluation sound preparation unit 1703 to start retrieving peripheral sounds of the user, which is an evaluation sound S100, using a microphone. Frequency analysis is then performed on the evaluation sound S100 to create an evaluation sound frequency pattern S1701 for each frequency band (step 1801).
Next, the user uses the frequency setting unit 3000 to input a frequency band on which fundamental period analysis is to be performed. In this example, the frequency bands of 200 Hz and 500 Hz, at which the power of the motorcycle that is the target sound is high, are inputted. Thus, “200 Hz, 500 Hz” that is the band information S3000 is inputted to the analysis unit 3001 (step 3100). When noise has been added to 200 Hz in consideration of the noise included in the evaluation sound S100, only 500 Hz may be set as the frequency band on which fundamental period analysis is to be performed.
Next, analysis is performed on whether or not the fundamental period of the motorcycle sound that is the target sound stored in the target sound preparation unit 1702 is included in the evaluation sound S100 (step 3101). In this example, since the band information S3000 is “200 Hz and 500 Hz”, the fundamental period of the target sound is analyzed in the same manner as in the second embodiment for a frequency pattern at 200 Hz and a frequency pattern at 500 Hz. Next, from the analysis results for 200 Hz and 500 Hz, when the target sound is judged to exist in even one of the frequency bands, a detection signal S102 to the effect that “the target sound exists” is outputted to the alarm sound output unit 105. Meanwhile, when it is judged that the target sound does not exist in both frequency bands, the detection signal S102 is not outputted to the alarm sound output unit 105.
Next, when the detection signal S102 is inputted, the alarm sound output unit 105 presents the alarm sound S103 to the user (step 203).
Since the steps 1800, 1801 and 203 are the same as in the first and second embodiments, descriptions thereof will be omitted.
Finally, the operations of the above-described steps 1801, 3100, 3101 and 203 are repeated until the vehicle detection system 3002 is brought to a stop (step 3102).
As described above, frequency bands of target sound frequency patterns and evaluation sound frequency patterns used by the analysis unit 3001 may be controlled using the frequency setting unit 3000. As a result, it is now possible to change a frequency band to be analyzed or the bandwidth of a frequency band to be analyzed. For instance, when analyzing an evaluation sound in which the target sound and noise are mixed, the fundamental period of the evaluation sound may be analyzed by selecting a frequency band that is free of noise, and in turn, the existence of the target sound may be judged.
<Another Example>
Another example at the frequency setting unit will now be described.
In this example, the frequency setting unit 3000 uses “band information BS3001B” and “band information CS3001C” shown in
The target sound preparation unit 1702 stores a target sound frequency pattern S1702 for each frequency band obtained through frequency analysis of the target sound, and a fundamental period S1706 of the target sound. The analysis unit 3001 stores a threshold value S1705. The target sound preparation unit 1702 outputs the target sound frequency pattern S1702 and the fundamental period S1706 to the analysis unit 3001. The evaluation sound preparation unit 1703 inputs an evaluation sound S100, and performs frequency analysis on the evaluation sound S100 to output an evaluation sound frequency pattern S1701 for each frequency band to the analysis unit 3001. The frequency setting unit 3000 inputs the band information CS3001C that is the evaluation sound S100 and the band information BS3001B from the target sound preparation unit 1702 to create band information S3000, and outputs the same to the analysis unit 3001. For a frequency band based on the band information S3000, the analysis unit 3001 sequentially calculates the differential values of the evaluation sound frequency pattern S1701 and the target sound frequency pattern S1702 at corresponding points in time, by temporally shifting the target sound frequency pattern S1702 with respect to the evaluation sound frequency pattern S1701. The analysis unit 3001 judges whether or not the target sound exists in the evaluation sound S100 based on the period of an iterative time interval of a differential value equal to or lower than the threshold value S1705 and the fundamental period S1706 of the target sound. When the target sound exists, the analysis unit 3001 outputs a detection signal S102 to the alarm sound output unit 105. The alarm sound output unit 105 presents the alarm sound S103 to the user when the detection signal S102 is inputted.
Next, operations of the vehicle detection system 3002 configured as above will be described.
In this example, prior to the shipment of the vehicle detection system 1702, a frequency pattern for each frequency band obtained by performing frequency analysis on the motorcycle sound is stored as the target sound frequency pattern S1702 in the target sound preparation unit 1702 (step 1800), and the fundamental period S1706 of the motorcycle sound that is the target sound is also stored. Furthermore, the threshold value S1705 is stored for each frequency band in the analysis unit 3001.
Activation of the vehicle detection system 3002 causes the evaluation sound preparation unit 1703 to start retrieving peripheral sounds of the user, which is the evaluation sound S100, using a microphone. Frequency analysis is then performed on the evaluation sound S100 to create an evaluation sound frequency pattern S1701 for each frequency band (step 1801).
Next, the frequency setting unit 3000 selects a frequency band in which the power of the target sound that is the band information BS3001B is high from the target sound. In this case, 200 Hz and 500 Hz are selected. In addition, a frequency band in which the power of the noise included in the evaluation sound S100 that is the band information CS3001C is high is selected from the evaluation sound S100. In this case, 200 Hz is selected. Then, a frequency band having a higher power than these frequency bands and which does not contain noise is set as the band information S3000. In this example, the band information S3000 is “500 Hz”.
Next, analysis is performed on whether or not the fundamental period of the motorcycle sound that is the target sound stored in the target sound preparation unit 1702 is included in the evaluation sound S100 (step 3101). In this example, since the band information S3000 is “500 Hz”, the fundamental period of the target sound is analyzed in the same manner as in the second embodiment for a frequency pattern at 500 Hz. When the target sound is judged to exist from the analysis result for 500 Hz, a detection signal S102 to the effect that “the target sound exists” is outputted to the alarm sound output unit 105.
When the detection signal S102 is inputted, the alarm sound output unit 105 presents the alarm sound S103 to the user (step 203).
Since the steps 1800, 1801 and 203 are the same as in the first and second embodiments, descriptions thereof will be omitted.
As described above, since the frequency setting unit 3000 is capable of automatically determining a frequency band that is appropriate for a target sound, there is no need to prepare a frequency band in advance, and greater usability is achieved.
The target sound analysis apparatus according to the present invention is deployable to a wide range of products incorporating the functions of mixed sound separation, sound discrimination and voice synthesis, such as vehicle detection systems, hearing aids, mobile phones and television conference systems.
Number | Date | Country | Kind |
---|---|---|---|
2006-005178 | Jan 2006 | JP | national |
This is a continuation application of PCT application No. PCT/JP2006/325548 filed Dec. 21, 2006, designating the United States of America.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP2006/325548 | Dec 2006 | US |
Child | 11902731 | US |