This application is a National Stage Entry of PCT/JP2014/050653 filed on Jan. 16, 2014, which claims priority from Japanese Patent Application 2013-025001 filed on Feb. 12, 2013, the contents of all of which are incorporated herein by reference, in their entirety.
The present invention relates to a technique of acquiring a signal from a sound mixture including noise and a desired signal.
In the above technical field, patent literature 1 discloses a technique of providing a sound insulating member between two microphones and acquiring a piece of speech in a sound space where a piece of speech and noise coexist.
Patent literature 1: International Publication No. 2012/096072
In the technique described in the above literature, however, an L-shaped or conical sound insulating member is provided aiming at increasing the difference between pieces of speech input to the two microphones. Hence, it is sometimes impossible to acquire a piece of speech of much higher level as compared to noise depending on the direction of the piece of speech or noise.
The present invention enables to provide a technique of solving the above-described problem.
One aspect of the present invention provides a speech processing apparatus comprising:
a first microphone that is provided on one of a ceiling member in a vehicle and an accessory thereof, inputs a sound mixture including a voice of a passenger of the vehicle and noise in the vehicle, and outputs a first signal;
a second microphone that is provided on one of the ceiling member in the vehicle and the accessory thereof at a position farther than the first microphone when viewed from the passenger of the vehicle, inputs the noise in the vehicle while insulating the voice of the passenger of the vehicle using one of the ceiling member of the vehicle and the accessory thereof, and outputs a second signal; and
a noise suppressor that outputs an enhanced speech signal based on the first signal and the second signal.
Another aspect of the present invention provides a speech processing method comprising:
inputting a sound mixture including a voice of a passenger of a vehicle and noise in the vehicle and outputting a first signal using a first microphone provided on one of a ceiling member in the vehicle and an accessory thereof;
inputting the noise in the vehicle while insulating the voice of the passenger of the vehicle using one of the ceiling member of the vehicle and the accessory thereof and outputting a second signal using a second microphone provided on one of the ceiling member in the vehicle and the accessory thereof at a position farther than the first microphone when viewed from the passenger of the vehicle; and
outputting an enhanced speech signal based on the first signal and the second signal.
Still other aspect of the present invention provides a speech processing program for causing a computer to execute a method comprising:
inputting a sound mixture including a voice of a passenger of a vehicle and noise in the vehicle and outputting a first signal using a first microphone provided on one of a ceiling member in the vehicle and an accessory thereof;
inputting the noise in the vehicle while insulating the voice of the passenger of the vehicle using one of the ceiling member of the vehicle and the accessory thereof and outputting a second signal using a second microphone provided on one of the ceiling member in the vehicle and the accessory thereof at a position farther than the first microphone when viewed from the passenger of the vehicle; and
outputting an enhanced speech signal based on the first signal and the second signal.
Still other aspect of the present invention provides a method of attaching a speech processing method to a vehicle, the method comprising:
attaching a first microphone that inputs a sound mixture including a voice of a passenger of a vehicle and noise in the vehicle and outputs a first signal to on one of a ceiling member in the vehicle and an accessory thereof;
attaching a second microphone that inputs the noise in the vehicle while insulating the voice of the passenger of the vehicle using one of the ceiling member of the vehicle and the accessory thereof and outputs a second signal to one of the ceiling member in the vehicle and the accessory thereof at a position farther than the first microphone when viewed from the passenger of the vehicle; and
connecting the first microphone and the second microphone to a noise suppressor that outputs an enhanced speech signal, based on the first signal and the second signal.
Still other aspect of the present invention provides a ceiling member comprising the speech processing apparatus.
Still other aspect of the present invention provides a vehicle comprising the speech processing apparatus.
According to the present invention, it is possible to input the voice of the passenger of a vehicle and output a high-quality enhanced speech signal independently of the direction of a piece of speech or noise.
Preferred embodiments of the present invention will now be described in detail with reference to the drawings. It should be noted that the relative arrangement of the components, the numerical expressions and numerical values set forth in these embodiments do not limit the scope of the present invention unless it is specifically stated otherwise. Note that “speech signal” in the following explanation indicates a direct electrical change that occurs in accordance with the influence of speech or another sound. The speech signal transmits speech or another sound.
A speech processing apparatus 100 according to the first embodiment of the present invention will be described with reference to
As shown in
The first microphone 101 is provided on the ceiling member in a vehicle 150 or an accessory thereof, inputs a sound mixture including a voice 170 of a passenger 160 of the vehicle 150 and noise 180 in the vehicle, and outputs a first signal 104.
The second microphone 102 is provided on the ceiling member in the vehicle 150 or an accessory thereof at a position farther than the first microphone 101 when viewed from the passenger 160 of the vehicle 150, inputs the noise 180 in the vehicle while insulating the voice 170 of the passenger 160 of the vehicle 150 using the ceiling member of the vehicle 150 or the accessory thereof, and outputs a second signal 105.
The noise suppressor 103 outputs an enhanced speech signal based on the first signal 104 and the second signal 105.
According to the above-described arrangement, the voice of the passenger of the vehicle is insulated using the ceiling member of the vehicle or an accessory thereof. It is therefore possible to input the voice of the passenger of the vehicle and output a high-quality enhanced speech signal while ensuring high productivity.
A speech processing apparatus according to the second embodiment of the present invention will be described next with reference to
<<Overall Arrangement>>
Referring to
The microphone 201 is provided on the ceiling member in a vehicle 250 or an accessory thereof, catches a voice 270 of a passenger 260 of the vehicle 250, outputs a signal X1, and provides it to the noise suppressor 203. The microphone 202 is provided on the ceiling member in the vehicle 250 or an accessory thereof at a position farther than the microphone 201 when viewed from the passenger 260 of the vehicle 250. The microphone 202 catches noise 280 in the vehicle, outputs a signal X2, and provides it to the noise suppressor 203. The noise 280 in the vehicle includes not only noise from the engine, motor, air conditioner, audio system, blinker, and windshield wipers generated in the vehicle but also road noise, sound of rain, sound of wind, and the like generated outside the car.
Both the signal X1 and the signal X2 are mixture signals including a speech signal and a noise signal. The signal X1 includes the speech signal in a relatively large amount. On the other hand, the noise 280 caught by the microphone 201 and that caught by the microphone 202 preferably have no large difference. In other words, the signal X1 includes the speech signal and the noise signal at a ratio different from that in the signal X2, and the ratio of the speech signal is higher in the signal X1 than in the signal X2.
The noise suppressor 203 outputs an enhanced speech signal 207 based on the signal X1 and the signal X2. The speech recognizer 208 recognizes the utterance contents of the passenger 260 based on the enhanced speech signal 207. The car navigation device 209 is operated by the piece of recognized speech. The voice of the passenger 260 is used not only to operate the car navigation device 209 but also for another purpose, for example, to operate the audio system or air conditioner in the car or to do a speech communication via a mobile phone.
<<Arrangement of Noise Suppressor>>
The noise suppressor 203 also includes an adaptive filter (XF) 304 serving as an estimated speech signal generator that generates the estimated speech signal Y2 from the enhanced speech signal E1 (207) that is the output signal of the subtracter 301. The adaptive filter 304 generates the estimated speech signal Y2 from the enhanced speech signal E1 using a parameter that changes based on the enhanced noise signal E2. A detailed example of the adaptive filter 304 is described in detail in International Publication No. 2005/024787.
Even if the voice of the passenger 260 is input to the microphone 202, and the speech signal is included in the signal X2, the adaptive filter 304 can prevent the subtracter 301 from erroneously removing the speech signal from the signal X1. With this arrangement, the subtracter 301 subtracts the estimated noise signal Y1 from the signal X1 transmitted from the microphone 201 and outputs the enhanced speech signal E1.
The noise suppressor 203 can be any one of an analog circuit, a digital circuit, and a mixture thereof. When the noise suppressor 203 is an analog circuit, the enhanced speech signal E1 is converted into a digital signal by an A/D converter and used for digital control. On the other hand, when the noise suppressor 203 is a digital circuit, a signal from the microphone is converted into a digital signal by an A/D converter before input to the noise suppressor 203. If both an analog circuit and a digital circuit are included, for example, the subtracter 301 or 303 can be formed from an analog circuit, and the adaptive filter 302 or 304 can be formed from an analog circuit controlled by a digital circuit.
The noise suppressor 203 shown in
<<Arrangement of Microphones>>
A windshield 402 is normally fixed to a body ceiling member 403 of the vehicle 250 by an adhesive or the like. The internal ceiling member 401 is separately attached to the body ceiling member 403. For this reason, a gap exists between the windshield 402 and the internal ceiling member 401. The microphone 202 is attached to the gap. An end of the internal ceiling member 401 thus insulates input of the voice 270 of the passenger 260 to the microphone 202.
Similarly, it is possible to use the microphone 201b as the first microphone and the microphone 202a as the second microphone for the driver's seat and the microphone 201c as the first microphone and the microphone 202a as the second microphone for the assistant driver's seat. Alternatively, each of the microphones 201b and 201c may be used as the first microphone, the microphone 202a may be shared as the second microphone, and a signal selector that automatically selects one of the microphones 201b and 201c with a stronger signal may be provided. In this case, the number of constituent elements can be decreased by sharing the microphone 202a. Note that the expressions of “driver's seat side” and “assistant driver's seat side” used here assume a car with a right-hand steering wheel but are not limited to these depending on the model.
In this embodiment, since the microphone configured to catch noise in the car is arranged in the gap between the windshield and the internal ceiling member, as described above, a high-quality enhanced speech signal can be obtained very easily without adding any new component to the conventional internal structure. It is possible to catch uniform noise from all directions by placing the microphone on the ceiling member.
A speech processing apparatus 300 according to the third embodiment of the present invention will be described next with reference to
Referring to
For example, upon determining that the air conditioner 654 is operating, the noise suppression module 603 actively suppresses wind noise from the input signals of the microphones 201 and 202. At this time, the suppression level may be controlled by determining that the input signal from the microphone 202 includes a larger amount of wind noise as compared to the microphone 201.
For example, upon determining that the windshield wiper 653 is operating, the noise suppression module 603 actively suppresses the operation noise of the windshield wiper and the noise of rain from the input signals of the microphones 201 and 202. At this time, the suppression level may be controlled by determining that the input signal from the microphone 202 includes a larger amount of the operation noise of the windshield wiper and the noise of rain as compared to the microphone 201.
Note that the electronic control unit 651 physically includes, for example, a CPU (Central Processing Unit), a memory, and an input/output interface. The memory includes, for example, a ROM (Read Only Memory) and an HDD (Hard Disk Drive) which store programs and data to be processed by the CPU and a RAM (Random Access Memory) mainly used as various work areas for control processing. These elements are connected to each other via a bus. The CPU executes a program (for example, noise suppression module) stored in the ROM and processes a signal received via the input/output interface, a signal input from a microphone, data expanded on the RAM, and the like, thereby implementing the function as the speech processing apparatus 300.
As described above, according to this embodiment, the noise suppression method and level are changed in accordance with the operation of the vehicle, thereby obtaining an enhanced speech signal of higher quality.
A speech processing apparatus according to the fourth embodiment of the present invention will be described with reference to
As shown in
A speech processing apparatus according to the fifth embodiment of the present invention will be described with reference to
The microphone 902 is arranged ahead of the overhead console 990. Since the overhead console 990 insulates the voice of the passenger 260, a stronger speech signal is input to the microphone 901 as compared to the microphone 902. Hence, according to the microphone arrangement of this embodiment, a high-quality enhanced speech signal can be obtained.
As for the microphone arrangement, a plurality of combinations are possible, as in
A speech processing apparatus according to the sixth embodiment of the present invention will be described with reference to
Since the projecting portion 1042 insulates the voice of the passenger 260, a stronger speech signal is input to the microphone 1001 as compared to the microphone 1002. Hence, according to the microphone arrangement of this embodiment, a high-quality enhanced speech signal can be obtained.
While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments. The scope of the following claims is to be accorded the broadest interpretation so as to encompass all such modifications and equivalent structures and functions.
The present invention is applicable to a system including a plurality of devices or a single apparatus. The present invention is also applicable even when an information processing program for implementing the functions of the embodiments is supplied to the system or apparatus directly or from a remote site. Hence, the present invention also incorporates the program installed in a computer to implement the functions of the present invention on the computer, a medium storing the program, and a WWW (World Wide Web) server that causes a user to download the program. Especially, the present invention incorporates at least a non-transitory computer readable medium.
This application claims the benefit of Japanese Patent Application No. 2013-025001 filed on Feb. 12, 2013, which is hereby incorporated by reference herein in its entirety.
Number | Date | Country | Kind |
---|---|---|---|
2013-025001 | Feb 2013 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2014/050653 | 1/16/2014 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2014/125860 | 8/21/2014 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20040059571 | Ohtomo | Mar 2004 | A1 |
20040138882 | Miyazawa | Jul 2004 | A1 |
20060031067 | Kaminuma | Feb 2006 | A1 |
20060174581 | Katcherian | Aug 2006 | A1 |
20060178169 | Dunn, Jr. et al. | Aug 2006 | A1 |
20110211705 | Hutt | Sep 2011 | A1 |
20120284023 | Vitte | Nov 2012 | A1 |
Number | Date | Country |
---|---|---|
2003-111185 | Apr 2003 | JP |
2004-120717 | Apr 2004 | JP |
2006-050303 | Feb 2006 | JP |
2006-222969 | Aug 2006 | JP |
2012096072 | Jul 2012 | WO |
2012165657 | Dec 2012 | WO |
Entry |
---|
International Search Report for PCT Application No. PCT/JP2014/050653 dated Apr. 1, 2014. |
Number | Date | Country | |
---|---|---|---|
20160049161 A1 | Feb 2016 | US |