The present invention relates to an in-vehicle voice processing device enabling, for example, communication between an occupant in one vehicle and an occupant in an intended vehicle.
In recent years, there are vehicles equipped with an acoustic system using surround speakers (a speaker array) that realizes a realistic sound field. PTL 1 presents a technology where one vehicle (vehicle A) transmits its position and the voice of an utterer in the one vehicle, and an intended vehicle (vehicle B) calculates a positional relationship between vehicle B and vehicle A from the received position of vehicle A and the position of vehicle B and outputs the received voice from surround speakers so that its voice can be heard from the direction of vehicle A.
In the technology in PTL 1, the voice is transmitted to not only the intended vehicle an utterer in one vehicle wants to speak to but also many and unspecified vehicles around the one vehicle. Then, the volume of the voice is adjusted according to the distance from the one vehicle (the farther away a vehicle is from the one vehicle, the lower the volume of the voice).
However, of other vehicles around one vehicle, the intended vehicle an utterer in the one vehicle wants to speak to is not always a vehicle nearest to the one vehicle. Therefore, an occupant in the intended vehicle may be less likely to find him/herself spoken to, and there is concern that there may arise a situation in which it is difficult to perform smooth communication with the intended vehicle.
The present invention has been made in view of the above, and an object of the invention is to provide a voice processing device enabling smooth communication between an occupant in one vehicle and an occupant in an intended vehicle.
An in-vehicle voice processing device according to the present invention for solving the problem includes: a vehicle-position acquiring unit that acquires a position of a vehicle; a voice acquiring unit that acquires a voice of an utterer in the vehicle; an utterance-direction detecting unit that detects a direction of utterance of the utterer; and a transmitting unit that transmits the position of the vehicle, the voice, and the direction of utterance to many and unspecified other vehicles around the vehicle.
In addition, an in-vehicle voice processing device according to another aspect of the present invention includes: a vehicle-position acquiring unit that acquires a position of vehicle; a receiving unit that receives a position of another vehicle, a voice of an utterer in the other vehicle, and a direction of utterance of the utterer in the other vehicle that are transmitted from the other vehicle; and a voice output unit that calculates volume of the voice to be output on the basis of the position of the vehicle, the position of the other vehicle, and the direction of utterance of the utterer in the other vehicle, and processes the voice so that a virtual source of the voice is formed in a direction of the position of the other vehicle in a sound field formed by a speaker array composed of a plurality of speakers, and then outputs the voice at the volume from the speaker array.
According to the present invention, smooth communication between vehicles is possible. Incidentally, the problems, configurations, and advantageous effects other than those described above are revealed in the following description of embodiments.
The best mode for carrying out the present invention is described below with examples while referring to drawings.
A communication system in the present invention is for performing wireless communication between at least two or more vehicles; in the present example, each vehicle is equipped with a wireless communication device 10. The wireless communication device 10 includes a transmitting unit 11 and a receiving unit 12, and enables one vehicle equipped with the wireless communication device 10 to communicate information including voice data with another vehicle equipped with the same wireless communication device. The transmitting unit 11 broadcasts information of the one vehicle so that many and unspecified other vehicles around the one vehicle can receive the information. The receiving unit 12 receives information of another vehicle transmitted from the other vehicle.
An in-vehicle voice processing device 20 is connected to the wireless communication device 10. Then, a plurality of microphones 31 composing a microphone array, a GPS device 32, and a gyro sensor 33 are connected to the input side of the in-vehicle voice processing device 20; a plurality of speakers 41 composing a speaker array are connected to the output side of the in-vehicle voice processing device 20.
The in-vehicle voice processing device 20 includes a vehicle-position acquiring unit 21 that acquires the position of the one vehicle, a voice acquiring unit 22 that acquires the voice of an utterer in the one vehicle, an utterance-direction detecting unit 23 that detects the direction of utterance of the utterer in the one vehicle, and a reproduced-voice output unit 24 that reproduces and outputs the utterer's voice in the other vehicle on the basis of information received from the other vehicle. The transmitting unit 11 of the wireless communication device 10 transmits information on the position of the one vehicle, the voice, and the direction of utterance.
The vehicle-position acquiring unit 21 acquires the position and orientation of the one vehicle on the basis of information from the GPS device 32 and information from the gyro sensor 33. The position of the one vehicle is represented by the latitude and longitude; the orientation of the one vehicle is represented by the azimuth direction (such as north, south, east, and west) based on the position of the one vehicle. The azimuth direction can also be represented by how many degrees, for example, from the north. As a method for detecting the orientation of the one vehicle, a geomagnetic sensor can be used instead of the gyro sensor 33. The voice acquiring unit 22 acquires the voice of an utterer in the one vehicle that has been input from the microphones 31. The voice acquiring unit 22 converts the voice from analog data to digital data. The utterance-direction detecting unit 23 detects the direction of utterance that is the direction the utterer is facing and speaking on the basis of the voice input from the microphones 31. The direction of utterance is represented by, for example, the azimuth direction based on a signal from the gyro sensor 33.
The reproduced-voice output unit 24 performs a process of calculating the volume at which reproduced voice is to be output in the one vehicle on the basis of the position of the one vehicle, the position of the other vehicle, and the direction of utterance of an utterer in the other vehicle, processing the voice so that the virtual source of the voice is formed in a direction of the position of the other vehicle in a sound field formed by the speaker array composed of the plurality of speakers, and outputting the voice at the calculated volume from the speaker array. Incidentally, as a method of processing the voice so that the virtual source of the voice is formed in a direction of the position of the other vehicle, the publicly-known technology presented in PTL 1 can be used.
The volume of reproduced voice output by the reproduced-voice output unit 24 is set so as to be highest when an utterer in the other vehicle is facing and speaking in the direction of the one vehicle, and is set so as to get lower as the direction of utterance of the utterer in the other vehicle gets farther away from the one vehicle.
The reproduced-voice output unit 24 changes the volume of reproduced voice according to the degree of coincidence between the direction of utterance in the other vehicle and the relative direction from the other vehicle to the one vehicle. The volume V1 of reproduced voice is calculated by the following equation (1).
In the above equation (1), V0 denotes the volume of the voice uttered by an utterer in the other vehicle (the volume of utterance); in the present example, it shows that the volume V1 of reproduced voice in the one vehicle is proportional to the volume V0 of utterance.
A term of direction calculation in the above equation (1) is a term that indicates the degree of coincidence between the direction of utterance that is the direction in which the utterer in the other vehicle is facing (vector d) and the relative direction from the other vehicle that is an utterance transmitting vehicle to the one vehicle that is an utterance receiving vehicle (vector P1). In the present example, the term of direction calculation adopts a value obtained by dividing the inner product of the above two vectors by the magnitude of the two vectors; if the directions agree completely, this term is 1; if the directions differ by 90 degrees, this term is 0. Incidentally, if this value is negative, the term is set to 0. Therefore, the higher the degree of coincidence between the directions, the higher the volume V1 of reproduced voice in the one vehicle. In the present example, there is described the case where the volume is gradually lowered as the degree of coincidence gets lower; alternatively, a predetermined angular range of less than 90 degrees is set, and the volume of reproduced voice can be held constant when the angle is within the predetermined angular range and be set to 0 if the angle deviates from the predetermined angular range.
A term of sound attenuation in distance in the above equation (1) is a term for calculating the attenuation of volume according to the distance P1 from the other vehicle that is an utterance transmitting vehicle to the one vehicle that is an utterance receiving vehicle. In the present example, the value of this term is inversely proportional to the square of the distance from the other vehicle to the one vehicle; the farther the distance, the lower the volume V1 of reproduced voice in the one vehicle.
In the present example, four microphones 31 and four speakers 41 are placed so as to surround seats of the vehicle 201. The four microphones 31 acquire the voice so that which direction an utterer in the one vehicle is facing and speaking can be recognized. The four speakers 41 form a sound field in the interior of the one vehicle, and output reproduced voice so that the virtual source of the utterer's voice acquired in the other vehicle is formed in a direction of the position of the other vehicle, i.e., so that the utterer's voice in the other vehicle is heard from the direction of the other vehicle.
The transmitting unit 11 of the in-vehicle voice processing device 20 transmits information of the one vehicle as packet data. The packet data has the packet format shown in
At step S401, the microphones 31 detect the voice of an utterer in the one vehicle, and a process of converting the detected voice into a format that the transmitting unit 11 can transmit is performed. Then, at step S402, a process of detecting the direction of utterance that is the direction in which the occupant is facing and speaking is performed. In the present example, the direction of utterance is detected on the basis of the voice detected by the microphones 31. At step S403, a process of transmitting information on the position and direction of utterance and the voice data through the transmitting unit 11 is performed. In the transmitting process, broadcasting to many and unspecified other vehicles existing within a predetermined range around the one vehicle is performed.
At step S501, a radio receiving process of receiving information of another vehicle broadcasted from the other vehicle is performed. Accordingly, the position of the other vehicle, the direction of utterance of an utterer in the other vehicle, and voice data of the utterer in the other vehicle are acquired. At step S502, a direction/distance calculating process of calculating the relative direction of utterance in the other vehicle to the one vehicle and the relative distance is performed. Then, at step S503, a process of calculating the volume of reproduced voice of the utterer in the other vehicle to be output from the speakers 41 on the basis of the relative direction of utterance of the utterer in the other vehicle to the one vehicle and the relative distance to the other vehicle that have been calculated at step S502 is performed. At step S504, a reproducing process of processing the voice so that the source of the utterer's voice in the other vehicle is formed a direction of the position of the other vehicle and outputting the voice from the speakers 41 at the volume calculated at step S503 is performed.
In an example shown in
In the conventional technology in PTL 1, the volume of the voice output in the leading receiving vehicle Mr2 nearer to the transmitting vehicle Mc is higher than that in the following receiving vehicle Mr1 farther away from the transmitting vehicle Mc. However, in this situation, the utterer in the transmitting vehicle Mc wants to speak to not an occupant in the leading receiving vehicle Mr2 but the occupant in the following receiving vehicle Mr1; therefore, there may be interference with smooth communication.
On the other hand, according to the communication system in the present example, respective volumes of reproduced voice in the receiving vehicles Mr1 and Mr2 are adjusted according to information on the direction of utterance of the utterer 601 in the transmitting vehicle Mc. Therefore, the volume of reproduced voice in the following receiving vehicle Mr1 located in the direction (d) of utterance of the utterer in the transmitting vehicle Mc is higher than that in the leading receiving vehicle Mr2. Therefore, an occupant 602 in the following receiving vehicle Mr1 can recognize that the utterer 601 in the transmitting vehicle Mc is speaking to the occupant 602 and becomes able to respond to the occupant 601 in the transmitting vehicle Mc, which makes it possible to have a you-are-there conversation between vehicles. Therefore, smooth communication can be performed as if it were communication between persons who are walking.
Then, an occupant 603 in the leading receiving vehicle Mr2 can hear the voice of the utterer 601 in the transmitting vehicle Mc from the direction of the transmitting vehicle Mc; however, its volume is lower than that in the following receiving vehicle Mr1, so the occupant 603 can recognize that the utterer 603 in the transmitting vehicle Mc is speaking to the occupant 602 in the following receiving vehicle Mr1.
Subsequently, Example 2 of the present invention is described. Incidentally, the same component as Example 1 is assigned the same reference numeral, and its detailed description is omitted.
The characteristic of Example 2 is that it is configured to detect the direction of utterance of an utterer on the basis of the utterer's face image taken by a camera. As shown in
The in-vehicle voice processing device 20 acquires the voice of an utterer and generates voice data, and also detects the utterer's gaze on the basis of the image taken by the camera 34, and detects the direction of utterance on the basis of the gaze. Then, the in-vehicle voice processing device 20 performs a process of generating vehicle information including the voice data, information on the direction of utterance, and information on the position of the vehicle, and transmitting the generated vehicle information from the transmitting unit 11.
As shown in
Information on the direction of utterance is broadcasted together with respective pieces of information on the position of the vehicle, the direction of utterance, and voice data as packet data. The subsequent processes are the same as Example 1.
According to the present example, the direction of utterance of an utterer can be detected certainly, and the vehicle the utterer wants to speak to can be identified accurately. Therefore, a you-are-there conversation between vehicles can be made, and smoother communication than ever before is possible.
Subsequently, Example 3 of the present invention is described. Incidentally, the same component as Example 1 or 2 is assigned the same reference numeral, and its detailed description is omitted.
The characteristic of the present example is that it is configured to enable communication between a vehicle equipped with a wireless communication device 10 including the transmitting unit 11 only and a vehicle equipped with a wireless communication device 10 including the receiving unit 12 only.
In the above-described Examples 1 and 2, there is described an example where each vehicle is equipped with both the transmitting unit and the receiving unit, and it is possible to have a conversation between vehicles; however, the present invention can be also applied to between a vehicle including the transmitting unit only and a vehicle including the receiving unit only. For example, emergency vehicles such as ambulances are equipped with the transmitting unit only, and general vehicles are equipped with the receiving unit only, so an emergency vehicle can transmit the voice of an utterer in the emergency vehicle telling general vehicles on the route of the emergency vehicle to pull over to the side of a road as the emergency vehicle is about to pass. In response, occupants in general vehicles located in the direction of utterance can recognize that they are being spoken to from the volume of reproduced voice, and can pull over to the side of a road promptly.
A wireless communication device 10 shown in
A wireless communication device 10 shown in
The embodiment of the present invention is described in detail above; however, the present invention is not limited to the above-described embodiment, and various design changes can be made without departing from the spirit of the invention described in claims. For example, the above embodiment is described in detail to explain the present invention clearly, and is not always limited to include all the described configurations. Furthermore, part of the configuration of one embodiment can be replaced with that of another embodiment, or the configuration of the other embodiment can be added to the configuration of the one embodiment. Moreover, part of the configuration of each embodiment can be subjected to addition/deletion/replacement with that of another embodiment.
Number | Date | Country | Kind |
---|---|---|---|
2014-225032 | Nov 2014 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2015/076828 | 9/24/2015 | WO | 00 |