The present invention relates to a localization system for measuring the position of an object which generates sound waves. In particular, the present invention relates to a localization system for measuring the position of an object when an sound wave reflecting object is present near the object, a robot and a localization method utilizing this localization system, and its sound source localization program.
Knowing positions (one-dimensional position, two-dimensional position, and three-dimensional position) of objects and human beings is an important technique in human interfaces and robots. For instance, regarding a robot which performs voice interaction with a human being, it is expected that the voice recognition performance will be improved by directing the microphone orientation to a user with whom the robot will have a dialogue, and it is also expected that causing the face of the robot to face the user when they have a dialogue will provide effects in smoothly proceeding a dialogue between the person and the robot. Further, in the case where a human being and a robot share an object present in the space shared by the human being and the robot, the robot needs to know the accurate position of the object. For example, when the user instructs the robot to bring an object, the robot cannot operate the actuator to grab the object unless the robot detects the position of the instructed object.
Conventionally, methods for knowing three-dimensional positions of objects and human beings include a method in which images captured by a camera is processed so that objects and people are detected, and a method in which tags emitting or reflecting electromagnetic waves and infrared rays, such as RFID tags and infrared tags, are attached to objects and people and the positions of the tags are detected by a sensor. In such localization methods, ultrasonic tags are used to realize a localization method, in which tags emit ultrasonic waves which are received by a microphone array so that the positions of the tags are calculated.
The ultrasonic tags are characterized in that the position can be detected with about several cm accuracy, which is more accurate compared to other measures. Various techniques relating to three-dimensional localization devices using such ultrasonic tags have been disclosed. For example, an ultrasonic-type three-dimensional localization device has been known (e.g., Patent Document 1) in which an ultrasonic tag is called up by using a radio wave to be caused to emit an ultrasonic wave, and the ultrasonic wave is received by an ultrasonic microphone array, and then, the position of the ultrasonic tag is identified by the principle of triangulation, based on the time the sound wave takes to travel from the emission source of the ultrasonic tag to respective microphones of the ultrasonic microphone array, and the relative positional relationship between the respective microphones.
Patent Document 1: Japanese Patent Laid-Open Publication No. 2005-156250
However, in the conventional example, if an attempt is made to use a localization system utilizing an ultrasonic tag in a place where various objects are disposed such as in a room, localization may not be performed accurately because an ultrasonic wave is reflected on walls, objects, and the like.
That is, in the case of performing three-dimensional localization in a system utilizing an ultrasonic tag, as three-dimensional position is estimated based on an assumption that an ultrasonic wave emitted from the ultrasonic wave transmission unit of the ultrasonic tag directly reaches the respective microphones of the microphone array for detecting the ultrasonic wave without being reflected on other objects, if reflection is caused on other objects, localization cannot be performed accurately.
In other words, in an environment where a microphone array and an ultrasonic tag are disposed, how the ultrasonic wave is reflected to reach the respective microphone cannot be determined only from information obtained from the respective microphones. As such, the conventional example described above involves a disadvantage that accurate localization cannot be performed.
As shown in
That is, as the ultrasonic wave cannot directly reach the microphone M1 because there is the object S on the path linearly linking the microphone M1 and the ultrasonic tag T, it is possible that a distance of a reflex path along which the ultrasonic wave emitted from the ultrasonic tag T travels, by being reflected at the wall K to reach the microphones M1, is determined as a linear distance between the microphone M1 and the ultrasonic tag T. This phenomenon is frequently caused in the case where a microphone array is mounted on a robot located near the floor, for example.
The present invention has been developed in view of the above-described circumstances, and an object of the present invention is to provide a localization system capable of measuring the position of a sound source about a microphone on the receiving side even in an environment where another object is present between the sound source such as an ultrasonic tag and a microphone or around thereof so that reflection of an ultrasonic wave is caused, and to provide a robot using the localization system, a localization method, and its sound source localization program.
In order to achieve the object, a localization system according to the present invention is a system for identifying the position of a sound source using propagation times of a sound wave propagating from the sound source to a plurality of microphones, including an object detection unit which detects the position, the shape and the like of a surrounding object present around the plurality of microphones, and a sound source position estimation unit which estimates the position of the sound source based on the propagation times.
The sound source position estimation unit has a reflection wave path estimation function for estimating a reflection wave path and its distance from the surrounding object identified by information from the object detection unit, and a sound source position estimation function for estimating the position of the sound source based on the reflection wave path and the distance.
With this configuration, as the positions and the shapes of reflective objects and interrupting objects around the microphones are recognized by the function of the object detection unit, it is possible to estimate the shortest paths of propagation paths of the sound wave (ultrasonic wave), and at the same time, the position of the sound source viewed from the respective microphone can be calculated based on the estimated shortest propagation paths of the sound and the corresponding propagation times of the sound wave. As such, by determining an area in which candidate areas where the ultrasonic tag may be present calculated from the respective microphones are overlapped as the position of the ultrasonic tag, the three-dimensional position of the sound source such as an ultrasonic tag can be calculated with higher accuracy, for example, than the case of not considering reflection caused by surrounding objects. Thereby, it is possible to measure the position of the above-described sound source (e.g., ultrasonic tag) accurately.
Further, a sound source localization program according to the present invention is a program for calculating propagation times of an ultrasonic wave propagating from the sound source to a plurality of microphones, and calculating the position of the sound source based on the respective propagation times of different ultrasonic waves detected by the plurality of microphones. The program is configured as to cause a computer to perform: a transmitting operation control function for controlling a radio wave transmission operation of a radio wave transmission unit which transmits a radio wave including an ultrasonic wave transmission command to an ultrasonic tag; a propagation time calculation function for calculating propagation times of an ultrasonic wave, transmitted from the ultrasonic tag and received by the plurality of microphones, from the time that the radio wave is transmitted until the time that the ultrasonic wave reaches the respective microphones; and a position estimation computing function for, if the position of a reflective object present around the microphone array has been detected beforehand, estimating a reflective propagation path of the ultrasonic wave from the detection result of the reflective object, and calculating the position of the ultrasonic tag based on the estimated reflective propagation path of the ultrasonic wave and the propagation time for each of the microphones calculated by the propagation time calculation function.
As the present invention is configured and works as described above, the object detection unit effectively works to recognize the arranging state of surrounding objects present around a plurality of microphones, and while considering reflection of an ultrasonic wave generated depending on the arranging state of the objects, candidate areas where the ultrasonic tag may be present are calculated from the positions of the microphones in the microphone array, and an area where the candidate areas where the ultrasonic tag may be present calculated from the respective microphones are overlapped is determined as the position of the ultrasonic tag. As such, compared from the case where reflection by the surrounding objects is not considered, the three-dimensional position of the sound source such as an ultrasonic tag can be calculated accurately for example, and even in a room where reflection of an ultrasonic wave by objects is frequently caused, measurement of the position of the sound source, that is, the positional relationship between the ultrasonic tag and the microphones can be measured accurately, for example. As such, it is possible to provide an excellent localization system which has not been able to be achieved conventionally, and a robot and a localization method using the localization system, and its sound source localization program.
Hereinafter, exemplary embodiments of the invention will be described in accordance with accompanying drawings.
A first exemplary embodiment will be described based on
First, the overview of the exemplary embodiment and the principle contents of the exemplary embodiment will be described, and then a specific exemplary embodiment of the invention will be described.
First, a localization system of the exemplary embodiment acquires the positions and the shapes of objects present around a microphone array and an ultrasonic tag by an object detection unit. Thereby, the system is capable of calculating the shortest paths from the ultrasonic wave transmission unit provided to the ultrasonic tag to the respective microphones configuring the microphone array, while considering reflection of the sound wave on the objects. With this configuration, with the elapsed time from the time that the sound wave is emitted from the ultrasonic wave transmission unit of the ultrasonic tag until it reaches the respective microphones being observed, it is possible to accurately calculate candidate areas for the ultrasonic tag which is present at a position where the sound wave can reach within the observed elapsed time.
More specifically, a candidate area where an ultrasonic tag may be present will be on a spherical face centered on a microphone having a radius of the length calculated by multiplying the elapsed time by the acoustic velocity, if there is no obstacle. However, if there is any obstacle (that is, object), a candidate area where the ultrasonic tag may be present is generally a combination of a plurality of faces present inside the sphere. In this case, a candidate area where the ultrasonic tag may be present can be estimated by detecting the position and the shape of the object by the object detection unit.
In this way, the candidate areas where the ultrasonic tag may be present are estimated for the respective microphones configuring the microphone array by the position estimation unit, and since the ultrasonic tag is present in a shared part of the respective candidate areas, the three-dimensional position of the ultrasonic tag can be calculated.
Next, the principle of the localization system utilizing an ultrasonic tag which is also a sound source will be described with reference to the drawings.
Now, a place where an microphone array MA11 including microphones M11, M12, and M13, an ultrasonic tag T11, an object S11, and a wall K11 are arranged respectively on a plane is set as shown in
In the case where the microphones MN1 to M13, the ultrasonic tag 11 and the object S11 are arranged as shown in
In a typical localization method (localization) utilizing an ultrasonic tag, a three-dimensional position of the ultrasonic tag T11 has been calculated when the relative positional relationships of the respective microphones MN1 to M13 and the paths lengths from the ultrasonic tag T11 to the respective microphones MN1 to M13 are given. That is, in the localization utilizing the conventional ultrasonic tag T11 in which reflection at the wall K is not considered, in a state where the ultrasonic wave reflects at the wall K11 as shown in
In other words, although the direct linear path from the ultrasonic tag T11 to the microphone M11 is short actually, a path of the ultrasonic wave U11 from the ultrasonic tag T11 reflected on the wall K11 and made incident on the microphones M11 is considered to be the shortest path. As such, an area where a sphere with a radius having a length of the path of the ultrasonic wave U11 reflected on the wall K11 and made incident on the microphone MN1 and respective spheres with radiuses having lengths of the linear paths of the ultrasonic waves U12 and U13 made incident on the microphones M12 and M13 overlap with one another is determined as an area where the ultrasonic tag T11 is present. Accordingly, an area different from the area where the ultrasonic tag T11 is actually present is determined as an area where the ultrasonic tag T11 is present.
However, according to the localization of the ultrasonic tag T11 of the exemplary embodiment, an area which is behind the object S11 and where an ultrasonic wave from the ultrasonic tag T11 does not propagate to the respective microphones MN1 to M13 is determined, with use of information regarding the result of detecting the position and the shape of the object S11 by another sensing device, as shown in
For example, with an assumption that a time period from the time an ultrasonic wave is emitted until it is received by the microphone M11 is D11, a time period from the time an ultrasonic wave is emitted until it is received by the microphone M12 is D12, and a time period from the time an ultrasonic wave is emitted until it is received by the microphone M13 is D13, and the acoustic velocity is v, an area where the sound source (ultrasonic tag T11) from which an ultrasonic wave propagates with a path length (D11*v) to the microphone M11 may be present is within a scope of a circle A11 as shown in
Further, an area where the sound source (ultrasonic tag T11) from which an ultrasonic wave propagates with a path length (D12*v) to the microphone M12 may be present is within a scope of a circle A12, and an area where the sound source (ultrasonic tag T11) from which an ultrasonic wave propagates with a path length (D13*v) to the microphone M13 may be present is within a scope of a circle A13.
That is, in order that an ultrasonic wave propagates with a path length (D1*v) for the microphone M11, if reflection of the object S11 is considered, the sound source (ultrasonic tag T11) has to be within the area A11 indicated by a dotted line in
In other words, when the positions of the wall K11 and the object S11 being obstacles are detected by an LRF (laser range finder), considering the reflection at the object S11, it is found that the area where the ultrasonic wave propagation distance from the microphone M11 is D11*v is on the line of A11. It is also found that the area where the ultrasonic wave propagation distance from the microphone M12 is D12*v is on the line of A12, considering the reflection at the object S11. Further, it is also found that the area where the ultrasonic wave propagation distance from the microphone M13 is D13*v is on the line of A13, considering the reflection at the object S11.
As such, it is possible to determine that the area A14 where these three circles A11, A12, and A13 overlap with one another is an area where the ultrasonic tag T11 is present.
Further, in a localization system using the ultrasonic tag according to the exemplary embodiment, as the system includes a sensor moving mechanism and the object detection unit considering the movement, object detection can be performed in a wider area as described later. For example, a range finder generally used for detecting objects involves a problem that an observable range is up to an object positioned in front of measuring part (e.g., in the case of a laser range finder, laser light emitting unit and receiving unit) of the range finder so that an object positioned over such object is not observable (a problem so-called occlusion) As such, if object detection is performed by fixing the sensor, objects present in the environment cannot be observed sufficiently. In view of the above, the localization system of the present invention is adapted to allow observation of objects present in a wider range and estimation of reflecting state of sound waves with higher accuracy, by moving the sensor so as to observe objects in an area hidden behind the front object.
Further, in the localization system using the ultrasonic tag according to the exemplary embodiment, the accuracy of the object map can be improved by detecting an object present around the microphones and the ultrasonic tag by an object matching unit and utilizing the shape information of the object that the object matching unit stores beforehand.
For example, by arranging the shape of an object stored in the object matching unit on an object map, it is possible to estimate the shape of the object which cannot be observed by the sensor due to occlusion or the like. In general, there is a case where all of the objects present in the environment may not be observed by the object sensor due to problems such as occlusion, observable range, and accuracy. As such, by storing the shape of the objects beforehand, it is possible to efficiently acquire the shapes of the objects present in the environment without observing all of the shapes of the objects each time.
Hereinafter, the localization system using an ultrasonic tag according to the exemplary embodiment will be described specifically with reference to the drawings.
The localization system further includes an object detection unit 104 which detects objects around the microphone array, and a position estimation unit 105 which calculates the position of the ultrasonic tag 200 from an arrival time at each microphone calculated by the propagation time calculation unit 103 and an object detection result while considering reflection of the ultrasonic wave. As described later, in the position estimation unit 105, the sound source position estimation unit has a reflection wave path estimation function for estimating a reflection wave path from a surrounding object identified by information from the object detection unit and the distance thereof, and a sound source position estimation function for estimating the position of the sound source according to the reflection wave path and the distance. Further, the ultrasonic tag 200 includes an ultrasonic wave transmission unit 201 which receives a radio wave transmitted from the radio wave transmission unit 101 and if the radio wave is an instruction to transmit an ultrasonic wave, transmits an ultrasonic wave.
The ultrasonic wave reception array unit 102 is formed of a microphone array including three or more microphones. Further, the object detection unit 104 has a relative position detection function for detecting shapes, positions, and relative positions with respect to the microphone array of objects reflecting ultrasonic waves, based on the surrounding environment where the microphone array and the sound source included in the ultrasonic wave reception array unit are placed. That is, the object detection unit 104 detects the structure of a surface of an object reflecting a ultrasonic wave in the environment where the microphone array and the ultrasonic tag 200 are placed, and a relative position with the microphone array. The objects include walls, furniture and other objects.
The object detection unit 104 can use and realize a method for performing shape measurement with a range sensor using laser beam and the like, a method for moving a range sensor (e.g., one utilizing laser light) capable of measuring a distance on a two-dimensional plane to thereby estimate a three-dimensional shape from the position and measurement result, a method for shape restoration by stereo view using a plurality of cameras, a method for shape restoration by means of a factorization method utilizing movement of cameras, and a method for shape restoration using gradation on object surface such as “shape from shading” from an image to a camera, for example. Further, the structure of a surface of an object and relative position with the microphone array can be calculated by using various sensors.
Next, operation of the localization system as shown in
First, the radio wave transmission unit 101 transmits a signal to which the ultrasonic tag 200, or a call object, responds (radio wave transmission step). At this time, the transmission time is stored as T0 (step S11). Then, the ultrasonic wave transmission unit 201 provided to the ultrasonic tag 200 side analyzes the radio wave transmitted from the radio wave transmission unit 101, and transmits an ultrasonic wave if it is a signal to which the self ultrasonic tag 200 has to respond (step S12). The time period from when the ultrasonic wave transmission unit 2o1 receives the radio wave till when it transmits the ultrasonic wave needs to be almost constant, and the time interval from the reception of the radio wave to the transmission of the ultrasonic wave is preferably minute.
Note that the radio wave transmission unit 101 may be configured such that the procedures of radio wave transmission operation is programmed as a transmission operation controlling function which is to be executed by a computer.
Next, when respective microphones of the ultrasonic wave reception array unit 102 (assumed to be composed of n pieces of microphones M11 to M1n) receive an ultrasonic wave from the ultrasonic tag 200, received times TR1 to TRk are recorded for respective microphones (step S13: ultrasonic wave reception step).
Then, the object detection unit 104 detects the positional information and the information regarding shapes, sizes, and the like of objects present around the microphones M11 to M1n, and generates an object map (step S14: surrounding object detection step). That is, as it is necessary to have positional relationships between objects and the microphone array from when the ultrasonic wave is transmitted from the ultrasonic tag 200 till when the ultrasonic wave reaches to each microphone, and object shapes, if in the environment where the microphone arrays and objects are standstill for example, a process of generating this object map can be performed in advance. If it is not the case, this may be realized by procedures in which a process of generating an object map is performed at the same time as transmission and reception of the ultrasonic wave.
The surrounding object detection step at step S14 includes an object information storing step for detecting present positions, shapes, and sizes of surrounding objects present around the microphone array beforehand and storing then as surrounding map information, and an object map generation step for generating an object map for identifying relative positions with the surrounding objects viewed from the respective microphones configuring the microphone array based on the stored surrounding map information. By performing these steps, the object map is generated.
Note that the surrounding object detection step may be configured such that the performing contents are programmed as a surrounding object identifying function which is to be executed by a computer.
Next, the propagation time calculation unit 103 calculates for each microphone a time difference from the time that the radio wave transmission unit 101 transmits a radio wave until each of the microphone array receives an ultrasonic wave. For example, a time difference between transmission and reception of a microphone Mi is set to be “Di=TRi−T0” (step S15). The difference D1 calculated here is a time interval from the time that an ultrasonic wave is transmitted from the ultrasonic wave transmission unit 201 until the ultrasonic wave reaches each of the microphones configuring the ultrasonic wave reception array unit 102. As such, if the time lag g from the time that the ultrasonic tag 200 side receives a radio wave until the time that it transmits an ultrasonic wave is long, it is necessary to subtract the time lag g to be “Di=TRi−T0−g” (propagation time calculation step).
Now, the propagation time calculation step for calculating the time period from when the radio wave transmission unit 101 transmits a radio wave till when each microphone of the microphone array receives an ultrasonic wave may be formed as a program which is to be executed by a computer as a propagation time calculation function.
Next, the position estimation unit 105 calculates the position of the ultrasonic tag (sound source) 200 as shown below using the object map calculated by the object detection unit 104 and the ultrasonic wave arrival times (D1 to Dn) of respective microphones calculated by the propagation time calculation unit 103 (step S16: sound source position estimation step).
Now, a position calculating process of the ultrasonic tag 200 performed by the position estimation unit 105 will be described by way of an example with reference to a flowchart.
In
Next, where i=i+1, for a case where the microphone Mi is a microphone M2 and a case where the microphone Mi is a microphone M3, the sound source position candidate calculation unit 105A sequentially calculates all points where the shortest path distance between the microphone Mi and the ultrasonic tag 200 is Di*v, determines a set of the points as Zi (step S23), and regarding all of the microphones M1 to Mn, calculates all points where the shortest path distance between the microphone Mi and the ultrasonic tag 200 is Di*v, determines a set of the points as Zi (step S24).
Next, the sound source position candidate calculation unit 105A finds a common element in all sets {Zi} (i=1, - - - , n), and determines the element as the position of the ultrasonic tag 200. If there are a plurality of common elements, the position of the ultrasonic tag 200 is estimated from the elements. The methods for estimation include a method in which the center of gravity of the common elements is set as the position of the ultrasonic tag 200, and a method in which the common elements are clustered by their positions and the position of the center of gravity of a cluster having the maximum elements is set as the position of the ultrasonic tag 200.
Further, if there is no common element in all of the sets {Zi} (i=1, - - - , n), an element included in the largest number of sets may be determined as the position of the ultrasonic tag 200 (step S25). In that case, the position of gravity of an element included in more than a certain number of sets may be determined as the position of the ultrasonic tag 200.
Next, an exemplary embodiment of the part calculating the shortest path distance in the exemplary operation of the position estimation unit 105 will be described.
Next, it is determined whether or not there is an area Ri (step S33), and if there is no area Ri (No in step S33), a set {Yi} of points where “MiYi=Di*v” is calculated (step S48), and the processing goes to step S45. That is, in “{Xij}j=1, - - - , Li” processing of setting a union of a set in which there is no m where XijεSim(m≠j) and an object exists on MiXij and a set of {Yi} as Zi (step S45).
On the other hand, if an area Ri exists in step S33 (Yes in step S33), an area Ci where a line can be taken from the microphone Mi without interrupted by an object within the area Ri is obtained (step S34). Next, the area Ci is divided into small areas, and points are arranged in the respective areas. Then, the points are set to be “{Pij(1)}j=1, - - - , Li”. For each of the points {Pij(1)}, a half line Kij(1) which is parallel to a line segment which is line-symmetry to a line segment MiPij(1) relative to a perpendicular line of a tangent plane with the area Ci and starts from Pij is calculated (step S35).
Next, beginning from j=1 (step S36), for each of the points “{Pij(1)}j=1, - - - , Li”, it is determined whether the half line Kij(1) crosses an object surface with a distance from Pij(1) being within “Di*v−MiPij(1)” (step S37).
If the half line Kij(1) does not cross an object surface, a point Xij which is on the half line Kij and has a distance of “(Di*v)−MiPij (1)−Pij (1)Xij” from Pij (1). Note that the path MiPij(1) and Pij(1)Xij are set to be Sij (step S38).
Meanwhile, if the half line Kij (1) crosses an object surface in step S37 (Yes in step S37), beginning from k=1, a point closest to the Pij(1) among the points that the half line Kij(1) crosses an object surface is set to be Pij (2), and a half line Kij (2) which is line-symmetry to Kij(1) relative to the perpendicular line of a tangent plane of the object surface including the Pij (2) and starts from the Pij (2) is calculated (step S40).
Next, it is determined whether the half line Kij (2) crosses an object surface when a distance from Pij (2) is equal to or less than a value expressed by the following expression (1) (step S41), and if it crosses an object surface (Yes in step S41), it is set that k=k+1 (step S41a) and returns to step S40, and then, among the points that the half line Kij(k) crosses the object surface, a point closest to Pij(k) is set to be Pij(k+1), and a half line Kij(k+1) which is line symmetry to Kij(k) relative to a perpendicular line of a tangent plane of the object surface including Pij(k+1) and starts from Pij(k+1) is calculated (step S40).
On the other hand, if the half line Kij (2) does not cross the object surface when the distance from Pij (2) is equal to or less than a value expressed by the formula (1) in the determination of step S41 (No in step S41), a point Xij which is on the half line Kij(k) and the distance from Pij(k) is expressed by the following expression (2) is calculated. Note that the path MiPij(1), {Pij(n)Pij(n+1)}(n=1, - - - k−1), Pij(k)Xij are set to be Sij (step S42).
In this manner, addition is sequentially performed with j=j+1 (step S43), and when j reaches Li and processing of steps S37 to S43 has been completed for all “{Pij(1)}j=1, . . . , Li” (step S44), and a union of a set of {Yi} and a set in which there is no “m” satisfying “XijεSim(m≠j)” in “{Xij}j=1, . . . , Li” and an object is present on MiXij is determined as Zi (step S45).
Next, addition is sequentially performed with i=i+1 (step S46), and after processing for all of the microphones M1 to Mn has been performed (step S47), a common part in {Zi}(i=1, . . . , n) is searched, and the common part is determined as a present position of a ultrasonic tag. At this time, if there is no common part for all Zi, an element included in the largest number of Zi is determined as a present position of an ultrasonic tag (step S49).
When a microphone Mi and an object S11 are present on a plane as shown in
Then, as shown in
Similarly, as shown in
Based on the points “Xi1, Xi2, . . . , Xin” calculated in this manner, a union of a set of {Yi} and a set in which there is no “m” satisfying “XijεSim(m≠j)” and an object is present on MiXij is determined as Z1, and a common part in {Zi}(i=1, . . . , n) is searched, and the part is determined as a present position of an ultrasonic tag (sound source).
Now, the step of estimating the position of the sound source described above, that is, the step of estimating the reflection propagation path of the ultrasonic wave based on the positional information of the surrounding object having been detected and identified around the microphone array MA11 and the propagation time for each of the microphones calculated in the propagation time calculation step, and estimating the position of the ultrasonic tag (position of the sound source) may be configured such that the content of performance is programmed and executed by a computer.
As described above, in the first exemplary embodiment, the arrangement status of an object present around the microphone array is recognized using a distance measuring device such as an LRF (Laser Range Finder), and by considering reflection of an ultrasonic wave generated due to the arrangement states of the object, candidate areas where the ultrasonic tag may be present are calculated from the respective positions of the microphones in the microphone array, and an area where the candidate areas in which the ultrasonic tag (sound source) is present, calculated from the respective microphones, are overlapped with one another is set to be a present position of the ultrasonic tag. Thereby, the three-dimensional position of the ultrasonic tag can be calculated with higher accuracy than the case where reflection by the object is not considered.
As such, it is possible to provide a localization system capable of measuring the position using an ultrasonic tag, that is, the positional relationship between the ultrasonic tag and microphones, with high accuracy even in a room where reflection of ultrasonic waves by objects frequently occurs, and a robot and a localization method utilizing the localization system, and a program for calculating the sound source position thereof.
Next, a second exemplary embodiment will be described based on
The second exemplary embodiment shown in
As shown in
In other words, the object detection unit 104 includes a sensor for detection surrounding objects and the sensor moving mechanism 106 for moving the sensor, and further, a surrounding map creating function for creating a surrounding object map by detecting surrounding objects in a plurality of locations based on the operation of the sensor moving mechanism 106, and an object position identifying function for identifying positions of the objects using the created surrounding object map as described below.
Other configurations are the same as those of the first exemplary embodiment described above.
Next, operation of the system according to the second exemplary embodiment will be described based on
In
By moving the sensor, positional information and information about shape and size of objects present around the microphones M11 to M1n are detected, and an object map is created (step S52: surrounding object detection step).
In this case, the surrounding object detection step in the step S52 includes an object information storing step and an object map generation step. In the object information storing step, the positions, shapes and sizes of surrounding objects present in a wide area around the microphone array are detected beforehand and the detection result and the sensor movement amount are stored as surrounding map information, and in the object map creation step, an object map for identifying relative positions of the surrounding objects viewed from the respective microphones configuring the microphone array is generated based on the stored surrounding map information. With these steps being carried out, an object map is generated.
Note that the object information storing step and the object map generation step described above may be configured such that the performance contents are programmed so as to be executed by a computer.
Specific methods of creating a surrounding map for the surrounding objects include a method combining a self position identifying technique of the sensor moving mechanism 106 and an object detection technique and a method of creating a map utilizing SLAM (Simultaneous Localization And Mapping).
Then, whether to observe an ultrasonic tag is determined (step S53). Conditions on which the determination of this step is made include a method of carrying out observation of the ultrasonic tag at predetermined times, a method of determining whether to carry out observation of the ultrasonic tag according to the state of creating a map (for example, observing the ultrasonic tag when a map is created with a sufficient area), and a method of determining whether to carry out observation of the ultrasonic tag according to a request by the user.
Next, after creating the object map by repeating the above-described processing of steps S51 to S53, processing which is almost the same as that of steps S11 to S16 in the flowchart of
Next, when the respective microphones (including n pieces of microphones M11 to M1n) of the ultrasonic wave receiving array unit 102 receives the ultrasonic wave from the ultrasonic tag 200, reception times TR1 to TRk are recorded for the respective microphones (step S56). Further, the object detection unit 104 detects objects present around the ultrasonic wave receiving array unit 102 from the current position managed by the sensor moving mechanism 106 and the object map managed by the object detection unit 104, and identifies an object map (step S57). In other words, the object detection unit 104 has the surrounding map creation function described above, and in addition, an object position identifying function for identifying the positions of the surrounding objects by using the surrounding object map created by the surrounding map creation function.
Then, the propagation time calculation unit 103 calculates, for the respective microphones, a difference between times from when the radio wave transmission unit 101 transmits a radio wave till when each of the microphone array receives an ultrasonic wave. For example, a time difference between transmission and reception for a microphone Mi is assumed to be “Di=TRi−T0” (step S58). Further, the position estimation unit 105 calculates the position of the ultrasonic tag (sound source) 200 with use of the object map calculated by the object detection unit 104 and the ultrasonic wave arrival times (D1 to Dn) at the respective microphones calculated by the propagation time calculation unit 103 (step S59).
In this way, the same operation effects as those of the first exemplary embodiment can be achieved, and further, by considering the movement of the objects in this manner, it is possible to generate an object map of a wider range than the range that the object detection unit 104 can sense at once as described above, whereby the position of the ultrasonic tag 200 which is the sound source is calculated as described above.
Further, as the sensing range of various sensors (e.g., laser range finder and images) used for object detection does not reach objects located at distant areas from the sensor, it is impossible to sufficiently detect objects present in a space, where an ultrasonic tag is used, by sensing from only one location. According to the above-described localization system, however, as the shapes of objects in an area hidden behind another object can be detected, it is possible to calculate the paths of ultrasonic wave reaching from the ultrasonic tag to the microphone array with higher accuracy. As such, by detecting objects within a larger range, it is possible to accurately estimate the ultrasonic wave from the ultrasonic tag is reflected at various objects. Thereby, localization of the ultrasonic tag (sound source) can be performed with higher accuracy.
Next, a third exemplary embodiment will be described based on
The localization system according to the third exemplary embodiment disclosed in
Further, the object detection unit 104 has a function of creating an object map for identifying objects in an area which is outside the sensor measuring range by using the object shape information provided from the object matching unit 107.
Here, the object matching unit 107 includes an object identifying unit 107A for identifying objects and an object shape storing unit 107B for storing object shapes. The object identifying unit 107A identifies an object placed near the ultrasonic microphone array and the ultrasonic tag as a certain object, from among the objects whose shapes are stored in the object shape storing unit 107B. More specifically, the object shape storing unit 107B detects and identifies various tags such as RFID tags, ultrasonic tags, and image markers attached to the objects, or identifies objects by image matching with use of a camera.
Further, the object shape storing unit 107B stores an outer shape diagram of objects required for estimating reflection of the ultrasonic wave at object surfaces, and uses the diagram for reflecting the shape information on the object map in the measurement environment if the position and orientation of an object can be recognized. Further, it is also possible that the object matching unit 107 calculates the position and the orientation of an object and transmits them to the object detection unit 104.
Note that methods for detecting the position and the orientation of an object include a method of detecting the position and the orientation of an object with use of tags such as RFID tags, ultrasonic tags, and image markers, and a method of detecting the position and the orientation by processing images obtained from a camera. More specifically, in the case of a method of using RFID tags, there is a method in which the position and the orientation are detected with a plurality of RFID tags being attached to an object beforehand. Meanwhile, in the case of using images obtained from a camera, there is a method in which matching with object shapes registered beforehand is performed.
Next, operation of the localization system of the third exemplary embodiment shown in
Next, the ultrasonic wave transmission unit 201 present on the ultrasonic tag 200 side analyzes the radio wave transmitted from the radio wave transmission unit 101, and if it is a signal to which the self ultrasonic tag 200 has to respond, the ultrasonic wave transmission unit 201 transmits an ultrasonic wave (step S62). Further, when the respective microphones (consisting of n pieces of microphones M11 to M1n) of the ultrasonic wave receiving array unit 102 receive the ultrasonic wave from the ultrasonic tag 200, the reception times TR1 to TRk for the respective microphones are recorded (step S63).
Next, processing unique to the localization system in the third exemplary embodiment is performed. That is, the object matching unit 107 matches a surrounding object with one of the previously registered objects, and further, detects the position of the object, and transmits the previously registered shape information of respective objects and the position and the orientation of the object to the object detection unit 104 (step S64).
Then, the object detection unit 104 detects surrounding objects and generates an object map (step S65). Further, the object detection unit 104 arranges the object shape received from the object matching unit 107 on the object map to thereby generate the object map (step S66: object map generation step). Through these steps, it is possible to detect objects which may not be detected by the sensor of the object detection unit 104 due to occlusion or the like of the object itself.
Then, the same processing as that of steps S15 to S16 of a flowchart of the first exemplary embodiment shown in
Specifically, the propagation time calculation unit 103 calculates a time difference between the time that the radio wave transmission unit 101 transmits a radio wave and the time that each of the microphone array receives an ultrasonic wave, for each of the microphones. For example, the time difference between transmission and reception of the microphone Mi is assumed to be “Di=TRi−T0” (step S67). Next, the position estimation unit 105 calculates the position of the ultrasonic tag 200 (position of the sound source) with use of the object map calculated by the object detection unit 104 and the ultrasonic wave arrival times (D1 to Dn) of the respective microphones calculated by the propagation time calculation unit 103 (step S68).
As described above, according to the exemplary embodiment, the object matching unit identifies objects present in each environment, and with use of the shape information previously stored in the object matching unit 107, an object map is generated by estimating the object shape of areas which cannot be sensed directly.
With this configuration, the area required to be directly sensed by the object detection unit 104 can be reduced. Consequently, the time required for sensing can be reduced by reducing the area required to be directly sensed, and as the range where objects have to be sensed can be reduced, sensing of objects can be performed more effectively.
As described above, the respective exemplary embodiments are configured such that the object detection unit 104 effectively works to recognize the arrangement status of the surrounding objects S11 and K11 present around the microphones M11 to M13, and while considering reflection of the ultrasonic wave generated according to the arrangement states, calculates the candidate areas where the ultrasonic tag may be present based on the respective positions of the microphones M11 to M13 in the microphone array AM 11. Further, an area A14, in which the candidate areas calculated for the respective microphones where the ultrasonic tag T11 may be present overlap with one another, is set to be the present position of the ultrasonic tag (sound source). As such, it is possible to acquire the three-dimensional position of the sound source such as an ultrasonic tag with higher accuracy, for example, than the case of not considering reflection by the surrounding objects, whereby it is possible to provide a localization system having an excellent advantage that localization of the sound source such as the positional relationship between the ultrasonic tag T11 and the microphones M11 to M13 can be measured with high accuracy even in a room where reflection of ultrasonic waves by objects is frequently caused, and a robot and a localization method using the localization system, and a program for calculating the position of the sound source.
The exemplary embodiments of the present invention may be configured as described below.
By acquiring the shapes of the objects located around the microphone array and the ultrasonic tag by the object detection unit, it is possible to calculate the shortest paths from the ultrasonic wave transmission unit provided to the ultrasonic tag to the respective microphones by considering reflection of the sound wave to those objects. With this configuration, by observing the elapsed time from when the sound wave is transmitted from the ultrasonic wave transmission unit till when it reaches each of the microphones, candidate areas where the ultrasonic tag may be present when the sound wave reaches in the elapsed time can be calculated at real time.
More specifically, although a candidate area where the ultrasonic tag may be present becomes a sphere about a microphone with a radius of a length obtained by multiplying the elapsed time by the acoustic velocity if there is no obstacle. However, if there is an obstacle (that is, object), candidate areas would generally be a combination of a plurality of plane present inside the sphere. In that case, by detecting the object shape by the object detection unit, the candidate areas can be estimated. In this way, as candidate areas where the ultrasonic tag may be present are estimated for respective microphones configuring the microphone array are estimated by the position estimation unit described above, and as the ultrasonic tag is present in a shared part of the respective candidate areas, the three-dimensional position of the ultrasonic tag can be accurately calculated.
Further, the object detection unit may be configured to have a relative position detecting function for detecting the shapes and positions of objects reflecting an ultrasonic wave and the relative positions of the objects with the microphone array based on the microphone array included in the ultrasonic wave reception array unit and the surrounding environment where the sound source is provided.
With this configuration, as the shapes and positions of the reflection objects and the relative positions with the microphone array become clear by the relative position detecting function, the propagation path of an ultrasonic wave can be estimated with high accuracy, whereby accuracy in estimating the position of the sound source can be significantly improved based on the relationship with the positions of the respective microphones.
Further, the position estimation unit may be configured to include a shortest reflection path calculation unit (sound source position candidate calculation unit) for estimating reflection paths using information of the object for the respective microphones configuring the microphone array included in the ultrasonic wave reception array unit and acquiring areas where the shortest path lengths of the ultrasonic wave correspond to the times calculated by the propagation time calculation unit as candidate areas of the ultrasonic wave transmission unit, and a sound source position calculation unit for calculating the position of the sound source from the relationship between the candidate areas acquired for the respective microphones.
With this configuration, as the shortest reflection path calculation unit and the sound source position calculation unit works together, an advantage that identification of the position of the sound source can be performed faster is achieved.
Further, the object detection unit may be configured to measure and detect the position and the shape of an object, by using at least one of; a method of performing shape measurement by a range sensor using laser light; a method of estimating a three-dimensional shape from the positions of the surrounding objects and the observation result by functioning a range sensor capable of performing range measurement on a two-dimensional plane; a method of restoring the shape by stereo view using a plurality of cameras; a method of restoring the shape by the factorization method using the movement of a camera; and a method of restoring the shape from gradation of the object surface using images captured by a camera.
With this configuration, all of the numbers, shapes, positions and the sizes of the surrounding objects can be visually recognized, whereby estimation and confirmation of the shortest reflection paths can be performed easily. As such, an advantage that the positional calculation of the sound source can be performed more accurately is achieved.
Further, the object detection unit is characterized as to include a sensor for defecting surrounding objects and the sensor moving function for moving the sensor, and also the surrounding map creating function for creating a surrounding object map by detecting surrounding objects at a plurality of locations based on the operation of the sensor moving mechanism, and the object position identifying function for identifying the positions of the objects using the created surrounding object map.
This configuration provides an advantage that presence of surrounding objects can be recognized for a wider range, and particularly, even objects which are present behind a large object can be recognized accurately.
As the localization system has the sensor moving mechanism and the object detection unit considering the movement, detection of objections can be performed for a wider range. For example, although a range finder generally used for detecting objects can observe up to objects located before the measuring part (in the case of a laser range finder, laser light transmission unit and receiving unit) of the range finder, there is a problem of occlusion that the states of objects located over such object cannot be observed. Accordingly, if detection of objects is performed with the sensor being fixed, objects present in the environment cannot be sufficiently observed. As such, by observing objects in an area hidden by the front object while moving the sensor with the sensor moving mechanism as the localization system of the present invention, observation of objects can be performed for a wider range, wherein reflection propagation paths of a ultrasonic wave can be estimated with higher accuracy.
Further, it is also acceptable that the object detection unit also has an object matching unit which detects what the surrounding objects are and transmits shape information regarding stored surrounding objects upon request, and that the object detection unit has a measurement range outside map creating function for creating an object map for the surrounding objects in an area outside of the measurement range of the sensor provided to the object detection unit using the shape information from the object matching unit.
With this configuration, as presence of surrounding objects are identified beforehand on an object map, there is no need to directly sense the surrounding objects, providing an advantage that the shortest propagation path from the sound source to each of the microphones can be estimated faster, so that the position of the sound source can be identified more rapidly.
Further, the object matching unit may be configured as to include a function of detecting tags such as RFID tags, ultrasonic tags, and image markers attached to objects, acquiring information for identifying the surrounding objects to which those tags are attached based on information of the detected tags, and transmitting the information to the object detection unit.
Further, the object matching unit may be configured as to include a function of identifying surrounding objects by performing image matching, and transmitting image information regarding the surrounding objects to the object detection unit.
Further, the object matching unit may be configured as to include a function of detecting the tags such as RFID tags, ultrasonic tags and image markers attached to the objects, identifying the positions and orientations of the objects based on information obtained from the tags, and transmitting information regarding the identified surrounding objects to the object detection unit.
Further, the object matching unit may be configured as to include, if at least one of RFID tags, ultrasonic tags, and image markers is set as the type of tag and multiple pieces of the tags are attached to the surrounding objects, a function of identifying the positions and the orientations of the surrounding objects by detecting the positions of the attached tags, and transmitting information regarding the identifying surrounding objects to the object detection unit.
Further, the object matching unit may be configured as to detect the positions and the orientations of the objects by matching the shape of the objects observed by the object detection unit.
With this configuration, accuracy of the object map can be improved by detecting objects located around the microphones and the ultrasonic tags by the object matching unit, and using the shape information of the object stored in the object matching unit. For example, by arranging the shapes of the objects stored in the object matching unit on the object map, it is possible to estimates shapes of parts that the sensor cannot be observed due to occlusion or the like. Generally, it is often difficult to observe all objects present in the environment with a sensor for detecting objects because of problems of occlusion, observation ranges, and accuracy. As such, by storing shapes of objects in the object matching unit beforehand as the localization system of the exemplary embodiment, it is possible to effectively acquire the shapes of the objects present in the environment without observing all shapes of the objects each time.
Further, the localization system may be configured as to be mounted on a robot for searching objects.
Further, it is also possible to include a sound source estimation step for calculating the position of the ultrasonic tag by estimating reflection propagation paths of the ultrasonic wave based on the positional information of the surrounding objects detected and identified beforehand regarding the surrounding objects present around the microphone array and the propagation time for each of the microphones calculated in the propagation time calculation step, and estimating the position of the sound source.
This configuration provides advantages that a sound source (e.g., object to be delivered with an ultrasonic tag) in which the position thereof is not identified can be detected and recognized at real time, searching for the sound source can be performed continuously, and the positions can be identified even for the case of a plurality of sound sources.
Note that before the radio wave transmission step is performed, a surrounding object detection step for detecting surrounding objects present around the microphone array and generating an object map may be provided.
Further, the surrounding object detection step may be configured as to include an object information storing step for storing the positions, shapes and sizes of the surrounding objects present around the microphone array as surrounding map information, and an object map generation step for generating an object map for identifying the relative positions of the surrounding objects viewed from the respective microphones configuring the microphone array based on the stored surrounding map information.
Further, the object map generation step in the surrounding object detection step may be configured such that shape information of the objects corresponding to the surrounding objects may be taken out from the object matching unit which detects and stores the positions, shapes and sizes of the surrounding objects present around the microphone array beforehand, and arranged on the identified object map.
With this configuration, as the surrounding objects have been identified in any cases, an object map can be identified rapidly and easily. Thereby, even for a sound source that the position thereof is not identified, estimation of the sound propagation paths and identification of the positions can be performed rapidly with high accuracy.
Note that for performing the position estimation computing function, the system may include a surrounding object identifying function for storing the positional information and the shape information of the surrounding object present around the microphone array which have been acquired by the object detection unit beforehand, and generating an object map based on the detection result, which is executed by the computer.
Further, the surrounding object identifying function may be configured as to include an object information storing function for, if information about the positions, shapes, and sizes of the surrounding objects present around the microphone array are detected by the object detection unit separately provided beforehand, storing this information in the form of surrounding map information, and an object map generating function for generating an object map in order to identify relative positions with the surrounding objects viewed from the respective microphones configuring the microphone array based on the stored surrounding map information.
Further, in the object map generating function, it is also acceptable to extracting shape information of objects corresponding to the surrounding objects from the object matching unit which detects and stores beforehand the positions, shapes and sizes of the surrounding objects present around the microphone array, and arranging the information on the object map, which is to be executed by the computer.
The above embodiments are provided for exemplary purposes, and the present invention is not limited to the scope of the embodiments shown in the drawings and may be changed in various ways within the scope of the claims.
While the present invention has been described with reference to embodiments (and examples) thereof, the present invention is not limited to these embodiments (and examples). Various changes in form and details, understood by those skilled in the art, may be made within the scope of the present invention.
This application is based upon and claims the benefit of priority from Japanese patent application No. 2006-233015, filed on Aug. 30, 2006, the disclosure of which is incorporated herein in its entirety by reference.
The localization system of the present invention can be effectively used in public facilities such as an exhibition hall for configuring an optimum layout by measuring the arrangement relationship between exhibitions.
Number | Date | Country | Kind |
---|---|---|---|
2006-233015 | Aug 2006 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2007/066098 | 8/20/2007 | WO | 00 | 2/25/2009 |