The present invention relates to a method for establishing semantic distance map and related moving device, and more particularly, to a method for establishing semantic distance map and related moving device capable of precisely determining obstacles for performing sweeping.
Sweeping robots are commonly utilized in families. When in an operation of the sweeping robot, all kinds of obstacles should be avoided indoors. The conventional sweeping robot usually adopts a single lens camera for distance measurement, and performs the obstacle avoidance according to the measured distance. Therefore, the conventional sweeping robot directly avoids an obstacle when detecting the obstacle exists in the front, and cannot enter a bottom space of the obstacle for sweeping. Or, when the seeping robot cannot determine the height from the ground to the bottom of the obstacle, the seeping robot may directly hit the obstacle or get stuck when entering the bottom of the obstacle for sweeping.
Therefore, improvements are necessary to the prior art.
In light of this, the present invention provides a method for establishing semantic distance map and related moving device, which improves the ability of obstacle avoidance of the moving device when sweeping.
An embodiment of the present invention discloses an establishing method of semantic distance map for a moving device, comprises capturing an image; obtaining a single-point distance measurement result of the image; performing recognition for the image to obtain a recognition result of each obstacle in the image; and determining a semantic distance map corresponding to the image according to the image, the single-point distance measurement result and the recognition result of each obstacle of in the image; wherein each pixel of the semantic distance map includes an obstacle information, which includes a distance between the moving device and an obstacle, a type of the obstacle, and a recognition probability of the obstacle.
Another embodiment of the present invention discloses a moving device, comprises a monocular image capturing device, configured to capture an image; a non-contact range finding module, configured to obtain a single-point distance measurement result of the image; and a processing unit, coupled to the monocular image capturing device and the non-contact range finding module, configured to perform recognition for the image to obtain a recognition result of each obstacle in the image; and determine a semantic distance map corresponding to the image according to the image, the single-point distance measurement result and the recognition result of each obstacle of in the image; wherein each pixel of the semantic distance map includes an obstacle information, which includes a distance between the moving device and an obstacle, a type of the obstacle, and a recognition probability of the obstacle.
These and other objectives of the present invention will no doubt become obvious to those of ordinary skill in the art after reading the following detailed description of the preferred embodiment that is illustrated in the various figures and drawings.
Please refer to
In detail, please refer to
Each of the pixels of the semantic distance map may include the obstacle information of the image captured by the monocular image capturing device 102, wherein each of the pixels includes at most a piece of obstacle information. In other words, each of the pixels of the semantic distance map includes at most a piece of obstacle information of the moving device 10.
The obstacle information includes a distance between the moving device 10 and an obstacle, a type of the obstacle and a recognition probability of the obstacle. The obstacle information of each pixel of the semantic distance map may be stored in the memory unit 106. In an example, the recognition probability may be a confidence level of a recognition result. In addition, before establishing the semantic distance map, a default type of obstacle of the single-point distance measurement result obtained by the non-contact range finding module 104 is wall.
Since the distance between the non-contact range finding module 104 and the obstacle measured by the non-contact range finding module 104 of the moving device 10 is a real distance. When an obstacle is determined in the front of the non-contact range finding module 104, the processing unit 108 is configured to set the recognition probability of the corresponding pixel to 100%. In an embodiment, an object recognition network model, e.g. Yolov4-tiny or Yolov5s of YOLO (You Only Look Once) series may be adopted to perform the object recognition to reduce overall parameters and computation demands.
After the processing unit 108 may obtain the obstacle information and an obstacle bounding box BB of the obstacle in the image after performing the recognition for the image, wherein a lowest coordinate height of the obstacle bounding box BB corresponding to a real distance to the obstacle may be determined according to a mapping table of coordinate-height-to-distance. Therefore, the processing unit 108 may determine a depth information of the obstacle according to the obstacle in the image and the single-point distance measurement result of the pixel corresponding to the image. In an embodiment, the mapping table of coordinate-height-to-distance may be stored in the memory unit 106.
As shown in
By the above method, the processing unit 108 may recognize whether the lowest coordinate height of the obstacle bounding box BB is lower than a highest imaging height on the ground or not to determine a ground obstacle on the ground, wherein the highest imaging height on the ground denotes a coordinate height of a horizon in the image, i.e. a coordinate height of a boundary between the ground and other places.
In detail, please refer to
When the non-contact range finding module 104 determines that the obstacle type of the pixel is the wall, the ground imaging height H2 is determined by looking up the mapping table of coordinate-height-to-distance; in contrast, when the non-contact range finding module 104 determines that the obstacle type of the pixel is not the wall, the height H1 is equal to the height H2. Therefore, the processing unit 108 according to an embodiment of the present invention may determine the highest imaging height on the ground according to the lower value of the ground imaging height H2 and the ground imaging height H1 in the image captured by the monocular image capturing device 102.
After the processing unit 108 determines the obstacle type, the recognition probability and the distance to the obstacle, the processing unit 108 may update the recognition probability of each pixel accordingly and store the updated recognition probability to the obstacle information of the memory unit 106. In an embodiment, a formula (1) for updating the recognition probability of each pixel is:
wherein p3 denotes the updated recognition probability, p1 is a previous recognition probability, p2 is a current recognition probability and the ratio is a weighted value for the update. Since the formula (1) takes extreme rates, i.e. the probability close to 0% or 100%, as with higher confidence level, 50% of the probability is taken as the weighted value for the update. Therefore, the moving device 10 according to an embodiment of the present invention may rapidly recognize the objects in the image and improve a precision rate of the recognition to improve an ability of obstacle avoidance.
Notably, when the updated recognition probability is larger than or equal to a default probability, the processing unit 108 stores the corresponding obstacle information, i.e. the distance to the obstacle and the obstacle type, to the memory unit 106; in contrast, when the updated recognition probability is smaller than the default probability, the processing unit 108 does not update the corresponding obstacle information. For example, when the updated recognition probability of the pixel is larger than or equal to 65% (i.e. the default probability), the corresponding obstacle information is updated; in contrast, when the updated recognition probability of the pixel is smaller than 65% (i.e. the default probability), no corresponding obstacle information is update.
Therefore, the present invention may update all pixels in the image with the updated recognition probability to complete a global update, and the updated recognition probability, the obstacle information are updated to the memory unit 106.
In an embodiment, please refer to
Since the distance measured by the non-contact range finding module 104 according to an embodiment of the present invention is a real distance, when the distance to the obstacle measured by the non-contact range finding module 104 is different to that determined according to the mapping table of coordinate-height-to-distance corresponding to the pixel, the distance measured by the non-contact range finding module 104 is adopted as the distance to complete a local update.
In another embodiment, as shown in
After the moving device 10 finishes updating the semantic distance map, an obstacle avoidance strategy may be determined according to the semantic distance map.
Then, the moving device 10 may look up the mapping table of coordinate-height-to-distance to determine the distance to the obstacle according to the single-point distance measurement result of the non-contact range finding module 104 when performing the local update to determine whether a space exists under the obstacle or not, e.g. a height from the ground to bottom of a sofa or a cabinet, to determine an obstacle avoidance route of the moving device 10. Therefore, when pixels corresponding to the semantic distance map of the moving route of the moving device 10 existing a wall surrounded by the obstacle pixels, and the distance from the wall to the moving device 10 is larger than the distance from the obstacle to the moving device 10, the processing unit 108 of the moving device 10 may calculate an obstacle bottom level height to determine whether the moving device 10 can enter the space under the obstacle or not.
Please refer to
Based on the distances D1, D2, the pixel numbers PN_1, PN_2 and the mapping of the monocular image capturing device 102, the processing unit 108 according to an embodiment of the present invention may calculate a real height of the bottom of the obstacle to determine whether the moving device 10 can enter the bottom space of the obstacle or not.
That is, when the bottom height of the obstacle is larger than a height of the moving device 10, the moving device 10 may neglect the distance to the obstacle and takes the distance to the wall for the obstacle avoidance, such that the moving device 10 can enter the bottom of the obstacle; in contrast, when the bottom height of the obstacle is smaller than the height of the moving device 10, the moving device 10 avoids the obstacle.
An operation of the moving device 10 may be concluded to an establishing method of a semantic distance map 1000, as shown in
On the other hand, an operation of determining the obstacle avoidance route of the moving device when cleaning may be concluded to an obstacle avoidance method 1100, the obstacle avoidance method 1100 includes the following steps:
Notably, the embodiments of the present invention illustrated above may be properly modified by those skilled in the art, and are not limited thereto. For example, the moving device 10 may further include an object recognition model for performing the objection recognition for the images. In addition, the updated recognition probability or the default probability of the updated recognition probability may all be adjusted according to requirements of a user and computer system, and not limited thereto, which are all within the scope of the present invention.
In summary, the present invention provides a method for establishing semantic distance map and related moving device which reduces physical collisions according to a semantic distance map and a non-contact range finding module when the moving device is in operation and determines whether a bottom space of an obstacle exists or not to improve a convenience of the moving device with a local update method.
Those skilled in the art will readily observe that numerous modifications and alterations of the device and method may be made while retaining the teachings of the invention. Accordingly, the above disclosure should be construed as limited only by the metes and bounds of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
111116954 | May 2022 | TW | national |
Number | Name | Date | Kind |
---|---|---|---|
11604967 | Smolyanskiy | Mar 2023 | B2 |
20180012370 | Aghamohammadi | Jan 2018 | A1 |
20200209401 | Motoyama | Jul 2020 | A1 |
20200225673 | Ebrahimi Afrouzi | Jul 2020 | A1 |
20200326715 | Hurd | Oct 2020 | A1 |
20200349846 | Siboni | Nov 2020 | A1 |
20200409387 | Tsurumi | Dec 2020 | A1 |
20210049376 | Cui | Feb 2021 | A1 |
20220044474 | Zhong | Feb 2022 | A1 |
20220057806 | Guo | Feb 2022 | A1 |
20220075074 | Kang | Mar 2022 | A1 |
20220236075 | Ho | Jul 2022 | A1 |
Number | Date | Country |
---|---|---|
109682381 | Apr 2019 | CN |
113228137 | Aug 2021 | CN |
3712841 | Sep 2020 | EP |
201920982 | Jun 2019 | TW |
WO-2021262943 | Dec 2021 | WO |
WO-2022000260 | Jan 2022 | WO |
WO-2022066583 | Mar 2022 | WO |
WO-2022115766 | Jun 2022 | WO |
Number | Date | Country | |
---|---|---|---|
20230360239 A1 | Nov 2023 | US |