The present disclosure relates to a robot generating a map based on multi sensors and artificial intelligence and moving based on the map.
A large-scale retail store, a department store, an airport, a golf course, and the like are places where exchange of goods and services takes place between people. Robots may be useful in the places to offer information or convenience to people.
Robots may be classified as guide robots, security robots, cleaning robots and the like. The robots move in a space, confirming their positions.
The robots are required for holding information on a space, on their current positions, or on a path previously moved by the robots and the like such that the robots move confirming their positions and avoiding obstacles.
The robots may store maps to confirm a space and to move in the space. To generate a map, the robots may draw up a map using a variety of sensors, and may match and store various pieces of information in the map.
However, the sensors have their own features and make errors while the robots are moving. Accordingly, technologies for generating a map reflecting the features and errors are required.
As a means to solve the above-described problems, the present disclosure is directed to allow a robot to generate heterogeneous maps for a space using various sensors.
Additionally, the present disclosure is directed to implement fusion SLAM that identifies the position of a robot using various sensors in a space.
Further, the present disclosure is directed to separate out from heterogeneous maps and provide the separated maps, thereby enabling a robot to use a single type of sensor.
Objectives of the present disclosure are not limited to what has been described. Additionally, other objectives and advantages that have not been mentioned may be clearly understood from the following description and may be more clearly understood from embodiments. Further, it will be understood that the objectives and advantages of the present disclosure may be realized via means and a combination thereof that are described in the appended claims.
A robot generating a map based on multi sensors and artificial intelligence according to an embodiment may include a LiDAR sensor sensing a distance between an object outside of the robot and the robot and generating a LiDAR frame, and a camera sensor capturing an image of an object placed outside of the robot and generating a visual frame.
The robot generating a map based on multi sensors and artificial intelligence according to an embodiment may include a controller that generates a pose graph comprised of a LiDAR branch including one or more of the LiDAR frames, a visual branch including one or more of the visual frames, and a backbone including two or more frame nodes registered with any one or more of the LiDAR frames or the visual frames, and that generates orodometry information which is generated while the robot is moving between the frame nodes.
The robot generating a map based on multi sensors and artificial intelligence according to an embodiment may include a map storage storing the LiDAR branch, the visual branch, the backbone, the odometry information between the frame nodes and the pose graph.
The robot generating a map based on multi sensors and artificial intelligence according to an embodiment may generate a new frame node on the basis of a distance or an angle between the position of a previous frame node and a current position, and may search the map storage for any one or more of the visual frames or the LiDAR frames capable of being registered at the current position, and may register the searched frame in the new frame node generated.
The robot generating a map based on multi sensors and artificial intelligence according to an embodiment may generate a vision-only pose graph in which a LiDAR frame registered in the frame node constituting the backbone is removed from the pose graph.
The robot generating a map based on multi sensors and artificial intelligence according to an embodiment may generate a LiDAR-only pose graph in which a visual frame registered in the frame node constituting the backbone is removed from the pose graph.
The robot moving using a map generated based on multi sensors and artificial intelligence according to an embodiment may include a controller that calculates a current position of the robot by comparing a frame registered in the frame node of the pose graph with a first LiDAR frame generated by the LiDAR sensor or with a first visual frame generated by the camera sensor or by using the odometry information.
The present disclosure according to embodiments enables a robot to generate heterogeneous maps for a space using various sensors.
Additionally, the present disclosure according to embodiments enables a robot to perform fusion-SLAM in a space using various sensors.
Further, the present disclosure according to embodiments makes it possible to extract a partial map from a map comprised of various sensors and to perform SLAM appropriate for a robot.
Effects of the present disclosure are not limited to the above-described ones, and one having ordinary skill in the art to which the disclosure pertains may easily draw various effects from the configuration of the disclosure.
Hereinafter, embodiments of the present disclosure will be described in detail with reference to the drawings so that those skilled in the art to which the present disclosure pertains can easily implement the present disclosure. The present disclosure may be implemented in many different manners and is not limited to the embodiments described herein.
In order to clearly illustrate the present disclosure, technical explanation that is not directly related to the present disclosure may be omitted, and same or similar components are denoted by a same reference numeral throughout the specification. Further, some embodiments of the present disclosure will be described in detail with reference to the drawings. In adding reference numerals to components of each drawing, the same components may have the same reference numeral as possible even if they are displayed on different drawings. Further, in describing the present disclosure, a detailed description of related known configurations and functions will be omitted when it is determined that it may obscure the gist of the present disclosure.
In describing components of the present disclosure, it is possible to use the terms such as first, second, A, B, (a), and (b), etc. These terms are only intended to distinguish a component from another component, and a nature, an order, a sequence, or the number of the corresponding components is not limited by that term. When a component is described as being “connected,” “coupled” or “connected” to another component, the component may be directly connected or able to be connected to the other component; however, it is also to be understood that an additional component may be “interposed” between the two components, or the two components may be “connected,” “coupled” or “connected” through an additional component.
Further, with respect to embodiments of the present disclosure, for convenience of explanation, the present disclosure may be described by subdividing an individual component, but the components of the present disclosure may be implemented within a device or a module, or a component of the present disclosure may be implemented by being divided into a plurality of devices or modules.
In this specification, a robot includes devices that are used for specific purposes (cleaning, ensuring security, monitoring, guiding and the like) or that moves offering functions according to features of a space in which the robot is moving, hereunder. Accordingly, in this specification, devices that have transportation means capable of moving using predetermined information and sensors, and that offer predetermined functions are generally referred to as a robot.
In this specification, a robot may move with a map stored in it. The map denotes information on fixed objects such as fixed walls, fixed stairs and the like that do not move in a space. Additionally, information on movable obstacles that are disposed periodically, i.e., information on dynamic objects may be stored on the map.
As an example, information on obstacles disposed within a certain range with respect to a direction in which the robot moves forward may also be stored in the map. In this case, unlike the map in which the above-described fixed objects are stored, the map includes information on obstacles, which is registered temporarily, and then removes the information after the robot moves.
Further, in this specification, the robot may confirm an external dynamic object using various sensors. When the robot moves to a destination in an environment that is crowded with a large number of pedestrians after confirming the external dynamic object, the robot may confirm a state in which waypoints to the destination are occupied by obstacles.
Furthermore, the robot may determine the robot arrives at a waypoint on the basis of a degree in a change of directions of the waypoint, and the robot moves to the destination of the next waypoint successfully.
A main body 10 may be configured to be long in the up-down direction, and may have the shape of a roly poly toy that gradually becomes slimmer from the lower portion toward the upper portion, as a whole.
The main body 10 may include a case 30 that forms the appearance of the robot 1. The case 30 may include a top cover 31 disposed on the upper side, a first middle cover 32 disposed on the lower side of the top cover 31, a second middle cover 33 disposed on the lower side of the first middle cover 32, and a bottom cover 34 disposed on the lower side of the second middle cover 33. The first middle cover 32 and the second middle cover 33 may constitute a single middle cover.
The top cover 31 may be disposed at the uppermost end of the robot 1, and may have the shape of a hemisphere or a dome. The top cover 31 may be disposed at a height below the average height for adults to readily receive an instruction from a user. Additionally, the top cover 31 may be configured to rotate at a predetermined angle.
The robot 1 may further include a control module 150 therein. The control module 150 controls the robot 1 like a type of computer or a type of processor. Accordingly, the control module 150 may be disposed in the robot 1, may perform functions similar to those of a main processor, and may interact with a user
The control module 150 is disposed in the robot 1 to control the robot during robot's movement by sensing objects around the robot. The control module 150 of the robot may be implemented as a software module, a chip in which a software module is implemented as hardware, and the like.
A display unit 31a that receives an instruction from a user or that outputs information, and sensors, for example, a camera 31b and a microphone 31c may be disposed on one side of the front surface of the top cover 31.
In addition to the display unit 31a of the top cover 31, a display unit 20 is also disposed on one side of the middle cover 32.
Information may be output by all the two display units 31a, 20 or may be output by any one of the two display units 31a, 20 according to functions of the robot.
Additionally, various obstacle sensors (220 in
Additionally, the robot in
The shape of the robot in
The robot in
In a state in which a plurality of the robots in
A LiDAR sensor 220 may sense surrounding objects two-dimensionally or three-dimensionally. A two-dimensional LiDAR sensor may sense positions of objects within 360-degree ranges with respect to the robot. LiDAR information sensed in a specific position may constitute a single LiDAR frame. That is, the LiDAR sensor 220 senses a distance between an object disposed outside the robot and the robot to generate a LiDAR frame.
As an example, a camera sensor 230 is a regular camera. To overcome viewing angle limitations, two or more camera sensors 230 may be used. An image captured in a specific position constitutes vision information. That is, the camera sensor 230 captures image of an object outside the robot and generates a visual frame including vision information.
The robot 1, to which the present disclosure is applied, performs fusion- simultaneous localization and mapping (Fusion-SLAM) using the LiDAR sensor 220 and the camera sensor 230.
In fusion SLAM, LiDAR information and vision information may be combinedly used. The LiDAR information and vision information may be configured as maps.
Unlike a robot that uses a single sensor (LiDAR-only SLAM, visual-only SLAM), a robot that uses fusion-SLAM may enhance accuracy of estimating a position. That is, when fusion SLAM is performed by combining the LiDAR information and vision information, map quality may be enhanced.
The map quality is a criterion applied to both of the vision map comprised of pieces of vision information, and the LiDAR map comprised of pieces of LiDAR information. At the time of fusion SLAM, map quality of each of the vision map and LiDAR map is enhanced because sensors may share information that is not sufficiently acquired by each of the sensors.
Additionally, LiDAR information or vision information may be extracted from a single map and may be used. For example, LiDAR information or vision information, or all the LiDAR information and vision information may be used for localization of the robot in accordance with an amount of memory held by the robot or a calculation capability of a calculation processor, and the like.
An interface unit 290 receives information input by a user. The interface unit 290 receives various pieces of information such as a touch, a voice and the like input by the user, and outputs results of the input. Additionally, the interface unit 290 may output a map stored by the robot 1 or may output a course in which the robot moves by overlapping on the the map.
Further, the interface unit 290 may supply predetermined information to a user.
A controller 250 generates a map as in
A communication unit 280 may allow the robot 1 to communicate with another robot or an external server and to receive and transmit information.
The robot 1 may generate each map using each of the sensors (a LiDAR sensor and a camera sensor), or may generate a single map using each of the sensors and then may generate another map in which details corresponding to a specific sensor are only extracted from the single map.
Additionally, the map of the present disclosure may include odometry information on the basis of rotations of wheels. The odometry information is information on distances moved by the robot, which are calculated using frequencies of rotations of a wheel of the robot or using a difference in frequencies of rotations of both wheels of the robot, and the like. The robot may calculate a distance moved by the robot using the odometry information in addition to information sensed by sensors.
The controller 250 in
A plurality of LiDAR sensors 220 and camera sensors 230 may be disposed outside of the robot 1 to identify external objects.
In addition to the LiDAR sensor 220 and camera sensor 230 in
The artificial intelligence unit 255 may input information that is processed by the LiDAR sensor 220, the camera sensor 230 and the other sensors, or information that is accumulated and stored while the robot 1 is moving, and the like, and may output results required for the controller 250 to determine an external situation, to process information and to generate a moving path.
As an example, the robot 1 may store information on positions of various objects, disposed in a space in which the robot is moving, as a map. The objects include a fixed object such as a wall, a door and the like, and a movable object such as a flower pot, a desk and the like. The artificial intelligence unit 255 may output data on a path taken by the robot, a range of work covered by the robot, and the like, using map information and information supplied by the LiDAR sensor 220, the camera sensor 230 and the other sensors.
Additionally, the artificial intelligence unit 255 may recognize objects disposed around the robot using information supplied by the LiDAR sensor 220, the camera sensor 230 and the other sensors. The artificial intelligence unit 255 may output meta information on an image by receiving the image. The meta information includes information on the name of an object in an image, a distance between an object and the robot, the sort of an object, whether an object is disposed on a map, and the like.
Information supplied by the LiDAR sensor 220, the camera sensor 230 and the other sensors is input to an input node of a deep learning network of the artificial intelligence unit 255, and then results are output from an output node of the artificial intelligence unit 255 through information processing of a hidden layer of the deep learning network of the artificial intelligence unit 255.
The controller 250 may calculate a moving path of the robot using date calculated by the artificial intelligence unit 255 or using data processed by various sensors.
Additionally, the robot may store information sensed by the camera sensor in a specific spot, in the map storage 210 using the camera sensor 230 while the robot is moving in the space 40.
The backbone is information on a trajectory of the robot. Additionally, the backbone includes one or more frame nodes corresponding to the trajectory. The frame nodes further include constraint information in a relation between the frame nodes and other frame nodes. An edge between nodes denotes constraint information. The edge denotes odometry constraint information (odometry constraint) or loop constraint information (loop constraint).
The LiDAR branch of the second layer is comprised of LiDAR frames. The LiDAR frames include a LiDAR sensing value that is sensed while the robot is moving. At least one or more of the LiDAR frames are set as a LiDAR keyframe.
The LiDAR keyframe has a corresponding relation with the nodes of the backbone. In
The visual branch of the second layer is comprised of visual keyframes. The visual keyframes indicate one or more visual feature nodes that are camera sensing values (i.e., an image captured by the camera) sensed while the robot is moving. The robot may generate a plurality of visual feature nodes on the basis of the number of camera sensors disposed in the robot.
In the map structure of
Poses of the robot at the LiDAR or the visual keyframe are same, and the LiDar or the visual keyframe is connected with each frame node. An extrinsic parameter may be added for each keyframe on the basis of a position of the robot, to which the LiDAR sensor or the camera sensor is attached. The extrinsic parameter denotes information on a relative position at which a sensor is attached from the center of the robot.
The visual keyframe has a corresponding relation with the node of the backbone. In
Edges are displayed between nodes v1 to v5 constituting the backbone of the first layer. e12, e23, e34, and e45 are edges between adjacent nodes, and e13, e35, and e25 are edges between non-adjacent nodes.
Odometry constraint information, or for short, odometry information denotes constraints between adjacent frame nodes such as e12, e23, e34, and e45. Loop constraint information, or for short, loop information denotes constraints between non-adjacent frames such as e13, e25, and e35.
The backbone is comprised of a plurality of keyframes. The controller 250 may perform an initial mapping process to add the plurality of keyframes to the backbone. The initial mapping process includes adding the LiDAR keyframe and the visual frame based on the keyframe.
The structure of
Additionally, the backbone includes two or more frame nodes in which any one or more of a LiDAR frame or a visual frame are registered. In this case, the LiDAR frame or the visual frame registered in the frame node is referred to as a keyframe. A pose graph includes the LiDAR branch, the visual branch and the backbone.
Further, the pose graph includes odometry information, loop information and the like among frame nodes. The odometry information includes information on rotations, directions, and the like of wheels, which is generated while the robot is moving between frames nodes. The loop information is based on a set of frame nodes connected using specific constraints between visual keyframes around a specific frame node within a maximum sensing distance of the LiDAR sensor 220.
The controller 250 generates the pose graph in
First, the controller 250 generates a frame node constituting a backbone of a first layer (S45). The frame node is based on a constant distance and angle between nodes. Additionally, the controller 250 generates a frame node when there is any one of LiDAR information (a LiDAR frame) or vision information (a visual frame) at a corresponding position.
Further, the controller 250 may generate a node additionally to generate a specific destination (a drugstore, an information desk and the like). This is possible when a user preset a specific destination or when a user designates the corresponding position as a node previously during the process of generating a frame node.
By reflecting a distance and an angle between nodes in the generation of a frame node, an overlapped node is prevented from being generated when the robot stops.
Next, in the process of registering a visual keyframe (S46), the visual keyframe is registered when there is a big difference in angles between an existing frame and the robot. This is also to prevent the generation of an overlapped keyframe.
Additionally, the controller 250 confirms whether a captured image has enough information to be registered as a visual keyframe, by calculating the number of feature points in the image. If the image is same-colored as a whole (e.g., a wall), or if the image shows people (without feature points), the controller 250 does not register the image as a visual keyframe.
Further, in the process of registering a LiDAR keyframe, the robot generates a keyframe when overlapped areas in LiDAR information are reduced, and the LiDAR information is sensed while the robot is moving. LiDAR information within close distances may be the same. Accordingly, only when pieces of LiDAR information are different from each other because of a difference in distances, the LiDAR frame can be registered as a keyframe.
In steps 45 to 47, the robot collects LiDAR information and vision information in real time while the robot is moving, and the controller 250 may register a frame node of a backbone and a visual keyframe, LiDAR keyframe in real time.
Alternately, the controller 250 may register a frame node of a backbone and a visual keyframe, a LiDAR keyframe after the robot collects and stores LiDAR information and vision information previously while moving primarily.
Additionally, the controller 250 may add odometry constraint information and loop constraint information between nodes as edges while generating a frame node of a backbone.
Certainly, when there is no visual frame/LiDAR frame with respect to the new frame node, the controller 250 may also generate a new LiDAR frame/visual frame using the LiDAR sensor 220 or the camera sensor 230.
The controller 250 performs initialization (S51). Initialization denotes that the controller 250 sets a set of neighbor nodes (Vneighbor) to an empty set (Φ) and sets the number of neighbor nodes (nneighbor) to 0
Additionally, the controller 250 includes nodes located within a maximum angle of a candidate area (Rmax) among nodes located within a certain distance from a candidate visual keyframe (vvcandidate) as neighbor nodes (Vneighbor) (S52). The maximum angle of a candidate area (Rmax) denotes a maximum search range required for identifying an overlapped visual keyframe. For example, when the maximum angle of a candidate area (Rmax) is small, a map with high density may be drawn up.
When a set of neighbor nodes are set, there are neighbor nodes having a difference in directions of θreg or less with respect to a visual keyframe, and when edges of the visual keyframe and the neighbor nodes are matched with a pose graph, this may be determined as an important keyframe for estimating a position (S53).
In the following step, the controller confirms whether the candidate visual keyframe has sufficient 3D points. If the candidate visual keyframe has sufficient 3D points, the candidate visual keyframe may be added as a keyframe in the pose graph including a backbone in
A criterion for 3D points, Nmin3D_point, denotes a minimum 3D point for registering as a keyframe.
The process in
Additionally, as illustrated in
The controller 250 may generate a LiDAR frame at a specific position while the robot is moving. Additionally, the controller registers a first LiDAR frame as a LiDAR keyframe (S56), and, while the robot 1 is moving, generates a new LiDAR frame (S57). The controller 250 selects the new LiDAR frame (S58).
The controller 250 compares the LiDAR frame selected and a keyframe previously registered, and, when an overlapped size is smaller than a certain criterion, registers the LiDAR frame selected as a keyframe (S59). Steps 57 to 59 may be repeated while the robot is moving.
According to the process in
A LiDAR branch, as illustrated in
The generation of a visual keyframe and the generation of a LiDAR keyframe in
The controller 250 registers a keyframe as a frame node of a backbone among keyframes generated as in
The controller 250 according to an embodiment of the present disclosure may generate a visual keyframe and a LiDAR keyframe described with reference to the process of
Specifically, in a user's manual registration state (S61), a user converts a mode into a POI (Point of Interest) input mode.
That is, the interface unit 290 may receive an input of frame node addition. The interface unit may receive meta data on a current position together with the frame node addition.
In response to the input of the user, the controller 250 generates a frame node at the current position, searches a visual frame or a LiDAR frame that may be registered in the generated frame node, and registers the searched visual frame or the searched LiDAR frame in the frame node. Alternately, the controller 250 may control the LiDAR sensor 220 or the camera sensor 230, may generate a new visual frame/LiDAR frame, and may register the generated frames in the frame node.
The controller 250 controls the camera sensor 230 at the current position of the robot 1 to generate vision information and controls the LiDAR sensor 220 to generate LiDAR information.
Additionally, the controller 250 generates a visual keyframe using the generated vision information and generates a LiDAR keyframe using the generated LiDAR information (S62). Then the controller 250 generates a frame node at the position (S63), and registers the visual keyframe and the LiDAR keyframe in the generated frame node (or associates the visual keyframe and the LiDAR keyframe with the generated frame node) (S64).
In the case of a user's non-manual registration state in step 61, the controller 250 determines whether a current position is generated as a frame node on the basis of a difference between a position of a previous frame node (vfprevious) and a current position of the robot (pcurrent) (S65).
For example, the controller 250 uses a criterion that is a distance (Tdist) and an angle (Tangle) required to be satisfied between the previous frame node and a new frame node at a minimum level.
Specifically, the controller 250 confirms whether a difference between the position of the previous frame node (vfprevious) and the current position of the robot (pcurrent) is greater than Tdist or whether an angle between the position of the previous frame node (vfprevious) and the current position of the robot (pcurrent) is greater than Tangle. Additionally, when there is a visual keyframe or a registable LiDAR keyframe for registration at the position (S66) in the case in which conditions in step 65 are satisfied, the controller 250 generates a frame node (S67).
When the frame node is generated, the controller 250 registers the registable visual keyframe and the registable LiDAR keyframe in the frame node (S68) at the position. The controller 250 may register only a visual keyframe in a frame node. Alternately, the controller 250 may register only a LiDAR keyframe in a frame node. Alternately, the controller 250 may register all the visual keyframe/LiDAR keyframe in the frame node.
The process in
According to the above-described process, the robot 1 may generate two types of maps (a vision map, and a LiDAR map) respectively, and may combine the maps with respect to a backbone. Alternately, the robot 1 may extract a map (a vision map, or a LiDAR map) required for a pose graph that has already included a single integrated backbone, and may apply the map to SLAM of the robot 1. This may be applicable in the case in which the robot 1 is equipped only with any one sensor (220 or 230), or in the case in which the robot 1 performs SLAM with respect to any one sensor.
Accordingly, in the above-described process of generating a map, the robot 1 including all the LiDAR sensor 220 and the camera sensor 230 generates two types of maps (a vision map and a LiDAR map) and generates a pose graph including a backbone, using keyframes between the maps.
Then the LiDAR MAP is installed in a robot equipped only with the LiDAR sensor 220, and the vision map is installed in a robot equipped only with the camera sensor 230. As a result, the robots that are respectively equipped with different sensors may estimate a position. Certainly, various scenarios for estimating a position, capable of being used by the robot, may be applied on the basis of features of a map.
The controller 250 loads the pose graph including a frame node onto a memory (S71). Additionally, step 71 is branched on the basis of whether a vision-only pose graph is generated (S72). In the case in which the vision-only pose graph is generated, the controller 250 removes a LiDAR frame, registered in a frame node constituting a backbone, from the loaded pose graph (S73). Additionally, the controller 250 generates a vision-only pose graph in which a LiDAR branch is removed from the pose graph (S74).
During SLAM of a robot that does not include the LiDAR sensor 220, the controller 250 may estimate a position of the robot using the map in
In the case in which a LiDAR-only pose graph is generated in step 71, the controller 250 removes a visual frame, registered in a frame node constituting a backbone, from the pose graph (S75). Additionally, the controller 250 generates a LiDAR-only pose graph in which a visual branch is removed from the pose graph (S76).
During SLAM of a robot that does not include the camera sensor 230, the controller 250 may estimate a position of the robot using the map in
The LiDAR sensor 220 may sense a distance between an object placed outside of the robot and the robot, and may generate a first LiDAR frame (S81). Additionally, the controller 250 selects a frame node stored in the map storage 210 (S82). In selecting a frame node, odometry information, in which the robot moves with respect to the frame node that is confirmed previously, may be used.
The controller 250 compares the generated first LiDAR frame and a second LiDAR frame registered in the frame node (S83). When a position can be estimated on the basis of results of the comparison (S84), the controller 250 estimates a current position of the robot (S86).
When failing to estimate a position in step 84, the controller 250 searches a new frame node in a backbone of the map storage 210 or estimates a position using a first visual frame generated by the camera sensor 230 (S85).
The map storage 210 stores the above-described pose graph. The map storage 210 stores a LiDAR branch including a plurality of LiDAR frames that may be compared with the first LiDAR frame. Further, the map storage 210 stores a visual branch including a plurality of visual frames that may be compared with the first visual frame.
The map storage 210 stores a pose graph including a backbone including two or more frame nodes registered with any one or more of the stored LiDAR frame or the stored visual frame. Additionally, the map storage 210 stores odometry information between frame nodes.
According to the type of a sensor provided by the robot, the map storage 210 may store a pose graph from which a LiDAR branch is removed (
Additionally, the controller 250 may estimate a position by comparing a frame, registered in the frame node of the pose graph, with the first LiDAR frame or the first visual frame. Alternately, the controller 250 may calculate a current position of the robot using the odometry information.
When failing to estimate a position with the LiDAR sensor, the controller 250 extracts a new LiDAR frame. For example, the controller 250 searches the backbone for a first frame node corresponding to the current position to extract a second LiDAR frame registered in the first frame node.
Additionally, the controller 250 may determine that the two frames are different as a result of comparison between the first LiDAR frame and the second LiDAR frame. The controller 250 may extract a third LiDAR frame adjacent to the second LiDAR frame from the LiDAR branch, may compare the first LiDAR frame and the third LiDAR frame, and may calculate a current position of the robot.
When still failing to estimate the position with the LiDAR sensor, the robot may use a visual frame. That is, the controller 250 extracts a second visual frame registered in the first frame node when the first LiDAR frame and the third LiDAR frame are different as a result of comparison between the two frames.
Additionally, the controller 250 calculates a current position of the robot by comparing the first visual frame and the second visual frame.
When failing to estimate the position with respect to the frame node currently confirmed, using the LiDAR frame or the visual frame, the controller 250 may search a new frame node. For example, when failing to estimate a position even though the controller 250 determines that a frame node is v2 in the configuration of
Alternately, the controller 250 may extract a visual frame or a LiDAR frame from the map storage 210 in addition to the keyframe currently registered in the frame node, and may compare the visual frame or the LiDAR frame with a first LiDAR frame/first visual frame acquired at the current position.
The camera sensor 230 captures image of an object placed outside of the robot, and generates a first visual frame (S91).
Additionally, the controller 250 selects a frame node stored in the map storage 210 (S92). In selecting a frame node, odometry information, in which the robot moves with respect to a frame node previously confirmed, may be used.
The controller 250 compares the generated first visual frame and a second visual frame registered in the frame node (S93). When a position can be estimated on the basis of results of the comparison (S94), the controller 250 estimates a current position of the robot (S96).
When the position cannot be estimated in step 94, the controller 250 estimates the position by searching a backbone of the map storage 210 for a new frame node or by using a first LiDAR frame generated by the LiDAR sensor 220 (S95).
Specifically, when a position cannot be estimated with the camera sensor, the controller 250 extracts a new visual frame. For example, the controller 250 searches the backbone for a first frame node corresponding to the current position and extracts a second visual frame registered in the first frame node.
Additionally, the controller 250 may determine that the first visual frame and the second visual frame are different as a result of the comparison between the two frames. The controller 250 may extract a third visual frame adjacent to the second visual frame from a visual branch, may compare the first visual frame and the third visual frame and may calculate a current position of the robot.
When still failing to estimate the position with the camera sensor, the robot may use a LiDAR frame. That is, the controller 250 extracts a second LiDAR frame registered in the first frame node when the first visual frame and the third visual frame are different as a result of the comparison of the two frames.
Additionally, the controller 250 calculates a current position of the robot by comparing the first LiDAR frame and the second LiDAR frame.
When failing to estimate the position with respect to the frame node currently confirmed, using the LiDAR frame or the visual frame, the controller 250 may search a new frame node. For example, when failing to estimate a position even though the controller 250 determines that a frame node is v2 in the configuration of
Alternately, the controller 250 may extract a visual frame or a LiDAR frame from the map storage 210 in addition to the keyframe currently registered in the frame node, and may compare the visual frame or the LiDAR frame with a first LiDAR frame/first visual frame acquired at the current position.
When a LiDAR frame or a visual frame among frame nodes is not registered or a position cannot be estimated, in
Additionally, the controller 250 may extract a second frame node in which the LiDAR frame or the visual frame is registered, among routes taken by the robot, and may calculate a current position of the robot using odometry information from the second frame node to the first frame node.
According to the above-described embodiments, the controller 250 may generate a single map by combining a camera sensor generating vision information and a LiDAR sensor generating LiDAR information. Additionally, the controller 250 may generate a map on the basis of the information sensed by the sensors, as in
Thus, high-quality maps that are mutually compatible may be generated and may be applied to the estimation of positions using frames and keyframes based on different sensors.
Although in embodiments, all the elements that constitute the embodiments of the present disclosure are described as being coupled to one or as being coupled to one so as to operate, the disclosure is not limited to the embodiments. One or more of all the elements may be optionally coupled to operate within the scope of the present disclosure. Additionally, each of the elements may be implemented as single independent hardware, or some or all of the elements may be optionally combined and implemented as a computer program that includes a program module for performing some or all of the combined functions in single hardware or a plurality of hardware. Codes or segments that constitute the computer program may be readily inferred by one having ordinary skill in the art. The computer program is recorded on computer-readable media and read and executed by a computer to implement the embodiments. Storage media that store computer programs includes storage media magnetic recording media, optical recording media, and semiconductor recording devices. Additionally, the computer program that embodies the embodiments includes a program module that is transmitted in real time through an external device.
The embodiments of the present disclosure have been described. However, the embodiments may be changed and modified in different forms by one having ordinary skill in the art. Thus, it should be understood that the changes and modifications are also included within the scope of the present disclosure.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/KR2019/005305 | 5/3/2019 | WO | 00 |