This disclosure relates generally to mobile robots, and more particularly to controlling mobile robots using various sensors.
Robot motion planning is a fundamental problem in robotics that involve determining a sequence of actions or motions for a robot to navigate from its current state to a desired goal state, while avoiding obstacles and satisfying various constraints. Robot motion planning is a complex task that requires finding feasible paths that satisfy criteria such as trajectory efficiency, power efficiency, safety, and task-completion. Robot motion planning algorithms are crucial as they allow robots to operate autonomously, making them adaptable to dynamic environments and capable of performing tasks without human intervention.
In general, planning algorithms and techniques work well when the state estimation is accurate and reliable. However, when the robot is uncertain about its state, then the uncertainty poses significant challenges on planning and controls: inaccurate state estimation leads to incorrect perceptions of the robot's environment, obstacle locations, and its own position. As a result, the planned paths may be suboptimal or unsafe, leading to collisions or failures. A major contribution for state uncertainty comes from sensor measurements. These uncertainties can arise due to noise, biases, calibration errors, or limitations in the sensing capabilities.
The following is a summary of certain embodiments described in detail below. The described aspects are presented merely to provide the reader with a brief summary of these certain embodiments and the description of these aspects is not intended to limit the scope of this disclosure. Indeed, this disclosure may encompass a variety of aspects that may not be explicitly set forth below.
According to at least one aspect, a computer-implemented method includes receiving sensor data from one or more sensors. The method includes generating state data using the sensor data. The state data includes a position estimate of the mobile robot with respect to a target location. The method includes identifying a current confident zone on a unified confident zone map using the state data. The unified confident zone map includes a number of confident zones. Each confident zone is indicative of a given confidence level of given state data of a selected sensor modality for a given location. The method includes generating assessment data indicating whether or not the current confident zone is deemed a failure zone using predetermined criteria. The method includes generating a control command using the unified confident zone map. The method includes controlling the mobile robot based on the control command. The control command relates to a recovery plan of moving the mobile robot out of the current confident zone and to another confident zone with another confidence level when the assessment data indicates that the current confident zone is the failure zone. The another confidence level is greater than a current confidence level of the current confidence zone. The control command relates to another plan when the assessment data indicates that the current confident zone is not the failure zone.
According to at least one aspect, a system includes one or more processors and one or more memory. The one or more memory are in data communication with the one or more processors. The one or more memory include computer readable data stored thereon. The computer readable data include instructions that, when executed by the one or more processors, perform a method. The method includes generating state data using the sensor data. The state data includes a position estimate of the mobile robot with respect to a target location. The method includes identifying a current confident zone on a unified confident zone map using the state data. The unified confident zone map includes a number of confident zones. Each confident zone is indicative of a given confidence level of given state data of a selected sensor modality for a given location. The method includes generating assessment data indicating whether or not the current confident zone is deemed a failure zone using predetermined criteria. The method includes generating a control command using the unified confident zone map. The method includes controlling the mobile robot based on the control command. The control command relates to a recovery plan of moving the mobile robot out of the current confident zone and to another confident zone with another confidence level when the assessment data indicates that the current confident zone is the failure zone. The another confidence level is greater than a current confidence level of the current confidence zone. The control command relates to another plan when the assessment data indicates that the current confident zone is not the failure zone.
According to at least one aspect, one or more non-transitory computer-readable media have computer readable data stored thereon. The computer readable data includes instructions that, when executed by one or more processors, cause the one or more processors to perform a method. The method includes generating state data using the sensor data. The state data includes a position estimate of the mobile robot with respect to a target location. The method includes identifying a current confident zone on a unified confident zone map using the state data. The unified confident zone map includes a number of confident zones. Each confident zone is indicative of a given confidence level of given state data of a selected sensor modality for a given location. The method includes generating assessment data indicating whether or not the current confident zone is deemed a failure zone using predetermined criteria. The method includes generating a control command using the unified confident zone map. The method includes controlling the mobile robot based on the control command. The control command relates to a recovery plan of moving the mobile robot out of the current confident zone and to another confident zone with another confidence level when the assessment data indicates that the current confident zone is the failure zone. The another confidence level is greater than a current confidence level of the current confidence zone. The control command relates to another plan when the assessment data indicates that the current confident zone is not the failure zone.
These and other features, aspects, and advantages of the present invention are discussed in the following detailed description in accordance with the accompanying drawings throughout which like characters represent similar or like parts. Furthermore, the drawings are not necessarily to scale, as some features could be exaggerated or minimized to show details of particular components.
The embodiments described herein, which have been shown and described by way of example, and many of their advantages will be understood by the foregoing description, and it will be apparent that various changes can be made in the form, construction, and arrangement of the components without departing from the disclosed subject matter or without sacrificing one or more of its advantages. Indeed, the described forms of these embodiments are merely explanatory. These embodiments are susceptible to various modifications and alternative forms, and the following claims are intended to encompass and include such changes and not be limited to the particular forms disclosed, but rather to cover all modifications, equivalents, and alternatives falling with the spirit and scope of this disclosure.
As shown in
The environment module 102 is configured to receive and/or obtain environment data from an environment of the mobile robot. The environment data includes sensor data obtained via one or more sensors of the mobile robot, a state of the mobile robot, a goal (e.g., reference location, target location, or docking station location) of the mobile robot, environmental conditions (e.g., weather, temperature, etc.) of the environment of the mobile robot, etc. Upon obtaining this environment data relating to a current environment of the mobile robot, the environment module 102 transmits this environment data to the perception module 104.
The perception module 104 is configured to receive environment data from the environment module 102. The perception module 104 is configured to generate perception data using the sensor data. In the example shown in
The state estimation module 110 is configured to perform state estimation and generate state data, which include a position estimate of the mobile robot. The state estimation module 110 includes a set of sensor modules. Each sensor module corresponds to a particular sensor modality. For example, in
The wireless module 116 is configured to perform state estimation using wireless features. For example, the wireless module 116 is configured to extract wireless features obtained from one or more wireless sensors and generate state data including a position estimate using one or more of these wireless features. The wireless features may include received signal strength Indicator (RSSI) data, fine tune measurement (FTM) data, channel state information (CSI) data, other wireless attributes, or any combination thereof.
The visual input module 118 includes fiducial tag-base state estimation. Fiducial tags are also known as visual markers, which are specially designed patterns or symbols placed in the environment to provide reference points for robot perception systems. These tags are typically designed to be easily detectable and distinguishable by cameras, thereby allowing robots to accurately recognize their position and orientation relative to the tags. Fiducial tags come in various forms, such as quick response (QR) codes, barcodes, specialized marker patterns like April Tags, etc.
A process for fiducial tag-based state estimation involves (1) detection, (2) recognition, (3) pose estimation, and (4) iteration. With respect to the first step of detection, the process includes capturing images, via a camera of the mobile robot, of the environment and performing image processing techniques (e.g., thresholding, edge detection, etc.) to identify the fiducial tags present in the scene. With respect to the second step of recognition, the process includes matching the detected fiducial tags against a known library of tag patterns to identify their unique IDs. With respect to the third step of pose estimation, the process includes using the known properties of the fiducial tags, such as their size and shape, along the detected image coordinates. A program calculates the pose (i.e., position and orientation) of each tag relative to the camera. With respect to the iteration step, this process is repeated over time as new images are captured, allowing for continuous updating of the robot's state estimation based on the detection and recognition of fiducial tags in the scene.
The visual input modality is used besides wheel odometry because, with skid-steer configuration, the robot turning rate is a function of both wheel velocities and skidding rate. As wheel odometry does not consider skidding, the corresponding state estimates are inaccurate. Thus, the fiducial tag-based modality may provide better state estimates that may be used in planning and control.
The IMU module 120 is configured to generate state estimation data using inertial measurement units. The IMU module 120 is configured to generate a position estimate using IMU data from one or more IMU sensors, which may include an accelerometer, a gyroscope, a magnetometer, etc.
The wheel encoder module 122 is configured to generate state estimation data using information obtained from wheels of the mobile robot. For instance, in an example, the mobile robot may comprise a four wheeled skid-steer configuration robot. The wheel encoders therefore comprise rotary encoders, which track motor shaft rotation to generate position and motion information based on wheel movement. The wheel encoder module 122 is therefore configured to generate state estimation data from wheel encoders and/or wheel odometry.
Also, as shown in
As aforementioned, the perception module 104 is configured to generate perception data. The perception data includes state data (e.g. position estimate), a set of confident zone maps, and a unified confident zone map, or any combination thereof. The state data includes a position estimate such as (x, y, θ), where x and y are cartesian position coordinates of the mobile robot and where θ is an orientation of the robot. Also, the perception module 104 includes known sensor models (i.e., mathematical models that describes the relation between the actual sensor output and the robot state in global frame) for all the sensor modalities. In addition, the perception module 104 is configured to transmit the perception data to the motion planner 106. The perception module 104 is also configured to transmit (i) the mapping data from the mapping module 112 and/or (ii) prediction data from the prediction module 114, to the motion planner 106.
The motion planner 106 is configured to receive perception data from the perception module 104 and environment data from the environment module 102. The motion planner 106 is also configured to receive mapping data from the mapping module 112 and prediction data from the prediction module 114. The motion planner 106 is configured to generate motion planning data using the perception data and the environment data. The motion planning data includes a nominal path for the mobile robot. The motion planning data includes control commands for the mobile robot. The control commands include a plan for the mobile robot. The control commands specify a linear velocity of the mobile robot and an angular velocity of the mobile robot. The motion planner 106 is configured to transmit the motion planning data to the control system 108.
The control system 108 is configured to receive motion planning data from the motion planner 106. For example, the motion planning data includes a nominal path for the control system 108 to control a movement of the mobile robot. In response to receiving the motion planning data, the control system 108 is configured to transmit a control signal and/or perform an action that advances the mobile robot according to the nominal path. In addition, the control system 108 is configured to update the environment module 102.
Precise navigation requires accurate and robust state estimation, coupled with effective path-planning and control strategies. The system 100 (e.g., the motion planner 106) receives the state data 204, represented as (x, y, θ), as input data and is configured to generate control commands, represented as (v, w), as output data. With respect to the output data, v represents linear velocity and w represents angular velocity.
More specifically, as an example, the mobile robot 200 may be a rover, which performs smart-docking maneuvers on the surface of the moon. In this case, the rover is initialized with a rough state estimate, some distance, D, away from a stationary charging coil. The rover is configured to autonomously perform precise navigation to and docking with the charging coil, despite the possible existence of negative environmental factors (e.g., low-light conditions, high glare/reflectivity on the fiducial marker, lunar dust obscuring part of the rover's camera lens, etc.).
Also, the mobile robot 200 may be a four wheeled skid-steer configuration robot. The mobile robot includes sensors, such as wheel encoders and RGB cameras and, thus, the system 100 is configured to consider wheel odometry and information from visual fiducial tag, placed at the goal location, for state estimation or noisy state estimation. In addition, a ‘cone of interest,’ may refer to the space from where mobile robot 200 senses the tag and estimates state data therefrom.
Referring to
At step 302, according to an example, the process includes a global planner, which includes any heuristic-based search involving the unified confident zone map. The global planner receives the goal (e.g., target location) and the unified confident zone map as input and generates a series of intermediate goals (step 304) as output. The global planner is configured to generate a global plan from a current location (or position estimate) of the mobile robot 200 to the target location (e.g., goal, destination location, docking/charging station location, etc.). For example, the global plan includes performing a heuristic search based on sensor confidence zones. In this regard, the heuristic search may include identifying confidence zones with confidence levels of at least one predetermined confidence range for intermediate goals. In addition to confidence level, the heuristic search may also distance as a factor for identifying a set of intermediate goals between the source location and the target location.
At step 304, according to an example, the process includes generating a set of intermediate goals or a series of intermediate goals via the heuristic-based search of step 302. The intermediate goals may refer to intermediate locations between a current location (or position estimate) of the mobile robot 200 and the target location (e.g., goal, destination location, docking/charging station location, etc.).
As shown in
Also, as shown in
At step 306, according to an example, the process includes obtaining a current confident zone using the state data and the unified confident zone map. The current confident zone refers to a confident zone of the unified confident zone map that corresponds to the state data.
In general, a confident zone refers to a spatial area that spans the locations where state estimates have the same uncertainty range in the values. There are a number of uncertainty ranges. A similar or same uncertainty range is categorized as a particular confidence level. Upon obtaining the current confident zone that corresponds to the state data, the process proceeds to step 308.
At step 308, according to an example, the process includes assessing whether or not the current confident zone is deemed a failure zone. The process includes generating assessment data indicative of whether or not the current confident zone is deemed a failure zone. If the assessment data indicates that the current confident zone is deemed a failure zone, then the process proceeds to step 312 to generate a recovery plan. In this regard, even though the strategies are robust, there may be instances in which the mobile robot 200 realizes later that the state data and/or unified confident zone map indicates that the mobile robot 200 is located in a highly uncertain sensing zone, or in the wrong state, by not following an optimal path. For instance, the scenarios may occur (i) when the robot misaligns with the charging coil at the goal state, thereby resulting in ineffective charging; and/or (ii) when the mobile robot 200 detects early failure zones, such as zones from where the mobile robot 200 cannot make it to desired final pose irrespective of the planning efforts, as illustrated in
At step 310, according to an example, the process includes determining the responsible sensor modality for the current confident zone of the unified confident zone map and assessing whether or not the one or more sensors, which generated the state data (e.g. position estimate), is of the same sensor modality as the responsible sensor modality. The process may generate assessment data indicative of whether or not the responsible sensor modality is active with respect to the current confident zone. Upon determining that the responsible sensor modality, associated with the current confident zone, is active for the mobile robot at that state data, then the process proceeds to step 316. Alternatively, upon determining that the responsible sensor modality, associated with the current confident zone, is inactive for the mobile robot at that state data, then the process proceeds to step 314.
The responsible sensor modality is the best sensor modality that gives a greatest confidence level (e.g. less error and less uncertainty) and greatest reliability for a given location. Each confident zone is associated with a responsible sensor modality. As a non-limiting example, if a fiducial tag based sensor modality provides greater reliability and exhibits a greater confidence level than wheel odometry sensor modality for a given location, then the fiducial tag based modality is selected as the responsible sensor modality for that given location and its confidence level is selected to represent the confidence zone for that given location on the unified confident zone map. Also, if the mobile robot 200 generated state data via one or more sensors associated with fiducial tag based modality, then the responsible sensor modality is deemed active. Alternatively, if the state data was generated from one or more sensors associated with another sensor modality (e.g., wheel odometry) that did not include the fiducial tag based modality, then the response sensor modality is deemed inactive.
At step 312, according to an example, the process includes generating at least one control command that relates to a recovery plan or a recovery mode for the mobile robot 200. More specifically, the recovery plan and/or recovery mode moves the mobile robot 200 to a more confident space and then replans the mobile robot's next actions. Here, the recovery mode and/or the recovery plan may comprise a receding-horizon method, a hard control, or a combination of receding-horizon method and a hard control. By using this recovery mode feature, the motion planner 106 is configured to activate recovery when the mobile robot 200 is not at a final desired state. Moreover, with this recovery mode feature, the motion planner 106 also does early kick-in if a simulated future plan cannot land the mobile robot 200 in the desired state; thereby avoid unnecessary replanning efforts.
At step 314, according to an example, the process includes generating at least one control command that relates to hard control or performing a hard control. In general, a hard control includes any predefined motion. The predefined motion may include rotation, moving forward, moving backward, or a predefined trajectory.
With respect to step 314, sometimes, the obtained confidence spaces may not be active, for example, if the fiducial tag is not in the camera's view for a fiducial tag-based state estimation modality. In order to activate those spaces, the motion planner 106 makes use of hard control by suspending receding horizon control for some time. For instance, as a non-limiting example, the hard control may include a simple rotation around the mobile robot's vertical axis to find the fiducial tag. Once the fiducial tag comes into view, the confidence space gets activated. In this example, hard control is engaged only when the mobile robot 200 is in a confidence space, with the corresponding sensor modality responsible for that confidence is not active. In the example, consider a scenario with fiducial tag not in view and mobile robot's location (x, y) is determined using only wheel odometry. Also, at (x, y), if fiducial tag based sensor modality is more confident than wheel odometry sensor modality, then the hard control is engaged to make the fiducial modality active. The type of hard control, which is generated at step 314, depends on the robot configuration, speed limits, robot control frequency, etc.
At step 316, according to an example, the process includes generating at least one control command that relates to receding horizon planning via a dynamic cost function. As an example, the receding horizon planning includes a model-predictive type with path integral formulation. For instance, the receding horizon planning comprises model predictive path integral (MPPI), as set forth in TABLE 2. The motion planner 106 is configured to generate a nominal path for the mobile robot 200 to travel from a current location or current position estimate to the goal or an intermediate goal.
As aforementioned, the receding horizon planning includes and uses a dynamic cost function. In this regard, the cost function weights or its formulation may change, depending on which underlying confident zone an agent (e.g., mobile robot) is presently in and what the next goal waypoint is; this allows the planner to dynamically replan, based on various conditions, in order to get more desirable motion characteristics (e.g., slowing down the robot for more precise navigation, or turning sharply to enter a higher-confidence region) which will give the agent a better chance of reaching the overall goal location successfully, as indicated in equation 1.
In equation 1, w is a weight parameter for a cost function feature, for example ƒ=w1*ƒ1, where ƒ1 is distance-to-goal cost; (w1, w2) and (w1new, w2new) are the old weights and the new weights of the cost function ƒ, respectively; g is the new cost function i.e., ƒ changes to g with same or different weights.
In addition,
As shown in
The mobile robot 200 is configured to include at least one sensor system 604. The sensor system 604 senses the environment and generates sensor data based thereupon. The sensor system 604 is in data communication with the processing system 602. The sensor system 604 is also directly or indirectly in data communication with the memory system 606. The sensor system 604 includes a number of sensors. As aforementioned, the sensor system 604 includes various sensors of various sensor modalities. For example, the sensor system 604 includes at least an image sensor (e.g., a camera), a wireless sensor (e.g., Wi-Fi 2.4 GHz on an ESP32 wireless chip), IMU technology (e.g., accelerometer, a gyroscope, a magnetometer, etc.), a light detection and ranging (LIDAR) sensor, a radar sensor, wheel encoders, a motion capture system, any applicable sensor, or any number and combination thereof. Also, the sensor system 604 may include a thermal sensor, an ultrasonic sensor, an infrared sensor, a motion sensor, or any number and combination thereof. The sensor system 604 may include a satellite-based radio navigation sensor (e.g., GPS sensor). In this regard, the sensor system 604 includes a set of sensors that enable the mobile robot 200 to sense its environment and use that sensing information to operate effectively in its environment.
The mobile robot 200 includes a memory system 606, which is in data communication with the processing system 602. In an example embodiment, the memory system 606 includes at least one non-transitory computer readable storage medium, which is configured to store and provide access to various data to enable at least the processing system 602 to perform the operations and functionality, as disclosed herein. The memory system 606 comprises a single memory device or a plurality of memory devices. The memory system 606 may include electrical, electronic, magnetic, optical, semiconductor, electromagnetic, or any suitable storage technology that is operable with the mobile robot 200. For instance, the memory system 606 includes random access memory (RAM), read only memory (ROM), flash memory, a disk drive, a memory card, an optical storage device, a magnetic storage device, a memory module, any suitable type of memory device, or any number and combination thereof.
The memory system 606 includes at least the system 100, which includes at least the environment module 102, the perception module 104, the motion planner 106, and the control system 108. In addition, the memory system 606 includes other relevant data 608. The system 100 and the other relevant data 608 are stored on the memory system 606. The system 100 includes computer readable data. The computer readable data includes instructions. In addition, the computer readable data may include various code, various routines, various related data, any software technology, or any number and combination thereof. The instructions, which, when executed by the processing system 602, is configured to perform at least the functions described in this disclosure. Meanwhile, the other relevant data 608 provides various data (e.g., operating system, etc.), which relate to one or more components of the mobile robot 200 and enables the mobile robot 200 to perform the functions as discussed herein.
In addition, the mobile robot 200 includes other functional modules 610. For example, the other functional modules 610 include a power source (e.g., one or more batteries, etc.). The power source may be chargeable by a power supply of a docking station. The other functional modules 610 include communication technology (e.g., wired communication technology, wireless communication technology, or a combination thereof) that enables components of the mobile robot 200 to communicate with each other, communicate with one or more other communication/computer devices, or any number and combination thereof. The other functional modules 610 may include one or more I/O devices (e.g., display device, speaker device, etc.).
Also, the other functional modules 610 may include any relevant hardware, software, or combination thereof that assist with or contribute to the functioning of the mobile robot 200. For example, the other functional modules 610 include a set of actuators, as well as related actuation systems. The set of actuators include one or more actuators, which relate to enabling the mobile robot 200 to perform one or more of the actions and functions as described herein. For example, the set of actuators may include one or more actuators, which relate to driving wheels of the mobile robot 200 so that the mobile robot 200 is configured to move around its environment. The set of actuators may include one or more actuators, which relate steering the mobile robot 200. The set of actuators may include one or more actuators, which relate to a braking system that stops a movement of the wheels of the mobile robot 200. Also, the set of actuators may include one or more actuators, which relate to other actions and/or functions of the mobile robot 200. In general, the other functional modules 610 include various components of the mobile robot 200 that enable the mobile robot 200 to move around its environment, and optionally perform one or more tasks in its environment.
As described in this disclosure, the system 100 provides several advantages and benefits. For example, the system 100 includes optimal control strategies that account for various inaccuracies in sensor measurements due to noise, biases, calibration errors, or limitations in the sensing capabilities. In addition, the system 100 is configured to make decisions based on this imperfect sensor information.
In order to make planning strategies practically robust under sensor uncertainties and enable a mobile robot 200 to navigate safely and effectively, the system 100 includes optimal control strategies that include zone-based planning schemes with recovery mode. Here, the motion planner 106 is a combination of a receding horizon planning method and hard-control. The motion planner 106 is configured to make the mobile robot 200 traverse through perception spaces of high confidence. The system 100 is configured to understand the strengths and weaknesses of each sensor modality and obtain sensor confident zones. Once these confident zones are mapped, the motion planner 106 is configured to use a dynamic cost function to generate trajectory plans to execute. The weights of the planner cost function change to go through confident spaces in order to reach the goal, thereby the state estimates are accurate along the path.
Furthermore, although this disclosure included an example use-case of a rover that performs smart-docking maneuvers on the surface of the moon, the system 100 is applicable to various real-world applications and may be used in various settings. For example, the system 100 may also be used in settings where satellite-based navigation systems, e.g., GPS, is unreliable or unavailable, such as in a warehouse, a home, a commercial space, a garage, etc.
Also, the system 100 uses a combination of receding horizon control and hard control for planning. Upon obtaining the unified confident zone map, the motion planner 106 is configured to traverse through the more sensor confident spaces via a dynamic cost function and optionally place intermediate goal points in those spaces, as shown in
Furthermore, the above description is intended to be illustrative, and not restrictive, and provided in the context of a particular application and its requirements. Those skilled in the art can appreciate from the foregoing description that the present invention may be implemented in a variety of forms, and that the various embodiments may be implemented alone or in combination. Therefore, while the embodiments of the present invention have been described in connection with particular examples thereof, the general principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the described embodiments, and the true scope of the embodiments and/or methods of the present invention are not limited to the embodiments shown and described, since various modifications will become apparent to the skilled practitioner upon a study of the drawings, specification, and following claims. Additionally, or alternatively, components and functionality may be separated or combined differently than in the manner of the various described embodiments and may be described using different terminology. These and other variations, modifications, additions, and improvements may fall within the scope of the disclosure as defined in the claims that follow.
The present application is related to the following patent applications: U.S. patent application Ser. No. ______ (RBPA0481PUS_R409654, filed on Dec. 29, 2023) and U.S. patent application Ser. No. ______ (RBPA0480_R410678, filed on Dec. 29, 2023), which are both incorporated by reference in their entireties herein.
At least one or more portions of this invention may have been made with government support under U.S. Government Contract No. 80LARC21C0013, awarded by National Aeronautics and Space Administration (NASA). The U.S. Government may therefore have certain rights in this invention.