1. Field of the Description
The present description relates, in general, to techniques for remotely controlling movements and operations of vehicles such as ground-based vehicles (e.g., a two-to-four or more wheeled vehicle) and aerial vehicles (e.g., a multi-copter, a drone, or the like), and, more particularly, to methods and systems adapted for providing shared control (i.e., both remote and local operator control) over a semi-autonomous vehicle.
2. Relevant Background
There are many settings where it is desirable to provide shared control over the operations of a robot such as a terrestrial vehicle or an aerial vehicle. For example, an amusement park may provide a ride in which a passenger vehicle is driven along a race track or through an obstacle course, and it may be desirable to allow the passenger to “drive” the vehicle by providing user input indicating a direction they wish to travel and a velocity or speed they want their vehicle to move in the selected direction.
However, the vehicle control is preferably shared to provide collision avoidance with static obstacles and also with other vehicles (which may be driven by other ride participants). In other cases, a human operator may remotely control a flying drone such as a multi-copter in a particular air space, but the control is shared with a central controller to avoid collisions or achieve other goals.
Human-robot shared control has been applied in a variety of settings, but there remains a need for improved shared-control methods that enhance human control such as by providing the human operator more local control or operational freedom while still safely avoiding collisions. For example, shared control has been applied in the field of tele-robot operation for tasks including space exploration and surgery. In the case of mobile robots, methods have been developed for human interaction with a forklift, and shared control has been used with a formation of aerial vehicles via a haptic device. Work has also progressed for human-robot shared control with semi-autonomous vehicles such as wheelchairs or similar vehicles and with self-driving cars and other urban transporters.
The present description provides a shared control method (and relative system for implementing this control method) for a vehicle (e.g., any wheeled or terrestrial vehicle, any aerial vehicle, and the like). A human driver, who may be in the vehicle or remote to the vehicle, commands (i.e., provides user input to provide) a preferred velocity. The method involves transforming this user input into a collision-free local motion that respects actuator constraints (e.g., vehicle kinematics) and allows for smooth and safe control. Collision-free local motions are achieved with an extension of velocity obstacles that takes into account dynamic constraints and also a grid-based map representation of the space through which the vehicle is traveling under share controlled.
To limit the freedom of control of the driver or human operator, a global guidance trajectory (or “ghost vehicle”) can be used in the shared control method. The global guidance trajectory may specify the areas (or volumes) where the vehicle is allowed to drive or move in each time instance. Further, to improve operations in many environments, schedules are imposed along with or as part of the global guidance trajectory to ensure the vehicles, even under human control, achieve broadly stated goals of reaching particular places (move from a first or start position or location to a second or finish position or location) under predefined time schedules.
The low computational complexity of the method makes it well suited for multi-agent settings and high update rates, and a centralized function and a distributed function (or centralized and distributed algorithms) are described herein that allow for real-time, concurrent control of many vehicles (e.g., several to 10 to 40 or more vehicles). Extensive experiments have been performed, and the results of these experiments (using a motorized, robotic wheelchair as the vehicle) at relatively high speeds in tight dynamic scenarios are presented in the following description.
More particularly, a method is described for achieving shared control over movement of a vehicle within a space (2D or 3D space depending on whether the vehicle is a wheeled/tracked vehicle or boat or is an aerial vehicle). The shared control method includes receiving user input related to a velocity (e.g., speed and direction) for the vehicle within the space. The method then includes processing the user input with a local motion planner selectively adjusting the velocity and the direction based on a set of predefined constraints (and the presence of neighboring or other vehicles within the space) to generate a trajectory (which may also be considered a velocity as it defines speed and direction) for the vehicle in the space. The method further includes communicating motion control signals including the trajectory to the vehicle and then operating drive mechanisms in the vehicle based on the motion control signals to move the vehicle from a first position to a second position within the space during an upcoming time period.
In some cases, the trajectory defines a preferred speed for the vehicle and a direction for movement in the space. The set of predefined constraints may include a grid map (e.g., image) of the space that defines locations of obstacles in the space (physical objects such as walls and also, in some cases, virtual objects that may be projected into the space or be displayed on a monitor/display device in the vehicle). There are at least two ways to treat virtual objects: (1) if the virtual object is fixed, it can be placed in the grid map and treated like a fixed physical object; and (2) if the virtual object is created for a transient period only or if it is moving, it can be treated like another vehicle. The processing of the user input includes searching the grid map (e.g., with the vehicle's current position in the space) to identify the obstacles and to define a path from the first position to the second position in the space that avoids the obstacles.
In some implementations of the method, the set of predefined constraints includes a guidance trajectory defining a passageway through the space that can be traveled by the vehicle without collision with the obstacles, and the trajectory is defined so as to control movement of the vehicle to retain the vehicle within the passageway. The passageway may include a set of interconnected waypoints in the space and an active area in which the vehicle may move is associated with each of the waypoints. At least some of the active areas differ in size or shape defining differing areas in which the vehicle may move within the space. In some embodiments, the set of predefined constraints includes kinematic data for the vehicle, and the processing based on neighboring vehicles in the space includes processing data for collision avoidance for other vehicles moving in the space, velocity predictions for ones of the other vehicles neighboring the vehicle in the space, and/or predictions of movements of the other vehicles during an upcoming time period.
In brief, methods and systems are described herein that are designed to provide shared control of semi-autonomous vehicles (terrestrial or ground based vehicles and/or aerial vehicles) based, in many cases, on velocity space optimization. The method for shared control of a semi-autonomous vehicle is performed based on or using: (1) real-time input from a driver or operator of the vehicle; (2) a guidance trajectory that specifies the areas (which may be 3D spaces or volumes for aerial vehicles) where the vehicle is allowed to drive or move in each time instance; (3) avoidance constraints with respect to other agents (which may be other semi-autonomous vehicles) and/or with respect to a grid map defining the location, size, and shape of static obstacles (real or virtual such as in the form of projected images in the vehicle's drive path/space, images displayed in a vehicle's head's up display, and the like); and (4) motion continuity constraints. The local planner may provide local planning of vehicle movements using efficient centralized and/or distributed algorithms based on convex optimization and non-convex search to achieve real-time local planning where an exact grid-based representation of the map can be used in conjunction with a velocity obstacles framework. This may be desirable to allow for real-time control of a larger number of vehicles (e.g., 10 to 40 or more vehicles in a space).
The vehicle 110 may include a processor 112 running a local controller 116 that controls operations of a drive mechanism(s)/actuator 118 to cause the vehicle 110 to move in a particular direction and velocity within the space 105. This control may be based on motion control signals 138 from a central control system 130 and also based on input from user input devices 114. For example, a driver/operator 114 may move a steering wheel and accelerator pad to cause the local controller 116 to operate the drive mechanism/actuator 118 within an active area defined/allowed by the motion control signal 138 from central control system to avoid collision and to follow a schedule as the vehicle 110 moves along a guidance trajectory 144. The user input device 114 may also take the form of a joystick that can be manipulated by the driver/operator of vehicle 110.
The local controller 116 and/or other components on the vehicle 110 may operate to transmit (wirelessly in most cases) user input data 119 in response to operations of the user input device(s) 114 by the driver/operator, and the user input data 119 typically will include a user-preferred velocity (e.g., direction and speed for the vehicle 110 in drive space 105). The communicated data 119 may also include sensor or tracking system data that is useful for determining the present location of other/neighbor vehicles 110 in the space and, in some cases, their present velocity, both of which can be used by the control system 120 in controlling the vehicle 110 via motion control signals 138 to avoid collisions among the vehicles 110 in the space 105.
The system 100 further includes a central control system 120 that may take the form of a computer or computer-based system specially configured to control the vehicles 110 via motion control signals 138. Particularly, the control system 120 includes one or more processors 122 managing operations of input/output (I/O) devices such as a monitor, a keyboard, a mouse, a touchpad, a touchscreen, and the like for receiving input from an operator and for displaying data (such as displaying present locations of vehicles 110 in space 105 and the like). The processor 122 also runs or executes code in a computer readable medium to provide a controller module 130, which provides all or a large portion of the shared motion control described herein. The controller module 130 may include a local motion planner 132 to generate the motion control signals 138, and, to this end, the local motion planner 132 may include a trajectory controller 134 along with an optimization engine 136 whose functions and design are described in detail below.
The processor 122 manages one or data storages devices or memory 140 to store (at least temporarily) data used by the controller module 130 to provide the shared control of the vehicles 110. As shown, for example, the memory 140 may be used to store received user input and sensor data 142 from the vehicle 110. Further, the memory 140 may store a guidance trajectory 144 defined by the controller module 130 for controlling the vehicle 110 including defining active areas 145 within the drive space 105 in which the vehicle 110 may be driven by an operator/driver while following a ride path or other planned trajectory. The memory 140 also stores a number of optimization constraints 150 that are used by the optimization engine 136. The constraints 150 may include a grid map 152 that maps physical obstacles 154 and, in some embodiments, virtual objects 156 (e.g., displayed 2D or 3D images of objects projected into space 105 or displayed to within or nearby the vehicle 110 to appear to be in the space 105) to locations or positions within the drive space 105. The controller module 130 may search the grid map 152 based on a present location of the vehicle 110 to generate a motion control signal 138 that avoids collision with obstacles.
Additionally, the memory 140 may be used to store vehicle properties and/or kinematics 158 for the vehicle 110 that can be used to determine the motion control signal 138 (e.g., an acceptable local trajectory for a next control cycle or at a present time). Other constraints 150 may include data on other vehicles/agents 160 that may be used to avoid collisions and may be determined from sensor data 119 from the vehicle 110 or from an external tracking system (not shown) operating to determine present locations of each of the vehicles 110 in the drive space 105. The optimization constraints 150 may include velocity predictions 164 for other or neighbor vehicles 110 that may be nearby to the vehicle 110 (“ego vehicle”) in the drive space 105. This information can be used by the optimization engine 136 to control the vehicle 110 via signals 138 so as to avoid collisions with other flying or airborne vehicles 110 in the 3D space 105.
With the system 100 understood and in mind, it may be useful to provide an overview of the shared control method that may be provided by operation of the system 100 and similar systems. Then, the description can proceed with explanation of a useful vehicle model and control, an exemplary guidance trajectory, a shared control technique, local motion planning, algorithms for implementing the methods taught herein, and experimental results.
Consider a group of n vehicles of which m are controlled, either in a distributed or centralized way. The position, velocity, and acceleration of a robot, i, are denoted by pi∈2, vi={dot over (p)}i, and ai={umlaut over (p)}i respectively. A disk, D(p,r), of radius ri centered at pi defines its enveloping shape. The enlarged obstacle map is given by a static grid map Or of the positions p where a disk D(p,r) is in collision with a static obstacle.
The driver/operator of the vehicle (or robot) commands the vehicle through the vehicle's user input devices by specifying a desired velocity. A guidance trajectory along with or together with active areas (i.e., areas (or spaces) where the vehicle is allowed to drive (or fly)) can be specified to: (1) guide the motion of the vehicle; (2) limit its movements; (3) guarantee that the vehicle has a certain performance even in the case where the driver does not specify a velocity; and (4) control the throughput of vehicles throughout a drive space or ride area. In brief, the desired velocity and the guidance trajectory (two inputs to the controller module) are combined and a local trajectory is computed (which is used to generate motion control signals, e.g., after optimization, for the vehicle's drive mechanism/actuator).
For each vehicle, the set of local trajectories is given by a controller (of sufficient continuity, C2, in this description) towards a straight-line reference trajectory at velocity u∈2 (see
Throughout this description, x denotes scalars, x vectors, x=∥x∥ its Euclidean norm, X matrices, X sets, and X⊕y their Minkowski sum. The super index *k indicates the value at time tk and subindex *i indicates agent i and xij=xi−xj relative vectors. Time relative to the current time step is employed when appropriate and denoted by {tilde over (t)}=t−tk. In practice, estimated states are used, but, to simplify the notation, no distinction is made herein between estimated and real states.
Turning now to a vehicle model and control, given a reference trajectory at constant velocity pref(t)=pk+u{tilde over (t)} for {tilde over (t)}∈[0,∞), with pk being the position of i at the current time-step k and u being the reference velocity, one can consider a trajectory tracking controller ƒ(zk,u,t)=p(t) that is continuous in the initial state, zk=[pk,{dot over (p)}k,{umlaut over (p)}k, . . . ] of the agent and that is converging to the reference trajectory. The set of feasible motions is given by:
(zk)={u∈2, such that the trajectory ƒ(zk,u,t) respects all dynamic restraints} Eq. (1)
This set can be pre-computed by forward simulation and depends on {dot over (p)}k and {umlaut over (p)}k. A trajectory controller appropriate for the vehicle kinematics can then be formulated.
With regard to such a trajectory controller, and recalling the methods herein described are independent of the chosen controller, one such example is as follows. If the system is modeled as the second order integrator (continuity in acceleration) and the appropriate state feedback control low towards the reference trajectory is formulated, the trajectory ƒ(zk,u,t) is given (for
p(t)=pk+u
with F=({umlaut over (p)}k+2w0({dot over (p)}k−v∞))/(w0−w1)2 and w0 and w1 being design parameters related to the decay of the initial velocity and acceleration. Saturation limits are added, e.g., maximum linear and angular speed (|v1|≦v1,max, |v2|≦v2,max) and maximum linear and angular accelerations (|{dot over (v)}1|≦v1,max, |{dot over (v)}2|≦{dot over (v)}2,max).
Turning now to kinematics of the vehicle, the inventors employed wheelchairs for the semi-autonomous vehicles that can have shared control. With regard to wheelchair (vehicle, in this example) kinematics, the unicycle model is used, with
[
A (fully controllable or holonomic) point p=
{dot over (p)}=ξ,v1=ξ·[cos δ, sin δ]T,v2=ξ·[−sin δ, cos δ]T/Δ Eq. (4)
The local trajectory is applied at this point, and a circle of radius r is considered as the enveloping shape of the vehicle.
With regard to use of a guidance trajectory, it may be desirable for some applications of shared control to have higher control over the motion of the vehicles. In these cases, a roadmap or set of guidance trajectories can be created. A method 300 of creating a roadmap or set of guidance trajectories is shown in
At step 320, a set of waypoints is specified, and, for each waypoint, an arrival time, Tl, (or, alternatively, a speed over the segment) and an active area (where the vehicle is allowed to freely move) of a predefined radius, ρl, around the waypoint is provided. At 340, the method 300 continues with connecting consecutive waypoints. Then, at 350, the method 300 involves generating or defining the guidance trajectory along with the active area around it (e.g., by interpolation or the like). The method 300 then ends at 390. In practice, when two or more active areas (e.g., from different trajectories) intersect, the vehicle can be assigned to an active area depending on its current position and specified attributes of the guidance trajectories, such as if more than one vehicle can follow that trajectory or if the vehicle must stay in the current one.
In the method 300, one can select the trajectory, s, that minimizes the weighted distance to the current point on the guidance trajectory, qs, relative to the current radius of the active area, ρs:
argsmin(1−σs)∥p−qs∥/ρs Eq. (5)
where σs∈(0,1) is an attribute indicating the predilection to stay in one trajectory. This framework provides an intuitive way to guide a team of vehicles throughout the drive space while giving the vehicles and their drivers/operators limited freedom in their movement via the active areas. An active area with a large radius, ρ, implies greater freedom of movement for the vehicle while an active area with a small radius, ρ, imposes less freedom and accurate tracking of the vehicle along and within the guidance trajectory.
Now, with regard to shared control, in each time step, for each vehicle i, a preferred velocity ūi is computed, based on the input from the driver, the guidance trajectory, and the neighboring obstacles as follows:
ūi=μ1(μ2uijoy+uitraj)+(1−μ1)uiA*+uirep Eq. (6)
where μ2=1 if the vehicle is inside the active area and μ2=0 otherwise, and μ1=1 if q is in line of sight of the vehicle and μ1=0 otherwise (see
uitraj=Kp(qi−pi)+{dot over (q)}i Eq. (7)
where qi is the ideal position on the guidance trajectory at the current time. The repulsive velocity uirep is computed with a potential field approach that pushes the vehicle away from both static and dynamic obstacles. A linear function weighting the distance to other vehicles is used as follows:
where Vr is the maximum repulsive velocity, D and
In cases where the guidance point, qi, is not in direct line of view from the current position, pi, a preferred velocity solely based on the previous terms could be prone to deadlocks. To avoid this, a term, uiA*, computed from a global path from pi to qi taking into account the map Or but ignoring the kinematic constraints and other vehicles may be added. This term is given by the first control input of a standard A* search in the grid map.
With regard to local motion planning, in each control loop, given the preferred velocity, ūi, of the ego vehicle and the current position and velocity of neighboring vehicles, a local trajectory is given by an optimal reference velocity, u*i, is computed by solving an optimization problem in reference velocity space. The optimization problem may be formulated as a combination of a convex optimization with quadratic cost and linear and quadratic constraints and a non-convex optimization.
Two alternative formulations are presented: (1) a centralized optimization where the optimal reference velocities of all vehicles, u*1:m=[u*1, . . . , u*m], are jointly computed and (2) a distributed optimization where each vehicle, i, independently solves an optimization where its optimal reference velocity, u*i, is computed. The position pj and velocity vj of neighboring agents is known. The optimization cost is given in two parts. The first one is a regularizing term penalizing changes in velocity, and the second one minimizing the deviation to the preferred reference velocity. For the centralized cast the cost may be provided as:
C(u1:m):=Ko∥u1:m−v1:m∥2+∥u1:m−ū1:m∥2 Eq. (9)
where Ko is a design constant. For the distributed case, the cost may be provided as:
C(ui):=Ko∥ui−vi∥2+∥ui−ūi∥2 Eq. (10)
The kinematic constraints of the vehicle are included where the vehicle's radius is enlarged by a value ∈>0 and the local trajectories are limited to those with a tracking error below ∈ with respect to the reference trajectory (parameterized by u).
With regard to a first constraint (“Constraint 1”) concerning avoidance of other agents, for every pair of neighboring agents i≦m and j≦n, the constraint is given by the reference velocities such that ∥pi−pj+(ui−uj)t∥≧ri+j=ri+∈i+rj+∈j, for all t∈[0,τ]. This constraint can be rewritten as ui−uj ∉ VOijτ=∪t=0τ((D(pj,rj+∈j) ⊕ D(pi,ri+∈i/t) representing a truncated cone. For example,
where γ+=α+β, γ−=α−β, α=a tan 2(−pij) and β=a cos (ri+j/pij). The first and last constraints represent avoidance to the right and to the left, and the middle one a head-on maneuver (collision-free up to t=τ). The linearization of 2\VOijτ is obtained by selecting the linear constraint with maximum constraint satisfaction for the current relative velocity (minl(nijlvij−bijl)). Other sensible choices include linearization with respect to the preferred velocity ūij or fixed avoidance side (right/left). For j≦m, this linear constraint is added to the centralized optimization. For j>m (dynamic obstacles) or when distributed, it can be rewritten as follows.
In the distributed case, all agents are considered as independent decision-making agents solving their independent optimizations. To globally maintain the constraint satisfaction and avoid collisions, an assumption on agent j's reference velocity should be determined. Variable sharing of avoidance effort might be considered with Δvi=λΔvij and the assumption that Δvj=−(1−λ)Δvij. For collaborative agents that equally share the avoidance effort, λ=0.5. If it is considered that agent j ignores agent i and continues with its current velocity, then λ=1 and full avoidance is performed by agent i. For λ∈[0,1], the constraint is given by:
With regard to a second constraint (“Constraint 2”) concerning active areas, this constraint guarantees that the agent locally remains within the active area. For agent i≦m, the constraint is given by ∥pi−qi+(ui+{dot over (q)}i)τ∥2≦pi2 with qi being the point on the guidance trajectory for the current time, {dot over (q)}i being speed, and ρi the radius of its associated active area (by interpolation between the previous and next waypoint). With the constant velocity assumption for the guidance trajectory, if the vehicle is initially within the active area, this constraint guarantees that the vehicle remains inside for time t≦τ. If the vehicle is initially outside, the constraint guarantees that the vehicle enters within time τ. The constraint can be written as:
This is a quadratic constant (circle in reference velocity space) that is ignored if no active areas are given.
With regard to a third constraint (“Constraint 3”) concerning avoidance of static obstacles, for agent i≦m, the constraint is given by the reference velocities such that the new positions are not in collision with the enlarged obstacle map, pi+uit ∉ Or
With regard to a fourth constraint (“Constraint 4”) concerning dynamic restrictions, each agent should remain within ∈i of its reference trajectory. For a maximum tracking error, ∈i, and a current state, zi=[pi,{dot over (p)}i,{umlaut over (p)}i], the set of reference velocities, ui, that can be achieved with position error below ∈i is given by Ri:=R(zi,∈i) such that:
Ri:={ui∈(zi)|∥(pi+tui)−ƒ(zi,ui,t)∥≦∈i,∀t>0} Eq. (14)
If the trajectory controller defined in part by Eq. (2) is used, the set of reachable reference velocities, Ri, depends on {dot over (p)}i, {umlaut over (p)}i, and ∈i. A mapping, γ, from initial state, {dot over (p)}, {umlaut over (p)}, and the reference velocity, u, to a maximum tracking error is precomputed and stored in a look up table as:
An example of this constraint is shown in
With regards to a useful optimization algorithm and guarantees, the optimization includes two types of constraints: (1) convex (linear and quadratic) and non-convex (grid-based). To efficiently find a solution, the optimization is divided into two parts. First, a convex subproblem is solved resulting in uic, and, second, a search within the grid-based constraints is performed that is restricted to the convex area defined by the linear and quadratic constraints. The set of convex constraints (Constraints 1 and 2 and bounding box of Constraint 4) is denoted by C. The set of non-convex constraints (Constraints 3 and 4 for agent I, with respect to a grid of the same resolution) is denoted by {tilde over (C)}i. For both the centralized (N=m) and the distributed (N=1, without loss of generality) optimizations, the algorithm may be stated as:
Function feasibleDynamics(ui) checks in a precomputed grid if the tracking error is below ∈i given the initial state of the vehicle:
if ui∈Ri (See Eq. 14) then return ‘true’; else ‘false’ Eq. (21)
Function feasibleMap(ui) checks if ui leads to a trajectory in collision with static obstacles given by the grid map O. This is efficiently checked in the precomputed dilated map, Or
if segment(pi,pi+uiτ)∩Or
The function expandNeighbors adds the neighboring grid points if they are within the convex region defined by the convex constraints in C, and they were not previously explored. This algorithm is recursive for N>1, although only one level of recursion is used to reduce computational time at the expense of optimality:
If the optimization is unfeasible, constraints 2 are removed and the optimization is recomputed. If still unfeasible, agents i≦N decelerate on their last feasible local trajectory following the time remap:
γ(t)=−tƒ2/(2τ)+(1+tƒ/τ)t−t2/(2τ) Eq. (16)
where tƒ is the time where the last feasible local motion was obtained. This reparameterization of the trajectories guarantees that the vehicles stop within the time horizon of the local planner, unless the optimization becomes feasible in a later time step.
With regard to computational complexity, with a limit in the number of linear constraints for collision avoidance, the complexity of the algorithm can be kept linear with the number of agents, although the centralized is of higher cost. If distributed, for each agent, the optimization is made up of two variables, one quadratic constraint, four linear constraints (bounding box of Constraint 5), a maximum of m linear constraints of Type 1 (in practice limited to a constant Kc) and a wave expansion within the 2D grid. If centralized, the optimization is made up of 2n variables, n quadratic constraints, less than
linear constraints (limited to 4n+nkc) and a joint wave expansion in n 2D grid maps.
With regard to safety guarantees, if feasible, the local trajectories are collision free up to time τ, with the assumption that other vehicles follow the same algorithm or maintain constant velocity. If unfeasible, no collision-free solution exists that respects all the constraints. If the time horizon is longer than the required time to stop, safety is preserved if all vehicles drive their last feasible trajectory with a time reparameterization to reach stop before a collision arises:
{dot over (γ)}(tƒ+τ)=(1+tƒ/τ)−(tƒ+τ)/τ=0 Eq. (17)
Note, though, that the optimization can be unfeasible due to several causes. First, there may not be enough time to find the solution within the allocated time. Second, there may be differences between the model vehicle and the real world vehicle. Third, there may be noise in localization and estimation of the vehicle's state. Fourth, optimization may be unfeasible due to the limited local planning horizon together with over simplification of motion capabilities by reducing them to the set of local motions. Fifth, if distributed, unfeasibility may be caused given the use of pair-wise partitions of velocity space with either the assumption of equal effort in the avoidance or constant speed. Hence, not all world constraints are taken into account for the neighboring agents/vehicles. As a result, a vehicle may have conflicting partitions with respect to different neighbors, static obstacles, and/or kinematics that render optimization unfeasible for this vehicle.
With regards to deadlock-free guarantees, for a single agent/vehicle, deadlock-free navigation is achieved in part due to the uA* term in the optimization that drives the vehicle toward its goal position or guidance trajectory. In the multi-agent (or vehicle) scenario, deadlocks are still possible although in degenerated situations. Giving priority to those agents/vehicles further in their guidance trajectory can help resolve this situation, though. The input from the driver of the vehicle can also act as a deadlock-breaking input.
Turning to experimental results, the algorithms have been extensively tested in simulation with larger groups of robots/agents. Due to space limitations in the test setup, the analysis relied upon real wheelchairs for the vehicles with a driver (see
As part of the experiment with shared control, semi-autonomous driving was tested without a guidance trajectory and where each vehicle was driven by a human that inputted a desired velocity, ujoy. Over 50 hours of experiments showed that the algorithms described above are safe and collisions with walls and other obstacles as well as with other vehicles were avoided.
With regard to constrained driving experiments using a guidance trajectory along with a grid map of obstacles,
As will be appreciated based on the above description, a method is taught for shared control of a vehicle. In this shared control method, the driver commands the vehicle by specifying a preferred velocity. The specified velocity is then transformed into a local motion that respects the actuator limits of the vehicle and is collision free with respect to other moving vehicles and static obstacles (physical or virtual) given by (or defined by) a grid map of the drive space. This allows for smooth and safe control of the vehicle. In order to limit the freedom of the driver, a global guidance trajectory can be included in some embodiments of the shared control method, and the global guidance trajectory specifies the areas (or spaces) where the vehicle is allowed to drive in each time instance. Good performance has been observed in extensive experimental tests at speeds of up to 3 m/s and in close proximity to other vehicles and obstacles (e.g., walls). Further, the low computational time of the optimization algorithm allows for real-time control of numerous vehicles in the drive space.
The above description teaches the inventors' shared control for vehicles that are wheeled or driving on the ground in a 2D drive space. The inventors further explored application and/or extension of their shared control method (and associated system) to collision avoidance for aerial vehicles especially in multi-agent (or multi-aerial vehicle) scenarios. The following paragraphs describe an investigation of local motion planning and collision avoidance for a set of decision-making agents (vehicles) navigating or flying in 3D space. The shared control method is applicable to agents that are heterogeneous or homogeneous in size, dynamics, and aggressiveness. The method may use and expand upon the concept of velocity obstacles (VO), which characterizes the set of trajectories that lead to a collision between interacting agents.
Motion continuity constraints are satisfied by using a trajectory tracking controller and by constraining the set of available local trajectories in an optimization. Collision-free motion is obtained by selecting a feasible trajectory from the VO's complement, where reciprocity can also be encoded. The following three algorithms for local motion planning are presented herein: (1) a centralized convex optimization in which a joint quadratic cost function is minimized subject to linear and quadratic constraints; (2) a distributed convex optimization derived from the centralized convex optimization; and (3) a centralized non-convex optimization with binary variables in which the global optimum can be found but at a higher computational cost. The following description also presents results from experiments with up to four vehicles/agents in the form of physical quadrotors (or multi-copters) flying in close proximity and with two quadrotors avoiding a human in the 3D space (or flying space or air space).
The last decade has seen significant interest in unmanned aerial vehicles (UAVs) including multi-rotor helicopters (or multi-copters) due to their mechanical simplicity and agility. Successful navigational control builds on the interconnected competences of localization, mapping, and motion planning and/or control. The real-time computation of global collision-free trajectories in a multi-agent setting had proven challenging. One approach used was to hierarchically decompose the problem into a global planner, which may omit motion constraints, and a local reactive component that locally modifies the trajectory to incorporate all relevant constraints. The local reactive component is one topic of the following paragraphs, and this description teaches a real-time reactive local motion planner that takes into account motion constraints, static obstacles, and other flying vehicles or agents. Further, the important case where other agents/vehicles are themselves decision-making vehicles is addressed with the shared control method (and system).
Among the inventors' contributions provided in their shared control method is a derivation and evaluation of three methods for local planning in 3D environments or air spaces (e.g., based on recent extensions of the VO concept). A first method is a distributed convex optimization in which a quadratic cost function is minimized subject to linear and quadratic constraints. Only knowledge of relative position, velocity, and size of neighboring agents/vehicles is utilized. A second method for local planning uses a centralized convex optimization in which a joint quadratic cost function that is minimized subject to quadratic and linear constraints. This variant scales well with the number of agents and presents real-time capability but may provide, in some cases, a sub-optimal but still useful solution. The third method uses a centralized non-convex optimization in which a joint quadratic cost function is minimized subject to linear, quadratic, and non-convex constraints. This optimization may be formulated as a mixed integer quadratic program, and this variant explores the entire solution space but can, in some situations, scale poorly with the number of agents/vehicles. Each of the optimization methods can: (1) characterize agents as decision making agents that collaborate to achieve collision avoidance; (2) incorporate motion continuity constraints; and (3) handle heterogeneous groups of agents. The following description also provides a complexity analysis and a discussion of the advantages and disadvantages of each optimization method.
The control problem can be thought of as a consideration of a group of n agents freely moving in 3D space. For each agent, i, pi∈3 denotes its position, vi={dot over (p)}i its velocity, and ai={umlaut over (p)}i its acceleration. The goal is to compute for each such agent, at high frequency (e.g., 10 Hz or the like), a local motion that respects the kinematics and dynamic constraints of the ego agent or aerial vehicle. The local motion should also be collision free with respect to neighboring agents or aerial vehicles in the 3D space for a short time horizon (e.g., for 2 to 10 or more seconds), which may be thought of as providing local motion that provides collision avoidance.
Two scenarios are discussed herein: (1) a centralized one where a central unit computes local motions for all agents simultaneously and (2) a distributed one where each agent or vehicle computes independently its local motion. For the second case, no communication between the agents is required, but each agent is able to maintain an estimate of the position and velocity of its neighbors. For the distributed case, neighboring agents are labeled as dynamic obstacles or decision-making agents, which employ an algorithm that is identical to that in the ego agent or vehicle. The local motion planning problem is first formulated as an efficient convex optimization (either centralized or distributed), followed by a non-convex optimization (centralized) that trades off efficiency for richness of the solution space.
The shape of each agent or vehicle is modeled by an enveloping cylinder of radius, ri, and height, 2hi, centered at the agent or vehicle. The local motion planning framework is applied to quadrotor helicopters, which are useful in many applications due to their agility and mechanical simplicity. Implementation details, including an appropriate control framework, are described in the coming paragraphs followed by discussion of extensive experimental results using up to four quadrotors along with a human moving through the 3D space.
As an overview to local motion planning, it may be useful to first introduce the terminology used to describe motion planning and then follow this with a definition of the local trajectories and an overview of the optimization problem solved to obtain collision-free motions. With regard to terminology,
A “global trajectory” is a trajectory between two configurations embedded in a cost field with simplified dynamics and constraints.” A “preferred velocity” is an ideal velocity computed by a guidance system to track the global velocity, and it may be denoted as ū∈3. A “local trajectory” is the trajectory executed by the agent or vehicle for a short time horizon and is selected from a set of candidate local trajectories. A “candidate local trajectory” is one that respects the kinematics and dynamic constraints of the agent or vehicle and is computed from a candidate reference trajectory via a bijective mapping or the like. A “candidate reference trajectory” is a trajectory given by a constant velocity, u∈3, that indicates the target direction and velocity of the agent or aerial vehicle. An “optimal reference trajectory” is a trajectory given by the optimal reference velocity, u*∈3, selected from the set of candidate reference velocities and which minimizes the deviation to ū∈3. Further, this trajectory defines the local trajectory.
One focus of the present description is on local motion planning using a preferred velocity, ū, computed by a guidance or control system in order to track a global trajectory. An optimal reference trajectory is computed such that its associated local trajectory is collision free. This provides a parameterization of local motions given by u and allows for an efficient optimization in candidate reference velocity space, 3, to achieve collision-free motions.
Each candidate reference trajectory is that of an omni-directional agent moving at a constant velocity, u, and starting at the current position, pk, of the agent as shown by:
Pref(t)=pk+u(t−tk),t∈[tk,∞) Eq. (18)
A trajectory tracking controller, respecting the kinematics and dynamic constraints of the agent, q({tilde over (t)})=ƒ(zk,u,{tilde over (t)}) is considered, continuous in the initial state zk=[pk,{dot over (p)}k,{umlaut over (p)}k, . . . ] of the agent and converging to the candidate reference trajectory. This defines the candidate local trajectories given by q({tilde over (t)}) and a bijective mapping (for a given zk) from candidate reference velocities, u.
As an overview of the optimization problem, a collision-free local trajectory (described above and parameterized by u*) is obtained via an optimization in candidate reference velocity space where the deviation to the preferred velocity, ū, is minimized. In the following description, two alternative formulations are presented as a centralized optimization and a distributed optimization, formulated as a convex optimization with quadratic cost and linearized constraints for the following: (a) Motion Continuity: The position error between the candidate reference trajectory and the local trajectory should be below a limit, ∈, to guarantee that the local trajectory is collision free if the candidate reference trajectory is. Continuity in position, velocity, acceleration, and further derivatives (optional) is guaranteed by construction; and (2) Collision Avoidance: The candidate reference velocities leading to a collision are described by the velocity obstacle, a cone in relative candidate reference velocity space for agents of ∈-enlarged radius. Consider Ci the set of collision avoidance constraints for an agent, i, with respect to its neighbors and C=∪i∈ACi.
These non-convex constraints can be linearized leading to a convex optimization with quadratic cost and linear quadratic constraints. The optimization problem is formulated below, and the convex optimization is described for the case of heterogeneous groups of agents of unknown dynamics. Further, an algorithm is described to solve the original non-convex optimization along with an extension for homogeneous groups of agents (same dynamics and controller). In the centralized case, a single convex optimization is solved where the optimal reference velocity of all agents, u*1:m=[u*1, . . . , u*m], are jointly computed. In the distributed case, each agent, i∈A, independently solves an optimization where its optimal reference velocity, u*i, is computed. In this case, it can be considered that agents collaborate in the avoidance and apply the same algorithm, otherwise they are treated as dynamic obstacles. Only information about the neighbor's current position, velocity, and shape is utilized in the optimization.
Now, it may be appropriate to further describe a formulation for local motion planning, and the following discusses in detail the optimization cost and the optimization constraints used in local motion planning. With regard to preferred velocity, in each local planning iteration, a preferred velocity, ū, can be computed given the current position and velocity of the vehicle so as to follow a global plan to achieve either a goal or a trajectory. With regard to achieving a trajectory, the preferred velocity, ū, is given by a trajectory tracking controller for omnidirectional agents.
With regard to achieving a goal, the preferred velocity, ū, is given by a proportional controller towards the goal position, gi, saturated at the agent's or vehicle's preferred speed,
where K
With regard to repulsive velocity, formulating the local motion planning as an optimization where avoidance constraints are given by Velocity Obstacles produces collision-free motions but typically reduces inter-agent distance to zero. Although optimal, this property can lead to unsafe motions in real scenarios with sensor noise. To appropriately maintain a minimum inter-agent distance, the preferred velocity, ū, is corrected by adding a repulsive velocity, , given by a vector field such as those shown in one of the two examples of repulsive velocities shown in the graph 1400 of
The optimization cost can be given by two parts, with the first one being a regularizing term penalizing changes in reference velocity and the second one being chosen for minimizing the deviation to the preferred velocity, ūi, corrected by the repulsive velocity, i. For example, it has been shown that pedestrians prefer to maintain a constant velocity in order to minimize energy, and, hence, preference was given to turning instead of changing speed. Furthermore, penalizing changes in speed leads to a reduction of deadlock situations. This idea is formalized as an elliptical cost, higher in the direction parallel to ūi+i, and lower perpendicular to it. One can let λ>0 represent the relative weight and then define L=diag(λ,1,1) and Di the rotation matrix of the transformation onto the desired reference frame.
In the distributed case, the quadratic cost can be given by:
C(ui):=Ko∥ui−vi∥2+∥L1/2Di(ui−(ūi+i))∥2 Eq. (20)
where, with the exception of the optimization variables, ui, the values of all other variables are given for the current time instance and Ko is a design constant.
In the centralized case, the quadratic cost can be given as:
C(u1:m):=KoΣi=1m
where
Turning now to motion continuity constraints, in order to guarantee collision-free motions, it can be guaranteed that each agent remains within ∈i of its reference trajectory, with ∈i<minj∥pj−pi∥/2. So, a first constraint (“Constraint 1”) for motion continuity can be stated: for a maximum tracking error, ∈i, and current state, zi=[pi,{dot over (p)}i,{umlaut over (p)}i], the set of candidate reference velocities, ui, that can be achieved with position error lower than the maximum tracking error, ∈i, is given by:
Ri=R(zi,∈i)={ui|∥(pi+{tilde over (t)}ui)−ƒ(zi,ui,{tilde over (t)})∥≦∈i,∀{tilde over (t)}>0} Eq. (22)
For arbitrary ƒ(zi,ui,{tilde over (t)}), the set Ri can be precomputed. For quadrotors freely moving in 3D space, an LQR controller can be used as the function ƒ(zi,ui,{tilde over (t)}). In the present description, poison control and local trajectory are decoupled and ƒ(zi,ui,{tilde over (t)}) is given by Eq. (40). If the agents were omnidirectional, the set Ri would be given by a sphere centered at zero and of radii vmax (no continuity in velocity). For the case of study of this description, given a maximum tracking error, ∈i, and an initial state, zi, the limits can be obtained from the computed data with:
To reduce computational complexity, δRi,∈
With regard to collision avoidance constraints, each agent or vehicle should avoid colliding with static obstacles, other controlled agents or vehicles, and dynamic obstacles. A maximum time horizon, τ, can be considered for which collision-free motion is guaranteed. A static obstacle is given by a region O ⊂3. The vehicles or agents are modeled by an enclosing vertical cylinder. The disk of radius r centered at c is denoted by Dc,r ⊂2, and the cylinder of radius r and height 2h centered at c is denoted by Cc,r,h ⊂ 3.
The collision avoidance constraints are computed with respect to an agent or vehicle following the reference trajectory. Since the agents are not able to perfectly track the reference trajectory, their enveloping cylinder should be enlarged by the maximum tracking error, ∈i. For simplicity, the following notation is used: ri:=ri+∈i and hi:=hi+∈i for the enlarged radius and the cylinder height with pi being the current position of an agent, i. Then a second constraint (“Constraint 2”) regarding avoidance of static obstacles can be stated as: for every agent i∈A and neighboring obstacle O, the constraint is given by the reference velocities for which the intersection between O and the agent is empty for all future time instances, i.e., pi+ui{tilde over (t)} ∉ (O ⊕ C0,r
This constraint can be given by a cone in the 3D candidate reference velocity space generated by O ⊕ C0,r
A third constraint (“Constraint 3”) for inter-agent collision avoidance can be stated: for every pair of neighboring agents i,j∈A, the constraint is given by the relative reference velocities for which the agents' enveloping shape does not intersect within the time horizon, i.e., ui−uj ∉ VOijτ=∪{tilde over (t)}=0τ((Cp
Turning now to approximation by linear constraints, since the volume of reference velocities leading to a collision is convex, the aforementioned collision avoidance constraints are non-convex. To obtain a convex optimization, they can be linearized. For increased control over the avoidance behavior and topology, each non-convex constraint can be approximated by five linear constraints representing to the right, to the left, over and under the obstacle or another agent, and a head-on maneuver, which remains collision-free up to t=τ. In particular, for the feasible space 3\VOijτ consider five (l∈[1,5]) linear constraints (half-spaces) Hijl of the form nijl(ui−uj)≦bijl, with nijl∈3 and bijl∈, and verifying that if ui−uj satisfies any of the linear constraints, it is feasible with respect to the original constraint that:
∪l=15Hijl⊂3\VOijτ Eq. (23)
An example of this process is provided in
Now, with regard to selection of a linear constraint, each collision avoidance constraint can be linearized by selecting one of the five linear constraints. First, one may choose a fixed side for avoidance. If agents are moving towards each other, (vij·pij<0), one may wish to avoid on a predefined side such as on the left (l=1). If agents are not moving towards each other, the constraint perpendicular to the apex of the cone (l=5) can be selected to maximize maneuverability. Second, the linear constraint chosen may be maximum constraint satisfaction for the current relative velocity:
This selection maximizes the feasible area of the optimization, when taking into account the motion continuity constraints discussed above.
A third choice may be maximum constraint satisfaction for the preferred velocity:
if centralized and
if distributed. This selection may provide faster progress towards the goal position, but the optimization may become quickly infeasible if the agent greatly deviates from its preferred trajectory. The first choice or option provides the best coordination results as it incorporates a social rule. The second choice or option maximizes the feasible area of the optimization, and the third choice or option may provide faster convergence to the ideal trajectory. An experimental evaluation of these options is given below. Without loss of generality, consider nij·uij≦bij the selected linearization of 3\VOijτ, which can be directly added in the centralized optimization of Algorithm 1 discussed below.
With regard to partition, in the distributed case, the reference velocity of an agent, j, is typically unknown, and only its current velocity, vi, can be inferred. With the assumption that every agent follows the same optimization algorithm, a partition should be found such that if both agents select new velocities independently their relative reference velocity, ui−uj, is collision free. The idea of reciprocal avoidance can be followed. The change in velocity is denoted by Δvi=ui−vi and the relative change of velocity by Δvij=Δvi−Δvj. In the distributed case, all agents are considered as independent decision makers solving their independent optimizations. To globally maintain the constraint satisfaction and avoid collisions, an assumption on an agent's, j's, velocity can be made.
Variable sharing of avoidance effort might be considered in some cases leading to Δvi=λΔvij with the assumption that Δvj=−(1−λ)Δvij. For collaborative agents that equally share the avoidance effort, λ=0.5. If it is considered that an agent, j, ignores another agent, i, and continues with its current velocity, then λ=1 and full avoidance is performed by the agent, i. With the assumption that both agents share the avoidance effort (change in velocity) for λ∈[0,1], the linear constraint can be written as:
which leads to:
nij·ui≦bi=λbij+nij·((1−λ)vi+λvj Eq. (25)
which is a fair partition of the velocity space for both agents and provides avoidance guarantees.
At this point, it may be useful to provide exemplary algorithms for use in convex optimization. To achieve low computational time, the inherently non-convex optimization can be approximated by a convex optimization with quadratic cost and linear and quadratic constraints. The following two algorithms can be used in this regard with one being for use in the centralized implementation and one in the distributed implementation.
The first algorithm (“Algorithm 1”) for centralized convex optimization can be stated as: a single convex optimization is solved where the optimal reference velocity of all agents, u*1:m=[u*1, . . . , u*m], are jointly computed as:
where the optimization by Eq. (21).
The second algorithm (“Algorithm 2”) for distributed convex optimization can be states as: each agent, i∈A, independently solves an optimization where its optimal reference velocity, u*i, is computed as:
where the optimization cost is given by Eq. (20). These 2m and 2-dimensional quadratic optimizations with linear and quadratic constraints can be solved efficiently using presently available solvers.
If the optimization is feasible, a solution is found that guarantees collision-free motion up to a predefined time horizon, τ. If the optimization is infeasible, no solution exists for the linearized problem. In this case, the involved agents drive or fly their last feasible trajectory (obtained at time tƒ) with a time re-parameterization given by:
γ(t)=(−tƒ2+(2τ+2tƒ)t−t2)/(2τ),t∈[tƒ,tƒ+τ],γ(t)=tƒ+τ/2,t<tƒ+τ Eq. (28)
This time re-parameterization guarantees that the involved agents reach a stop on their previously computed paths within the time horizon of the collision avoidance algorithm. Since this computation is performed at a high frequency, each individual agent is able to adapt to changing situations.
With these two convex optimization algorithms in mind, it may now be useful to discuss several theoretical guarantees. A first theoretical guarantee or remark pertains to computational complexity and can be stated as follows: linear in the number of variables and constraints although the centralized is of higher complexity. If distributed, for each agent, the optimization utilizes three variables, one quadratic constrain, and a maximum of m linear constraints for collision avoidance (in practice, limited to a constant value, Kc). If centralized, the optimization uses 3n variables, n quadratic constraints, and less than
linear constraints (usually limited to nkc).
A second theoretical guarantee or remark pertains to safety guarantees and can be stated as: safety is preserved in normal operation. If feasible, collision-free motion is guaranteed for the local trajectory up to a particular time, τ (optimal reference trajectory is collision-free for agents of radii enlarged by the maximum tracking error, ∈, and agents stay within ∈ of it) with the assumption that all interacting agents either continue with their previous velocity or compute a new one following the same algorithm and assumptions. If one lets pi(t) denote the position of an agent, i, at time t≧tk and recalls that {tilde over (t)}=t−tk, then:
∥pi(t)−pj(t)∥=∥ƒ(zik,ui,{tilde over (t)})−ƒ(zjk,uj,{tilde over (t)})∥≧∥(pik+ui{tilde over (t)})−(pjk+uj{tilde over (t)})∥−∈i−∈j≧ri+∈i+rj+∈j−∈i−∈j=ri+rj Eq. (29)
where the first inequality holds from the triangular inequality and Constraint 1 (ui∈Ri and uj∈Rj). The second inequality holds from Constraint 3 (ui−uj ∉ VOijτ).
If infeasible, no collision-free solution exists that respects all the constraints. If the time horizon is longer than the required time to stop, passive safety is preserved if the last feasible trajectories of all agents are driven with a time re-parameterization to reach stop before a collision arises (see Eq. 28). This time re-parameterization implies a slowdown of the agent, which in turn may render the optimization feasible in a later time step. Since this computation is performed at a high frequency, each individual agent can adapt to changing situations.
A third theoretical guarantee or remark pertains to infeasibility and can be stated as: the optimization can be infeasible due to several causes. First, there may not be enough time to find the solution within the allocated time. Second, there may be differences between the model and the real vehicle. Third, there may be noise in localization and estimation of the agents' states. Fourth, the optimization can be infeasible due to the limited local planning horizon together with an over simplification of motion capabilities by reducing them to the set of local motions. Fifth, if distributed and given the use of pair-wise partitions of velocity, space with either the assumption of equal effort in the avoidance or constant speed, the optimization may be infeasible when not all world constraints are taken into account for the neighboring agents. Thus, an agent may have conflicting partitions with respect to different neighbors rendering its optimization infeasible.
A fourth theoretical guarantee or remark pertains to deadlock-free guarantees and can be stated as: as for other local methods, deadlocks may still appear. With this in mind, a global planner for guidance may be useful in complex scenarios.
Now, it may be useful to provide an exemplary algorithm for use in non-convex optimization. Earlier, a method for local motion planning was described for heterogeneous groups of aerial vehicles (or agents), which was formulated as a convex optimization problem. The following presents an extension to a non-convex optimization where the global optimum can be found but at an increased computational cost. As taught earlier in this description, each non-convex constraint can be approximated by five linear constraints of which only one is introduced in the optimization, which leads to a local optimum. The full solution space can be explored by the addition of binary variables and reformulating the problem as a mixed integer quadratic program (MIQP).
One binary variable, ξcl={0,1}, is added for each linear constraint, l∈[1:5], approximation of a nonconvex constraint, c∈C. Only one out of the five linear constraints is active, thus Σl=15ξcl=4,∀c∈C. An algorithm (“Algorithm 3”) for non-convex optimization (or for centralized mixed-integer optimization) can now be formulated. A single MIQP optimization is solved where the optimal reference velocity of all agents, as well as the active constraints,
where N>>0 is a large constant of arbitrary value. The linear cost vector f
This optimization can be solved via branch and bound using state of the art solvers. Although the number of variables and constraints can be bounded to be linear with the number of agents, the number of branches to be explored increases exponentially. In practice, a bound in the resolution time or number of explored branches (avoidance topologies) should be set. A distributed MIQP optimization can also be formulated, but collision avoidance guarantees would be lost since this can lead to a disagreement in the avoidance side and reciprocal dances appear.
With all the above teaching in mind, a control framework can be presented that is useful with aerial vehicles, and the inventors implemented and experimentally evaluated the control framework for providing shared control over quadrotor helicopters as the aerial vehicles. Particularly,
Motion planning may be divided into global planning and local planning. With regard to global planning, in complex environments, a global planner may be desirable for convergence, but in this discussion a fixed trajectory or goal position may be considered or used (as shown at 1720). With regard to local planning, the controller 1700 may compute (e.g., with a 10 Hz frequency) a collision-free local motion given a desired global trajectory or a goal position. To this end, a guidance system 1730 may output a preferred velocity, ū, given a desired global trajectory or a goal position and the current state of the vehicle. Additionally, a collision avoidance module 1740 receives the preferred velocity, ū, and computes the optimal reference velocity, u*, i.e., the closest collision-free candidate reference velocity that satisfies the motion continuity constraints.
With regard to control, a position controller 1760 controls the position of the quadrotor helicopter (or other aerial vehicle) to the local trajectory and runs at a desired frequency such as 50 Hz offboard in a real-time computer. The local trajectory interpreter 1750 receives the optimal reference velocity, u*, at a number of time instances, tk. The module 1750 then outputs the control states, q, {dot over (q)}, {umlaut over (q)}, to satisfy motion continuity at a particular time, and asymptotic convergence to the optimal reference trajectory as may be given by pk+u*{tilde over (t)}. The position controller 1760 receives setpoints in position, q, velocity, {dot over (q)}, and acceleration, {umlaut over (q)}, and outputs the desired force, f.
The vehicle 1710 may be configured to provide a processor, memory, and software to provide a low-level controller that runs onboard (e.g., a 200 Hz or the like) and provides accurate attitude control at high frequency. The low-level controller may abstract the nonlinear quadrotor model as a point-mass for the high-level control. It may be composed of the following three submodules: (1) an attitude controller 1780 that receives a desired force, f (collective thrust and desired orientation) and outputs desired angular rates,
A quadrotor is a highly dynamic vehicle. To account for delays, the state used for local planning may not be the current state of the vehicle but, instead, a predicted one (e.g., as may be provided by forward simulation of the previous local trajectory) at the time when the new local trajectory is sent. A quadrotor is an under-actuated system in which translation and rotation are coupled. Its orientation (attitude) is defined by the rotation matrix, RIB=[ex, ey, ez]∈SO(3), which describes the transformation from the inertial coordinates frame, I, to the body fixed coordinate frame, B. The following paragraphs provide an overview of the model and control.
With regard to a quadrotor model (vehicle model), each of the four motors of the quadrotor has an angular speed, wi, and produces a force, Fi=ctwi2, and a moment, Mi=cywi2, where ct and cy are model-specific constants. Since the motor dynamics are fast, it is assumed that the desired force, Fi, is reached instantaneously. The control input, o=[T, o1], with o1=[Mx, My, Mz], T being the total thrust, and Mx, My, and Mz being the moments around the body axis, is given by:
where l is the length from the center of gravity of the quadrotor to the rotor axis of the motor and ca is a model constant.
The angular velocity, w, and its associated skew anti-symmetric matrix, w, are denoted by:
The angular acceleration around the center of gravity is given by:
{dot over (w)}=J−1[−w×Jw+o1] Eq. (33)
with J being the moment of inertia matrix reference to the center of mass along the body axes of the quadrotor. If RBI(RIB)−1 denotes the transformation from the body to the inertial frame, the rotational velocity of the quadrotor's fixed body frame with respect to the inertial frame can be given by {dot over (R)}BI=wRBI.
A point-mass model can be used for the position dynamics. The accelerations of the quadrotor's point-mass in the inertial frame, {umlaut over (p)}I, are given by:
where two forces are applied, gravity in the z direction in the inertial frame is given by FG=mg, and T is the thrust in the negative z direction in the quadrotor body-frame.
With regard to attitude control, an approach may be followed that involves directly converting differences of rotation matrices into body angular rates. A desired force setpoint,
and
where {circumflex over (R)}** indicates the estimated rotation matrices.
Now, with regard to position control, according to the double integrator dynamics of Eq. (34), the discrete LTI model can be written as:
Xk+1=Aχk+Bu y=Cχk Eq. (37)
with state vector χ:=[px, vx, py, vy, pz, vz]T, input vector ρ=[Fx, Fy, Fz]T, and system matrices given as:
with I being the identity matrix,
and ⊕ being the Kronecker product. To have an integral action for the LQR controller, the system given in Eq. (37) is augmented to the discrete LTI system:
The control input, ρ, is then given with the state feedback law ρ=−K{tilde over (X)} with K being the solution to the Discrete Algebraic Riccati equation (DARE) with state and input cost matrices.
With regard to motion primitives, a point-mass M-order integrator model can be used or considered, decoupled in each component:
q(M+1)({tilde over (t)})=v Eq. (38)
Then, one can let v∈3 be piecewise continuous and (.)(M+1) represent the derivative of order M+1 with respect to time. The resulting solution q({tilde over (t)}) is CM differentiable at the initial point. For M>0 the state feedback control law is given by:
v({tilde over (t)})=−aMq(M)− . . . −a2{umlaut over (q)}+a1(u−{dot over (q)})+a0(pk+u{tilde over (t)}−q) Eq. (39)
It has been shown that the under-actuated model of a quadrotor can be written as a differentially flat system, with its trajectory parameterized by its position and yaw angle and their derivatives up to forth degree in position and second degree in yaw angle. In the following, the yaw angle is considered constant. Although the method presented herein extends to M=4, in order to simplify the formulations given the difficulty of accurately estimating the high order derivatives and their marginal effect, only continuity in acceleration (M=2) is imposed in the following discussion. For M=2 (continuity in initial acceleration), the motion primitive is:
q({tilde over (t)})=pk+u{tilde over (t)}+Fe−w
with F=({umlaut over (p)}k+2w0({dot over (p)}k−v∞))/(w0−w1)2 and w0, w1 being design parameters related to the decay of the initial velocity and acceleration, which can potentially be different for each of the three axes.
At this point in the discussion, it may be useful to describe results of experiments performed by the inventors with the shared control method of aerial vehicles. Particularly, experiments were performed both in simulation and with physical quadrotors. Simulation utilized a model of the dynamics and enabled experiments for larger groups of vehicles. As an example,
The experimental space utilized was a rectangular space that was approximately 5 m by 5.5 m by 2 m in height. Quadrotor position was measured by an external motion tracking system running at 250 Hz. Each vehicle was modeled with the following parameters: w0=3, w1=3.5, ri=0.35 m, hi=0.5 m, vmax=2 m/s, amax=2 m/s2, ∈=0.1 m, and τ=3 s. The control system 1700 was used to control the quadrotors. A low-level attitude estimator and controller were run on each quadrotor's IMU, and the quadrotor was abstracted as a point-mass for the position controller. The position controller was run on a real-time computer and communication between the real-time computer and the quadrotor was done with a 50 Hz UART bridge. The overlying collision avoidance was run in a normal operating system environment and sent a collision-free optimal reference trajectory to the position controller over a communication channel. The position controller ensured that the quadrotors stayed on their local trajectories as computed by the collision avoidance algorithm. This hard real-time control structure ensured the stability of the overall system and kept the quadrotors on given position references.
The benchmark performance of the system was evaluated for a single quadrotor running the full framework, including local planning in which continuity up to acceleration was imposed.
Position swaps were performed with sets of two and four quadrotors, with about fifty transitions to different goal configurations (both antipodal and random). The centralized algorithm (Algorithm 1) and the distributed algorithm (Algorithm 2) were both tested, with the three different linearization options for the collision avoidance constraints described above. The joint MIQP optimization (Algorithm 3) was also tested, but it proved too slow for real-time performance at 10 Hz in the experimental implementation. However, in simulation, this approach provided good results.
Two experiments were performed in which four quadrotors were transitioned to antipodal positions while also changing height. Two different linearization strategies were employed, with one being the imposition of avoidance and with one being a case when collision avoidance constraints are linearized with respect to the measured velocity.
For the case when avoidance to the right was imposed, the behavior was highly predictable and smooth, with the rotation of all vehicles to one side being imposed in the linearization. This shows the control method provided good results both in the centralized case and the distributed case. Although this characteristic is effective for cases of symmetry, in arbitrary configuration, it can lead to longer trajectories. For the case of linearization with respect to the current velocity, the behavior can be different for different runs and emerges from non-deterministic characteristics of the motion. It was observed that in this particular example that two quadrotors chose to pass over each other using the control method. Although this linearization option presents very good performance in not too crowded environments, its performance may degrade in the case of high symmetry and very crowded scenarios (for example, due to disagreements in avoidance side especially in the distributed case).
Now, it may be useful to discuss the results of experiments performed to test the method for trajectory tracking, e.g., in the case of a moving goal. Two representative scenarios are described, and vertical motion is not displayed in the results because, without loss of generality, motion was approximately in a horizontal plane.
At this point in the description, it may be useful to provide a discussion of the overall behavior of the control techniques over all the performed experiments based on statistics collected and processed by the inventors. Deadlock was not observed in the experiments, but it is possible from a theoretical standpoint and can occur either due to a stop condition being the optimal solution with respect to the optimization objective or due to the optimization problem being infeasible. The optimization can be infeasible for two main reasons: (1) for experimental reasons such as the unmodeled high-order dynamics of the vehicle and other uncalibrated parameters such as delays and (2) for reasons inherent in the method (and the trade-off between richness of planning versus low computational time) such as the limited horizon of the local planning or the convexification of the optimization via the linearization of the collision avoidance constraint.
The local planning optimization, which was computed at 10 Hz, was infeasible in under nine percent of the of the cumulated time steps over all experiments (over 40,000 data points at 10 Hz over more than 50 experimental runs). On encountering this condition, the speed of the quadrotor is reduced at a higher than normal rate following Eq. (28), and the quadrotor can, if necessary, be brought to a halt before a collision due to the finite local planning horizon. In all cases, due to the reduction in speed, the optimization became feasible after a further one or more time steps, e.g., below 0.5 seconds.
With regard to evaluation of the algorithm, good performance was observed throughout the experiments especially for the centralized convex optimization. The distributed optimization performs well in simulation and in experiments with two quadrotors. If the collision avoidance constraints are linearized with respect to the current velocity, the distributed approach suffers from inaccuracies in the velocity estimation, which can lead to performance degradation due to disagreements in the linearization. The centralized, non-convex optimization performs well in simulation but may be too slow in some cases for real-time control/performance.
In support of the above discussion, it may be useful to provide equations or pseudocode used for determining repulsive velocity with the controller modules of the present shared control system. For the repulsive velocity field of
where Vr is the maximum repulsive force and Dr, and Dh are the preferred minimal inter-agent distances in the X-Y plane and the Z component, respectively.
where Hij1 and Hij2 represent avoidance to the right/left, Hij3 and Hij4 above/below and Hij5 represents a head-on maneuver, which remains collision-free up to t=τ.
The reader may also benefit from being aware of the equations for linearization of VO utilized in one implementation of the control system by the inventors. One can denote hij=hi+hj and rij=ri+rj. The non-convex constraint 3\VOijT is linearized to obtain a convex problem. For an approximation with five linear constraints, as in
where Hij1 and Hij2 represent avoidance to the right/left, Hij3 and Hij4 represent avoidance above/below, and Hij5 represents a head-on maneuver, which remains collision-free up to t=τ.
Control systems, such as system 100 of
Throughout the following paragraphs, x denotes scalars, x vectors, M matrices, and X sets. The Minkowsky sum of two sets is denoted by X⊕y and x=∥x∥ denotes the Euclidean norm of vector x. The super index *k indicates the value at time tk and {tilde over (t)}=t−tk the appropriate relative time. Subindex *i indicates agent i, and relative vectors are denoted by xij=xi−xj. For ease of exposition, no distinction is made between estimated and real states.
The control problem can be stated as a group of n agents freely moving in 2D space for which shared control is required or desired. For each agent i, pi∈3 denotes its position, vi={dot over (p)}i its velocity, and ai={umlaut over (p)}i its acceleration. The shape of each agent is modeled by an enveloping circle of radius ri, centered at the agent's position pi, and denoted by D(pi,ri). One goal of the control method is to compute for each agent, at high frequency (e.g., 10 Hz), a local motion that respects the kinematics and dynamic constraints of the ego agent and that is collision free with respect to neighboring agents for a short time horizon (typically, 2 to 5 seconds or the like), and the problem may be labeled simply as “collision avoidance.” As discussed above, centralized and distributed scenarios can be considered and addressed with local motion planning using the same terminology (e.g., global trajectory, preferred velocity, local trajectory, candidate local trajectory, candidate trajectory, and optimal reference trajectory) as discussed with reference to
With regard to definition of candidate local trajectories, each candidate is that of an omnidirectional agent moving at a constant velocity, u, and starting at the current location, pk, of the agent:
pref(t)=pk+u(t−tk),t∈[tk,∞) Eq. (41)
A trajectory tracking controller, respecting the kinematics and dynamic constraints of the agent, p({tilde over (t)})=ƒ(zk,u,{tilde over (t)}) is considered that is continuous in the initial state zk=[pk,{dot over (p)}k,{umlaut over (p)}k, . . . ] of the agent and converging to the candidate reference trajectory. This defines the candidate local trajectories given by p({tilde over (t)}) and a bijective mapping from candidate reference velocities u.
One focus of the inventors' work is on local motion planning using a preferred velocity, ū, computed by a guidance system in order to track a global trajectory. An optimal reference trajectory can be computed such that its associated local trajectory is collision free. This provides a parameterization of local motions given by u and allows for an efficient optimization in candidate reference velocity space, (3), to achieve collision-free motions.
Velocity prediction involves consideration that in each time instance a probability distribution over the future velocities of the ego agent and its neighbors is available or, due to the structure of the problem, it can be relaxed to a utility over the possible future velocities that represents their individual likelihood. This distribution can be computed in an environment around the agent's current velocity. Given a candidate reference velocity, ui, for an agent, i, its utility is defined by a probability distribution:
Utility,φi:2→[0,1]uiφi(ui) Eq. (42)
such that ∫u
In multi-agent applications where several robots/agents/vehicles are controlled, a prediction over the future velocities of the neighboring agents may not be suitable. In that case, a cost can be defined:
Cost,χi:2→[0,∞)uiχi(ui) Eq. (43)
which is then transformed into a utility distribution with a mapping [0,∞)→[0,1]. To maintain the linear characteristics for low cost, one can employ the following (but, note, other mappings are possible and may be used):
φi,aux(ui)=max(0,1−χi(ui)/K0) Eq. (44)
where K0 is the cut-off cost, and this can be normalized to
In the following paragraphs, three cases are discussed: (1) a simplified prediction; (2) utility computed from a motion prediction; and (3) utility computed from a cost distribution.
Using a simplified prediction may be useful in some implementations of a controller or control system. Simplifying assumptions can be made with respect to the neighboring agents such as the following: (1) static agent to provide a zero-order approximation (φj(uj)=1, for uj=0; φj(uj)=0 otherwise); (2) constant velocity for a first-order approximation (φj(uj)=1, for uj=vj; φj(uj)=0); and (3) constant preferred velocity to provide a first-order approximation (φj(uj)=1, for uj=ūj; φj(uj)=0). In these formulations of the velocity predictions, ūj is the preferred velocity of agent j.
In other cases, the velocity prediction may be provided using a utility given a motion prediction. As a step towards accounting for uncertainties and predictions, one may consider the case of having a prediction of the future motion of the neighboring agent, i, given by a Gaussian Process (GP) over its future positions and velocities. The utility of each reference velocity can then be obtained by computing the posterior probability given the Gaussian Process, with different levels of complexity and using the following: (1) reference velocity (φj(uj)=P(uj,GP)); (2) candidate reference trajectory (φj(uj)=P(pkk+uj(t−tk),GP(t))); and (3) candidate local trajectory (φj(uj)=P(ƒ(zjk,uj,t),GP(t))).
In still other cases, the velocity prediction may be provided using a utility given by a cost from the environment. The environment characteristics can be taken into account when constructing the weights over the reference velocities, uj. This approach then provides information about the maneuverability of the neighboring agents and their motion preferences and can later be incorporated in the local motion planning step.
As an example, the cost, χj(uj), can be given by the sum of several terms, which may include the following and should be saturated: (1) deviation from current velocity, where the agent is expected to continue with its current velocity (K1∥uj−vj∥2); (2) deviation from preferred velocity, where velocities close to its preferred velocity incur lower cost (K2∥uj−ūj∥2); (3) cost to go, determined as being proportional to the distance from the desired goal to the expected position after the time horizon of the local planner and as may be determined with a search algorithm or from a pre-computed distance transform map, and, with K3 being a design constant, the term can be normalized to [0,K3vmaxτ] by K5max (0, cost2go(pjk+ujτ)−C0) Eq. (46) where C0=max (0, cost2go(pjk)−vmaxτ); (4) static obstacles, where a high cost, K4→∞, is added for reference trajectories in collision with a static obstacle such that pjk+uj{tilde over (t)}∈Or for some {tilde over (t)}∈[0,τ]; (5) congestion, where a cost, K5<<K4, can be added for reference trajectories in collision with another agent for {tilde over (t)}∈[0,τ]; for this weight, the other agents can be considered static or moving at constant speed, and the intuition behind this is that an agent would prefer to move away from other neighbors but, since they are mobile, the cost of a potential collision is not too high; note, this term penalizes movement towards potentially congested areas and may only be used for guidance as hard collision avoidance constraints are included in the local motion planning framework; and (6) motion velocity field, where the case is considered where a motion velocity field of the environment is available given by a velocity, V(p), for each position p, and this is the case, for example, in multi-lane navigation, with intuition indicating that velocities against the flow field are unlikely; this could be encoded by K5∫0τ∥V(pjk+ujt)−uj∥dt for areas where the velocity field is defined and 0 in free areas (but alternative formulations are possible).
The different cost teams represent the belief the agent has over the environment and their relative weights are design parameters. Costs 1 and 4 to 6 can be obtained by all agents with common observations. On the other hand, costs 2 and 3 are agent specific and cannot be observed, although an estimation may be available and used. These terms are given as an example and can be modified or simplified. One contribution of this description is that of incorporating the velocity prediction in the planning.
Now, turning to determination of preferred velocity, in each time step, each agent, computes a preferred velocity, ūi, which would be its desired reference velocity in the absence of other agents. It can be obtained to achieve different objectives. A first objective may be goal directed. In this case, the preferred velocity, ūi, can be computed from the “cost to the goal” map (e.g., obtained by an F1 expansion as discussed above). Its orientation is that of the shortest path to the goal from the current position of the agent, and its magnitude equals the preferred speed,
A second objective may be trajectory following. In this case, the preferred velocity, ūi, is given by a trajectory tracking controller for omnidirectional agents, such as a controller in position with feed forward velocity, such as:
ū=Kp(qi−pi)+{dot over (q)}i Eq. (47)
where qi is the ideal position on the guidance trajectory at the current time and Kp is a design constant. A third objective may be semi-autonomous driving. In this case, the preferred velocity, ūi, can be specified by a driver, with the preferred velocity, ūi, being proportional to the driving wheel or joystick position (in the relative frame of the vehicle).
In practice, the preferred velocity, ūi, often will be corrected with a repulsive velocity, i, that pushes the vehicle away from both static and dynamic obstacles. Collision-free motion is guaranteed by the constraints that are directly included in the local planning framework. Optimal solutions that minimize deviation with respect to the preferred velocity do produce solutions where agents come arbitrarily close to each other. This behavior can be unsafe in some real world scenarios with uncertainties in sensing and kinematic model. The repulsive velocity is, thus, included to help maintain a minimum distance to other agents and walls, and it moves the ego-agent (controlled vehicle) away when it is too close.
A linear function weighting the distance to other vehicles can be used that is adapted to specify the maximum velocity allowed towards the obstacles or other vehicles as shown by:
where Vr is the maximum repulsive velocity, D and
Turning now to consideration of motion constraints, for an agent, i, the candidate local trajectories can be given by a tracking controller, pi(t)=ƒ(zik,ui,{tilde over (t)}), towards a constant velocity reference trajectory parameterized by u. The set of feasible local trajectories given the current state is defined by:
i(zik)={ui∈2,such that the trajectory ƒ(zik,ui,{tilde over (t)}) respects all dynamic constraints} Eq. (51)
And, in the following discussion, the super-index, *k, is dropped to simplify the notation.
With regard to a first constraint regarding dynamic restrictions (“Constraint 1”), in order to guarantee collision-free motions, it should be guaranteed that each agent remains within a maximum tracking error, ∈i, of its reference trajectory. For a maximum tracking error ∈i and a current state, zi=[pi,{dot over (p)}i,{umlaut over (p)}i], the set of reference velocities ui that can be achieved with position error below ∈i is given by Ri:=R(zi,∈i) and then:
Ri:={ui∈i(zi)|∥(pi+{tilde over (t)}ui)−ƒ(zi,ui,{tilde over (t)})∥≦∈i,∀{tilde over (t)}>0} Eq. (52)
If a local trajectory that is continuous in acceleration is used (such as one generated by one of the trajectory controllers taught herein), the set of reachable reference velocities, Ri, depends on {dot over (p)}i, {umlaut over (p)}i, and ∈i. A mapping, γ, from the initial state, {dot over (p)}i, {umlaut over (p)}i, and a reference velocity, ui, to maximum tracking error is precomputed and stored in a look up table as:
A bounding box given by four linear constraints of the form Hl={ui|nlui≦bl} is computed such that Ri ⊂ ∪l=14Hl. These linear constraints can then be included in the algorithms to reduce the search space.
Although some examples of agents, vehicles, or robots have been provided earlier, it may be useful at this point in the description to provide several additional examples of typical robot models used by the inventors to implement and/or test their control methods (and controllers and control systems). Below, five different robot kinematics and dynamics models are described that have been implemented and tested with this framework, and these cover most of the typical robot platforms presently in use.
First, one may model a robot, vehicle, or agent as a simply unicycle that may take the form of a small, differently-drive robot. In this case, the local trajectories can be defined by two sections including a circumference arc followed by a straight line path at a constant speed, ui. The angular velocity over the arc was, in one model, defined such that the correct orientation was achieved within a fixed time, T, and the linear velocity minimized the tracking error of the reference trajectory. A closed form solution was obtained for the set Ri, but, in this simple formulation, the robots had no constraints in acceleration and the linear and angular velocities presented a discontinuity.
Second, one may model the controlled agent as a bicycle. For example, this modeling may be applied to car-like robots with bicycle kinematics. A trajectory controller can be employed to track the referenced trajectory given by a constant speed, ui, and was obtained by applying full-state linearization via dynamic feedback to the non-linear system. Additional constraints such as maximum steering angle and velocity and linear velocity and acceleration were added as saturation limits. This framework was useful as it guaranteed continuity in both linear velocity and steering angle.
Third, a point mass vth-order integrator may be used to model the controlled vehicle. A controller for point mass vth-order integrator dynamics can and has been described, which is continuous up to the vth derivative of the initial state. For v>0, the state feedback control law was given by:
p(v+1)({tilde over (t)})=−avp(v)− . . . −a2{umlaut over (p)}+a1(u−{dot over (p)})+a0(pk+u{tilde over (t)}−p) Eq. (54)
In particular, for v=2 (continuity in initial acceleration), the trajectory is given by:
p({tilde over (t)})=pk+u{tilde over (t)}+(({dot over (p)}k−u+(ω1−ω0)F){tilde over (t)}−F)e−ω
with F=({umlaut over (p)}k+2ω0({dot over (p)}k−v∞))/(ω0−ω1)2 and ω0, ω1 being design parameters related to the decay of the initial velocity and acceleration.
Fourth, the robot may be modeled as a unicycle with dynamic constraints. For example, as shown in
[
with
{dot over (p)}=ξ,v1=ξ·[cos δ, sin δ]T,v2=[−sin δ, cos δ]T/Δ Eq. (57)
and the system is then controllable about p with holonomic velocity inputs given by ξ∈2 and a local trajectory may be applied at this point. A circle with a radius, r, is considered as the enveloping shape of the vehicle. Equation (55) defines the local motion of the vehicle's holonomic point. Additional constraints can be added such as maximum linear and angular velocity (|v1|≦v1,max, |v2|≦v2,max) and maximum linear and angular accelerations (|{dot over (v)}1|≦{dot over (v)}1,max, |{dot over (v)}2|≦{dot over (v)}2,max).
Fifth, the vehicle may be modeled as an omnidirectional boat with dynamic constraints. The methods taught herein have also been applied to control a team of omnidirectional boats controlled by three rotating propellers. An LQR controller in position in velocity was employed to track the reference trajectory. Additional saturation limits were included that accurately model the boat dynamics.
In the control method, static obstacles are typically given by a grid map, O0, of occupied space. The dilated obstacle map of the positions, p, where a disk, D(p,r), is in collision with a static obstacle is denoted by Or. Its complement defines the positions for which the agent is not in collision. A constraint (“Constraint 2” in some of this description) can be stated regarding collision avoidance with static obstacles. For agent i≦m this constraint is given by the reference velocities such that the new positions are not in collision with the enlarged obstacle map, pi+ui{tilde over (t)} ∉ Or
The constraint can be kept in its original grid-based representation or convexified. Grid-based forms can be used due to the mapping between velocities and future positions. This provides a vehicle with full navigation capability including being able to move close to the static obstacles and being able to go inside small openings. Particularly, given a known grid map, O0, the enlarged map Or
With regard to pair-wise collision avoidance, the following discussion teaches the constraint for inter-agent avoidance (or avoidance of dynamic obstacles), and it may be based on work on velocity obstacles, reciprocal velocity obstacles, and/or their convex counterpart. The constraint is computed for agents of radius {circumflex over (r)}i=ri+∈i following the reference trajectory. First, the constraint is described in its relative velocity space formulation, which can be employed in a centralized optimization. Then, three different approaches to obtain a distributed version are described.
Regarding a velocity obstacle and relative velocity space, the inter-agent collision avoidance constraint can be obtained for the reference trajectory following or using work on velocity obstacles. The velocity obstacle is formed by the relative reference velocities that lead to a collision between the two agents within the given time horizon τ. This constraint (“Constraint 3” or a constraint to provide inter-agent collision avoidance) may be stated: for every pair of neighboring agents i≦m, j∈[i+1,n], the constraint is given be the candidate reference velocities such that ∥pi−pj+(ui−uj){tilde over (t)}∥≧{circumflex over (r)}i+j={circumflex over (r)}i+{circumflex over (r)}j, for all {tilde over (t)}∈[0,τ]. This constraint can be rewritten as ui−uj ∉ VOijτ=∪{tilde over (t)}=0τ((D(pj,{circumflex over (r)})) ⊕ D(pi,{circumflex over (r)}i)/{tilde over (t)}), representing a truncated cone, which is only computed if the distance between the two agents is below a threshold (pij<Kd). Since this constraint is non-convex (free space is the complement of VOijτ), it should be linearized to incorporate it into a convex optimization. Alternatively, for homogeneous teams of robots, the control obstacle (or CN) can be employed as a substitute of the velocity obstacle. Then, CN-CO considers all interacting agents or robots as linear systems with order N dynamics. After linearization of the CN-CO, the method described herein proceeds analogously.
An approximation can be made using linear constraints. Particularly, the non-convex constraint 2\VOijτ (inter-agent collision-free reference velocities) can be approximated by three linear constraints of the form nijl·uij−bijl, with l∈{1,2,3} and given by:
where γ+=α+β, γ−=α−β, α=a tan 2(−pij) and β=a cos(ri+j/pij). The first and last constraints represent avoidance to the right and to the left, respectively, and the middle constraint represents a head-on maneuver, which remains collision free up to {tilde over (t)}=τ.
Turning now to the selection of a linear constraint, 2\VOijτ is linearized by selecting one of the three linear constraints. Sensible choices include the following: (1) fixed side for avoidance: if agents are moving towards each other (vij·pij<0) avoid on a predefined side, on the left (l=1) or on the right (l=3) and if the agents are not moving towards each other the constraint perpendicular to the apex of the cone (l=2) is selected to maximize maneuverability; (2) maximum constraint satisfaction for the current relative velocity,
this selection maximizes the feasible area of the optimization when taking into account the motion continuity constraints; and (3) maximum constrain satisfaction for the preferred velocity,
if centralized and
if distributed: this selection may provide faster progress towards the goal position but the optimization may become quickly infeasible if the agent greatly deviates from its preferred trajectory. Option 1 provides the best coordination results as it incorporates a social rule, Option 2 maximizes the feasible area of the optimization, and Option 3 provides (or may provide) faster convergence to the ideal trajectory. Without loss of generality, consider nij·uij≦bij the selected linearization of 2\VOijτ, which can be included in a centralized convex optimization.
Pair-wise collision avoidance may also include avoidance of a velocity obstacle in the distributed case. The new reference velocity of the neighboring agent, uj, is unknown. One of the following assumptions can be made regarding this velocity: (1) static agent: the assumption uj=0 is made, and the constraint is then given by ui∈2\VOijτ and its linearization by nij·ui≦bij, with this approach being prone to collisions in highly dynamic environments; and (2) constant velocity: the assumption uj=vj is made, and the constraint is then given by ui∈2\VOijτ ⊕ vj and its linearization by nij·ui≦bij+nij·vj, with this approach typically not considering the case where the other agent is also decision-making.
Pair-wise collision avoidance may further include velocity obstacles (or a distributed reciprocal). In the following discussion, the assumption is made that all agents compute collision-free motions following identical algorithms. One attempt towards taking into account decision making can be considered the work on the reciprocal velocity obstacle, where the velocity obstacle was slightly translated and the assumption was made that both agents share the avoidance effort with a known weight. To avoid reciprocal dances, to obtain stronger guarantees, and to coin the problem as a convex optimization, the idea was later extended to optimal reciprocal collision avoidance (ORCA).
The goal with regard to the velocity obstacle is to obtain two sets Fi|j ⊂2 and Fj|i ⊂2 such that for every velocity ui∈Fi|j and for every velocity uj∈Fj|i the relative velocity is collision free, thus:
equivalent to Fi|j ⊕ (−Fj|i) ⊂2\VOijτ. Therefore, agent i can freely select a velocity within Fi|j and agent j can freely select a velocity within Fj|i.
An avoidance topology can be fixed by first linearizing the constraint 2\VOijτ. In this case, the partition is reduced to obtaining a value bi|j such that:
where the implication holds for any bi|j∈. For reciprocal collision avoidance, bi|j can be defined such that if the minimum required change in relative velocity is denoted by Δuij then the minimum required change in velocity for agent i is Δui=λi|jΔuij, with λi|j being a constant to indicate how the avoidance effort is shared between both agents. For agent j, it is then assumed Δuj=−(1−λi|j)Δuij. The implication of Equation (60) then holds, and, for agent i, the linear constraint is computed from:
which implies:
nij·ui≦bi|j=λi|jbij+nij·((1−λi|j)vi+λi|jvj) Eq. (62)
Although decision making is encoded, the parameter λi|j defining the collaboration effort is fixed and known, typically considering a known partition between agents that equally cooperate, λi|j=0.5, and dynamic obstacles, λi|j=1.
At this point in the description, it may be useful to provide more explanation of cooperative pair-wise collision avoidance. Reciprocal methods are a first step towards modeling inter-agent decision making, but they consider all agents equal or a known cooperation effort. Velocity predictions and external factors such as static obstacles were not taken into account. In contrast, the following provides a method to incorporate a velocity prediction or utility function over future velocities in the selection of the sets Fi|j and Fj|i of the velocity partition for distributed collision avoidance. This produces a truly cooperative partition that takes into account the state of each agent and can improve vehicle navigation. A cooperation measure is defined as a function of the utility functions φi(ui) and φj(uj) of both interacting agents and the partition is found such that the cooperation measure is maximized.
With regard to the cooperation measure, to bound the problem, one can consider two disks, Di and Dj, that are centered at the current velocity of each agent and of known radius, such as τaimax representing the velocities that can be achieved within the time horizon. For each linearization option, l∈{1,2,3}, of the collision-free relative velocities 2\VOijτ and for each value of bi|j defining the partition, the cooperative measure Y(l,bi|j) is given by: (1) Y(l) which can favor a certain avoidance topology, for example to encode preferences for avoidance to the right; and (2) Y(i,l,bi|j) which is a function of the utility of the collision-free velocities within the partition Fi|j of agent i (analogously for agent j). One can then define each agent's term by:
with Fi|j and Fj|i defined by Equation (60). Then, two alternative cooperation measures are proposed in the form of a joint value and a min-max value. The joint value is given by:
Y1(l,bi|j)=Y(l)+Y(i,l,bi|j)+(j,l,bi|j) Eq. (64)
The min-max value is given by:
Y2(l,bi|j)=Y(l)+min(Y(i,l,bi|j),Y(j,l,bi|j)) Eq. (65)
Note that, for each agent and fixed l, Y(i,l,bi|j) is a monotonically increasing/decreasing function of bi|j taking values from 0 to 1.
With regard to an optimal partition and linearization, for a cooperative measure Yr(l,bi|j), the optimal partition and linearization are then given by:
obtained by first computing the value Yr(l,bi|j) for each l∈{1,2,3} and then selecting the maximum among the three. The reciprocal partition discussed earlier is obtained, for λ=0.5, by employing the cooperation measure:
Yr(l,bi|j)=max(nijl·vi−bi|j−nijl·vi−(bij−bi|j)) Eq. (67)
The following discussion teaches another exemplary algorithm or method for performing local motion planning (e.g., as may be implemented with the shared control system 100 of
For a first optimization technique (“Optimization 1” or distributed optimization), each agent, i, independently solves an optimization where its optimal reference velocity, u*i, is computed as follows, with the position pj and velocity vj of neighboring agents being known:
For a second optimization technique or algorithm (“Optimization 2” or centralized optimization), a single optimization is solved where the optimal reference velocities of all vehicles, u*1:m=[u*1, . . . , u*m], are jointly computed as follows:
Note that in the centralized optimization the partitions, Fi|j, for distributed avoidance are not required or utilized.
In the following, four algorithms are described to compute the optimal reference velocities. First, a distributed, convex optimization may be used. Optimization 1 is solved with quadratic cost C(ui) and linearized constraints. Computational complexity is very low and scalability is very good, but the solution space is greatly reduced due to the linearization of the constraints and the partition of the reference velocity space. Second, a centralized, convex optimization may be used to compute the optimal reference velocities. In this case, Optimization 2 is solved with quadratic cost, C(ui:m), and linearized constraints. Computation complexity is low and scalability is relatively good, but the solution space is partially reduced due to the linearization of the constraints. Third, a centralized, non-convex optimization may be utilized. With this algorithm, Optimization 2 is solved with quadratic cost, C(ui:m), and non-convex constraints. Computational complexity is high and scalability is poor, but the complete solution space is explored. Fourth, a distributed, non-convex search may be performed within the convex region to compute the optimal reference velocities. With the use of this algorithm, Optimization 1 is solved with quadratic cost, C(ui:m), or arbitrary utilities φi(ui), linearized inter-agent collision avoidance constraints, non-convex motion constraints, and static obstacle avoidance constraints. Computational complexity is low and stability is good, but the solution space is reduced due to the linearization of the collision-avoidance constraints and the partition of the reference velocity space to guarantee safety in the distributed case.
These algorithms provide either collision-free reference inputs that respect all the dynamic constraints (u*i if distributed or u*1:m if centralized) or an infeasible flag. If the optimization is infeasible, the time horizon might be decreased or a different linearization might be selected. If the problem remains infeasible, agent i (distributed) or agents 1:m (centralized) decelerate on their last feasible local trajectory (obtained at time tƒ) following the time remap given by:
γ(t)=−tƒ2/(2τ)+(1+tƒ/τ)t−t2/(2τ),t∈[tƒ,tƒ+τ],γ(t)=tƒ+τ/2,t>tƒ+τ Eq. (68)
The re-parameterization of the trajectories guarantees that the vehicles stop within the time horizon of the local planner, unless the optimization becomes feasible in a later time step.
Turning first to distributed convex optimization, the optimization cost, C(ui) is defined by a quadratic function given by two terms. The first one is a regularizing term penalizing changes in velocity, and the second one minimizes the distance to the preferred velocity. The optimization cost may be determined as:
C(ui):=Ko∥ui−vi∥2+∥ui−ūi∥2 Eq. (69)
where Ko is a design constant.
Typically, pedestrians prefer to maintain a constant velocity in order to minimize energy. Hence, preference was given to turning instead of changing speed. Furthermore, penalizing changes in speed leads to a reduction of deadlock situations. This idea can be formalized as an elliptical cost, higher in the direction parallel to ūi and lower perpendicular to it. One can then let Λ<0 represent the relative weight and let γi represent the orientation of ūi. Then, the relative weighting is defined by:
And the optimization cost is redefined as:
C(ui):=Ko∥ui−vi∥2+(ui−ūi)TDiTLDi(ui−ūi) Eq. (70)
For each neighboring agent j, a collision-avoidance constraint is computed, linearized, and partitioned (ui∈Fi|j).
The constraints for avoidance of static obstacles are also linearized, and the set, Ri, defining the motion constraints is approximated by a convex polygon (formed by a union of linear constraints). If Ri is given by two convex regions, only one is selected such as either the closest one to the previous reference velocity or the closest one to the preferred reference velocity. Note that at high speeds, Ri is formed by a single region. The set of all linear constraints may be denoted by Clin. Then, Optimization 1 may be formulated as a quadratic program with linear constraints as:
Turning now to centralized convex optimization, the optimization cost, C(u1:m), is defined by a quadratic function given by a weighted sum of the cost functions, C(ui), of each agent as:
C(u1:m):=Σi=1mωiC(ui) Eq. (72)
where ωi represents the weight of each individual agent in the avoidance effort and can be used to define characteristics such as shy versus aggressive behavior.
For each neighboring pair of neighboring agents, i and j, a collision-avoidance constraint is computed and linearized (e.g., as described above). The set of linear constraints for inter-agent collision avoidance (one for each pair of neighboring agents) is denoted by Crel. The constraints for avoidance of static obstacles are also linearized and the sets Ri are approximated by a convex polygon. The set of these linear constraints for agent i is denoted by Ci,abs. Then, Optimization 2 can be formulated as a quadratic program with linear constraints as:
Turning now to centralized non-convex optimization, to coin the problem as a convex optimization, the inter-agent collision avoidance constraints were linearized, which led to a sub-optimal solution due to the reduction of the solution space (an avoidance topology was fixed). To obtain the global optimum, a non-convex problem should be solved. The solution may be obtained via sampling, which may use a large amount of samples in this high dimensional space with many constraints. Instead, the inventors propose that one of the two following algorithms may be utilized to provide shared control of vehicles.
First, one may utilize a mixed integer quadratic program. The optimization that is formulated in this way uses a program where three binary variables are added for each inter-agent collision avoidance constraint and that specifies which one of the three linear constraints is active. Another two binary variables are added for each Ri set that needs to be approximated by two convex regions. One can then let b be the vector of all binary variables, where each binary variable, β, is zero if the constraint is active (to be satisfied) and one otherwise.
The optimization cost of Equation (72) is kept, with an optional modification. Since an avoidance topology is not defined beforehand, a side preference for the collision avoidance can now be added by appropriately selecting a penalty for the binary variables. The three binary variables associated with each VO constraint, s∈Crel, are denoted by βsl, l∈[1,3]. For example, the rule “prefer to avoid on the right” can be added by penalizing βsl=1 with a weight w. The vector of linear cost with respect to the binary variables is then given by:
where I2 is the set of agents whose set Ri is approximated by two convex regions, I1=[1,m]\I2, and |X| the cardinality of set X. The set of linear constraints for static obstacle avoidance is denoted by Ci,obst and the set of linear constraints defining the polygonal approximation of Ri by Ci,dyn, and the sets verify Ci,abs=Ci,obst ∪ Ci,dyn.
Optimization 2, in this case, can then be formulated as a quadratic program with linear constraints as:
where M>>0 is a large enough constant and extra constraints have been added to impose that only one linear constraint is active for each original non-convex constraint. This optimization can be solved via branch-and bound, where the maximum number of explored nodes defines the ratio between optimality and computational time.
To reduce the computational time of the algorithm, an initial point may be specified. The solution of the convex optimization discussed herein may be well suited for this purpose since it typically returns a feasible (although not optimal) solution with very low computational time. Optimality of the solution increases with time as more nodes are explored. The algorithm can exit at any time to cope with run-time constraints, and, hence, this method features anytime properties.
Second, centralized, non-convex optimization may be formulated as message passing optimization. For example, this non-convex optimization can be solved using an improved version of the alternating direction method of multipliers (ADMM) algorithm known to those skilled in the art. This approach was naturally parallelizable and allowed for incorporating different cost functional with only minor adjustments, and binary constraints were no longer required and/or used. None the less, convergence to the global optimum was not guaranteed.
Turning now to distributed non-convex search within a convex region, convex (linear) and non-convex (grid-based) constraints are combined to explore a larger solution space while keeping the computational cost low. A convex cost, C(ui), is given by Eq. 70. Additionally, a non-convex cost can be defined, for example, by {tilde over (C)}(ui)=C(ui)+χi(ui). The set of convex constraints is formed by the inter-agent linearized collision avoidance constraints (i.e., set Fi|j), and the bounding box of the set Ri is denoted by C. For agent i, the set of non-convex constraints is formed by the static obstacle collision avoidance constraint and the set Ri accounting for the motion constraints and is denoted by {tilde over (C)}i. Both non-convex constraints are given by grid representations of identical resolution.
To efficiently find a solution, the optimization is divided in two parts. First, a convex subproblem is solved resulting in uic, and, second, this is followed by a search within the grid-based constraints restricted to the convex area defined by the linear constraints. The quadratic cost defined in Equation (70) is maintained, and the algorithm proceeds as follows:
In this algorithm, function feasibleDynamics(ui), checks in a precomputed grid if the tracking error is below ∈i, given the initial state of the vehicle (if ui∈Ri then return ‘true’; else ‘false’). Function feasibleMap(ui) checks if ui leads to a trajectory in collision with static obstacles given by the grid map Or
An example of implementation of the distributed non-convex search within a convex region algorithm is shown in
Now, it may be useful to turn to an analysis of the above-described algorithms providing collision avoidance guarantees. With regard to radius enlargement, variable maximum tracking error, ∈i, and associated radius enlargement is desirable for feasibility. To guarantee feasibility of the computation of the inter-agent collision avoidance constraint, VOi|jτ, ri+rj+∈i+∈j≦d(pi,pj) should be satisfied, i.e., the extended radii of the robots should not be in collision. This might happen for fixed ∈i but is assuredly avoided by having ∈i and ∈j stepwise decreasing when robots are close to each other.
The planned local trajectories are collision free up to time τ, with the assumption that other vehicles follow the same algorithm or maintain a constant velocity. Passive safety is guaranteed if the assumptions hold. In the case where optimization is feasible, collision-free motion is guaranteed for the local trajectory up to time τ (optimal reference trajectory is collision free for agents of radii enlarged by ∈ and agents stay within ∈ of it) with the assumption that all interacting agents either continue with their previous velocity or compute a new one following the same algorithm and assumptions. One can let pi(t) denote the position of agent i at time t≧tk and recall {tilde over (t)}=t−tk, then:
∥pi(t)−pj(t)∥=∥ƒ(zik,ui{tilde over (t)})−ƒ(zjk,uj,{tilde over (t)})∥≧∥(pik+ui{tilde over (t)})−(pjk+uj{tilde over (t)})∥−∈i−∈j≧ri+∈i+rj+∈j−∈i−∈j=ri+rj Eq. (75)
where the first inequality holds from the triangular inequality and Constraint 1 (ui∈Ri and uj∈Rj). The second inequality holds from Constraint 3 (ui−uj ∉ VOTijτ and, if distributed ui∈Fi|j and uj∈Fj|i). To guarantee feasibility of the velocity obstacles computation and, thus, completion of the method, the discussion regarding radius enlargement should also hold. Due to the time-discrete implementation, after each time step, a new collision-free trajectory is computed. Therefore, the trajectories of all agents, given as concatenation of segments, are collision free.
If the optimization is infeasible, no collision-free solution exists that respects all constraints. If the time horizon is longer than the required time to stop, safety is preserved if all involved vehicles drive their last feasible trajectory with a time re-parameterization to reach stop before a collision arises:
{dot over (γ)}(tƒ+τ)=(1+tƒ/τ)−(tƒ+τ)/τ=0 Eq. (76)
This time re-parameterization implies a slowdown of the agent, which in turn may render the optimization feasible in a later time step. Since this computation is performed at a high frequency, each individual agent is able to adapt to changing situations. With regard to dynamic obstacles, the feasibility of the optimization indicates if the dynamic obstacles can be avoided (assuming they adhere to their predicted reference velocity) or a collision is imminent. A fast control loop is able to handle small deviations in the prediction.
Although the invention has been described and illustrated with a certain degree of particularity, it is understood that the present disclosure has been made only by way of example, and that numerous changes in the combination and arrangement of parts can be resorted to by those skilled in the art without departing from the spirit and scope of the invention, as hereinafter claimed. For example, in addition to the test cases described herein, a test case may involve arbitrary robots moving among people such as in a theme park or on a busy street (e.g., either with passenger providing input (such as a Segway-type vehicle or a wheelchair) or with autonomous control (e.g., a robotic character, a cleaning robot, a surveillance device, and so on)).
It will be understood that the methods taught herein are also useful for autonomous driving and not only semi-autonomous. For example, if no input from a driver/guest is available, the vehicle can be controlled to follow a desired path or navigate toward a predefined goal by itself. Thus, the driver input is optional. One can imagine a case where a driver/guest sits in/on the vehicle but does not command anything so as to just go along for a ride provided by the autonomously controlled vehicle. This one more reason that the guidance trajectories can be beneficial.
Several means are available to implement the systems and methods discussed in this specification. These means include, but are not limited to, digital computer systems, microprocessors, application-specific integrated circuits (ASIC), general purpose computers, programmable controllers and field programmable gate arrays (FPGAs), all of which may be generically referred to herein as “processors” (such as represented by processors 112 and 122 in the system 100 of
This application claims the benefit of U.S. Provisional Application No. 61/878,308, filed Sep. 16, 2013, which is incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5006988 | Borenstein | Apr 1991 | A |
20090079839 | Fischer | Mar 2009 | A1 |
20130325244 | Wang | Dec 2013 | A1 |
Number | Date | Country | |
---|---|---|---|
20150284010 A1 | Oct 2015 | US |
Number | Date | Country | |
---|---|---|---|
61878308 | Sep 2013 | US |