The invention relates generally to a system and method for motion control of a machine, and more particularly to a system and a method for data-driven reference generation for constraint satisfaction in machines with closed-loop control systems and uncertain dynamics or parameters.
Motion control systems are used in number of positioning applications, e.g., single-axis positioning, and multiple-axis positioning. For example, a simple single-axis positioning motion control system generally includes sensors, controller, amplifier, and actuator motor. The actuator follows a predetermined trajectory subject to input and output constraints, e.g., dynamics, acceleration, velocity.
Reference governors (RGs) are add-on schemes for enforcing pointwise-in-time input and output constraints by modifying the reference input to closed-loop feedback control systems. For ensuring repeated constraint satisfaction, RGs leverage particular invariant sets known as constraint admissible sets. Due to their ability to enforce constraints without requiring a full re-design, and with a relatively low computational burden, RGs have proven useful in multiple application domains, including vehicles, aerospace, manufacturing, and energy systems; see, for example, the survey for an extensive list of contributions. When the parameters of the underlying systems are uncertain, robust formulations of RG may be applied based on robust set invariance. On the other hand, such robust formulations are based on the assumption that the uncertain parameters are constantly varying, and as such, they are overly conservative in the cases where the unknown parameters are constant or slowly-varying.
Despite being relatively common in real-world applications, there are relatively few designs for RGs in this case. To the best of our knowledge, the conventional load governor approach proposed is the only parameter-adaptive reference governor (PARG) formulation in the literature. A major reason for the dearth of PARG frameworks is that the computation of robust constraint admissible sets under parameter uncertainty is extremely difficult due to complex geometries of these sets and inherent non-convexity, even for linear systems. It is common to circumvent this issue by local linearization, but such an approximation tends to still be conservative. In addition, several methods involve simulating multiple trajectories on-line for various references and unknown parameter values, and checking constraint satisfaction, all of which leads to an increase in computational load. Recently proposed sampling-driven machine learning approaches may provide computationally tractable and efficient frameworks for estimating these robust invariant sets on-line by offloading simulation and trajectory generation off-line.
For linear systems with unknown parameters, Kalman-type approaches have been widely reported to be effective. For nonlinear systems, these approaches are generally intractable, and particle filtering provides an effective alternative, where the state space is estimated by predicting state trajectories (particles) and weighting them according to the likelihood of the measurements. A general framework for parameter estimation with particle filters can be also be found.
Since recursive parameter estimators result in time-varying confidence intervals, determining the robust parameter invariant sets is a dynamic learning problem. Specifically, for a fixed set of samples, the time-varying confidence intervals result in time-varying labeled/target sets for the learner. Hence, the learner has to be updated on-line in order to provide improved estimates of the robust constraint admissible sets based on the time-varying confidence intervals.
Consequently, there is a requirement for a PARG framework that is capable of enforcing constraints in parameter-uncertain closed-loop systems without modifying the control algorithm directly.
Some embodiments of the present disclosure are based on recognition that a PARG framework is capable of enforcing constraints in parameter-uncertain closed-loop systems without modifying the control algorithm directly. As a specific realization of this PARG framework, we consider two components: (i) a recursive parameter estimator for generating confidence intervals around a point estimate of the unknown parameter; and (ii) a supervised learning algorithm that dynamically learns constraint admissible sets by combining off-line data based on sampling, and on-line data provided by the parameter estimator. An advantage of using interval-based estimates rather than point estimates of the parameter, is that the intervals can exhibit certain properties such as monotonicity that are crucial to ensure performance guarantees on the PARG. Another advantage of our proposed method is that learners with good approximation properties, such as with universal kernels, can be employed to efficiently represent highly non-convex robust constraint admissible sets for black-box systems using simulations and systematic sampling; this is extremely challenging via analytical methods.
Some embodiments provide sampling-based approaches to generate features and labels off-line, iteratively update the robust constraint admissible sets on-line as more data becomes available and the confidence in the parameter estimates improves. In order to construct robust constraint admissible sets, one requires not only the point estimates of the parameters themselves, but also regions in the parameter space within which the true parameter lies with high confidence, ascertained by a probability. To this end, some embodiments provide the use of recursive parameter estimators that can demonstrate excellent performance in a wide range of estimation problems.
Some embodiments of the present disclosure are based on recognition that the RGs provide an effective method for ensuring safety via constraint enforcement in closed-loop control systems. When the parameters of the underlying systems are unknown, but constant or slowly-varying, robust formulations of RGs that consider only the worst-case effect may be overly conservative and exhibit poor performance. This disclosure discloses a PARG architecture that is capable of generating safe trajectories in spite of parameter uncertainties without being as conservative as robust RGs. Some embodiments leverage on-line data to inform algorithms for robust parameter estimation. Subsequently, confidence bounds around parameter estimates are fed to supervised machine learners for approximating robust constraint admissible sets leveraged by the PARG. While initially, due to the absence of on-line data, the PARG may be as conservative as a robust RG, as more data is gathered and the confidence bounds become tighter, and conservativeness reduces.
Further, according to some embodiments of the present disclosure, a controller is provided for operating a system under admissible states. The controller may include an interface configured to connect the system storing a set of measured system states, a set of reference inputs and a set of system parameters in a storage arranged inside or outside the system; a memory storing measured system states, admissible reference inputs and admissible parameter sets and computer-executable programs including a parameter estimator and an adaptive reference governor (ARG); a processor, in connection with the memory, configured to perform the ARG and the parameter estimator, wherein the parameter estimator extracts a pair of a reference input and the system state and compute a system parameter estimate based on the reference input and system state, wherein the ARG is configured to update the reference input and compute a parameter-robust constraint admissible set based on the updated reference input and the system states, wherein the ARG generates and transmits a reference input to the system based on the parameter-robust constraint admissible set.
Some embodiments are based on realization that shows a control system for controlling a vehicle moving on a road, e.g., controlling the vehicle according to a reference trajectory of desired vehicle positions and velocities along the road. This embodiment is based on the recognition that the stiffness determined for normal driving can be used to select from a memory one of the multiple parameters of friction functions. The method can be implemented using a processor of the vehicle.
To that end, the control system may include a controller of claim 1 including an adaptive reference governor (ARG), wherein the ARG generates the control commands to vehicle controllers of the vehicle. In this case, the ARG may include a sensing system configured to measure inertial components of the vehicle; a signal conditioner configured to output an estimate of a longitudinal acceleration; and a state-of-stiffness estimator configured for determining parameters of a state of a stiffness.
Another embodiment discloses a motion control system for controlling a motor. The motion control system may include one or more combination of motion controllers configured to transmit control signals to a motor amplifier; an adaptive reference governor (ARG)-trajectory generator, wherein the ARG-trajectory generator includes a controller of claim 1 including an adaptive reference governor (ARG), wherein the ARG-trajectory generator generates and transmits control commands to the one or more combination of the motion controllers.
Yet another embodiment discloses a method for controlling a system. The method may include acquiring system states, reference inputs and system parameters from a set of measured system states, a set of admissible reference inputs and a set of admissible system parameters, wherein the set of measured system states, the set of admissible reference inputs and the set of admissible system parameters have been collected during past operations and current operations of the system; providing a system state estimate, a reference input, a system parameter estimate and a desired reference to an adaptive reference governor (ARG); transmitting a reference input generated by the ARG to a system; extracting and providing a pair of the reference input and the system state estimate to a parameter estimator; computing a boundary interval of parameters using the parameter estimator; updating the reference input using the ARG; and computing a parameter-robust constraint admissible set using the ARG based on the updated reference input and system states.
Further, another embodiment discloses a non-transitory computer readable storage medium embodied thereon a program executable by a processor for performing a method, the method includes acquiring system states, reference inputs and system parameters from a set of measured system states, a set of admissible reference inputs and a set of admissible system parameters, wherein the set of measured system states, the set of admissible reference inputs and the set of admissible system parameters have been collected during past operations and current operations of the system; providing a system state estimate, a reference input, a system parameter estimate and a desired reference to an adaptive reference governor (ARG); transmitting a reference input generated by the ARG to a system; extracting and providing a pair of the reference input and the system state estimate to a parameter estimator; computing a boundary interval of parameters using the parameter estimator; updating the reference input using the ARG; and computing a parameter-robust constraint admissible set using the ARG based on the updated reference input and system states.
The presently disclosed embodiments will be further explained with reference to the attached drawings. The drawings shown are not necessarily to scale, with emphasis instead generally being placed upon illustrating the principles of the presently disclosed embodiments.
While the above-identified drawings set forth presently disclosed embodiments, other embodiments are also contemplated, as noted in the discussion. This disclosure presents illustrative embodiments by way of representation and not limitation. Numerous other modifications and embodiments can be devised by those skilled in the art which fall within the scope and spirit of the principles of the presently disclosed embodiments.
The following description provides exemplary embodiments only, and is not intended to limit the scope, applicability, or configuration of the disclosure. Rather, the following description of the exemplary embodiments will provide those skilled in the art with an enabling description for implementing one or more exemplary embodiments. Contemplated are various changes that may be made in the function and arrangement of elements without departing from the spirit and scope of the subject matter disclosed as set forth in the appended claims.
Specific details are given in the following description to provide a thorough understanding of the embodiments. However, understood by one of ordinary skill in the art can be that the embodiments may be practiced without these specific details. For example, systems, processes, and other elements in the subject matter disclosed may be shown as components in block diagram form in order not to obscure the embodiments in unnecessary detail. In other instances, well-known processes, structures, and techniques may be shown without unnecessary detail in order to avoid obscuring the embodiments. Further, like reference numbers and designations in the various drawings indicated like elements.
Also, individual embodiments may be described as a process, which is depicted as a flowchart, a flow diagram, a data flow diagram, a structure diagram, or a block diagram. Although a flowchart may describe the operations as a sequential process, many of the operations can be performed in parallel or concurrently. In addition, the order of the operations may be re-arranged. A process may be terminated when its operations are completed, but may have additional steps not discussed or included in a figure. Furthermore, not all operations in any particularly described process may occur in all embodiments. A process may correspond to a method, a function, a procedure, a subroutine, a subprogram, etc. When a process corresponds to a function, the function's termination can correspond to a return of the function to the calling function or the main function.
Furthermore, embodiments of the subject matter disclosed may be implemented, at least in part, either manually or automatically. Manual or automatic implementations may be executed, or at least assisted, through the use of machines, hardware, software, firmware, middleware, microcode, hardware description languages, or any combination thereof. When implemented in software, firmware, middleware or microcode, the program code or code segments to perform the necessary tasks may be stored in a machine-readable medium. A processor(s) may perform the necessary tasks.
The operation of the machine 201 can be modeled by a set of equations representing changes of the output over time as functions of current and previous inputs and previous outputs. During the operation, the machine 201 can be defined by a state of the machine. The state of the machine is any set of information, in general time varying, that together with the model and future inputs, can define future motion. For example, the state of the machine 201 can include an appropriate subset of current and past inputs and outputs.
The legacy tracking controller 205 can be implemented in hardware or as a software program executed in a processor (hardware processor), e.g., a microprocessor, which at fixed or variable control period sampling intervals receives the estimated state 203 of the machine 201 and the adjusted reference input 107 and determines, using this information, the inputs to the actuator 207 used for operating the machine. The sensor 203, at fixed or variable period sampling intervals, receives the outputs of the machine 201 and sends this output data 209 to the ARG 103 via the interface 105. Examples of the interface 105 includes network control interface configured to accept data transmitted thorough wired or wireless communication channel. These outputs can be one or a combination of measurements, states, and references of the legacy control system 101 at the current time or stored in memory at some previous time instants.
In some embodiments, the ARG 103 has the following modules: a parameter estimation module 231, a constraint-admissible invariant set (CAIS) learning module 221, and a reference generation module 211.
The parameter estimator 231 estimates the parametric uncertainty in the machine using the output data 209 or some transformation of this data. The parameter estimator 231 usually consists of estimation algorithms deployed on a processor 235 that uses stored data in memory 233, including stored previous state estimates, parameter estimates, covariance matrices, and so on. Examples of parameter estimator 231 can include: linear and nonlinear observers or Kalman filters with state augmentation, and recursive least squares filters when full state feedback is available. Estimated parameters could include a mass of a vehicle, an inertia of a vehicle, a tire friction coefficient of at least one tire mounted on a vehicle, or viscous damping coefficients in servomotors.
The machine 201 and control system is designed to satisfy constraints 227 that arise from safety considerations, physical limitations, and/or specifications, limiting the range where the outputs, the inputs, and also possibly the states of the machine are allowed to operate. The constraints 227 are defined in continuous space. For example, state constraints are defined in continuous state space of the machine and reference input constraints are defined in continuous reference input space. The CAIS learning module 221 takes outputs of the parameter estimator 231, along with features and labels stored in the memory 223 to generate constraint-admissible sets using the CPU/GPU or combination thereof in the processor 225 for use in the reference generator module 211.
Given a desired reference input 109 such as a desired velocity to be tracked in cruise control, or a desired torque in a servomotor, the reference generator unit 211 adjusts this desired reference input r to an adjusted reference input v using the CAIS generated by the CAIS learner 221. The adjusted reference input belongs to a set of admissible references 217, such as limits on velocities in highway driving between 50 and 80 miles per hour. The reference generator accesses its memory 213 to find the adjusted reference in the previous time instant and updates that to compute the new adjusted reference using computations in the processor 215. The adjusted reference input is connected with the legacy controlled machine via the interface 107 and is guaranteed to enforce constraints 227 without altering the structure of the legacy control system 101.
According to some embodiments, the output of the legacy control system 103 must satisfy constraints described by the set for each instant of time, that is yt ∈⊂n
In the unconstrained setting, that is =n
In one embodiment, the machine 201 with the uncertain dynamics may be given by
x
t+1
=f(xt,vt)+θTg(xt,vt),yt=xt (1)
An efficient way of determining such confidence intervals for machine with this class of uncertain dynamics is by using Kalman filters and adaptive particle filters. We do this by reformulating the dynamics (1) in a probabilistic framework where θ is treated as an unknown disturbance with stochastic properties. In the current embodiment, since xt is known, we exploit the linearity of the system (1) with respect to θ and use a Kalman filter for estimating θ and its confidence interval Θt. Note that the approach can be extended to the case when the state vector is not completely known and has to be estimated together with the parameter. Specifically, we reformulate (1) as
θt=θt−1+wt, (2)
t
=g(xt−1,vt−1)Tθt+et, (3)
where
The reason to address the parameter estimation problem in a Bayesian framework is that even if the state xt is known, for instance, from measurements, such knowledge is typically imperfect due to inherent noise in the sensors measuring the state, even though we do not model the uncertainties explicitly in (1) for simplicity. Furthermore, a Bayesian framework provides a systematic approach to work with confidence intervals in recursive estimators. In a Bayesian context, we update the uncertainty θt+1:p(θt) and
p(θt|
using the measurement history
where P(
{circumflex over (Θ)}tj=[{circumflex over (θ)}tj−βPtjj,{circumflex over (θ)}tj+βPtjj]
for each element j in the parameter vector θt and β>0.
In order to provide theoretical guarantees on the PARG, some embodiments ensure that the confidence intervals do not expand with more available data, that is, {circumflex over (Θ)}t+1⊆{circumflex over (Θ)}t. While this is a natural consequence of applying Kalman filters to linear-in-parameter systems such as (1), in general, exploration using nonlinear filters such as in particle filters could result in a violation of this condition. In such scenarios, one embodiment explicitly enforces contraction of confidence intervals. Specifically, if the filter computes an updated confidence interval {tilde over (Θ)}t+1, we set
This forces non-expansion of {circumflex over (Θ)}t for all t≥0.
If the state vector is available at every t, one can use a linear estimator to provide the confidence intervals, and, therefore, a more general approach using Bayesian recursions is not needed. However, if the state is unavailable, the updates (5-6) can be employed to generate joint estimates of states and parameters via nonlinear recursive estimators.
A tool used to satisfy constraints despite parametric uncertainty is a parameter-robust constraint admissible invariant set, referred to here as CAIS for brevity. More formally, let H={(x, v)∈X×V: h(x, v)∈Y} denote the set of state and reference inputs for which the output y satisfies the constraints. We present the following definition for parameter-robust constraint admissible sets: The set O({circumflex over (Θ)})⊂H is a parameter-robust constraint admissible set for the closed-loop system if, for every initial condition (x,v)∈O({circumflex over (Θ)}) when x0=x and vt=v for all t≥0, (xt,vt)∈H for every θ∈{circumflex over (Θ)} and for all t>0. The set O({circumflex over (Θ)}) is invariant. In order to generate estimates of parameter-robust constraint admissible sets, we will adopt an off-line sampling-driven approach to collect data for learning the sets on-line as operational data becomes available.
An estimate of a parameter-robust constraint admissible set can subsequently be used to evaluate the control law
by solving for
at each time instant t. Note that Vε({circumflex over (Θ)}t) denotes the set of references v such that a ball of radius ε>0 centered at the corresponding steady state xss(v,θ) and v lies inside O({circumflex over (Θ)}t).
Some embodiments simulate trajectories of the legacy system off-line, from different initial states sampled from , reference inputs sampled from , and parameters within Θ. At the end of each off-line simulation, if an initial condition xi ∈ tracks a desired reference input vi ∈ without violating the constraint (3) at any time in the simulation, for a parameter θi sampled within Θ, then the feature (xi, vi) is labeled ‘+1’ to indicate it resides within the parameter-robust constraint admissible set (θi). Contrarily, if the constraint is violated at any time point in the simulation, the feature is labeled ‘−1’ to indicate it resides outside O(θi). This sets up a binary classification problem which can be solved via supervised machine learning.
At the end of this off-line data generation procedure, the method 500 includes a fixed collection of initial {xi}i=1N
from which a labeled set will be generated on-line for robust invariant set estimation using supervised learning. Note that every element in li is either +1 or −1 by (9).
In order to solve the reference generation problem, some embodiments require an estimate of the set O({circumflex over (Θ)}t), which can be obtained using machine learning.
In some embodiments, the learning problem is set up as follows. At time instant t, consider a {circumflex over (Θ)}t provided by the parameter estimator. Then, for each vj∈{tilde over (V)}t described in (10), and each xi ∈{xi}i=1N
is the index set of parameters contained in the current confidence interval {circumflex over (Θ)}t. Taking the minimum ensures that the estimated set is robust to all parameters within {circumflex over (Θ)}t. That is, if even one θt is infeasible for the particular vj and xi, then xi does not belong to the robust parameter invariant set corresponding to {circumflex over (Θ)}t.
With the training data D:={(xi, vj), zi,j}, we construct classifiers ψj, where j=1, . . . , |{tilde over (V)}t|. For each vj, a classifier is trained on features {xi} and their corresponding labels {zi,j}. These classifiers need to represent inner approximations of the robust parameter invariant sets; to this end, one may select sub-level sets of the decision boundary ψk=0 of the classifier until no infeasible sample is contained in the interior of the sub-level set.
The figure provides some learning algorithms 521 that can compute the set O({circumflex over (Θ)}t) up to arbitrary accuracy with careful selection of kernels/activation functions. These include bi-classifiers such as support vector machines 531, neural networks 534, extreme learning machines 535, or probabilistic classifiers such as Bayesian classifiers 532 or Gaussian process classifiers 533.
As an exemplar classification algorithm, consider a 2-norm soft margin support vector machine (SVM) classifier 531 trained on a dataset D by solving the optimization problem
Here, c>0 is a regularization constant, w quantifies the margin of separation, b is a bias term, ξ are slack variables, and φ is a feature map into a reproducing kernel Hilbert space for a kernel function K. The decision function of the SVM is given by
ψj(x)=sign((wjå)Tφ(x)+bjå),
where the inner product (wjå)T φ(x) can be expressed efficiently by the kernel function K. Since the classifiers may not be a true inner approximation of a robust parameter invariant set without an infinite number of features, a heuristic that can be employed to ensure constraint satisfaction is by choosing a small ε>0 and checking that ψj
In some embodiments, the learner can be updated on-line or incrementally, rather than solving the entire learning problem (11) at each time step.
According to one embodiment, a solution to (8) is computed efficiently using machine learning and gridding V. Placing a grid on V, along with the constraints, imply that the solution to (8) is contained within the sub-grid of V defined by
V
t:=[min{rt,k,vt,k},max{rt,k,vt,k}]. (10)
Then, we can recast the problem (8) as a grid search,
Solving the grid search then becomes identical to selecting the node vj on the grid {tilde over (V)}t that minimizes the cost while ensuring that ψj(xt)>0; that is, the current state is predicted by the j-th classifier to belong to the robust parameter invariant set induced by {circumflex over (Θ)}t.
It is noted that since the system is nonlinear, the invariant sets are non-convex and require local radial basis function kernels to represent their geometries. In subplot [A], we see that, for a fixed vt, the sets expand with time; each updated state shares the same colored dot as the corresponding invariant set. The expansions occur because the intervals {circumflex over (Θ)}t contract and so the invariant sets can be less conservative for the same vt. When vt changes, as in subplot [B], the shapes of the sets alter according to how close they are to the constraints. However, as expected, the states always lie within the invariant sets, which is why constraints are never violated.
Some embodiments of the disclosure use the ARG 103 for enforcing constraint when the coefficient between tire and road is uncertain.
During normal driving 825, in which case the slip is small, the friction function includes an initial slope defining a stiffness of the tire. As used herein, the normal driving is defined as regular driving, e.g., everyday driving in urban areas, where the vehicle avoids emergency braking and evasive steering maneuvers. The normal driving can be contrasted with aggressive driving when extensive force is applied on the wheels of the vehicle. As used herein, the aggressive driving is defined as driving where braking/acceleration and/or steering torque is large enough such that the vehicle operates close to the tire adhesion limits of material of the tires. For example, while the validity of the linear region of the tire force function varies between different surfaces, approximating the tire force function with a linear function is valid for accelerations up to roughly 4 m/s2 on asphalt surfaces, i.e., approximately 40% of the total available force on asphalt. As an example, production-type electronic stability control systems (ESC) measure a deviation from a predicted measurement, using a steering wheel angle and a longitudinal velocity, to a vehicle model using the tire force function as a linear approximation. When the deviation is more than a threshold, safety braking is activated. Thus, a measure of normal driving is driving well below these activation thresholds. In other words, if the driving is not aggressive, the driving is normal.
During the aggressive driving 835 the wheel slips more, which causes a larger force/friction variation. This variation is highly non-linear. For example, regardless of the extent of the force and type of the road, there is a maximum friction for almost all surfaces, which occurs at a point from which the force decreases when applying more braking/steering torque. After this point the increase in wheel slip results in smaller tire forces. For large wheel slip beyond the maximum force it becomes more difficult to maintain a desired wheel slip, since the dynamics becomes unstable beyond the point of maximum force. Therefore, vehicles are often controlled such that the wheel slip stays small enough such that the peak is not exceeded.
Hence, the friction function includes a linear part roughly corresponding to normal driving and a non-linear part corresponding to aggressive driving. During the normal driving, the friction function changes slowly and predictably. In addition, usually, the vehicle is controlled with force consistent with the normal driving. Such a control is safer and gives time and sufficient data allowing learning the linear part of the friction function during the control of the vehicle and to use the learned part of the friction function for vehicle control. For example, some embodiments use a filter configured to determine the current state of the stiffness of the tire by comparing a current state of the vehicle estimated using the stiffness of the tire with measurements of the current state of the vehicle. In other words, it is possible and safe to learn linear part of the friction function in real time during the control of the vehicle.
In contrast, the aggressive driving changes the friction function rapidly and non-linearly. Hence, controlling the vehicle using values of the linear part of the friction function can jeopardize accuracy and safety of vehicle control. In addition, non-linear variations of the friction function during the aggressive driving and relatively short time when a vehicle is driven under a specific style of the aggressive driving make the learning of the non-linear part of the friction function impractical. Hence, there is still a need for a method that can rapidly estimate non-linear part of the friction function during a real-time control of the vehicle.
The method 901 determines 910a a current state of stiffness 915a of at least one tire of the vehicle and accesses parameters 909a of multiple friction functions stored in a memory. Each friction function describing a friction between a type of surface of the road and a tire of the vehicle as a function of slippage of the vehicle, the parameters of each friction function include an initial slope of the friction function defining a stiffness of the tire and one or combination of a peak friction, a shape factor and a curvature factor of the friction function.
Next, using the determined stiffness 915a, the method selects 920a a set of parameters 925a corresponding to a particular friction function. Using a model of motion of the vehicle 927a including the selected parameters 925a, the method 901 determines 930a, using an ARG 103, a modified reference command 935a, and submits the reference command 935a to a controller of the vehicle to move the vehicle 940a on the road.
The control system 899 includes a signal conditioner 920 that receives information 290 and produces estimates of the wheel speed for some or all wheels 921. The information 990 can include wheel-speed measurements from ABS, engine torque and rotation speed, and/or brake pressure. The control system 899 can also include a sensing system 930 that measures inertial components of the vehicle, such as rotation rate of the vehicle and acceleration of the vehicle, using an inertial measurement unit (IMU). For example, the IMU can comprise 3-axis accelerometer(s), 3-axis gyroscope(s), and/or magnetometer(s). The IMU can provide velocity, orientation, and/or other position related information to other components of the control system 899. The sensing system 930 can also receive global position information from a global positioning system (GPS) or equivalent.
The control system 899 also includes a state-of-stiffness estimator 940 for determining parameters of the state of the stiffness. In some embodiments, the state-of-stiffness estimator includes a filter that iteratively determines the state of the vehicle and the state of stiffness, from a state of the vehicle and a state of stiffness determined during previous iterations. In some implementations, a state of the vehicle includes velocity and heading rate of the vehicle, but can also include a position, heading, and additional quantities related to the motion of the vehicle.
The state-of-stiffness estimator 940 uses information 931 from the sensing system and wheel-speed estimates 921 from the signal conditioner 920. If the sensing system 930 is equipped with an IMU for measuring the longitudinal acceleration of the vehicle, the measurements from the IMU can be used to determine parameters related to the longitudinal friction of the tire. However, if the sensing system 930 does not possess information about longitudinal acceleration, the signal conditioner 920 can output an estimate 921 of the longitudinal acceleration based on the wheel-speed estimates and other quantities according to other embodiments. Additionally, or alternatively, the state-of-stiffness estimator 940 can determine an estimate of the longitudinal acceleration based on the wheel-speed information 921.
In one embodiment, the states of the vehicle and parameters determining the tire to road interaction are estimated iteratively by combining wheel-speed and IMU information. In another embodiment, the friction-estimation system only includes lateral components. In such a case, the information 921 can include necessary information for the longitudinal motion. The state-of-stiffness estimator 940 can also receive information 961 about the vehicle motion from the vehicle-control units 960. The information can include a state of the vehicle, such as position, heading, velocity, and is received either from hardware or software, connected directly or remotely to the machine.
For example, the state-of-stiffness estimator can output state of stiffness 941 including friction values, tire-stiffness values, certainty levels of the tire stiffness, or combinations thereof. The control system 899 also includes a parameter selector 970 that uses the state of stiffness 941 to determine a set of parameters 971 describing a tire-friction function, where the parameters for multiple tire-friction functions are stored in a memory.
The control system 899 includes vehicle controllers 960 that use the selected parameters 971 to generate control commands to one or multiple actuators of the controlled vehicle. For example, in one embodiment, the parameters are used in a motion model of the vehicle to control the vehicle using a model predictive controller (MPC). The vehicle controllers 960 can include stand-alone components, such as ABS, ESC, or ADAS, or a combination of vehicle controllers that enable autonomous driving features. For example, the selected parameters can output 972 a friction coefficient corresponding to the parameters to be displayed on a display 910 of the vehicle as supervisory components to a driver of the vehicle.
To avoid determining the current friction coefficient and the entire tire friction function, one implementation stores in a database the parameters. In one embodiment, the motion model is modeled as a single-track model with nonlinear
tire force as, where the nominal forces are modeled using the Pacejka tire model as
F
0,i
x=μixFiz sin(Cix arctan(Bix(1−Eix)λi+Eix arctan(Bixλi))),
F
0,i
y=μiyFiz sin(Ciy arctan(Biy(1−Eiy)αi+Eiy arctan(Biyαi))),
where the nominal forces are the forces under pure slip, i.e., when one of the longitudinal slip and lateral slip are zero.
Some embodiments, instead of determining the tire parameters in the Pacejka model, to be used in the tire friction function, use a linear approximation of the tire forces as Fx≈Cxλ, Fy≈Cyα, for the longitudinal and lateral tire force, where the C constants are the stiffness components. Consequently, one embodiment estimates the stiffness component using a stochastic model of the stiffness as a disturbance to the motion model, Cx=C,nx+ΔCx, Cy=C,ny+ΔCy where C,n is the nominal stiffness value, for example, a priori determined on a nominal surface, and ΔCs is a time-varying, unknown part which is estimated according to one embodiment. One implementation estimates the mean value and variance of the tire stiffness.
Additionally or alternatively, one embodiment estimates the tire stiffness and compares to the tire parameters stored in memory according to a linear approximation of the Pacejka model, Fy≈μiyFizCiyBiiαi, which gives that the tire stiffness and parameters should be equal, μiyFizCiyBii=Ciy. However, since measurement and estimation errors, and nonperfect parameters stored in memory, give tire stiffness estimates that deviate from the ones stored in memory, one embodiment uses the estimated variance of the tire stiffness to determine the best fit according to the parameters, by selecting the parameters maximizing the likelihood of the parameters,
Yet another embodiment uses a test statistic to determine whether the estimated stiffness can be regarded as outliers or inliers from the parameters. Consequently, one embodiment choose the parameters θ1 corresponding to the lowest friction surface if T(μi,1yFizCi,1yBi,1y)>xnr(1), where xn2(1) is the Chi-squared distribution with one degree of freedom and some significance level η. Otherwise, the embodiment proceeds in order of increasing peak friction until a parameter set is found.
In some embodiments, constraints are imposed on the motion of the vehicle. For instance, one embodiment the constraints model a maximum allowed deviation from the middle lane of the road or a maximum heading rate of the vehicle, and one embodiment models a constraint as a maximum steering rate of the steering wheel of the vehicle.
Such constraints and the ability of a controller to satisfy such constraints are heavily dependent on the friction coefficient between at least one tire of the vehicle and the road. For instance, having a controller tuned for the friction coefficient corresponding to asphalt, if that controller is used to control a vehicle on snow, constraints such as maximum allowed deviation from the middle lane of the road are likely to be violated.
In some embodiments, the state-of-stiffness estimator 940 and subsequent parameter selector 970 is used to adapt the reference such that exemplified constraints are satisfied.
In one embodiment, the reference r is the velocity profile and steering profile to reach a desired point on the road. In other embodiments, the reference r is a timed path leading to a desired motion on the road. The reference is subject to various constraints, e.g., the velocity should obey speed limits, and the path should not deviate too much from the middle of the desired lane. For example, the steering profile should not except physical limits of the actuators.
In addition, for the reference governor to be able to make sensible decisions, the state-of-stiffness estimator outputs the determined state of stiffness of the vehicle, and the confidence of such stiffness. Referring to Eq. (7), the state-of-stiffness estimator provides the confidence bounds used by the reference governor. In one embodiment the state-of-stiffness estimator ensures contractivity according to other embodiments of the disclosure.
As shown in
In some embodiment, the trajectory generator module 1005 determines the trajectory 1005 by minimizing a cost function subject to the constraints 1004. The cost function is determined based on an energy model 1016 of the system 1000 and a function of a tracking time 1015, as described below. The motion controller can be implemented using a processor 1011.
The control module 1020 determines a control signal 1006 based on the trajectory 107 and feedback signal 1008, and outputs the control signal 1006 to the amplifier 1002 which determines and outputs a current or voltage 1007 to the motor 1003. The motor is mechanically coupled with a load 1030, and drives the load to achieve specified tasks. In one embodiment, the feedback signal 1008 describes a current state of the motion control system. In another embodiment, the feedback signal 1008 describes a motion of the motor 1003. For example, in one variation of this embodiment, the feedback signal 1008 is a position of the motor.
In some embodiment, the constraints 1004 include a dynamic constraint that the motion system has to satisfy; velocity constraints limiting the speed that the motion system can operate; acceleration constraints limiting the acceleration that the motion system can have; and a control constraint limiting the voltage or current that the motor can accept.
In some embodiment, the constraints 1004 include a dynamic constraint that the motion system has to satisfy; velocity constraints, and acceleration constraints.
In some embodiments, the trajectory generation module 1005 determines trajectories according to a dynamic model of motor 1003 and physical constraint 1004. For example, the dynamics of the motor can be defined according to
where x=(x1, x2)=(θ, ω) is a state of the motor state representing a position of the motor position and an angular velocity respectively, u is the control input to the motor, d, c are the viscous and coulomb friction coefficients of the motor, respectively, b is a constant coefficient, a single dot indicates the first derivative, and double dots the second derivate, x2 is a second component of the state of the motor x representing an angular velocity of the motor. The embodiment of the control input to the motor depends on the type of the motor. In one embodiment, the control input to the motor includes a current input into the motor. Additionally, or alternatively the control input can include control signal of a voltage into the motor.
In some embodiment, values of parameter d, b, c, in the motor model of the motion control system, reflecting characteristics such as inertia and friction coefficient, of the load 1030, are typically unknown in advance. The trajectory generator 1005 computes a trajectory 109 according nominal values of these parameters. Because of the mismatches between true parameter values of the nominal values used in 1005, the trajectory 109 is infeasible for the motion system to follow. For example, control input constraint, for example voltage required be applied to the motor exceeds its rated voltage, and thus endangers the safety of the entire motion system, or shorten its lifespan. Adaptive reference governor 103 estimates point values of parameters d, b, c and their confidence level from signal 1008, then reshape the infeasible trajectory 109 in time domain to produce a new trajectory 107 by slowing down acceleration or deceleration to enforce that all constraints with estimated parameters d, b, c are satisfied by 107.
In some embodiment, the trajectory generator computes a time optimal trajectory satisfying velocity and acceleration constraints to maximize productivity of the motion control system, where the velocity and acceleration constraints restrict the maximal velocity and acceleration that the motor can operate. For example, the velocity constraint is
|x2|≤vmax
where vmax is the maximum velocity. The acceleration constraint is
dx
2
+c+bu−a
max≤0
−amax−dx2−c−bu≤0,
where amax is a constant that defines the maximum acceleration. The time optimal trajectory computation does not require the knowledge of values of parameters d, b, c. This is expedient for embedded platforms with limited computing power, and thus the time optimal trajectory 109 could violate control constraint. ARG 103 incorporates estimated model parameters and reshapes 109 to produce a trajectory 107 with all constraints satisfied.
The motor is typically subject to other physical constraints such as maximal current that can flow into the motor. In some embodiment, the physical constraint on the maximal current is given by
|u|≤umax
In some embodiment, trajectory generator determines trajectories satisfying velocity, acceleration, and physical current/voltage constraints. The trajectory generation algorithm is computational expensive, as well as require a good knowledge of motor parameter values ensure constraint satisfaction and operation efficiency. According to some embodiments of the present disclosure, the adaptive reference governor can estimate model parameter values which can be fed into trajectory generator.
The embodiments of the invention may be embodied as a method, of which an example has been provided. The acts performed as part of the method may be ordered in any suitable way. Accordingly, embodiments may be constructed in which acts are performed in an order different than illustrated, which may include performing some acts simultaneously, even though shown as sequential acts in illustrative embodiments.
Use of ordinal terms such as “first,” “second,” in the claims to modify a claim element does not by itself connote any priority, precedence, or order of one claim element over another or the temporal order in which acts of a method are performed, but are used merely as labels to distinguish one claim element having a certain name from another element having a same name (but for use of the ordinal term) to distinguish the claim elements.
Although the invention has been described by way of examples of preferred embodiments, it is to be understood that various other adaptations and modifications can be made within the spirit and scope of the invention.
Therefore, it is the object of the appended claims to cover all such variations and modifications as come within the true spirit and scope of the invention.