Method for real time optimization and parallel computing of model prediction control based on computing chart

CROSS-REFERENCE TO RELATED APPLICATION

This application claims the priority benefit of China application serial no. 202110344736.0, filed on Mar. 31, 2021. The entirety of the above-mentioned patent application is hereby incorporated by reference herein and made a part of this specification.

BACKGROUND
Technical Field

The present invention relates to the technical field of real time optimization for model prediction control and in particular, relates to a method for real time optimization and parallel computing of model prediction control based on a computing chart.

Description of Related Art

Since model prediction control is featured by rolling optimization, feedback adjustment and explicit considerations of a system restraint, thus increasing application fields, especially a rapid and dynamic system (such as electronic power, mechatronics engineering, automobile electronics and the like) urgently need model prediction control to process its complicated restraint optimization problem and improve a control property. However, a current control action of model prediction control is obtained by solving an optimal control problem of an open loop in a limited time domain through a model at each sampling moment, relating to numerous computing amount and time. Therefore, large computing amount for online optimization of model prediction control is a main bottleneck of limiting its application.

In the recent years, many valuable achievements have been made for study of rapid computing for prediction control. In a control policy aspect, a design is optimized and a solution process of prediction control is simplified through a controller structure, which effectively reduce a computing complexity. However, the existing methods are mostly serial computing policies in a time domain, with limited speed upgrading space. In an aspect of solving an optimization problem, the existing methods mostly adopt a standard or improved planning algorithm for iterative computing of solution. However, direct solution of a nonlinear optimization problem relates to a large number of complicated computing of gradient and a computing of a matrix inversion, resulting in a very large computing amount. Meanwhile, a general iterative logic for optimization is rather complicated and it belongs to a serial iterative algorithm mostly. The characteristic determines that they are not equipped with a large parallel accelerating space, such that a speed of each iteration can be accelerated as far as possible. This point depends on increasing speed of a processor to a great extent and creates disadvantages for parallel implementation of hardware.

SUMMARY

The objective of the present invention is to overcome the existing defect of the prior art by providing a method for real time optimization and parallel computing of model prediction control based on a computing chart.

The objective of the present invention can be realized through the following technical solution:

A method for real time optimization and parallel computing of model prediction control based on a computing chart comprises the following steps:

S1: building a prediction model of a system state amount and building a target function of a system;

S3: solving and computing a gradient with a manner of back propagation and using a gradient descent method to optimize a control amount of the system and realize real time optimal control of the system.

Preferably, in the parallel computing architecture for model prediction control in the step S2, a symbol indicating that solution of the prediction model and the target function in a present step has been completed is used as a symbol of starting a prediction computing at a next step, thereby realizing parallel computing of the prediction model and the target function.

Preferably, a recurrence relationship between the prediction model and the target function is:

$\begin{matrix} i = 0 & J_{0} = \sum_{i = 0}^{N - 1} Δ u_{k + i ❘ k}^{T} R Δ u_{k + i ❘ k} & x_{k + 1 ❘ k} = f (x_{k ❘ k}, u_{k ❘ k}) \\ i = 1 & J_{1} = J_{0} + x_{k + 1 ❘ k}^{T} {Qx}_{k + 1 ❘ k} & x_{k + 2 ❘ k} = f (x_{k + 1 ❘ k}, u_{k + 1 ❘ k}) \\ ⋮ \\ i = N - 1 & J_{N - 1} = J_{N - 2} + x_{k + N - 1 ❘ k}^{T} {Qx}_{k + N - 1 ❘ k} & x_{k + N ❘ k} = f (x_{k + N - 1 ❘ k}, u_{k + N - 1 ❘ k}) \\ i = N & J = J_{N - 1} + x_{k + N ❘ k}^{T} {Px}_{k + N ❘ k} \end{matrix}$

- wherein J is a target function, ƒ is a prediction model of a system, x_k+i|kis a system state amount in step i at moment k and _k+i|kis a system control amount in step i at moment k.

Preferably, the step S4 specifically comprises:

S41: building a plurality of computing nodes, and setting one storage unit for each computing note, the storage unit storing a related computing parameter;

S42: obtaining a gradient of a target function for an input amount based on back propagation according to the computing parameter in the plurality of computing nodes; and

S43: using a gradient descent method to optimize a control amount of the system, and obtaining an optimal control sequence, thereby realizing parallel prediction control of the system.

Preferably, the step S43 specifically comprises:

- using a gradient descent method to optimize a control amount:
  
  _k|k,_k−1|k. . . _k+N−1|k,
- wherein _k|k,_k+1|k. . . _k+N−1|kis a control amount in step 0, 1 . . . N−1 within moment k,
- completing an optimization process when one of optimization conditions is satisfied, thereby obtaining an optimal control sequence U*^k:
  
  U*_k=[*_k|k,*_k+1|k. . . *_k+N−1|k],
- wherein *_k|k,*_k+1|k. . . *_k+N−1|kis a desired value of a control amount in step 0, 1 . . . N−1 within moment k,
- using a first element *_k|kin the obtained optimal control sequence U*_kas a control amount at moment k, and a new control sequence consisting of a zero element added after an element subsequent to the first element as an initial value of a prediction control input matrix at moment k+1, i.e. U_0|k+1=[*_k+1|k,*_k+2|k. . . *_k+N−1|k,0], and ending an prediction and optimization process at moment k, and
- repeating the above steps to complete a prediction and optimization process at moment k+1.

Preferably, a computing formula of the optimal control sequence is:

$\begin{matrix} u_{k ❘ k}^{*} = u_{k - 1 ❘ k - 1}^{*} - \frac{\partial J}{\partial u_{k ❘ k}} Δ t \\ u_{k + 1 ❘ k}^{*} = u_{k ❘ k}^{*} - \frac{\partial J}{\partial u_{k + 1 ❘ k}^{*}} Δ t \\ ⋮ \\ u_{k + N - 1 ❘ k}^{*} = u_{k + N - 2 ❘ k}^{*} - \frac{\partial J}{\partial u_{k + 1 ❘ k}^{*}} Δ t \end{matrix}$

- wherein *_k⇄i|kis a desired value of a control amount of step i at moment k, *_k−1|k−1is an optimal control amount of the previous moment and Δt is a control step size.

Preferably, the optimization condition is that a difference value between a target function of a present iterative step size and a target function of a previous step is smaller than a set value or reaches limited optimization times or a changing amount of a target function is 0.

Preferably, the system is a vehicle model and the prediction model is a vehicle path and speed tracking model.

Preferably, the control objective of the vehicle path and speed tracking model is:

- rapidly and accurately tracking a vehicle longitudinal velocity v_sand a lateral displacement Y, and setting a control time domain and a prediction time domain both as N and a target function as

$\min_{U} J = \sum_{i = 0}^{N - 1} (x_{i}^{T} {Qx}_{i} + u_{i}^{T} {Ru}_{i}) + x_{N}^{T} {Px}_{N}$

- wherein x_iis a state amount of step i at a present moment, _iis a system control amount of step i at the present moment, Q is a positive weight matrix, P is a positive terminal penalty matrix and R is a positive control amount penalty matrix.

Preferably, the target function is decomposed as follows upon being computed:

$\begin{matrix} J_{0} = g (U) = \sum_{i = 0}^{N - 1} u_{i}^{T} {Ru}_{i} \\ J_{i} = h (y_{i}, J_{i - 1}) = J_{i - 1} + {(y_{i} - y_{r})}^{T} Q (y_{i} - y_{r}) \\ = {(x_{ji} - x_{jr})}^{T} Q (x_{ji} - x_{jr}), j = 1, 6, i = 1, 2, \dots, N \end{matrix}$

- wherein g and h are respectively a decomposition computing function.

Compared with the prior art, the present invention proposes an architecture for parallel computing with a forward solution and a target function solution based on a multi-instruction and multi-data parallel computing concept and in considering coupling of parallel task data combined with a triggering parallel computing manner, sequence of data computing number is ensured and the objective of shortening the computing time is finally achieved. Meanwhile, based on the concept of the computing chart, the architecture uses a process of solving a prediction state with a forward propagation as a node, and the node comprises an input amount, an output amount and a partial derivative of the input amount to the output amount. Through the manner of back propagation, the partial derivative of input and output stored in the node is taken out in order and multiplied to compute a gradient. Generally, a dimension of a control sequence matrix in model prediction control is higher than a dimension of a target function matrix. Therefore, relative to forward computing, inverse computing can reduce operation times to further improve operation efficiency greatly, ensure real time property of a model prediction controller and expand application fields of model prediction control, such that the inverse computing is highly practical and applicable.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flow chart of the present invention;

FIG. 2 is a computing model diagram of a recursive process for a future state of a prediction system of the present invention;

FIG. 3 is a diagram for a parallel computing architecture of the present invention;

FIG. 4 is a schematic diagram of a computing manner for forward propagation;

FIG. 5 is a schematic diagram of a computing manner for back propagation;

FIG. 6 is a partial computing chart of a gradient descent method;

FIG. 7 is a curve diagram of a reference speed;

FIG. 8 is a curve diagram of path tracking;

FIG. 9 is a curve diagram of speed tracking;

FIG. 10 is an error curve diagram of displacement tracking; and

FIG. 11 is an error curve diagram of speed tracking.

DESCRIPTION OF THE EMBODIMENTS

The present invention is described in detail below with reference to the accompanying drawings and specific embodiments. It should be noted that the following descriptions of embodiments are merely illustrative in substance, as the present invention does not intend to limit its applicable objects or functions and the present invention does not limit the following embodiments:

Embodiments

A method for real time optimization and parallel computing of model prediction control based on a computing chart, as shown in FIG. 1, comprises the following steps:

S1: building a prediction model of a system state amount and building a target function of a system;

In the embodiment, the following control system is considered:

A system discrete kinetic equation is: x_k+1=ƒ(x_k, custom character _k);

A system output amount is equal to a system state amount, i.e. y=χ;

A system desired output is a zero matrix, i.e. y^r=0;

A control objective is to minimize a quadratic sum of a prediction error and a quadratic sum of a changing rate in a control sequence simultaneously; and

A control time domain is equal to a prediction time domain, as N.

According to the above system combined with a chart model concept, the computing model of the recursive process for the future state of the following prediction system as shown in FIG. 2 can be obtained, wherein x_k|kis a system amount at moment k, custom character _k|kis an input amount at moment k, ƒ is a recurrence function (a prediction model), J is a target function to be optimized, P is a weight matrix of a terminal, Q is a state weight matrix and R is a weight matrix of a control amount.

A recurrence relationship between the prediction model and the target function is:

- wherein J is a target function, ƒ is a prediction model of a system, x_k+i|kis a system state amount in step i at moment k and _k+i|kis a system control amount in step i at moment k.

S2: building a parallel computing architecture for model prediction control of a prediction model and the target function and employing a triggering parallel computing method by the parallel computing architecture to synchronously compute the prediction model and the target function. In the parallel computing architecture for model prediction control in the step S2, a symbol indicating that solution of the prediction model and the target function in a present step has been completed is used as a symbol of starting a prediction computing at a next step, thereby realizing parallel computing of the prediction model and the target function.

According to the above forward computing process, a computing program is programmed. Each node built corresponds to a storage unit of a single chip microcontroller and comprises the following five parts. With a node at x_k+N−1|kas an example, when forward computing is finished, output of each node and a partial derivative corresponding to the node are stored in a corresponding storage space.

$\begin{matrix} x_{k + N - 1 ❘ k} \\ u_{k + N - 1 ❘ k} \\ x_{k + N ❘ k} = f (x_{k + N - 1 ❘ k}, u_{k + N - 1 ❘ k}) \\ \frac{\partial f}{\partial x} ❘ (x_{k + N - 1 ❘ k}, u_{k + N - 1 ❘ k}) \\ \frac{\partial f}{\partial u} ❘ (x_{k + N - 1 ❘ k}, u_{k + N - 1 ❘ k}) \end{matrix}$

In the process of forward recurrence, computing of the target function is performed synchronously, as shown in FIG. 3. Since data in parallel tasks is not completely independent, a coupling exists. However, for a prediction process of each step, data is independent. In this regard, the present invention combines a triggering parallel computing manner, i.e. using a symbol indicating that solution of the forward inference in step N (i.e. No. 1 parallel computing content) and the target function (i.e. No. 2 parallel computing content) as a symbol of starting computing for prediction in step N+1, so as to ensure sequence of the data computing number and finally achieve the purpose of the shortening the computing time.

Through a calculating gradient of back propagation, i.e. partial derivatives stored correspondingly by needed nodes are taken out in order and multiplied to obtain a gradient of the target function to the input amount. Since N control amounts exist in an entire control time domain, one control amount outputs a control model of a variable. For forward solution, partial derivatives of the target function to each input can only be computed by traversing all nodes N times. For inverse solution, it is only necessary to traverse all nodes once, to thus calculate the partial derivative of the target function to each input. The solution processes thereof are respectively shown in FIG. 4 and FIG. 5.

The step S3 specifically comprises:

S31: building a plurality of computing nodes, and setting one storage unit for each computing note, the storage unit storing a related computing parameter;

S32: obtaining a gradient of a target function for an input amount based on back propagation according to the computing parameter in the plurality of computing nodes; and

S33: using a gradient descent method to optimize a control amount of the system, and obtaining an optimal control sequence, thereby realizing parallel prediction control of the system.

The step S33 specifically comprises:

- using a gradient descent method to optimize a control amount:
  
  _k|k,_k+1|k. . . _k+N−1|k,
- wherein _k|k,_k+1|k. . . _k+N−1|kis respectively a control amount in step 0, 1 . . . N−1 within moment k,
- completing an optimization process when one of optimization conditions is satisfied, thereby obtaining an optimal control sequence U*_k:
  
  U*_k=[*_k|k,*_k+1|k. . . *_k+N−1|k],
- wherein *_k|k,*_k+1|k. . . *_k+N−1|kis respectively a desired value of a control amount in step 0, 1 . . . N−1 within moment k,
- using a first element *_k|kin the obtained optimal control sequence U*_kas a control amount at moment k, and a new control sequence consisting of a zero element added after an element subsequent to the first element as an initial value of a prediction control input matrix at moment k+1, i.e. U_0|k+1=[*_k+1|k,*_k+2|k. . . *_k+N−1|k, 0], and ending an prediction and optimization process at moment k, and
- repeating the above steps to complete a prediction and optimization process at moment k+1.

A computing formula of the optimal control sequence is:

- wherein *_k+i|kis a desired value of a control amount of step i at moment k, *_k−1|k−1is an optimal control amount of the previous moment and Δt is a control step size.

In the embodiment, the system is a vehicle model, and the prediction model is a vehicle path and speed tracking model for parallel computing in considering the following three degrees of freedom (DOFs) vehicle nonlinear model:

${\begin{matrix} {\dot{υ}}_{x} = υ_{y} r + a_{x} \\ {\dot{υ}}_{y} = υ_{x} r + \frac{2 (C_{f} + C_{r}) υ_{y}}{{m υ}_{x}} + \frac{2 r ({aC}_{f} - {bC}_{r})}{m υ_{x}} - \frac{2 C_{f}}{m} δ_{f} \\ \dot{φ} = r \\ \dot{r} = \frac{2 υ_{y} ({aC}_{f} - {bC}_{r})}{I_{z} υ_{x}} + \frac{2 r (a^{2} C_{f} + b^{2} C_{r})}{I_{z} υ_{x}} - \frac{2 {aC}_{f}}{I_{z}} δ_{f} \\ \dot{X} = υ_{x} \cos φ - υ_{y} \sin φ \\ \dot{Y} = υ_{x} \sin φ + υ_{y} \cos φ \end{matrix}$

Wherein v_xis a vehicle longitudinal speed; v_yis a vehicle horizontal speed, a_xis a longitudinal acceleration, r is a yaw rate, C_ƒis a front wheel cornering stiffness, C_ris a rear wheel cornering stiffness, m is a total weight, a,b is a distance from a centroid to a front shaft and a rear shaft, δ_ƒa front wheel steering angle, I_xis a rotational inertia of a vehicle centroid about shaft z and φ is a heading angle of a vehicle.

Therefore, a vehicle continuous nonlinear model can be rewritten as:

{dot over (x)}=ƒ(x, custom character )

- system state prediction (forward propagation).

Firstly, the continuous system models are discretized. In order to improve discretization accuracy, a three-order three-section Runge-Kutta formula is used for discretization to obtain a system discrete model.

${\begin{matrix} k_{1} = T_{s} f (x_{k}, u_{k}) \\ k_{2} = T_{s} f (x_{k} + \frac{1}{2} k_{1}, u_{k}) \\ k_{3} = T_{s} f (x_{k} - k_{1} + 2 k_{2}, u_{k}) \\ x_{k + 1} = x_{k} + \frac{1}{6} (k_{1} + 4 k_{2} + k_{3}) \end{matrix}$

Then, an initial value is given for control input and it is N in a control time domain and a prediction time domain both. Due to a dual-input system, the initial value is set as a column vector of row 1 in line 2N.

U=[ custom character (0)^T(1)^T. . . (N−1)^T]^T

An initial state is x(0) and state prediction is computed according to formula (7).

The control objective of a vehicle path and speed tracking model is:

- rapidly and accurately tracking a vehicle longitudinal velocity v_xand a lateral displacement Y, and setting a control time domain and a prediction time domain both as N and a target function as

$\min_{U} J = \sum_{i = 0}^{N - 1} (x_{i}^{T} {Qx}_{i} + u_{i}^{T} {Ru}_{i}) + x_{N}^{T} {Px}_{N}$

- wherein x_iis a state amount of step i at a present moment, is a system control amount of step i at the present moment, Q is a positive weight matrix, P is a positive terminal penalty matrix and R is a positive control amount penalty matrix.

The target function is decomposed as follows upon being computed:

- wherein g and h are respectively a decomposition computing function.

According to the computing chart model of FIG. 2, it is divided into N layers and one layer is computed each time upon forward prediction and inverse derivation.

With

$\begin{matrix} A_{k} = \frac{\partial x_{k + 1}}{\partial x_{k}} & and & B_{k} = \frac{\partial x_{k + 1}}{\partial u_{k}}, \end{matrix}$

the following can be computed according to the system discrete model:

A_k=I+T_sA_c,k+½T_s²A_c,k²+⅙T_s³A_c,k³
B_k=T_sB_c,k+½T_s²A_c,kB_c,k+⅙T_s³A_c,k²B_c,k

Wherein A_c,kis a Jacobi matrix of ƒ to x at (x_k, custom character _k) and B_c,kis a Jacobi matrix of ƒ to at (x_k,_k).

According to the computing chart model of FIG. 2, a local partial derivative of each layer can be computed in order as follows:

$\frac{\partial J}{\partial x_{N}} = 2 x_{N}^{T} P, \frac{\partial J}{\partial J_{N - 1}} = 1$

For the N−1th layer:

$\frac{\partial x_{N}}{\partial x_{N - 1}} = A_{N - 1}, \frac{\partial x_{N}}{\partial u_{N - 1}} = B_{N - 1}$

$\frac{\partial J_{N - 1}}{\partial x_{N - 1}} = 2 x_{N - 1}^{T} B, \frac{\partial J_{N - 1}}{\partial J_{N - 2}} = 1$

- after obtaining the local partial derivative, the following can be solved according to FIG. 6 and a chain rule:

$\begin{matrix} \frac{\partial J}{\partial x_{N - 1}} = \frac{\partial J}{\partial x_{N}} \frac{\partial x_{N}}{\partial x_{N - 1}} + \frac{\partial J}{\partial J_{N - 1}} \frac{\partial J_{N - 1}}{\partial x_{N - 1}} \\ = \frac{\partial J}{\partial x_{N}} A_{N - 1} + 2 x_{N - 1}^{T} Q \end{matrix}$

$and$

$\begin{matrix} \frac{\partial J}{\partial u_{N - 1}} = \frac{\partial J}{\partial x_{N}} \frac{\partial x_{N}}{\partial u_{N - 1}} + \frac{\partial J}{\partial J_{N - 1}} \frac{\partial J_{N - 1}}{\partial u_{N - 1}} \\ = \frac{\partial J}{\partial x_{N}} B_{N - 1} + 2 u_{N - 1}^{T} R \end{matrix}$

It can be seen that

$\frac{\partial J}{\partial x_{N}}$

solved at the Nth layer is used for solution of both

$\frac{\partial J}{\partial x_{N - 1}} and \frac{\partial J}{\partial u_{N - 1}},$

and the computing of the two does not relate to each other, such that it can performed in parallel.

With the previous layer similar to the N−1th layer, recurrence can be made by combining FIG. 2 to obtain

$\frac{\partial J}{\partial x_{i}}$

and the needed

$\frac{\partial J}{\partial u_{i}}$

$\frac{\partial J}{\partial x_{i}} = \frac{\partial J}{\partial x_{i + 1}} A_{i} + 2 x_{i}^{T} Q, i = 1, 2, \dots, N - 1$

$\frac{\partial J}{\partial u_{i}} = \frac{\partial J}{\partial x_{i + 1}} B_{i} + 2 u_{i}^{T} R, i = 0, 1, \dots, N - 2$

To sum up, by inputting formulas of weight matrices R, Q and P, and the Jacobi matrix of the system, A_iand B_icorresponding to the future x_ican be solved according to the prediction state of the Runge-Kutta formula and solution formulas of A_kand B_k, and computing can be performed according to the recurrence formula to obtain the gradient of the target function to the optimization variable.

$\nabla J (U) = {[{(\frac{\partial J}{\partial u_{0}})}^{T} {(\frac{\partial J}{\partial u_{1}})}^{T} \dots {(\frac{\partial J}{\partial u_{N - 1}})}^{T}]}^{T}$

Each iteration updates control input along a direction opposite to the gradient.

U^(k+1)=U^(k)−s∇J(U^(k))

Simulation Experiment Result

The prediction time domain and the control time domain: N=10 and s=0.01, and weight matrices R=S=0 and P=Q=diag(0.5, 0, 0.2, 0.2, 0, 0.5) are taken. The reference path y_refto be tracked is:

$y_{ref} = {\begin{matrix} 0, & X_{p r e} < 200 m \\ 3 \sin (\frac{π}{1 0 0} X_{pre}), & 200 ⩽ X_{p r e} < 600 m \\ 0, & X_{p r e} > 600 m \end{matrix}$

- wherein X_preis a lateral prediction displacement.

An initial vehicle speed is 15 m/s and a reference speed v_x,refis:

$v_{x, ref} = {\begin{matrix} 25, & X_{pre} < 400 m \\ \sqrt{625 - 2 (X_{pre} - 400)}, & 200 ⩽ X_{pre} < 600 m^{dand} \sqrt{625 - 2 (X_{pre} - 400)} ⩾ 20 \\ 20, & 200 ⩽ X_{pre} < 600 m^{dand} \sqrt{625 - 2 (X_{pre} - 400)} < 20 \\ \sqrt{400 - X_{pre} - 600)}, & X_{pre} > 600 m^{dand} \sqrt{400 - 2 (X_{pre} - 600)} ⩾ 15 \\ 15, & X_{pre} > 600 m^{dand} \sqrt{400 - 2 (X_{pre} - 600)} < 15 \end{matrix}$

The reference speed curve is shown in FIG. 7 and the acceleration at a speed reducing stage is set as −1 m/s. The result of the vehicle path and speed tracking curve is shown in FIGS. 8 and 9 and the displacement and speed tracking error curve is shown in FIGS. 10 and 11.

The embodiments are merely illustrative and do not limit the scope of the present invention. These embodiments can also be implemented in other various manners and various omissions, replacements and modifications can be made without departing from the scope of the technical concept of the present invention.

Number	Name	Date	Kind
20210276588	Kabzan	Sep 2021	A1
20220324484	Hruschka	Oct 2022	A1

Method for real time optimization and parallel computing of model prediction control based on computing chart

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

CPC

International Classifications

Term Extension

Abstract

Description

Claims

Priority Claims (1)

US Referenced Citations (2)

Non-Patent Literature Citations (1)

Related Publications (1)