The present disclosure belongs to the technical fields of on-orbit services, spacecraft control, and model predictive control, and in particular relates to a multi-arm spacecraft model predictive control method based on the mixture of Gaussian processes, equipment, and a medium.
In recent years, an on-orbit service technology has received widespread attention from countries around the world. Major space agencies such as China, the United States, ESA, and Japan have conducted a large number of ground and space experiments, achieving various types of on-orbit service tasks such as satellite capture, refueling, component replacement, and space debris cleanup. However, the spacecraft on-orbit service tasks are all described in a task/Cartesian space, while the traditional controller design is mostly expressed in a joint space, which requires additional inverse kinematics solution, so that besides the lack of intuitiveness, there are problems such as mismatch of control ability and large amount of calculation. Therefore, the present disclosure proposes a task space controller design method based on model predictive control (MPC), which is simple, direct, and suitable for processing high-dimensional nonlinear models of a multi-arm spacecraft.
Furthermore, there are various disturbances in the process of conducting experiments on the ground and executing tasks in space for an on-orbit service spacecraft, including parameter uncertainty, spatial and environmental torque disturbances, and input and measurement noise, which have a significant impact on the performance of on-orbit service tasks. Traditional solutions have their own advantages and disadvantages: the method based on a robust design needs to assume that the total disturbance has an upper bound; the method based on an adaptive disturbance observer is capable of estimating disturbances online, but it has higher requirements for initial values and is poor in performance at initial moments; and the method for constructing disturbance models based on neural networks and fuzzy networks requires offline training and is better in initial performance, but it requires a large amount of training data. Considering the actual workflow of the on-orbit service spacecraft, it is promising to design an off-line disturbance model construction method with small data demand and fast training speed by using the Mixture of Gaussian Processes (MGP).
Finally, the traditional on-orbit service spacecrafts are generally in a free-floating mode, which do not control the satellite platform, or only rely on flywheels for platform attitude adjustment, resulting in weak maneuverability and limited operating range, which affects the efficiency and performance of on-orbit services. Aiming at this problem, the present disclosure employs jet thrusters to adjust the spacecraft platform pose and designs a corresponding thruster control distribution method.
The purpose of the present disclosure is to propose a multi-arm spacecraft model predictive control method based on the mixture of Gaussian processes, equipment and a medium for the deficiencies of the existing multi-arm spacecraft control methods.
The present disclosure is realized by the following technical solution, and the present disclosure proposes a multi-arm spacecraft model predictive control method based on the mixture of Gaussian processes, which specifically includes:
Further, for a multi-arm spacecraft with n manipulators, each of which has m degrees of freedom, and with a platform equipped with l jet thrusters, the dynamic model of the multi-arm spacecraft is first described in a joint space:
M{umlaut over (Q)}+C+d=u (1)
where M represents an inertial parameter of the multi-arm spacecraft; C represents a nonlinear term; d is the disturbance term; Q=[r0T, qT]T represents information about the platform pose and joint angle, r0 is the 6-degree-of-freedom platform pose, and q=[q1, . . . , qn]T is the joint angle; and u=[F0T, FqT]T represents a control force and a control torque, F0 represents a platform control force/torque, and Fq represents a joint torque.
Further, based on a mapping relationship between the end-effectors of the manipulators and the states of joints and a platform, the description of the end-effector poses of the on-orbit service spacecraft is obtained by using a DH parameter method:
where a homogeneous transformation matrix nTm-1m is defined based on DH parameters θm,dm,am,αm:
finally, a state space expression is built:
where X=[QT, {dot over (Q)}T]T, ξ=[r0T, relT, . . . , renT]T represents the platform pose and the end-effector poses of the manipulators, and {circumflex over (d)} represents the estimated value of the disturbance term; and
Further, the recorded p groups of data sets are used for training of the mixture of Gaussian processes, where an input x contains the multi-arm spacecraft pose Q, speed {dot over (Q)}, and the increment Δu of a control variable, and an output y is a difference value between an actual angular velocity and an angular acceleration calculated by a nominal model, that is, the residual dynamics, which is used to capture the impact {circumflex over (d)} of the total disturbance on the system;
y(x)˜N (μ(x), k (x,x′)) (6)
where μ (x) represents a mean function; k (x, x′) represents a covariance function, and an expression thereof is as follows:
where σn is equal to 0.3, σf and l are hyperparameters to be solved, and
the covariance function is solved by using an optimal maximum likelihood estimation method:
after the hyperparameters of the Gaussian processes are obtained via training, the corresponding output value y* can be solved by using the input information x* of a prediction point; a joint probability density function of the samples and a test set is firstly determined:
y
*
|y(x)˜N(K*K−1y, K**−K*K−1K*T) (13)
a predicted value of the Gaussian processes is accordingly obtained:
*
=K
*
K
−1
y (14).
Further, a local Gaussian process method is utilized in the Gaussian processes of the multi-arm spacecraft. The local Gaussian process method includes: clustering based on a Gaussian mixture model method, performing regression on each of Gaussian components, and finally fusing local predicted values during prediction;
where πi is a mixing coefficient, which represents the probability that the sample comes from the ith Gaussian component; p(x|μi, Σi) represents the probability that the sample is generated by the ith Gaussian component; and {μi, Σi} is the hyperparameter of the mixture of Gaussian processes.
Further, after the training samples are obtained, the model parameters are first initialized, and then the model parameters are updated by means of the continuous iteration of steps E and M of expectation-maximization until the model converges,
in step M, the posterior probability ri of the training samples is calculated through step E, and the model parameters are updated by maximizing a logarithmic likelihood function:
for each group of Gaussian components obtained by clustering, a Gaussian regression training method is used to obtain the corresponding Gaussian process models and the predicted value y*i of the prediction point x* under the M local Gaussian process models, and the prediction point generated by the clustering is subjected to weighted fusion by the posterior probability r*i generated by the ith Gaussian component so as to calculate the final predicted value y*:
Further, a performance index in the following form is constructed to solve the control input of the system:
where Qr and Qt represent weighting matrices of a control error and an input penalty term, respectively; and
Further, a generalized thruster installation matrix is given firstly, that is, there are l jet thrusters installed on the satellite platform, where the installation position of each of the thrusters is divided into [xTi, yTi, zTi]T, and the included angles between thrust vectoring and a +x direction and between the thrust vectoring and a x-y plane are respectively βi and γi so that a configuration matrix of a thruster thrust and a torque under the satellite platform system is obtained:
thus, a relationship between the platform control force and the control torque as well as thrust of the jet thrusters is obtained:
F0=CIBAT (21)
where CIB is a conversion matrix from the platform system to an inertial system, and T=[T1, T2, . . . , Ti]T is the thrust of the jet thrusters;
where Tlb and Tub represent thrust saturation constraints for the thrusters;
where Ti* represents the continuous thrust obtained by the distribution algorithm, Tio represents the start-up thrust of the thrusters, and Δt represents a PWM frequency; and
The present disclosure provides electronic equipment, including a memory and a processor, where the memory stores a computer program, and the processor implements the steps of the multi-arm spacecraft model predictive control method based on the mixture of Gaussian processes when executing the computer program.
The present disclosure provides a computer-readable storage medium for storing computer instructions, where the steps of the multi-arm spacecraft model predictive control method based on the mixture of Gaussian processes are implemented when the computer instructions are executed by the processor.
The present disclosure has the following beneficial effects.
The technical solutions in the examples of the present disclosure will be clearly and completely described below in conjunction with the accompanying drawings in the examples of the present disclosure. Apparently, the described examples are only a part rather than all of the examples of the present disclosure. Based on the examples of the present disclosure, all other examples obtained by those of ordinary skill in the art without making creative efforts shall fall within the protection scope of the present disclosure.
Aiming at the deficiencies of the existing multi-arm on-orbit service spacecraft control methods, the present disclosure proposes a multi-arm spacecraft task space control method based on the mixture of Gaussian processes and model predictive control (MGP-MPC). The model predictive control has excellent performance in dealing with complex nonlinear systems such as multi-arm spacecrafts with various constraints, and is widely applied to ground robots, unmanned aerial vehicles, autonomous driving and other practical scenarios. Therefore, a task space controller is designed based on the model predictive control. Besides, in order to enhance the anti-interference capability of the present disclosure, an interference model is established and compensation is carried out in the model predictive control by utilizing the characteristics of small training data volume and high training speed in the mixture of Gaussian processes. Finally, a thrust distribution method is designed to complete platform control. The method provided by the present disclosure is convenient and intuitive in design and has relatively high practicability.
Referring to
M{umlaut over (Q)}+C+d=u (1)
where M represents an inertial parameter of the multi-arm spacecraft; C represents a nonlinear term; d is the disturbance term; Q=[r0T, qT]T represents information about the platform pose and joint angle, r0 is the 6-degree-of-freedom platform pose, and q=[q1, . . . qn]T is the joint angle; and u=[F0T, FqT]T represents a control force and a control torque, F0 represents a platform control force/torque, and Fq represents a joint torque.
Based on a mapping relationship between the end-effectors of the manipulators and the states of joints and a platform, the description form of the end-effectors poses of the multi-arm spacecraft is obtained by using an improved DH parameter method:
where a homogeneous transformation matrix nTm-1m is defined based on DH parameters θm,dm,am,αm:
finally, a state space expression is built:
where X=[QT, {dot over (Q)}T]T, ξ=[r0T, relT, . . . , renT]T represents the platform pose and the end-effectors poses of the manipulators, and {circumflex over (d)} represents the estimated value of the disturbance term; and
y(x)˜N (μ(x), k (x,x′)) (6)
where μ(x) represents a mean function; k (x, x′) represents a covariance function, which has more forms and is generally a Gaussian kernel function, and an expression thereof is as follows:
where σn is equal to 0.3, σf and l are hyperparameters to be solved, and
and
therefore, the Gaussian processes are actually processes of solving the hyperparameters in the covariance function through sample data. The present disclosure adopts an optimal maximum likelihood estimation method to solve the covariance function:
after the hyperparameters of the Gaussian processes are obtained via training, the corresponding output value y, can be solved by using the input information x, of a prediction point; a joint probability density function of the samples and a test set is firstly determined:
y
*
|y(x)˜N(K*K−1y, K**−K*K−1K*T) (13)
a predicted value of the Gaussian processes is accordingly obtained:
*
=K
*
K
−1
y (14).
On the basis of the above-mentioned Gaussian processes of the multi-arm spacecraft, in order to speed up training and predicting, the present disclosure utilizes a local Gaussian process method in the Gaussian processes of the multi-arm spacecraft. The local Gaussian process method includes: clustering based on a Gaussian mixture model method, performing regression on each of Gaussian components, and finally fusing local predicted values during prediction;
where πi is a mixing coefficient, which represents the probability that the sample comes from the ith Gaussian component; p(x|μi, Σi) represents the probability that the sample is generated by the ith Gaussian component; and {μi, Σi} is the hyperparameter of the mixture of Gaussian processes.
EM (Expectation-Maximization) is an effective algorithm for learning mixture models, which is theoretically equivalent to a maximum likelihood estimation method. After the training samples are obtained, the model parameters are first initialized, and then the model parameters are updated by means of the continuous iteration of steps E and M of expectation-maximizationuntil the model converges,
in step M, the posterior probability ri of the training samples is calculated through step E, and the model parameters are updated by maximizing a logarithmic likelihood function:
for each group of Gaussian components obtained by clustering, a Gaussian regression training method is used to obtain the corresponding Gaussian process models and the predicted value y*i of the prediction point x* under the M local Gaussian process models, and the prediction point generated by the clustering is subjected to weighted fusion by the posterior probability r*i generated by the ith Gaussian component so as to calculate the final predicted value y*:
where Qr and Qt represent weighting matrices of a control error and an input penalty term, respectively; and
A generalized thruster installation matrix is given first, that is, there are l jet thrusters installed on the satellite platform, where the installation position of each of the thrusters is divided into [xTi, yTi, zTi]T, and the included angles between thrust vectoring and a +x direction and between the thrust vectoring and a x-y plane are respectively βi and γi so that a configuration matrix of a thruster thrust and a torque under the satellite platform system is obtained:
thus, a relationship between the platform control force and the control torque as well as thrust of the jet thrusters is obtained:
F0=CIBAT (21)
where CIB is a conversion matrix from the platform system to an inertial system, and T=[T1, T2, . . . , Ti]T is the thrust of the jet thrusters;
where Tlb and Tub represent thrust saturation constraints for the thrusters;
where Ti* represents the continuous thrust obtained by the distribution algorithm, Tio represents the start-up thrust of the thrusters, and Δt represents a PWM frequency; and
The present disclosure provides electronic equipment, including a memory and a processor, where the memory stores a computer program, and the processor implements the steps of the multi-arm spacecraft model predictive control method based on the mixture of Gaussian processes when executing the computer program.
The present disclosure provides a computer-readable storage medium for storing computer instructions, where the steps of the multi-arm spacecraft model predictive control method based on the mixture of Gaussian processes are implemented when the computer instructions are executed by the processor.
The multi-arm spacecraft model predictive control method based on the mixture of Gaussian processes, the equipment, and the medium which are proposed by the present disclosure have been described above in detail. Specific examples are used herein to illustrate the principle and implementation of the present disclosure. The description of the above examples is only used to help understand the method and its core idea of the present disclosure. Furthermore, for those skilled in the art, according to the idea of the present disclosure, there may be changes in the specific examples and the scope of application. To sum up, the content of the Description should not be construed as limiting the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
2022105822346 | May 2022 | CN | national |