The disclosure relates to the field of intelligent control and naval architecture and ocean engineering technologies, and more particularly to an intelligent course planning method and controller for unmanned surface vehicle.
With rapid development of unmanned technology, unmanned surface vehicles are receiving increasing attention in recent years. Compared with unmanned aerial vehicles and unmanned ground vehicles, research of unmanned surface vehicles is still in an exploratory stage. As a small water surface platform with an autonomous planning and autonomous course capability, movement of an unmanned surface vehicle is non-linear and interference of the unmanned surface vehicle is uncertain in a complex and ever-changing water environment. Thus, a course planning and control technology of the unmanned surface vehicle is particularly important.
For example, a Chinese patent application NO. CN202110533417.4 (corresponding to patent publication NO. CN113341953A) discloses a course control method for a gondola-type unmanned surface vehicle, in which a time-delay problem can be reduced and a requirement for course deviation control of an inland unmanned surface vehicle is met. A Chinese patent application NO. CN202111549519.1 (corresponding to patent publication NO. CN114371701A) discloses a course control method for unmanned surface vehicle, a controller, an autopilot, and an unmanned surface vehicle, in which a course control method for unmanned surface vehicle based on a fuzzy adaptive proportional derivative (PD) control is achieved. A Chinese patent application NO. CN202110306691.8 (corresponding to patent publication NO. CN112947477A) discloses a course active-disturbance-rejection control system and control method for unmanned surface vehicle, in which a course active-disturbance-rejection control system and control method for unmanned surface vehicle based on a fuzzy neural network is achieved. A Chinese patent application NO. CN202210591518.1 (corresponding to patent publication NO. CN114894187A) discloses a navigation method for unmanned surface vehicle, and a difficult navigation problem of a course angle drifting of the unmanned surface vehicle is solved.
However, most of the above methods belong to traditional methods, which are used to solve a single issue related to course control for the unmanned surface vehicle; the above methods do not yet have an ability of intelligent course planning for unmanned surface vehicle, and do not have an effect of global optimization for intelligent planning.
Therefore, in an aspect of course planning and controlling for the unmanned surface vehicle, a signal technology is difficult to apply to a complex water environment and meet a global planning requirement. Various intelligent control technologies and intelligent algorithms can be fused to design a course controller for unmanned surface vehicle, which can ensure superior planning and control effectiveness in different scenarios.
A purpose of the disclosure is to provide an intelligent course planning method and controller for unmanned surface vehicle to overcome defects of the above related art.
The purpose of the disclosure can be implemented through the following technical solutions.
An intelligent course planning method for unmanned surface vehicle is provided, which includes:
In an embodiment, the dynamic system of intelligent course planning for unmanned surface vehicle is constructed based on an equation as follows:
where θ(t) represents a course deflection angle of an unmanned surface vehicle; {dot over (θ)}(t) represents a first order partial derivative of the course deflection angle of the unmanned surface vehicle with respect to time; {umlaut over (θ)}(t) represents a second order partial derivative of the course deflection angle of the unmanned surface vehicle with respect to time; M represents a payload mass of the unmanned surface vehicle; g represents a gravitational acceleration; L represents a length of the unmanned surface vehicle; J represents a moment of inertia of the unmanned surface vehicle; D represents a viscous frictional resistance of the unmanned surface vehicle; ν(t) represents a disturbance signal of an actual sea condition; u(t) represents a target course.
In an embodiment, the transforming a form of the dynamic system of intelligent course planning for unmanned surface vehicle includes:
In an embodiment, a formula of the designing a cost function of intelligent course planning for unmanned surface vehicle based on the dynamic system of intelligent course planning for unmanned surface vehicle after transforming the form is as follows:
γ(t)=γ[x(t),u(t),t]=a1Δx12(t)+a2Δx22(t)+a3u2(t);
In an embodiment, formulas for designing, based on the cost function, the performance index Q(t) of intelligent course planning for unmanned surface vehicle are as follows:
In an embodiment, N sampling data pairs of Φ(t) and Γ(t) at adjacent sampling times are collected by the RBF neural network model, and Φ(t) is related to the basic function of the RBF neural network, Γ(t) is related to the continuous approximation difference of the performance index of intelligent course planning for unmanned surface vehicle, the N sampling data pairs are aligned and arranged to obtain Φ(t) and Γ(t), and formulas of θ(t) and Γ(t) are as follows:
In an embodiment, the obtaining a target course expression of intelligent course planning for unmanned surface vehicle based on the function P(t) in the performance index, includes: performing approximate dynamic planning on a target formula based on a quadratic form, where a formula of the target formula is as follows:
In an embodiment, the intelligent course planning method for unmanned surface vehicle further includes: calculating, based on the target course expression and the performance index, a second order partial derivative of the performance index Q(t) of intelligent course planning for unmanned surface vehicle on each state variable x(t) and the target course u(t) to construct the Hessian matrix, where a formula of the Hessian matrix is as follows:
An intelligent course planning controller for unmanned surface vehicle includes a memory and a processor, the memory is configured to store a computer program, the processor is configured to call the computer program to execute the above intelligent course planning method for unmanned surface vehicle.
Compared with the related art, the disclosure has the following beneficial effects.
A control goal of intelligent course planning for unmanned surface vehicle is implemented of the disclosure based on an optimization principle of an approximate dynamic programming, that is, an overall optimization from a starting point to an end point of the course deflection angle and a step-by-step optimization can be implemented under a certain payload mass and sea condition disturbance of the unmanned surface vehicle, and a high-accuracy course planning can be implemented during an entire process of navigation of the unmanned surface vehicle.
Minimization of both the course deflection angle and a control signal can be achieved based on a quadratic form cost function and the performance index of intelligent course planning for unmanned surface vehicle by specially designing a virtual RBF neural network and the least square method, so as to reduce an overshoot of the course deflection angle and the control signal, achieve an optimization of flight and steering energy consumption, reduce a navigation power consumption for unmanned surface vehicle, and achieve completely data-driven for intelligent course planning for unmanned surface vehicle and a high-accuracy feedback adjustment.
The following provides a detail explanation of the disclosure in conjunction with drawings and embodiments. The embodiment is implemented based on technical solutions of the disclosure and the embodiment provides detailed implementation methods and specific operation processes. However, a protection scope of the disclosure is not limited to the following embodiments.
The disclosure provides an intelligent course planning method for unmanned surface vehicle, an overall optimization from a starting point to an end point of a course deflection angle and a step-by-step optimization can be implemented based on an optimization principle of an approximate dynamic programming. Minimization of both the course deflection angle and a control signal can be achieved based on a quadratic form cost function and a performance index of an intelligent course planning for unmanned surface vehicle by specially designing a virtual radial basis function (RBF) neural network and a least square method.
As shown in
In step 1, a dynamic system of intelligent course planning for unmanned surface vehicle is constructed.
In step 2, a form of the dynamic system of intelligent course planning for unmanned surface vehicle is transformed.
In step 3, a cost function of intelligent course planning for unmanned surface vehicle is designed based on the dynamic system of intelligent course planning for unmanned surface vehicle after transforming the form.
In step 4, a performance index and a continuous approximation difference of the performance index of intelligent course planning for unmanned surface vehicle are designed based on the cost function.
In step 5, a neural network model is designed based on the performance index and the continuous approximation difference of the performance index, and the neural network model is a virtual RBF neural network model.
In step 6, a function P(t) in the performance index of intelligent course planning for unmanned surface vehicle is equivalently obtained based on the virtual RBF neural network model.
In step 7, a target course expression of intelligent course planning for unmanned surface vehicle is obtained based on the function P(t) in the performance index.
In step 8, a Hessian matrix is constructed based on the target course expression and the performance index, and a positive definiteness of the Hessian matrix is determined.
In step 9, in a situation that the positive definiteness of the Hessian matrix meets a preset requirement, the target course expression is output and the target course expression is input into a preset feedback network; in a situation that the positive definiteness of the Hessian matrix does not meet the preset requirement, then return to step 1 to repeat each step in sequence.
The disclosure provides an intelligent course planning controller for unmanned surface vehicle, and the intelligent course planning controller for unmanned surface vehicle includes a memory and a processor, the memory is configured to store a computer program, the processor is configured to call the computer program to execute the above intelligent course planning method for unmanned surface vehicle.
For step 1, the dynamic system of intelligent course planning for unmanned surface vehicle is constructed in an actual sea condition, in the embodiment, a dynamic process of the dynamic system of intelligent course planning for unmanned surface vehicle can be designed based on a second-order dynamic equation as follows:
Where θ(t) represents a course deflection angle of an unmanned surface vehicle, a unit of θ(t) is degree (°); {dot over (θ)}(t) represents a first order partial derivative of the course deflection angle of the unmanned surface vehicle with respect to time, a unit of {dot over (θ)}(t) is degree per second (°/s); {umlaut over (θ)}(t) represents a second order partial derivative of the course deflection angle of the unmanned surface vehicle with respect to time, a unit of {umlaut over (θ)}(t) is degrees per square second (°/s2); M represents a payload mass of the unmanned surface vehicle, a unit of M is kilogram (kg); g represents a gravitational acceleration, a unit of g is meter per square second (m/s2); L represents a length of the unmanned surface vehicle, a unit of L is meter (m); J represents a moment of inertia of the unmanned surface vehicle, a unit of J is kilogram·square meter (kg·m2); D represents a viscous frictional resistance of the unmanned surface vehicle a unit of D is newton per (meter per second)(N/(m/s)); ν(t) represents a disturbance signal of an actual sea condition; and u(t) represents a target course.
For step 2, the form of the dynamic system of intelligent course planning for unmanned surface vehicle is transformed, in the embodiment, formulas θ(t)=x1(t) and {dot over (θ)}(t)=x2(t) are defined, and formulas for transforming the form of the dynamic system of intelligent course planning for unmanned surface vehicle are as follows:
Where, {dot over (x)}1(t) represents a first order partial derivative of x1(t) with respect to time, {dot over (x)}2(t) represents a first order partial derivative of x2(t) with respect to time.
In the form, a state variable of the dynamic system of intelligent course planning for unmanned surface vehicle is x(t)=[x1(t),x2(t)]T.
In the embodiment, parameters of the dynamic system of intelligent course planning for unmanned surface vehicle are set as: M=1, g=9.8, L=0.5, j=1, and D=2, an added disturbance signal of the actual sea condition 2 is: ν(t)=x2(t) sin(x1(t)), an initial state of the unmanned surface vehicle is set as: x(0)=[1, −0.5], and an initial target course is set as: u(0)=−0.36T.
For step 3, the designing a cost function of an intelligent course planning for unmanned surface vehicle based on the dynamic system of intelligent course planning for unmanned surface vehicle after transforming the form, which is used to achieve an optimization goal of minimization of both the course deflection angle and a control signal, so as to reduce an overshoot of the course deflection angle and the control signal, and achieve an optimization of flight and steering energy consumption. In the embodiment, a formula of the cost function can be designed as follows:
γ(t)=γ[x(t),u(t),t]=a1Δx12(t)+a2Δx22(t)+a3u2(t).
Where, both γ(t) and γ[x(t),u(t), t] represent cost functions of intelligent course planning for unmanned surface vehicle at time t; Δx1(t) represents a difference between a course deflection angle at the time t and a course deflection angle of a planning route at the time t; Δx2(t) represents a partial derivative of the course deflection angle at the time t with respect to the time t; a1, a2, and a3 represent optimization weight coefficients of Δx1(t), Δx2(t) and u(t), respectively, in the embodiment, all of a1, a2, and a3 can be valued as 1, and values of a1, a2, and a3 can be changed according to a specific requirement in other embodiments.
For step 4, in order to achieve intelligent course planning for unmanned surface vehicle in a form of data-driven and direct feedback, formulas of the performance index Q(t) of intelligent course planning for unmanned surface vehicle are designed based on the cost function are as follows:
Where, P(t) is an upper triangular weight matrix of a quadratic performance index, and elements p11(t), p12(t), p13(t), p22(t), p23(t), p33(t) are variables of time t.
The continuous approximation difference of the performance index Q(t) of intelligent course planning for unmanned surface vehicle is recorded as γ(t+1) based on the linear quadratic form function and the cost function, γ(t+1) represents a difference between performance indexes Q(t) and Q(t+1) at adjacent sampling times t and t+1, and an expression of γ(t+1) is as follows:
Q(t)−αQ(t+1)=Q(x(t),u(t))−αQ(x(t+1),u(t+1))=γ[x(t+1),u(t+1),t+1]=γ(t+1).
Where, α represents a discount factor, and 0<α<1, in the embodiment, α is valued as 0.95 (i.e., α=0.95), and a value of α can be changed according to a specific requirement in other embodiments.
For step 5, the neural network model is designed based on the performance index and the continuous approximation difference of the performance index, in the embodiment, the neural network model is a virtual RBF neural network model, and an RBF neural network represents a radial basis function neural network. In the embodiment, a target value of the performance index should be consistent with a target result approximated based on a virtual RBF neural network.
A basic function of the RBF neural network is designed as a 6×1 matrix ϕ(x(t),u(t)), and a formula of the 6×1 matrix ϕ(x(t), u(t)) is as follows:
ϕ(x(t),u(t))=[x12(t),2x1(t)x2(t),2x1(t)u(t),x22(t),2x2(t)u(t),u2(t)]T.
In the embodiment, number 6 is a number of the basic function of the virtual RBF neural network.
A connection weight parameter of the RBF neural network is designed as a 6×1 matrix w(t), and a formula of the 6×1 matrix w(t) is as follows:
w(t)=[p11(t),p12(t),p13(t),p22(t),p23(t),p33(t)]T.
Where, each variable in the matrix w(t) is consistent with a corresponding variable in the matrix P(t) in the performance index of intelligent course planning for unmanned surface vehicle, that is, a variable pij(t) in the connection weight parameter w(t) happens to be a variable in i-th row and j-th column in the matrix P(t) in the performance index of intelligent course planning for unmanned surface vehicle. Meanwhile, a form conforming to subsequent matrix calculations is considered in a design of the above formula.
For step 6, the virtual RBF neural network is configured with an evaluation module in the embodiment, and the evaluation module is configured to collect N sampling data pairs of Φ(t) and Γ(t) at adjacent sampling times, among them, Φ(t) is related to the basic function of the RBF neural network model, Γ(t) is related to the continuous approximation difference of the performance index of intelligent course planning for unmanned surface vehicle, the N sampling data pairs are aligned and arranged to obtain Φ(t) and Γ(t), and formulas of Φ(t) and Γ(t) are as follows:
Where, Φ(t)∈6×N, and Γ(t)∈N×1, N is a number of the sampling data pairs, the number N of the sampling data pairs is larger than or equal to a number of the basic function of the RBF neural network, in the embodiment, N≥6; and a calculation relationship of the N sampling data pairs of Φ(t) and Γ(t) is as follows:
ΦT(t)w(t)=Γ(t).
A least square method is used to obtain w(t), and a formula of w(t) is as follows:
w(t)=(Φ(t)ΦT(t))−1Φ(t)ΓT(t).
Due to consistency between elements in w(t) and elements in P(t), the function P(t) in the performance index of intelligent course planning for unmanned surface vehicle is equivalently obtained.
For step 7, the obtaining a target course expression of intelligent course planning for unmanned surface vehicle based on the function P(t) in the performance index, includes the following steps.
An approximate dynamic panning is performed on a target formula based on a quadratic form, and a formula of the target formula is as follows:
A partial derivative of the performance index Q(t) of intelligent course planning for unmanned surface vehicle on the target course u(t) is calculated to obtain an expression of the target course u(t) of intelligent course planning for unmanned surface vehicle of the form of direct feedback, and the expression is as follows:
For step 8, a second order partial derivative of the performance index Q(t) of intelligent course planning for unmanned surface vehicle on each state variable x(t) and the target course u(t) is calculated, based on the target course expression and the performance index, to construct the Hessian matrix, and a formula of the Hessian matrix is as follows:
For step 9, the preset requirement of the positive definiteness of the Hessian matrix is formulated by those skilled in the art, in the embodiment, the Hessian matrix is a positive-definite matrix. Under a positive-definite constraint of the positive-definite matrix, each time step of intelligent course planning for unmanned surface vehicle is a target time step, and the each time step is also convergent, that is, an intelligent planning effect is achieved. Finally, the target course expression is output, and the target course expression is input into a feedback network.
The above describes in detail the embodiments of the disclosure. It should be understood that, various modifications and changes according to ideas of the disclosure can be made by those skilled in the art without labor work. Therefore, any technical solutions obtained by those skilled in the art based on the ideas of the disclosure and the related art through logical analysis, reasoning, or limited experiments shall be within the protection scope determined by claims.
Number | Date | Country | Kind |
---|---|---|---|
202211158293.7 | Sep 2022 | CN | national |
Number | Name | Date | Kind |
---|---|---|---|
11274929 | Afrouzi | Mar 2022 | B1 |
20190295003 | Dronen | Sep 2019 | A1 |
20210003404 | Zeng | Jan 2021 | A1 |
20220307849 | Chattopadhyay | Sep 2022 | A1 |
Number | Date | Country |
---|---|---|
102400 | Feb 2019 | CN |
110018687 | Jul 2019 | CN |
110018687 | Jul 2019 | CN |
113900372 | Jan 2022 | CN |
Entry |
---|
Tang Chong et al., Application of lm-proved LM-RBF Neural Network in ARDC; Ordnance Industry Automation; Jul. 2020, vol. 39, pp. 11-15 and 28. |
Number | Date | Country | |
---|---|---|---|
20240118694 A1 | Apr 2024 | US |