The present invention belongs to the technical field of soft sensing based on neural networks and relates to a spatiotemporal dynamic system soft sensing method for automatically determining a partial differential equation (PDE) structure.
With the continuous improvement of the industry level, high-end industrial equipment such as an aero-engine are becoming more complex and larger, and the possibility of failure has also become higher, so the safety and reliability of industrial equipment is crucial. During actual operation, complex industrial equipment have numerous key variables containing information with respect to the health status of the equipment, and the acquisition for key variable values is the basis to ensure the safe and stable operation of the equipment. The measurements for key variables are difficult to acquire through sensors due to the limitation of extreme environment, such as high temperature, high pressure, and installation sites. Accordingly, a huge threat to the reliable, safe, and stable operation of high-end industrial equipment are posed.
As a virtual sensing technology, soft sensing is used for estimating key variables by utilizing easy-to-measure process variables and mathematical models. Therefore, for key process variables of high-end industrial equipment, soft sensing technology plays a key role in ensuring the safety and reliability of the equipment when sensors are not available. The classical observer-based methods use time series data to implement soft sensing. However, the methods cannot easily handle spatial information. PDEs are used for describing industrial processes involving multiple variables. For example, heat equations are used for modeling heat diffuses through a given region; wave equations are used for describing various wave phenomena; Maxwell's equations are used for describing how changes in charges, currents and fields produce electric fields and magnetic fields; and Navier-Stokes (NS) equations are used for describing the motion of viscous fluid substances. Therefore, PDE is a natural idea as a candidate for a soft sensing mathematical model for industrial processes with spatiotemporal dependence. An appropriate PDE structure is of great significance in optimization, control, and prediction in real industry. It should be noted that the above idealized PDE focuses on a single phenomenon or mechanism. However, complex industrial processes often involve multiple disciplines. For example, internal combustion engines involve thermodynamics and fluid mechanics. The integration of several idealized PDEs into a system of equations is a intuitive physic-informed method in industrial processes. However, prior-determined PDE structures cannot always model practical industrial processes well. Therefore, constructing PDEs with proper structures for practical situations is crucial for developing soft sensors.
In recent years, it is an effective method to use machine methods to develop a data-physics-hybrid model to solve forward and inverse problems in PDEs. A physics-informed neural network (PINN) is introduced to solve PDEs by sparse measurement and prior physical knowledge (Raissi M, Perdikaris P, Karniadakis G E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations [J]. Journal of Computational physics, 2019, 378:686-707.). In the PINN framework, the physics-informed loss is used for regularizing the data-loss. Accordingly, an unknown PDE structure including differential operators or unknown driving sources makes it infeasible to solve PDEs using PINN. Therefore, some works focus on learning PDEs and solving PDEs with unknown driving sources in recent years. The investigation shows some recent advances in solving PDEs with unknown driving sources in the PINN framework. Literature (M. Yang and J. T. Foster, “Multi-output physics-informed neural networks for forward and inverse pde problems with uncertainties,” Computer Methods in Applied Mechanics and Engineering, p. 115041, 2022.) shows that the functional forms of both a solution and a driving source are assumed to be unknown, where the measurement of the driving source can be obtained separately from that of the solution. Recent literature (H. Gao, M. J. Zahr, and J.-X. Wang, “Physics-informed graph neural galerkin networks: A unified framework for solving pde-governed forward and inverse problems,” Computer Methods in Applied Mechanics and Engineering, vol. 390, p. 114502, 2022) proposes a graph neural network to solve PDEs with unmeasurable driving sources, where the driving sources are assumed to be constant. Despite some success in the above research, the independent measurement of the driving sources and the use of constant sources to describe dynamic systems are infeasible for practical situations. Other advances show that the PINN is a promising framework for identifying PDEs from spatiotemporal data. Recent work (H. Xu, H. Chang, and D. Zhang, “Dl-pde: Deep-learning based data-driven discovery of partial differential equations from discrete and noisy data,” arXiv preprint arXiv:1908.04463, 2019. and Z. Chen, Y. Liu, and H. Sun, “Physics-informed learning of governing equations from scarce data,” Nature communications, vol. 12, no. 1, p. 6136, 2021.) combine deep learning through neural networks with data-driven PDE identification through sparse regression. Work (Z. Long, Y. Lu, and B. Dong, “Pde-net 2.0: Learning pdes from data with a numeric-symbolic hybrid deep network,” Journal of Computational Physics, vol. 399, p. 108925, 2019.) identifies PDEs by a combination of numerical approximation of differential operators by convolution and symbolic multi-layer neural networks for model discovery. Although the abovementioned works have achieved some success in PDE identification, the form of PDEs driven by unknown driving sources is not taken into account. Literature (Sun Ximing, Wang Aina, Qin Pan, “Coupled Physics-informed Neural Networks for Solving Vibration Displacement Distribution of Finite Line Under the Action of Unknown External Driving Forces” [P]. Liaoning Province: CN116050247A, 2023-5-2) uses idealized PDEs to realize effective soft sensing of industrial processes with spatiotemporal dependence under the conditions that the form of driving sources with spatiotemporal dependence is unknown and the independent measurement are not available, and the gaps often exist between idealized PDEs and practical situations. Therefore, an effective soft sensing algorithm for industrial processes with spatiotemporal dependence that have completely unknown PDE structure and driving sources is a problem under investigation.
In view of the existing problems, the present application proposes a soft sensing technology (CPINNRP-AIC) combining a coupled physics-informed neural network (CPINN) with a recurrent prediction (RP) mechanism and the Akaike's information criterion (AIC), which is used for obtaining a PDE structure for describing industrial processes with spatiotemporal dependence, so as to obtain a solution and a driving source. The method provided by the present invention can obtain an appropriate soft sensing result to ensure optimization, control and prediction of the system, which is of great significance to ensure the actual, safe and stable operation of high-end industrial equipment such as an aero-engine. The CPINNRP-AIC proposed by the present invention is used for soft sensing of industrial processes with spatiotemporal dependence that have completely unknown PDE structure and is verified using heat equations describing heat diffusion phenomena and data sampled from an aero-engine involute spline couplings fretting wear experiment platform, and the effectiveness of the proposed method is verified using evaluation criteria of a root mean square error (RMSE) and a Pearson correlation coefficient (CC).
To achieve the above purpose, the present invention adopts the following technical solution:
A spatiotemporal dynamic system soft sensing method for automatically determining a PDE structure is a soft sensing technology combining a CPINN with an RP mechanism and the AIC, comprising the following steps: firstly, constructing a loss function for training a CPINN with an RP mechanism to obtain a solution and a driving source which satisfy a PDE used for describing spatiotemporal industrial processes; secondly, obtaining differential operator candidates using an automatic differentiation technology; thirdly, selecting an appropriate PDE structure from the differential operator candidates using the AIC to accurately describe the spatiotemporal industrial processes; and finally, verifying that the method proposed by the present invention can obtain a feasible soft sensing result by using heat diffusion phenomena and actual vibration processes. The method comprises the following specific steps:
A PDE used for describing industrial processes with spatiotemporal dependence has the following general form:
x is a space variable,/is a time variable and in the initial state when t=0, u:d×
→
is solution to formula (1), i.e., a variable to be measured of industrial processes with spatiotemporal dependence, and the variable has spatiotemporal dependence; and g:
d×
→
is a driving source with a general form, i.e., the driving source has spatiotemporal dependence, including linearity, nonlinearity and steady state or dynamic state. Ω is open set space to which the space variable belongs, and ωΩ is the boundary thereof; and N is a series of differential operators.
The industrial processes with spatiotemporal dependence can be described by the following PDE with a general form when the exact PDE structure is unknown, i.e., the proposed CPINNRP-AIC is used for solving the following PDE:
Φ[·] is differential operator candidates of a series of partial differential operators, for example, ut,ux,uxx,utx, which are assumed to be sufficient to fully include all the differential operators obtained by means of automatic differentiation, i.e., a series of states of industrial processes with spatiotemporal dependence changing over time and space.
The present invention is to learn to obtain a solution u and a driving source g that can approximately satisfy formula (1) from formula (2); and formula (2) can be written in a form of the following residual function:
When an external driving force g(x,t) is completely known, fN(x,t) can be obtained directly by means of sparse regression. However, the unknown external driving force g(x,t), i.e., industrial processes under the action of the unknown external driving force, will cause the method for obtaining fN(x,t) by means of sparse regression to be infeasible.
Therefore, the present invention proposes the CPINNRP-AIC to obtain a neural network-based solution satisfying the PDE in formula (1). The proposed CPINNRP-AIC comprises two phases: 1) the CPINNRP is for approximating u and g satisfying the PDE in formula (1); and 2) the AIC is for selecting an approximate combination of differential operators satisfying the PDE in formula (1). The CPINNRP is composed of NetU, NetG and NetU-RP, wherein NetU is for approximating u satisfying formula (1), NetG is for approximating g satisfying formula (1), and NetU-RP is for compensating for information loss caused by the discretization strategy with respect to t. The specific steps are as follows:
Step 1: constructing a loss function for training the CPINNRP.
Mountable hardware sensors in a whole domain
wherein MSEDN and MSEPN represent a data loss function and a physics-informed loss function with an undetermined exact PDE structure in the given formula (2), respectively, and the physics-informed loss function contains a physics-informed part with an undetermined combination of differential operators.
The MSEDN is obtained by the following formula:
wherein û(x,t;ΘU) is a function of the network NetU with ΘU being a set of parameters to approximate u satisfying formula (1), and card(·) is the cardinalty of the set ·.
The MSEPN is obtained by the following formula:
Further, {circumflex over (f)}N(x,t;ΘU,λ) is obtained from Φ[û(x,t;ΘU)]·λ with an undetermined combination of differential operators, as shown in formula (7):
wherein ĝ(x,t;ΘG) is a function of the network NetG with ΘG being a set of parameters to approximate g satisfying formula (1); λ represents a parameter vector; and {circumflex over (f)}N(x,t;ΘU,λ) represents a residual function form of formula (3) parameterized by (ΘU,λ).
Step 2: optimizing the coupled CPINNRP-AIC with an undetermined exact PDE structure using a hierarchical training strategy.
Considering the interdependence between the network NetU and the network NetG in formula (4), the hierarchical training strategy is proposed to train the CPINNRP-AIC by means of iterative transmission of parameters, wherein {circumflex over (Θ)}U(k+1) and {circumflex over (Θ)}G(k+1) at are used for approximating u and g, respectively. k represents the number of iteration steps. The purpose of the hierarchical training strategy is to solve the following two coupled optimization problems.
wherein MSEPN(ΘG;{circumflex over (Θ)}U(k),λ(k)) and MSEPN(ΘU;λ,{circumflex over (Θ)}G(k+1)) are the physics-informed parts with an undetermined combination of differential operators, and MSEPN(ΘG;{circumflex over (Θ)}U(k),{circumflex over (λ)}(k)) is obtained based on ({circumflex over (Θ)}U(k),{circumflex over (λ)}(k)); and MSEPN(ΘU,λ;{circumflex over (Θ)}G(k+1)) is obtained from {circumflex over (Θ)}G(k+1));
To compensate for the information loss caused by discretization strategy with respect to t, the delayed prediction through time of the CPINN output û(x,t;{circumflex over (Θ)}CPINN) and the hardware sensors are used as part of the NetU-RP input, i.e., the input of NetU-RP comprises three parts: x, t and the delayed prediction through time of û(x,t;{circumflex over (Θ)}CPINN) or the delayed prediction through time of hardware sensors. û(x,t;{circumflex over (Θ)}CPINN) and the output of the hardware sensors are selected in an either-or way, i.e., when the hardware sensors can be installed at the collocation point (x,t), the output delay of the measurements is used as the input of the collocation point; and when the measurements of the hardware sensors cannot be obtained at the collocation point (x,t), the delay of û(x,t;{circumflex over (Θ)}CPINN) is used as the input of the collocation point. Therefore, Φ[û(x,t;{circumflex over (Θ)}U)]·{circumflex over (λ)} with an undetermined combination of differential operators and g satisfying formula (1) are obtained.
Step 3: using the AIC to select an appropriate combination of differential operators satisfying formula (1).
For idealized models, such as heat equations, wave equations, Maxwell's equations and NS equations, many interference factors will exist in complex industrial processes. Therefore, a gap often exists between the ideal situation and the practical situation. For this purpose, the AIC is considered for evaluating and selecting Φ[û(x,t;{circumflex over (Θ)}U)]·{circumflex over (λ)} containing an undetermined combination of differential operators.
wherein p is the number of differential operators contained in evaluation of Φ[û(x,t;{circumflex over (Θ)}U)]·{circumflex over (λ)}, n is the size of the data set, and {circumflex over (σ)}2=MSEDN. The selected combination of differential operators achieves a appropriate PDE structure, i.e., a combination with the minimum AIC value is selected from all the candidates Φ[û(x,t;{circumflex over (Θ)}U)]·{circumflex over (λ)} as a PDE model that is ultimately used for approximating formula (1). The details of the hierarchical optimization coupling strategy and model selection of the CPINNRP-AIC are shown in algorithm 1.
Algorithm 1: CPINNRP-AIC for hierarchical optimization coupling strategy and model selection:
Judging whether the stop criterion is met: if yes, ending the loop, and returning to u(x,t;{circumflex over (Θ)}CPINN); if no, returning to step (a1), and conducting an iterative transmission process of parameters ΘU and ΘG to solve the optimization problems (8) and (9);
Step 4: evaluating the performance of the CPINNRP-AIC method in the spatiotemporal dynamic system soft sensing method for automatically determining a PDE structure, i.e., evaluating the u and g performance of {circumflex over (Θ)}U(k+1) and {circumflex over (Θ)}G(k+1) obtained using the trained CPINNRP-AIC for approximating formula (1).
Using the RMSE and the CC as evaluation criteria to evaluate the performance of the CPINNRP-AIC.
wherein the RMSE is used for evaluating the distance between the ground truth and the prediction, |T| is the cardinality of the test collocation point set (x,t)∈T, and u(x,t) and û(x,t) represent the ground truth and the corresponding solution based on the neural network, respectively. The smaller the value of the RMSE is, the better the performance of the CPINNRP-AIC is.
wherein CC is used for evaluating the similarity between ground truth and the prediction. The CC is the correlation coefficient of u(x,t) and û(x,t), and cov(u(x,t), û(x,t)) is covariances of u(x,t) and û(x,t). Var u(x,t) and √{square root over (Var û(x,t))} are variances of u(x,t) and û(x,t), respectively. The closer the value of the CC is to 1, the better the performance of the CPINNRP-AIC is. For the RMSE and the AIC, snapshots as well as values of the evaluation criteria for the full time period are given.
Without loss of generality, the industrial processes with spatiotemporal dependence which have an unknown exact PDE structure and action of the driving sources can be described by formula (2), typically including: (a) heat equations describing heat diffusion phenomena: the heat of the interior of the finite line is subjected to heat diffuses under the drive of the heat source, and heat equations with sources describing the actual temperature distribution of the interior of the finite line, if the measurements of the heat source generated inside an aero-engine in the actual operation process cannot be obtained and the specific heat diffusion phenomena cannot be described using idealized heat equations, heat equations with an unknown exact PDE structure are solved in order to obtain the temperature distribution at any point; and (b) wave equations describing wave phenomena: vibration displacement distribution of the finite line, which is described by wave equations, if the spline shaft of an aero-engine vibrates during operation and is affected by various driving sources, a variety of industrial processes with spatiotemporal dependence are involved, and the vibration process cannot be precisely modeled by using the idealized wave equations, it is necessary to solve wave equations with an unknown exact PDE structure to obtain vibration displacement distribution at any point for spline shaft.
The present invention has the following beneficial effects:
The present invention proposes a new PINN, called CPINNRP-AIC, which is used for approximating a solution and a driving source which satisfy the PDE used for describing industrial processes with spatiotemporal dependence. In the present invention, firstly, the CPINNRP is used to approximate the solution satisfying the PDE, and the driving source satisfying the PDE is obtained. Then, a data-physics-hybrid loss function for training the CPINNRP is proposed, wherein the physics-informed par involves an undetermined combination of differential operators. Therefore, AIC is used for selecting an appropriate combination of differential operators. The proposed CPINNRP-AIC is a data-driven method which is used for achieving an appropriate PDE structure and a neural network-based solution satisfying PDE. Finally, the feasibility and effectiveness of the CPINNRP-AIC for soft sensing of industrial processes with spatiotemporal dependence that have a completely unknown PDE structure are verified using heat equations and the data sampled from aero-engine involute spline couplings fretting wear experiment platform. The performance of the CPINNRP-AIC is evaluated by using RMSE and CC, and the results tend to 0 and 1, respectively, indicating that the CPINNRP-AIC has good performance as a soft sensing method. This method is soft sensor for dynamic systems with spatiotemporal dependence that have an unknown PDE structure. The proposed CPINNRP-AIC is suitable for soft sensing technologies of multi-class dynamic systems with spatiotemporal dependence, including heat diffusion phenomena in practical situations and wave phenomena in actual vibration processes. The CPINNRP-AIC proposed by the present invention can achieve the effective acquisition of key variable values of high-end complex equipment such as an aero-engine in operation processes, so as to provide guarantee for the good control performance of high-end industrial equipment and the stability of safe operation, thus having practical engineering application significance and economic values.
The present invention is further described below in combination with specific embodiments.
The present invention provides a spatiotemporal dynamic system soft sensing method for automatically determining a PDE structure. The method solves a PDE by using the CPINN with an RP mechanism and the AIC, the PDE is used for describing industrial processes with spatiotemporal dependence, and the exact PDE structure comprises a combination of differential operators and unknown driving sources. Specific embodiments discussed are only used for illustrating the implementation mode of the present invention, not used for limiting the scope of the present invention. The implementation mode of the present invention is described below in detail in conjunction with the drawings, specifically comprising the following steps:
To theoretically demonstrate the effectiveness of the CPINNRP-AIC, embodiment 1 first illustrates the implementation mode of the present invention using numerical simulation of one-dimensional heat equations.
Embodiment 1: heat diffusion phenomena of a one-dimensional finite line under the impact of an unknown heat source with one end being a heat insulation end and the other end being a heat dissipation end in practical industrial systems, i.e., the measurement is the temperature distribution at any point (x,t). Therefore, embodiment 1 uses the CPINNRP-AIC to solve the one-dimensional heat equation with Neumann boundary conditions and an unknown exact model structure, to obtain a soft sensing result.
To obtain a training set and a testing set, the PDEs of the following forms are given to describe heat diffusion phenomena of the one-dimensional finite line under the impact of a heat source with one end being a heat insulation end and the other end being a heat dissipation end.
The heat diffusivity is α=1, u(x,t) is the temperature in any point (x,t), the length of the finite line is L=π, the initial temperature is
the heat source is
and the analytical expression of temperature distribution is
Considering the heat diffusion phenomena in the actual industrial system, the exact PDE structure and the heat source are unknown, and the measurement of the heat source cannot be obtained. A heat equation describing the temperature distribution of the one-dimensional finite line and having an unknown exact model and driving source is converted to a PDE in the residual form shown in formula (3).
Accordingly, CPINNRP-AIC comprises two phases: 1) the CPINNRP is for approximating u and g satisfying formula (13); and 2) the AIC is for selecting appropriate differential operators ut,uxx. The CPINNRP is composed of NetU, NetG and NetU-RP, wherein NetU is for approximating u satisfying formula (13), Net (is for approximating u satisfying formula (13), and NetU-RP is for compensating for information loss caused by the discretization strategy with respect to t.
1. Constructing a loss function:
A training set is obtained from uniform random sampling in a system controlled by formula (13). In the present embodiment, the training set is obtained from uniform random sampling in [0,π]×[0,10], and contains 130 boundary and initial training data (x,t,u)∈DB, including 10 initial condition training data, 60 left boundary condition training data, 60 right boundary condition training data, and 20 internal training data (x,t,u)∈D1. 20 collocation points (x,t)∈E, and the structure of formula (13) is guaranteed by the collocation points. CPINNRP-AIC is trained using the loss function of formula (4). MSEDN and MSEPN represent data loss (5) and physics-informed loss (6), respectively, wherein MSEDN is obtained by formula (5), û(x,t;ΘU) is a function of the network NetU with ΘU being a set of parameters; and MSEPN is obtained by formula (6), ĝ(x,t;ΘG) is a function of the network NetG with ΘG being a set of parameters. MSEPN corresponds to the physics-informed loss of formula (13) on a finite collocation point set (x,t)∈E, and Φ[û(x,t;Θ)]·λ involves an undetermined combination of differential operators to regularize u in the network NetU to satisfy formula (13).
2: Optimizing the coupled CPINNRP-AIC with an undetermined exact PDE structure using the hierarchical training strategy;
Considering the interdependence between the network NetU and the network NetG in the loss function MSE, the hierarchical training strategy is proposed. In many cases of practical application, for example, the exact expressions of the heat equations used for describing the heat diffusion phenomena of the interior of the aero-engine and the heat source g(x,t) cannot be obtained even by sparse measurement, the CPINNRP-AIC can be used to obtain {circumflex over (Θ)}U and {circumflex over (Θ)}G based on sparse measurement data D, to approximate u and g satisfying formula (13), respectively. Considering the interdependence between ΘU and ΘG, the estimation is made by means of interactive iteration. Assuming k as the existing number of iteration steps, the purpose of the hierarchical training strategy is to solve two optimization problems (8) and (9).
3. Using the AIC to select an appropriate combination of partial differential operators satisfying formula (13);
A gap often exists between the idealized heat equation model describing heat diffusion phenomena and the practical situation. For this purpose, the AIC is considered for carrying out model evaluation and selection, and for the heat diffusion phenomena, a first derivative with respect to the time variable is considered. The candidates are constructed as follows: Φ=[ut,ux,uxx,uxt]. The AIC in formula (10) is subsequently used for evaluating and selecting an appropriate combination from Φ.
Judging whether the stop criterion is met: if yes, ending the loop, and returning to u(x,t{circumflex over (Θ)}CPINN); if no, returning to step (a1), and conducting an iterative transmission process of parameters ΘU and ΘG to solve the optimization problems (8) and (9);
(2) Initialization: letting k=0, and initializing the network NetU-RP using {circumflex over (Θ)}CPINN;
2.1) Besides x and t, inputting the output u(x,t;{circumflex over (Θ)}CPINN) of the CPINN and measurement to NetU-RP in an either-or way by recurrently delayed through time, depending on whether the hardware sensors are available;
2.2) Conducting the following loop when k<M:
(a2) Training NetU-RP using formula (4);
2.3) Judging whether the stop criterion of loop is met: if yes, ending the loop, and returning to differential operator candidates Φ[û(x,t;{circumflex over (Θ)}U)]·{circumflex over (λ)} obtained by means of automatic differentiation; if no, returning to step (a2).
2.4) Calculating the AIC value from the combination Φ[û(x,t;{circumflex over (Θ)}U)]·{circumflex over (λ)} of differential operator candidates using formula (10);
2.5) Selecting a combination of differential operators with the minimum AIC value as a final model for approximating formula (13).
4. Evaluation of Performance
The RMSE of formula (11) is used to evaluate the soft sensing error of the proposed CPINNRP-AIC method in the heat diffuse system of the one-dimensional finite line with one end being a heat insulation end and the other end being a heat dissipation end, i.e., evaluating the performance of the CPINNRP-AIC method in solving a one-dimensional heat equation with Neumann boundary conditions and an unknown exact PDE structure. The RMSE of fN(x,t) for the full time period is 5.658725e-03, and u(x,t) and û(x,t) represent an actual temperature value and a corresponding soft sensing temperature value, respectively. To further verify the performance of the CPINNRP-AIC, CC of formula (12) is used to further illustrate the similarity between the soft sensing temperature of the CPINNRP-AIC method and the actual temperature.
In the present embodiment, the relevant setting for the CPINNRP-AIC is that the number of hidden layers is 3 and each layer has 30 neurons. The magnitude of prediction is shown in
RMSE tends to 0 and the CC tends to 1 in Table, indicating that the CPINNRP-AIC in the present embodiment has better soft sensing performance based on the unknown exact heat equations in the heat diffuse system of the one-dimensional finite line.
Embodiment 2: vibration phenomena of a one-dimensional finite line under the impact of an unknown driving force in practical industrial systems, i.e., the measurement is the vibration displacement distribution at any point (x,t). Therefore, embodiment 2 uses the CPINNRP-AIC to solve a one-dimensional wave equation with an unknown exact model structure, so as to obtain a soft sensing result. The feasibility and effectiveness of soft sensing with the CPINNRP-AIC are verified using actual data sampled from the aero-engine involute spline couplings fretting wear experiment platform. The training and testing data sets are sampled from displacement sensors 1-4 installed on the acro-engine involute spline couplings fretting wear experiment platform. The sampling conditions of the datasets shown in
Considering the vibration process in the actual industrial system, the exact PDE structure for describing the vibration process is unknown, and the driving force cannot be measured. Accordingly, the CPINNRP-AIC can be considered as a soft sensors contains two phases: 1) the CPINNRP is for approximating the displacement u and the driving force g satisfying a PDE; and 2) the AIC is for selecting an appropriate PDE structure. The CPINNRP is composed of NetU, NetG and NetU-RP, wherein NetU is for approximating the displacement u satisfying the PDE, NetG is for approximating the driving force g satisfying the PDE, and Net-RP is for compensating for information loss caused by the discretization strategy with respect to t.
1. Constructing a loss function.
MSEDN and MSEPN represent data loss (5) and physics-informed loss (6) respectively. û(x,t;ΘU) is a function of the network NetU with ΘU being a set of parameters. MSEPN is obtained by formula (5), ĝ(x,t;ΘG) is a function of the network NetG with ΘG being a set of parameters MSEPN corresponds to the physics-informed loss on a finite collocation point set (x,t)∈E, and Φ[û(x,t;Θ)]·λ involves an undetermined combination of differential operators to regularize u in the network NetU to satisfy the displacement of PDE describing the vibration process.
2: Optimizing the coupled CPINNRP-AIC with an undetermined exact PDE structure using the hierarchical training strategy
Considering the interdependence between the network NetU and the network NetG in the loss function MSE, the hierarchical training strategy is proposed. The wave equation structure that actually describes the vibration phenomena of the acro-engine involute spline shaft cannot be obtained even by sparse measurement of the driving force, the CPINNRP-AIC can be used to obtain {circumflex over (Θ)}U and {circumflex over (Θ)}G based on sparse measurement data DI to approximate the vibration displacement and the driving force respectively. Considering the interdependence between {circumflex over (Θ)}U and {circumflex over (Θ)}G, the estimation is made by means of interactive iteration. Assuming k as the existing number of iteration steps, the purpose of the hierarchical training strategy is to solve two optimization problems (8) and (9).
3. Using the AIC to select an appropriate combination of differential operators satisfying the PDE structure describing actual vibration processes;
A gap often exists between the idealized wave equation model describing vibration phenomena and the practical situation. Accordingly, the AIC is considered for carrying out model evaluation and selection, and for the wave phenomena, a second derivative with respect to the time variable is considered. The candidates are constructed as follows: Φ=[utt,ux,uxx,uxt]. The AIC in formula (10) is subsequently used for evaluating and selecting an appropriate combination from Φ.
RMSE of formula (11) is used to evaluate the measurement error of the proposed CPINNRP-AIC method in soft sensing of the vibration displacement distribution of the aero-engine involute spline shaft, i.e., evaluating the performance of the CPINNRP-AIC method in solving an unknown exact one-dimensional wave equation structure. RMSE of fN(x,t) for the full time period is 1.053161e-04, and RMSEs of the training set and the test set fN(x,t) are 9.945489e-05 and 1.059776e-04, respectively. u(x,t) and û(x,t) represent the measurement of an actual hardware vibration displacement sensor and a corresponding soft sensing vibration displacement value, respectively. To further verify the performance of the CPINNRP-AIC, CC of formula (12) is used to further illustrate the similarity between the soft sensing vibration displacement of the CPINNRP-AIC method and the measurement of the actual hardware vibration displacement sensor. In the present embodiment, the relevant setting for the CPINNRP-AIC is that the number of hidden layers is 3 and each layer has 10 neurons. Table 2 shows the soft sensing performance of the model obtained by the CPINNRP-AIC for the actual vibration displacement of the aero-engine spline shaft.
Reference example: early work of the research group (Sun Ximing, Wang Aina, Qin Pan, “Coupled Physics-informed Neural Networks for Solving Vibration Displacement Distribution of Finite Line Under the Action of Unknown Driving Forces” [P]. Liaoning Province: CN116050247A, 2023 May 2)
The following idealized wave equation is used to describe the actual vibration processes:
The vibration wave speed is α=1, the length is L=π, the vibration wave propagation time is t=6, the driving force in the area(x,t) is
and the vibration displacement is
The wave equation representing displacement distribution is converted to a PDE in the residual form as shown in formula (3).
Therefore, in the benchmark example, the idealized wave equation (15) is used to describe the vibration process of the aero-engine spline shaft. The benchmark example does not involve the processes of evaluating and selecting undetermined differential operators using the AIC of formula (10). The benchmark example uses a CPINN with an RP mechanism to solve the idealized wave equation (15), and the obtained solution û is the soft sensing result of vibration displacement to be obtained. Table 2 shows comparison of the soft sensing performance of the idealized wave equation describing the vibration displacement distribution of the aero-engine involute splines and the model obtained by the CPINNRP-AIC, and the optimal values in Table 2 are shown in bold.
2.613778e−02
9.999967e−01
3.944919e−01
3.937696e−01
9.998993e−01
9.998996e−01
1.088167e−02
9.999997e−01
3.799071e−01
9.999220e−01
9.480886e−03
9.999998e−01
4.015018e−01
4.011793e−01
9.999433e−01
9.999440e−01
9.999441e−01
2.144921e+00
6.975732e+00
6.968462e+00
9.748884e−01
9.749325e−01
RMSE tends to 0 and the CC tends to 1 in Table 2. The soft sensing result in the benchmark example is not as good as the CPINNRP-AIC method proposed by the present invention due to the idealized wave equation is not completely suitable for controlling the actual vibration processes. The early work of the research group was to use the idealized wave equation to describe the actual vibration processes, and the idealized wave equation cannot completely accurately describe the actual vibration processes due to uncertain factors involved in the actual vibration processes. The CPINNRP-AIC method proposed by the present invention adopts the vibration displacement data obtained from the actual vibration processes to obtain the wave equation model that can accurately describe the actual vibration processes by evaluating the AIC of the combination of differential operators. Therefore, the CPINNRP-AIC method proposed by the present invention has higher soft sensing accuracy.
The above embodiments only express the implementation of the present invention, and shall not be interpreted as a limitation to the scope of the patent for the present invention. It should be noted that, for those skilled in the art, several variations and improvements can also be made without departing from the concept of the present invention, all of which belong to the protection scope of the present invention.
| Number | Date | Country | Kind |
|---|---|---|---|
| 202310974079.7 | Aug 2023 | CN | national |
| Filing Document | Filing Date | Country | Kind |
|---|---|---|---|
| PCT/CN2023/113498 | 8/17/2023 | WO |