The present invention relates to the technical field of fault diagnosis of electromechanical systems, and particularly to an aero-engine bearing fault diagnosis method based on variational mode decomposition and residual network.
Nearly 40% of the navigation accidents in a year are caused by mechanical problems such as equipment system failure, fault, wear and tear of key parts and components. Aero-engine is a key component of an aircraft with the most mechanical parts and the most complex working environment. Any accidental damage of an aero-engine can cause huge accidents and economic losses.
Bearings, as aero-engine rotor supports, work in high-temperature, high-pressure and high-corrosive environments, and are prone to damage such as wear, spalling, and ablation affected by alternating impact loads. Its fault may increase system noise and vibration, and at worst, will cause serious damage to the whole engine and its accessories. If the fault is not detected timely and accurately, there will be great dangers to the safety and efficiency of aerial operations. Therefore, how to monitor the operating status of an aero-engine, diagnose its fault timely and accurately, and predict the occurrence of any fault is of great research significance for the flight safety guarantee.
The traditional manifestation of engine mechanical system faults is vibration. At present, some cases have studied the fault diagnosis of rotating parts such as aero-engine bearings. Most of them are based on the vibration signal analysis method, which is to collect the vibration acceleration signal of engine shell, and extract the time-domain and frequency-domain characteristics of faults through traditional artificial signal analysis. In spite of the guaranteed accuracy, the bearing fault diagnosis based on signal processing shows deficiencies of high dependence on signal knowledge and on manual work during the complex process of feature extraction. In recent years, with the maturity of artificial intelligence technology, research of aero-engine fault diagnosis based on machine learning and deep learning springs up. On the one hand, a large amount of vibration data is stored during the service of aero-engines, which needs to be analyzed and mined. On the other hand, the computers can realize the calculation of larger volume of data with their hardware further improved.
As a consequence, how to diagnose and analyze the fault of bearings of the rotating mechanical parts in aero-engines, and to accurately identify its category is an urgent problem for those skilled in the art.
In view of the above, the present invention provides an aero-engine bearing fault diagnosis method based on variational mode decomposition and residual network, which is to collect acceleration signals at different positions of the body under different fault states, and diagnose and analyze the faults of the bearings of the rotating mechanical parts in aero-engines based on variational mode decomposition and one-dimensional residual network, which improves the diagnostic accuracy.
In order to achieve the above purpose, the present invention provides an aero-engine bearing fault diagnosis method based on variational mode decomposition and residual network, which comprises the following steps:
Collect signals in different positions and directions with vibration acceleration sensors, which will be used as sample data;
Convert the said sample data into the target data type through normalization, slicing, variational mode decomposition and labeling, to get a training sample set;
Build a 1D-Resnet model, input the said training sample set into the said 1D-Resnet model for training and save the model parameters when the model converges;
Diagnose the aero-engine bearing fault with the trained 1D-Resnet model, to get the diagnostic results.
Optionally, the said normalization is maximum and minimum normalization, with the expression of:
Wherein, Xmax is the maximum value of the sample data, Xmin is the minimum value of the sample data, Xnorm is the normalized result, and [0,1] is the numerical interval.
Optionally, the specific operation of the said slicing is to divide the acceleration signals of the long signal wave every N points to get multiple pieces of short signal wave data of the same length.
Optionally, the specific operation of the said slicing is to amplify the said sample data by overlapping sampling, and segment it every M step length. There is overlap between adjacent sliced data.
Optionally, the specific operation of performing variational mode decomposition on the sliced data is as follows:
Decompose the sliced original one-dimensional signal f(t) into k intrinsic mode functions with limited bandwidth, and extract the frequency-domain characteristics of the signal. The expression of the constrained variation is:
The expression of the intrinsic mode functions is:
u
k(t)=ak(t)cos(φk(t)) (4);
Wherein, k is the number of decomposed modes, {uk}={u1, . . . , uk} presents k intrinsic mode functions, {wk}={w1, . . . , wk} is the center frequency of each function, δ(t) is the Dirichlet function, * is the convolution operation, t is the time series, ak(t) is the non-negative envelope, φk(t) is the phase, ∂t represents the partial derivative of time t, K is the total number of modes, and j is the imaginary number in the Fourier transform process.
Introduce quadratic penalty factor α and Lagrange multiplication operator λ, and transform the constrained variational problem into an unconstrained variational problem. The augmented Lagrange expression is:
Wherein, λ(t) represents the Lagrange multiplier.
Optionally, the specific operation of the said labeling processing is to add corresponding fault labels of 0-i to the data after variational mode decomposition, where i is the total number of categories. Optionally, the constructed 1D-Resnet model comprises an input layer, 5 residual modules, a Dropout layer, a Flatten layer and an output layer; The first residual module comprises a one-dimensional convolutional layer and a one-dimensional maximum pooling layer;
The second residual module comprises two identity modules; for each identity module, the main road consists of two one-dimensional convolutional layers connected in series, and the branch is an identity mapping channel;
The third, fourth and fifth residual modules consist of an identity module and a convolutional downsampling module connected in series; the main road of the convolutional downsampling module consists of two one-dimensional convolutional layers connected in series, and the branch is a convolution layer with a convolution kernel size of 1.
Optionally, the training of the said 1D-Resnet model specifically comprises the following steps:
Input a multi-channel one-dimensional vector through the said input layer and input the said multi-channel one-dimensional vector into the residual module; the number of channels=the number of sensors*the number of intrinsic modes k after variational mode decomposition;
Convolute the output of the upper layer through the convolutional layer in the said residual module, and extract the spatial features of a local area by using the nonlinear activation function. The mathematical model is expressed as:
y
i
l+1(j)=wil·xl(j)+bil (6);
z
i
l+1(j)=f(yil+1(j)) (7);
Wherein, yil+1(j) represents the input of the jth neuron in layer l+1, that is, the output of layer l; wil represents the weight of the ith filter kernel in layer l, the symbol · represents the dot product of the kernel and the local area, xl(j) represents the input of the jth neuron in layer l, bil represents the bias of the ith filter kernel in layer l, zil+1(j) represents the result of the ith filter kernel in layer l+1 under the action of the nonlinear activation function, and f(⋅) represents the activation function. The logical value output of each convolution is transformed nonlinearly.
Reduce network parameters through the maximum pooling layer in the residual module, and lessen the data length through the said convolutional downsampling module;
Randomly discard the parameters trained by the residual module through the said Dropout layer;
Integrate the local information distinguished by the residual module through the said Flatten layer to get single channel data;
Back-propagate the data output from the output layer with the softmax function to optimize the 1D-Resnet model until the model converges. The trained 1D-Resnet model is obtained.
Optionally, the specific operation of getting the diagnostic results is as follows:
Convert the acceleration signal of the aero-engine to be detected into the target data type and input it into the trained 1D-Resnet model, to get the probability value of each fault category, and take the fault label corresponding to the maximum probability value as the final fault category identification result.
According to the above technical scheme, compared with the prior art, the present invention provides an aero-engine bearing fault diagnosis method based on variational mode decomposition and residual network, which has the following beneficial effects:
To better describe the embodiment of the present invention or the technical scheme of the prior art, a brief introduction of the accompanying drawings to be used in the descriptions of the embodiment or the prior art is made hereby. Obviously, the drawings below are only the embodiment of the present invention, and for those ordinarily skilled in the art, other drawings based on such drawings can be obtained without making creative endeavors.
The technical scheme in the embodiment of the present invention is clearly and completely described below in combination with the drawings of the embodiment of the present invention. Obviously, the embodiment is just a part of embodiments of the present invention, not all of them. Based on the embodiment of the present invention, all the other embodiments obtained by those ordinarily skilled in the art without making creative endeavors shall fall into the scope of protection of the present invention.
The embodiment of the present invention discloses an aero-engine bearing fault diagnosis method based on variational mode decomposition and residual network, as shown in
Collect signals in different positions and directions with vibration acceleration sensors according to the needs, and use such signals as sample data;
Specifically, in this embodiment, the relevant data of the deep groove ball bearing for the test of the main reducer test bench of a helicopter transmission system is collected. The detected faulty bearing is installed at the entrance of the drive shaft into the gearbox, and the acceleration sensor is located on the gearbox casing. The speed sensor collects the output speed of the motor (constant), at the sampling frequency of 10,000 Hz. Data is collected for one minute in three periods: startup, smooth operation and coining to a stop, and the data of each minute is set as a group. Bearing faults include rolling element, inner ring, outer ring, and combined faults. Single-point faults (single-point holes with a diameter of 0.1 mm) are set in corresponding parts with EDM technology.
Convert the sample data into the target data type through normalization, slicing, variational mode decomposition and labeling to get a training sample set, which specifically comprises the following steps:
First, normalization is maximum and minimum normalization, with the expression of:
Wherein, Xmax is the maximum value of the sample data, Xmin is the minimum value of the sample data, Xnorm is the normalized result, and [0,1] is the numerical interval.
Further, the specific operation of data slicing is to divide the long signal wave every N points to get multiple pieces of short signal wave data of the same length. If a small amount of fault data is collected, the sample data can be amplified by overlapping sampling. The data is divided every M step lengths, and there is overlap between adjacent sliced data.
Further, the variational mode decomposition is to have modal decomposition of the sliced data by the VIVID method in the vmdpy library in python. VIVID is a new self-adaptive and completely non-recursive mode variation and signal processing method, which avoids the influence of signal length selection on decomposition results. The decomposition is essentially a process of finding the optimal solution of the constrained variational problem. The original one-dimensional signal f(t) is decomposed into k intrinsic mode functions (IMF for short) with limited bandwidth, and the constraint is that the sum of the estimated bandwidths of the modes is the smallest, and the sum of all modes is equal to the original signal. The expression of the constraint variation is:
The expression of the intrinsic mode functions is:
u
k(t)=ak(t)cos(φk(t)) (4);
Wherein, k is the number of decomposed modes, {uk}={u1, . . . , uk} presents k intrinsic mode functions, {wk}={w1, . . . , wk} is the center frequency of each function, δ(t) is the Dirichlet function, * is the convolution operation, t is the time series, ak(t) is the non-negative envelope, φk(t) is the phase, ∂t represents the partial derivative of time t, K is the total number of modes, and j is the imaginary number in the Fourier transform process.
Introduce quadratic penalty factor α (to reduce the interference of Gaussian noise) and Lagrange multiplication operator λ, and transform the constrained variational problem into an unconstrained variational problem. The augmented Lagrange expression is:
Wherein, λ(t) represents the Lagrange multiplier.
When performing the variational mode decomposition, it is necessary to define the number of decomposition modes k and the bandwidth limit a, where k is generally 5 or 7, and the empirical value of a is 1.5-2.0 times the length of the slice sample.
In the fault diagnosis of rotating parts such as aero-engine bearings, VMD can be used to decompose the vibration acceleration signal containing Gaussian white noise, and then preliminarily extract the frequency-domain features of the signal, which enhances the frequency representation of fault features in the signal, and improving the effect of bearing fault diagnosis.
Further, the specific operation of labelling is to add corresponding fault labels of 0-i to the data after variational mode decomposition, where i is the total number of categories.
Further, an aero-engine fault database management system is established with SQL Server database technology to realize data interaction and effective storage.
In this embodiment, the data collected in the first part is preprocessed as above (see
Take the data collected in the first and second minutes as the training set and test set, which are respectively used for training model iteration and testing the model accuracy change during training. The data collected in the third minute is set as the verification set, used for testing the generalization effect of the model.
According to the principle of 1D-Resnet neural network, the specific structure of the aero-engine bearing fault diagnosis model proposed by the present invention is shown in
The network model in this embodiment consists of an input layer, 5 residual modules, a Dropout layer, a Flatten layer and an output layer. The input data is a multi-channel one-dimensional vector with a length of 600 and a number of channels of 20 (the number of channels=the number of sensors*the number of intrinsic modes k after variational mode decomposition).
The first residual module (Conv1) consists of a one-dimensional convolutional layer (with a size of 3, 64 convolution kernels, a sliding step length of 2, and 3 units of zero-padding) and a maximum pooling layer (with a pooling area size of 3, a sliding step length of 2, and 1 unit of zero-padding). The second residual module (Conv_2x) consists of two identity modules. For each identity module, the main road consists of two one-dimensional convolutional layers connected in series, and the branch is an identity mapping channel The convolutional layers consist of 64 convolution kernels with a size of 3, a sliding step length of 1, and 1 unit of zero-padding. The third, fourth and fifth residual modules consist of an identity module and a convolutional downsampling module connected in series. The main road of the convolutional downsampling module consists of two one-dimensional convolutional layers connected in series, and the branch is a convolution layer with a convolution kernel size of 1, a sliding step length of 2, and non-zero padding.
The convolution kernel of the convolutional layer in the residual module convolutes the output of the previous layer, to extract the spatial features of the local area, and obtain the characteristic mapping with a width of W×, a height of 1×and a depth of D. In this process, a nonlinear activation function is usually used to construct the output features, and its mathematical model is expressed as:
y
i
l+1(j)=wil·xl(j)+bil (6);
z
i
l+1(j)=f(yil+1(j)) (7);
Wherein, yil+1(j) represents the input of the jth neuron in layer l+1, that is, the output of layer l; wil represents the weight of the ith filter kernel in layer l, the symbol · represents the dot product of the kernel and the local area, xl(j) represents the input of the jth neuron in layer l, bil represents the bias of the ith filter kernel in layer l, zil+1(j) represents the result of the ith filter kernel in layer l+1 under the action of the nonlinear activation function, and f(⋅) represents the activation function. The logical value output of each convolution is transformed nonlinearly, and the original linear and indivisible multidimensional features are transformed into another space to enhance the linear separability of features.
The maximum pooling layer is to reduce the network parameters, and the convolutional downsampling module is to lessen the data length and reduce the amount of data. Generally, maximum pooling or average pooling is used, and the maximum value of the perception domain is taken as the output characteristic mapping.
The Dropout layer randomly discards the previously trained parameters. Generally, the retention rate is set to 0.8, which means 20% of the parameters are discarded to prevent excessive model parameters and training resource over-consumption.
The Flatten layer is a fully connected layer. It expands the output of the last residual module into a one-dimensional vector, establishes a fully connected network between the input and output, integrates the local information distinguished by the residual module, compresses the multi-channel one-dimensional data to single-channel one-dimensional data, and then transfer the data to the Softmax classifier for classification.
Softmax classifier is usually used in the output layer to distinguish labels, and the output results are the probability values of the categories. The label corresponding to the largest probability value is taken as the recognition result.
Next, the method proposed in this study is tested against several other methods, see
Specifically, targeted at the problem of aero-engine bearing fault diagnosis, in this embodiment, the original noise-added data (4×600) that is not decomposed by VIVID is selected and input into 1D-Resnet, VMD&1D-CNN, and the single 1D-CNN method as a comparison. The values when the recognition effect is optimal are taken as their structure and parameters. To control the learning rate of the network, the Adam (Adaptive Momentum Estimation) optimization algorithm is used to update the network parameters, and the initial learning rate is set to 0.0001. The Dropout regularization method is introduced in the fully connected layer to avoid training data overfitting, and the retention rate is 0.8. The neural network training parameters are set as follows: maximum number of iterations epoch=500, mini-batch size Batch size=64. The total number of network parameters of the model is 3,936,709, each iteration takes 4.001 s, and the total training time is 33.342 min.
This embodiment was implemented on a computer conFIGured with an NVIDIA GeForce GTX1650 and 16-GB RAM, and a programming language of Python, in an integrated development environment of Spyder, TensorFlow 2.1.1, and Keras 2.3.1, all of which are open-source deep learning platforms or software libraries for developing the proposed model.
According to
After converging to the optimal accuracy rate of the model, the method proposed by the present invention remains stable until the 500th round, while the method using only 1D-CNN causes repeated oscillations of the accuracy rate, which adversely affects the final diagnosis effect.
Generally, accuracy, precision and recall rate are the evaluation criteria for model identification. Accuracy refers to the ratio of the number of samples correctly classified by the classifier to the total number of samples for a given test set, which is presented by the visualization tool that comes with the model; precision (P) refers to the ratio of the number of samples correctly classified as label A to the total number of samples classified as label A; recall rate (R) refers to the ratio of the number of samples correctly classified as label A to the number of samples actually of A category in the samples. The calculation formulas are as follows:
Wherein, TP is the number of samples that are correctly classified as A, FP is the number of samples that are classified as A but are actually not A, and FN is the number of samples that are actually A but are incorrectly classified.
In this embodiment, each model is continuously trained for five times. The specific values of the accuracy of the diagnosis methods are shown in Table 2. VMD and 1D-Resnet have achieved a 100% identification, while other algorithm models show certain errors. Aero-engine bearing fault diagnosis methods shall ensure high precision, otherwise there will be a great safety hazard for aloft workers.
To further test the effectiveness of the method proposed by the present invention, five verification sets are sequentially input into the trained model for fault diagnosis. The diagnosis accuracy and identification speed are shown in Table 3.
Due to different acquisition time, there is a certain difference in data distribution between the validation set and the training set, but after variational mode decomposition, the original vibration acceleration signal is decomposed into multiple intrinsic modes with different center frequencies. The high-frequency impact characteristics reflecting the fault characteristics are amplified thereby, so the overall identification accuracy is significantly improved to nearly 100%. At the same time, in this model, the identification time of 1,000 pieces of data in each group is 1.911 s. As a result, in case of a sudden failure or potential fault during high-altitude operation, the workers have enough time to adjust the operating status of the equipment, so as to avoid serious consequences. According to the confusion matrix of the classification results of group 1 in
In the practical application scenario, the workers can install the acceleration sensors at the designated positions of the aero-engine, to collect its vibration signal during operation, which will be integrated and preprocessed, and put into the fault diagnosis model proposed by the present invention. Consequently, any fault in the current equipment and its category can be identified, providing an accurate and reliable basis for the maintenance workers.
The above description of the disclosed embodiment enables those skilled in the art to practice or use the present invention. Modifications to the embodiment will be apparent to those skilled in the art, and the general principles defined herein can be implemented in other embodiments without departing from the essence or scope of the present invention. Accordingly, the present invention will not be limited to the embodiment described herein, but will cover the widest scope consistent with the principles and novel features disclosed herein.
Number | Date | Country | Kind |
---|---|---|---|
202111663014.8 | Dec 2021 | CN | national |
This application is the national stage entry of International Application No. PCT/CN2022/073740, filed on Jan. 25, 2022, which is based upon and claims priority to Chinese Patent Application No. 202111663014.8, filed on Dec. 30, 2021, the entire contents of which are incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2022/073740 | 1/25/2022 | WO |