This application is based upon and claims priority to Chinese Patent Application No. 202010226934.2, filed on Mar. 27, 2020, the entire contents of which are incorporated herein by reference.
The present invention belongs to the technical field of rolling bearing fault diagnosis, and more specifically, to a deep partial transfer method weighted by domain asymmetry factors for rolling bearing fault diagnosis.
The rolling bearing is a major and key component in large rotating machinery. The bearing faults will cause substantial economic loss, and even seriously endanger people's lives and property. It is, therefore, crucial to perform in-service condition monitoring on the rolling bearings. Intelligent fault diagnosis utilizes advanced machine learning technology to build a mapping relationship between bearing monitoring data and health states, which significantly reduces the excessive reliance on experts' prior knowledge in the diagnostic process. With the rapid development of deep learning technology in recent years, the intelligent level and diagnostic accuracy of intelligent fault diagnosis have been dramatically improved. This has become an important means to ensure the safe operation of bearings. The intelligent fault diagnosis requires a large number of labeled samples to sufficiently train the diagnostic model. However, in engineering practice, the scarcity of labeled samples severely limits the practical application of the intelligent fault diagnosis. Transfer learning, by establishing a transfer diagnostic model, can utilize fault diagnosis knowledge of the source rolling bearing to solve the fault diagnosis problem of the target rolling bearing, which promotes the practical application of the intelligent fault diagnosis of rolling bearings.
Existing transfer diagnostic techniques for rolling bearings have significant limitations: namely, the diagnostic knowledge domains of the source bearing and the target bearing need to be symmetrical, which requires (1) the data of the target bearing are evenly balanced across every health states, and (2) the size of the label space of the source bearing monitoring data is equal to the size of the label space of the target bearing data. In engineering practice, however, these two requirements generally cannot be satisfied due to the following problems. The target bearing is in the normal state for a long time during the in-service monitoring. As a result, the fault state is significantly less frequent compared with the normal state. Therefore, the collected data are imbalanced to include a large amount of normal information and a small amount of fault information. Additionally, the fault state generated by the source bearing may not occur on the target bearing The label space of the source rolling bearing data generally covers the label space of the target bearing. This causes asymmetrical diagnostic knowledge domains between the source bearing and the target bearing.
Due to the influences of the asymmetry of the diagnostic knowledge domain, the existing transfer diagnostic techniques are difficult to effectively use the diagnostic knowledge of the source bearing to identify the imbalanced health states of the target bearing.
In order to overcome the shortcomings of the prior art, an objective of the present invention is to provide a deep partial transfer method weighted by domain asymmetry factors for rolling bearing fault diagnosis, which improves the transfer diagnostic accuracy of the rolling bearing under the domain asymmetry constraint, and promotes the practical application of intelligent diagnostic techniques.
To achieve the above-mentioned objective, the present invention adopts the following technical solution:
A deep partial transfer method weighted by domain asymmetry factors for rolling bearing fault diagnosis, including the following steps:
step 1: obtaining a vibration signal sample set
of a source rolling bearing in R types of health state, where xms∈N×1 represents the mth health state sample of the source rolling bearing and includes N vibration data points, the sample label of the health state sample is yms ∈{1, 2, 3, . . . R}, Ms represents the total number of vibration signal samples of the source rolling bearing, and s represents the source rolling bearing; obtaining a vibration signal sample set
of a target rolling bearing, where xnt ∈N×1 represents the nth unlabeled health state sample of the target rolling bearing and includes N vibration data points, Mt represents the total number of vibration signal samples of the target rolling bearing, and t represents the target rolling bearing;
step 2: building a domain-shared deep residual network, wherein the parameter to be trained in the network is θResNet, and extracting the deep transfer fault features
from the vibration signal sample set of the source rolling bearing and the vibration signal sample set of the target rolling bearing, respectively, where xms,F
step 3: building a parameter-shared domain confusion network, wherein the parameter to be trained in the domain confusion network is θadv, the input of the domain confusion network is the deep transfer fault features
and the output of the domain confusion network is the domain confusion features
where xms,adv represents the domain confusion feature of the mth health state sample of the source rolling bearing, xnt,adv represents the domain confusion feature of the nth health state sample of the target rolling bearing, and adv represents the domain confusion network; and maximizing the following objective function to update the parameter θadv of the domain confusion network:
wherein, after being updated in each iteration, the parameter θadv to be trained in the domain confusion network is truncated within the range of {−ξ, ξ};
step 4: after the parameter θadv to be trained in the domain confusion network is iteratively updated nadv times in step 3, calculating the domain asymmetry factor ρms for the deep transfer feature of the mth health state sample of the source rolling bearing;
step 5: extracting the fault features
of an adaptation layer of the F2 layer of the deep residual network, where xms,F
where, k(⋅,⋅) represents a polynomial kernel function; au represents a slope of the uth polynomial kernel function; U represents the number of the implanted polynomial kernel functions; βu represents a weighting coefficient of the maximum mean discrepancy implanted by the uth polynomial kernel, and βu∈β*, where β* represents the optimal weighting coefficient and is obtained by solving the following optimization problem:
step 6: predicting the probability distribution
of the F3 layer feature of the deep residual network belonging to the health state of the source and target rolling bearings by a Softmax activation function, where Pms,F
where, λ represents a tradeoff parameter for the training of the deep residual network; and
step 7: repeating steps 3-6 in sequence to train the partial transfer diagnostic model combined by the domain confusion network and the deep residual network; after the training of partial transfer diagnostic model is done, inputting the nth unlabeled health sample xnt of the target rolling bearing into the deep residual network of the partial transfer diagnostic model; selecting a health label corresponding to the maximum probability value in the probability distribution Pnt,F
The advantages of the present invention are as follows. The present invention provides a deep partial transfer method weighted by a domain asymmetry factor for rolling bearing fault diagnosis. The method (i) constructs the domain confusion network for adaptive learning of the domain asymmetry factor, (ii) utilizes this factor weighting to suppress the influence of outlier deep transfer fault features of the source rolling bearing on the feature distribution adaption, and (iii) identifies the imbalanced health state of the target rolling bearing by using the partial diagnostic knowledge in the source rolling bearing. The method thus overcomes the limitations of the domain asymmetry on current transfer diagnostic techniques in practical engineering, and improves the transfer diagnostic accuracy of rolling bearing fault under the constraint of the domain asymmetry factor.
The present invention is further described hereinafter with reference to the drawings and embodiments.
As shown in
Step 1: The vibration signal sample set
of the source rolling bearing in R types of health state is obtained, where xms∈N×1 represents the mth health state sample of the source rolling bearing and includes N vibration data points, and the sample label of the health state sample is yms∈{1, 2, 3, . . . R}; Ms represents the total number of vibration signal samples of the source rolling bearing; s represents the source rolling bearing. The vibration signal sample set
of the target rolling bearing is obtained, where xnt∈N×1 represents the nth unlabeled health state sample of the target rolling bearing and includes N vibration data points; Mt represents the total number of vibration signal samples of the target rolling bearing, and t represents the target rolling bearing.
Step 2: Referring to
from the vibration signal sample set of the source rolling bearing and the vibration signal sample set of the target rolling bearing, where xms,F
Step 3: Referring to
obtained in step 2, and the output of the domain confusion network is the domain confusion features
where xms,adv represents the domain confusion feature of the mth health state sample of the source rolling bearing, xnt,adv represents the domain confusion feature of the nth health state sample of the target rolling bearing, and adv represents the domain confusion network. The following objective function is maximized to update the parameter θadv of the domain confusion network:
After being updated in each iteration, the parameter θadv to be trained in the domain confusion network is truncated within the range of {−ξ, ξ}.
Step 4: After the parameter θadv to be trained in the domain confusion network is iteratively updated nad times in step 3, the domain asymmetry factor ρms for the deep transfer feature of the mth health state sample of the source rolling bearing is calculated by the following formula:
represents a Sigmoid function.
Step 5: Referring to
of the adaptation layer are extracted, where xms,F
where, k(⋅,⋅) represents the polynomial kernel function; au represents the slope of the uth polynomial kernel function; U represents the number of the implanted polynomial kernel functions; βu represents the weighting coefficient of the maximum mean discrepancy implanted by the uth polynomial kernel, and βu∈β*, where β* represents the optimal weighting coefficient and is obtained by solving the following optimization problem:
Step 6: Referring to
of the F3 layer feature of the deep residual network belonging to the health state of the source and target rolling bearings is predicted by the Softmax activation function, where Pms,F
where, λ represents a tradeoff parameter for the training of the deep residual network.
Step 7: Steps 3-6 are repeated in sequence to train the partial transfer diagnostic model combined by the domain confusion network and the deep residual network. After the training of partial transfer diagnostic model is done, the nth unlabeled health sample xnt of the target rolling bearing is input into the deep residual network of the partial transfer diagnostic model. The health state corresponding to the maximum probability value in the probability distribution Pnt,F
Embodiment: The identification of the health state of the locomotive wheelset bearing is taken as an example to verify the feasibility of the present invention.
The vibration signal sample set A of the source rolling bearing is derived from the University of Paderborn, as shown in Table 1, the data contain three types of bearing health state: normal state, inner race fault, and outer race fault. The vibration signal samples are obtained in four different working conditions (including 900 r/min, 0.7 N·m, 1 kN; 1500 r/min, 0.1 N·m, 1 kN; 1500 r/min, 0.7 N·m, 1 kN; 1500 r/min, 0.7 N·m, 0.4 kN). The sampling frequency of the vibration signal is 64 kHz during the testing process. 2559 samples are obtained at the end of the test, each type of health state contains 853 samples, and each sample contains 1200 sampling points.
The vibration signal sample set B of the target rolling bearing is derived from the locomotive wheelset bearing, as shown in Table 1, the data set contains two types of bearing health state: normal state and spalling of the outer race surface. The vibration signal samples are collected under the working condition of a 500 r/min rotational speed of the bearing outer race (the inner race is fixed) and a 680 kg radial load at the sampling frequency of 76.8 kHz. The data set contains 832 samples with the normal state and 147 samples with the outer race fault. Each sample contains 1200 sampling points.
A transfer diagnostic task A←B is constructed based on the data sets A and B shown in Table 1 to verify the feasibility of the present invention, in order to identify the health state of the locomotive wheelset bearing by using the knowledge of rolling bearing fault diagnosis accumulated in the laboratory environment. In addition to the diagnostic accuracy, two imbalance classification metrics including the F-score and area under the curve (AUC) are employed to quantify the effect of the present invention on the transfer diagnostic task in consideration of the imbalanced samples in the vibration signal sample set B of the target rolling bearing. The experiment is repeated 10 times to calculate the statistical value of the diagnostic result. As shown in Table 2, the present invention uses partial diagnostic knowledge in the source rolling bearing to obtain the diagnostic accuracy of 97.48% on the vibration sample set of the target locomotive bearing and the statistical standard deviation of 2.03%. In addition, the indices F-score and AUC obtained by the present invention are 0.949 and 0.973, respectively, close to 1, which indicates that the method is of high diagnostic accuracy, and proves the feasibility of the present invention in solving the problem of domain imbalance transfer diagnosis in practical engineering.
The MPK-ResNet and the standard ResNet are additionally selected and compared with the method of the present invention. The MPK-ResNet directly minimizes the multiple polynomial kernel induced maximum mean discrepancy of the fault features of the adaptation layer of the source rolling bearing and the target rolling bearing, and then uses the diagnostic model of the source rolling bearing to identify the health state of the target rolling bearing. Since the MPK-ResNet does not employ the domain asymmetry factor weighting of the present invention, the diagnostic accuracy of the MPK-ResNet is affected by the domain asymmetry and is only 30.58%, the standard deviation is 4.89%, the F-score is significantly lower than that of the present invention, and the AUC is close to 0.5, indicating that the performance of the traditional MPK-ResNet method is close to the random diagnostic model. The standard ResNet method uses the vibration signal sample set of the source rolling bearing to train the deep residual network, and then to directly identify the health state of the target rolling bearing. This method has a diagnostic accuracy of only 15.79%, the standard deviation is relatively high and is 9.83%, and the F-score and AUC indices are significantly lower than those of the present invention.
The comparison of the present invention with the conventional transfer diagnostic method (MPK-ResNet) and the standard deep intelligent diagnostic method (ResNet) indicates that the present invention effectively overcomes the influence of the domain asymmetry on the diagnostic knowledge transfer, thus improving the performance of the transfer diagnostic model.