The present application is based on, and claims priority from JP Application Serial Number 2023-007364, filed Jan. 20, 2023, the disclosure of which is hereby incorporated by reference herein in its entirety.
The present disclosure relates to a state classification method, a state classification device, and a state classification program.
Failure diagnosis of a rotary machine used in a factory or the like is currently an important task. When a machine having a high degree of importance such as a machine serving as one end of a production line is stopped, a very large loss occurs. Since it is difficult to quickly stop a failure after the failure is diagnosed, it is important to find the failure earlier and diagnose a failure mode. The failure of the rotary machine includes many modes such as bearing damage, unbalance, and misalignment. Since influence of most failure modes appears in vibration, vibration data is often used for diagnosis.
In recent years, studies on failure diagnosis by vibration using deep learning have been conducted. According to deep learning, the need for specialized knowledge of machines for feature engineering is reduced. In addition, it is possible to more accurately grasp complicated vibration features related to vibration of a plurality of shafts. Currently, CNN-based methods provide excellent results in many abnormality diagnosis data sets. In order to handle time-series data such as vibration data by CNN, it is necessary to convert vibration into image information, and the vibration data is converted into a spectrogram by preprocessing by many methods. The spectrogram represents an intensity of a frequency component at each time in color information. For example, in a method of Tao, Hongfeng, et al. “An unsupervised fault diagnosis method for rolling bearing using STFT and generative neural networks”. Journal of the Franklin Institute 357.11 (2020): 7286-7307, vibration time-series data is converted into a spectrogram by short time Fourier transform and processed by the CNN, a GAN accompanied by clustering is used, and a failure class is clustered while unsupervised learning is performed by a generative basis.
Tao, Hongfeng, et al. “An unsupervised fault diagnosis method for rolling bearing using STFT and generative neural networks”. Journal of the Franklin Institute 357.11 (2020): 7286-7307 is an example of the related art and is hereinafter referred to as Tao, Hongfeng, et al.
In the method of Tao, Hongfeng, et al., the spectrogram is handled by the CNN, so that an intensity change in each frequency component of the vibration can be grasped. However, the CNN-based method using the spectrogram as an input cannot grasp a phase change in vibration. This is because only an amplitude intensity is reflected in the spectrogram, and phase information is deleted during conversion. Since it is considered that a failure of a machine appears as a change in the phase before the failure becomes apparent as a change in a vibration intensity, a change in an initial stage of the failure cannot be grasped by the CNN-based method.
A state classification method according to an aspect of the present disclosure includes:
A state classification device according to an aspect of the present disclosure includes:
A state classification program according to an aspect of the present disclosure causing a computer to:
Hereinafter, preferred embodiments of the present disclosure will be described in detail with reference to the drawings. The embodiments to be described below do not unduly limit contents of the present disclosure described in the claims. All configurations described below are not necessarily essential components of the present disclosure.
As shown in
A special solution of Equation (1) is expressed by Equation (2). φ is given by Equation (3), ωn is a resonance angular frequency of a system, and ζ is a damping constant.
When Equation (3) is solved for ζ, Equation (4) is obtained.
When an external force F is expressed in Equation (5), vibration in a steady state is given by a sum of corresponding special solutions as in Equation (6).
In Equation (4), a relationship of Equation (7) is obtained by replacing φ with φm, replacing ω with mω, and replacing ζ with ζm.
When considering a special case where the external force F is known, if phases φk and φ1 for two types of harmonic vibrations in a steady state are known, a relationship of Equation (8) is obtained by setting ζk=ζ1 under an assumption that a frequency dependence of the damping constant ζ can be ignored. Since ω is known, ωn can be calculated from Equation (8), and the damping constant ζ can be calculated from Equation (7).
Generally, it is known that deterioration of components of a rotary device appears as a change in the spring constant k or the damping constant ζ before the deterioration becomes apparent as an increase in a vibration intensity. As described above, the damping constant ζ can be calculated by acquiring two or more phases for a plurality of vibration peaks corresponding to a fundamental wave and a harmonic wave. This suggests that a vibration phase includes information corresponding to a state of the rotary device, and phase information is useful for classification of a state of the rotary device. Here, in a state classification method according to the embodiment, a state of a device is classified by using time-series data measured for a vibrating device such as a rotary device as it is without converting the time-series data into information of a frequency domain such as a spectrogram.
As shown in
A type of the vibrating device is not particularly limited, and the vibrating device may be various devices such as a motor having a rotary mechanism or a vibration mechanism, a structure such as a bridge or a building that vibrates due to an external force, or an electric circuit that generates a signal having periodicity. Types of the physical quantity related to the vibration are not particularly limited, and for example, first to N-th physical quantities may be an acceleration, an angular velocity, a velocity, a displacement, a pressure, a current, and a voltage. Hereinafter, the vibrating device that is a state classification target may be referred to as a “target device”.
The measurement data may be time-series data of a digital signal output from a sensor, or time-series data of a digital signal obtained by converting an analog signal output from the sensor by an analog front end. The measurement data may be measurement data of a plurality of channels. For example, the measurement data of the plurality of channels may be measurement data of physical quantities corresponding to an x-axis, a y-axis, and a z-axis orthogonal to one another. The sensor that outputs the measurement data may be, for example, a sensor using a quartz crystal vibrator, a sensor using a MEMS, or an IMU. The MEMS is an abbreviation for micro electro mechanical systems, and the IMU is an abbreviation for an inertial measurement unit.
The state classification device 100 may acquire measurement data measured by the sensor in real time, or may read and acquire measurement data measured in the past from a storage medium in which the measurement data is stored.
The vacuum pump 1 includes a housing 3. The housing 3 includes a motor case 4, a coupling portion 5, a pump case 6, and a gear case 7 disposed from a −X direction side toward a +X direction side. The housing 3 includes a first side wall 8 as a bearing casing between the coupling portion 5 and the pump case 6. The housing 3 includes a second side wall 9 between the pump case 6 and the gear case 7.
An intake pipe 11 is coupled to a surface of the pump case 6 on a +Z direction side. An exhaust pipe 12 is coupled to a surface of the pump case 6 on a −Z direction side.
The coupling portion 5 includes a first leg portion 13 and a second leg portion on the base 20 side. The first leg portion 13 is disposed on a −Y direction side, and the second leg portion is disposed on a +Y direction side. The gear case 7 includes a third leg portion 14 and a fourth leg portion on the base 20 side. The third leg portion 14 is disposed on the −Y direction side, and the fourth leg portion is disposed on the +Y direction side. The first leg portion 13 to the fourth leg portion are fastened to the base 20 by first bolts 15.
A sensor unit 17 is attached to the housing 3. The sensor unit 17 is attached to, for example, the coupling portion 5. For example, the sensor unit 17 is attached such that the x-axis direction, the y-axis direction, and the z-axis direction respectively coincide with the +X direction, the +Y direction, and the +Z direction. The sensor unit 17 outputs measurement data of three channels, that is, measurement data of three axes including the x-axis, the y-axis, and the z-axis. For example, the state classification device 100 acquires the measurement data of the three channels output from the sensor unit 17 in the step S10.
As shown in
Next, in a step S30, the state classification device 100 classifies a state of the target device using information based on the intermediate feature output from the deep learning model in the step S20. For example, the state classification device 100 may classify the state of the target device into a normal state or an abnormal state, or may classify the state into the normal state and any one of a plurality of types of abnormal states.
The state classification device 100 repeatedly performs the steps S10 to S30 until state classification processing is completed (N in step S100).
The sensor 200 detects a physical quantity generated by vibration of the target device and outputs a signal corresponding to the detected physical quantity. An output signal of the sensor 200 is input to the analog front end 210.
The analog front end 210 performs amplification processing, A/D conversion processing, or the like on each output signal of the sensor 200, and outputs measurement data which is a digital time-series signal.
The processing circuit 110 acquires the digital time-series signal output from the analog front end 210 as the measurement data of the physical quantity related to the vibration of the target device, and performs processing of classifying the state of the target device. Specifically, the processing circuit 110 executes a state classification program 121 stored in the storage circuit 120 and executes various types of calculation processing on the measurement data. In addition, the processing circuit 110 executes various types of processing according to an operation signal from the operation unit 130, processing of transmitting a display signal for causing the display unit 140 to display various types of information, processing of transmitting a sound signal for causing the sound output unit 150 to generate various sounds, processing of controlling the communication unit 160 to perform data communication with an external device (not shown), or the like. The processing circuit 110 is implemented by, for example, a CPU or a DSP. The CPU is an abbreviation for a central processing unit, and the DSP is an abbreviation for a digital signal processor.
The sensor 200 may output the measurement data which is the digital time-series signal, and in this case, the processing circuit 110 may output the measurement data output from the sensor 200, and the analog front end 210 is not necessary.
The processing circuit 110 functions as a measurement data acquisition unit 111, an intermediate feature output unit 112, and a state classification unit 113 by executing the state classification program 121. That is, the state classification device 100 includes the measurement data acquisition unit 111, the intermediate feature output unit 112, and the state classification unit 113.
The measurement data acquisition unit 111 acquires the measurement data of the physical quantity related to the vibration measured for the vibrating device. N is a predetermined integer of 1 or more. That is, the measurement data acquisition unit 111 executes the step S10 in
The intermediate feature output unit 112 is a deep learning model including an encoder and a decoder using a recurrent neural network, and outputs, based on the measurement data acquired in the step S10, an intermediate feature of vibration based on an output of the encoder. That is, the intermediate feature output unit 112 executes the step S20 in
The state classification unit 113 classifies the state of the target device based on the intermediate feature output from the intermediate feature output unit 112. That is, the state classification unit 113 executes the step S30 in
As described above, the state classification program 121 is a program that causes the processing circuit 110, which is a computer, to execute the step S10, the step S20, and the step S30 in
The storage circuit 120 includes a ROM and a RAM (not shown). The ROM is an abbreviation for a read only memory, and the RAM is an abbreviation for a random access memory. The ROM stores various programs such as the state classification program 121 and predetermined data, and the RAM stores data generated by the processing circuit 110. The RAM is also used as a work area of the processing circuit 110, and stores programs and data read from the ROM, data received from the operation unit 130, and data temporarily generated by the processing circuit 110.
The operation unit 130 is an input device including an operation key, a button switch, or the like, and outputs an operation signal corresponding to an operation of a user to the processing circuit 110.
The display unit 140 is a display device implemented by an LCD or the like, and displays various types of information based on a display signal output from the processing circuit 110. The LCD is an abbreviation for a liquid crystal display. The display unit 140 may be provided with a touch panel functioning as the operation unit 130. For example, the display unit 140 may display a screen including at least a part of various types of data stored in the storage circuit 120 based on a display signal output from the processing circuit 110.
The sound output unit 150 is implemented by a speaker or the like, and generates various sounds based on a sound signal output from the processing circuit 110. For example, the sound output unit 150 may generate a sound indicating the start or end of the state classification based on the sound signal output from the processing circuit 110.
The communication unit 160 performs various types of control for establishing data communication between the processing circuit 110 and an external device. For example, the communication unit 160 may transmit at least a part of various types of data stored in the storage circuit 120 to the external device, and the external device may display the received information on a display unit (not shown).
At least one of the measurement data acquisition unit 111, the intermediate feature output unit 112, and the state classification unit 113 may be implemented by dedicated hardware. The state classification device 100 may be a single device or may be implemented by a plurality of devices. For example, the sensor 200 and the analog front end 210 may be provided in a first device, and the processing circuit 110, the storage circuit 120, the operation unit 130, the display unit 140, the sound output unit 150, and the communication unit 160 may be provided in a second device separate from the first device. For example, the processing circuit 110 and the storage circuit 120 may be implemented by a device such as a cloud server, and the device may classify states of the target device and transmit information indicating the classified states to a terminal including the operation unit 130, the display unit 140, the sound output unit 150, and the communication unit 160 via a communication line.
The encoder 31 is implemented using a recurrent neural network, and in an input step t, a measurement value xt included in the measurement data is input, an intermediate feature ht is output, and t is an integer of 1 to T, and corresponds to a time when the measurement value xt is measured. The measurement value xt is an m-dimensional vector including m elements. For example, when the measurement data is three-axis acceleration data, the measurement value xt is a three-dimensional vector. The intermediate feature ht is an n-dimensional vector including n elements. A total number T of measurement values and the integer n are set to be appropriate values by a creator of the deep learning model 30. In the embodiment, the recurrent neural network used for the encoder 31 is an LSTM. The LSTM is an abbreviation for a long short term memory.
The attention 32 weights and adds the intermediate features h1 to hT output from the encoder 31 in the input steps 1 to T by attention scores ai, 1 to ai, T in a prediction step i, and creates a context vector ci in the prediction step I, and i is an integer of 1 to p. The context vector ci is an n-dimensional vector and is an input vector to the decoder 33 in the prediction step i.
The decoder 33 is implemented using a recurrent neural network, and in the prediction step i, the context vector ci output from the attention 32 is input, and a prediction value fT+i of the measurement data is output. The prediction value fT+i is an m-dimensional vector including m elements. In the embodiment, the recurrent neural network used for the decoder 33 is an LSTM.
For example, the intermediate features h1 to hT are input to the machine learning model 40 functioning as the state classification unit 113, and the machine learning model 40 classifies the state of the target device using the intermediate features h1 to hT. The machine learning model 40 may be, for example, a support vector machine (SVM).
The LSTM cell 311-t processes the measurement value xt and outputs a processing result to the LSTM cell 311-(t+1) at the subsequent stage, and t is an integer of 1 to T.
In
The cell state C1t is calculated by Equation (13), and the hidden states hit is calculated by Equation (14). An operator “*” in Equation (13) means a Hadamard product, and an operator “*” in Equation (14) means a product of elements.
Returning to the description of
As shown in
As described above, in the encoder 31, a bidirectional LSTM including a forward direction LSTM corresponding to the LSTM cells 311-1 to 311-t and a reverse direction LSTM corresponding to the LSTM cells 312-1 to 312-t is used, and the adders 313-1 to 313-t add up an output of the forward direction LSTM and an output of the reverse direction LSTM at the same time. The intermediate features h1 to hT after the addition are subjected to orthogonalization described later. The bidirectional LSTM can extract features of the first half and the second half of the measurement data in a balanced manner. As a result of the deep learning, information extracted from the measurement data is aggregated in the intermediate features h1 to hT, and the machine learning model 40 classifies the state of the target device using the intermediate features h1 to hT.
In
As shown in
The LSTM cell 331-i processes the context vector ci output from the attention 32 and outputs a processing result to the LSTM cell 331-(i+1) at the subsequent stage, and i is an integer of 1 to p. The LSTM cell 331-1 receives the context vector ci and a hidden state h0′, and outputs a cell state C1′ and a hidden state h1′. The hidden state h0′ is the intermediate feature hT. The LSTM cell 331-i excluding the LSTM cell 331-1 receives the context vector ci, a cell state Ci−1′ and a hidden state hi−1′ output from the LSTM cell 331-(i−1), and outputs a cell state Ci′ and a hidden state hi′. The cell state Ci′ is an m-dimensional vector, and the hidden state hi′ is an n-dimensional vector. Since a configuration of the LSTM cell 331-i is the same as that of
The context vector ci output from the attention 32 is calculated by Expression (15). In Equation (15), αi, t is an attention score to the input step t in the prediction step i, and is calculated by Equation (16). In Equation (16), ei, t is calculated as an inner product of the hidden state hi−1′ and the intermediate feature ht according to Equation (17). According to Equation (16) and Equation (17), the attention score αi, t is larger as the hidden state hi−1′ and the intermediate feature ht are approximate to each other. According to Expression (15), the attention score αi, t is larger as a degree of contribution of the intermediate feature ht to the context vector ci increases.
The cell state Ci′ output from the LSTM cell 331-i is input to the prediction unit 332-i, and an m-dimensional prediction value fT+i is output from the prediction unit 332-i. For example, the prediction unit 332-i may convert the cell state Ci′ into the prediction value fT+i using a ReLU function. The ReLU is an abbreviation for a rectified linear unit.
The LSTM cell 311-t, the LSTM cell 312-t, and the LSTM cell 331-i are not limited to the configuration of
As described above, the deep learning model 30 is a learning-completed model obtained by performing the deep learning for predicting the future value of the measurement data for the target device. The intermediate features h1 to hT output from the encoder 31 include information related to the vibration of the target device. The information is information reflecting the state of the target device or time information independent of the state of the target device. Further, the information reflecting the state of the target device is considered to be independent for each failure mode. In general learning, various pieces of information related to the vibration of the target device are mixed in each element of the intermediate features h1 to hT, and further, independent information of the input vibration is distributed to a plurality of elements of the intermediate features h1 to hT. On the other hand, in the embodiment, the deep learning model 30 is caused to learn while applying orthogonalization pressure to the intermediate features h1 to hT. Specifically, as shown in
It is easier to store independent information related to the vibration in a small number of different elements by the orthogonalization. Therefore, the intermediate features hi to hT in which an influence of confusion of different failure modes or vibration noise is reduced are obtained, and accuracy of the state classification of the target device by the machine learning model 40 can be improved.
A loss function L optimized during learning is expressed by Equation (19) by combining an orthogonalization loss Lorth and an MSE loss Lpred. The MSE is an abbreviation for a mean square error. In Equation (19), A is a hyper parameter for determining trade-off of the orthogonalization loss Lorth and the MSE loss Lpred. A combination coefficient W of the encoder 31 and the decoder 33 is learned in a direction in which the loss function L decreases.
The orthogonalization loss Lorth is expressed by Equation (20). The orthogonalization loss Lorth is a loss function defined such that orthogonalization proceeds among a plurality of elements ct1 to ctn included in the intermediate feature ht, and learning is performed in a direction in which the orthogonalization loss Lorth decreases. As a value of an autocorrelation of the plurality of elements ct1 to ctn increases, a value of the orthogonalization loss Lorth decreases. As a value of a cross-correlation of the plurality of elements ct1 to ctn decreases, the value of the orthogonalization loss Lorth decreases. In other words, by reducing the orthogonalization loss Lorth, an absolute value of each covariance Cov [Xi, Xj] expressing the autocorrelation decreases, and an absolute value of each variance Var [Xi] expressing the cross-correlation increases. As a result, learning is performed such that temporal behaviors of the elements ct1 to ctn of the intermediate feature ht become independent.
The MSE loss Lpred is expressed by Equation (21). In Equation (21), xi is the i-th prediction value, and yi is the i-th actual measurement value. The MSE loss Lpred is a loss function for predicting a future value of the measurement data, and the learning is performed in a direction in which the MSE loss Lpred decreases. By reducing the MSE loss Lpred, the future value can be accurately predicted, and useful information expressing the vibration state of the target device is extracted as the intermediate feature ht.
The deep learning can be performed by unsupervised learning without requiring a label related to a failure. Since the encoder 31 and the decoder 33 are LSTMs, the time-series data can be input to the deep learning model 30 as it is. When the deep learning model 30 performs the deep learning before an operation of the target device, the target device is supposed to operate normally. Therefore, basically, normal time-series data is used as the time-series data of the physical quantity related to the vibration of the target device. When the deep learning model 30 is caused to perform learning using only the normal time-series data, the deep learning model 30 does not learn a pattern of abnormal time-series data. Therefore, information included in the intermediate features h1 to hT is greatly different between a case where the measurement data is normal and a case where the measurement data is abnormal, and the machine learning model 40 can classify whether the target device is in a normal state or an abnormal state.
On the other hand, when the deep learning model 30 performs the deep learning using not only the normal time-series data but also the abnormal time-series data, prediction accuracy of the deep learning model 30 is improved, and it is expected that useful information including an abnormal vibration state of the target device is extracted as the intermediate feature ht. Therefore, for example, the deep learning model 30 may perform learning using first time-series data of a physical quantity measured for a device in a normal state and second time-series data in which at least one of a phase and an amplitude of a signal component of a specific frequency of the first time-series data is changed. For example, the deep learning model 30 may perform learning using the first time-series data of the physical quantity measured for the device in the normal state and the second time-series data of the physical quantity measured for the device whose state changes with time after the first time-series data is measured. In any example, the first time-series data is normal time-series data, and the second time-series data is abnormal time-series data.
In the above description, the machine learning model 40 that is the state classification unit 113 classifies the state of the target device using the intermediate features h1 to hT, and may classify the state of the target device using context vectors c1 to cp that are information based on the intermediate features h1 to hT.
The inventors conducted experiments using a data set of vibration data obtained from two different devices, that is, a dry pump and a rotor kit, and verified validity of the method according to the embodiment. The inventors also created failure data, in which the phase is shifted by signal processing, as abnormal data in the data set of the dry pump. The inventors reproduced a change in state due to unbalance by changing a weight or the presence or absence of a weight fixed to a rotor in the data set of the rotor kit.
The dry pump to be measured is a dry pump AA70W manufactured by Ebara Corporation, and includes a main pump and a mechanical booster pump. Each pump includes a bearing and a gear. In order to measure vibration generated during operation of the dry pump, a six-axis digital output IMU sensor M-354 manufactured by Seiko Epson Corporation was fixed to an upper surface of the main pump such that a sensor X-axis was in an axial direction, a sensor Y-axis was in a left-right direction, and a sensor Z-axis was in an up-down direction. For the fixation, a strong adhesive thin double-sided tape confirmed to have no influence on the measurement was used. An UART was used as a communication protocol for acquiring data, and XYZ axis acceleration data and angular velocity data, and temperature data were acquired at 2000 samples per second for 5.5 seconds by dedicated logger software installed in a personal computer. In the verification of the validity of the method according to the embodiment, acceleration data among these pieces of data was used.
The inventors artificially created failure data by shifting a phase of an X-axis acceleration. Specifically, the inventors created failure data of three modes by shifting phases of frequency components of spectra of the measurement data having peaks at 85 Hz, 314 Hz, and 398 Hz. At the time of input to the deep learning model, each piece of data was cut out and used with a width of 64 points while being shifted by one point. Learning data and test data were divided at a ratio of 8:2 and used.
The deep learning model learned at a maximum of 2000 epochs using the learning data before the phase shift. The test data was input to the model after learning, and the obtained context vectors were classified by a classifier. Elements of the context vectors were input to the classifier one by one and then compared by the highest classification accuracy. The inventors evaluated a detection performance for data of each failure mode using a one-class SVM as the classifier. An AUC when a distance from a discrimination boundary of the one-class SVM was taken as a degree of abnormality was used as an evaluation index. Table 1 shows a result of the detection performance in a method of a comparative example without orthogonalization as the base and a method according to the embodiment with orthogonalization added to the base. As shown in Table 1, the method according to the embodiment exhibited an AUC higher than the method of the comparative example for all of the three failure modes. The AUC is an abbreviation for an area under the curve. From this result, it can be seen that information related to a failure is aggregated into a smaller number of elements by orthogonalization, and detection is facilitated.
The inventors evaluated a classification performance among different failure modes by using the SVM as a classifier. Based on this evaluation, accuracy of estimating any one of the failure modes after failure detection is compared. Table 2 shows the classification accuracy among the failure modes in the method of the comparative example without orthogonalization as the base and the method according to the embodiment with orthogonalization added to the base. As shown in Table 2, according to the method of the embodiment, the classification accuracy of the failure data obtained by shifting the 85 Hz component and the failure data obtained by shifting the 314 Hz component, and the classification accuracy of the failure data obtained by shifting the 314 Hz component and the failure data obtained by shifting the 398 Hz component were greatly improved. From these results, it is considered that the aggregation of the information of the respective failure modes into different elements by the orthogonalization contributes to the improvement of the accuracy. Although the classification accuracy of the failure data obtained by shifting the 85 Hz component and the failure data obtained by shifting the 398 Hz component is lower than that of the method according to the comparative example, it is considered that this is because the pieces of information related to the respective failures are aggregated into the same element.
In order to perform verification using labeled actual data, the inventors collected data using small-sized rolling bearing rotor kits AA31-020 manufactured by Shinkawa Electric Co., Ltd as measurement targets. A three-axis digital output vibration sensor M-A342 manufactured by Seiko Epson Corporation was fixed to an upper surface of a bearing such that a sensor Y-axis was in an axial direction, a sensor X-axis was in a left-right direction, and a sensor Z-axis was in an up-down direction. The vibration sensor was developed for a purpose of measuring vibration of a rotary device. By a configuration in which a one-axis vibration sensor having the same characteristics is mounted on three axes to perform digital signal processing, a flat frequency response characteristic in a use band of 10 Hz to 1000 Hz and excellent synchronization accuracy of 10 μs or less during acquirement of three-axis data are implemented. The vibration sensor has a feature that the vibration sensor is less likely to be affected by induction noise or the like by minimizing an arrangement of an analog wiring.
In order to fix the sensor, a strong adhesive thin double-sided tape confirmed to have no influence on the measurement was used. A UART was used as a communication protocol for acquiring data, and XYZ axis velocity data and temperature data were acquired at 3000 samples per second by dedicated logger software installed in a personal computer. For a purpose of detecting and classifying a change in vibration due to a slight change in unbalance, for components fixed to a rotor, the rotor, in which the presence or absence of two types of washers, that is, a thick washer and a thin washer changes, was rotated at 1200 rpm, and data in which respective fixed states were used as labels was used for verification. Six classes of data having different rotor states depending on the presence or absence of the washer were prepared.
The inventors confirm that there is no significant difference in vibration RMS value depending on the presence or absence of the washer in the classes. At the time of input to the deep learning model, each piece of data was cut out and used with a width of 64 points while being shifted by 32 points. Learning data and test data were divided at a ratio of 3:1 and used.
The deep learning model learned at a maximum of 500 epochs using learning data of all classes. The inventors input the test data to a model after learning, and evaluated a separation performance between the classes using the obtained context vector. A silhouette coefficient was used as an index for the evaluation. The silhouette coefficient reflects a distance between classes and an aggregation degree between classes, and indicates a larger value as the separation performance is higher. The silhouette coefficient when each element of the context vector was used was calculated and then compared by the highest value. Table 3 shows silhouette coefficients when the method of the comparative example as a base line is used. Table 4 shows silhouette coefficients when the method of the embodiment is used. In Tables 3 and 4, the silhouette coefficients between the two classes are shown in all combinations. An average of the silhouette coefficients in all the combinations was 0.9071 in the method of the comparative example, and 0.9239 in the method of the embodiment. The method of the embodiment was superior to the method of the comparative example. From Tables 3 and 4, there is a tendency that the method of the embodiment is excellent when the change in state is small, that is, when a thin washer is attached.
According to the state classification method of the embodiment described above, since the deep learning model 30 includes the encoder 31 and the decoder 33 using the recurrent neural network, the measurement data can be used for learning as time-series data. That is, it is possible to use the time-series data storing the phase information as it is for learning without converting the measurement data into a frequency spectrum or a spectrogram in which the phase information is missing. Therefore, the intermediate features h1 to hT output from the encoder 31 of the deep learning model 30 also include the phase information of the vibration of the target device. Therefore, according to the state classification method of the embodiment, the machine learning model 40 can more accurately classify the state of the target device by using the information based on the intermediate features h1 to hT to grasp a phase change of the vibration of the target device that cannot be detected by the RMS value of the frequency spectrum or the spectrogram.
According to the state classification method of the embodiment, by using the measurement data of the plurality of channels, the phase information included in the measurement data is relatively increased, and the phase change of the vibration of the target device is easily grasped, and thus the state of the target device can be more accurately classified.
According to the state classification method of the embodiment, it is possible to grasp a feature including non-linearity of the measurement data which is the time-series data by using the deep learning model 30. Further, according to the state classification method of the embodiment, the deep learning model 30 performs learning for predicting future measurement data, and thus the information of the measurement data can be efficiently compressed and aggregated into the intermediate features h1 to hT.
According to the state classification method of the embodiment, since the recurrent neural network used for the encoder 31 and the decoder 33 is an LSTM, it is possible to accurately classify the state of the target device using the deep learning model 30 with high prediction accuracy obtained by inputting and learning time-series data having a large data length.
According to the state classification method of the embodiment, by using the deep learning model 30 in which learning is performed such that the cross-correlation among the plurality of elements ct1 to ctn included in the intermediate feature ht is small by orthogonalization, the information of each feature of the measurement data is stored and easily separated in separate elements of the intermediate feature ht. By using such intermediate features h1 to hT, classification and interpretation of the state of the target device becomes easy.
According to the state classification method of the embodiment, the classification performance of the state of the target device is improved by using the deep learning model 30 in which learning is performed using the first time-series data measured for the target device and the second time-series data that simulates the abnormal state of the target device by changing at least one of the phase and the amplitude of the signal component of the specific frequency of the first time-series data. In this case, in the learning of the deep learning model 30, actual data corresponding to the abnormal state of the target device which is difficult to be collected is not required.
According to the state classification method of the embodiment, the classification performance of the state of the target device is improved by using the deep learning model 30 in which learning is performed using the first time-series data and the second time-series data measured before and after the state of the target device changes with time.
The above embodiment and modifications are examples, and the present disclosure is not limited thereto. For example, in the above embodiment, the LSTM is adopted as the neural network, but instead of the LSTM as the neural network, a CNN capable of time-series processing such as a 3D convolutional neural network (3DCNN), a recurrent neural network (RNN) including the LSTM, or a transformer used for language translation may be adopted. In the classification of the state of the target device, for example, when a load coupled to the target device changes or when an operation mode of the target device changes, the state of the target device may be further classified into a plurality of types of states as a normal state. Although the one-class SVM is used as the classifier, other methods may be used, or abnormal data may be learned. For example, abnormality detection may be performed using a k class SVM classified according to the types of abnormality, and the types of abnormality may also be detected. A state change or a degree of the state change may be detected. In this case, for the detection of the state change and the degree of the state change, similar to the abnormality detection, the one-class SVM or the like may be used. The embodiments and the modifications may be combined as appropriate.
The present disclosure includes substantially the same configurations (such as a configuration having the same function, method, and result and a configuration having the same object and effect) as the configurations described in the embodiments. The present disclosure includes a configuration in which a non-essential portion of the configuration described in the embodiments is replaced. The present disclosure may include a configuration capable of achieving the same operation and effect or a configuration capable of achieving the same object as the configuration described in the embodiments. The present disclosure includes a configuration obtained by adding a known technique to the configuration described in the embodiments.
The following contents are derived from the embodiments and modifications described above.
A state classification method according to an aspect includes:
According to the state classification method, since the deep learning model includes the encoder and the decoder using the recurrent neural network, the measurement data can be used for learning as time-series data. That is, it is possible to use the time-series data storing the phase information as it is for learning without converting the measurement data into a frequency spectrum or a spectrogram in which the phase information is missing. Therefore, the intermediate feature output from the encoder of the deep learning model also include the phase information of the vibration of the device. Therefore, according to the state classification method, the state of the device can be more accurately classified by using the information based on the intermediate feature to grasp a phase change of the vibration of the device that cannot be detected by an RMS value of the frequency spectrum or the spectrogram.
According to the state classification method, it is possible to grasp a feature including non-linearity of the measurement data which is the time-series data by using the deep learning model. Further, according to the state classification method, the deep learning model performs learning for predicting future measurement data, and thus the information of the measurement data can be efficiently compressed and aggregated into the intermediate feature.
In the state classification method according to an aspect,
According to the state classification method, it is possible to accurately classify the state of the device using the deep learning model with high prediction accuracy obtained by inputting and learning time-series data having a large data length.
In the state classification method according to an aspect,
According to the state classification method, by using the deep learning model in which learning is performed such that the cross-correlation among the plurality of elements included in the intermediate feature is small, the information of each feature of the measurement data is stored and easily separated in separate elements of the intermediate feature. By using such intermediate feature, classification and interpretation of the state of the device becomes easy.
In the state classification method according to an aspect,
According to the state classification method, it is possible to accurately classify the state of the device using the deep learning model in which learning is performed such that the cross-correlation is small among the plurality of elements included in the intermediate feature.
In the state classification method according to an aspect,
According to the state classification method, since the phase information included in the measurement data is relatively increased, the phase change of the vibration of the device can be easily grasped, and the state of the device can be more accurately classified.
In the state classification method according to an aspect,
According to the state classification method, a classification performance of the state of the device is improved by using the deep learning model in which learning is performed using the time-series data simulating an abnormal state of the device. In the learning of the deep learning model, actual data corresponding to the abnormal state of the device which is difficult to be collected is not required.
In the state classification method according to an aspect,
According to the state classification method, the classification performance of the state of the device is improved by using the deep learning model in which learning is performed using the time-series data before and after the state of the device changes with time.
A state classification device according to an aspect includes:
According to the state classification device, the state of the device can be more accurately classified by using the information based on the intermediate feature including phase information of the vibration of the device to grasp a phase change of the vibration of the device that cannot be detected by an RMS value of a frequency spectrum or a spectrogram.
A state classification program according to an aspect causing a computer to:
According to the state classification program, the state of the device can be more accurately classified by using the information based on the intermediate feature including phase information of the vibration of the device to grasp a phase change of the vibration of the device that cannot be detected by an RMS value of a frequency spectrum or a spectrogram.
Number | Date | Country | Kind |
---|---|---|---|
2023-007364 | Jan 2023 | JP | national |