The present invention relates to the technical field of fault detection and diagnosis, in particular to a method for detecting abnormal working conditions of multi-view data based on feature regression.
With the development of science and technological progress, industrial systems have become increasingly large and complex. Once these systems fail, they will cause huge property damage and casualties. Therefore, improving reliability and maintainability of complex dynamic systems has become a primary issue for enterprise development. A data-based fault diagnosis technology refers to establishment of appropriate mathematical models for industrial processes through various data processing methods, statistical modeling methods, etc. based on massive data acquired in actual industrial production processes, thereby achieving the purpose of detecting and diagnosing whether faults occur in the processes. Research on this technology not only has much theoretical significance, but also has a wide range of application backgrounds, and thus it is an important development direction for a future fault diagnosis technology.
An industrial production process is a complex and changeable process, and occurrence of faults is caused by multiple reasons. Data from a single data source can only detect a single abnormal working condition during fault detection, and cannot comprehensively and accurately detect the entire production process. Therefore, collaborative modeling should be performed on multi-source heterogeneous data generated in the production process, and a comprehensive mathematical model at a data structure level is established to diagnose the production process comprehensively and improve its automation level. However, inconsistency of data categories poses significant difficulties for modeling.
The technical problem to be solved by the present invention is to provide a method for detecting abnormal working conditions of multi-view data based on feature regression to address shortcomings of the prior art. By collecting data that can be acquired in production processes, a big data pool is established. By analyzing the data in the data pool, a general collaborative modeling algorithm based on multi-source heterogeneous data is established to achieve comprehensive diagnosis of the production processes.
In order to solve the above technical problems, the present invention adopts the following technical solutions: the method for detecting abnormal working conditions of multi-view data based on feature regression includes the following steps.
Step 1: acquiring sample data under different working conditions in an actual industrial production process, denoted as {Xji|i=1, 2, . . . , w; j=1, 2, . . . , n}, where Xji is jth sample data in an ith view, w is the number of views, and n is the number of samples under different views.
Step 2: preprocessing the acquired sample data under different working conditions.
If the sample data Xji in the ith view is image data, firstly performing graying processing and normalization processing on the image data, then obtaining an average value {
If the sample data Xji∈iq×1 in the ith view is vector data, the preprocessed sample data is:
Wherein Xjri represents the preprocessed sample data of the sample data Xji, xjqi represents qth variable of the sample data Xji,
Step 3: after preprocessing the sample data, establishing the method for detecting abnormal working conditions based on the multi-view data by the feature regression method, wherein an objective function model is:
Wherein Xli∈id×m is lth preprocessed sample data in the ith view, d and m are dimensions of the sample data Xli, and uij∈il×d and vij∈im×1 are left and right projection vectors, respectively; Xfi and Xgi are fth preprocessed sample data and gth preprocessed sample data in the ith view, respectively; pj is a regression center of the sample data in all views under a jth working condition; ylij is a label of lth sample data in the ith view, if Xli belongs to the jth working condition, ylij=1, otherwise, ylij=0; λ1 and λ2 are both coefficient parameters.
Wherein
for i=1, 2, . . . , n, largest t data labels corresponding to ωl constitute a set {Ω}, {Cfi} is a label set of sample data in a same category as the preprocessed sample data Xfi.
Step 3.1: solving the objective function model.
The above objective function model (3) is decomposed into w×r optimization problems, namely rewritten as:
Step 3.2: fixing vij and pj, and solving uij.
Letting
Taking a partial derivative of formula (8) with respect to uij and letting the partial derivative be equal to 0 to obtain:
Where I1 is a unit matrix of d×d.
Step 3.3: fixing uij and pj, and solving vij.
Taking a partial derivative of formula (8) with respect to vij and letting the partial derivative be equal to 0 to obtain:
Where I2 is a unit matrix of m×m.
Step 3.4: fixing uij and vij, and solving pj.
Further rewriting the rewritten objective function model (8) as:
Taking a partial derivative of formula (12) with respect to pj and letting the partial derivative be equal to 0 to obtain:
Step 3.5: using the obtained uij, vij and pj for next iteration and performing loop iteration until the objective function model converges to a minimum value or satisfies termination conditions.
Step 3.5.1: solving ωl, {Ω} and {Gfi}, respectively, through formulas (4), (5) and (6).
Step 3.5.2: solving uij through formula (10).
Step 3.5.3: solving vij through formula (11).
Step 3.5.4: solving pj through formula (13).
Step 3.5.5: determining whether the termination conditions are satisfied, if yes, outputting uij, vij and pj, or else, repeating Steps 3.5.1-3.5.4.
Step 4: performing on-line abnormal working condition detection by the method for detecting abnormal working conditions based on the multi-view data.
Acquiring sample data {Xnewi|i=1, 2, . . . , w} at a certain time, and performing preprocessing by the data preprocessing method in Step 2; performing dimensionality reduction on the sample data at the certain time by using the left projection vector uij and the right projection vector vij obtained from Step 3, and constituting a vector ynew after the dimensionality reduction; selecting data category vectors y* according to requirements, wherein for the data category vectors, since sample point data at every time can only belong to one category, for each view, only one data category vector is 1, and the rest are 0; therefore, for the data category vectors y*, w 1s exist, all other w×(r−1) variables are 0; wherein
Based on similarity between ynew and y*, determining a category of working conditions at the certain time, and using cosine similarity as a detection index, wherein a calculation method of the detection index is:
By comparing values of detection indexes of different data category vectors y* and ynew, determining working conditions at a current time, wherein the smaller the values of the detection indexes, the greater a correlation between y* and ynew; due to different data category vectors y* under different working conditions, working conditions corresponding to the data category vector with a smallest value of the detection index are selected as working conditions at the current time.
The beneficial effect of adopting the above technical solutions is that in the method for detecting abnormal working conditions of multi-view data based on feature regression provided by the present invention, a new data preprocessing method is adopted, and historical data information is fully utilized, thereby making differences between different categories of data greater after preprocessing, and providing more reliable information for subsequent working condition recognition. A general mathematical model is established for the preprocessed data acquired by different sensors. The left and right projection vectors solved by the model can make similar samples have better clustering effects in low dimensional space. By comparing a correlation between vectors after dimensionality reduction and various category vectors, the production working conditions at the current time can be quickly and accurately recognized.
In drawings, 1: transformer; 2: short mesh; 3: electrode lifting apparatus; 4: electrode; 5: furnace shell; 6: vehicle body; 7: electric arc; and 8: furnace burden.
The further detailed description of embodiments of the present invention will be provided in combination with the accompanying drawings and embodiments. The following embodiments are used to illustrate the present invention, but are not intended to limit the scope of the present invention.
A production process of an electric melting magnesium oxide furnace (referred to as electric melting magnesium furnace) is shown in
In the embodiment, the method for detecting abnormal working conditions of multi-view data based on feature regression, as shown in
Step 1: sample data under different working conditions are acquired in an actual industrial production process, denoted as {Xji|i=1, 2, . . . , w; j=1, 2, . . . , n}, wherein Xji is jth sample data in an ith view, w is the number of views, and n is the number of samples under different views.
In the embodiment, sensors are used for acquiring the sample data of the electric melting magnesium oxide furnace under two working conditions, wherein w=2, n=52, and the number of sample points under normal working conditions is 36.
Step 2: the acquired sample data under different working conditions are preprocessed.
If the sample data Xji in the ith view is image data, firstly graying processing and normalization processing are performed on the image data, then an average value {
If the sample data Xji∈iq×1 in the ith view is vector data, the preprocessed sample data is:
Wherein Xjri represents the preprocessed sample data of the sample data Xji, xjqi represents a qth variable of the sample data Xji,
In the embodiment, original image data are 188×344, and a dimension of the physical variable data is 3×1, β1=4, β2=2.
Step 3: after preprocessing the sample data, the method for detecting abnormal working conditions based on the multi-view data is established by the feature regression method, where an objective function model is:
Wherein Xli Σid×m is lth preprocessed sample data in the ith view, d and m are dimensions of the sample data Xli, and uijΣil×d and vij∈im×1 are the left and right projection vectors, respectively; Xfi and Xgi are fth preprocessed sample data and gth preprocessed sample data in the ith view, respectively; pj is a regression center of the sample data in all views under a jth working condition; ylij is a label of lth sample data in the ith view, if Xli belongs to the jth working condition, ylij=1, otherwise, ylij=0; λ1 and λ2 are both coefficient parameters.
Wherein
for 1=1,2 . . . , n, largest t data labels corresponding to ωl constitute a set {Ω}, {Cfi} is a label set of sample data in a same category as the preprocessed sample data Xfi.
Here, ωl is a solution of the lth sample data. The total number of acquired sample data is n, that is to say, a total of n ωs exist, with different subscripts. The largest t ωls selected above are selected sample points with poor effects in the modeling process. These sample points constitute a set {Ω}, it is assumed that fis the sample point included in {Ω}, that is to say, the preprocessed sample data Xfi is the sample points with poor effects in the modeling process, but the preprocessed sample data Xgi is the sample points belonging to the same category as the preprocessed sample data Xfi. These sample points constitute a set {Cfi}, the meaning of the {Gfi}={Cfi}−{Ω} is selecting the data belonging to the same category as the preprocessed sample data Xfi and the sample points that are not included in the set {Ω} to constitute the set. Mi is a data matrix constructed for preventing overfitting.
A first term of the objective function model is to find projection vectors uij and vij that enable each sample data to approach the label of a sample after dimensionality reduction; a second term uses the square of a Frobenius norm of the matrix Mi as a regularization term to reduce model complexity and minimize overfitting risks; a third item is to find t sample data with unclear classification effects, and increase a distance weight of the sample data with other similar sample data to continuously approach correct category labels, while also have the effect of clustering the data after being mapped by the data of the same category.
In the embodiment, λ1=λ2=1, t=10.
Step 3.1: the objective function model is solved.
The above objective function model (3) is decomposed into w×r optimization problems, namely rewritten as:
Step 3.2: vij and pj are fixed, and uij is solved.
A partial derivative of formula (8) with respect to uij is taken and the partial derivative is let to be equal to 0 to obtain:
Where Il is a unit matrix of d×d.
Step 3.3: uij and pj are fixed, and vij is solved.
A partial derivative of formula (8) with respect to vij is taken and the partial derivative is let to be equal to 0 to obtain:
Where I2 is a unit matrix of m×m.
Step 3.4: uij and vij are fixed, and pj is solved.
The rewritten objective function model (8) is further rewritten as:
A partial derivative of formula (12) with respect to pj is taken and the partial derivative is let to be equal to 0 to obtain:
Step 3.5: the obtained uij, vij and pj are used for next iteration and loop iteration is performed until the objective function model converges to a minimum value or satisfies termination conditions.
Step 3.5.1: ωl, {Ω} and {Gfi} are solved respectively, through formulas (4), (5) and (6).
Step 3.5.2: uij is solved through formula (10).
Step 3.5.3: vij is solved through formula (11).
Step 3.5.4: pj is solved through formula (13).
Step 3.5.5: whether the termination conditions are satisfied is determined, if yes, uij, vij and pj are output, or else, Steps 3.5.1-3.5.4 are repeated.
In the embodiment, the termination conditions are that a difference value between objective functions of two adjacent iterations is less than a set threshold (less than 10−15) or the number of iterations reaches an upper limit value, and in the embodiment, it is 50 iterations.
Step 4: on-line abnormal working condition detection is performed by the method for detecting abnormal working conditions based on the multi-view data.
Sample data {Xnewi|i=1, 2, . . . , w} at a certain time are acquired, and preprocessing is performed by the data preprocessing method in Step 2; dimensionality reduction is performed on the sample data at the certain time by using the left projection vector uij and the right projection vector vij obtained from Step 3, and a vector ynew after the dimensionality reduction is constituted; data category vectors y* are selected according to requirements, wherein for the data category vectors, since sample point data at every time can only belong to one category, for each view, only one data category vector is 1, and the rest are 0; therefore, for the data category vectors y*, w 1s exist, all other w×(r−1) variables are 0; wherein
Based on similarity between ynew and y*, a category of working conditions is determined at the certain time, and cosine similarity is used as a detection index, where a calculation method of the detection index is:
By comparing values of detection indexes of different data category vectors y* and ynew, working conditions at a current time are determined, wherein the smaller the values of the detection indexes, the greater a correlation between y* and ynew; due to different data category vectors y* under different working conditions, working conditions corresponding to the data category vector with a smallest value of the detection index are selected as working conditions at the current time.
In the embodiment, in the sample data, a total of 498 sample points are acquired in the continuous production process of electric molten magnesium oxide, wherein, anomalies exist in 96th-110th sample points and 202nd-357th sample points of the image data, and anomalies exist in the 99th-122nd and 348th-433rd sample points of physical variable data. The image data and preprocessed image data are shown in
Finally, it should be noted that the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit it. Although the present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that it is still possible to modify the technical solution recorded in the aforementioned embodiments, or to equivalently replace some or all of the technical features thereof, and these modifications or replacements do not make the essence of the corresponding technical solution deviate from the scope limited by the claims of the present invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2022/071326 | 1/11/2022 | WO |