This application claims priority to Korean Patent Application No. 10-2018-0112873, filed on 20 Sep. 2018, the entire content of which is incorporated herein by reference.
The present invention relates to a machine learning apparatus and a method based on multi-feature extraction and transfer learning, on which signal characteristics measured from a plurality of sensors are reflected. This invention also relates to an apparatus for performing leak monitoring of plant pipelines using the same.
Recently, as deep learning technologies that imitate the workings of the human brain have evolved greatly, machine learning based on deep learning technologies has been actively applied in various applications such as image recognition and processing, automatic voice recognition, video behavior recognition, natural language processing, etc. It is necessary to construct a learning model specialized to perform machine learning for receiving measured signals from particular sensors for each application and reflecting signal characteristics specific to the corresponding application of these signals.
Meanwhile, cases have been steadily reported that aging of the plant pipelines installed at the time of initial construction has progressed to show symptoms of corrosion, wall thinning, leaks, etc., and accordingly, there is a growing demand for early detection of leaks in such aging pipelines. Relatively inexpensive acoustic sensors have been used as a means to detect such leaks, and currently, equipment for determining leaks based on an experimental result that an acoustic signal in the high frequency range is detected when a leak occurs is commercialized and commonly used.
However, there is difficulty in determining truth of fine leaks due to various mechanical noises or noisy environments occurring in a plant. In addition, because these methods do not allow remote monitoring at all times, there are limitations on early detection of leaks. Accordingly, for early detection of leaks in aging plant pipelines, a data signal processing technique and a continuous leak detection technology using the same that make it possible to detect fine leaks even in noisy environments such as machine operations, etc. is very important. However, development of a methodical system capable of continuously/constantly monitoring leak detection based on signal processing for detection of fine leaks is not sufficient yet.
Therefore, it is an object of the present invention to propose apparatus and method for extracting multiple features from time series data collected from a plurality of sensors and for performing transfer learning on them.
Further, it is another object of the present invention to solve the problems mentioned above by using such apparatus and method proposed in the present invention to perform leak detection in plant pipelines.
In order to solve the problems mentioned above, an aspect of the present invention provides an apparatus/method for performing machine learning based on transfer learning for the extraction of multiple features, which are robust to mechanical noises and other noises, from time series data collected from a plurality of sensors. In particular, there is provided a machine learning apparatus based on multi-feature extraction and transfer learning comprising: a multi-feature extraction unit for extracting multiple features from a data stream for each sensor inputted from the plurality of sensors, wherein the multiple features comprise ambiguity features that have been ambiguity-transformed from characteristics of the input data and multi-trend correlation features extracted for each of multiple trend intervals according to a number of packet sections constituting the data stream for each sensor; a transfer-learning model generation unit for extracting useful multi-feature information from a learning model which has finished pre-learning for the multiple features, for forwarding the extracted multi-feature information to a multi-feature learning unit below so as to generate a learning model that performs transfer learning for each of the multiple features; and the multi-feature learning unit for receiving learning variables from the learning model for each of the multiple features and for performing parallel learning for the multiple features, so as to calculate and output a loss.
According to an embodiment of the machine learning apparatus, the multi-feature extraction unit may comprise an extractor for extracting the ambiguity features. The extractor for ambiguity features may be configured to convert characteristics in a form of sensor data from the data stream transmitted from each of the sensors into an image feature through ambiguity transformation using the cross time-frequency spectral transformation and the 2D Fourier transformation.
Here, the ambiguity feature may comprise a three-dimensional volume feature generated by accumulating two-dimensional features in a depth direction.
Further, according to an embodiment of the machine learning apparatus, the multi-feature extraction unit may comprise a multi-trend correlation feature extraction unit for extracting the multi-trend correlation features. The multi-trend correlation feature extraction unit may be configured to construct column vectors with data extracted during multiple trend intervals consisting of different numbers of packet sections in the data stream for each sensor, and to extract data for each trend interval so that sizes of the column vectors for each trend interval are the same, so as to output the multi-trend correlation features.
Moreover, according to an embodiment of the machine learning apparatus, the learning model generated in the transfer-learning model generation unit may comprise a teacher model for extracting and forwarding information which has finished pre-learning and a student model for receiving the extracted information. Here, the student model may be configured in the same number as the multiple features, and the useful information of the teacher model that has finished pre-learning may be forwarded to a number of student models for the multiple features so as to be learned. As an alternative, the learning model generated in the transfer-learning model generation unit may comprise a teacher model for extracting and forwarding information which has finished pre-learning and a student model for receiving the extracted information. Here, the student model may be configured as a single common model, and the useful information of the teacher model that has finished pre-learning may be forwarded to the single common student model so as to be learned.
In addition, according to an embodiment of the machine learning apparatus, the useful information extracted from the teacher model may be a single piece of hint information corresponding to an output of feature maps comprising learning variable information from a learning data input to any layer. The forwarding of this single piece of hint information may be performed such that a loss function for the Euclidean distance between an output result of feature maps at a layer selected from the teacher model and an output result of feature maps at a layer selected from the student model is minimized.
Furthermore, an embodiment of the machine learning apparatus may further comprise a means for periodically updating the learning models generated in the transfer-learning model generation unit.
Moreover, an embodiment of the machine learning apparatus may further comprise a multi-feature evaluation unit for finally evaluating learning results by receiving results that have been learned from the multi-feature learning unit. And in this case, the machine learning apparatus may further comprise a multi-feature combination optimization unit for repetitively performing combination of the multiple features until an optimal combination of the multiple features according to a loss is acquired based on the learning results inputted in the multi-feature evaluation unit.
In order to solve the problems mentioned above, another aspect of the present invention provides a machine learning method based on multi-feature extraction and transfer learning from data streams transmitted from a plurality of sensors. The method comprises: a multi-feature extraction procedure for extracting multiple features from a data stream for each sensor inputted from the plurality of sensors, wherein the multiple features comprise ambiguity features that have been ambiguity-transformed from characteristics of the input data and multi-trend correlation features extracted for each of multiple trend intervals according to a number of packet sections constituting the data stream for each sensor; a transfer-learning model generation procedure for extracting useful multi-feature information from a learning model which has finished pre-learning for the multiple features, for forwarding the extracted multi-feature information to a multi-feature learning procedure below so as to generate a learning model that performs transfer learning for each of the multiple features; and a multi-feature learning procedure for receiving learning variables from the learning model for each of the multiple features and for performing parallel learning for the multiple features, so as to calculate and output a loss.
Further, in order to solve the problems mentioned above, yet another aspect of the present invention provides an apparatus for detecting fine leaks using a machine learning apparatus based on multi-feature extraction and transfer learning from data streams transmitted from a plurality of sensors.
The apparatus comprises: a multi-feature extraction unit for extracting multiple features from a data stream for each sensor inputted from the plurality of sensors, wherein the multiple features comprise ambiguity features that have been ambiguity-transformed from characteristics of the input data and multi-trend correlation features extracted for each of multiple trend intervals according to a number of packet sections constituting the data stream for each sensor; a transfer-learning model generation unit for extracting useful information from a learning model which has finished pre-learning for the multiple features, for forwarding the extracted useful information to a multi-feature learning unit below so as to generate a learning model that performs transfer learning for each of the multiple features; a multi-feature learning unit for receiving learning variables from the learning model for each of the multiple features and for performing parallel learning for the multiple features, so as to calculate and output a loss; and a multi-feature evaluation unit for finally evaluating whether there is a fine leak by receiving results that have been learned from the learning model generated in the multi-feature learning unit.
The configuration and operation of the present invention mentioned above will be even clearer through specific embodiments described later with reference to accompanying drawings.
The advantages of the present invention may be better understood by those skilled in the art with reference to the accompanying drawings, in which:
Advantages, features, and methods for achieving these will be apparent by referring to embodiments described in detail below as well as the accompanying drawings. However, the present invention is not limited to embodiments described below but may be implemented in various different forms. The embodiments described make the present invention complete and are provided to let a person having ordinary skilled in the art fully understand the scope of the invention, and accordingly, the present invention is defined by what is set forth in the claims.
On the other hand, the terms used herein are to describe various embodiments but not to limit the present invention. Singular forms herein may cover plural forms as well, unless otherwise explicitly mentioned. The term “comprise” or “comprising” used herein is not intended to preclude the existence or addition of one or more further components, steps, operations, and/or elements, in addition to the components, steps, operations, and/or elements preceded by such terms.
Below, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. The embodiment to be described now relates to a method for multi-feature extraction and transfer learning from the information acquired from a plurality of sensors, and to an apparatus for detecting fine leaks in plant pipelines using the multi-feature extraction and transfer learning. When it comes to designating reference numerals for components of each drawing, like numerals are assigned to like components if possible, though they may be shown in different drawings. Further, in describing the present invention, specific descriptions on related known components or functions will not be provided if such descriptions may obscure the subject matter of the present invention.
The multi-feature extraction unit 20 comprises an ambiguity feature extractor 22 and a plurality of multi-trend correlation feature extractors 24, and receives time series data from the plurality of sensors 10 to extract image features on which the characteristics for detecting fine leaks are well reflected and which are suitable for deep learning.
In this case, the output P of the cross time-frequency spectral transformer 223 in
P=X′⊗conj(Y′) Eq. 1
where ⊗ represents the element-wise multiplication of two-dimensional matrices, and conj(*) represents the complex conjugate calculation.
It can be observed that: a chirp signal (
On the other hand, in the case of collecting data from the sensor 10 in a very noisy environment such as mechanical noises, other noises, etc., the feature of fine leaks in the shape a point may not appear in an image even in the case of detection of a fine leak, and accordingly, a recognition error may occur when applying to machine learning.
In order to solve such a problem, a plurality of two-dimensional ambiguity image features 231 of W (width)×H (height) extracted from each sensor pair S(#1,#2), . . . , S(# i,# j) are accumulated in the depth (D) direction and combined to extract a three-dimensional image feature as can be seen in
Next, in a stream in which data for each sensor is configured to have a predetermined packet period 241 and a packet section 243, that is, in an mth order sensor data stream 245 (m=1, 2, . . . , M) as shown in
Eq. 2
where <●, ●> represents an inner product of two vectors, ai represents each vector of the matrix A, and gij represents each element of the matrix G. Therefore, the matrix G representing the multi-trend correlation image feature 247 according to Equation 2 presents correlation information for each trend by each sensor, in an image.
When creating the multi-trend correlation image feature 247, a plurality of multi-trend correlation image features 247 may be extracted (feature #2˜feature # N) by performing various signal processing processes, such as: 1) the original data inputted for each trend may be used as they are to create an image feature by applying the resampling and Gramian operation described above thereto; 2) the original data inputted for each trend are converted to RMS (root mean square) data, followed by applying the resampling and Gramian operation described above thereto to create an image feature; 3) the original data inputted for each trend are converted to frequency spectral data, followed by applying the resampling and Gramian operation described above thereto to create an image feature, etc.
Referring back to
The multi-feature transfer learning proposed in the present invention may be configured such that, as shown in
More specifically, for example, the useful information extracted from the teach model 32 which has finished pre-learning may be defined as a single piece of hint information corresponding to an output of feature maps 323 including learning-variable (weights) information from input learning data 320 to any particular layer 321, as shown in
A transfer learning method for forwarding such a single piece of hint information is performed, referring to
The extraction of a single piece of hint information and learning method in
Meanwhile, along with the hint information described above, matrix G′ representing the hint correlation using the Gramian operation for the output of the feature maps as in Equation 3 below may be used as the extracted information for the teacher model.
where F presents a matrix obtained by reconstructing the feature map output into a two-dimensional matrix, and gij represents each element of the matrix G′.
Therefore, when forwarding the extracted information from the teacher model to the student model, the hint information described with reference to
On the other hand, for the learning data used for transfer learning, N number of volume features 233 extracted in the multi-feature extraction unit 20 shown in
A method for selecting a plurality of layers 321 from the teacher model 32 which has finished pre-learning and for extracting multiple pieces of hint information corresponding to the layers 321, so as to forward such multiple pieces of hint information to the multi-feature learning unit 40 shown in
The simultaneous learning method for multiple pieces of hint information is a method for learning simultaneously such that for L number of multi-layer pairs 321-1, 321-2, . . . , 321-L and 341-1, 341-2, . . . , 341-L selected from the teacher model 32 and the student model 34 as shown in
The sequential learning method for multiple pieces of hint information is a method for sequentially forwarding hint information one by one from the lowest layer to the highest layer for the L multi-layer pairs selected in the same way as in
Meanwhile, for the information extracted from the teacher model 32 when extracting the multiple pieces of hint information, both the hint information and hint correlation information may be applicable to the extraction of multiple pieces of hint information as described with respect to the extraction and forwarding of a single piece of hint information, and also when forwarding the multiple pieces of hint information, the hint information may be forwarded alone for each layer, the hint correlation information may be forwarded alone for each layer, or weights may be added to the two pieces of information to be forwarded for each layer.
The learning data used for transfer learning in this case may also use the N volume features 233 extracted in the multi-feature extraction unit 20 shown in
On the other hand, the above teacher model 32 and the student model 34 for transfer learning may periodically (according to a period defined by the user) collect learning data so as to perform updates. More specifically, the existing teacher model 32 may further learn using additional data for a corresponding period to update, and the existing student models 34 may also further learn using the transfer learning technique described in the present invention to perform a new update. Another method of updating is that if there is a change in the data distribution to be collected, the data which have changed may be collected to perform further learning and to update models. Moreover, if the data distribution to be collected departs from the range defined by the user, the above update procedure may be performed. In an embodiment, a similarity may be measured using the Kullback-Leibler divergence for the histogram distribution of the image features to be inputted to the transfer-learning model generation unit 30, so as to perform a model update through transfer learning.
More specifically, in the case of performing transfer learning in the above method shown in
Accordingly, the N volume features outputted from the multi-feature extraction unit 20 described above are received, and parallel learning is performed with the N learners resulting from transfer learning for each volume feature to calculate the loss. At this time, the loss can be calculated using results such as the learning model, accuracy, and complexity that have been learned in the learner.
As described above, the present invention may be implemented in an aspect of an apparatus or a method, and in particular, the function or process of each component in the embodiments of the present invention may be implemented as a hardware element comprising at least one of a DSP (digital signal processor), a processor, a controller, an ASIC (application specific IC), a programmable logic device (such as an FPGA, etc.), other electronic devices and a combination thereof. It is also possible to implement in combination with a hardware element or as independent software, and such software may be stored in a computer-readable recording medium.
The description provided above relates to multi-feature extraction and transfer learning from the information acquired from a number of sensors, and hereinafter, an apparatus for detecting fine leaks in plant pipelines using such multi-feature extraction and transfer learning will be described.
Returning to
Last,
According to multi-feature extraction and transfer learning of the present invention, optimal performance can be achieved by collecting time series data from a plurality of sensors, performing multi-feature ensemble learning based on transfer learning after extracting image features for fine leaks from the time series data, and evaluating it. In particular, according to the apparatus and method for detecting fine leaks based on such multi-feature extraction and transfer learning, early detection of fine leaks and thus, optimum performance can be achieved. Specifically, even if there are mechanical noises, or other ambient noises in a plant environment, it is possible to greatly improve the reliability of leak detection by extracting image/volume features on which the signal characteristics of fine leaks are well reflected through the imaging signal processing technique proposed in the present invention. In addition, by extracting image features of fine leaks suitable for deep learning in pattern recognition, early detection and continuous monitoring of fine leaks based on data is possible through from the step of collecting data using a plurality of sensors, extraction of features, and ensemble optimization learning based on transfer learning.
In the above, though the configuration of the present invention has been described in detail through the preferred embodiments of the present invention, it will be appreciated by those having ordinary skill in the art to which the invention pertains that the present invention may be implemented in other specific forms that are different from those disclosed in the specification without changing the spirit or essential features of the present invention. It should be understood that the embodiments described above are exemplary in all aspects, and are not intended to limit the present invention. The scope of protection of the present invention is to be defined by the claims that follow rather than by the detailed description above, and all changes and modified forms derived from the claims and its equivalent concepts should be construed to fall within the technical scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
10-2018-0112873 | Sep 2018 | KR | national |