The present disclosure concerns apparatus and methods for predicting fault occurrence in mechanical systems and electrical systems.
Mechanical systems, such as gas turbine engines, usually include sensors for sensing the condition of the mechanical system. The sensor data may be used within a machine learning method to automatically distinguish between faulty conditions and non-faulty conditions to predict when a fault may occur in the mechanical system. The prediction of fault occurrence may be used to predict the remaining useful life of the mechanical system.
In some mechanical systems, the dataset provided by the sensors is imbalanced in that the majority of data points relate to non-faulty conditions and a minority of data points relate to faulty conditions. For example, in gas turbine engines, the vast majority of data points relate to non-faulty conditions, and a small minority of data points relate to faulty conditions.
Machine learning methods usually perform poorly on imbalanced datasets since they tend to identify all data as belonging to the majority class (that is, non-faulty for gas turbine engines). This may lead to an inaccurate prediction of fault occurrence of the mechanical system. In order to compensate for the inaccuracy of the prediction, the operator of the mechanical system may increase the frequency at which the mechanical system is maintained. However, this may increase the maintenance cost to the operator and may result in the mechanical system being placed out of use more frequently than necessary.
According to various, but not necessarily all, embodiments there is provided a method of predicting fault occurrence in a mechanical system, the method comprising: receiving a first dataset of mechanical system condition data, the first dataset being imbalanced by having more data points in a first category than in a second category; generating a plurality of chromosomes from the second category data points in the first dataset; the plurality of chromosomes including information to enable the creation of new datasets; generating a second dataset using the plurality of chromosomes and an evolutionary algorithm, the second dataset being less imbalanced than the first dataset; and predicting fault occurrence in the mechanical system using the second dataset and a machine learning algorithm.
According to various, but not necessarily all, embodiments there is provided a method of balancing a dataset, the method comprising: receiving a first dataset, the first dataset being imbalanced by having more data points in a first category than in a second category; generating a plurality of chromosomes from the second category data points in the first dataset; the plurality of chromosomes including information to enable the creation of new datasets; generating a second dataset using the plurality of chromosomes and an evolutionary algorithm, the second dataset being less imbalanced than the first dataset.
Generating the second dataset may include: iteratively generating a plurality of datasets using the evolutionary algorithm and the plurality of generated chromosomes; and selecting the second dataset from the plurality of iteratively generated datasets.
The method may further comprise: generating a plurality of second datasets from a subset of the plurality of chromosomes; training a plurality of classifiers using the plurality of second datasets; combining the plurality of classifiers to form an ensemble; and wherein predicting fault occurrence in the mechanical system uses the ensemble.
The evolutionary algorithm may be a single objective evolutionary algorithm.
The evolutionary algorithm may be a multi-objective evolutionary algorithm.
The information to enable the creation of new datasets may include an interpolation factor.
The information to enable the creation of new datasets may include information for the number of new data points to be generated within a hypervolume.
The information to enable the creation of new datasets may include a probability landscape to enable generation of new data points.
The information to enable the creation of new datasets may only encode parameters for defining clusters and a data generation method.
The first category may be a non-faulty condition of the mechanical system and the second category is a faulty condition of the mechanical system.
The method may further comprise controlling presentation of the predicted fault occurrence in the mechanical system.
The mechanical system may comprise a gas turbine engine.
According to various, but not necessarily all, embodiments there is provided apparatus for predicting fault occurrence in a mechanical system, the apparatus comprising: processor circuitry configured to: receive a first dataset of mechanical system condition data, the first dataset being imbalanced by having more data points in a first category than in a second category; generate a plurality of chromosomes from the second category data points in the first dataset; the plurality of chromosomes including information to enable the creation of new datasets; generate a second dataset using the plurality of chromosomes and an evolutionary algorithm, the second dataset being less imbalanced than the first dataset; and predict fault occurrence in the mechanical system using the second dataset and a machine learning algorithm.
According to various, but not necessarily all, embodiments there is provided apparatus for balancing a dataset, the apparatus comprising processor circuitry configured to: receive a first dataset, the first dataset being imbalanced by having more data points in a first category than in a second category; generate a plurality of chromosomes from the second category data points in the first dataset; the plurality of chromosomes including information to enable the creation of new datasets; generate a second dataset using the plurality of chromosomes and an evolutionary algorithm, the second dataset being less imbalanced than the first dataset.
The processor circuitry may be configured to iteratively generate a plurality of datasets using the evolutionary algorithm and the plurality of generated chromosomes; and select the second dataset from the plurality of iteratively generated datasets.
The processor circuitry may be configured to: generate a plurality of second datasets from a subset of the plurality of chromosomes; train a plurality of classifiers using the plurality of second datasets; combine the plurality of classifiers to form an ensemble. Predicting fault occurrence in the mechanical system may use the ensemble.
The evolutionary algorithm may be a single objective evolutionary algorithm.
The evolutionary algorithm may be a multi-objective evolutionary algorithm.
The information to enable the creation of new datasets may include an interpolation factor.
The information to enable the creation of new datasets may include information for the number of new data points to be generated within a hypervolume.
The information to enable the creation of new datasets may include a probability landscape to enable generation of new data points.
The information to enable the creation of new datasets may only encode parameters for defining clusters and a data generation method.
The first category may be a non-faulty condition of the mechanical system and the second category may be a faulty condition of the mechanical system.
The processor circuitry may be configured to control an output device to present the predicted fault occurrence in the mechanical system.
The mechanical system may comprise a gas turbine engine.
According to various, but not necessarily all, embodiments there is provided a system comprising: a mechanical system; apparatus as described in any of the preceding paragraphs; and one or more sensors configured to sense a condition of the mechanical system, and to provide the first dataset to the apparatus.
According to various, but not necessarily all, embodiments there is provided a computer program that, when read by a computer, causes performance of the method as described in any of the preceding paragraphs.
According to various, but not necessarily all, embodiments there is provided a non-transitory computer readable storage medium comprising computer readable instructions that, when read by a computer, causes performance of the method as described in any of the preceding paragraphs.
The skilled person will appreciate that except where mutually exclusive, a feature described in relation to any one of the above aspects of the invention may be applied mutatis mutandis to any other aspect of the invention.
Embodiments of the invention will now be described by way of example only, with reference to the Figures, in which:
In the following description, the terms ‘connected’ and ‘coupled’ mean operationally connected and coupled. It should be appreciated that there may be any number of intervening components between the mentioned features, including no intervening components between the mentioned features.
In more detail,
The mechanical system 12 may be any apparatus or device that includes mechanical components and may also include electrical components. For example the mechanical system 12 may be (but is not limited to) a gas turbine engine, an internal combustion engine, a wind turbine, a hydro-electric turbine. The mechanical system 12 may be a module of an apparatus or a device. As used herein, the wording ‘module’ refers to a device or apparatus where one or more features are included at a later time, and possibly, by another manufacturer or by an end user. For example, where the mechanical system 12 is a module of a gas turbine engine, the mechanical system 12 may be a turbine module or a compressor module. The mechanical system 12 may be a part or a component of an apparatus or device. For example, where the mechanical system 12 is a component of a gas turbine engine, the mechanical system 12 may be (for example) a shaft, a fan, a disc, or a blade of the gas turbine engine. In other examples, the system 10 may include an electrical system 12 that comprises or consists of electrical components. For example, the electrical system 12 may be the electrical system of a gas turbine engine, the electrical system of an aircraft, or may be the electrical system of a power plant.
The apparatus 14 includes processor circuitry 18, a user input device 20, and an output device 22. In some examples, the apparatus 14 may be a module. Where the apparatus 14 is a module, the apparatus 14 may only include the processor circuitry 18, and the remaining features may be added by another manufacturer, or by an end user.
The processor circuitry 18, the user input device 20, the output device 22 and the sensors 16 may be coupled to one another via a wireless link and may consequently comprise transceiver circuitry and one or more antennas to enable wireless communication. Additionally or alternatively, the processor circuitry 18, the user input device 20, the output device 22 and the sensors 16 may be coupled to one another via a wired link and may consequently comprise interface circuitry (such as a Universal Serial Bus (USB) socket). It should be appreciated that the processor circuitry 18, the user input device 20, the output device 22, and the sensors 16 may be coupled to one another via any combination of wired and wireless links.
The processor circuitry 18 may comprise any suitable circuitry to cause performance of the methods described herein and as illustrated in
By way of an example, the processor circuitry 18 may comprise at least one processor 24 and at least one memory 26. The memory 26 stores a computer program 28 comprising computer readable instructions that, when read by the processor 24, causes performance of the methods described herein, and as illustrated in
The processor 24 may be located on the mechanical system 12, or may be located remote from the mechanical system 12, or may be distributed between the mechanical system 12 and a location remote from the mechanical system 12. The processor 24 may include at least one microprocessor and may comprise a single core processor, multiple processor cores (such as a dual core processor or a quad core processor) or may comprise a plurality of processors (at least one of which may comprise multiple processor cores).
The memory 26 may be located on the mechanical system 12, or may be located remote from the mechanical system 12, or may be distributed between the mechanical system 12 and a location remote from the mechanical system 12. The memory 26 may be any suitable non-transitory computer readable storage medium, data storage device or devices, and may comprise a hard disk and/or solid state memory (such as flash memory). The memory 26 may be permanent non-removable memory, or may be removable memory (such as a universal serial bus (USB) flash drive).
The computer program 28 may be stored on a non-transitory computer readable storage medium 32. The computer program 28 may be transferred from the non-transitory computer readable storage medium 32 to the memory 26. The non-transitory computer readable storage medium 32 may be, for example, a USB flash drive, a compact disc (CD), a digital versatile disc (DVD) or a Blu-ray disc. In some examples, the computer program 28 may be transferred to the memory 26 via a wireless signal 34 or via a wired signal 34.
The user input device 20 may comprise any suitable device for enabling an operator to at least partially control the apparatus 14. For example, the user input device 20 may comprise one or more of a keyboard, a keypad, a touchpad, a touchscreen display, and a computer mouse. The processor circuitry 18 is configured to receive signals from the user input device 20.
The output device 22 may be any suitable device for conveying information to a user. For example, the output device 22 may be a display (such as a liquid crystal display, or a light emitting diode display, or an active matrix organic light emitting diode display, or a thin film transistor display, or a cathode ray tube display), and/or a loudspeaker, and/or a printer (such as an inkjet printer or a laser printer). The processor circuitry 18 is configured to provide a signal to the output device 22 to cause the output device 22 to convey information to the user.
The at least one sensor 16 is configured to sense at least one condition of the mechanical system 12. The processor circuitry 18 is configured to receive the data from the at least one sensor 16 and may store the data as a dataset in the memory 26 (where the dataset is a collection of data received from the at least one sensor 16 over a period of time). The sensors 16 may comprise any suitable sensor or combination of sensors. For example, the sensors 16 may be configured to sense temperature, pressure, velocity, acoustic emissions, electromagnetic emissions, of a part of the mechanical system 12.
It should be appreciated that the methods illustrated in
The operation of the system 10 according to various examples is described in the following paragraphs with reference to
At block 36, the method includes receiving a first dataset of mechanical system condition data, the first dataset being imbalanced by having more data points in a first category than in a second category. For example, the processor circuitry 18 may receive a first dataset from the at least one sensor 16 that includes condition data of a gas turbine engine 12. The received first dataset is imbalanced and includes a greater number of data points that represent a non-faulty condition than data points that represent a faulty condition.
At block 38, the method includes generating a plurality of chromosomes from the second category data points in the first dataset. The plurality of generated chromosomes includes information to enable the creation of new datasets. For example, the information to enable the creation of new datasets may include an interpolation factor, information for the number of new data points to be generated within a hypervolume, or a probability landscape to enable generation of new data points. The length of the chromosomes may not be restricted to be equal to the number of data points within the received first dataset.
In more detail, the information enables the apparatus 14 to generate new data points from the second category data points in the first dataset. For example, the processor circuitry 18 may separate the received first dataset into first category data points and second category data points. The processor circuitry 18 may then generate a plurality of chromosomes from the second category data points in the first dataset received from the sensors 16.
At block 40, the method includes generating a second dataset using the plurality of chromosomes and the evolutionary algorithm 31, the second dataset being less imbalanced than the first dataset by comprising more second category data points than the first dataset. In some examples, block 40 may include iteratively generating a plurality of datasets using the evolutionary algorithm 31 and the plurality of generated chromosomes, and then selecting the second dataset from the plurality of iteratively generated datasets. For example, the processor circuitry 18 may generate the second dataset using the plurality of chromosomes generated in block 38 and the evolutionary algorithm 31.
The evolutionary algorithm may be a single objective evolutionary algorithm where the second dataset is optimized for a single evaluation metric (for example, accuracy, precision, recall, Area Under the Curve (AUC), Geometric Mean (G-Mean), and so on. The evolutionary algorithm may alternatively be a multi-objective evolutionary algorithm (for example, multi-objective evolutionary algorithm based on decomposition (MOEA/D), non-dominated sorting genetic algorithm-11 (NSGA-II), and so on.
At block 42, the method includes predicting fault occurrence in the mechanical system 12 using the second dataset and the machine learning tool 30. For example, the processor circuitry 18 may predict fault occurrence in the mechanical system 12 using the second dataset generated at block 40 and the machine learning tool 30. In some examples, the machine learning tool 30 may use the second dataset generated at block 40 to train a classifier to obtain better accuracy in predicting fault occurrence in the mechanical system 12.
Where the mechanical system 12 is a gas turbine engine, the processor circuitry 18 may predict fault occurrence in the gas turbine engine. Where the mechanical system 12 is a module of a gas turbine engine (such as a turbine module), the processor circuitry 18 may predict fault occurrence in the module. Where the mechanical system 12 is a component of a gas turbine engine (such as a fan blade for example), the processor circuitry 18 may predict fault occurrence in the component.
The fault occurrence of the mechanical system 12 predicted in block 42 using the second dataset may be more accurate than where fault occurrence is predicted using the first dataset. The second dataset being more balanced than the first dataset enables the machine learning tool 30 to more accurately categorise a data point as being in the first category (for example, a non-faulty condition) or in the second category (for example, a faulty condition).
At block 44, the method includes controlling presentation of the predicted fault occurrence in the mechanical system. For example, the processor circuitry 18 may control a display of the output device 22 to display the predicted fault occurrence of the mechanical system 12 to an operator. By way of another example, the processor circuitry 18 may control a printer of the output device 22 to print the predicted fault occurrence of the mechanical system 12 on a printing medium (such as paper) for viewing by an operator. The operator may then schedule maintenance of the mechanical system 12 to replace or repair the mechanical system 12.
At block 36, the processor circuitry 18 receives an imbalanced first dataset from the sensors 16. For example, the processor circuitry 18 may receive the first dataset illustrated in
In more detail,
At block 56, the method includes splitting the first dataset received at block 36 into a training dataset and a validation dataset. For example, the processor circuitry 18 may split the first dataset into a training dataset and a validation set using random sampling, stratified sampling, K-fold cross validation, or stratified K-fold cross validation. The use of stratified sampling to split the first dataset may advantageously retain the original ratio of second category data points to first category data points.
At block 58, the processor circuitry 18 generates a plurality of chromosomes from the second category data points in the first dataset. The chromosomes include information to enable the creation of new datasets. Performance of block 58 results in the random generation of an initial population of chromosomes. The total number of chromosomes generated is dependent on the population size defined by the operator (for example, the operator may input the population size using the user input device 20). The chromosomes in the initial population may be any one of, or combination of, the chromosomes illustrated in
At block 59, the processor circuitry 18 determines fitness values of individual chromosomes in the population using the initial population of chromosomes generated in block 58, the validation dataset, and the training dataset from block 56. The processor circuitry 18 may first generate new datasets from the chromosomes as described in the following paragraphs.
In some examples, the processor circuitry 18 may generate new second category data points that compensate for the deficit in the number of second category data points in the first dataset relative to the number of first category data points. In other examples, the processor circuitry 18 may generate new second category data points that replace the original second category data points in the first dataset and have the same number (or a similar number) of data points as the first category data points. An operator may operate the user input device 20 to select one of the above mentioned options for generating second category data points.
In the following example, the processor circuitry 18 is configured to generate new second category data points that compensate for the deficit in the number of second category data points in the first dataset relative to the number of first category data points (that is, the original second category data points are retained in the new dataset).
x
i,new
=x
i,1+αi(xi,2−xi,1)
In examples where the feature can only take either a range of values (for example, integers, binary, or real numbers within a range), interpolation between two existing data points may result in an invalid new data point. In such examples, an additional method block may be required to ensure the new data point complies with the expected value type. This may be achieved by rounding and thresholding.
By way of an example with reference to
In the following example, the processor circuitry 18 is configured to generate new second category data points that replace the original second category data points in the first dataset and have the same number (or a similar number) as the number of first category data points.
As described in the preceding paragraphs (and as illustrated in
x
1,new
=x
i,1+αi(xi,2−xi,1)
In some examples, all chromosomes from each population generation is translated into a balanced training dataset. Each of these balanced training datasets are used to train a machine learning algorithm. The performance (for example, accuracy) of the machine learning algorithm on the validation set is then assigned as the fitness value of the respective chromosome. After iterating through a number of generations, the fitness values gradually improve. At the end of the method, the chromosome with the best fitness value is used to generate a balanced dataset.
x
i,new
=x
i,1+αi(xi,2−xi,1)
New second category data points may be generated from the third chromosome 84 using any suitable method. For example, the processor circuitry 18 may use random resampling or synthetic minority oversampling technique (SMOTE) to generate new second category data points. The processor circuitry 18 may then generate a new balanced training dataset using the original first category data points and the new second category data points.
As illustrated in
After generating the new balanced training dataset, the processor circuitry 18 uses the new balanced training dataset to train the machine learning computer program 30. It should be appreciated that any learning method may be used by the machine learning computer program 30 at block 59.
The processor circuitry 18 then uses the trained machine learning computer program 30 to determine a fitness value of the chromosomes in the initial population of chromosomes by using the validation dataset (from block 56), an evaluation metric (for example, accuracy, Area Under the Curve (AUC), and so on), and the new datasets generated from the initial population of chromosomes.
At block 90, the processor circuitry 18 performs a mating selection using the initial population of chromosomes generated at block 58, the fitness values determined at block 59, and a pre-defined number of chromosomes in the mating pool (which may be defined by the operator using the user input device 20). In more detail, the processor circuitry 18 selects a subset of chromosomes from the initial population of chromosomes (the subset size being defined by the mating pool size) based on their fitness values (where chromosomes having higher fitness values have a higher probability of being selected into the mating pool by the processor circuitry 18).
At block 92, the processor circuitry 18 performs crossover and mutation on the chromosomes in the mating pool (that is, the subset of chromosomes selected at block 90) to generate offspring chromosomes. It should be appreciated that any suitable crossover algorithm may be used by the processor circuitry 18 (such as the crossover algorithm illustrated in
At block 94, the processor circuitry 18 performs a fitness evaluation on the offspring chromosomes generated at block 92 to determine a fitness value for the offspring chromosomes. The processor circuitry 18 may perform block 94 as described above with reference to block 59.
At block 96, the processor circuitry 18 performs survivor selection to select one or more chromosomes for passing on to the next generation of chromosomes. It should be appreciated that any suitable survivor selection method may be used. For example, the processor circuitry 18 may use an ‘elitism’ selection method where the chromosomes having the highest fitness values are passed onto the next generation of chromosomes. In more detail, the chromosomes from the original population and the offspring chromosomes may be pooled together and sorted based on their fitness values. The chromosomes having the highest fitness values (that is, the fittest chromosomes) are then chosen to survive to the next generation. The number of chromosomes chosen to survive is dependent on the elitism percentage. In some examples, only a certain percentage of the next generation of chromosomes are ‘elite’ chromosomes and the rest of the population of chromosomes are then randomly selected from the remaining pool of chromosomes.
At block 98, the processor circuitry 18 determines whether a termination condition has been fulfilled. The processor circuitry 18 may use any suitable termination condition and may use, for example, maximum number of generations, the maximum fitness value, and/or the average fitness value of the population of chromosomes. If the termination condition has not been fulfilled, the processor circuitry 18 returns to block 90 and the next generation of chromosomes selected at block 96 forms at least part of the mating pool for mating selection. If the termination condition has been fulfilled, the processor circuitry 18 proceeds to block 100.
At block 100, the processor circuitry 18 selects the chromosome having the highest fitness value (that is, the processor circuitry 18 selects the fittest chromosome) and then generates an optimized balanced dataset from the selected chromosome. The optimized balanced dataset may then be used at block 42 as the second dataset in order to predict fault occurrence in the mechanical system.
In some examples, the processor circuitry 18 may perform the method illustrated in
At block 102, the method includes determining a subset of the chromosomes from the plurality of chromosomes. The subset of chromosomes includes those chromosomes that are closest to the chromosome selected at block 100 (and may or may not include the selected chromosome). For example, subsequent to block 100, the processor circuitry 18 may select a subset of chromosomes having fitness values above a threshold fitness value. By way of another example, the processor circuitry 18 may select a predetermined number of chromosomes that have fitness values closest to the fitness value of the chromosome selected at block 100.
At block 104, the method includes generating a plurality of balanced datasets using the subset of chromosomes determined at block 102.
At block 106, the method includes training a plurality of classifiers using the plurality of balanced datasets generated at block 104. For example, the processor circuitry 18 may train the plurality of classifiers using the plurality of generated balanced datasets to train the machine learning algorithm 30.
At block 108, the method includes combining the plurality of trained classifiers to form an ensemble which may then be used to predict fault occurrence in the mechanical system 12 at block 42 (illustrated in
The apparatus 14 and methods illustrated in
First, the methods illustrated in
Second, when compared with non-wrapper methods, the methods illustrated in
Third, it has been found by the inventors that the methods illustrated in
Fourth, since the length of the chromosomes may not be restricted to be equal to the number of data points within the original dataset (that is, the received first dataset), this may advantageously enable effective optimisation by the evolutionary algorithm.
Fifth, the method illustrated in
It will be understood that the invention is not limited to the embodiments above-described and various modifications and improvements can be made without departing from the concepts described herein. For example, the at least one sensor 16 may be configured to sense condition data of a human or an animal instead of the mechanical system 12. In these examples, the apparatus 14 is configured to balance a dataset for the condition of the human or animal to enable a diagnosis to be determined from the balanced dataset.
By way of a further example, a chromosome may only encode; parameters required during clustering; and a data generation method. Such a chromosome may have a reduced length and thus enable more efficient optimisation to be achieved. The clustering may be used to define regions optimal for synthetic oversampling. By tuning the parameters, the regions for synthetic oversampling may be changed accordingly.
‘Nnew’ represents the number of new synthetic data points to be generated. ‘Nnew’ may be set to be of any integer value. For example, ‘Nnew’ may be allowed to range between 0 and the number of majority datapoints, ‘cls’ represents the clustering method used and a variety of clustering methods may be selected by a user. ‘k’ represents the number of clusters. ‘k’ may have an allowable range of integers between 0 and Nmin/2 to ensure that at least two data points are within each cluster. ‘nn’ represents the number of nearest neighbours within each cluster for oversampling. This parameter has the effect of either limiting the data generation to be close to the cluster centre or enabling data generation between clusters. ‘type’ specifies the data generation method (for example, type 1, type 2 and so on).
In more detail,
In more detail,
In more detail,
Where an imbalanced dataset includes more than two categories, the dataset may be balanced by sequentially pairing categories together and using the methods described in the preceding paragraphs for each pair of categories.
Except where mutually exclusive, any of the features may be employed separately or in combination with any other features and the disclosure extends to and includes all combinations and sub-combinations of one or more features described herein.
Number | Date | Country | Kind |
---|---|---|---|
1508032.8 | May 2015 | GB | national |