The invention relates to methods and devices for monitoring or analyzing power network assets. The invention relates in particular to methods and devices that perform a condition classification of power network assets, such as power transformers.
A power system comprises a network of electrical components or power system equipment configured to supply, transmit, and/or use electrical power. For example a power grid comprises generators, transmission systems, and/or distribution systems. Generators, or power stations, are configured to produce electricity from combustible fuels (e.g., coal, natural gas, etc.) and/or non-combustible fuels (e.g., such as wind, solar, nuclear, etc.). Transmission systems are configured to carry or transmit the electricity from the generators to loads. Distribution systems are configured to feed the supplied electricity to nearby homes, commercial businesses, and/or other establishments. Among other electrical components, such power systems may comprise one or more power transformers configured to transform electricity at one voltage (e.g., a voltage used to transmit electricity) to electricity at another voltage (e.g., a voltage desired by a load receiving the electricity).
Monitoring and analysis of power network assets, such as power transformers, is an important task, because it can mitigate the risk of power system failure and ensure that actions are taken in a timely manner to ensure reliable operation of the power network assets, before failure occurs.
The identification of a condition that indicates that a power network asset requires attention is a considerable challenge. For illustration, power transformers are complex high-cost assets that are subject to ageing and other phenomena that may affect their reliability and operation. Various tools have been developed to assist an engineer in identifying conditions of power network assets that require some action to be taken.
WO 2014/078830 A2 discloses a method that comprises predicting an oil temperature of a transformer of a power system for a desired load based upon a profile of the transformer developed via a machine-learning algorithm.
CN 102 735 760 A discloses a method of predicting transformer oil chromatographic data based on an extreme learning machine.
CN 102 944 796 A discloses a fault diagnosis method for a power transformer that is based on an extreme learning machine.
The accuracy of a tool that automatically processes parameters of a power network asset for monitoring, diagnosis, or analysis is expected to increase when it is capable of taking into consideration values of a larger number of parameters. Various limitations are conventionally associated with tools that process a large number of inputs. For illustration, when a tool is capable of automatically processing a large number of parameter values associated with a power network asset, the performance may be good when all of those parameter values are available for a given power network asset. However, the tool may be incapable of analyzing the condition of a different power network asset for which not all of the required parameter values are available, or may be capable of analyzing the condition only partially. The lack of information on the expected reliability of the tool when not all of the required parameter values are available may also present an obstacle.
Missing parameter values for a power asset may have various reasons and may be caused, e.g., by the absence of certain sensors or by the lack of information on parameters such as age of the power transformer.
It may be challenging to adequately train a tool capable of automatically processing a large number of parameter values, because historical data that can be used for the training process may include all the parameter values for just a small number of power network assets. Analysis tools that use a smaller number of parameter values may be easier to train, but may not provide adequate reliability.
It is an object of the invention to provide improved methods, devices, systems, and computer-readable instructions that perform a condition classification of a power network asset. It is in particular an object to provide improved methods and devices that are capable of reliably performing a condition classification even if not all input parameter values required by an automatic classification procedure are available.
According to embodiments, methods and devices are provided which are capable of performing a condition classification for a power network asset. The methods and devices combine an automatic classification procedure that requires a set of parameter values as inputs with a missing data replacement procedure. The missing data replacement procedure provides substitute values for each required parameter value that is not available for a given power network asset. The missing data replacement procedure may be invoked when training the automatic classification procedure (e.g., to provide substitute values for those portions of the historical data that lack parameter values) and when using the automatic classification procedure for online or offline condition classification of a power network asset (e.g., by invoking the missing data replacement procedure to provide substitute values when some of the required parameter values are not available for the power network asset for which condition classification is performed).
According to an aspect of the invention, a method of monitoring or analyzing a power network asset of a power network comprises: performing, by an electronic device, an automatic classification procedure for a condition classification of the power network asset, wherein the automatic classification procedure performs the condition classification using a set of parameter values as inputs, and wherein only a subset of the set of parameter values is available for the power network asset and at least one parameter value of the set is not available for the power network asset. The method further comprises performing, by the electronic device, a missing data replacement procedure to determine at least one substitute parameter value, and using the subset of parameter values and the at least one substitute parameter value in combination as inputs for the automatic classification procedure to obtain the condition classification of the power network asset.
According to another aspect of the invention, an electronic device comprises an interface to receive data associated with a power network asset, and a processing device configured to perform an automatic classification procedure for a condition classification of the power network asset, wherein the automatic classification procedure is operative to use a set of parameter values as inputs, and wherein only a subset of the set of parameter values is available for the power network asset and at least one parameter value of the set is not available for the power network asset. The processing device is further configured to perform a missing data replacement procedure to determine at least one substitute parameter value, and use the subset of parameter values and the at least one substitute parameter value in combination as inputs for the automatic classification procedure to obtain the condition classification of the power network asset.
According to another aspect of the invention, there is provided a power network which comprises a power network asset and an electronic device. The electronic device comprises an interface to receive data associated with the power network asset, and a processing device configured to perform an automatic classification procedure for a condition classification of the power network asset, wherein the automatic classification procedure is operative to use a set of parameter values as inputs, and wherein only a subset of the set of parameter values is available for the power network asset and at least one parameter value of the set is not available for the power network asset. The processing device is further configured to perform a missing data replacement procedure to determine at least one substitute parameter value, and use the subset of parameter values and the at least one substitute parameter value in combination as inputs for the automatic classification procedure to obtain the condition classification of the power network asset. The power network asset may be a transformer, in particular a power transformer, or a generator, without being limited thereto.
According to another aspect of the invention, there is provided a set of machine-readable instructions that cause a processor of an electronic device to perform the following steps: performing an automatic classification procedure for a condition classification of a power network asset, wherein the automatic classification procedure performs the condition classification using a set of parameter values as inputs, wherein only a subset of the set of parameter values is available for the power network asset and at least one parameter value of the set is not available for the power network asset; performing a missing data replacement procedure to determine at least one substitute parameter value; and using the subset of parameter values and the at least one substitute parameter value in combination as inputs for the automatic classification procedure to obtain the condition classification of the power network asset.
According to another aspect of the invention, there is provided a method of providing an automatic classification procedure for a condition classification of a power network asset. The method comprises training a machine learning algorithm that uses a set of parameter values as inputs to perform a condition classification, wherein the training is performed using training data associated with a plurality of power network assets; and performing a missing data replacement procedure when training the machine learning algorithm, the missing data replacement procedure generating substitute parameter values where at least one of the parameter values of the set is missing in the training data.
According to another aspect of the invention, there is provided a set of machine-readable instructions that cause a processor of an electronic device to perform the following steps for providing an automatic classification procedure for a condition classification of a power network asset: training a machine learning algorithm that uses a set of parameter values as inputs to perform a condition classification, wherein the training is performed using training data associated with a plurality of power network assets; and performing a missing data replacement procedure when training the machine learning algorithm, the missing data replacement procedure generating substitute parameter values where at least one of the parameter values of the set is missing in the training data.
The methods, devices, and machine-readable instruction code according to embodiments of the invention mitigate missing data problems that are conventionally encountered for automatic condition classification when the number of inputs of the automatic condition classification is so large that it is likely that one or several ones of the parameter values required by the automatic condition classification may not be available for a power network asset.
Embodiments of the invention may be used for determining whether a transformer, in particular a power transformer, or another power network asset operates normally or whether the transformer requires attention, without being limited thereto.
Embodiments of the invention may be used for performing an automatic condition classification with good reliability, even when part of the parameter values used as inputs by the automatic condition classification are not available for a given power network asset. Embodiments of the invention may be particularly useful in cases where one or several of the parameter values required as inputs by the automatic condition classification are not monitored online for a power network asset, for example without being limited thereto.
The subject-matter of the invention will be explained in more detail with reference to preferred exemplary embodiments which are illustrated in the attached drawings, in which:
Exemplary embodiments of the invention will be described with reference to the drawings in which identical or similar reference signs designate identical or similar elements. While some embodiments will be described in the context of power transformers, the methods and devices described in detail below may be used for a performing a condition classification of a wide variety of different power network assets. The features of embodiments may be combined with each other, unless specifically noted otherwise.
Overview
In view of their importance for power network operation and reliability, an assessment of the condition of power network assets is performed. In order to assist an engineer in this task, a condition classification device 30 may automatically perform a condition classification of one or several power network assets. For illustration, and without being limited thereto, the condition classification device 30 may perform a condition classification of the power transformer 20 and, optionally, of one or several additional power transformers 25 or other power network assets.
The condition classification device 30 may be operative to output a condition classification that may have at least two different values. The at least two different values may represent
The condition classification device 30 receives data from sensor(s) 21, 22, 26, 27 that capture operational data associated with the power transformer(s) 20, 25 or other power network assets for which condition classification is to be performed. The condition classification device 30 may have an interface 33 for receiving data from the sensors that capture operational data associated with the power transformer(s) 20, 25 or other power network assets. The condition classification device 30 may be configured to process a wide variety of different parameter values to perform the condition classification, as will be explained in more detail below.
Additional parameter values associated with the power transformer(s) 20, 25 or other power network assets for which condition classification is to be performed may be stored in a data storage device 34. Such additional parameter values may include information on age, importance rating, construction type, nameplate data, or other information associated with the power network assets that is unlikely to change during ongoing operation of the power network asset. An engineer may input this information via a user interface 35, for example for storing in the data storage device 34 when a power network asset is installed, for example.
The condition classification device 30 may comprise an automatic classification module 31. The automatic classification module 31 may be configured to perform an automatic condition classification in response to a set of parameter values. The total number of values of different parameters in the set that are required as inputs by the automatic classification module 31 may be designated by N.
A subset of the set of parameter values may be available for a power network asset 20, 25 for which condition classification is to be performed. The total number of values of different parameters in the subset that is available for the power network asset for which condition classification may be designated by L.
The subset of parameter values that is available for the power network asset may include parameter values that are provided by sensors 21, 22, 26, 27 installed on the power network asset for which condition classification is to be performed. L1 parameter values for the power network asset for which condition classification is to be performed may be received at the interface 33, while L2 parameter values may be retrieved from a data storage device 34, wherein L1+L2=L.
The condition classification device 30 according to embodiments is configured in such a way that it can perform an automatic condition classification even when the number L of values for different parameters that is available for a power network asset is less than the total number N of values of different parameters that is required by the automatic classification module 31 as inputs. In order to accommodate the missing parameter values, the condition classification device 30 comprises a missing data replacement module 32. The missing data replacement module 32 may provide substitute values for those M=N−L values for parameters that are required by the automatic classification module 31 to perform a condition classification, but which are not available for the power network asset for which condition classification is to be performed.
The missing data replacement module 32 may use any one of a variety of different missing data replacement techniques, as will be explained in more detail below.
The missing data replacement module 32 may determine at least one of the substitute parameter values required as input for the automatic classification module 31 as a function of the L parameter values that are available for the power network asset.
The automatic classification module 31 and missing data replacement module 32 may be implemented by hardware, firmware, software, other machine-readable instruction code, or a combination thereof. The condition classification device 30 may comprise at least one integrated semiconductor circuit to implement the function of the automatic classification module 31 and the missing data replacement module 32. The at least one integrated semiconductor circuit may comprise one or several of a microprocessor, a processor, a microcontroller, a controller, an application specific integrated circuit, or any combination thereof.
While the operation of the condition classification device 30 will mainly be described with reference to the condition classification of one power network asset, such as power transformer 20, it will be appreciated that in a realistic operation scenario the condition classification device 30 can normally perform a condition classification for a plurality of power network assets that operate in the same power network 10 or that may even be installed in different power network. For illustration, the condition classification device 30 may perform a condition classification for a plurality of power transformers 20, 25, either simultaneously or sequentially.
When the condition classification device 30 performs a condition classification for plural power network assets, different parameter values may be missing for different power network assets, even when the power network assets are all of the same type (such as power transformer). In this case, the missing data replacement module 32 may provide substitute values for different missing parameter values of the different power network assets. For illustration, a substitute value for a first parameter of the power transformer 20 may be provided by the missing data replacement module 32 for a condition classification of the power transformer 20. A substitute value for a second parameter of the power transformer 25 may be provided by the missing data replacement module 32 for a condition classification of the power transformer 25. The missing data replacement module 32 may use different missing data replacement procedures depending on which one of the parameter values required by the automatic classification module 31 is not available for the respective power network asset.
In the condition classification, the automatic classification module 31 may process an input it receives in the same way, irrespective of whether the input is an actual (for example measured) parameter value of the power network asset or whether the input is a substitute value generated by the missing data replacement module 32. For illustration, a gas concentration relating to a dissolved gas in an insulating oil of a power transformer may be processed in the same way by the automatic classification module 31 irrespective of whether the value for this parameter is measured at the power transformer 20 or whether it is generated by the missing data replacement module 32.
The condition classification device 30 according to an embodiment may combine automatic classification, which may be implemented by a machine learning algorithm, with missing data replacement. This allows the condition classification device 30 to perform an automatic classification procedure that uses a comparatively large number N of parameter values as inputs, rendering the automatic classification procedure reliable, while autonomously compensating for missing parameter values that may not be available for some of the power network assets. Those parameter values that are not available may be replaced by substitute values that are determined by performing a missing data replacement procedure.
It will be appreciated that there may be various reasons that can cause at least one of the parameter values required by the automatic classification module 31 to be not available for a power network asset. Exemplary scenarios include the following:
Without limitation, the automatic classification module 31 may be configured such that it processes a number N of different parameter values, with N being greater than 10, greater than 50, or even greater than 90, to perform the condition classification. At least part of the number N of different parameter values required as inputs by the automatic classification module 31 may be generated by the missing data replacement module 32.
While the generation of a substitute value for each parameter value that is not available for a power network asset, but which is required as input by the automatic classification module 31, allows the condition classification to be performed, it may affect the accuracy of the obtained condition classification. The condition classification device 30 may be operative to determine an indicator for the accuracy, e.g., a confidence level, of the condition classification in dependence on how many substitute values have been inputted to the automatic classification module 31 and/or in dependence on which parameters are affected by the inputting of the substitute values.
The condition classification device 30 may output a result of the condition classification. Optionally, the condition classification device 30 may output information on the accuracy, e.g., a confidence level, of the result of the condition classification in dependence on which parameter value(s) is/are not available for the power network asset.
The condition classification device 30 may comprise a user interface 35 for outputting the result of the condition classification and, optionally, information on the accuracy. Alternatively or additionally, the condition classification device 30 may comprise a data network interface for outputting data indicating the result of the condition classification and, optionally, the information on the accuracy. This allows the results of the condition classification to be accessed by a terminal device that may be remote from the condition classification device 30, as will be explained in more detail with reference to
The automatic classification procedure performed by the condition classification device 30 may comprise a machine learning technique. The machine learning technique may be trained using historical data or other training data previously acquired for a plurality of power network assets of the same asset type as the power network asset for which condition classification is to be performed. For illustration, in order to perform a condition classification of a power transformer or of several power transformers installed in a power network 10, the automatic classification procedure 31 may comprise one or several machine learning algorithms that have been trained with historical data for a plurality of power transformers. Missing data replacement procedures may be performed not only for determining a condition classification of a power network asset during operation, but also when training a machine learning technique.
Exemplary methods and scenarios in which the combination of an automatic classification procedure with missing data replacement may be used will be described with reference to
The automatic classification procedure that has been trained for power network asset condition classification in the method 50 may comprise a machine learning algorithm or plural different machine learning algorithms. The machine learning algorithm(s) may comprise linear algorithms, nonlinear algorithms, and ensemble algorithms. The method 50 of adapting an automatic classification procedure to a type of power network asset (such as power transformer) may comprise training a plurality of different machine learning algorithms and selecting one or some of the machine learning algorithms as a function of a performance evaluation. The plurality of different machine learning algorithms that is trained in the method 50 may comprise at least one linear algorithm selected from a group consisting of general linear regression (GLM) and linear discriminant analysis (LDA). Alternatively or additionally, the plurality of different machine learning algorithms that is trained in the procedure 50 may comprise at least one nonlinear algorithm selected from a group consisting of classification and regression trees (CART), a Naïve Bayes algorithm (NB), Bayesian networks, K-nearest neighbor (KNN), and a support vector machine (SVM). Alternatively or additionally, the plurality of different machine learning algorithms that is trained in the method 50 may comprise at least one ensemble algorithm selected from a group consisting of random forest, tree bagging, an extreme gradient boosting machine, and artificial neural networks.
The method 50 may also comprise performing a missing parameter replacement procedure. For illustration, the training data may comprise historical data associated with a plurality of power network assets. While it is desirable to provide an automatic classification procedure that can take advantage of a comparatively large number of inputs, the number of parameter values that is available for each one of the power network assets in the training data may be fairly small or even zero. For illustration, the training data may include a large number of data sets. Each data set may be associated with historical data of a real power transformer or other power network asset. In some or all of the data sets (which may be thought of as lines or columns in a large table of training data), at least one parameter value may be missing. Therefore, missing data replacement may be used also during the training, so as to replace those parameter values that are not available for a power network asset in the training data by substitute values in each one of the data sets.
In the method 60, the use of the automatic classification procedure may involve performing a missing data replacement procedure. Missing data replacement in the methods 50 and 60 serve somewhat different, albeit related purposes: missing data replacement in the method 50 at least partially compensates for the fact that not all parameter values that can be input to the various machine learning algorithms during the training may be available for all data sets of the training data. The missing data replacement in method 60 at least partially compensates for the fact that not all parameter values that are required as inputs by the trained automatic classification procedure may be available for a power network asset for which condition classification is to be performed.
At step 52, it is determined whether a parameter value that should be input into a machine learning algorithm during the training is missing in the training data for at least one of the data sets in the training data. If a parameter value is missing, the missing data replacement procedure is performed at step 53 to generate a substitute value for the missing parameter value. If no parameter value is missing for a data set associated with a power network asset in the training data (which is a very unlikely scenario if the number of parameter values input into the machine learning algorithm is large), the method may directly proceed from step 52 to step 54.
At step 54, supervised learning may be performed. The supervised learning may be based on the training data, supplemented by substitute values generated by the missing data replacement procedure performed at step 53 where required. The supervised learning may use the training data in combination with an assessment of the power network condition that is given by a human expert.
As will be appreciated by the skilled person, “machine learning” involves a semi-automated process of knowledge extraction from data using algorithms that are not explicitly programmed. The process is semi-automated because machine learning requires human-data interaction (e.g., for data cleansing, etc.). Machine learning generally refers to a vast set of tools that can be utilized to extract knowledge from data. Various algorithms known to the skilled person may be put to use for condition classification of a power network asset. Examples of such algorithms include, without limitation, linear algorithms, such as general linear regression (GLM) and linear discriminant analysis (LDA); nonlinear algorithms, such as classification and regression trees (CART), a Naïve Bayes algorithm (NB), Bayesian networks, K-nearest neighbor (KNN), and a support vector machine (SVM); and ensemble algorithms, such as random forest, tree bagging, an extreme gradient boosting machine, and artificial neural networks.
In the supervised learning performed at step 54, machine learning maps the complex relationship between the feature space and the condition classification, which is the output variable of the machine learning algorithm. The output provided by the machine learning algorithm is compared to the human expert classification, to improve and enhance the accuracy of the classification obtained by the machine learning algorithm. In unsupervised learning, the machine learning searches for hidden structures in data.
While only general steps of the training method 50 are shown in
At step 61, parameter values are received for a power network asset. The parameter values may be received from the sensors associated with the power network asset and/or from a data repository, as has been explained with reference to
At step 62, it is determined whether one of the N parameter values required by the automatic classification procedure as inputs is not available for the power network asset, i.e., whether L=L1+L2<N. For an automatic classification procedure that uses a fairly large number of inputs (e.g., more than 50 inputs), at least one parameter value is likely to be not available for any power network asset in the power network. Different parameter values may be missing for different power network assets (e.g., different transformers) for which a condition classification is performed.
At step 63, a missing data replacement procedure is performed to generate a substitute value for the at least one parameter value that is not available for the power network asset.
At step 64, an automatic classification procedure is performed. The automatic classification procedure uses the received parameter values as inputs, supplemented by the substitute values generated at step 63 for those inputs for which no data is available for a power network asset.
While only the general steps of the training method 60 are shown in
At least one parameter value of the set 41 is neither included in the set of parameter values 36 nor in the other known parameter values 37. Substitute values 43 are determined by the missing data replacement module 32. The substitute values 43 are input as substitutes for actual (measured or otherwise known) parameter values that are included in the set 41 because they are required as input by the automatic classification module 31, but which are neither available as sensed parameter values 36 nor otherwise known for the power network asset.
At least one of the substitute values 43 may depend on the subset 42 of parameter values that is available for the power network asset. For illustration, and as will be explained in more detail below, correlations between different parameters or information on statistical distributions of parameter values may be used in the missing data replacement procedure to determine one or several of the substitute values.
While the concepts disclosed herein are applicable to a wide variety of different power network assets, the methods, devices, and computer programs may be used for a condition classification of transformers. The parameter values used as inputs of the automatic classification procedure may include parameter values for which online monitoring is performed during operation of the power network asset and other parameter values for which no online monitoring is performed during operation of the power network asset. The parameter values used as inputs of the automatic classification procedure may include a parameter value that has been incorporated into the inputs of the automatic classification procedure after manufacture or installation of the power network asset, such that no information on this parameter value is available for the power network asset.
The techniques may be used, e.g., for a condition classification of a power transformer, a distribution transformer, or a high voltage transformer, which may be operative with voltages of at least 69 kV or at least 34.5 kV.
An automatic classification procedure may use various parameter values associated with the transformer as inputs.
The following provides non-limiting examples for parameter values that may be required (individually or in any combination) as inputs of the automatic classification procedure for a power network asset:
It will be appreciated that values of alternative or additional parameters may be used as inputs of the automatic classification procedure for other power network assets, such as generators.
Missing Parameter Values and Missing Data Replacement
Missing Parameter Values
The problem that parameter values required as inputs for an automatic classification procedure may not be available may occur both when training the automatic condition classification for subsequent use in power network asset condition classification and when using the trained automatic classification procedure. In either case, missing data replacement procedures may be performed to provide substitute values when part of the required inputs are missing for a power network asset. The missing data replacement procedures disclosed herein may be used in association with any one of the methods, devices, systems, and machine-readable instruction codes disclosed herein.
The table includes exemplary columns, which are provided for illustration and which are not exhaustive. For illustration, a data column 71 may include an identifier for each power network asset. A data column 72 may include a parameter value representing a class of the respective power network asset. A data column 73 may include a parameter value representing an importance rating of the respective power network asset. A data column 74 may include a parameter value representing an age of the respective power network asset. A data column 75 may include a parameter value representing a voltage class of the respective power network asset. A data column 76 may include a parameter value representing a ThruFault of the respective power network asset. Data columns 77-85 may include parameter values representing dissolved gas concentrations in insulating oil of an oil insulation system for the gases H2, CH4, C2H2, C2H4, C2H6, CO, CO2, O2, and N2.
Parameter values are missing in areas 86, 87, and 88 of the table. For illustration, the age information is missing for a transformer in area 86 of the table. ThruFaults are missing for transformers in area 87 of the table. Gas concentrations are missing for various other transformers in area 88 of the table.
If the missing data is encountered during operation of a condition classification device in accordance with an embodiment, substitute values may be determined that are used where real data is missing in areas 86, 87, and 88 of the table. The determination of substitute values may be performed automatically and autonomously by the condition classification device.
If the missing data is encountered during the adaptation of the automatic classification procedure, e.g., during training a machine learning algorithm, substitute values may be input to the supervised learning procedure.
As will be appreciated from
The missing data replacement procedure(s) which may be used may include the following:
Exemplary implementations of such missing data replacement procedures will be explained below.
For brevity, parameter values that are required as inputs for a machine learning algorithm or a trained automatic classification procedure will be referred to as “missing parameter values” in the following. It will be appreciated that a missing parameter value always is to be understood with reference to a respective power network asset or data set of the training data. I.e., while a given parameter value may not be available for a power transformer 20, the respective parameter value may be available for another power transformer 25 in the power network. Similarly, while a given parameter value may not be available for a data set in the training data, the respective parameter value is available for another data set in the training data.
Determining a Substitute Value for a Missing Parameter as a Default Value
A substitute value for a missing parameter value may be a default value. The default value may be fixed. The default value may depend on which parameter value is missing. The default value may also depend on the parameter values that are available for the power network asset.
Determining a Substitute Value for a Missing Parameter as a Mean or Median Value of a Statistical Distribution
A substitute value for a missing parameter value may be determined as a mean or median value of a statistical distribution for this parameter value. The statistical distribution may be determined, e.g., from those data sets in the training data that include the parameter value that is missing in another data set.
As illustrated in
Determining a Substitute Value for a Missing Parameter as a Random Number Selected According to a Statistical Distribution
A substitute value for a missing parameter value may be determined as a random value in accordance with a statistical distribution for this parameter value. I.e., the substitute value may be a random number that is selected in accordance with a statistical distribution for that parameter value. The statistical distribution may be determined, e.g., from those data sets in the training data that include the parameter value that is missing in another data set. The statistical distribution may alternatively be determined by physical models or by experiments.
Determining a Substitute Value for a Missing Parameter by Hard Value Imputation
A substitute value for a missing parameter value may be determined by hard value imputation. The substitute value may be based on an educated guess. During training of the automatic classification procedure, the educated guess may be provided by a human expert. During operation of the condition classification device 30, when hard value imputation is used, information on the educated guess may be retrieved from a storage device. The storage device may store educated guess values for a plurality of different parameter values.
Determining a Substitute Value for a Missing Parameter Based on Parameter Correlations
A substitute value for a missing parameter value may be determined by using correlations between parameters. For illustration, even when some parameter values are missing in most or all of the data sets of the training data, as illustrated in
Correlated parameters may be identified based on the correlation matrix. Exemplary islands of higher correlation are reproduced separately in
The correlation matrix may be used to determine substitute value(s) for one or several missing parameter values of a power network asset, using those parameter values of the power network asset that are known and by combining this information with the correlations 110a, 110b determined from a large set of power network assets. Multivariate regression or Pearson correlations may be used to determine the substitute value(s) for one or several missing parameter values of a power network asset in this way.
Other Missing Data Replacement Procedures
Other missing data replacement procedures may also be used. For illustration, a Probabilistic Belief Propagation Algorithm that uses Conditional Probability Tables (CPTs) may be employed to determine substitute values for missing parameter values, taking into consideration those parameter values of the power network asset that are known.
Selection of a Missing Data Replacement Procedure
Some missing data replacement procedures may outperform other missing data replacement procedures. The best-performing missing data replacement procedure may depend on which parameter value is missing and/or which machine learning technique is used to implement the automatic classification procedure.
The condition classification device 30 according to an embodiment may be configured to perform at least one missing data replacement procedure. More than one missing data replacement procedure may be supported, such as single imputation (educated guess, mean or even median value of a distribution), feature correlation (i.e., making the missing data a function of all other parameters), multiple imputation (i.e., finding the probability distribution function that best adhere to the data), and use of probabilistic belief propagation algorithms (such as in Bayesian Networks). One or several suitable missing data replacement procedures may be implemented in the condition classification device 30. Depending on the parameter values that are available for a power network asset and/or depending on the missing parameter for which a substitute value is to be determined, one of the missing data replacement procedures may be invoked to determine the substitute value.
For illustration, a missing data replacement procedure that uses parameter correlations may be used if there is a sufficient, but not too strong correlation or anti-correlation between the parameter value for which the substitute value is to be determined and other parameter value(s) that are known for the power network asset. If the correlation has a magnitude that is close to 1 (i.e., perfectly correlated or anti-correlated parameters), the missing data replacement procedure that uses parameter correlations may not add information when it is used to determine the substitute value for the missing parameter value.
For further illustration, if a good educated guess is available for a given parameter value, the educated guess may be used.
During the method of adapting an automatic classification procedure to a training set (method 50 in
Machine learning algorithms may be used to evaluate the impact of different types of missing data replacement strategies on the accuracy of the best trained machine learning algorithm.
Automatic Classification Procedure and Machine Learning Algorithms
The automatic classification procedure performed by the condition classification device 30 may be or may comprise a machine learning algorithm that has previously been trained with training data associated with a plurality of power network assets. Different machine learning algorithms may be used, as has been explained above.
Generally, training an automatic classification procedure for use with power network assets may include:
(a) Selecting a candidate technique (e.g., Linear Regression, Logistic Regression, ANN, Classification Trees, etc.).
(b) Select a training dataset with the attributes of the power network asset to be classified (e.g., transformer nameplate data, H2, CH4, etc.).
(c) Training the machine learning algorithm with the “labeled data”—for example the classification “good” or “bad”. or a classification comprising three or more classes.
(d) The machine learning algorithm “learns” the relationship between the attributes (or features) and the outcome. After the training, the machine learning algorithm can make predictions on new data for which there is no outcome, i.e., no class imposed by humans.
For illustration rather than limitation,
In the exemplary CART of
Node 1: Main tank
Node 2: Corrosion
Node 3: Leaks
Node 4: Main cabinet
Node 5: Oil quality
Node 6: Oil aging
Node 7: Acidity
Node 8: Power factor
Node 9: Interfacial tension
Node 10: Dielectric susceptibility
Node 11: Moisture
Node 12: Contaminants
Node 13: Gas level
Node 14: Gas trend
Node 15: Dissolved Gas Analysis (DGA)
Node 16: Electrical tests
Node 17: Thru Fault
Node 18: Noise Level
Node 19: Winding temperature
Node 20: Active part
Node 21: Cooling system
Node 22: Oil preservation system
Node 23: Load tap changer
Node 24: Bushings
Node 25: Accessories
Node 26: Operational data
Node 27: Load
Node 28: Sister failures
Node 29: Design issues
Node 30: History
Node 31: Probability health
During training of a Bayesian network, the conditional probability values in the conditional probability tables of the Bayesian Network may be learned. The learning process (method 50 in
Selection of Suitable Machine Learning Algorithm and Missing Data Replacement Procedure
In order to provide a reliable and accurate condition classification by the automatic classification procedure of the condition classification device 30, training may be performed for one or several machine learning algorithms and/or one or several missing data replacement procedures.
Certain missing data replacement strategies may work better for some parameter than for others. Machine learning classification algorithms may be used to assess a power network asset condition after the machine learning classification algorithms have been properly trained using training data captured from real power network assets (such as plural power transformers with multiple operational data like nameplate, load, gas in oil, oil quality, bushing power factor and capacitance, load tap changer operations, type, gases, etc.). The best machine learning algorithm(s) (i.e., those that provide best accuracies in the classification process) can be tested against the same data but using a different missing data replacement procedure, until the optimum machine learning algorithm and data replacement procedure are found.
At step 131, plural different machine learning algorithms are trained using the training data. The training may include supervised learning. A missing data replacement procedure may be used to provide substitute values where parameter values are missing in a data set of the training data.
The plural different machine learning algorithms that are trained at step 131 may comprise at least one linear algorithm selected from a group consisting of general linear regression (GLM) and linear discriminant analysis (LDA). Alternatively or additionally, the plurality of different machine learning algorithms that is trained at step 131 may comprise at least one nonlinear algorithm selected from a group consisting of classification and regression trees (CART), a Naïve Bayes algorithm (NB), Bayesian networks, K-nearest neighbor (KNN), and a support vector machine (SVM). Alternatively or additionally, the plurality of different machine learning algorithms that is trained at step 131 may comprise at least one ensemble algorithm selected from a group consisting of random forest, tree bagging, an extreme gradient boosting machine, and artificial neural networks.
At step 131, the machine learning algorithms may learn the statistical mapping between inputs (a set of parameter values) and output (a condition classification) through typically a large number of examples provided in the training phase (number of cases available in the training data), in which each example generally contains a large number of parameter values (for example transformer age, dissolved gas analysis history, load, etc.). Supervised learning may take place through a comparison between the output of each individual machine learning algorithm and the condition classification given by a human expert. An error function can be defined and a statistical process can be employed to minimize the error function so that each algorithm will provide the best possible accuracy based on its implementation.
At step 132, a performance evaluation may be performed. The performance evaluation is preferably performed based on test data that is not included in the training data. The performance evaluation may comprise testing the condition classification output by the trained machine learning algorithms and comparing the results against the classification provided by a human expert.
At step 133, at least one of the machine learning algorithms and, optionally, at least one of plural missing data replacement procedures used at step 131 is selected for use in the condition classification device 30. The selecting step 133 may comprise selecting the machine learning algorithm and missing data replacement procedure that, in the performance evaluation, had a maximum number of condition classifications that matched those of the human expert.
Alternative or additional criteria may be employed for selecting a machine learning algorithm and/or a missing data replacement procedure from a plurality of candidates. For illustration, the so-called confusion matrix may be evaluated that compares the results given by the trained machine learning algorithm to those given by the human expert. The selecting step 133 may comprise selecting the machine learning algorithm and missing data replacement procedure that had a maximum number of condition classifications that matched those of the human expert, but which did not incorrectly classify any power network asset that required attention as being in a normal operation state and/or that had the lowest number of incorrect classifications in which a power network asset that required attention was classified as being in a normal operation state.
For the exemplary training data used in
The Machine Learning algorithms showed an impressive accuracy when analyzing complex power transformer data, even without the use of any engineering model. In other words the algorithms do not need to be provided with reference levels or flags to indicate that a given parameter was within acceptable range or outside “normal” levels. The twelve machine learning models were only provided with the final classification between the above-mentioned classes (i), (ii), (iii) previously established by transformer human experts.
The best performing algorithm (xGBM1) presented near 97% accuracy when analyzing the 200 new test cases unseen during training. It missed one class (i) case that was “wrongly” but conservatively classified as class (iii), three class (ii) cases that were wrongly classified as class (i) and three class (ii) cases that were wrongly classified class (iii). No class (iii) case was wrongly classified. The significant number of misses in practical terms is three class (ii) cases classified as class (i) cases (i.e., classified as normal power transformers although the human expert considered those power transformers to require some attention) out of 200 total, leading to 3/200=1.5% real miss since the other misses were conservative and would not lead to any unfavorable situation like a possible failure.
Using an Automatic Classification Procedure and Missing Data Replacement Procedure for Automatic Online or Offline Condition Classification by a Condition Classification Device
The results of the adaptation of the automatic classification procedure (which may involve training plural machine learning algorithms using one or several different data replacement procedures) may be used for performing condition classification of a power transformer or of another power network asset. For illustration, the automatic classification procedure executed by the automatic classification module 31 may depend on which one of several trained machine learning techniques showed the best performance. Additional information obtained in the adaptation of the automatic classification procedure for power network asset condition classification may be used by the condition classification device.
The different missing data replacement procedures 171, 172, 173 may respectively be selected from a group consisting of
Which one of the missing data replacement procedures 171, 172, 173 is invoked for a given parameter may depend on the performance of the different missing data replacement procedures 171, 172, 173 and/or on which other parameters are available. For illustration, feature correlation may exhibit good performance for some parameter values, but may not be a viable option if several highly correlated parameter values are not available for a power network asset, with these highly correlated parameter values having little or no correlation with those parameter values that are available for the power network asset.
Additionally or alternatively, the methods and condition classification devices according to embodiments may be operative to provide information on the expected accuracy of a condition classification, in dependence on which parameter values are not available for a given power network asset. The expected accuracy, or confidence level, may be output via a user interface or a network interface.
De-Centralized Condition Classification System
Results of a condition classification performed by the condition classification device 30 may be output locally at a user interface 35 of the condition classification device 30, as has been explained above. The techniques disclosed herein may also be used in systems that involve plural spatially separated computing device that communicate with each other via a wide area network or the internet 37.
Use of the Missing Data Replacement Procedure for Accommodating Changes in the Automatic Classification Procedure
As has already been explained above, the need to generate substitute values for one or several parameter value(s) that are not available for a power network asset may have various reasons, including the absence of sensors for a parameter value required as input by the automatic classification procedure.
One exemplary scenario in which the missing data replacement procedure may be applied to generate a substitute value for the same parameter value for all, or at least a large fraction, of the power network assets that are being monitored is that the automatic classification procedure is enhanced, possibly long after installation of the power network assets, to use a new parameter value as input. For illustration, a new parameter value may be discovered to be of relevance to the condition classification, long after power transformers or other power network assets have been built and installed. It may not be possible to retrofit the installed power network assets with a sensor that would be capable of measuring this new parameter value. In this case, the missing data replacement procedure may be used to generate the substitute value for this new parameter value that has subsequently been incorporated into the inputs of the automatic classification procedure.
A suitable missing data replacement procedure for such a new parameter may be obtained by laboratory experiments or physical modeling, even when little empirical information may be available for the effect of the new parameter on the condition classification.
The following embodiments are also disclosed:
A method for a power network, comprising:
performing, by an electronic device, an automatic classification procedure for a condition classification of a power network asset,
wherein the automatic classification procedure performs the condition classification using a set of parameter values as inputs,
wherein only a subset of the set of parameter values is available for the power network asset and at least one parameter value of the set is not available for the power network asset;
performing, by the electronic device, a missing data replacement procedure to determine at least one substitute parameter value; and
using the subset of parameter values and the at least one substitute parameter value in combination as inputs for the automatic classification procedure to obtain the condition classification of the power network asset.
The method of embodiment 1,
wherein the missing data replacement procedure is performed to determine a substitute value for a parameter value for which no online monitoring is performed during operation of the power network asset.
The method of embodiment 1 or embodiment 2,
wherein the missing data replacement procedure is performed to determine a substitute value for a parameter value that has been incorporated into the inputs of the automatic classification procedure after manufacture or installation of the power network asset.
The method of any one of the preceding embodiments,
wherein the missing data replacement procedure is performed to determine a substitute value for a parameter value that is independent of an operation condition of the power network asset.
The method of any one of the preceding embodiments,
wherein the missing data replacement procedure is performed to determine a substitute value for an age of the power network asset.
The method of any one of the preceding embodiments,
wherein the missing data replacement procedure is performed to determine a substitute value for a voltage class, a power, or an importance rating of the power network asset.
The method of any one of the preceding embodiments,
wherein the missing data replacement procedure is performed to determine a substitute value for a ThruFault of the power network asset.
The method of any one of the preceding embodiments,
wherein the power network asset comprises an insulation system, and
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter relating to the insulation system.
The method of embodiment 8,
wherein the insulation system comprises an oil insulation system.
The method of embodiment 9,
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter selected from a group consisting of: an oil interfacial tension, an oil dielectric strength, an oil power factor, moisture in insulating oil of the oil insulation system, and a system type of the oil insulation system.
The method of embodiment 9 or embodiment 10,
wherein the missing data replacement procedure is performed to determine a substitute value for a concentration of at least one dissolved gas in insulating oil of the oil insulation system.
The method of embodiment 11,
wherein the at least one gas is selected from a group consisting of: H2, CH4, C2H2, C2H4, C2H6, CO, CO2, O2, and N2.
The method of any one of embodiments 8 to 12,
wherein the insulation system comprises a gas insulation system.
The method of any one of the preceding embodiments,
wherein the power network asset comprises a winding, and
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter of the winding.
The method of embodiment 14,
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter selected from a group consisting of: a winding power factor, a winding capacitance, and a winding temperature.
The method of any one of the preceding embodiments,
wherein the power network asset comprises a bushing, and
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter of the bushing.
The method of embodiment 16,
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter selected from a group consisting of: a bushing power factor, a bushing capacitance, and a bushing type of the bushing.
The method of any one of the preceding embodiments,
wherein the power network asset comprises a cooling system, and
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter of the cooling system.
The method of embodiment 18,
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter selected from a group consisting of: a condition of the cooling system and a cooling system type of the cooling system.
The method of any one of the preceding embodiments,
wherein the power network asset comprises a load tap changer, and
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter of the load tap changer.
The method of embodiment 20,
wherein the missing data replacement procedure is performed to determine a substitute value for a condition of the load tap changer or a load tap changer type of the load tap changer.
The method of any one of the preceding embodiments,
wherein the missing data replacement procedure is performed to determine a substitute value for a load coupled to the power network asset.
The method of any one of the preceding embodiments,
wherein the power network asset is a transformer.
The method of embodiment 23,
wherein the transformer is a power transformer.
The method of embodiment 23,
wherein the transformer is a distribution transformer.
The method of embodiment 25,
wherein the transformer is a high voltage transformer.
The method of any one of embodiments 1 to 22,
wherein the power network asset is a generator.
The method of any one of the preceding embodiments, further comprising:
determining confidence information indicative of an accuracy of the condition classification when the missing data replacement procedure is performed; and outputting the confidence information.
The method of any one of the preceding embodiments, further comprising:
selecting, by the electronic device, the missing data replacement procedure from a plurality of missing data replacement procedures.
The method of embodiment 29,
wherein the missing data replacement procedure is selected as a function of which ones of the set of parameter values are not available for the power network asset.
The method of embodiment 29 or 30,
wherein at least two different missing data replacement procedures are performed for at least two different parameter values of the set that are not available for the power network asset.
The method of any one of embodiments 29 to 31,
wherein the one of the plurality of missing data replacement procedures is selected which maximizes accuracy of the condition classification of the power network asset.
The method of any one of the preceding embodiments,
wherein a first parameter value and a second parameter value from the set of parameter values are not available for the power network asset,
a first missing data replacement procedure is performed to automatically determine a first substitute parameter value for the first parameter value, and
a second missing data replacement procedure is performed to automatically determine a second substitute parameter value for the second parameter value, the second missing data replacement procedure being different from the first missing data replacement procedure.
The method of embodiment 33,
wherein an accuracy of the condition classification is increased by performing the second missing data replacement procedure to determine the second substitute parameter value, as compared to a case in which the first missing data replacement procedure is used to determine both the first substitute parameter value and the second substitute parameter value.
The method of any one of the preceding embodiments,
wherein the automatic classification procedure comprises a machine learning algorithm.
The method of any one of the preceding embodiments, wherein the automatic classification procedure is selected from a plurality of automatic classification procedures.
The method of embodiment 36,
wherein the plurality of automatic classification procedures comprises procedures selected from a group consisting of linear algorithms, nonlinear algorithms, and ensemble algorithms.
The method of embodiment 36 or embodiment 37,
wherein the plurality of automatic classification procedures comprises a linear algorithm selected from a group consisting of general linear regression (GLM) and linear discriminant analysis (LDA).
The method of any one of embodiments 36 to 38,
wherein the plurality of automatic classification procedures comprises a nonlinear algorithm selected from a group consisting of classification and regression trees (CART), a Naïve Bayes algorithm (NB), Bayesian networks, K-nearest neighbor (KNN), and a support vector machine (SVM).
The method of any one of embodiments 36 to 39,
wherein the plurality of automatic classification procedures comprises an ensemble algorithm selected from a group consisting of random forest, tree bagging, an extreme gradient boosting machine, and artificial neural networks.
The method of any one of the preceding embodiments,
wherein the missing data replacement procedure is selected from a group consisting of the following procedures:
The method of any one of the preceding embodiments,
wherein the missing data replacement procedure comprises determining the at least one substitute parameter value using a multivariate regression or using a Pearson correlation.
The method of any one of the preceding embodiments, further comprising:
receiving, by the electronic device, all or part of the subset of parameter values for the power network asset from a plurality of sensors.
The method of embodiment 43,
wherein the data are received during operation of the power network asset and the automatic classification procedure is performed online during operation of the power network asset.
The method of any one of the preceding embodiments,
wherein the automatic classification procedure is operative to assign the power network asset to one of at least three different classes.
The method of embodiment 45,
wherein the at least three different classes comprise
An electronic device, comprising:
an interface to receive data associated with a power network asset; and
a processing device configured to perform an automatic classification procedure for a condition classification of the power network asset, wherein the automatic classification procedure is operative to use a set of parameter values as inputs,
wherein only a subset of the set of parameter values is available for the power network asset and at least one parameter value of the set is not available for the power network asset, and
the processing device is further configured to
The electronic device of embodiment 47,
wherein the processing device is further configured to output a result of the condition classification of the power network asset over a wide area network or the internet.
The electronic device of embodiment 47 or 48,
wherein the electronic device is configured to perform the method of any one of embodiments 1 to 46.
A power network, comprising:
a power network asset; and
the electronic device of any one of embodiments 46 to 48 to perform a condition classification of the power network asset.
The power network of embodiment 50,
wherein the power network asset is a transformer, in particular a power transformer, a distribution transformer, or a high voltage transformer.
The power network of embodiment 50,
wherein the power network asset is a generator.
Machine-readable instruction code comprising instructions which, when executed by a processor of an electronic device, cause the electronic device to perform the method of any one of embodiments 1 to 46; optionally wherein the machine-readable instruction code is stored in a tangible storage medium.
A method of providing an automatic classification procedure for a condition classification of a power network asset, the method comprising:
training a machine learning algorithm that uses a set of parameter values as inputs to perform a condition classification,
wherein the training is performed using training data associated with a plurality of power network assets; and
performing a missing data replacement procedure when training the machine learning algorithm, the missing data replacement procedure generating substitute parameter values where at least one of the parameter values of the set is missing in the training data.
The method of embodiment 54,
wherein training the machine learning algorithm comprises training a plurality of machine learning algorithms using the training data, and the method further comprises:
performing a performance evaluation after the training; and
selecting, based on the performance evaluation, at least one of the plurality of machine learning algorithms for use in the condition classification.
The method of embodiment 54 or embodiment 55,
wherein performing the missing data replacement procedure comprises performing a plurality of missing data replacement procedures when training the machine learning algorithm and the method further comprises:
performing a performance evaluation after the training; and
selecting, based on the performance evaluation, at least one of the plurality of different missing data replacement procedures for use in the condition classification.
The method of embodiment 55 or embodiment 56,
wherein the performance evaluation is performed using test data different from the training data.
The method of any one of embodiments 54 to 57,
wherein the machine learning algorithm is trained using supervised learning.
The method of any one of embodiments 54 to 58,
wherein the missing data replacement procedure is performed to determine a substitute value for an age of at least one power network asset of the plurality of power network assets.
The method of any one of embodiments 54 to 59,
wherein the missing data replacement procedure is performed to determine a substitute value for a voltage class, a power, or an importance rating of at least one power network asset of the plurality of power network assets.
The method of any one of embodiments 54 to 60,
wherein the missing data replacement procedure is performed to determine a substitute value for a ThruFault of at least one power network asset of the plurality of power network assets.
The method of any one embodiments 54 to 61,
wherein at least one power network asset of the plurality of power network assets comprises an insulation system, and
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter relating to the insulation system.
The method of embodiment 62,
wherein the insulation system comprises an oil insulation system.
The method of embodiment 63,
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter selected from a group consisting of: an oil interfacial tension, an oil dielectric strength, an oil power factor, moisture in oil of insulating oil of the oil insulation system, and a system type of the oil insulation system.
The method of embodiment 63 or embodiment 64,
wherein the missing data replacement procedure is performed to determine a substitute value for a concentration of at least one gas dissolved in insulating oil of the oil insulation system.
The method of embodiment 65,
wherein the at least one gas is selected from a group consisting of: H2, CH4, C2H2, C2H4, C2H6, CO, CO2, O2, and N2.
The method of any one of embodiments 62 to 66,
wherein the insulation system comprises a gas insulation system.
The method of any one of embodiments 54 to 67,
wherein at least one power network asset of the plurality of power network assets comprises a winding, and
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter of the winding.
The method of embodiment 68,
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter selected from a group consisting of: a winding power factor, a winding capacitance, and a winding temperature.
The method of any one of embodiments 54 to 69,
wherein at least one power network asset of the plurality of power network assets comprises a bushing, and
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter of the bushing.
The method of embodiment 70,
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter selected from a group consisting of: a bushing power factor, a bushing capacitance, and a bushing type of the bushing.
The method of any one of embodiments 54 to 71,
wherein at least one power network asset of the plurality of power network assets comprises a cooling system, and
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter of the cooling system.
The method of embodiment 72,
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter selected from a group consisting of: a condition of the cooling system and a cooling system type of the cooling system.
The method of any one of embodiments 54 to 73,
wherein at least one power network asset of the plurality of power network assets comprises a load tap changer, and
wherein the missing data replacement procedure is performed to determine a substitute value for at least one parameter of the load tap changer.
The method of embodiment 74,
wherein the missing data replacement procedure is performed to determine a substitute value for a condition of the load tap changer or a load tap changer type of the load tap changer.
The method of any one of embodiments 54 to 75,
wherein the missing data replacement procedure is performed to determine a substitute value for a load coupled to at least one power network asset of the plurality of power network assets.
The method of any one of embodiments 54 to 76,
wherein the machine learning algorithm is selected from a group consisting of linear algorithms, nonlinear algorithms, and ensemble algorithms.
The method of embodiment 77,
wherein the machine learning algorithm is a linear algorithm selected from a group consisting of general linear regression (GLM) and linear discriminant analysis (LDA).
The method of embodiment 77,
wherein the machine learning algorithm is a nonlinear algorithm selected from a group consisting of classification and regression trees (CART), a Naïve Bayes algorithm (NB), Bayesian networks, K-nearest neighbor (KNN), and a support vector machine (SVM).
The method of embodiment 77,
wherein the machine learning algorithm is an ensemble algorithm selected from a group consisting of random forest, tree bagging, an extreme gradient boosting machine, and artificial neural networks.
The method of any one of embodiments 54 to 80,
wherein the missing data replacement procedure is selected from a group consisting of the following procedures:
The method of any one of embodiments 54 to 81,
wherein the missing data replacement procedure comprises determining the at least one substitute parameter value using a multivariate regression or using a Pearson correlation.
The method of embodiment 82, further comprising
determining a multivariate correlation or the Pearson correlation based on the training data.
The method of any one of embodiments 54 to 83,
wherein the plurality of power network assets comprises a plurality of transformers.
The method of embodiment 84,
wherein the plurality of transformers comprises power transformers, distribution transformers, or high voltage transformers.
The method of embodiment 84 or embodiment 85,
wherein the training data comprise historical operational parameters of the plurality of transformers.
Machine-readable instruction code comprising instructions which, when executed by an electronic computing device, cause the computing device to perform the method of any one of embodiments 54 to 86; optionally wherein the machine-readable instruction code is stored in a tangible storage medium.
The methods, devices, power networks, and computer-readable instruction code according to embodiments of the invention addresses the need for condition classification tools that can process a large number of inputs, while providing good classification results for a condition classification of a power network asset for which not all of the required parameter values are available. The methods, devices, power networks, and computer-readable instruction code according to embodiments also allow information to be provided on how a missing data replacement strategy affects the confidence level of the obtained condition classification result.
While exemplary embodiments have been explained with reference to the drawings, modifications and alterations may be implemented in other embodiments. The methods, devices, power networks, and computer-readable instruction code may be used for condition classification of power network assets other than power transformers. Machine learning models and/or missing data replacement procedures different from the ones discussed herein in detail may be used in further embodiments.
As will be understood by the skilled person, the embodiments disclosed herein are provided for better understanding and are merely exemplary. Various modifications and alterations will occur to the skilled person without deviating from the sprit and scope of the invention.
While the invention has been described in detail in the drawings and foregoing description, such description is to be considered illustrative or exemplary and not restrictive. Variations to the disclosed embodiments can be understood and effected by those skilled in the art and practicing the claimed invention, from a study of the drawings, the disclosure, and the appended claims. In the claims, the word “comprising” does not exclude other elements or steps, and the indefinite article “a” or “an” does not exclude a plurality. The mere fact that certain elements or steps are recited in distinct claims does not indicate that a combination of these elements or steps cannot be used to advantage, specifically, in addition to the actual claim dependency, any further meaningful claim combination shall be considered disclosed.
Number | Date | Country | Kind |
---|---|---|---|
18173698.4 | May 2018 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2019/051421 | 1/22/2019 | WO | 00 |