This application claims the priority benefit of Indian application No. 202341039719, filed Jun. 9, 2023, which is incorporated herein, in its entirety, by reference.
This disclosure generally relates to forecasting transformer fault based on dissolved gases concentration and their rate of change in a transformer.
Transformer oil dissolved gas analysis is a useful, predictive, and effective way for evaluating transformer health. The breakdown of electrical insulating material and related components inside a transformer may generate gases that may be indicative of transformer faults, so detecting the gases generated may allow for transformer maintenance.
A method for using forecasting power transformer faults may include: receiving, by at least one processor of a device, dissolved gas data of power transformers; receiving, by the at least one processor, features of the power transformers; generating, by the at least one processor, based on the features and the dissolved gas data, clusters of transformers, wherein each respective cluster of the clusters comprises transformers exhibiting similarities to each other; selecting, by the at least one processor, for each respective cluster and for each respective gas of the dissolved gas data, from among multiple machine learning models trained to minimize a difference between a forecast gas concentration and an actual gas concentration, a machine learning model associated with forecasting concentration of the respective gas of the respective cluster; generating, by the at least one processor, using a selected machine learning model, a forecasted concentration of the respective gas of the respective cluster; estimating, by the at least one processor, using forecasted gas concentration, a ROC of the respective gas of the respective cluster; predicting, by the at least one processor, based on a comparison of the forecasted gases and their ROC to their alarm threshold, a future fault of a transformer; and generating, by the at least one processor, an alert indicating the transformer maintenance is required enabling proactive maintenance.
A device for using forecasting power transformer faults, the device comprising memory coupled to at least one processor that may: receive dissolved gas data of power transformers; receive features of the power transformers; generate, based on the features and the dissolved gas data, clusters of transformers, wherein each respective cluster of the clusters comprises transformers exhibiting similarities to each other; select, for each respective cluster and for each respective gas of the dissolved gas data, from among multiple machine learning models trained to minimize a difference between a forecast gas concentration and an actual gas concentration, a machine learning model associated with forecasting concentration of the respective gas of the respective cluster; generate, using a selected machine learning model, a forecasted gas concentration of the respective gas of the respective cluster; estimate, using forecasted gas concentration, a ROC of the respective gas of the respective cluster; predict, based on a comparison of the forecasted gases and their ROC to their alarm threshold, a future fault of a transformer; and generate an alert indicating the transformer maintenance is required enabling proactive maintenance
A system for using forecasting power transformer faults may include: a dissolved gas analyzer device; and memory coupled to at least one processor, the at least one processor may: receive, from the dissolved gas analyzer device, dissolved gas data of power transformers; receive features of the power transformers; generate, based on the features and the dissolved gas data, clusters of transformers, wherein each respective cluster of the clusters comprises transformers exhibiting similarities to each other; select, for each respective cluster and for each respective gas of the dissolved gas data, from among multiple machine learning models trained to minimize a difference between a forecast gas concentration and an actual gas concentration, a machine learning model associated with forecasting concentration of the respective gas of the respective cluster; generate, using a selected machine learning model, a forecasted gas concentration of the respective gas of the respective cluster; estimate, using forecasted gas concentration, a ROC of the respective gas of the respective cluster; predict, based on a comparison of the forecasted gases and their ROC to their alarm threshold, a future fault of a transformer; and generate an alert indicating the transformer maintenance is required enabling proactive maintenance
To easily identify the discussion of any particular element or act, the most significant digit or digits in a reference number refer to the figure number in which that element is first introduced.
Certain implementations will now be described more fully below with reference to the accompanying drawings, in which various implementations and/or aspects are shown. However, various aspects may be implemented in many different forms and should not be construed as limited to the implementations set forth herein; rather, these implementations are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art. Like numbers in the figures refer to like elements throughout. Hence, if a feature is used across several drawings, the number used to identify the feature in the drawing where the feature first appeared will be used in later drawings.
Power transformer requires various maintenance tasks that include preventive maintenance and breakdown maintenance. Dissolved Gas Analysis (DGA) is useful in detecting and predicting transformer faults determining the maintenance on ad-hoc basis that is as required. However, DGA may generate false positive alarms that incorrectly indicate a transformer fault based on the presence of gases in transformers. Alarm thresholds used to detect transformer fault based on gas levels often are set manually based on established standards. For example, the IEEE and IEC standards organizations have set fixed alarm thresholds for gas concentration and their rate of change (ROC), such as IEEE-C57.104-2019. For example, the IEEE-C57.104-2019 standard recommends static alarm thresholds that are generic based on its own transformer network or fleet historical data instead of those that are specific to a transformer. The IEC 60599-1999 standard provides the following alarm thresholds for transformer gas ROC, as shown in Table 1.
However, the static alarm thresholds may result in false positives and may not be robust for grids with penetration of distributed energy resources, decarbonization, different types of loads being added, different loading characteristics of transformers, and different transformer manufacturing. Fixed alarm thresholds used in the standards may be transformer-agnostic and based on the 90th/95th percentile of thousands of same types of transformers connected in a network. As a transformer ages or experiences significant or frequent load changes, the generated gas concentrations tend to rise naturally. A possible result is a false fault alarm based on the fixed alarm thresholds resulting in transformer forced shutdown for maintenance and repair
To avoid fault conditions and transformer breakdown, proactive maintenance may be performed based on forecasting the dissolve gases concentrations and their rate of change. Proactive maintenance may provide duration for which transformers may operate reliably and indicates the time for their servicing or replacement.
Transformer gas concentration may depend on a variety of factors such as seasonality (e.g., environmental factors such as temperature, humidity, etc.), historical operation and maintenance of the transformer, degradation severity, operational variations, and the like. Such factors should be accounted for to ensure accurate transformer condition estimation and fault prognosis, likely requiring extensive historical and repetitive gas concentration data. There are approaches that may include exponential smoothing with a seasonal effect, autoregressive integrated moving average (ARIMA), seasonal ARIMA (SARIMA), and the like to forecast time-dependent features such as gas concentrations. Additionally, machine learning models like Prophet and DeepAR may also be used to forecast the dissolved gases concentrations. ROC of dissolved gases concentration may not be measured directly and may be calculated by using linear approximation or any other technique. It may vary significantly based on the number of gas concentration samples in a time window (i.e. chosen to compute ROC) and may have no seasonality for any time window duration, so forecasting ROC directly is not possible. However, absolute gas concentration shows seasonality, so forecasting absolute gas concentrations and estimating gas concentration ROC from the forecasted gas concentrations may provide an accurate fault prediction.
Time-series forecasting may include time-series components such as trend, cyclical, irregular, and seasonal (SARIMA). ARIMA may be represented by (p, d, q) (P, D, Q)m where (p, d, q) is the non-seasonal portion of the model, and (P, D, Q)m is the seasonal portion of the model, m is the number of periods per season, p, d, q are non-negative integers, P is autoregression, d is integrated (e.g., differencing of observations), and 4 is a moving average. Error metrics for evaluating a mode may include mean absolute error, mean square error, and root-mean-square error (RMSE), where
where y is the real value of the test data, and y′ is the predicted value from the forecast. Other Time-series forecasting approaches may comprise of Prophet model, which is a statistical model capable of modelling trends and different seasonality components and DeepAR model which is based on recurrent networks.
In one or more embodiments, the enhanced techniques herein consider other approaches to forecast (e.g., predict) future gas concentrations in transformers by identifying trends in gas concentrations and comparing the forecasted gas values to transformer alarm thresholds. Machine learning models such as Prophet and DeepAR may be trained and tested to forecast the gas concentrations. Linear approximation may estimate gas concentration ROC. The proposed methodology herein forecasts the dissolved gas concentrations and their gradient (i.e. ROC) and predicts fault by comparing them with thresholds values from standards and/or adaptive thresholds. The enhancements herein enable proactive maintenance of transformers and reliable grid operation, may provide a duration for which transformers can operate reliably, and may indicate a time for transformer servicing or replacement.
As a result of the enhanced techniques herein, fault protection may be improved, which may optimize scheduled transformer maintenance and may extend the life of the transformer. In addition, maintenance may be reduced and grid reliability may be increased.
In one or more embodiments, a respective transformer gas may use a best model selected from among multiple models that may forecast gas concentration. The performance of the models may be assessed across one or more transformers, and the best model for the gas may be selected for use in forecasting the gas concentration. A system may pre-process historical gas concentration data from transformers (e.g., by removing singular values, transforming unequal interval series to equal interval series by linear interpolation, normalizing the data). The system may cluster transformers by identifying groups of transformers exhibiting similar behavior to test the machine learning models. The transformer clustering may be based on transformer features such as operation, oil condition, transformer state, and dissolved gas concentration history. The system also may match a newly added transformer (e.g., newly added to a grid) to an already-identified cluster. Transformer clustering also may be based on a transformer configuration (e.g., nameplate information), a daily load profile, and oil temperature.
In one or more embodiments, for the respective transformer clusters, the system may train and test the machine learning models for respective gases, and may evaluate the models for their performance in forecasting gas concentration. The model that is the most accurate at forecasting for a given gas may be selected for the cluster. The machine learning models may include statistical algorithms and deep learning models, with the different models exhibiting different strengths. For example, the models may include persistence models and time-series models. The machine learning models may include statistical models like Prophet, capable of modeling trends and seasonality components, autoregressive deep learning models, recurrent networks like DeepAR for time-series forecasting, and ARIMA (e.g., for an initial assessment).
In one or more embodiments, the transformer clustering may be based on principal components analysis (PCA) of the transformers plus k-means clustering. For example, based on a selection of principal components, transformer PCAs may exhibit a cumulative sum of variance ratio, which may be used to identify similar transformers to be included in a given cluster. For nine transformer gases, for example, a PCA may be applied to reduce the dimensions from nine to three (the dimension reduction size is not limited, as this is an example). The transformer clusters may be generated within the three-dimensional space from the PCA dimension reduction to cluster similar transformers. The number of clusters may vary, so the clustering may result in an optimal number of clusters based on the transformer similarities.
In one or more embodiments, to select the best model for a cluster for a respective gas, the system may evaluate the number of times a model has the best performance (e.g., lowest root-mean-square deviation RMSE) across the transformers in the cluster. The best performance analysis may be based on a comparison of a model's forecast to actual gas concentration data for the same time period (e.g., lower RMSE for a model indicates stronger performance). A model may generate a forecast for the gas concentration, which the system may use to forecast the ROC of the gas concentration. To estimate ROC, a time interval is selected based on statistics and heuristics. The time interval may be a sliding time window that could be fixed or variable. The system may determine an optimal next scheduled maintenance time for a transformer using the forecasted gas concentration and their ROC estimation over the time interval. Moreover, the system may estimate the optimal window size using an auto-feedback mechanism for any gas and cluster type to fine-tune the estimated ROC accuracy. Both forecasted gas parts per million (PPM) and their ROC may be compared with their alarm thresholds (that could be static or adaptive), respectively, to accurately forecast and monitor transformer condition in the future. The forecasting of critical transformer parameters (i.e. gas concentration and their ROCs) and their comparison with alarm threshold enables determination of accurate time available for maintenance/time to possible fault occurrence/time to over-heating condition/time to replace. The system may use the feature forecast from the selected model to forecast if and when a future fault (e.g., transformer fault related to high gas concentration) may occur.
Forecasting gas ROC data may be affected by ROC time window duration and the minimum number of samples. The sample interval may determine the ROC time window duration. The lower the ROC time window, the more false alarms there may be. The ROC time window therefore may be set to at least 48 hours for less ROC volatility. For a sample interval greater than one day, the ROC forecast may be in ppm/month or ppm/week (e.g., to account for multiple days/samples). The sample interval may be selected to provide at least 48 samples (n in the ROC formula), so the ROC time window may depend on the sample interval.
In one or more embodiments, the gas concentration forecasting may use a direct approach or a recursive approach. In the direct approach, the predicted gas concentration may not be dependent on a previous prediction. In the recursive approach, the predicted gas concentration for a time interval may depend on a prediction for another time interval.
In one or more embodiments, the transformer fault alarms may use adaptive alarm thresholds for ROC of gas for transformers in operation. The present disclosure provides multiple new techniques for setting and using adaptive thresholds and for determining a gas concentration ROC. A dissolved gas analyzer may provide a gas concentration in parts-per-million (ppm), and from the gas concentration, the gradient (ROC) may be calculated. Adaptive alarm thresholds may be used for both the gas concentration and the ROC. The gas concentrations from DGA may be compared to static or adaptive alarm thresholds, and when the gas concentration exceeds a threshold, a probable fault condition may be hypothesized. To confirm the fault condition, the ROCs may be compared to the adaptive alarm thresholds to minimize false alarms. In this manner, the fault detection and confirmation may be based on comparing both gas concentration and ROC of gas concentration to alarm thresholds, with at least the ROC alarm thresholds being adaptive.
The benefits of the enhanced techniques of the present disclosure include proactive maintenance by predicting next optimal maintenance schedule, determination of accurate time available for maintenance/time to possible fault occurrence/time to over-heating condition/time to replace, improved anomaly detection, reducing the need to set alarm thresholds manually for transformers, and reducing computational expense (e.g., allowing for the techniques to be applied on a variety of devices). In contrast with some techniques that use an established alarm threshold (e.g., 95th percentile), the enhanced adaptive alarm thresholds herein may use prior knowledge from the fixed thresholds of the standards to determine the adaptive thresholds more adaptively. For reference, a fixed-size window tw may slide over a device data frame and its 95th percentile generates a threshold τ that may vary as the window slides: τ=f(β, tw) 95th percentile of βt
In one or more embodiments, a DG analyzer may sample gases in a transformer at any sampling rate (e.g., the enhanced techniques are agnostic to sampling rate). The DG analyzer may determine gas concentrations of respective gases and their ROCs. The ROC calculation may use Equation (1): p=βt+α, where p is the concentration (ppm) of a gas target measurement, β is the ROC formula, t is the time from initial zero start time, and α is a vertical axis intercept. After determining the ROC for a gas, the DG analyzer may apply one of the data-driven techniques disclosed herein to generate ROC-adaptive alarm thresholds. The DG analyzer may apply the ROC-adaptive alarm thresholds and the standard-based thresholds to the data stream of gas data to determine whether a subsequent ROC exceeds a ROC-adaptive alarm threshold.
In one or more embodiments, the DG analyzer may use ROC limits from the standard, that is prior knowledge of thresholds and may add them to an adaptive threshold. Using the example of Table 1 above, the standard-based Hydrogen alarm threshold is 5 mm/day. Alternatively, the DG analyzer may operate with no prior knowledge of thresholds from the standard to an adaptive threshold (e.g., ignore the 5 mm/day threshold for Hydrogen). With no prior knowledge of thresholds from the standard, the DG analyzer may generate an adaptive threshold for a gas ROC. The DG analyzer may use some previous ROC values for a gas using a static or variable sliding time window of ROC values (e.g., use the last n ROC values, or use a variable window of ROC values at each sampling rate). The variable time window reduces ROC computations unlike the case of fixed time window where a new ROC value is calculated at every sampling rate, The DG analyzer may increase or reduce the sliding time window size (i.e., the number of ROC values) when the DG analyzer detects no change or a change in gas concentration respectively. When the window of ROC samples is variable or fixed, the DG analyzer may use the L2 norm (or another norm) of the ROC or difference of two consecutive sorted ROCs referred to as Delta ROC, mu+α*SD, where mu is the mean and SD is standard deviation of the ROC or the Delta ROC, and α is the multiplication factor with a default value of 2, or mu_w+SD_w with a weighted mu and weighted standard deviation. Some of the methodologies with a fixed window of ROC samples are shown below in Table 2.
One approach shown in Table 2 is to ignore the prior knowledge from the standard and directly apply the estimated thresholds to detect fault. However, this may not help in achieving robustness and may still result in false alarms. Therefore, another approach includes the prior knowledge in adaptive threshold estimation. The threshold values from the standard may be added to the estimated thresholds. With prior knowledge of thresholds from the standard, the threshold may be set according to: τ=τt
The statistical methods in Table 2 may be applied on ROC, beta and Delta ROC, and deltabeta time series whose length is determined by tw that could be fixed or variable. The delta ROC may be calculated by taking the difference between two consecutive sorted ROC either in ascending or descending order as shown in the equations: βt
The enhanced adaptive alarms herein add a methodology for computing gas concentration gradient by selecting a time window (e.g., n=1-96 hours), computing adaptive thresholds for gas concentration gradient using one of the adaptive methods above, some of which may use prior knowledge of standards-based thresholds, and improve the fidelity of fault detection for transformers based on gas concentration and their gradients.
The above descriptions are for purposes of illustration and are not meant to be limiting. Numerous other examples, configurations, processes, etc., may exist, some of which are described in greater detail below. Example embodiments will now be described with reference to the accompanying figures.
Referring to
Still referring to
Referring to
Referring to
Referring to
Still referring to
Referring to
Referring to
Referring to
may be less than or equal to 96 hours as shown, where alpha may be represented by
In one or more embodiments, the system (e.g., the diagnostic modules 112) may identify a sample interval, start with a window size of 10× the sample interval, identify a 3-sigma spread for the window size, and repeat the process until 3-sigma varies within 0.1 to select the ROC time window. The ROC time window size may one that has more than 20 samples (e.g., to minimize volatility such as shown in the plot 702).
In a direct approach 802, the predicted gas concentration may not be dependent on a previous prediction (e.g., yt+n may not depend on yt+2, which may not depend on yt+1, etc.). In a recursive approach 804, the predicted gas concentration for a time interval may depend on a prediction for another time interval (e.g., yt+2 may depend on yt+1, yt+n may depend on yt+2, etc.). The time-series forecasting of
Referring to
and alpha may be represented by
where n=number of data points, and it is determined by time window of the gas concentrations to calculate the ROC.
Still referring to
At block 1002, a device (or system, e.g., using the diagnostic modules 112 of
At block 1004, the process 1000 may continue at
At block 1006, when there is prior knowledge of the standard-based alarm thresholds, the process 1000 may continue at block 1008. At block 1008, the device may determine ROC alarm limits for dissolved gas concentration from the IEEE/IEC standards.
At block 1010, the device may apply the standard-based alarm thresholds as an initial estimation, and at block 1012, the device may add the standard-based alarm thresholds to an adaptive alarm threshold from the ROC. At block 1014, the device may update the adaptive alarm threshold based on subsequent insights from received DGA data. At block 1016, when the window is variable, the process 1000 may return to block 1002. When the window is fixed, the process 1000 may return to block 1004.
At block 1018, the device may determine the gas concentration ROC based on a sliding time window. At block 1020, the device may use the gas concentration ROC or may calculate a delta ROC.
At block 1022, the device may apply the norm on the ROC or delta ROC to generate the ROC alarm threshold.
Alternatively, at block 1024, the device may apply a mean and standard deviation on the ROC or delta ROC to generate the ROC alarm threshold.
Alternatively, at block 1026, the device may apply weighted mean and standard deviation on the ROC or delta ROC to generate the ROC alarm threshold.
The graph 1100 of
I/O device 1226 may also include an input device (not shown), such as an alphanumeric input device, including alphanumeric and other keys for communicating information and/or command selections to the processors 1202-1206. Another type of user input device includes cursor control, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to the processors 1202-1206 and for controlling cursor movement on the display device.
Machine 1200 may include an adaptive storage device, referred to as main memory 1210, or a random access memory (RAM) or other computer-readable devices coupled to the processor bus 1208 for storing information and instructions to be executed by the processors 1202-1206. Main memory 1210 also may be used for storing temporary variables or other intermediate information during execution of instructions by the processors 1202-1206. Machine 1200 may include a read only memory (ROM) and/or other static storage device coupled to the processor bus 1208 for storing static information and instructions for the processors 1202-1206. The system outlined in
According to one embodiment, the above techniques may be performed by machine 1200 in response to processor 1204 executing one or more sequences of one or more instructions contained in main memory 1210. These instructions may be read into main memory 1210 from another machine-readable medium, such as a storage device. Execution of the sequences of instructions contained in main memory 1210 may cause processors 1202-1206 to perform the process steps described herein. In alternative embodiments, circuitry may be used in place of or in combination with the software instructions. Thus, embodiments of the present disclosure may include both hardware and software components.
A machine readable medium includes any mechanism for storing or transmitting information in a form (e.g., software, processing application) readable by a machine (e.g., a computer). Such media may take the form of, but is not limited to, non-volatile media and volatile media and may include removable data storage media, non-removable data storage media, and/or external storage devices made available via a wired or wireless network architecture with such computer program products, including one or more database management products, web server products, application server products, and/or other additional software components. Examples of removable data storage media include Compact Disc Read-Only Memory (CD-ROM), Digital Versatile Disc Read-Only Memory (DVD-ROM), magneto-optical disks, flash drives, and the like. Examples of non-removable data storage media include internal magnetic hard disks, SSDs, and the like. The one or more memory devices may include volatile memory (e.g., adaptive random access memory (DRAM), static random access memory (SRAM), etc.) and/or non-volatile memory (e.g., read-only memory (ROM), flash memory, etc.).
Computer program products containing mechanisms to effectuate the systems and methods in accordance with the presently described technology may reside in main memory 1210, which may be referred to as machine-readable media. It will be appreciated that machine-readable media may include any tangible non-transitory medium that is capable of storing or encoding instructions to perform any one or more of the operations of the present disclosure for execution by a machine or that is capable of storing or encoding data structures and/or modules utilized by or associated with such instructions. Machine-readable media may include a single medium or multiple media (e.g., a centralized or distributed database, and/or associated caches and servers) that store the one or more executable instructions or data structures.
Embodiments of the present disclosure include various steps, which are described in this specification. The steps may be performed by hardware components or may be embodied in machine-executable instructions, which may be used to cause a general-purpose or special-purpose processor programmed with the instructions to perform the steps. Alternatively, the steps may be performed by a combination of hardware, software and/or firmware.
Various modifications and additions can be made to the exemplary embodiments discussed without departing from the scope of the present invention. For example, while the embodiments described above refer to particular features, the scope of this invention also includes embodiments having different combinations of features and embodiments that do not include all of the described features. Accordingly, the scope of the present invention is intended to embrace all such alternatives, modifications, and variations together with all equivalents thereof.
Number | Date | Country | Kind |
---|---|---|---|
202341039719 | Jun 2023 | IN | national |