The invention relates to determining recommended maintenance intervals and particularly maintenance intervals for a fleet of units based on the operational profile of each unit.
Power generation units, such as the units responsible for creating electrical power for utility companies, are regularly tasked with operating continuously for extended periods at high levels. The units need to toe considered to have a high level of reliability. Maintaining reliability under these requirements may be difficult since the components of power generation units are prone to deterioration with use, which may include wear, fatigue, cracking, oxidation, and other damage, and require regular maintenance. Deterioration that can lead to failure of a unit may be referred to as a failure mode.
Power generation units generally include large components, many of which rotate in use and operate under extreme conditions, and are also subjected to large mechanical and electrical loads. Without proper maintenance, these units may degrade due to wear and will ultimately fail if not properly maintained. To avoid failure, the power generation units are taken off-line for periodic repair and maintenance under a regular schedule.
Scheduling maintenance and repair of power generation units often includes setting an operational period during which a unit is continuously operated, the operational period lasting from a first event which takes the unit off-line for repair and/or maintenance until a second event which also takes the unit off-line for repair and/or maintenance. Determining the length of the operational period usually involves balancing the requirement for reliable operation of the unit and a need for continuous and extended operation.
To perform repair and/or maintenance work on a power generation unit, the unit must be taken off-line. While off-line, the unit is not generating power for a power grid. If the operational period is allowed to be too long, the power unit may unexpectedly fail during operation. Such unexpected failure results in an unplanned outage of power generation, which reduces power output of a plant relying on the unit and may have a high cost in both monetary and human capital due to the suddenness and immediacy of the need for repair. On the other hand, shortening the operational period reduces the amount of time during which the unit is gene-rating power between repair and maintenance events and therefore reduces the total amount of power generated by the unit over an extended period such as the life of the power generation unit due to the more frequent off-line sessions. Another way to describe taking a unit offline is reducing “availability,” which may be described mathematically. Mathematically, “1” represents being available for an entire calendar year, which is 365 days. “0” would represent a unit which is offline for an entire year and is thus available for 0 days out of a year. To mathematically represent when a particular unit is available, a calculation may be made in the form of (1-(total downtime divided by total calendar time)), where total downtime is the sum of planned and unplanned downtime. Conventionally, a calculated likelihood of failure is used to determine an appropriate operational period for an industrial power generation unit that extends between planned off-line maintenance and repair events, The calculated likelihood of failure is then used to balance the requirement of maintaining a reliable power unit with the need for generated power.
The calculated likelihood of failure may be characterized as a likelihood of the power generation unit suffering an unplanned outage. A power unit suffers an unplanned outage when the unit is taken off-line at a time other than a scheduled off-line period. The unplanned outage is typically due to a failure of a power generation unit operating in the field such as in a power generation plant.
Unplanned outage likelihood is traditionally determined based on historical data of actual field failures of power generation units. Actual field failures are useful for estimating the likelihood of failure, but do not accurately account for all potential failure modes of the power generation unit. Some failure modes are not reflected in the historical failures of units.
To model these other failure modes, a “lurking model” has conventionally been used. For example, the likelihood of previously unseen failure modes may be estimated using the lurking model. The lurking model approach is crude and is based on a hypothetical analysis of unforeseen and unseen modes of failures. A lurking model takes into account the likelihood associated with those unknown failure mode(s) of the system or system components which may occur in the future if the system or component is allowed to operate beyond the current operating experience. It is usually estimated using a Weibayes model, which is a type of Weibull model with no failure points, and in which a shape parameter (also known as “beta”) is assumed. A value of beta may be relatively high and may be in the range of 3 to 4. Generally, beta may be in the range of 1 to 4. The hypothetical analysis used in the lurking model may not anticipate actual unforeseen failure modes of a power generation unit.
Conventionally, maximum intervals of operation may also be based on either the maximum amount of hours of operation or a maximum number of starts of a particular unit. The use of a maximum interval for hours or a maximum interval for starts results in an operational metric varying as a function of the operational profile of a specific unit in the fleet. The further a specific unit is from the maximum hours or starts limit, the less optimally it is being serviced. In other words, the unit could be operated for more hours or cycles than is allotted under current recommendations.
Therefore, at a fleet level, units are being prematurely serviced due to conventionally recommended maintenance intervals. Conventional maintenance intervals, which are fixed, do not take into account the behavior of desired operational metrics as a function of unit operation. In some conventional solutions, an elliptical relationship is assumed between number of starts of systems and hours a system is running. In those elliptical solutions, units are often recommended to be serviced prematurely for part of the operational profile spectrum and conversely are recommended to be serviced later than the appropriate interval.
There is a long felt and unsolved need for enhanced systems and methods to accurately assess the correct operational period for each individual unit within a fleet of units so as to maximize their value and minimize losses due to their being taken off-line unexpectedly.
A method for determining recommended maintenance intervals for a fleet of units is based on the operational profile of each unit. Each unit in the fleet is run to a constant value of an operational metric regardless of operational profile. For example, operational metrics may include at least a constant likelihood of unplanned maintenance, costs of unplanned maintenance, reliability, availability, or even total lifecycle costs (including unplanned outage costs, repair costs, and fallout costs, etc.). Failure mode and effects analyses (FMEA) may be leveraged to ensure that all known and hypothetical failure modes are properly accounted for.
A method for determining recommended maintenance intervals for a fleet of units is based on each unit in the fleet being run to the same value of an operational metric. As noted above, operational metrics may include a likelihood of unplanned maintenance, costs of unplanned maintenance, reliability of an individual unit, availability of replacement parts or replacement units, and total lifecycle costs for an individual unit or a fleet of units.
Operational metrics may be directly linked to reliability models and/or data at the failure mode level for each component or subsystem of interest.
As part of determining an appropriate operational period, models are created at the failure mode level for each component of interest. Typically, these models would result in the probability of an unplanned maintenance event for a given failure mode as a function of an operational parameter. These models may also include data for the consequences of the failure mode if it occurs, e.g., an event duration or cost of repair, etc.
Models for all of the individual failure modes are combined to result in an operational metric of interest. An operational, metric “of interest” is a metric which a business wishes to maintain constant for each unit in the fleet. Examples of an operational metric of interest include, but are not limited to, a likelihood of unplanned maintenance, a cost of unplanned maintenance, reliability of a unit, availability of a unit, and a total lifecycle cost.
Based on the models, the maintenance interval, i.e., the operational period, may be adjusted such that each operational profile results in the same value of the operational metric. For example, a curve may be generated which compares a number of starts with hours in operation for a particular unit or a fleet of units. Such a curve may indicate a maintenance interval for any given hours to starts ratio.
Such a system may be re-evaluated within a boundary embodied in the above-discussed curve. Re-evaluation may be accomplished using a design evaluation tool such as Failure Modes and Effects Analysis (FMEA). Such a re-evaluation may determine if any new, hypothetical failure mode models should be added, given the new expanded boundary for the maintenance interval (relative to conventional methods). If any new failure models are identified, then their corresponding models are added and new models are recreated for all of the individual failure modes as discussed above.
As a result, a recommended operational interval for each unit in a fleet of units is generated based on its individual operating profile, the individual operating profile maintaining the same value of an operating metric across all units in the fleet.
Regarding reliability, empirical data of actual wear, degradation, and damage occurring in the machine may be used to enhance the modeling of machine failure modes. The model may be used to calculate the reliability of the machine and determine an optimal period between off-line maintenance and repair sessions. The model combines data from historical field failures and potential, also referred to as unforeseen, failures based on monitoring the machine, such as by boroscopic inspections.
Reliability data may come in different forms, each of which nay require a different tool to measure, capture, and/or collect. Reliability data may include at least; A) Operational data, which may be automatically collected from each unit and/or may be transmitted to a central monitoring system; B) Cost data, which may be related to unplanned downtime and may be captured in a financial accounting system; and C) Field engineering reports, which may describe failure modes, including those that have occurred in the past. These examples are illustrative and non-limiting.
Reliability data may be recorded on a variety of storage devices, which may function as intermediate storage devices prior to the data being consolidated into a final storage device. The intermediate and/or consolidated data may be used to create reliability models at the failure mode level. Storage devices may include at least: A) a first database system which may be used to store operational data automatically collected from each unit and/or transmitted to a central monitoring system; B) a second database system which may foe used to store financial accounting data; C) a server which may be used to store field reports that may have been created using conventional office software programs, such as word processing programs, and the reports may be created and/or stored as individual files; and D) a third database system may store reliability data compiled from multiple databases and/or storage devices. The databases may foe intermediate databases. These examples are illustrative and non-limiting.
Analysis devices may be used to create models of data which has been collected regarding units within a fleet and/or the fleet as a whole. The analysis devices may be used to create models at the failure mode level for each component of interest from the reliability data. After models have been created, one or more analysis devices may be used to apply the created model(s) to predict future reliability, availability, unplanned cost, and other operational considerations. One or more analysis devices may also be used to evaluate those models in order to calculate a recommended fleet maintenance interval. For example, analysis devices may include at least: A) statistical analysis software used by a reliability engineer to iteratively create models at the failure mode level for each component of interest; B) spreadsheet-based software tools; C) stochastic simulation software; and D) personal computer, server, or cloud-based simulation software. These examples are illustrative and non-limiting.
A power generation unit may include a gas turbine, a steam turbine, or another power generation device.
With the target operational metric obtained in step 103, the operational profile may then be incremented in step 104. In the incrementation of step 104, the maintenance interval may be adjusted such that each operational profile results in the same value of the operational metric. After the incrementation of step 104, an operational metric may be calculated at the new operational profile In step 105. After the new operational metric is calculated in step 105, the operational metric is compared with a target operational metric in step 106. If the operational metric calculated in step 105 is not equal to a target operational metric, the system may enter an optimization loop, whereby the maintenance interval is adjusted in step 107 followed by a new calculation of an operational metric at the newest operational profile in step 105.
If, on the other hand, the operational metric is found to be equal, within a desired tolerance (for example, +/−0.1%), to the target operational metric in step 106, the system may then consider whether other operational profiles need to be evaluated as part of the process of determining maintenance intervals in step 108. If the system or an operator of the system determines that other operational profiles need to be evaluated, the operational profile may again be incremented as in step 104. If the operational profile is again incremented in step 104, a new operational metric is again calculated at the newest operational profile in step 105, followed by the comparison in step 106.
If, on the other hand, it is determined that no other operational profiles need to be evaluated in step 108, the system may then be evaluated within a new recommended maintenance interval using, e.g., FMEA procedures, in step 109.
After an evaluation of the system in step 109, the system or an operator of the system determines whether any other failure modes need to be modeled due to extended operation in step 110. If step 110 determines that yes, additional failure modes need to be modeled, the new additional models are combined with ail the other models already considered in step 102. The system then proceeds through the steps outlined above with the new additional model(s) combined in step 102. If, however, step 110 determines that no other failure modes need to be modeled or considered, the system is completed and a maintenance interval is determined in step 111.
Line 208 illustrates an example “unit A” being considered for when to take unit A offline for scheduled maintenance. Point 209 illustrates the conventional point at which unit A would be taken offline, while point 210 illustrates a point according to the present technology when unit A would be taken offline. According to conventional methods, as soon as unit A had been started 1200 times, the unit must be taken offline, regardless of how many hours the unit had actually been in operation.
Line 211 illustrates, another example “unit B” being considered for when to take unit B offline for scheduled maintenance. According to conventional methods, as soon as unit B reached 32000 hours in service, regardless of the number of fired starts, unit B would be taken offline. Similarly, point 212 illustrates a point at which unit B would be taken offline for maintenance according to conventional methods, while point 213 illustrates a point according to the present technology when unit B would be taken offline.
As illustrated in
Similar to
While the invention has been described in connection with what is presently considered to be the most practical and preferred embodiment, it is to be understood that the invention is not to be limited to the disclosed embodiment, but on the contrary, is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims.