The present disclosure relates generally to abnormal situation prevention in a process plant. More particularly, the disclosure relates to automatically determining the order of a polynomial regression model modeling a process control variable as a function of one or more other process control variables.
Process control systems, like those used in chemical, petroleum or other processes, typically include one or more centralized or decentralized process controllers communicatively coupled to at least one host or operator workstation and to one or more process control and instrumentation devices such as, for example, field devices, via analog, digital or combined analog/digital buses. Field devices, which may be, for example, valves, valve positioners, switches, transmitters, and sensors (e.g., temperature, pressure, and flow rate sensors), are located within the process plant environment, and perform functions within the process such as opening or closing valves, measuring process parameters, increasing or decreasing fluid flow, etc. Smart field devices such as field devices conforming to the well-known FOUNDATION™ Fieldbus (hereinafter “fieldbus”) protocol or the HART® protocol may also perform control calculations, alarming functions, and other control functions commonly implemented within the process controller.
The process controllers, which are typically located within the process plant environment, receive signals indicative of process measurements or process variables made by or associated with the field devices and/or other information pertaining to the field devices, and execute controller applications. The controller applications implement, for example, different control modules that make process control decisions, generate control signals based on the received information, and coordinate with the control modules or blocks being performed in the field devices such as HART® and fieldbus field devices. The control modules in the process controllers send the control signals over the communication lines or signal paths to the field devices, to thereby control the operation of the process.
Information from the field devices and the process controllers is typically made available to one or more other hardware devices such as, for example, operator workstations, maintenance workstations, personal computers, handheld devices, data historians, report generators, centralized databases, etc. to enable an operator or a maintenance person to perform desired functions with respect to the process such as, for example, changing settings of the process control routine, modifying the operation of the control modules within the process controllers or the smart field devices, viewing the current state of the process or of particular devices within the process plant, viewing alarms generated by field devices and process controllers, simulating the operation of the process for the purpose of training personnel or testing the process control software, diagnosing problems or hardware failures within the process plant, etc.
While a typical process plant has many process control and instrumentation devices such as valves, transmitters, sensors, etc. connected to one or more process controllers, there are many other supporting devices that are also necessary for or related to process operation. These additional devices include, for example, power supply equipment, power generation and distribution equipment, rotating equipment such as turbines, motors, etc., which are located at numerous places in a typical plant. While this additional equipment does not necessarily create or use process variables and, in many instances, is not controlled or even coupled to a process controller for the purpose of affecting the process operation, this equipment is nevertheless important to, and ultimately necessary for proper operation of the process.
As is known, problems frequently arise within a process plant environment, especially a process plant having a large number of field devices and supporting equipment. These problems may take the form of broken, malfunctioning or underperforming devices, plugged fluid lines or pipes, logic elements, such as software routines, being improperly configured or being in improper modes, process control loops being improperly tuned, one or more failures in communications between devices within the process plant, etc. These and other problems, while numerous in nature, generally result in the process operating in an abnormal state (i.e., the process plant being in an abnormal situation) which is usually associated with suboptimal performance of the process plant. Many diagnostic tools and applications have been developed to detect and determine the cause of problems within a process plant and to assist an operator or a maintenance person to diagnose and correct the problems once the problems have occurred and been detected. For example, operator workstations, which are typically connected to the process controllers through communication connections such as a direct or a wireless bus, an Ethernet, a modem, a phone line, and the like, have processors and memories that are adapted to run software or firmware, such as the DeltaV™ and Ovation™ control systems, sold by Emerson Process Management, wherein the software includes numerous control module and control loop diagnostic tools. Likewise, maintenance workstations, which may be connected to the process control devices, such as field devices, via the same communication connections as the controller applications, or via different communication connections, such as OPC connections, handheld connections, etc., typically include one or more applications designed to view maintenance alarms and alerts generated by field devices within the process plant, to test devices within the process plant and to perform maintenance activities on the field devices and other devices within the process plant. Similar diagnostic applications have been developed to diagnose problems within the supporting equipment within the process plant.
Thus, for example, the AMS™ Suite: Intelligent Device Manager application (at least partially disclosed in U.S. Pat. No. 5,960,214 entitled “Integrated Communication Network for use in a Field Device Management System”) sold by Emerson Process Management, enables communication with and stores data pertaining to field devices to ascertain and track the operating state of the field devices. In some instances, the AMS™ application may be used to communicate with a field device to change parameters within the field device, to cause the field device to run applications on itself such as, for example, self-calibration routines or self-diagnostic routines, to obtain information about the status or health of the field device, etc. This information may include, for example, status information (e.g., whether an alarm or other similar event has occurred), device configuration information (e.g., the manner in which the field device is currently or may be configured and the type of measuring units used by the field device), device parameters (e.g., the field device range values and other parameters), etc. Of course, this information may be used by a maintenance person to monitor, maintain, and/or diagnose problems with field devices.
Similarly, many process plants include equipment monitoring and diagnostic applications such as, for example, Machinery Health™ applications provided by CSI, or any other known applications used to monitor, diagnose, and optimize the operating state of various rotating equipment. Maintenance personnel usually use these applications to maintain and oversee the performance of rotating equipment in the plant, to determine problems with the rotating equipment, and to determine when and if the rotating equipment must be repaired or replaced. Similarly, many process plants include power control and diagnostic applications such as those provided by, for example, the Liebert and ASCO companies, to control and maintain the power generation and distribution equipment. It is also known to run control optimization applications such as, for example, real-time optimizers (RTO+), within a process plant to optimize the control activities of the process plant. Such optimization applications typically use complex algorithms and/or models of the process plant to predict how inputs may be changed to optimize operation of the process plant with respect to some desired optimization variable such as, for example, profit.
These and other diagnostic and optimization applications are typically implemented on a system-wide basis in one or more of the operator or maintenance workstations, and may provide preconfigured displays to the operator or maintenance personnel regarding the operating state of the process plant, or the devices and equipment within the process plant. Typical displays include alarming displays that receive alarms generated by the process controllers or other devices within the process plant, control displays indicating the operating state of the process controllers and other devices within the process plant, maintenance displays indicating the operating state of the devices within the process plant, etc. Likewise, these and other diagnostic applications may enable an operator or a maintenance person to retune a control loop or to reset other control parameters, to run a test on one or more field devices to determine the current status of those field devices, to calibrate field devices or other equipment, or to perform other problem detection and correction activities on devices and equipment within the process plant.
While these various applications and tools are very helpful in identifying and correcting problems within a process plant, these diagnostic applications are generally configured to be used only after a problem has already occurred within a process plant and, therefore, after an abnormal situation already exists within the plant. Unfortunately, an abnormal situation may exist for some time before it is detected, identified and corrected using these tools, resulting in the suboptimal performance of the process plant for the period of time before which the problem is detected, identified and corrected. In many cases, a control operator will first detect that some problem exists based on alarms, alerts or poor performance of the process plant. The operator will then notify the maintenance personnel of the potential problem. The maintenance personnel may or may not detect an actual problem and may need further prompting before actually running tests or other diagnostic applications, or performing other activities needed to identify the actual problem. Once the problem is identified, the maintenance personnel may need to order parts and schedule a maintenance procedure, all of which may result in a significant period of time between the occurrence of a problem and the correction of that problem, during which time the process plant runs in an abnormal situation generally associated with the sub-optimal operation of the plant.
Additionally, many process plants can experience an abnormal situation which results in significant costs or damage within the plant in a relatively short amount of time. For example, some abnormal situations can cause significant damage to equipment, the loss of raw materials, or significant unexpected downtime within the process plant if these abnormal situations exist for even a short amount of time. Thus, merely detecting a problem within the plant after the problem has occurred, no matter how quickly the problem is corrected, may still result in significant loss or damage within the process plant. As a result, it is desirable to try to prevent abnormal situations from arising in the first place, instead of simply trying to react to and correct problems within the process plant after an abnormal situation arises.
One technique collects data that enables a user to predict the occurrence of certain abnormal situations within a process plant before these abnormal situations actually arise or shortly after they arise, with the purpose of taking steps to prevent the predicted abnormal situation or to correct the abnormal situation before any significant loss within the process plant takes place. This procedure is disclosed in U.S. patent application Ser. No. 09/972,078, now U.S. Pat. No. 7,085,610, entitled “Root Cause Diagnostics” (based in part on U.S. patent application Ser. No. 08/623,569, now U.S. Pat. No. 6,017,143). The entire disclosures of both of these applications/patents are incorporated herein by reference. Generally speaking, this technique places statistical data collection and processing blocks or statistical processing monitoring (SPM) blocks, in each of a number of devices, such as field devices, within a process plant. The statistical data collection and processing blocks collect, for example, process variable data and determine certain statistical measures associated with the collected data, such as a mean, a median, a standard deviation, etc. These statistical measures may then be sent to a user interface or other processing device and analyzed to recognize patterns suggesting the actual or future occurrence of a known abnormal situation. Once a particular suspected abnormal situation is detected, steps may be taken to correct the underlying problem, thereby avoiding the abnormal situation in the first place.
Many abnormal situation prevention algorithms rely on some type of regression to model a certain monitored variable as a function of some other load variable. The regression model may be calculated during a training phase in which a set of training data comprising a number of corresponding samples of the load variable and the monitored variable are analyzed to derive a function or curve that best fits the data in the training set. Once the regression model has been calculated the model may be used to predict values of the monitored variable based on measured values of the load variable received during a monitoring phase. The predicted values of the monitored variable may be compared to corresponding measured values of the monitored variable. An abnormal situation may be detected when a predicted value of the monitored variable differs from a corresponding measured value of the monitored variable by more than a predetermined amount.
In many abnormal situation prevention applications, a polynomial regression model is used. The polynomial function modeling the data may be a first order polynomial (linear), a second order polynomial (quadratic), a third order polynomial (cubic), a fourth order polynomial, and so forth. In theory there is no upper limit on the order or degree of the polynomial function. However, practical considerations such as processing time may impose some upper limit on the order of the polynomial. For example, in many applications it may be desirable to set an upper limit pmax on the order of the polynomial. Pmax may be, for example, an integer value between 5 and 10.
In general, the higher the order of the polynomial, the better the polynomial regression model will fit the data in the training set. It is possible, however, to “over fit” the data. In this case, the resulting polynomial function may describe the data in training set very accurately, but may otherwise miss broader more important trends in the data and may not accurately predict values of the monitored variable based on data received in the future monitoring phase. In this case, a lower order polynomial regression may actually be more accurate in predicting values of the monitored variable based on future sampled values of the load variable.
Typically the order of the polynomial regression is a configurable parameter that may be determined by the person setting up the abnormal situation prevention function. Selecting an improper value for the order of the polynomial regression may result in poor results. If the polynomial regression model fails to accurately predict values of the monitored variable based on received samples of the load variable, abnormal situations may be detected when none exists, or abnormal situations that do exist may not be detected. Thus, the person setting up the abnormal situation prevention function must have a thorough understanding of the process and of the statistical analysis underlying the abnormal situation prevention algorithm. If not, it is likely that a polynomial regression of a less than optimal order will be selected, resulting in sub-optimal performance of the abnormal situation prevention system. To avoid this situation, systems and methods are needed for automatically determining an optimal order of a polynomial regression. Such systems and methods should be capable of analyzing a set of training data and determining the proper order of a polynomial regression for the best results in predicting future values of the monitored variable based on future samples of the load variable.
The present disclosure relates to abnormal situation prevention in a process plant. Polynomial regression models are generated to model various monitored process variables as a function of one or more load variables. The models may be used to predict values of a monitored variable based on measured values of a corresponding load variable. An abnormal situation may be detected if a measured value of the monitored variable differs from a corresponding predicted value of the monitored variable by more than a predetermined amount. The regression model is calculated based on a set of training data comprising a plurality of data samples including measured values of the monitored variable and the load variable. The regression model may comprise a polynomial function of a particular order. For example, the regression curve may be linear function (1st order), and quadratic function (2nd order), a cubic function (3rd order), and so forth. The present disclosure relates to systems and methods for automatically determining the optimal order or degree of a polynomial regression for modeling a monitored or dependent variable as a function of a load or independent variable.
According to an embodiment, a process control system includes at least one field device. At least one field device is adapted to measure a first process control variable and a second process control variable. A processor is adapted to determine the optimal order of a polynomial regression for modeling the second process control variable as a function of the first process control variable based on a plurality of corresponding values of the first and second process control variables contained in training data set. According to an embodiment the processor may execute a cross-validation algorithm for determining the optimal order of the polynomial regression. In another embodiment the processor may execute a penalty function for determining the optimal order of the polynomial regression. In still another embodiment, the processor may execute a forward selection algorithm for determining the optimal order of the polynomial regression. In yet another embodiment the processor may be adapted to determine the optimal order of the polynomial regression by calculating a polynomial regression for all polynomial orders between and including a first order polynomial and a predetermined maximum polynomial order. The processor may then calculate an R2 value for each of the polynomial regression. The R2 values indicate how well each polynomial fits the process control data. The optimal polynomial order may be determined by comparing the R2 values of the various polynomials. Finally, the processor may utilize a support vector machine model to determine the optimal order of a polynomial regression. However the optimal order of the polynomial regression is determined, the processor adapted to determine the optimal order of the polynomial regression may be implemented in a process control field device, in a field device interface module, within a F
Another embodiment may comprise a method of creating a polynomial regression model of process control data for preventing abnormal situations in a controlled process. In this case, the method includes receiving a set of training data. The training data comprises a plurality of first process control variable values and a plurality of corresponding second process control variable values. The method further includes determining an optimal order of a polynomial regression for modeling the second process control variable as a function of the first process control variable based on the values of the first process control variable and the second process control variable included in the received set of training data. Finally, the method calls for creating a polynomial regression model of the determined order for modeling the second process control variable as a function of the first process control variable. The step of determining the optimal order of the polynomial regression may comprise performing a cross validation algorithm on the received set of training data. Alternatively, the optimal order of the polynomial regression may be determined by minimizing a penalized risk function, such as a Ridge Regression. In another alternative, the optimal order of the polynomial regression may be determined by executing a forward selection algorithm, or by comparing R2 values for a plurality of polynomial regressions of every order from a first order polynomial regression to a predetermined maximum order polynomial regression.
Further aspects and advantages will be apparent to those of ordinary skill in the art from a review of the following detailed description, taken in conjunction with the drawings. While the compositions and methods are susceptible of embodiments in various forms, the description hereafter includes specific embodiments with the understanding that the disclosure is illustrative, and is not intended to limit the invention to the specific embodiments described herein.
Referring now to
Still further, maintenance systems, such as computers executing the AMS application or any other device monitoring and communication applications may be connected to the process control systems 12 and 14 or to the individual devices therein to perform maintenance and monitoring activities. For example, a maintenance computer 18 may be connected to the controller 12B and/or to the devices 15 via any desired communication lines or networks (including wireless or handheld device networks) to communicate with and, in some instances, reconfigure or perform other maintenance activities on the devices 15. Similarly, maintenance applications such as the AMS application may be installed in and executed by one or more of the user interfaces 14A associated with the distributed process control system 14 to perform maintenance and monitoring functions, including data collection related to the operating status of the devices 16.
The process plant 10 also includes various rotating equipment 20, such as turbines, motors, etc. which are connected to a maintenance computer 22 via some permanent or temporary communication link (such as a bus, a wireless communication system or hand held devices which are connected to the equipment 20 to take readings and are then removed). The maintenance computer 22 may store and execute known monitoring and diagnostic applications 23 provided by, for example, CSI (an Emerson Process Management Company) or other any other known applications used to diagnose, monitor and optimize the operating state of the rotating equipment 20. Maintenance personnel usually use the applications 23 to maintain and oversee the performance of rotating equipment 20 in the plant 10, to determine problems with the rotating equipment 20 and to determine when and if the rotating equipment 20 must be repaired or replaced. In some cases, outside consultants or service organizations may temporarily acquire or measure data pertaining to the equipment 20 and use this data to perform analyses for the equipment 20 to detect problems, poor performance or other issues effecting the equipment 20. In these cases, the computers running the analyses may not be connected to the rest of the system 10 via any communication line or may be connected only temporarily.
Similarly, a power generation and distribution system 24 having power generating and distribution equipment 25 associated with the plant 10 is connected via, for example, a bus, to another computer 26 which runs and oversees the operation of the power generating and distribution equipment 25 within the plant 10. The computer 26 may execute known power control and diagnostics applications 27 such a as those provided by, for example, Liebert and ASCO or other companies to control and maintain the power generation and distribution equipment 25. Again, in many cases, outside consultants or service organizations may use service applications that temporarily acquire or measure data pertaining to the equipment 25 and use this data to perform analyses for the equipment 25 to detect problems, poor performance or other issues effecting the equipment 25. In these cases, the computers (such as the computer 26) running the analyses may not be connected to the rest of the system 10 via any communication line or may be connected only temporarily.
As illustrated in
Generally speaking, the abnormal situation prevention system 35 may communicate with abnormal operation detection systems (not shown in
The portion 50 of the process plant 10 illustrated in
In any event, one or more user interfaces or computers 72 and 74 (which may be any types of personal computers, workstations, etc.) accessible by plant personnel such as configuration engineers, process control operators, maintenance personnel, plant managers, supervisors, etc. are coupled to the process controllers 60 via a communication line or bus 76 which may be implemented using any desired hardwired or wireless communication structure, and using any desired or suitable communication protocol such as, for example, an Ethernet protocol. In addition, a database 78 may be connected to the communication bus 76 to operate as a data historian that collects and stores configuration information as well as on-line process variable data, parameter data, status data, and other data associated with the process controllers 60 and field devices 64 and 66 within the process plant 10. Thus, the database 78 may operate as a configuration database to store the current configuration, including process configuration modules, as well as control configuration information for the process control system 54 as downloaded to and stored within the process controllers 60 and the field devices 64 and 66. Likewise, the database 78 may store historical abnormal situation prevention data, including statistical data collected by the field devices 64 and 66 within the process plant 10, statistical data determined from process variables collected by the field devices 64 and 66, and other types of data.
While the process controllers 60, I/O devices 68 and 70, and field devices 64 and 66 are typically located down within and distributed throughout the sometimes harsh plant environment, the workstations 72 and 74, and the database 78 are usually located in control rooms, maintenance rooms or other less harsh environments easily accessible by operators, maintenance personnel, etc.
Generally speaking, the process controllers 60 store and execute one or more controller applications that implement control strategies using a number of different, independently executed, control modules or blocks. The control modules may each be made up of what are commonly referred to as function blocks, wherein each function block is a part or a subroutine of an overall control routine and operates in conjunction with other function blocks (via communications called links) to implement process control loops within the process plant 10. As is well known, function blocks, which may be objects in an object-oriented programming protocol, typically perform one of an input function, such as that associated with a transmitter, a sensor or other process parameter measurement device, a control function, such as that associated with a control routine that performs PID, fuzzy logic, etc. control, or an output function, which controls the operation of some device, such as a valve, to perform some physical function within the process plant 10. Of course, hybrid and other types of complex function blocks exist, such as model predictive controllers (MPCs), optimizers, etc. It is to be understood that while the Fieldbus protocol and the DeltaV™ system protocol use control modules and function blocks designed and implemented in an object-oriented programming protocol, the control modules may be designed using any desired control programming scheme including, for example, sequential function blocks, ladder logic, etc., and are not limited to being designed using function blocks or any other particular programming technique.
As illustrated in
As illustrated in
Each of one or more of the field devices 64 and 66 may include a memory (not shown) for storing routines such as routines for implementing statistical data collection pertaining to one or more process variables sensed by sensing device and/or routines for abnormal operation detection, which will be described below. Each of one or more of the field devices 64 and 66 may also include a processor (not shown) that executes routines such as routines for implementing statistical data collection and/or routines for abnormal operation detection. Statistical data collection and/or abnormal operation detection need not be implemented by software. Rather, one of ordinary skill in the art will recognize that such systems may be implemented by any combination of software, firmware, and/or hardware within one or more field devices and/or other devices.
As shown in
Generally speaking, the blocks 80 and 82 or sub-elements of these blocks, collect data, such a process variable data, within the device in which they are located and perform statistical processing or analysis on the data for any number of reasons. For example, the block 80, which is illustrated as being associated with a valve, may have a stuck valve detection routine which analyzes the valve process variable data to determine if the valve is in a stuck condition. In addition, the block 80 includes a set of four statistical process monitoring (SPM) blocks or units SPM1-SPM4 which may collect process variable or other data within the valve and perform one or more statistical calculations on the collected data to determine, for example, a mean, a median, a standard deviation, a root-mean-square (RMS), a rate of change, a minimum, a maximum, etc. of the collected data. The specific statistical data generated, nor the method in which it is generated is not critical. Thus, different types of statistical data can be generated in addition to, or instead of, the specific types described above. Additionally, a variety of techniques, including known techniques, can be used to generate such data. The term statistical process monitoring (SPM) block is used herein to describe functionality that performs statistical process monitoring on at least one process variable or other process parameter, and may be performed by any desired software, firmware or hardware within the device or even outside of a device for which data is collected. It will be understood that, because the SPMs are generally located in the devices where the device data is collected, the SPMs can acquire quantitatively more and qualitatively more accurate process variable data. As a result, the SPM blocks are generally capable of determining better statistical calculations with respect to the collected process variable data than a block located outside of the device in which the process variable data is collected.
It is to be understood that although the blocks 80 and 82 are shown to include SPM blocks in
It is to be understood that although the blocks 80 and 82 are shown to include SPM blocks in
The block 82 of
The model 112 includes inputs for receiving values of an independent variable x and a dependent variable y. The model 112 may be trained using a plurality of (x, y) data sets to model the dependent variable y as a function of the independent variable x. The model 112 may include a regression model. The regression model utilizes a function to model the dependent variable y as a function of the independent variable x over some range of x. The regression model may be a linear regression model, or some other regression model. A linear regression model may comprise a first order function of x (e.g., y=a0+a1x), a second order function of x (e.g., y=a0+a1x+a2x2), or a polynomial of some other order p (e.g., y=a0+a1x+a2x2+ . . . +apxp).
After it has been trained, the model 112 may be used to generate a predicted value (yP) of a dependent variable y based on a given input value of the independent variable x. The predicted value of the dependent variable output yP is provided to a deviation detector 116. The deviation detector 116 receives the predicted value of the dependent variable yp as well as the actual value of the dependent variable y corresponding the to the input value of the independent variable x. The deviation detector 116 compares the actual value of the dependent variable y to the predicted value of the dependent value yP to determine whether the value of the dependent variable y varies significantly from the predicted value of the dependent variable yP. If the value of the dependent variable y is significantly different from the predicted value of the dependent variable yP, an abnormal situation may have occurred, is occurring, or may occur in the near future. In these circumstances the deviation detector 116 may generate a deviation indicator indicating the presence of an abnormal situation. In some implementations, the indicator may comprise an alert or alarm.
The abnormal situation prevention system 100 could be implemented wholly or partially in a field device. As just one example, the SPM blocks 104 and 108 could be implemented in a field device 66 and the model 112 and/or the deviation detector 116 could be implemented in the controller 60 or some other device. In one particular implementation, the abnormal situation prevention system 100 could be implemented as a function block, such as a function block to be used in a system that implements the fieldbus protocol. Such a function block may or may not include the SPM blocks 104 and 108. In another implementation, each of at least some of the blocks 104, 108, 112, and 116 may be implemented as a function block.
As described above, many abnormal situation prevention algorithms rely on a linear regression to model a monitored variable as a function of a corresponding load variable. The regression model is fashioned from a set of training data containing a number of corresponding samples of the load and monitored variables measured from the controlled process. The regression model comprises a function or curve that best fits the data in the training set. The regression model may comprise a polynomial function of a specified order p. For example, the regression model may comprise a linear, second order (quadratic), third order (cubic), or fourth-order polynomial function, and so forth.
In general, the higher the order of polynomial function, the better the regression model will fit the training data. However, it is possible that a polynomial regression model may “over-fit” the training data. A model that over-fits the training data fits the points in the training set very well, but may not prove to be particularly accurate at predicting values of the monitored variable during the monitoring phase. At some point there is a trade-off between generating a model that accurately represents the data in the relatively small training set and a model that represents the data in the training set in a more general way that is more accurate in making predictions regarding a much larger data set that will be gathered during the monitoring phase.
For example,
The sixth order polynomial regression 148 shown in
Comparing the first order regression model 128 of
In many abnormal situation prevention systems employing a polynomial regression model the order of the polynomial regression is a user-specified parameter. However, because improper selection of the polynomial order may result in inaccurate predictions, and therefore inaccurate detection of abnormal process events, it may be desirable to automatically select the order of the polynomial regression based on the nature of the data in the training set. As will be described below, a number of different methods may be employed for determining an appropriate order of a polynomial regression for modeling the data in a set of training data. An embodiment of an abnormal situation prevention system may employ substantially any algorithm or method for determining an appropriate order of a polynomial regression model.
A first method that may be employed for determining the order of a polynomial is known as cross-validation. Consider a training set of n (x, y) data samples, where x comprises the independent or load variable and y comprises the dependent or monitored variable. The training set may be divided into two subsets, a training subset comprising nt samples, and a validation subset comprising n, samples. The training samples and the validation samples comprise all of the samples in the training set such that nt+nv=n. The sample in the training subset may be identified as (xt,i, yt,i), i=1, 2, . . . , nt. The samples in the validation subset may be identified as (xv,i, yv,i), i=1, 2, . . . , nv. The samples in the training subset may be used to calculate the polynomial regression model, and the samples in the validation subset may be used to determine how well the calculated regression model will predict future data.
The regression coefficients a0, a1, . . . , ap of a polynomial regression model of the order p may be calculated from the samples in the training subset according to the formula:
Once the coefficients a0, a1, . . . , ap have been calculated for a polynomial regression of order p, the model may be applied to the samples in the validation subset to determine how well the polynomial regression predicts future values of the dependent (monitored) variable y based on the received values of the independent (load) variable x. For every sample in the validation set (xv,i, yv,i), i=1, 2, . . . , nv, a predicted value of the dependent variable ŷv,i may be calculated according to the formula:
ŷ
v,i
=a
0
+a
1
x
v,i
+ . . . a
p
x
v,i
p for i=1, 2 . . . nv.
Furthermore, the prediction error for every data point in the validation subset may be determined by calculating the difference between the predicted value of the dependent variable ŷv,i and the corresponding actual value of the dependent variable yv,i according to the formula:
E
v,i
=ŷ
v,i
−y
v,i.
The overall accuracy of the model may be evaluated based on the mean square error for all data points in the validation subset. The mean square error for all data points in the validation subset may be calculated according to the formula:
According to this first method for determining the optimal order of a polynomial regression, a polynomial regression model may be calculated for every polynomial order from 1 to pmax using the training subset. The corresponding mean square error (MSEv) may be calculated for each polynomial to determine how well each polynomial fits the validation data. The order of the polynomial regression resulting in the smallest mean square error may be selected as the optimal order for the polynomial regression for the abnormal situation prevention application.
In order to improve results and to ensure that all of the available training data are used to determine the optimal order of the regression polynomial, the above described procedure may be repeated several times, each time using different portions of the training data as the training and validation subsets. For example, the k-fold cross-validation technique may be employed to divide the training data into a number of different subsets that may be recombined in various combinations to form the training and validation subsets employed for calculating a polynomial regression model and testing its validity as described above. According to the k-fold cross-validation technique, the training data set may be divided into a number k of approximately equal subsets. The number k may equal, for example, 5 or some other number. For each polynomial regression order p (p=1, 2, . . . , pmax) the cross-validation technique is repeated k times. Each time a different one of the k subsets of the training data is used as the validation subset and the remaining k-1 subsets of the training data are used for calculating the regression model. In this case, the mean square error (MSEv) for a given order polynomial regression is equal to the sum of all k MSEv calculations (i.e., one component from each of the k subsets of validation data). An alternative validation technique that may be employed is leave-one-out cross-validation. This is essentially a special case of the k-fold cross-validation technique in which k equals the total number of samples in the training set (k=n) and the size each subset is equal to 1.
Additional methods for determining the optimal order of a polynomial regression may be based on analytical methods of complexity control. Such methods may be based on minimizing a penalized risk functional of the form:
R
PENALTY(ω)=REMP(ω)+λφ[f(x,ω)]
where
The Ridge function is a popular penalty function that may be readily applied to polynomial regression. Employing the ridge function,
The penalized risk functional for a polynomial regression of order P becomes
This is known as a Ridge Regression. At this point it is still necessary to determine an appropriate value for the complexity control factor λ. This could be done empirically based on numerous trials with a plurality of different training data sets. Once an appropriate value of λ has been determined it will likely apply across many different situations. For a given order polynomial regression, the Ridge Regression can be evaluated numerically. In order to determine the optimal order of a polynomial regression for a particular set of training data, the Ridge Regression must be evaluated for each value of p=1, 2, . . . , pmax. The optimal order p of the polynomial regression will be the value of p for which the penalized risk functional is a minimum.
Yet another method for automatically determining the optimal order of a polynomial regression model is to calculate an R2 value for polynomial regressions of every polynomial order from p=1 to pmax, for a particular training data set. R2 is a measure of how well a polynomial regression fits the data in a training set. The R2 value is calculated by dividing the sum of the square of the difference between the predicted value of the dependent variable and the average of the actual values of the dependent variable for every point in the data set by the sum of the square of the difference between the actual value of the dependent variable and the average value of the dependent variable for every point in the data set. Thus, the R2 value may be calculated according to the formula:
The average value of the dependent variable
And each predicted value of the dependent variable yi is given by:
According to this method, a polynomial regression may be performed for all polynomial orders p=1, 2, . . . , pmax, and a corresponding R2 value may be calculated for each polynomial regression. As mentioned above, R2 is a measure of how well a polynomial regression fits the training data set. If the R2 value is near 1, there is a good fit between the regression curve and the data. If R2 is near 0, there is little or no fit between the regression curve and the data.
The optimal order of the polynomial regression is determined by comparing the R2 values associated with each polynomial regression calculated for each possible polynomial order p=1, 2, . . . , pmax. The optimal order for a polynomial regression model may be identified as the order of the polynomial regression for which polynomial regressions of a higher order show no significant improvement in their corresponding R2 values. “No significant improvement” may be defined in many different ways. For example, there may be no significant improvement between a polynomial regression of an order p and a polynomial regression of the next highest order p+1 if the difference between the R2 value calculated for the polynomial regression of order p and the R2 value of the polynomial regression of order p+1 is less than a predefined threshold
In this case, the order of the lower order polynomial may be selected as the optimal order of the polynomial regression model for the abnormal situation prevention application. Another way to define “no significant improvement” might be to look at the changes in the R2 values from each successive polynomial regression (in the order of increasing values of p) and identifying the step from one polynomial regression to the next in which the difference in R2 values is significantly less (e.g., less than 10%) of the difference in R2 values from the previous two successive polynomial regressions. For example, assume the R2 value calculated for a second order polynomial regression is 0.4, the R2 value of a 3rd order polynomial regression is 0.7, and the R2 value of a 4th order polynomial regression is 0.72. The difference in the R2 value between the third and fourth order polynomial regressions, 0.02, is only 6% of the difference between the R2 values of the 2nd and 3rd order polynomial regressions. Since the difference in the R2 value going from a third order polynomial regression to a fourth order polynomial is less than 10% of the difference in the R2 values going from a 2nd order polynomial regression to a 3rd order polynomial repression, the 3rd order polynomial regression may be considered the optimal order of the polynomial regression for the particular abnormal situation prevention application for which the polynomial regression model is being developed.
Support vector machines are another method by which regression models may be developed for abnormal situation prevention applications. Support vector machines are based on statistical learning theory and were originally developed for classification problems, but may also be extended to regression. Support vector regression (SVR) uses what is known as an ε-insensitive loss function. If the error between a model and a data point is less than some predetermined value ε then the error is considered 0 and the point does not contribute to the calculation of an error penalty. For errors greater than ε, however, the error penalty increases linearly according to the ε-insensitive loss function.
An example of a first order SVR model is shown in
In performing a support vector regression the expression:
is minimized. The term
represents the sum of all input weights. When applied to polynomial regression, these comprise the coefficients a0, a1, . . . ap. The term
is the sum of all of the losses outside the ε-insensitive region (i.e., all of the losses for points outside the ±ε band 212 surrounding the regression curve 210). C is a complexity control factor that determines the relative importance between minimizing the complexity of the model input weights and the total error of points outside the ε-insensitive region. The complexity control factor C must be user specified for each data set. Thus, there are two parameters that must be specified for implementing a support vector regression, ε and C. Methods may be employed for automatically selecting the values of these parameters based on the training data. These methods could be implemented as part of an algorithm for automatically determining the order of a polynomial regression.
SVR is simply another method for calculating the coefficients of a regression model. It does not, in and of itself, determine the optimal order p of a polynomial regression model for modeling a particular data set and making predictions regarding future data. Using SVR, the optimal order p of a polynomial regression may be determined by performing the SVR algorithm for all possible values of p, (p=1, 2, . . . , pmax), in other words, creating a polynomial regression model for every possible polynomial order p from 1 to pmax, The total error
as determined by the ε-insensitive loss function may be evaluated for each polynomial regression model. The polynomial regression of the optimal order p is either the polynomial regression for which the total error is 0, or the polynomial regression beyond which there is no significant improvement in the total error for polynomial regressions of higher orders. Again “no significant improvement” may be defined as situations in which the total error
changes by less than some predetermined amount between successive polynomial regressions, or when the total error improves by less than a certain percentage, and so forth.
In addition to the methods described above, there may be many other methods for automatically determining the optimal order of a polynomial regression that may be applied to abnormal situation prevention in a process plant. By automatically determining the optimal order of a polynomial regression, a user, such as a plant operator or other personnel, need not determine the optimal order and enter the value as a configuration parameter when setting up an abnormal situation prevention system. By determining the order of the polynomial regression automatically it is less likely that an inappropriate value will be selected, and the resulting polynomial regression model will be more likely to accurately predict the value of monitored process variables for purposes of detecting and/or preventing abnormal process events.
The methods for determining the order of a polynomial regression described herein may be implemented on many process control platforms. In fact, the methods may be implemented on substantially any intelligent process control device that has the processing power to implement the computationally intensive methods described above. A non-exclusive list of process control devices in which the various methods may be implemented includes field devices, Foundation™ fieldbus function or transducer blocks, field device interface modules, handheld communicators, controllers and control systems, stand alone software applications and so forth.
Thus, while the present disclosure has been described with reference to specific examples, which are intended to be illustrative only and not to be limiting, it will be apparent to those of ordinary skill in the art that changes, additions or deletions may be made to the disclosed embodiments without departing from the spirit and scope of the disclosure.