This invention relates to the control of the time varying trajectories of key process variables in industrial batch and transitional processes and more particularly to a system and method for the model predictive control of batch processes using variable dynamic models.
A batch process may be defined as a process which transitions from some initial or starting state to some final state over a finite duration of time to produce some product with desirable properties at the final end point. In a pure batch process all materials may be charged at the start, processed through to the final time, and the product then is discharged. A semi-batch process is similar to a batch process, but may have one or more streams of materials being charged to the batch over time and/or one or more streams being discharged over time. The present disclosure refers to both of these simply as batch processes. Batch processes are used in the production of chemicals, polymers, pharmaceuticals, and biological products in batch reactors; the processing of semi-conductor products via lithography, vapor deposition, and etching; the processing of foods in batch vessels; the injection molding of polymers; batch distillation; and batch crystallization.
A transitional process is a process that may be run continuously but undergoes occasional transitions from one operating point to another, such as might occur during start-up (or shut-down) of a process or during product grade transitions whereby the process transitions over a finite period of time from one operating state to another.
The prior art discloses batch processes controlled in industry using simple univariate proportional-integral-derivative (PID) controllers which operate separately on each controlled variable (CV) using a single manipulated variable (MV) to force the controlled variable to track a desired set-point trajectory. This approach has been taken to ensure that industrial processes remain within an acceptable operating window with respect to safety and with respect to the production of high quality product. This control allows for general control where CV's associated with the industrial process are held within a relatively tight operating window. This general control may experience difficulties when there is a significant variance that ensues such as an introduction of a raw material with significantly difference characteristics. There is a need for more advanced control of batch processes especially when the process characteristics change significantly during the course of the batch. There is a further need for multivariable control of trajectories in batch processes to achieve this advanced control.
The model predictive controllers (MPC's) applied in industry (by vendors such as Honeywell, Aspen Technologies, Emerson, Rockwell) are generally based on linear input-output models or linear state space models identified from plant data. A quadratic objective function penalizing the deviation of the CV's from their set-points and penalizing excessive manipulated variable (MV) movement (move suppression) is optimized on-line using Quadratic Programming (QP), subject to various operating and safety constraints on the variables. The result of the optimization is a set of MV trajectories over a specified control horizon. Only the specified MV solutions to be implemented for the next time interval are then usually written out as set points to be implemented by the plant control layer (such as a Distributed Control System (DCS) or Programmable Logic Controller (PLC)). The MPC algorithm is then run again at the next control interval and new MV trajectories computed, and the process repeated. The linear models used generally relate only the set of manipulated variables to the set of controlled variables and ignore other measured process variables (xme) that are not directly controlled or manipulated. However, some MPC systems do allow explicitly measured disturbance variables to be included in the model, provided explicit models for their effects on the controlled variables are available.
Nonlinear MPC's have recently been developed and are available from several control vendors (see additional related information accounting for the nonlinear behavior of the process through the use of the nonlinear model). Some of them, based on fundamental, non-linear models of the process are, or could be, applicable to the multivariable control of batch processes.
The disadvantages are that (i) a good theoretical model of the process is necessary and such models are very time consuming and costly to develop; (ii) the theoretical models often do not include a description of the behavior of all of the measured variables available on the system (e.g. agitator power) that are useful in providing information on the disturbances in the system; and (iii) the nonlinear MPC's are very computationally intensive and the optimization may not be able to be completed in the required time interval, especially for a short-duration batch process.
Prior art empirical modeling methods have used a form of regression to build models relating past inputs (manipulated variables (u) and sometimes measured disturbance variables) to the future controlled variables (y). These methods may be satisfactory if the process is a continuous one operating about a fixed point so that the model is valid for every time point in the past and the future, ie. the model procedure uses the data to find one fixed model that is valid at all times.
In batch and transitional processes the process is time varying throughout the batch or transition and a fixed model may not be adequate. Therefore, there is a need for a MPC formulation aimed at transitions in continuous processes or batch processes that is based on nonlinear theoretical models (nonlinear differential equation models built for fundamental mass and energy balance equations) that can model this nonlinear time varying behavior.
There is also a need for empirical latent variable models for batch processes, which are built using data collected from the process and are able to model the time varying, nonlinear behavior of these batch and transitional processes. Thus they may accomplish what the fundamental differential equation models do but with all the ease of model building and computational speed advantages of the linear regression models. These models may also have an advantage over other regression-based models in this problem because they are models that may extract all the useful information from the data into very low dimensional spaces (ie. into latent variables) thereby giving very low dimensional, parsimonious models that are not over-parameterized and thus are more robust (less sensitive to slight variations in the data used to build them). This also allows these models to use all the measured variables available and not be forced to use just the manipulated inputs (variables) (MV's) and controlled variables (CV's). The result may be a more accurate prediction of the future behavior of the process.
U.S. Pat. No. 6,826,521 issued to Hess et al. discloses the standard practice for advanced industrial process control in processes of the type described above is to use linear, multivariable, model predictive controller (MPC) software. Other prior art publications further expand on this practice such as: the Setpoint, Inc. product literature dated 1993 entitled “SMC-Idcom: A State-of-the-Art Multivariable Predictive Controller”; the DMC Corp. product literature dated 1994 entitled “DMC.TM.: Technology Overview”; the Honeywell Inc. product literature dated 1995 entitled “RMPCT Concepts Reference”; and Garcia, C. E. and Morshedi, A. M. (1986), “Quadratic Programming Solution of Dynamic Matrix Control (QDMC)”, Chem. Eng. Commun. 46: 73-87. The typical MPC software allows for model scheduling (i.e. changing the model gains and/or dynamics) to improve control performance when operating on a nonlinear and/or time-varying process. The controller uses new models that are generally calculated in an off-line mode, or may be calculated by an adaptive algorithm that uses recent operating data.
The prior art includes many examples of the use of model-based control systems employing both linear and nonlinear methodologies for control. Prior art MPCs refers to linear controllers, see for example: U.S. Pat. No. 4,349,869 to Prett et al.; U.S. Pat. No. 4,616,308 to Morshedi et al.; U.S. Pat. No. 5,351,184 to Lu et al.; and U.S. Pat. No. 5,572,420 to Lu. To handle nonlinear, time-varying processes, these controllers may use gain scheduling, adaptive model estimation, or robust controller tuning. These approaches typically encounter implementation problems and/or performance degradation for the types of processes and operating conditions described previously.
There have been a few patents issued for nonlinear model-based control methodologies. In particular, U.S. Pat. No. 5,260,865 to Beauford et al. describes a nonlinear model-based control strategy for a distillation process which employs a nonlinear model to compute liquid and vapor flow rates required for composition control. Sanner and Slotine (U.S. Pat. No. 5,268,834) employ a neural network together with other nonlinear control strategies to provide adaptive control of a plant. Bartusiak and Fontaine (U.S. Pat. No. 5,682,309) developed a reference synthesis technique in which the controller attempts to make a nonlinear plant follow a specified reference trajectory. U.S. Pat. No. 5,740,033 to Wassick et al. describes an MPC that employs a real-time executive sequencer and an interactive modeler to find the optimized set of control changes for a nonlinear process. Large, nonlinear control problems are difficult to solve in an on-line operating environment. The solver must be fast and robust.
One prior art publication, Flores-Cerillo, J. and J. F. MacGregor, (2005) “Latent variable MPC for Trajectory Tracking in Batch Processes, J. Process Control, 15, 651-663, discloses an industrial applications on MPC's for batch processes based on Latent Variable methods. That publication is related to one of the algorithms in this methodology in that it uses a simpler version of the observation-wise unfolding with time-lagging approach.
However, there is a need for the control method and related algorithm to eliminate errors in the handling of external disturbances throughout the batch. There is a further need for the control algorithm to allow for the use of multiple models, one for each different phase of the batch. In one aspect of the present invention, the Model Predictive Controllers based on Latent Variable Models (LVM) may allow one to achieve essentially the equivalent control of non-linear batch processes as is possible with the use of non-linear MPC based on non-linear fundamental models of the process. However it may do so with linear LVM's that allow for fast on-line solution and that are easily identified from data collected from the industrial batch process. The MPC calculations may also be computed very rapidly on-line with very modest solvers, thereby making it a powerful practical solution to batch MPC.
There is a further need to differentiate between high level control and low level control in the control of a batch process. High level control refers to controlling the process from the perspective of meeting specific performance targets, measured upon completion. The process may be analyzed from the perspective of whether it will result in performance within a specific window based on data upstream, and if not then making midpoint adjustments. Usually this type of control is made possible by extracting a wealth of information based on timed histories.
Low level control refers to controlling the timed history of factors such as temperature, pressure etc., and tracking these trajectories. Prior art discloses proportional-integral-derivative (PID) controls where control may work well in some period but not in others. Prior art discloses MPC based on certain types of linear models that are applied to continuous processes. Prior art also discloses non-linear model predictive control based on theoretical models applied to batch processes. There is a need for a much simpler approach that will also enable tight control over the trajectories.
There is further a need to have a predictive model with wide applicability to the control of variables such as temperatures, pressures and concentrations in batch processes for the manufacture of chemicals, pharmaceuticals, processed foods, semi-conductors, etc.
The invention will be better understood and objects of the invention will become apparent when consideration is given to the following detailed description thereof. Such description makes reference to the annexed drawings wherein:
In the drawings, embodiments of the invention are illustrated by way of example. It is to be expressly understood that the description and drawings are only for the purpose of illustration and as an aid to understanding, and are not intended as a definition of the limits of the invention.
The present invention provides a method for process modeling and control, and a software implementation of this method which includes an empirically identified latent variable model of a batch or transitional process, and, based on that model, a predictor for the future trajectories and an optimal control method (and related algorithm, in one implementation thereof a latent variable model predictive control, LV-MPC) for trajectory tracking of specified variables. This model may be based on Latent Variable models (linear or non-linear) that efficiently model the time varying non-linear behavior of the batch evolution, that easily incorporate into the model all measured variables, that are easily identified from process data, and that require modest computational effort and computing time for their use in the prediction of future trajectories and for the computation of the controls. These advantages make this approach uniquely suited for real-time application to industrial batch and transitional processes.
In one implementation of the present invention LV-MPC may consist of a model predictive controller that is designed to manipulate and/or control process variables in the batch or transitional process. The controller interfaces with or may be included in commercially available process control systems that are known in the art such as Emerson, Aspen Technologies, Honeywell, Rockwell Automation and others. The controller may also be implemented or included as a toolkit or hardware component designed to interface with one or more computers.
In one aspect of the present invention, a computer implemented method for modeling and controlling batch or transitional processes comprises the steps of: (a) collecting, or initiating the collection of measurements on a plurality of process variables; (b) creating, or initiating the creation of, a latent variable model predictive controller based on the collected measurements; and (c) applying or initiating the application of, the model predictive controller to predict and control at least one of the process variables to track a desired trajectory, by operation of at least one computer including one or more computer processors.
Without limiting application of the invention, the method has a wide range of potential industrial applications in the batch manufacturing industry. Some examples of the present invention may be but are not limited to: (i) the control of temperature, pressure and concentration trajectories in the batch manufacture of chemicals and polymers, (ii) the control of pH, dissolved oxygen, and nutrient concentration trajectories in the batch manufacture of biological materials, (iii) the control of temperature, supersaturation and particle growth trajectories in batch crystallization, (iv) the control of temperature, mixing intensity and viscosity trajectories in food processing, (v) the control of partial pressure and temperature trajectories in vapor phase deposition or etching in the semi-conductor industry.
The present invention also provides for applying the latent variable model predictive controller imputing unmeasured further values of at least one process variable of batch or transitional processes using a missing data imputation method for a latent variable model (such as Projection to the Model Plane (PMP), Trimmed Score Regression (TSR), and Conditional Mean Replacement (CMR) methods), or some variations of these including those which weight the data in some manner to improve the imputation and the conditioning of the imputation.
In another aspect of the present invention, the method uses Latent Variable (LV) Models which may be based on one of the latent variable estimation methods such as Principal Component Analysis (PCA), Partial Least Squares (PLS), Independent Component Analysis (ICA), Reduced Rank Regression (RRR), Canonical Correlation Analysis (CCA) or other subspace methods related to these.
In another aspect of the present invention, the latent variable models (LVM) may be formulated in different possible ways, but preferably it will depend upon the particular batch or transitional process and problem being considered. These possibilities include: (i) a LVM based on unfolding the batch data in a batch-wise manner (i.e. with all the measurements for a batch in one row of the matrix); (ii) a LVM based on unfolding the data in a time-lagged observation-wise manner (i.e. with each row of the data matrix containing a vector of lagged observations over a finite time period on all the variables); (iii) a LVM based on a combination of the above two unfolding methods; and (iv) multiple LVM's whereby different LV models, based on any of the above unfolding methods, are used to model different phases of the batches. Non-linear versions of these Latent Variable models may also be used, but appear to provide no or modest improvement over the linear algorithms and lead to additional computational effort and complexity of the computer program of the present invention.
In a further aspect of the present invention, the models may be identified from data collected from the batch or transitional process while it is in operation under closed-loop control. Data from both historical batches operating with the existing controllers active and from some batches with designed experimental variations added on top of the manipulated variables are used for the model identification.
In an implementation of the present invention, model predictive controllers (MPCs) may be based on a quadratic optimization solution (quadratic programming if constraints are present or unconstrained least squares if there are no constraints) in either the space of the manipulated variables or in the space of the latent variables. In the latter case the manipulated variables may then be calculated from the optimized values of the latent variables using the latent variable model.
In a further aspect of the present invention, the predictions of the future trajectories of the process variables and the calculation of the model predictive controller are, in some cases, formulated to use a growing time history of past data and a shrinking forward horizon for future controls as time progresses through the batch from the start to the end.
The present invention also provides for a system for modelling and controlling batch or transitional processes comprising: (a) one or more computers including or being linked to a computer program, the computer program including computer instructions which when made available to the one or more computers, is operable to provide: (i) a control layer for collecting or initiating the collection of measurements on a plurality of process variables, and further for creating or initiating the creation of a latent variable model predictive controller based on the collected measurements, wherein the control layer is operable to apply or initiate the application of the model predictive controller for predicting and controlling at least one of the process variables to track a desired trajectory.
The present invention further provides for a computer program comprising computer instructions, which when made available to one or more computers, are operable to define on the one or more computers: (a) a control layer configured to collect or initiate the collection of measurements on a plurality of process variables, and to create or initiate the creation of a latent variable model predictive controller based on the collected measurements, wherein the control layer is operable to apply or initiate the application of the model predictive controller predict and control at least one of the process variables to track a desired trajectory.
The Latent Variable MPC technology discussed in the present invention has several unique aspects that provide it with advantages over both the linear and the nonlinear MPC technologies described above. In one aspect of the present invention, it may use efficient, low dimensional Latent Variable (LV) models to describe the time evolution of all the variables in the batch or transitional process. This modeling approach offers several advantages in the control of these processes as:
In another aspect of the present invention, the LV-MPC technology may provide the following advantages over alternative nonlinear MPC methods when applied to batch or transitional processes such as:
A schematic of the information and calculation flow of the method of the present invention, in one embodiment thereof, is illustrated in
In one implementation of the present invention the general workflow may consist of building the model, developing the model predictive controllers and implementing the technology. The first step may provide for data from a plurality of past batches operated under normal operating conditions with an existing control system being extracted from the data historian. The data may consist of the time histories on all measured variables, sampled every second or so, depending upon the sampling interval to be used for control: often between 1 to 10 seconds but it may depend on how fast control is needed. It will be understood that in a batch process that is very fast then faster sampling is needed and vice versa.
In a further aspect of the present invention, some additional batches are run with some designed experimental signals (random binary signals (ie. either high or low values) added on top of the manipulated variable signals being sent to the control actuators (eg. valves for flow control). Other binary signals and control actuators are considered. These signals are added throughout the duration of the batch. Data from these designed experimental batches may help to give a better model. The data collected from all these batches may be combined and then unfolded in one of the ways explained in further detail below. A latent variable model may then be built using these data. The latent variable model used may be based on a Principal Component Analysis or on other latent variable methods.
In another aspect of the present invention, the PCA or other model may then be used in the manner described below to predict the future behavior of new batches and to compute the optimal LV-MPC control actions over the future. The control may then be performed on a new batch using the data flow. The control actions computed for application at the current time interval are then sent to the different layers of the implemented method to the final control elements (eg. valves). This step may then be repeated at each time interval.
In one embodiment of the present invention the first step consists of collecting the data (100). Once the data is collected it may optionally be preprocessed (101) which may include mean centering and scaling the data. Once the data is preprocessed, there may be outlier and abnormal situation detection (103). Following that step, prediction of future trajectories (103) may be computed followed by optimal control calculations (104). After the optimal control calculations (104) have been completed output of control actions to actuators (105) may be instituted.
The present invention contemplates various formulations of the latent variable models, as particular embodiment thereof may have certain advantages and disadvantages. The choice of the LVM formulation to use may depend upon the particular features of the batch process involved. Possible formulations of the LVM's arise from different ways of unfolding the data arrays collected from the batch process: (i) batch-wise unfolding of the data (as illustrated in
Data Requirements and Rearrangement of the Data
In one aspect of the present invention, the training data needed to build the LV-MPC controller may consist of data from historical batches executed under standard manufacturing conditions with existing controllers in operation, augmented with data from some batches in which designed experiments have been implemented to obtain causal information between the manipulated and controlled variables. One approach may be to add a random binary sequence (RBS) dither to each manipulated variable output under closed-loop conditions. The choice of closed-loop identification may be used for minimizing disruption to the batch recipe.
In one implementation of the present invention, the identification approach may be sequential in nature. A preliminary data set may be collected using the methods described above. This may enable the first-generation model to be estimated and the corresponding LV-MPC system commissioned. Once commissioned, data from subsequent batches, under the new control regime, may be collected and added to the training data. The model may then be re-estimated and a revised controller commissioned. This iterative modeling process may be repeated several times.
One form of batch process modeling was presented and popularized by prior art publications such as: Nomikos, P. and MacGregor, J. F., (1994), “Monitoring of Batch Processes Using Multi-Way Principal Components Analysis”, Amer. Inst. Chem. Eng. J., 40, 1361-1375; and P. Nomikos and J. F. MacGregor, (1995). “Multi-Way Partial Least Squares in Monitoring Batch Processes”, Chemometrics & Intell. Lab. Systems, 30, 97-108. This method has seen many industrial applications for process data analysis and process monitoring.
One advantage of this invention may be the use of this modeling approach for the control of batch trajectories. The models may be identified from data collected on past batches under feedback control and data collected from some batches in which a designed input sequence is superimposed upon the existing feedback controller output. The batch data (X) may be unfolded as illustrated in
In one implementation of latent variable models built on this batch-wise unfolded data, the advantage is provided of enhanced ability to model the nonlinear behavior of batch processes. By subtracting the mean or some reference trajectory from the raw measurements on each of the variables in the preprocessing step (101 in
In one example of the present invention, the first step in formulating the Controller may be to build a Latent Variable model on training data from the batch process. Any one of the latent variable methods may be used, but PCA may be preferable and is illustrated here. The data may be arranged into the matrix, X. Each row of X, the row vector, xT, contains all the relevant data from a particular batch, arranged as,
xT=[ζ1T, ζ2T, . . . , ζkT, . . . , ζKT] (1)
where,
ζjT=└xmeTycvTyspTucT┘j (2)
and,
subscript j=the jth time point
Then,
The matrix, X, has dimensions I×S, where I is the number of batches and S=M×K, M being the number of variables and K the total number of time points. In this example, each principal component of the PCA model is captured by a loading vector, p, of length S.
In another aspect of the present invention the time-lagged, observation-wise unfolding may be used. This form of unfolding of the batch data (
A PCA model on this unfolded data may provide a fixed (non time-varying) model of the batch over each window of time lags used. It may therefore not model time-varying behavior within this window, but by using multiple time windows it may provide time-varying models between windows. This approach may have the advantage of requiring data from fewer batch runs for identifying the latent variable model, since a simpler, fixed (non-time varying) model is being identified within each phase.
In another alternative implementation of the present invention a combination of batch-wise and observation-wise unfolding may be used. This form of unfolding of the batch data unfolds the data as in batch-wise unfolding (
A latent variable model built on this unfolded data matrix will have some of the benefits of both of the two previous approaches. In particular, it may allow the model to capture some of the nonlinearities within each phase (an advantage of batch-wise unfolding) and also require less batch data for the model identification (advantage of observation-wise, time lagged unfolding).
In one example of the present invention, the unfolded matrices of
X=TP
T
+E (6)
T=XP (7)
Where T is a (a×A) matrix (A≦b) of latent variable scores that summarizes the major differences among the batch trajectories, and P is a (b×A) matrix of loadings that show how the latent variable scores are related to the trajectory data (X). The score values of the A latent variables for each batch summarize the time varying behavior of its trajectories relative to all the other batches.
Various Latent Variable regression methods including Partial Least Squares (PLS), Redundancy Analysis (RA) (sometimes called Reduced Rank Regression (RRR)), and Canonical Correlation Analysis (CCA) may be used to estimate a latent variable space as disclosed in prior art A. J. Burnham, R. Viveros-Aquilera and J. F. MacGregor, 1996. “Objective Function Frameworks for Comparing Latent Variable Methods for Multivariate Regression”, J. Chemometrics, 10, 31-46 (Burnham et al.). These latent variable regression methods differ primarily by breaking out the variables to be predicted (Y) from the others (X) and have the latent variable model structure:
X=TP
T
+E (8)
Y=TC
T
+F (9)
T=XW* (10)
The latent variable regression models differ only by slight variations in the objective function used to estimate the reduced set of latent variables (Burnham et al.). Any of them may be used to estimate a suitable set of latent variables for control and for prediction and then be used in the methods outlined in this present invention.
In one implementation of the present invention, latent variable modeling of the batch using the batch-wise unfolding approach (
In one aspect of the present invention, the multiphase modeling approach is based on identifying multiple phases within the batches, partitioning of the dataset according to these phases, considering overlap between two adjacent phases and building latent variable models for each phase. The proper selection of the number and location of the phases may be more important in the time-lagged observation-wise unfolded approach where it is important to select phases during which the covariance structure of the data is reasonably constant. In the batch-wise unfolded approaches non-linear time varying covariance structures are already accounted for and hence the location of phases may be less important and the number of phases may be selected primarily to simply reduce the phase size so as to improve the local predictability of the models, minimize the ill-conditioning, and reducing the computation time.
Once the phases have been determined, latent variable models are built for each phase and MPC may be applied in each phase based on the model for that phase. To build the latent variable models in each phase it is desirable to use not only the data from that phase, but also overlapping data between any adjacent phases to guarantee the smooth switching of models between phases (bumpless transfer) during the trajectory tracking control. The delineation of phases and the overlapping of data between phases according to one aspect of the present invention are illustrated in
One may use data over as many sample times as the selected model future horizon (fh) from the next phase and the data of as many sample times as the selected model past horizon (ph) from the previous phase as shown in
Other models besides PCA may also be used. Independent Component Analysis may be used similarly to PCA. Some of the other latent variable regression based methods may alter things somewhat. Instead of putting all the variables together in one X matrix one may break out the variables to be predicted in the future (ie. the future controlled variables (y) and the other future measured variables xme) into a Y matrix. PLS, CCA and RRR (LV regression methods) may still find a LV model as equations (8), (9) and (10) as shown above and the prediction and control calculations may have to be modified accordingly, but may result in very similar equations and would optimize the same objective function. The prediction accuracies of other models may also be similar. It will be understood that the essential issue is not the specific type of LV modeling method used, but the use of one of these efficient estimation methods to get a reduced dimensional latent variable model.
Prediction plays an important role in the MPC algorithm since the optimization problem embedded in the MPC needs future prediction of the outputs up to the prediction horizon. The prediction method depends on the type of model being utilized in the MPC. For most linear and nonlinear dynamic models used for existing MPC algorithms, the future prediction may be calculated using integration of the dynamic model over the prediction horizon (fh) and adapting it assuming a simple random walk type disturbance model on the controlled variable (CV).
As mentioned above the LV models may be able to easily incorporate all the measured variables throughout the batch (not just manipulated variables, MV's, and CV's) without being over-parameterized. These measured variables (eg. agitator power, coolant temperature, other temperature/pressure/force measurements around the process, etc) contain within them valuable information on disturbances that may affect the future behavior of the important variables being controlled (y's=CV's). Latent variable models then use not only the manipulated variables (u) to predict the future controlled variables (y) but all of these ancillary variables that contain important information on the future. Then during on-line LV-MPC calculations for a new batch the model may use efficient missing data imputation methods to simultaneously use all this information to predict the future in any batch phase.
Several missing data imputation methods have been proposed for latent variable models in the literature. For batch processes the aim is to predict the final latent variable scores at the end of any phase and then, from the PCA model of the X-space (Equation (6)) or the latent variable regression model of the Y-space (Equation (9)), the values of all the missing trajectories over the remainder of the phase can be estimated. Nelson et al. (1996) presented an analysis of several methods including the Single Component Projection (SCP) method, the Projection to the Model Plane (PMP) method, and the Conditional Mean Replacement (CMR) method. Arteaga, F., Ferrer A., (2002), discussed the methods proposed by Nelson et al. (1996) as well as some other methods including a Trimmed Score Regression (TSR) method. Since the PMP and TSR are the methods found to have greatest promise in this study, they are briefly discussed below but it should be understood that the application of the present invention is not limited to these models.
The PMP method was used by Nomikos and MacGregor, in both 1994 and 1995 prior art publications above, in their original batch monitoring methodology. It projects the new vector of observations with missing data onto the plane defined by the latent variable model (Equations (6) and (7)) to obtain an estimate of the missing part of the data vector that is consistent with the model.
In one example of the present invention, a new observation (z) may be divided in two parts as shown in
zT=[z*Tz#T] (11)
Where, z* are the known data and z# are the missing data. For the batch process analysis, z* corresponds to the past data and z# corresponds to the future data. The loading matrix may also be divided into two parts in the same way as z.
P
T
=[P*
T
P
#T] (12)
Thus, the PCA model can be partitioned as:
where τ is the vector of latent variable scores, τT=[t1, t2, tA]. If the known part of the data is used for score estimation, the following relation is obtained (Neslon et al., 1996):
τ=(P*1:ATP*1:A)−1P*1:ATz* (14)
Subscript 1:A means that A principal components are considered in the PCA model. The estimates of the trajectories of all the variables for the remainder of the batch phase (z#) are then obtained from equation (13).
In another alternative of the present invention TSR may be used. For this method the same partitioning may be applied to the data. The score may be computed based on the assumption that the known part of the data is the complete data in the observation. Thus,
τ*=P*Tz* (15)
And the real scores are calculated by regressing the real scores (τ) on the fake scores (τ*). Finally, the score estimation formula is (Arteaga and Ferrer, 2002):
τ=Θ1:AP1:A*TP1:A*(P1:A*TP1:Q*Θ1:QP1:Q*TP1:A*)−1P1:A*Tz* (16)
Where, Θ is the covariance matrix of the scores (Θ=(TTT)/I)) in the PCA model, where I is the total number of batches in the dataset and T is the matrix of scores from all batches. The number of scores considered in Θ(Q) can be more than or equal to A.
This section illustrates several variations of the control methodology based on different combinations of latent variable models, obtained from different types of unfolding, and on solutions of the optimization problem in different variable spaces (solution in the latent variable space and solution directly in the manipulated variable space). Other variations of the proposed methods arising from using combined batch-wise and observation-wise unfolding, different variants of the missing data imputation methods, or different latent variable estimation approaches could be easily made by a person skilled in the art.
Control in the LV Space
In one aspect of the present invention, a multi-phase PCA model is developed based on a batch-wise unfolded dataset. The objective of the control may be to run a new batch to track certain trajectories and compensate for the effects of disturbances entering the batch. Assume a new batch is currently at sample time k. For any phase the information of each sample time is included in ζk as defined by:
ζkT=[xme,kT,ycv,kT,uc,kT,ysp,kT] (17)
Where xme, ycv, uc, and ysp are measured variables, controlled variables, manipulated variables, and set point variables, respectively. The existing information in the current batch phase can be separated as follows according to whether it is known past or present data or unknown future values:
where, xP1T=(ζjT|j=1:k−1, xme,kT, ycv,kT, ysp,kT) and xP2T=(ysp,jT|j=k+1:K) are vectors of the known information at time k, while xf1T=(uc,kT, uc,jT|j=k+1:k+K−1, xme,jT|j=k+1:k+K) and xf2T=(ycv,jT|j=k+1:K) are future data that are not known yet and K is the total duration of the batch or phase. Separating the loading vectors in the corresponding manner to the division of x, we have:
P=[PP1;PP2;Pf1;Pf2] (19)
Note that since the algorithm is presented for online application, all of the variables mentioned in Equations (17) to (19) change over time and must have an index “k”. However, for the sake of brevity the index “k” may be omitted in the following derivations.
Under MPC control at the current time (k), the phase is not complete and the projected scores at the end of the batch assuming no further control moves are to be taken, must be estimated from only the data available up to and including the time step k using a missing data imputation method. A correction to the score (Δ{circumflex over (τ)}k) is then estimated by optimizing the MPC objective function and the corrected final score can be calculated as:
τkc={circumflex over (τ)}k+Δ{circumflex over (τ)}k (20)
The objective function of the optimal control can be represented as follows:
The first term penalizes the deviation of the controlled variables (ŷcv) from their setpoint trajectories (ysp), while the second term is a move suppression term that penalizes the amount of movement allowed in the manipulated variables computed by the controller (ûf). Define:
xpT=[xP1TxP2T],
xfT=[xf1Txf2T]
PpT=[PP1TPP2T],
PfT=[Pf1TPf2T] (22)
From the PCA model (equation (7):
{circumflex over (τ)}k+Δ{circumflex over (τ)}k=PpT{circumflex over (x)}p,k+PfT{circumflex over (x)}f,kPfT{circumflex over (x)}f,k={circumflex over (τ)}k+Δ{circumflex over (τ)}k−PpT{circumflex over (x)}p,k (23)
Then, in this example of the present invention, using the same analysis presented in Flores-Cerrillo and MacGregor (2004) the future values of the variables may be obtained to be consistent with the past information in a PLS model. This analysis may be modified to be used with a PCA or other model. Following this consideration and after some rearrangements, the output and input variables can be written in terms of scores of the batch:
{circumflex over (x)}
f2
=P
f2(PfTPf)−1({circumflex over (τ)}k+Δ{circumflex over (τ)}k−PpTxp,k) (24a)
u
f
=P
uf({circumflex over (τ)}k+Δ{circumflex over (τ)}k) (24b)
Substituting Equations (24a) and (24b) into the objective function, Equation (21), and solving the optimization problem, we get the optimal correction to the scores which can be used along with Equation (20) to get the optimal score of the batch and then optimal ûf. If there is no constraint, it is straightforward to find the analytical solution for the above optimal problem. If there are linear inequality constraints that must be respected on any of the variables, the optimization can be posed as a general quadratic programming problem and solved subject to the constraints. Constraints on the manipulated variables may be projected into the latent variable space and explicitly considered in this space as functions of the latent variable scores.
Control in the Manipulated Variable Space
In another alternative of the present invention control can be in the manipulated variable space. The data for the current batch may be partitioned in a more explicit way with respect to the manipulated variable:
Where PH and CH are (Model) Prediction and Control horizons, respectively xP1T=(ζjT|j=1:k−1,xme,kT,ycv,kT,ysp,kT), xP2T−(ysp,jT|j=k+1:k+PH), xf1T=(xme,jT|j=k+1:k+PH), xf2T=(ycv,jT|j=k+1:k+PH), and uf=(uc,kT,uc,jT|j=k+1:k+CH). A key point of this method is to formulate the problem in terms of future manipulated variables, uf. At the sample time k, the known data are xP1, xP2, the unknown data are xf1, xf2, and the future decision variable is uf. The term uf will be determined through the optimization process. As a result, to develop the control algorithm, the score estimation method has to be defined first. The method used in this study is the TSR method but use of other methods is contemplated, Equation (16).
Once the scores are estimated the future output variables may be estimated as well:
{circumflex over (x)}f2=Pf2{circumflex over (τ)}k (26)
One may use the Equation (26) in the optimization problem, Equation (21), with the modification of considering uf as the decision variable instead of Δ{circumflex over (τ)}, to obtain the optimal uf as the solution to the optimization problem. If there are hard constraints that must be respected on any of the variables, the optimization can be posed as a general quadratic programming problem and solved subject to the constraints.
Control Based on a PCA Latent Variable Model Using Observation-Wise Unfolded Data with Time-Lagging
In another embodiment of the present invention, the control algorithm for a PCA model based on time-lagged observation-wise unfolded data (
In another alternative of the present invention, the control based on combined batch-wise and observation-wise unfolded data (
In one implementation of the present invention, the latent variable model predictive control (LV-MPC) technology for transitional processes (such as batch processes or continuous processes during grade transitions, startups and shutdowns) may be readily implemented within the existing hardware and software environments provided by many of the existing control vendors, such as Emerson, Aspen Technologies, Honeywell, Rockwell Automation and others. ProSensus also has access to its own in-house software platform for this purpose.
In one embodiment of the present invention as illustrated in
Alternatively, the technology of the present invention may be implemented as a computer program, operable on a computer to implement the method of the present invention, and constitute a computer system operable to provide the functionality described in the present invention. The computer system or computer program of the present invention may be configured to interoperate the third party hardware or software systems described above.
Aziz, N., Hussain, M. A., Mujtaba, I. M., (2000), Performance of different types of controllers in tracking optimal temperature profiles in batch reactors, Comput. Chem. Eng. 24, 1069-1075, presented a nonlinear model of a batch reactor. This case study was originally proposed by Cott, B. J., Macchietto, S., (1989), Temperature control of exothermic batch reactors using generic model control, Ind. Eng. Chem. Res. 28 (8), 1177-1184, as a case study for a temperature control problem in a batch reactor. The schematic figure of the reactor is shown in
The objective is to control the reactor (800) temperature to track a desired trajectory and simultaneously reject disturbances entering the process. The manipulated variable is the set point of the jacket (801) temperature (
The following figures show the performance of the proposed variations of the methodology used for both the tracking of a complex trajectory and for non-stationary disturbance rejection where a random walk disturbance has been added to the calculated reactor temperature.
a. Control Studies Using PCA Models Based on Batch-Wise Unfolded Data:
These figures illustrate the ability of these algorithms to simultaneously track the temperature set-point trajectory and compensate for non-stationary disturbances.
b. Control Studies Using PCA Models Based on Time-Lagged Observation-Wise Unfolded Data
The present invention may have a wide range of potential industrial applications in the batch manufacturing industry. To illustrate the nature and range of some possible industrial applications the following examples are presented, but are not intended to be exhaustive:
The following are several additional example applications from other batch material processing industries that are very similar in nature to the application described above:
Other examples may be as follows:
The following are examples of where the proposed LV-MPC may be applied to transitional operation of continuous processes:
This application claims the benefit of U.S. Patent Application 61/052,992 filed May 13, 2008
Number | Date | Country | |
---|---|---|---|
61052992 | May 2008 | US |