This invention relates to optimizing commercial ethylene oxide (EO) production in a reactor and more particularly to the acquiring of real time operating data of an ethylene oxide reaction process and generating recommendation by optimizing the process conditions to improve reactor performance.
Business environment has encountered a drastic transformation in this decade due to stiff global competition. Chemical process industries are facing a perpetual shrink in profit margin due to the rise in competition in the turbulent market, thanks to globalization. Technological innovation to improve process and energy efficiency, yield maximization, and environmental impact reduction are the only path to mitigate these issues successfully. Chemical companies across the globe are striving to find new innovative ways to reduce production cost and maximize profit. Application of artificial intelligence-based techniques to generate value from large amount of plant operating data by data mining and knowledge discovery is one of the most attractive innovative ways to explore. In search of innovative ways to generate more profit, chemical reactor attracts the focus of plant managers and researchers. In chemical plants the reactor is the only large piece of equipment that converts raw materials into finished goods and adds real value. In this regard, reactor optimization has an incredible potential impact on overall plant profitability. [1] To optimize the commercial reactor, it is required to model complex systems of industrial chemical reactions as a first step. Industrial chemical reactions involved complicated reaction kinetics and thermodynamics. Building a credible phenomenological model for commercial reactors is considered as a very time-consuming difficult task as it demands a deep knowledge of reaction kinetics and understanding of industrial heterogeneous catalytic behavior characterized by mass diffusion, catalyst deactivation, etc. Lack of sufficient knowledge in chemical reaction kinetics actually hinders the optimization of the reactor. In most chemical plants, the reactors remain as black-box, and operation engineers do not try experimentation in running plants due to safety and reliability reasons. This drives non-optimum operation of industrial reactors, which leads to loss of huge money and greatly affects the profitability of the plant. However, a small increment of catalyst selectivity and reaction yield has large potential to significantly impact the raw material consumption and plant profitability in large-scale operation.
Data is considered as new oil in current time, and certainly, artificial intelligence based data driven modeling techniques like Artificial neural network (ANN), support vector machine (SVM), genetic programing (GP) etc. are considered new IC engine (internal combustion engine) of current age. As huge reactor inlet and outlet historic plant operation data are collected and stored every minute for most chemical plants, the challenge is to utilize this wealth of data to make more profit. Modelling and optimization of commercial Ethylene oxide (EO) reactor and also of ethylene glycol production has not yet received sufficient attention in the prior art.
Due to poor understanding and complexity of multiphase catalytic reactions occurring in commercial ethylene oxide reactor, a credible first principle-based model is not available in literature. Availability of large amount of reactor operating parameters data of commercial plants make data driven modelling techniques as a viable alternative approach. The current invention presents an effort to utilize the large amount of process data to find an online EO reactor digital twin framework to convert the information from this data to maximize profit.
EO reactors use silver based catalyst and Ethylene Glycol plant economics greatly depends on catalyst selectivity and activity. Catalyst's selectivity and activity is very sensitive to chloride and other process parameters. It is very crucial to run the EO reactor at optimum operating conditions all the time to maximize the catalyst performance. A minor change in chloride can reduce the catalyst selectivity drastically. Plant engineers could not always detect the optimum chloride in real time basis and thus loses 0.5-1% selectivity on an average throughout the catalyst life. This leads to erosion of potential profits in terms of million USD per year for a moderate size glycol plant. Due to the above reasons, full potential of catalyst performance could not be realized in actual plant. This resulted in huge monetary loss of client plants and catalyst manufacturer.
To tackle the above challenges U.S. Pat. No. 9,892,238 B2 found a technique to detecting abnormal events in an ethylene oxide reaction process using multivariate statistical techniques and artificial neural network. Detecting abnormal events in a running commercial ethylene oxide reactor at incipient stage is very important but optimizing the process parameters to drive the process towards better performance during normal operation is more crucial. In this context, current invention may be considered having been made on the basis of the above U.S. patent as closest prior art. Instead of detecting abnormal event in EO reaction process, present invention focuses on how to optimize the reactor process conditions in real time to achieve higher selectivity of catalyst. One major objective of present invention is to support an engineer in controlling a process of ethylene oxide production. This objective is achieved by finding a real time optimizer of ethylene oxide (EO) reaction system which collects the real time plant operating data from data historian, carry out appropriate analysis of present and future status of the EO reaction system with the help of its in-built model and recommend to the plant engineers appropriate corrective and preventive actions so that performance improvement of reactor is realized in actual plant. The present invention proposes a method of generating a signal for adjusting a parameter of a process for ethylene oxide (also EO) production. In other words, a signal is generated by means of a computer implemented invention. The invention is related to a process performed in a chemical reactor in which ethylene oxide is produced. The ethylene oxide reactor can be configured as known from the prior art. Generally, the hardware and the functional parts, the sensors and actuators of the reactor fully correspond to setups of known ethylene oxide reactors. According to the invention, in a first step process data of the process for EO production are acquired. The process data can be measured or calculated data. They can refer to historical states and modes of the reactor and alternative or additionally to current process data. In other words, the process data can be retrieved from a data base or from sensors arranged in a real ethylene oxide reactor, or both. In a second step, based on the process data, an output value for the parameter is predicted. In other words, a computer program, which will be defined further below, is used for automatically predicting the future value for the key performance parameters based on the process data. The Key performance parameters for EO reactor can refer to catalyst selectivity, reactor temperature, ethylene oxide production rate or the like. In a third step, the-output value is compared to a predefined reference. In other words, the predicted output value is evaluated based on a certain threshold value or with respect to a certain mathematical function. As a result of the comparison, a difference between the output value and the predefined reference can be calculated or estimated. The difference can refer to a static difference or one-value-difference or can take into account a change in the output value over time. Such change can be reflected by the change of future output value from its present value over time (e.g. max./min. 1 hour ahead, max./min. 2 hours ahead or the like). Based on the result of the comparison the signal for adjusting the parameter is generated in an ultimate step. The signal can represent a recommendation or a control signal for adjusting the parameter of the process in order to adapt the process for ethylene oxide production in a real reactor. In other words, the signal is automatically taking effect on the parameter of the control unit of the reactor or at least represents a recommendation to staff of a reactor for adjusting the parameter. As a result, the process can be kept at best possible conditions for an optimum security, best possible performance and low costs. All historical information regarding a certain reactor can be regarded in real time and the process can be controlled even by non-experienced engineers who only have to follow the generated signal for adjusting the parameter or may have to agree/admit application of the automatically adjusted parameter to the process.
The dependent claims are related to advantageous embodiments of the present invention. The prediction of the output value may be performed by applying the acquired data to a genetic programming model and/or a kinetic based detail phenomenological model. In other words, the current or historical process data can be fit into a mathematical model of the genetic programming algorithm and in addition or alternatively to a kinetic based detail phenomenological model. Still in other words, the second step of predicting an output value for the parameter based on the process data is performed by applying a genetic programming model. The genetic programming model predicts the output value, which is compared to the predefined reference in order to generate the signal. Similar considerations apply for the kinetic based detail phenomenological model. It goes without saying, that the mathematical model can involve all available sensor data which represent the current (actual) EO production process as well as the historical process data derived from a data historian (data storage device). This way, all available data which appear relevant for adjusting the parameter can be regarded efficiently, 24 hours at 7 days a week with no skilled engineer being necessary for routine control actions.
Preferably, a combination of a genetic programming model and a second model (in particular a first principal-based kinetic model) are used for prediction of the output value. The genetic programming model and the second model of the process can preferably be adapted to a concrete reactor of a certain size and/or type and/or state. In other words, a given reactor, which is currently operating, can be simulated by the combination of the genetic programming model and the first principal-based kinetic model in order to run a digital twin of the reaction performed by the reactor. This way, simulation results based on concrete sensor data can represent beneficial control steps for optimizing the real-world reactor, if deemed necessary. It goes without saying, that the generated signal can automatically take effect on the control of the real world reactor, if needed. Several steps of checks and balances can make sure that the generated recommendation is applicable to the real world reactor and its parameters.
In case of a first principal-based kinetic model, a prediction error of the model is minimized by a data driven model, which in turn is based on artificial intelligence. In other words, the data driven model can be trained based on the relevant plant data stored in plant data historian—and further information such as current sensor data can be applied for minimizing the prediction error of the model. It goes without saying that the data historian of the real world reactor in question are best suited for training the mathematical model. However, even if plant data of different reactor are available, this model can be retrained to suit the different reactor and thus make it more beneficial.
The signal can be generated in a control unit (personal computer, large computer, microcontroller or the like) which is run on-site of the EO reactor which actually performs the chemical reaction. From the processing unit, the signal can wirelessly or wired be transferred to the control unit of the reactor. The control unit can in turn adjust a heating, open or close a valve (in part), add a certain starting material, reduce supply of a certain starting material or the like. This way, automatic adjustment of the process and its parameters is possible. However, it goes without saying, that the signal can be used for outputting a message to the plant production engineer, which represents a recommendation for adjusting a parameter, adding a certain starting material, opening a certain valve (or the like), in order to facilitate taking effect on the process performance and safety. Moreover, it goes without saying that certain checks and thresholds can be applied in order to evaluate a severity of possible effects of predicted output values or recommendation/control signals for adjusting the parameter and based on the severity outputting the signal to a first and/or a second real-planned engineer. Alternatively or in addition to that the confirmation of several plant engineers may be requested and waited for, before the signal is propagated to the control unit for taking effect on the parameter of the real world reaction. This way, the process safety and profitability can be assured in connection with the method underlying the present invention.
The acquired process data can comprise one or an arbitrary combination of one of the following exemplary process data: internal reactor data can comprise an age of a catalyst (e.g. in hours, days, years or the like), a selectivity of a catalyst (e.g. in terms of a certain percentage or a single value between zero and one), a temperature within the reactor or the temperature of a certain substance/starting material in the reactor, a pressure within the reactor, inlet moisture, inlet ethylene oxide (EO) and ethane concentration of the reactor. In particular, the ethylene oxide concentration can be fed back ethylene oxide originating from the reactor itself or from other reactors. Additionally or alternatively, external reactor data, such as material in a scrubber and/or in a CO2-removal unit can be used as acquired process data (derived from a data storage device or sensor data derived from a real-world reactor. These process data constitute a profound basis of information for surveilling the real-world reactor by means of the present invention and facilitate best possible recommendations/signals for adjusting the parameter.
In particular, if genetic programming, artificial intelligence and other similar mathematical models are involved, the inventive method can comprise a step of training the respective model by using sensor data of a process for ethylene production, which is simultaneously performed, in real time and in the real world. In other words, a continuous or iterative process of providing the process with information can be established, based on which information the process can improve its performance, accuracy and reliability. It goes without saying that training the mathematical model can be performed as an initial step before even applying the mathematical model to the method according to the present invention as discussed above. This way, the model will provide best possible performance right from the beginning of applying its results to the real world reactor.
Preferably, the step of predicting an output value for the parameter involves using a partial correlation co-efficient methodology and/or a combination of artificial neural network and genetic programming on the acquired process data. As discussed above, the acquired process data can be derived from a data storage means and/or from sensors surveilling the real world reactor.
Further preferably, the present invention involves a self-learning or auto correction functionality, according to which second process data of the process for EO production are measured/acquired at a second (and later) point in time, a second output value is predicted for the parameter based on the second process data and the second process data are analyzed with respect to the second future value. Subsequently, a result of said analysis is compared to a second predefined reference. In other words, the four essential steps of the inventive method are repeated at a later point in time and involve a second predefined reference. The second predefined reference can basically correspond to the first predefined reference or be identical to the first predefined reference. Based on a result of said comparison of said analysis to the second predefined reference the signal for adjusting the parameter is amended (or maintained, respectively) and alternatively or additionally, a model used for modelling the process for ethylene oxide production, which has been established at the first point in time, is invalidated. In other words, the repetition of the four essential steps of the method is used for updating the model used for modelling the process for EO production. Consequently, the updated model is used for modelling the process.
Optionally, the prior model and the recently found model can be compared to each other in order to refine the modelling technique and improve the model building process. Thus, evolution of the digital twin EO reactor model is positively affected.
Further details, advantages and features of the present invention are disclosed in the following description of embodiments according to the drawing. Therein,
Specifically, against the predicted selectivity %:
Specifically, against the predicted temperature:
Specifically,
In the present invention a digital twin application of the commercial EO reactor is built to mimic the real plant. This application will run in real time on a digital processing unit (e.g. a desktop computer) and collect plant data historic data (also named “historian” in this disclosure) e.g. at a regular interval of e.g. one hour. The software will collect the data in real time and carry out all the analysis in order to collect information for efficiently controlling the plant. With the help of its in-built AI based model of any other suitable algorithm, it will assess the current status of the plant, carry out root cause analysis for performance deterioration of catalyst (if any), predict the future performance and optimize the reactor operating parameters to increase catalyst performance. Moreover, this application will generate and communicate the decision to change chloride and other process parameters to respective control devices (valves, dosage systems, heating or pumping devices etc.) in order to establish a best performance for the process immediately or in order to communicate suggestions and/or target values for the parameters (e.g. on a dashboard and/or infographics) to an engineer.
The EO reaction process of commercial glycol plant in this disclosure can be applied to the hardware and implemented as software in a plant or process exemplary described in US patent 2018/9, 892, 238 B2. Oxidation of ethylene to produce ethylene oxide (also named “EO” in the following) is an important reaction in the petrochemical industry for synthesis of glycol, for instance—Commercially EO is produced in shell and tube type EO reactors by reacting oxygen and ethylene at high temperature and pressure in presence of silver based catalyst. The oxidation of ethylene involves a main reaction producing EO (epoxidation reaction) and an undesirable side reaction producing carbon dioxide or CO2 (combustion reaction).
Ethylene+oxygen→Ethylene oxide
Ethylene+oxygen→carbon dioxide+water
The performance of the reaction is measured by selectivity, which is calculated by the percentage of ethylene used to produce EO as compared to total ethylene used to produce EO and CO2. Indirectly, selectivity measures the extent of the first reaction as compared to the second reaction. Selectivity has a profound effect on the efficiency and hence the overall economics of the glycol plant.
An EO reactor may be built like a shell and tube heat exchanger. Silver catalyst may be put as a fixed bed in a tube side. Water is circulated through the shell side to remove the heat of reaction as both the reactions are exothermic. The conversion of ethylene to EO is very low, therefore ethylene and oxygen are recycled back as illustrated in
There are two associated processes, namely, cycle water process and CO2 removal process which are interconnected with EO reaction process. The EO rich cycle water after absorbing EO from cycle gas in EO scrubber fed to EO stripper. EO is stripped off from cycle water in EO stripper by applying heat and then EO free cycle water then again recycled back to scrubber to absorb EO again. This complete the cycle water closed loop. Similarly, CO2 rich carbonate solution from CO2 absorber fed to regenerator where CO2 is removed from carbonate solution by application of heat. CO2 free carbonate solution then recycled back to CO2 scrubber again to absorb CO2 from cycle gas and thus completes the closed carbonate flow loop.
Typically, there may be in the region of 20 independent and 35 dependent variables associated with such a process, not all of which are shown in this example. Independent variables measured include the cycle gas inlet composition (nine components, such as oxygen, ethylene, methane, ethane, carbon-dioxide, water, ethylene oxide, nitrogen and argon measured by online analyser 18), cycle gas flow 17, pressure 20, coolant temperature 19, chloride concentration at reactor inlet gas i.e., different chloride species such as ethylene di-chloride, ethyl chloride, vinyl chloride, methyl chloride, allyl chloride measured by online chloride analyser 34, methane flow 27, ethylene flow 28, oxygen flow 29, EDC flow 30, EO scrubber top temperature 26, wash tower top temperature 25 etc. Examples of dependent variables include cycle gas outlet composition 9 components as specified above measured by analyser 32, selectivity of catalyst 31, EOE production (calculated), steam generation in steam drum 33 etc. All the sensors and meters are interfaced to an online real time data historian 15 as shown in
The basic structure of the EO reactor digital twin (DT) application software is as follows:
In the following, the detail function of each module and related calculations are discussed.
It starts with data collection module. Purpose of this module to collect real-time data from plant data historian. Usually data collector module directly collect data at an interval of one hour from plant data acquisition systems usually installed in glycol plants (Like IP21, PI systems etc.). Data acquisition system usually connected with plant DCS, which in turn connected with different flow, temperature, level, pressure transmitters and also online analyzers in actual plant. Key process parameters in and around EO reactor, wash tower, cycle water system and carbon-dioxide removal system are identified in-priori and data collection system is continuously collecting real time data of these process parameters in fixed time intervals, e.g. every hour.
As this digital twin application module is collecting real time data from data historian and do a background model-based calculations; it is very critical that it collects and interprets the accurate data. Data accuracy is very crucial in such type of online optimization module.
However, industrial EO plant data usually can have following concerns:
Hence, for a real world running glycol plant, inaccuracies, outliers, spike and drifting of measured data is very common and expected. This inaccurate data collection is dealt with following ways in present invention:
If data are drifted and not beyond their average by a certain value, e.g. ±3 standard deviations level, then data are put in a detailed validation calculation module to check its accuracy.
In glycol plants, most of the process parameters are interrelated and they are moved together in certain fashion. Also, EO reactor system has many redundant measurements, i.e. there may be two or more transmitters measuring the same flow. These redundant transmitters are kept strategically during design phase so that redundant transmitters readings can be used in case original transmitters malfunctions or stop working. This interdependence of various process parameters and redundant measurements forms the basis of data validation calculations.
Mainly 3 performance parameters namely selectivity, temperature and equivalent ethylene oxide (EOE) production from reactor need to be verified and validated. All other reactor parameters can be calculated from them.
Where Delta represents difference between reactor outlet and reactor inlet concentrations for the particular component. Equations S1-S6 represent same catalyst selectivity calculated in different ways. All the above six equations are derived from reaction stoichiometry and based on fundamental chemistry.
S11, S33 and S44 are catalyst selectivity calculated by different flows.
Where Net Ethylene flow and Net Oxygen flow are the net oxygen and ethylene consumption in reaction and calculated as follows:
3. Validity Based on Heat Balance Across Reactor
Both the epoxidation and combustion reaction occurred in EO reactor are exothermic and generate heat. This heat is taken away by water circulation and steam generation in shell side of reactor and sensible heat by cycle gas. Since the heat of reaction of Co2 reaction is much more than EO reaction, heat generation varies as per selectivity. This fundamental heat balance concept is utilized to calculate the selectivity from steam generation data and cycle gas temperature rise across reactors.
Following guiding equations are used to calculate steam selectivity.
All the 10 selectivity mentioned above namely [S1-S6, S11, S33, S44 and steam selectivity] ideally should have same value if all the measurements of mass spectrometer [or gas chromatograph], oxygen, ethylene and steam flow meters are correct. In real plant, always there are some measurement errors and all ten selectivity are slightly different and their accuracy can be measured by a term called selectivity spread and defined as follows:
In online digital twin application, a solver was run to minimize the selectivity spread by varying all independent parameters namely delta concentrations of individual components and various flows. Oxygen flow and delta oxygen are taken at their measured value to reduce the multiplicity of solutions.
In this way the true value of concentrations and flows are calculated irrespective of slight error in their measurements.
These calculations involved reading from flow, temperature, pressure transmitters, analyzer readings etc. So even one or two transmitter showing wrong results or drifted then that could be identified by validation calculations involving all other transmitters.
Based on above validation calculations all the EO reactor process parameters are cross checked, validated and recalculated. These re-calculated values are then used for model building and subsequent calculations. Application of detailed validation calculation for EO reactor inside an online optimization model is new.
To optimize the commercial EO reactor, it is required to model complex systems of industrial chemical reactions as a first step.
Following sections describes the details of model building.
Industrial chemical reactions involved complicated reaction kinetics and thermodynamics. Building a credible phenomenological model for commercial reactors is considered as a very time-consuming difficult task as it demands a deep knowledge of industrial heterogeneous catalytic behavior characterized by mass diffusion, catalyst deactivation, etc. Lack of sufficient knowledge in chemical reaction kinetics actually hinders the optimization of the reactor. In most chemical plants, the reactors remain as black-box, and operation engineers do not try experimentation in running plants due to safety and reliability reasons. This drives non-optimum operation of industrial reactors, which greatly affects the profitability of the plant. So, reactors are considered as unexplored territory of chemical industries. However, a small increment of catalyst selectivity and reaction yield significantly impact the raw material consumption and plant profitability in large-scale operation. [1]
Hence, an easy alternate approach is applying data-driven effective computational technique to build approximate reactor models for these complex reaction systems, which can be subsequently used to optimize the reactor and increase profit.
As huge reactor inlet and outlet historic plant operating data are collected and stored every minute for most chemical plants, the real challenge is to utilize this wealth of data to make more profit. Data is new oil, and certainly, data-driven modelling techniques like artificial neural network (ANN) and support vector machine (SVM) are considered new IC engine of current age. However, as to the chemical industry, ANN and SVM models are not favored by the plant engineers due to lack of their expandability and black-box nature. ANN modelling provides an equation consisting of complex sigmoidal function with several tuning constants, known as weights and biases, and that is why the model suffers from the limitation of explain ability though the prediction capability of ANN model is excellent. Usually, process engineers want manageable equations in differential/algebraic form relating output variables with input features so that they can get more insight to derive benefit. Hence, it is needed to explore alternate computational methods. Genetic programming (GP), which is a branch of evolutionary modelling techniques, has been recognized by the inventors to have the capability to remove the above drawbacks of ANN and SVM models among chemical problems. Genetic programming (GP), automatically generates nonlinear structured models as closed-form equations relating input and output of the system from available data. Not only GP identify the structure of the equation, but also it estimates the different parameters of the equations so that it accurately predicts the output. At the time of building up MISO (multiple input, single output) models by using GP, the probability of survival of a particular model to its next generations depends on its prediction accuracy and followed ‘survival of fittest’ principles. Recombination of components of survived models continuously takes place to form new model aiming at increasing predictability in each generation.
In literature, the breakthrough in GP can be seen in the late 1980s with the experiments of Koza [2] on symbolic regression. The versatility of GP algorithm is proved by Koza and Rice [3] by applying in diverse fields of robotics, games, control, etc. Both dynamic and steady-state modelling of binary distillation column and twin-screw cooking extruder respectively were performed by Willis et al. [4] with the help of GP. Gray et al. [5] implemented GP-assisted dynamic modelling to generate continuous discrete models using MATLAB Simulink combination with sets of equations. Mckay et al. [6] demonstrated the application of GP in solving nonlinear test problems of vacuum distillation column, reaction system. Lakshminarayanan et al. [7] combined principal component analysis with GP to build up nonlinear models and applied the same in product design. Grossman [8] applied GP for the generation of empirical models in process system. Grossman and Lewin [9] presented an approach to automate the design of nonlinear MPC (NMPC) using a GP-driven, model-based control approach.
Multi-gene genetic programming (MGP) [10, 11] is one of the robust variants of GP and claims to be more effective than GP in nonlinear modelling. MGP is designed to generate mathematical models of predictor response data that are “multi-gene” in nature, i.e., linear combinations of low-order nonlinear transformations of the input variables. The conventional GP is mainly on the basis of evaluation of a single tree (model) expression. In multi-gene approach, several individual genes are combined to construct a single one. [11] It has been proved that MGP regression is capable of providing more accurate and computationally efficient model in comparison to conventional GP. [10, 11] It can be seen in many cases that MGP performed better than other machine learning methods, viz. ANN, SVM in terms of predictability and model simplicity. [12-15]
Despite the remarkable prediction capabilities of the MGP approach, applications of this method to chemical reaction engineering tasks in commercial reactors are conspicuous by their near absence. The reason—as discussed above—is to be seen in the clear preference of the engineers and stakeholders for deterministic approaches and control algorithms. Moreover, complexity of reactors make modelling a challenging task. Additionally, impurities in the starting material could have had disastrous consequences, if not adequately reflected in and considered by the model. Finally, the question of responsibility and liability for economical damage and personal injury may have prevented GP from being considered in chemical reaction engineering tasks. In the present invention MGP modelling approach is applied to build model commercial chemical ethylene oxide (EO) reactor.
One major objective of present invention was to find an accurate commercial EO reactor model which is in closed form equation, portable, explainable, deployable in real time application and can be used by the plant engineers to gain insight of the process and valuable support for adjusting parameters and concentrations.
Due to poor understanding and complexity of multiphase catalytic reactions occurring in commercial ethylene oxide reactor, a credible first principle-based model is not available in common chemical handbook and similar literature. Availability of large amount of reactor operating parameters data of commercial plants make data driven modelling as a viable alternative approach. The current work presents an effort to utilize the large amount of process data to find a framework to convert the information from this data to maximize profit.
Because of catalyst poisoning and sintering effect, the catalyst selectivity decreases gradually with the age of the catalyst, and the catalyst selectivity remains around 90% at the starting of operation (SOR) and becomes 80% at the end of the operation (EOR) (usually after 2-4 years). The average catalyst life of present generation catalyst is around 24-48 months. To maintain the plant production at target while selectivity is decreasing, the temperature of the reactor is gradually increased over the age of the catalyst life (e.g., from 225° C. in SOR to 275° C. in EOR) to increase the reaction rate. Catalyst selectivity determines the profitability of plant as higher selectivity means less consumption of expensive ethylene and oxygen to produce same amount of EO i.e., less production cost. On the other hand, reactor temperature determines catalyst age. Faster rise of reactor temperature means loss of catalyst age and incurring fixed cost of catalyst sooner by replacing the catalyst and taking plant shutdown for that. In commercial plant, two prime production objectives are optimizing the reactor operating parameters in such a way so that selectivity maximization and minimization of reactor temperature (i.e., maximization of catalyst longevity) are achieved simultaneously while maintaining plant production target.
GP is a branch of metaheuristic symbolic optimization technique which generates equations for solving a problem using the ‘survival of the fittest principle’ of Darwinian natural selection.
The general form of the reactor model to be obtained is given as below (Equation 4)
where y indicates the process output variable (Selectivity or catalyst temperature); X is the N-dimensional vector of input variables like flow, pressure, inlet concentration of various raw materials etc. (X=[x1, x2, . . . , xn, . . . , xN]T), and f denotes a nonlinear function whose parameters are defined in terms of a P-dimensional vector, β[β1, β2, . . . , βK]P. If industrial data of input and output variables are given, GP algorithm tries to best fit the data by changing its functional form and parameter vector β.
The algorithm of GP has been illustrated in
The reproduction, crossover, and mutation steps are illustrated in
The conventional GP is generally used for symbolic regression of a given input output datasets but often, conventional GP suffers from limitation of lesser accuracy. On the contrary, multigene approach enhances the accuracy of the model in higher extent. Multigene Genetic Programming (MGP) makes weighted linear combination of smaller GP trees or genes in order to maximize fitness. In MGP, predicted output variable is the summation of bias term and weighted output of individual trees (genes). The predicted variables in MGP can be expressed as follows (Equation 5):
where ypred is the predicted output, bo is the bias term, gi is the genes or trees and wi is the corresponding weightages. Similar to linear regression, the weightages and bias are determined by least squares method. Therefore, MGP captures the nonlinear behavior through the small trees and also applies classical linear regression methodology.
A typical MGP model has been pictorially represented in
Since selectivity and reactor temperature has a large impact on plant profitability, these two parameters are kept as output variables. All the reactor operating parameters which can affect the selectivity and reactor temperature are kept as ‘wish list’ of input variables. At first, all EO reactor operating data (hourly average) of nearly three years were collected. Then after consultation with plant expert, all the input variables that may influence the output variables were noted down. After that, the cross-correlation analysis was carried out. In this method, correlation coefficients of each of the input variables with each output variable and inter input cross-correlation coefficients were found out.
The following criteria are used to shortlist the input variables.
Based on the above criteria 11 input variables are finally shortlisted and tabulated in Table 1.
There are additional two input variables namely inlet EO concentration and inlet moisture concentration which are tackled separately after initial model building, according to a preferred embodiment of the present invention.
According to one embodiment of the present invention, it has been found that the Input Variables (x4), (x5) and (x7) are—in a relative view—less relevant in the context of the present invention and can thus be omitted, if desired.
The data ranges indicated in the Table 1 for the input variables below are exemplary and may slightly vary between production plants. It has been found that also the inlet concentration of saturated hydrocarbons, in particular the ethane concentration is an advantageous input variable (an example of the feature “process data” as used herein) within the context of the present invention. Preferably, the “total chloride concentration” is the total concentration of all chloride moderators present in the reactor inlet gas stream.
The inventors have found that the above input variables have a different degree of importance for advantageous model building. Thus, according to one aspect of the invention, the inlet variables (an example of the feature “process data” as used herein) used can be grouped as follows:
According to a particularly preferred embodiment of the invention, the input variables are a combination of one or more of the aforementioned input variables a) to f), more preferably comprising at least a), more preferably at least a) and b), more preferably at least a) to c), more preferably at least a) to d), more preferably at least a) to e), more preferably a) to f).
In commercial plants, data historian software is used to collect and store every operating parameter at every minute. EO reactor operating data (daily average) for all parameters given in Table 1 from commercial plant of nearly three years were collected. These raw data are then exposed to data cleaning and outlier removal software.
Data cleaning in one of the key tasks of modelling of industrial process as industry data contains noise, spikes, outliers, error etc. due to malfunction of different sensors, transmitters, analyzers, control systems and data historian software etc. It was found that quality of data driven model is determined by quality of data used to define the model. Noisy and bad data can seriously impact the model performance for data driven modelling, data quality is an important factor to be considered.
According to a preferred embodiment of the present invention, an automated effective data cleaning algorithm is part of this invention to avoid manual cleaning. According to this embodiment, multivariate Principal Component Analysis (PCA) was employed for pre-processing of data. An automated MATLAB based program was found which generate a multivariate statistical vector viz. t-squared from the plant operating dataset. Then, in the t-squared vector which have above 95th percentile, the corresponding rows were considered to be outliers and that outliers were removed from the dataset.
Selectivity and activity (measured by catalyst temperature) are two performance parameters of EO catalyst. There are 8 process parameters (see Table 2 below) which can affect the selectivity and activity of catalyst. However, in a continuous plant, there is always a delay time of these process parameters to have a visible effect on catalyst performance. For example, if a particular parameter (say CO2 concentration) increases now, we may see the impact on selectivity and activity after 12 hours. This is called time to impact. In short, change in input parameters will have some time delay to see its impact on catalyst performance. Moreover, this delay time or time to impact is different for different process parameters. For some process parameters such as reactor inlet ethylene concentration and oxygen concentration, the impact is immediate i.e., the delay time is very short. On the other hand, reactor inlet CO2 concentration and reactor inlet chloride concentration, the delay time is very large and can be 24 to 48 hours.
In practice, evaluating the response time of individual parameters is a very difficult task as all the parameters are continuously changing in a dynamic plant environment.
In this invention, a statistical methodology is found which can very accurately evaluate the response time of individual parameters from plant past data. Partial correlation co-efficient methodology is applied on EO reactor historical operating data, and the response time is calculated.
Partial correlation is a method used to describe the relationship between two variables whilst taking away the effects of another variable, or several other variables, on this relationship.
Partial correlation co-efficient is a statistical procedure which evaluate the correlation coefficient of an individual process parameters' impact on selectivity or activity while keeping all other variables constant.
Calculations related to partial correlation coefficient:
For simplicity it is assumed that
y=f(x1, x2)
where y is dependent variable and x1, x2 are two independent variable.
Bivariate (zero order) correlation between y and x1 (ryx1) can be calculated as
where ryx1 is bivariate (zero order) correlation between y and x1.
Similarly Bivariate (zero order) correlation between v and x2 (ryx2) can be calculated as
Bivariate (zero order) correlation between x1 and x2 (rx1x2) can be calculated as
These bivariate correlations are used to calculate partial correlations (first order) as follows:
Where ryx1x2 is the partial correlation between y and x1 while keeping x2 constant.
Similarly, partial correlation between y and x2 while keeping x1 constant can be calculated as follows:
Multivariate regression equation can be written as y=β0+β1x1+β2x2 where β1 =partial slope of the linear relationship between the first independent variable and y β1 indicates the change in y for one unit change in x1, controlling for x2 β2=partial slope of the linear relationship between the second independent variable and y β2 indicates the change in y for one unit change in x2, controlling for x1 β0=the y intercept, where the regression line crosses the Y axis.
The partial slopes indicate the effect of each independent variable on y, while controlling for the effect of the other independent variables and can be calculated as follows:
where σx1 and σx2 are standard deviations of y, x1 and x2 respectively. Once β1 and β1 have been calculated, Y intercept β0 can be calculated by
The above statistical procedure is used to evaluate the response time of each input variables on selectivity and temperature.
It is very crucial to evaluate the individual input parameters contribution to overall selectivity and temperature. In dynamic plant environment this type of data are not available where only one input variable varies and all other input parameters remain constant.
In other words, individual parameters are arranged in such a way that current selectivity is aligned with data of previous one hr data of input parameters. After all the time series data are shifted and realigned in above way, partial slope of each input parameters are calculated for selectivity in similar way.
Partial slopes of each variable in table 1 are evaluated for all these cases.
It is worth mentioning here that these delay times are not universal for any EO reactor, but they are dependent on catalyst type, plant size, catalyst age etc. However, this procedure to calculate delay time from actual plant data is generic and can be applied to any EO reactor plant data to evaluate delay time for that particular plant.
Accurate estimation of response time for individual process parameters as set out above is a very preferred embodiment of the present invention and allow an important improvement in quantitatively calculating the future value of performance parameters. This will help to manipulate the current value of controlled parameters i.e., to take corrective action so that performance parameters do not deteriorate in future. This also helps to take accurate calibrated corrective actions to improve the performance parameters in future.
Once the delay times of each parameter are calculated, input and output data set are shifted by their respective delay time and realigned. This realignment of data helps to predict selectivity and temperature with more accuracy. The cleaned realigned data was taken for MGP-based modelling. Random partitioning of the dataset comprising of eight input variables and two output variables (Table 1) into training set (80% of whole data) and test set (20% of whole data) was performed. The training set was used to compute the model by maximizing the fitness value whereas the test data set was used for cross-validation of the expression developed. The main objective of the cross-validation is to make the model more generalizable.
Code written in MATLAB 2019a was used to find the MGP-based model. In this invention, root-mean-squared error (RMSE) between actual and predicted outputs was taken as fitness function and the program was run in such a manner that the value of RMSE got minimized. Due to the stochastic nature of the MGP, the program was run 100 times to find the model.
Main objective of present invention is to generate a closed for model equation of EO reactor which is accurate, simple, portable and explainable.
The three years hourly average dataset were subjected to cleaning and after removal of outliers, 16000 data sets were qualified for model building. The values of MGP parameters required for modelling were found based on trial-and-error approach. For a real world reactor, the present invention has exemplary been verified in that basic arithmetic operators and functions, population size of 250, maximum generation of 500, maximum tree depth of 4 and maximum number of genes of 6 were taken for modelling. If these parameters are taken with high value, the model accuracy may increase but the complexity of the solutions also increases and also the program becomes computationally expensive.
MGP generates lot of promising model equations during its run. Table 2 and 3 summarizes the most prominent closed form equations generated by GP for selectivity and temperature. As seen in Table 2 and 3, there is always conflicting objective between model complexity (denoted by number of tree nodes) and model prediction accuracy (denoted by R2 in table). More accurate models are more complex and vice versa. Complex models are difficult to interpret and unnecessary over fit the data. For each run of MGP, two Pareto diagrams were found, one is for catalyst selectivity and another is for reactor temperature (
Use of multigene regression models frequently suffers from the phenomenon called ‘bloat’, vertical and horizontal bloat. Vertical bloat refers the tendency to evolve trees that contain terms that confer little or no performance benefit. In terms of model development, this is related to the phenomenon of overfitting. In present invention, by restriction on tree depth and use of Pareto tournament between expressional complexity and accuracy the vertical bloats were ameliorated. Tree depth was kept 4 by trial-and-error method and pareto diagrams are generated as shown in
Horizontal bloat is the tendency of multigene models to acquire genes that are either performance neutral (i.e., deliver no improvement in R2 (or “R2” as also used herein) on the training data) or offer very small incremental performance improvements. Horizontal bloat is essentially the same behavior exhibited by non-regularized models, where it is well known that the addition of model terms leads to a monotonically increasing R2 on training data even though the terms may not be meaningful (e.g., they are capturing noise) or allow the model to generalize well to testing or validation data sets. Ostensibly, the simplest way to prevent horizontal bloat in multigene regression is to limit the maximum allowed number of genes in a model. In this invention, maximum number of genes are kept 6 after trial-and-error method.
Out of potential candidates or representative model equations (refer Table 3 and 4) with varying degree of complexity and accuracy, for selection of optimum model, the following criteria were kept in mind:
All the ten equations in Table 2 and 3 are subjected to the above scrutiny to judge whether the developed equation is in agreement with plant observations. All the developed models are passed through rigorous testing. For example, ten test data set was generated where all the variables are kept at their 50-percentile value except oxygen inlet concentration which was varied from its minimum value to maximum value by equal 10 intervals. These test data were put in equations of Table 2 and Table 3 and respective 10 set selectivity and temperature data were generated. After that, first oxygen inlet concentration vs selectivity and oxygen inlet concentration vs temperature were plotted. From these plots, observation number 1 of Table 2 was verified. In same way all other plots were generated which are shown in
Model equations that do not follow the observations in Table 4 are rejected, as they do not capture the underlying physics of the EO reactor and not in agreement with plant observations.
They merely represent a complex data fitting equation without actual sense. From the shortlisted model equations, only one model equation for catalyst selectivity and one model for reactor temperature were finally selected which have been mentioned as below (Equation 8 and Equation 9): These two equations are considered the representative model equation for selectivity and temperature as they are highly accurate, obey the Table 4 observations and capture the internal physics of the underlying exemplary reactor.
The corresponding Coefficient of Determination (R2) and Average Percentage Error (APE) of the above models for training and test data are reflected in Table 5.
Looking at high values of R2 and low values of APE for both the models involving catalyst selectivity and reactor temperature (Table 5), it can be concluded that the predicted output values are at par with the actual output values and the models developed are reliable, fairly accurate and captures the inherent physics of EO reactor. The high R2 value on unseen test data and low APE also indicates about the model's generalizability and accurate learning on nonlinear input and output relationship.
Models' prediction performance on training and testing data are shown in
From Table 5 and
One of the major advantages of MGP modelling technique over ANN and SVR techniques is that it generates closed form of equation (like equation 8 and 9), which is explainable, portable and easily implantable in plant distributed control system. Though MGP generates a closed form of equation which has very high predictive capability, the equation found is large and complex and sometimes difficult to directly interpret. In present invention, a methodology is found to enhance the interpretability of the developed equations.
In short, found models capture the nonlinear relationship between selectivity and reactor operating parameters. These trend lines can be used by plant operating engineers to get the insights how a particular input parameter affects the catalyst selectivity. For example, from
The selectivity increases linearly with oxygen concentration with positive slope of 0.658, whereas the relation of selectivity with ethylene and CO2 are nonlinear and represented by second order polynomial. Now these trend line equations have been considered when finding the following equations (Equation 10 and 11).
Where x1, x2, . . . x8 are the actual value of the 8 input variables and x1, avg, x2, avg, x8, avg are the average (50 percentile) value of input variables respectively.
Each term in the equation 10 represents the change in selectivity if a particular parameter deviates from its average value. For example, the term, (x1−x1.) represents the deviation of inlet oxygen concentration from its average value and when it multiplied by co-efficient 0.658, represents the selectivity gain (or penalty) due to oxygen. In this way, all 8 parameters contribution is calculated in equation 8 and it is added with about 88.5% (average selectivity) to get the actual selectivity.
Main advantage of this equation (equation 10) over GP equation (equation 8) is that this equation is interpretable and easily explainable to plant engineer. Equation is simple, and contains terms or parametric co-efficient which throw lights relative importance of each parameter on the overall selectivity if they deviate from these base values. Also, it indicates whether the effect of each parameters is linear or non-linear.
Equation 10 is then used to predict selectivity of three-year actual data and predicted and actual selectivity is compared. Prediction error is 0.8% and R2 is 0.97. This low value of prediction error and high value of R2 signifies that developed equation (equation 10) is highly accurate and reliable.
Selectivity and activity are two major performance parameters of EO catalyst. There are 10 input parameters which have an influence on selectivity and activity. Parametric coefficients are defined as the change in selectivity or activity when one unit change of any individual parameter is done. For example, oxygen inlet concentration parametric coefficient of selectivity is defined as:
Parametric coefficients are very important to calculate the change in selectivity or activity of catalyst when an input parameter changes. It also provides a calculation basis to calculate the adjusted selectivity and activity to judge the real performance of catalyst in midst of ever changing dynamic plant environment.
At present, the value of parametric coefficient of limited number of parameters are available for Syndox-400 catalyst. These values are estimated in controlled laboratory environment and not in the plant environment. The validity of these parametric coefficients in plant environment is questionable as they were built in controlled R&D environment. It is very crucial to estimate the real life parametric co-efficient to enable them to apply in real time optimization.
As a part of the present invention, a calculation methodology is found to estimate the parametric coefficient of different input variables from actual plant data. The above mentioned Genetic programming based modelling methodology is used to calculate the parametric co-efficient of each parameters as expressed in equation 10 and 11. The whole 3 years plant hourly data is utilized to arrive at equation 10 and 11.
Parametric co-efficient of each individual inputs are derived from equation 10 and 11 and summarized in Table 6:
The value of parametric coefficients changes with age of catalyst. The sensitivity at different input parameters on catalyst selectivity and activity changes in a non-linear fashion over the age of catalyst. The current set of available parametric coefficients does not take this in account. It is beneficial to estimate the parametric co-efficient at different age of catalyst.
Whole set of 3 years' data are divided in 5 sets with 6 months each. Same modelling methodology is applied to each 5 sets data separately and parametric co-efficient are evaluated each time. In other words, same calculations are repeated in different catalyst age at fixed time intervals, say for every 6 months and age sensitivity coefficient of each parameter are calculated from historical data.
In this way, parametric co-efficient at different age of catalyst can be estimated.
There are few instances in plant when inlet moisture and/or inlet EO has increased significantly in plant due to some disturbances in downstream plant or during summer time etc. It is well known that increase of inlet moisture and/or inlet EO can have a very detrimental effect on catalyst selectivity and activity.
Parametric co-efficient of inlet EO and inlet moisture are separately calculated due to the fact the data related to very high inlet EO and inlet moisture are very sparse and few.
Following calculations procedures are followed:
The parametric coefficient of inlet moisture, inlet EO are not available for Syndox-400 catalyst and this procedure gives a methodology to evaluate them from real life plant data.
For an online digital twin application, it is very crucial to understand where the process stands currently in terms of catalyst performance. This will subsequently help to determine the preventive and corrective actions.
Catalyst performance status is detected online by 3 key performance indexes (KPI) namely adjusted selectivity, adjusted temperature and calculated EOE production.
In real plants process parameters and above 3 KPIs have noise coming from various measuring transmitters. To avoid influence of noise for online automated decision making following calculations are followed:
A trend line of cubic polynomials is automatically calculated and drawn to smoothen the noisy data. A pipeline was drawn around this trend line with radius of one standard deviations. The smooth trend lines are calculated and drawn for each parameters and KPIs around reactor.
These smooth lines are then taken for decision making. A decision making truth table related to selectivity, temperature and production rate based on first principle can be prepared to decide on current chloride process status.
During operation of EO catalyst in actual plant, there are times when catalyst performance (selectivity and/or activity) gets deteriorated in spite of full effort of engineers and operators running the plant. Production engineer are very curious to know why this performance deterioration happened.
This online module will identify all the responsible parameters for selectivity and/or activity decline in every hour. It will help to identify the root cause for selectivity and activity drop.
All responsible parameters will be shown in a comprehensive infographic
The contribution of each parameter on selectivity and activity trend will be calculated based on parametric coefficients of table 3. A replica of GP EO reactor model will run in the background to evaluate individual contributions. Chloride contribution to selectivity and activity will be calculated as remaining delta after allocating contributions with other parameters
GP based model are also built similarly for the following two systems:
This online module will Identify root cause outside EO reactor area, if any with the help of above two models. This will scan process parameters in EO scrubber and EO stripper, wash tower, CO2 absorber and regenerator etc. and identify root cause for high EO, CO2 and Moisture at reactor inlet. Based on Root Cause Analysis it provides recommendation for remedial action in real time basis.
Few ppm chloride is added at cycle gas at reactor inlet to improve catalyst selectivity and activity. For SynDox 400 series high selectivity catalyst, selectivity and activity follows a curve as shown in figure. As seen from the inverted U shaped curve, there is an optimum chloride value which gives highest selectivity. Selectivity drops significantly if the chloride level is below or above this optimum value. It is very crucial to identify and run the catalyst at optimum chloride level.
The decision of current chloride (chlorination) state detection can be based on the chloride curve as shown e.g. in
Based on the current status of the plant, future selectivity, temperature and EOE will be predicted for e.g. next 24 hours with this hybrid model in real time basis. The artificial intelligence based GP model will be used to predict the selectivity.
This GP model has superior prediction capability and explain ability. It will capture different parametric co-efficient at different age.
This future prediction capability of digital twin helps the plant engineer to see e.g. 24 hrs or even farther ahead in future due to their past action. Any future selectivity, temperature and EOE production deterioration trend is shown in dashboard with infographics in real time basis. These future predictions are recalculated at fixed or variable time intervals, e.g. every hour based on plant current data and the trends are updated.
Purpose of optimization module is to analyze the current situations of the plant, evaluate the best move of manipulated variables to maximize catalyst selectivity and activity.
Based on GP model prediction an online metaheuristic optimization (multi objective genetic algorithm) is carried out to prescribe operators the best operating condition in current situation to achieve the best selectivity and temperature. This metaheuristic optimization will explore thousands of possible combinations of corrective and preventive action that can be taken and prescribe the best among them to the operators. During calculations of best move, optimization modules need to honor the different constraints of process parameters, which is given by plant engineers. This optimization module exploits any margins available in ethylene and oxygen concentrations, cycle gas pressure and flow, EDC flow etc. which can be changed/manipulated by plant engineers. This online optimizer will guide and recommend the optimum process parameters which need to be maintained in the plant to the panel operators. The effect of optimization will be shown in dashboard and operators will clearly see what benefits of selectivity and activity plant is going to get in future if he implements the recommended actions.
Once a reliable model is developed, the second objective of present invention is to utilize the developed model to increase profit of the ethylene oxide plant and simultaneously reduce its environmental impact. This done by applying nature inspired metaheuristic optimization algorithm on the model to optimize the input process parameters which simultaneously maximize selectivity (i.e. reactor performance) and minimize reactor temperature (i.e. maximize catalyst longevity). Multi-objective genetic algorithm (MOGA) is chosen to optimize the input space of EO reactor MGP model to generate pareto optimal solutions which simultaneously achieve both the objectives in the best possible manner. Once the reliable models were successfully found, the models were subjected to verification and further optimization. Purpose of optimization is to find the optimum value of reactor operating parameters to achieve maximum selectivity and minimum temperature simultaneously. One of the critical tasks for optimization of any process is fixing of search space at which the optimal process conditions are to be found out. Therefore, before running the optimization a lower bound and upper bound of the process variables were fixed in consultation with plant engineers. As plant conditions are dynamic in nature, the total operational time was divided into three periods [start of run (SOR), middle of run (MOR) and end of run (EOR)] and at each running period, optimizations were performed. The lower bounds (LB) and upper bounds (UB) considered in these three cases have been depicted in Table 7.
With the help of MOGA algorithm, a pareto diagram was developed for case 2 (MOR) (
Pareto optimal solutions on the curve are considered equally good. As seen from the curve, if someone try to increase selectivity, one has to sacrifice reactor temperature. Reverse is also true, i.e., lower temperature can be achieved at the cost of lower selectivity. Other than these solutions, there is no process condition which can increase selectivity without sacrificing temperature and vice versa. Some of the pareto optimal solutions have been also tabulated in Table 7 and indicted by black circles in
The main advantage of such an invention is that it gives the operation engineer/DCS panel operators a strategy to run the reactor in optimum condition in real-time. In a running plant, since operators have no idea about Pareto optimal solution, operators try to optimize the plant heuristically based on their experience and knowledge. One such real-life operating point is plotted and named as ‘A’ in
Of course, it goes without saying that the invention is capable of outputting control data for running the reactor in optimum condition in real-time. In other words, the operation engineer/DCS panel operators do not have to input the model's output data into the control unit of the EO reactor, but the model's output data are fully automatically input into the control unit of the EO reactor. This way, the EO reactor is immediately fed with optimum control data, which prevents reading errors when interpreting the model's output data, typos when entering the data to the control unit as well as sabotage.
As a further option, an engineer can be prompted for applying the data recently output by the model. In other words, the engineer is asked whether to apply the model's output data to the control unit of the EO reactor as soon as same have been calculated. E.g. such prompt can be transferred to a personal computer or to a handheld device (smart phone, tablet, smart watch or the like) of a competent engineer via land line or wirelessly. The prompt as well as the engineers answer (accept or decline) can be transmitted via the companies Ethernet and/or the internet).
In particular, the prompt can depend on a difference between the currently applied values and the values recently output by the model. Only in case the difference exceeds a certain threshold value, a prompt and request for acceptance is sent to the engineer, whereas if not exceeded, the values recently output by the model are adopted automatically.
The threshold can be set by a user/engineer in order to match his faith in the model (e.g. based on his experience/skill level). The threshold can be defined in dependency of the current temperature of the process and/or the current selectivity of the catalyst.
Online recommendation can be generated by digital twin application as proposed by the present invention using feedforward and feedback action.
This online digital twin application provides online recommendation by using combination of feedforward and feedback action (refer
If the improvement of selectivity and temperature is positive, then feedback action will not take any action. Program will continue and give recommendation through FFM in next cycle. But at any point in time, when the feedback action detects that performance deteriorates and not as per expected, feedback action then generates a recommendation to revert back last action and go to original position before recommendation.
This feedback recommendation will override the feedforward recommendation in current cycle. This combination of feedback & feedforward action will autocorrect any wrong move generated out of model inaccuracies or unforeseen disturbance in plant.
In reality, catalyst behavior may change with its age while running continuously inside the plant. Reasons are plenty and not fully known. Some of the reasons are (but not fully known):
This behavior change can be detected by slope change of selectivity and temperature curve. As catalyst behavior was changed during operation, any model built from past historical data may not be valid or model's prediction capability may reduce drastically. Hence, online retraining of model is necessary to keep the model prediction accurate all the time throughout the catalyst life.
In this application, auto retraining of model is incorporated. With this feature, the model gets retrained with e.g. last 15 days' data and always remain accurate. Even catalyst behavior is changed during operation, such change is detected by prediction error (error between selectivity (or temperature) and model provided selectivity).
As prediction error remain high for continuous about 15 days, model retrains itself automatically with recent fresh data of last 15 days. Even if such behavior change does not happen, then also the present model is retrained in every 15 days, by default.
This auto retraining facility make the model up to date all the time and improve prediction accuracy in most of catalyst change.
If this auto-retraining facility is absent from this application, the static model may become invalid, or its prediction accuracy may deteriorate significantly with time if catalyst changes its behavior.
In the following, an example is given on how auto retraining is done in a process according to the present invention:
In this application, a database is maintained for e.g. last 15 days' data. These data are collected from plant data historian system.
In each cycle, the program according to the present invention detects its own prediction accuracy, which means it calculates the error between actual selectivity seen in the plant and model predicted selectivity (same applies for temperature).
If prediction error increase significantly, or exemplary 15 days elapsed, this application runs an auto retraining of model. In this auto retraining facility, an algorithm takes the last 15 days' recent data and perform following operation to update the model coefficients.
Exemplary steps performed by the auto retraining module:
Summarizing, and with no limiting effect in light of the enclosed set of claims, the present invention deviates from the prior art at least as follows:
The present invention claims the benefit of U.S. Provisional Patent Application No. 63/463,628 filed May 3, 2023, the entire content and disclosure of which is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
63463628 | May 2023 | US |