The present invention relates generally to control of technical processes and machines.
A process control or automation system is used to automatically control an industrial process such as chemical processes, oil refineries, power plants, timber industry, mineral processing (e.g. rock-crushing, screening, grinding, etc.), and paper and pulp factories. The process automation system often uses a network to interconnect sensors, controllers, operator terminals and actuators. Process automation involves using computer technology and software engineering to help power plants and factories operate more efficiently and safely. An automatic controller must be able to facilitate the plant operation over a wide range of operating conditions. Also many machines, such as crushers, screens, wood harvesters, etc., are very complicated systems with a plurality of sensors, controllers, and actuators.
The industrial processes maintain continuous operations in facilities that represent substantial start-up costs and time, but can be interrupted or disrupted by minor disturbances. If the product stream is disrupted or the process is not run optimally, lost productivity and lost product can create a large financial burden. Therefore it is important for the operator to monitor and supervise the process in real-time. Control rooms are central facilities used to manage large systems such as industrial processes. As the operators of large systems are asked to perform more efficiently, use more sophisticated control systems, and take on more duties and responsibilities, developers of control room equipment have sought to improve operators' ability to interact effectively with their systems. Various approaches have been proposed, which are often divided, according to the level of knowledge, into model-based and statistical methods.
Behaviour of a process can be predicted using simulation models. Process simulation is a model-based representation of industrial processes and unit operations in software for studying and analyzing the behaviour and performance of actual or theoretical systems. Simulation studies are performed, not on the real-world system, but on a (usually computer-based) model of the system created for the purpose of studying certain system dynamics and characteristics. The purpose of any model is to enable its users to draw conclusions about the real system by studying and analysing the model response. The major reasons for developing a model, as opposed to analysing the real system, include economics, unavailability of a “real” system, and the goal of achieving a deeper understanding of the relationships between the elements of the system. In process automation, the simulator may use measurements to show not only how the plant is working but to simulate different operating modes and find the optimal strategy for the plant.
Modern process plants generate a vast amount of real-time data that often has to be stored and processed efficiently for later analysis. All measurements, setpoints, controller outputs, device statuses, motor starts, alarms, operation tracking etc. may be stored into so called historian data-bases. A historian database may be integrated with other parts of a control system, such as controls, engineering and control room functionalities. By means of the history data collected from the process plant over time, trend charts can be created that show process trends in data over time. In modern control rooms, long history trends are usually available for the users. As all processes vary, single point measurements can be misleading. Displaying historical data over time increases understanding of the real performance of a process.
US 2002177909 discloses a multi-variable control loop assessment method and apparatus for analyzing, assessing and troubleshooting control loops in complex industrial processes such as papermaking operations. A multivariable analysis tool applies an orthogonal decomposition method such as the Partial Least Squares (PLS) to a disturbance model relating the known loop disturbances to the loop model residue. The tool first extracts the most dominant correlation to the loop model residue and then uses the residue to search for secondary dominant correlation in an orthogonal space. This process is repeated until no further output variation can be significantly attributed by the next dominant correlation. In this way, the analysis tool is able to estimate the performance potential of each control loop under different disturbance conditions and provide a control performance index by comparing the achieved performance to the performance potential in a multi-variable environment. This index indicates whether or not a given loop is under optimal operation and, in fact, shows the variance of the loop from the best achievable loop performance. A problem with PLS (Partial Least Squares Regression), as well as with PCA (Principal Component Analysis) and Factor Analysis is that these methods are linear methods. Moreover, latent variables are quite abstract and therefore difficult to explain for many purposes, and it is difficult to select proper variables to the models used.
In WO2008040845 the quality of the paper produced by a paper machine is monitored by profile measurements of the paper web. The profile-measurement information is combined with other process measurements, with the aid of correlation calculation. In the calculation, the period of time in which some parameter has changed significantly is examined. By calculating the correlations between parameter time series and the process measurements, the process measurements are found that correlate most strongly with the parameter and which thus have most probably influenced the profile change. A problem related to such correlation matrix method is that it is a linear method and may provide misleading results for nonlinear relationships. Moreover, if a correlation matrix is used as criteria for modelling, wrong or confusing results may be obtained.
Non-linear models have not been employed because they are complicated and require very much processing capacity and extremely long processing times even if there are only a handful of variables.
Real-life management and up-keeping challenges of a process monitoring and diagnosis are also discussed in “Managing Adaptive Process Monitoring: New Tools and Case Examples”, Mats Friman and Harri Happonen, Proceedings of the 15th Mediterranean Conference on Control and Automation, July 27-29, Athens, Greece, 2007.
There is a need for a monitoring and diagnosis tool which allows an easy identification of process parameters affecting on process problem, or productivity or efficiency of the process, and making appropriate control actions based on the identified parameters. There is also need for a method which allows the operator to evaluate non-linear relationships of parameters with the monitoring and diagnosis tool.
An object of the present invention is to provide a new operating tool which can assist an operator of a process or machine in decision making. This object of the invention is achieved by the subject matter of the attached independent claims. The preferred embodiments of the invention are disclosed in the dependent claims.
An aspect of the invention is a method of operating a process or a machine, comprising
obtaining operating data from an operating historian database, said operating data containing a plurality of variables collected from a process or machine during operation,
determining, by non-real time processing, statistical models, each of the statistical models describing non-linear causal interrelations between two or more of said variables,
organizing, by a non-real time processing, a plurality of operating variables into a hierarchical presentation of said statistical models according a statistical criterion, in which hierarchical presentation the models are connected to each other through said variables, and storing said hierarchical presentation of statistical models in a database,
receiving, via a graphical human-machine interface of the process or machine, a selection of one of said plurality of variables as a variable of interest for real-time analysis,
selecting, by real-time processing based on said stored hierarchical presentation of statistical models, among said plurality of variables of the collected process data at least one explanatory variable which has the largest impact on said variable of interest in terms of said statistical criterion, and
analyzing and displaying on said graphical human-machine interface relationships between said variable of interest and said at least one explanatory variable.
Another aspect of the invention is a method of operating a process or a machine, comprising
obtaining operating data from an operating historian database, said operating data containing a plurality of variables collected from a process or machine during operation,
determining, by non-real time processing, a collection of statistical models, each of the models describing non-linear causal interrelations between two or more of said variables, and storing said collection of statistical models in a database,
receiving, via a graphical human-machine interface of the process or machine, a selection of one of said plurality of variables as a variable of interest for real-time analysis,
dynamically organizing, by a real time processing, variable-specific hierarchical presentation of statistical models from the statistical models of the stored model collection for analysis of said variable of interest,
selecting, by real-time processing based on said variable specific hierarchical presentation of statistical models, among said plurality of variables of the operating data at least one explanatory variable which has the largest impact on said variable of interest in terms of said statistical criterion, and
analyzing and displaying on said graphical human-machine interface relationships between said variable of interest and said at least one explanatory variable.
According to an embodiment of the invention said analyzing of said relationship information comprises displaying, at a graphical human-machine interface, a presentation of said variable of interest in function of said at least one explanatory variable.
According to an embodiment of the invention said analyzing of said relationship information comprises displaying, at said graphical human-machine interface, a hierarchical variable presentation having said variable of interest at a root and said at least one explanatory variable at a branch.
According to an embodiment of the invention the method comprises the operator manually adjusting, during the analysis at said graphical human-machine interface, said at least one explanatory variable by one or more of: removing one or more of said at least one explanatory variable from the analysis; locking one or more of said at least one explanatory variable to a certain value or range of values; and setting an allowed direction of change of one or more of said at least one explanatory variable and said variable of interest.
According to an embodiment of the invention said non-real time processing comprises automatically building several hierarchical presentations of statistical models, wherein variables to said statistical models are selected among said plurality of variables based on a page hierarchy of the graphical human-machine interface.
According to an embodiment of the invention the method comprises
analyzing a predicted impact of said at least one explanatory variable and said variable of interest on each other using said hierarchical presentation or presentations of statistical models,
showing the impact in a numerical and/or graphical form at the graphical human-machine interface.
According to an embodiment of the invention said analyzing of said relationship information comprises displaying, at said graphical human-machine interface, a predicted value of the variable of interested, calculated with said hierarchical presentation or presentations of statistical models, when said at least one explanatory variable varies in value.
According to an embodiment of the invention said analyzing of said relationship information comprises
varying, via said graphical human-machine interface, one or more of said at least explanatory variables,
displaying, at said graphical human-machine interface, a predicted value or range of values of the process output variable calculated using said hierarchical presentation or presentations of statistical models.
According to an embodiment of the invention the method further comprises
selecting an explanatory variable that, together with already selected explanatory variable or variables, if any, best describe the selected process output variable in terms of said statistical criterion of the variable of interest, and
sequentially repeating the selection step until a predetermined stopping criterion is met.
According to an embodiment of the invention the method further comprises
calculating the statistical criterion of the variable of interest,
calculating, for each of said plurality of variables, the impact between the respective variable alone and the variable of interest in terms of the statistical criterion of the variable of interest,
selecting as a primary explanatory variable the one of said plurality of variables that best describes the variable of interest in terms of the statistical criterion,
calculating, for each remaining one of said plurality of variables, the impact between the respective variable, together with already selected primary explanatory variable, and the variable of interest in terms of the statistical criterion,
selecting as a further explanatory variable the one of said plurality of variables, if any, that together with already selected primary explanatory variable, best describes the variable of interest in terms of the statistical criterion,
sequentially repeating the selection step until a predetermined stopping criterion is met.
According to an embodiment of the invention said statistical criterion is entropy or relative entropy.
According to an embodiment of the invention said statistical models comprise multi-dimensional histogram models.
According to an embodiment of the invention the method further comprises
controlling, through said graphical human-machine interface, said variable of interest in the industrial process by adjusting one or more of said at least one explanatory variable based on information obtained from said analysis.
A further aspect of the invention is a system for implementing any one of method embodiments of the invention.
A further aspect of the invention is a computer program comprising program code for performing any one of method embodiments of the invention, when said program is run on a computer.
A further aspect of the invention is program code means stored on a computer readable medium for performing any one of method embodiments of the invention, when said program product is run on a computer.
In the following the invention will be described in greater detail by means of exemplary embodiments with reference to the attached drawings, in which
An exemplary embodiment of the invention is illustrated in
Referring again to
A process 2 that is controlled by an automation system 4 may typically include a high number of field devices, such as actuators, valves, pumps and sensors, in a plant area (field). There are various alternative ways to arrange an interconnection between an automation system 4 and a process 2 (e.g. field devices), such as two-wire twisted pair loop (4 to 20 mA analog signal), HART (Highway Addressable Remote Transducer) protocol, and other field buses (e.g. Fieldbus and Profibus). However, it is to be appreciated that the type or implementation of an interconnection between an automation system 4 and a process 2, e.g. between a control room and field devices, may be based on any one of the alternatives described above, or on any combination of the same, or on any other implementation. A practical plant configuration may, and typically does, include several types of automation lines or field buses in parallel, since the plant is updated and extended gradually over a long period of time.
Process operating data may include process measurements y(k) process controls u(k), set points r(k), etc., for example. The process measurements y(k) may include any measurement of any desired variables or properties in a process 2, such as a flow rate, a pressure, a temperature etc. These process variables can be measured with dedicated sensors arranged in the field of a process plant. Process controls (inputs) u(k) from an automation system 4 to a process 2 may include, but are not limited to, control inputs to field devices.
An operator or user may monitor and control the automation system 4 via the user interface 6 of the automation system. Through the user interface user may monitor displayed measurement data y(k) and process controls u(k) received from the automation system 4 and provide actions and setpoints to the automation system 4. In the exemplary embodiment shown in
The operator user interface 6 in a control room may be based on a human-machine interface (HMI) running in the operator console and allowing the user to monitor, supervise and control the process. The aim of the operator user interface is effective operation and control of the plant systems, typically by means of keyboard and mouse controlled displays on which a vast amount of process information is graphically presented (graphical user interface, GUI). The primary aspects of this interface are graphics animated with feedback from the process which aids the operator in making operational decisions, e.g. make actions and change setpoints to operate the process. The operator user interface may also include data browsing and analyzing utilities, configuration tools and alarm display. In steady operation, the operator interface displayed in the main control room may show the synthetic and functional state of the plant system. At one glance, the operator may be able to know the overall state of the plant system—i.e. the global state of all the components. If a problem occurs, the controlled component appears in error and the operator may access its detailed state, for instance opening a display that is specific to the supervised component. This interface may help the operator to locate the origin of the fault and shall suggest a corrective action if possible. In the operator user interface, it is desired that the displayed information should be organized purposefully, in meaningful and useful ways based on clear, consistent models that are apparent and recognizable to operators, putting related things together and separating unrelated things, differentiating dissimilar things and making similar things resemble one another.
In order to provide an improved user interface environment for an operator in a process industry, according to exemplary embodiments of the invention, the user interface is provided with an interactive tool which allows selecting any variable (called “key” variable herein), which is of special interest, at the user interface and determining or selecting which other variables, or combination of variables, out of a large number of variables, have the largest impact on (non-linear relationship with) the key variable. This interactive tool may be used, for example, in a continuous troubleshooting, in order to reveal root causes for quality and runnability problems.
An example of the operation of the tool is illustrated in a flow diagram of
However, in many situations and application, such trial-and-error control is not desirable, but the operator would prefer to have more information on how to adjust the explanatory variable(s) obtained. Therefore, in some exemplary embodiments, the impact of one variable to another may be demonstrated by a prediction method and shown in a numerical and/or graphical form at the user interface (step 26). After being satisfied with the result of the analysis, the operator may decide on which is the preferred parameter change(s) in order to overcome the problem in question or achieve the desired improvement, and then implement the corresponding change(s) on the real system via the same user interface (step 28).
According to an exemplary embodiment of the invention, variables related to the selected variable of interest are automatically determined with an entropy-based statistical selection method, i.e. with entropy, or more specifically, relative entropy as a statistical criterion. However, the invention is not limited to use of entropy as a statistical criterion but another statistical criterion may be used as well.
For example, in monitoring of fan efficiency it is difficult to select variables that best describes the operating point. All candidate variables, i.e. Head, Flow and Wing Angle correlate strongly with each other. With entropy, the following relative entropy changes may be obtained: Head 7.1%; Flow 1.6%; and Wing Angle: 2.2%. The head has the best relative entropy and, therefore, is the candidate variable in this comparison and should be included in the list of explanatory variables of the fan efficiency. With this variable included in the list and in a next iteration round, the next iteration round gives (with the head set to the 7.1%) following relative entropy changes: Flow 16.7% and Wing Angle 12.6%. Thus, the parameter Flow would be beneficial as a secondary explanatory variable.
Example implementation of a generic procedure for determining a list of explanatory variables for one dependent variable may be:
indicates data missing or illegible when filed
Exemplary embodiments of the invention provide a possibility to statistically suggest variables that cause problems, such as emissions and energy consumption, or runnability problems. In selection of variables a difficulty evolves from the strong mutual correlation between explanatory variables. According to an exemplary embodiment of the invention, only one of these variables with strong mutual correlation is selected, and instead variables with small mutual correlation are selected to a list of explanatory variables. The variable selection by using relative entropy as a measure has the purpose is to select the most representative variables that explain the variation in the variable of interest. Thus, with exemplary embodiments of the invention, in variable selection a large correlation from inputs to output and a small mutual correlation between inputs can be achieved. Exemplary embodiments of the invention also provide a simple method for selection of variables out of numerical data with nonlinear relations. In other words, the selection method is nonlinear, which is an advantage to compared to commonly used multivariable statistical methods like PCA or PLS. First of all, the method is nonlinear. From the user's point of view, no abstract and difficult-to-interpret latent variables are required, but real variables are selected to represent a set of variables with mutual correlation. A selection method according to exemplary embodiments of the invention also allow selection of different types of variables out of numerical data, such as continuous (e.g. temperature, pressure), discrete (e.g. number of shutdowns), and classification variables (e.g. type of product). A selection method according to exemplary embodiments of the invention also enables selection of a large number of variables, while requiring only a restricted calculation capacity. No pre-knowledge of the automation system or no pre-configuration is not necessarily needed, but it is possible to initiate a selection according to exemplary embodiments for any variable of interest at a user interface during online operation of a process.
In exemplary embodiments of the invention, the interactive tool of the user interface may further be provided with a statistical predictor (explorer) for exploring non-linear statistical relationships of variables, which allows the operator to operate the user interface of the automation system against a statistical model. The operator may also initiate an automatic building the statistical predictor, based on a history trend data and an existing configuration data, for example.
The interactive predictor tool may be based on self-organizing stochastic variables to a hierarchical structure, i.e. into a network of models connected through variables, such as a tree, for example. The hierarchical structure may be organized such that variables with high mutual information are closely connected. In exemplary embodiments of the invention, in addition to using a statistical method, e.g. entropy-based method, for variable selection for models as described above, a statistical criterion, such as the entropy, is also used as a basis for construction of the hierarchical structure. In other words, the way the statistical models are attached to the hierarchical structure, such a tree, is based on a statistical criterion, such as entropy.
Various statistical methods are suitable for modelling non-linear interrelations between variables in the exemplary embodiments of the invention. These are referred to as statistical models herein. As examples of simple statistical models, multi-dimensional histograms will be discussed below. A histogram is a well-known way of presenting data from one variable. A multi-dimensional histogram is just a histogram in multiple dimensions. As an example data set we will use the daily average outdoor temperature measured in Helsinki during 1951-2004.
An exemplary construction of a tree structure based on information entropy is illustrated in
The next task before building the tree is to decide in which order the submodels are connected. The tree is built by connecting submodels, consisting of 2-3 variables to the tree. A model that would cause a cycle cannot be attached to the tree. Therefore, the order of connecting submodels, is important. During phase 1 described above, we will get (in addition to the explanatory variables) the entropy based goodness of the model. As discussed before, the variable which results in the largest change of relative entropy is selected as explanatory variable. This value may also be a measure of the goodness of the model. In this case, when we use two explanatory variables, we get two goodness values: 1) goodness of a model with one explanatory variable (hr1), and 2) goodness of a model with two explanatory variables (hr2). The values, sorted according to hr2, are:
Since cycles are not tolerated, we may be forced to leave out explanatory variables during connection. It is therefore useful to consider using only one explanatory variable whenever possible. If a submodel is well modelled with only one explanatory variable, we can add this submodel to the tree at a later stage. For example, the variable variable2 is described with the following explanatory variable with goodness (hr1):
EV1:variable3, hr1=0.76499
With two explanatory variables we will get (variables and goodness hr2):
EV1:variable3, EV2:variable1, hr2=0.85101
As seen, the goodness does not improve much with two explanatory variables as compared to one explanatory variable. For this submodel it would hence be reasonable to connect this submodel at a later stage, and use the second explanatory only if it has not already been connected, i.e. it will not add cycles to the tree.
The justification of using 2 explanatory variables is calculated as
where hr1 and hr2 are the relative entropy changes (goodness of models) with one and two explanatory variables, respectively.
To summarize, the connection order may be decided based on
1) The goodness of models (with two explanatory variables) hr2.
2) The improvement of using two explanatory variables instead of one e2.
3) A combination of 1 and 2, f(hr2, e2).
In this example we will use put a little more weight on the model goodness and use the following criteria for the selection order
We will hence build the tree, by selecting the submodels in the following order:
Next we will build the tree by connecting submodels in the order discussed above. We will start with the first model (corresponding to steps 42 and 44 in
With the construction algorithm of the hierarchical structure of models, it is not guaranteed that all models and variables will end up in a single hierarchical structure. This is illustrated in the example of
Sometimes it is convenient to connect all hierarchical substructures, e.g. subtrees to a single hierarchical structure, e.g. tree. Two hierarchical structures may be connected such that the variable pair with the highest mutual correlation (e.g. information gain) is connected. In the example of
It can be seen that the single tree can be merged through variables 3 and 4 with a new model (Model3) because variables 3 and 4 have the highest Information gain, as illustrated in
In an exemplary embodiment of the invention, in place of or in addition to building a general hierarchical structure of models which contains all variables, a collection of models may be built. An example of a model collection is shown in
A variable-specific hierarchical structure of models (e.g. a tree) may be dynamically built from the models (submodels) of the model collection. The variable-specific construction of hierarchical structure of models may start from a variable of interest provided by the operator through the interactive user interface. All models of the model collection with a connection to the variable of interest are considered in the construction of the hierarchical structure. All considered models that will preserve the hierarchical structure (e.g. tree), are attached to the hierarchical structure. Models are attached to the hierarchical structure in a specific order determined by the model affinity (relative entropy), for example. In other words, the model having the best relative entropy, for example, is added in each iteration round, in similar manner as in the method described with reference to
An example of a dynamic construction of a variable-specific tree is illustrated in
In inference engine block 83 represents a prediction or exploring function that allows the operator to analyse and demonstrate the impact of one variable to another in tree. The operator may manually adjust the related variables shown on the interface: for example, by removing a related variable from analysis, by locking a related variable to an actual value, or by providing the favourable direction of change of the value of a related variable (up or down). The analysis may be shown in a numerical and/or graphical form at the user interface 6. After being satisfied with the result of the analysis, the operator may decide on which is the preferred parameter change(s) in order to overcome the problem in question or achieve the desired improvement, and then implement the corresponding change(s) on the real system via the same user interface 6. The tree is modified each time evidence has been presented, i.e. when the operator changes a value of one of the variables (called an evidence variable herein) in the tree through the user interface 6. Each model connected to the presented evidence variable may be modified. This may modify the distribution of the other variables in the model. These variables are treated as new evidence to the rest of the tree. Each time evidence is entered to a variable, the evidence will propagate through the entire tree in a single pass. Let us consider the inference of the tree shown in
As an example of an inference method a Bayesian inference may be employed for the models. More details about Bayesian inference can be found e.g. in Pearl, Judea (1982). “Reverend Bayes on inference engines: A distributed hierarchical approach”, Proceedings of the Second National Conference on Artificial Intelligence. AAAI-82: Pittsburgh, Pa. Menlo Park, Calif.: AAAI Press. pp. 133-136.
According to an exemplary embodiment of the invention, the calculation may comprise two main stages 1) a background calculation stage 86, which makes required calculation-intensive preparations, and 2) an on-line stage 87, which involves rapid calculations. As the calculations, which are made in response to the operator's action during the analysis, can be performed more rapidly, a prompt user experience can be achieved. In the example shown in
As noted above, the tags in the historian database of the automation system may be used for purposes of the invention. However, there are usually too many tags in a system for one single statistical model, typically 1000's of tags. Usually, when the number of tags exceeds 100, the modelling phase will slow down.
In an exemplary embodiment of the invention, several sub-models are used. One tag may exist in several submodels. The page hierarchy of an automation system user interface (UI) may be utilized as criteria to select tags of sub-models. This is based on an assumption that variables shown on same page can be assumed to be related by default. All tags on one page, as well as tags on a parent page (the page on higher level of the page hierarchy), may be included to the same model. As a result, the model size will be moderate since the number of variables shown on one page is limited. This approach is illustrated in
Exemplary embodiments of the invention may provide various advantages. The hierarchical structure can be efficiently built for large networks (more than 100 variables). The inference method is one-pass, i.e. it is fast even for large hierarchical structures.
Exemplary embodiments allow improved automation system user interfaces with one or more of following features: the existing user interface, used in daily operation, can be used; easy to use, since the user interface is familiar to the operator; same displays/views can be used for both a normal process operation and for deeper analysis, prediction and decision making; existing configuration data is sufficient; no manual configuration is needed; user can modify or lock any variable (input/output/parameter) which is part of the model; when the user modifies a value, he/she can observe changes made by the prediction without any additional fields (e.g. numbers). User's decision making is supported in various scenarios: what has possibly caused a problem; what is relevant in this particular situation; how to correct the situation, what-if experimentation; has this happened earlier; how to inform others, etc. As further potential advantages: opening the analysis tool from a target with point and click is possible; analysis tool readily shows what the system has concluded relevant; dependencies and their strengths are shown in a simple way; also a more thorough analysis is available if needed; the operator's verification in a particular situation is supported with curves and their manipulation methods; prediction and experimentation is supported with direct manipulation methods. Similar situations can be shown on timeline and they can be stepped (this could also be automatic and no button to start this is needed). Informing others with diary entry can be made easy with one click access and the situation may be automatically stored.
Let us now examine example embodiments of user interfaces according to the invention.
In this example, an exemplary interactive user interface according to the invention is used to maximize Kappa reduction in a bleaching plant.
1. The operator selects Kappa Reduction as the variable of interest at the user interface. The analysis is opened from an initial tag. A number of tags, which are related to the initial tag, are automatically added to the analysis. In this example, the tree shown in
2. Operator knows that “Kappanumber In” cannot be modified and locks that value (illustrated by the horizontal arrow in a “box” in
3. Based on this information, the operator opens branch under “Kappanumber Out”, which results in the user interface view shown in
4. The operator clicks (e.g. with a right button of a mouse) at the variable “Stock temp. in reactor 2”, and a context menu shown in
5. In
Some numerical examples, which demonstrate the use of entropy in variable selection in connection of Example 1, are discussed below. Kappa number estimates the amount of chemicals required during bleaching of wood pulp to obtain a pulp with a given degree of whiteness. Since the amount of bleach needed is related to the lignin content of the pulp, the Kappa number can be used to monitor the effectiveness of the lignin-extraction phase of the pulping process. It is approximately proportional to the residual lignin content of the pulp. In the following example, the “Kappa reduction” is a variable of interest. Let us assume that trend data collected from a pulp mill will give histogram of the Kappa reduction shown in
Entropy of a single variable can be calculated from its histogram as follows
wherein h=entropy and pi=the probability, which is estimated from the histogram by dividing it with total number of observations.
The histogram shown in
1. The number of bins is usually quite low (3-6). With few bins we can speed up the calculations and usually get more reliable results, especially when we have small number of observations.
2 The bins are not necessarily equally spaced. Instead we may select the bins such that we get a similar number of observations in each bin. With this arrangement the initial entropy is maximized.
For simplicity reasons, we will use only 3 bins in this example. We will select the discretization intervals such that we will get the number of hits almost equally distributed between bins. This is illustrated in Table 1.
The entropy (h0) of this distribution is
Entropy, h=05849622
Entropy of a Variable with One Input
Let us first consider entropy of Kappa reduction with one input, i.e. one explanatory variable. Variable selection based on entropy may be done as follows. Each candidate input value is divided into bins in the same way as the output variable above, and for each input bin the corresponding output entropy is calculated. In our case where we use 3 bins, we will get 3 output distributions, and hence 3 entropy values. Since we only need one entropy value, we may calculate the weighted average of the individual entropy values
The weighted entropy is compared to the original entropy value, i.e. h0. The input variable, which produces the largest change in output entropy, is the best explanatory variable. In order to easier understand the numerical value, we may use the relative change in entropy calculated as
In our example, the best input variables in terms of relative entropy change are shown in Table 2.
Let us now study how the relative entropy may be calculated for the best explanatory variable distributions shown in Table 3.
From the Output Entropy and Number of Hits, we may calculate the weighted average of entropy
Average entropy, havg=1.1975
The relative entropy change (h0-havg)/h will then be
Relative Entropy change, hr=0.2444
In other words, with using this variable as an explanatory variable, we would be able to reduce the disorder by almost 25%.
Let us next consider adding a second explanatory variable. When the first explanatory variable has been selected, we can add more explanatory variables. It is however important to take into account the variables that already have been selected. Also here, we will test all input variables and calculate the relative change in entropy. The best explanatory variables and their entropy values, in addition to the already selected ones, are illustrated in Table 4.
As can be seen from Table 4, with two inputs, we can get over 50% reduction of disorder in data. Let us now study how the entropy value was calculated for the best inputs.
With 2 input signals, we will get 3×3=9 different input bin combinations. Consequently, we will have 9 Output Histograms and 9 Output Entropy values as illustrated in Table 5. As can be seen from Table 5, the Output Histogram is quite well organized with these two explanatory variables.
Also here the weighted average may be calculated to combine all entropy values to one single value. The weighted average of output entropy is
havg=6958
and the relative entropy change hr=(h0-havg)/h0 is
Relative Entropy Change, hr=0.561
In this example, an exemplary interactive user interface according to the invention is used for decision support, i.e. for selecting the right soot blowing instant in a power plant boiler.
1. The operator selects “Time since last soot blowing [h]” as the variable of interest at the user interface. As result, the tree shown in
2. The operator analyzes the tree by opening more branches as illustrated in
3. The tree with locked variables is shown in
4. The operator then initiates a prediction. In the prediction, “Thermal power SH3 [MW]” is configured as an output variable (y-axis in plot). This is made by clicking (e.g. with a right button of a mouse) at the variable “Thermal power SH3 [MW]”, such that a context menu shown in
5. Based on the x-y plot, the operator is able to justify the value and need of the soot blowing, and make the appropriate action in the real system.
In this example, an exemplary interactive user interface according to the invention is used for decision support, i.e. for analysing a temperature problem in a power plant boiler.
1. The operator has a normal process view at user interface, as illustrated in
2. A context menu appears, as shown in
3. An analysis tool opens, as shown in
4. The operator notices one measurement has no influence. He removes it from trends view, thereby obtaining a view shown in
5. The measurements as tree are opened, as shown in
6. The operator concludes that the variable “Flue gas before LUVO” has caused the problem and stars starts planning actions to correct the situation. The operator clicks Predict button in the view shown in
7. A section showing the future opens (the rectangular area on the right), as shown in
8. To continue the verification, the operator also wants to know if there are similar situations happened earlier. To analyse this, the operator crops by a rubber band zoom the relevant area from the curves and clicks “Trace similar situations” button.
9. As a consequence, arrow buttons above the “Trace similar situations” button and markers on time line slider appear, as shown in
10. The operator decides that the situation is what he has concluded and wants to inform others. He clicks “Write diary entry” button. A diary is opened (not shown) and the analysis tool with related data and screenshots in analysis tool are automatically stored in the diary entry.
In this example, a further exemplary interactive user interface according to the invention is illustrated for decision support, i.e. for analysing a power plant boiler.
1. The operator has a normal process view at user interface, as illustrated in
2. A prediction mode view appears which may be indicated by a coloured border, for example, or by some other suitable visual effect, as illustrated in
3. The operator may click mouse button on a tag in the UI view to open a context menu, as shown in
4. The operator may click the mouse button to open the context menu and select “Show” and “Predict analysis”. As a result a UI view shown in
5. The operator may click the mouse button to open the context menu and select “Show” and “Dependence tree”. As a result a UI view shown in
The techniques described herein may be implemented by various means. For example, these techniques may be implemented in hardware (one or more devices), firmware (one or more devices), software (one or more modules), or combinations thereof. For a firmware or software, implementation can be through modules (e.g., procedures, functions, and so on) that perform the functions described herein. The software codes may be stored in any suitable, processor/computer-readable data storage medium(s) or memory unit(s) and executed by one or more processors. The data storage medium or the memory unit may be implemented within the processor or external to the processor, in which case it can be communicatively coupled to the processor via various means as is known in the art. Additionally, components of systems described herein may be rearranged and/or complimented by additional components in order to facilitate achieving the various aspects, goals, advantages, etc., described with regard thereto, and are not limited to the precise configurations set forth in a given figure, as will be appreciated by one skilled in the art.
The description and the related figures are only intended to illustrate the principles of the present invention by means of examples. Various alternative embodiments, variations and changes are obvious to a person skilled in the art on the basis of this description. The present invention is not intended to be limited to the examples described herein but the invention may vary within the scope and spirit of the appended claims.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/FI2011/051112 | 12/15/2011 | WO | 00 | 6/10/2014 |