The invention relates to a method and arrangement for estimating disturbances. The invention relates also to a method and arrangement for giving recommendations for improving process performance. The process is water intensive industrial process, for example, a papermaking process, a board machine, a waste water treatment etc.
Many industrial processes like a paper/board machine, water treatment process etc. are complicated entities. They contain many sub processes which together form a whole entity, like a paper machine. The sub processes are controlled by process specific controllers. On the other hand, the variables may affect to each other in several individual control arrangements. Therefore, different systems have been created in order to better understand the whole process.
Machine learning (ML) algorithms are used with systems, which model, analyze and estimate behavior of a process like a paper making or a water treatment. The ML algorithms are used with processes having multivariable processes, so a great number of measurements are taken. Huge amounts of data are generated and processed especially when measurements are taken online and every day.
Machine learning provides systems with an ability to automatically learn and also to improve from experience without being explicitly programmed. Thus, machine learning (ML) utilities algorithms and statistical models that computer systems use to perform a specific task or tasks without using explicit instructions. There exist several ML algorithms. Here only some of them are mentioned: linear regression, logistic regression, K-means, feed-forward neural networks etc.
The outcomes of the ML algorithms are usually difficult to interpret, especially from complex processes. Therefore, explanation values are used to help user to interpret the outcomes of the ML. Thus, explanations values are used for explaining and also for classifying how the ML works. The explanation values are obtained by using, for example, SHAP (Shapley additive explanations) values, LIME method or DeepLIFT method.
Although the monitoring systems utilize measurements and ML values, the monitoring could utilize other data as well, and in an automatic way. The monitored info can be used for recommendations to adjust certain sub-process/sub-processes.
The object of the invention is to provide a new way to estimate disturbances and give recommendations. The object is achieved by means of independent claims. The dependent claims illustrate different embodiments of the invention.
An inventive method for estimating disturbances and giving recommendations for improving process performance has steps for measurement variables of a process and collecting process data 50, and pre-processing measurement data and collected data of the measuring and collecting step 51. The method further comprises steps for estimating disturbances 52, and forming recommendations 53.
For each disturbance estimation of a parameter of the process, the step of estimating 52 the disturbances comprise the sub steps of receiving, normalizing, operating and scaling. The receiving step 60 is for receiving the pre-processed measuring and process data from a pre-selected group of the variables of the process. The normalization step 61 is for normalizing the received pre-processed measuring and collected data. The operation step 62 is for operating the normalized data. The scaling step 63 is for scaling the operated normalized data. An output 64 of the scaling step is the disturbance estimation of the parameter of the process.
For each recommendation forming the step of forming 53 the recommendations comprise sub steps of receiving, mapping and forming. The receiving step 70 is for receiving the disturbance estimations from a pre-selected group of the outputs of the scaling step. The mapping step 71 is for mapping each received disturbance estimation to one of status categories. The forming step 72 is for forming each recommendation utilizing the mapped disturbance estimations.
An inventive arrangement estimates disturbances and gives recommendations for process performance. The arrangement has measurement devices and collecting interfaces 4 to measure variables of a process and to collect/receive process data. The arrangement also has a measurement and collected data pre-processing arrangement 5 to pre-process the measurement data and the collected data from the measuring devices and collecting interfaces.
The arrangement further comprises a first unit 8 to estimate disturbances, and a second unit 9 to form recommendations. The first unit 8 estimates the disturbances for each disturbance estimation of a parameter of the process, and it is arranged to receive the pre-processed measuring/collected data from a pre-selected group of the variables of the process, normalize the received pre-processed measuring data, operate the normalized data, and scale the operated normalized data. An output of the scaling is the disturbance estimation of the parameter of the process.
The second unit 9, for each recommendation forming, is arranged to receive the disturbance estimations from a pre-selected group of the outputs of the first unit, map each received disturbance estimation to one of status categories, and form each recommendation utilizing the mapped disturbance estimations. The same status value can be mapped to form more than one recommendation.
In the following, the invention is described in more detail by reference to the enclosed drawings, where
The arrangement has measurement devices and data collecting interfaces 4 to measure variables of a process and to collect data, and a pre-processing arrangement 5 to pre-process measurement/collected data from the measuring devices and the collecting interfaces. The collecting/receiving interfaces are for example data line connections (wire connections or wireless connections) to a central managing unit or managing units of the process having configuration data of the process. Therefore, the collected data comprises the configuration data of the process (like a final surface layer of the paper to be produced, a grade of the paper to be produced) and other possible process data.
The measurements can be directly used by a controller 3 (or controllers), which drives an actuator/s 4 in order to control a sub-process or sub-processes of the whole process. The measurement devices and collecting interfaces 4, the controllers 3 and the actuators 2 are local devices within the process. The data pre-processing arrangement 5 can be local or a cloud service. It is used to: clean unreliable values from the measurements, merge data and calculate statistical values, calculating new variables based on measurements and/or variables of process data by using mathematical formulas/equations and so on. The calculated new variable may, for example, relate to amounts of raw materials, chemical efficiency, cost efficiency etc. The unreliable values are caused by sensor faults, noise etc.
The arrangement further comprises a first unit 8 to estimate disturbances, and a second unit 9 to form recommendations. The first unit 8 estimates the disturbances for each disturbance estimation of a parameter of the process, and it is arranged to receive the pre-processed measuring and process data from a pre-selected group of the variables of the process, normalize the received pre-processed measuring/process data, operate the normalized data, and scale the operated normalized data. An output of the scaling is the disturbance estimation of the parameter of the process.
The second unit 9, for each recommendation forming, is arranged to receive the disturbance estimations from a pre-selected group of the outputs of the first unit, map each received disturbance estimation to one of status categories and form each recommendation utilizing the mapped disturbance estimations.
The inventive arrangement may also comprise a third unit 6 to form machine learning values from the pre-processed measurement and process (collected) data. The machine learning values are outcomes of the ML algorithms. The ML algorithms and their learning are known as such. Further, the inventive arrangement may also comprise a fourth unit 7 to form explanation values from the machine learning values of the third unit. These units are discussed more below.
As said, the first unit 8 estimates the disturbances for each disturbance estimation of a parameter of the process. For example, in case of a board machine a parameter can be the level of a particle retention in the wet end, from which the disturbance estimation is made. Other parameter of the board machine can be a detrimental contaminant level in a process stream, the level of particle agglomeration in another process stream, energy balances, accumulated errors, material costs, etc. The parameter may also be a calculated parameter (calculated in the pre-processing unit 5). As can be noted, the parameters relate to the process in question. In
The first unit 8 is arranged to receive the pre-processed process and measuring data 20, 21, 22 from a pre-selected group of the variables of the process. The group of the variables relates to the parameter whose disturbance content is estimated. Knowledge from the process have been utilized when the pre-selection was made. Therefore, the group of the variables is usually different for each disturbance estimation.
The received pre-processed process and measuring data is normalized 23, 24, 25 individually for each variable specific process and measuring data 20, 21, 22. The normalization converts the input min-max range into another range, like 0 to 1, −1 to 1, or −2 to 2 etc, which is more convenient to use and to overcome skewness in data distribution among different variable specific measuring data. The normalization functions are specific for each received data or value.
The normalized data 26, 27, 28 is operated 29 by using an operator, like a sum, median, average, or min/max operation. The operation function may have one or more operators in order to form an output 201, i.e., an operated normalized data. The output 201 of the operation function is scaled 202 to a more suitable range in order to have an estimation which is convenient to use and easier to understand by users for possible examinations of the estimations. The scaled range, which forms the output of the scaling can, for example, be −100 to 100 or 0 to 100 etc. The output 203 of the scaling is the specific disturbance estimation of the specific parameter of the process. So, each disturbance estimation is parameter specific. The outputs, like 203, are named as status A, status B . . . status X in this description. The scaling is individual for each disturbance estimation.
As said above the disturbance estimation comprises a desired value (indicating a situation wherein there is no disturbance) and values deviating from the desired value. A disturbance relates to some parameter of the process, obtained from the measurements, process data, calculated parameters. Further, the disturbance estimation can for example be a cost function (reducing/elimination of an error), for example to energy balance, production volume, raw material cost, etc. In other words, the said operator can also be a function.
A disturbance and its estimation can describe a specific process related disturbance (chemical, mechanical, physical, microbiological) that has an impact on process performance or final product quality. It can also be said that a disturbance estimation can be an indicator which indicates the status of a certain part of process or chemistry. In other words, a disturbance can be an indicator which indicates the status of a chemical, chemistry, microbiological, mechanical or physical status of whole process or subprocess or a certain part of a process or sub process or a process stream. A disturbance estimation can be said to be a calculated performance index.
Examples of disturbances are COD load of fresh/raw water, COD load of waste water stream(s), the use or quality of a process stream, surface level of a storage tower/tank/silo, delay time in a tower or in a specific process part (too low or high), anionic trash/contaminants in a process stream, temperature (too low or high), hydrophobic contaminants, chemical inefficiency, detrimental compounds e.g. white pitch, wood pitch, extractives.
As a further example a disturbance can be hydrophobic contaminants in the wet end of a paper or board machine: high amount of hydrophobic contaminants in the wet end can cause deposits and further runnability problems like breaks in the production and low quality of end product (like defects, e.g. different kinds of spots and holes in the paper or board).
As a further example a disturbance can be chemical residual(s) or change in redox potential or pH in a process stream or in process stream in a storage tower/tank. This kind of disturbances may cause changes in the microbiological conditions in a process, such as bacterial endospore formation. Bacterial endospores in the process may e.g. cause low quality of end product (such as too high endospore content, e.g. in food packaging board). This may especially be case e.g. in when the process steam is an aqueous stream comprises natural fibers, such as fibrous suspensions or pulp suspensions.
So, the predefined variables for each predefined disturbance are the set of measurement(s), calculated measurement(s) or collected/received process data that correlates to the predefined disturbance or describe the predefined disturbance.
For example, a predefined process disturbance can be too high amount of wood extractives (=hydrophobic contaminants) in pulp A in pulping making process. Following measurements and a calculated variable are used for the calculation of the status of this disturbance: the used raw material, calculated washing efficiency of pulp A and pH of pulp A. The washing efficiency, pH and the raw material (e.g. softwood, hardwood) correlate to the amount of wood extractives in pulp A. The correlation can be found by traditional correlation analysis (using historical data, e.g. 6-12 months time period).
The grouping of input variables/parameters (measurements, process data, ML prediction, performance value, etc.) depends on the individual parameter's correlation with a specific disturbance.
The mapping function 33 maps each received disturbance estimation to one of status categories, and forms a recommendation 35 utilizing the mapped disturbance estimations. Threshold values can be used when mapping the disturbance estimations i.e. status A, status B, . . . status X into the status categories. For example, if the disturbance estimation range/status range is 0 to 100, an OK status category can be when the disturbance estimation is <50, and a warning status category can be for the estimations between 50-70, and an alerting status category for the estimations of >70. The threshold values relate to each individual recommendation. Categories can be also numeric values that describes the severity of the disturbance, in other words indicating how huge effect the disturbance has on process performance or product quality or any other target.
The forming of the recommendation can use a mapping logic having rules for different recommendations. The recommendation may relate to a dosing amount, like keeping the current dosing, decreasing the dosing, or increasing the dosing. The mapping function 33 may contain mapping parameters (thresholds and dosing parameters) or it is connected 36 to a library 34 of the mapping parameters.
The arrangement may comprise the third unit 6 to form machine learning values from the pre-processed measurement and process data, which machine learning values are also used with the pre-processed measuring and process data in the first unit 8. The machine learning values are outcomes of the ML algorithms. So, the first unit is arranged to also receive the machine learning values from a pre-selected group of the machine learning values, to also normalize the received machine learning values, to also operate the normalized machine learning values, and to also scale the operated normalized machine learning values. The output of the scaling is the disturbance estimation of the parameter of the process.
In order to understand the process the machine learning (ML) values are formed. The machine learning is used for extracting information and patterns in large datasets. The matching learning algorithms are usually based on statistical models, which a computer can use to perform a certain task without having exact instructions but relies instead on recognizing patterns. The recognized patterns can be obtained by building a mathematical model based on a training dataset. Simulations and pattern recognition can be made by feeding new data to the mathematical model.
The arrangement may also comprise the fourth unit 7 to form explanation values from the machine learning values of the third unit. The explanation values are also used with the pre-processed measuring and process data and the machine learning values in the first unit. So, the first unit 8 is arranged to also receive the explanation values from a pre-selected group of the explanation values, to also normalize the received explanation values, to also operate the normalized explanation values, and to also scale the operated normalized explanation values. The output of the scaling is the disturbance estimation of the parameter of the process.
Because it is hard to see what is going on in the process from the output (predictions/simulations) of ML, the explanation values, like SHAP values, are used to track how ML predictions link back to the input variables. For each prediction a rating number is calculated for each input variable indicating how the variable is contributing to the final predictions. These rating numbers can be seen as explanation values indicating the significance of an input value at a given point in time.
The explanation values of machine learning are, for example, SHAP values, values from a LIME method, values from a DeepLIFT method or any other possible explanation values. The LIME method interprets individual model predictions, which are based on locally approximation the model around a given prediction. LIME refers to simplified inputs x as interpretable inputs. The mapping x=hx(x) converts a binary vector of interpretable inputs into the original input space. Different types of hx mappings are used for different input spaces.
DeepLIFT is a recursive prediction explanation method. It attributes to each input xi a value CAxiAy that represents the effect of that input being set to a reference value as opposed to its original value. It means that DeepLIFT mapping x=hx(x) converts binary values into the original inputs, where 1 indicates that an input takes its original value, and 0 indicates that it takes the reference value. The reference value represents a typical uninformative background value for the feature.
The SHAP (SHapley Additive explanation) explanation values attribute to each feature the change in the expected model prediction when conditioning on that feature. The values explain how to get from a base value an expectation E[f(z)] that is going to be predicted if we did not know any features to the current output f(x). The order how features are added in the expectations matters. However, this is taken into account in SHAP values.
The estimation step 52 is illustrated in more detail in
The recommendation forming step 53 is illustrated in more detail in
Further, the normalization step may comprise normalization functions, which are specific for each received data or value. The operation step may comprise one or more operations. The operation is a sum, median, average, min/max operation etc. The scaling of the scaling step is individual for each disturbance estimation. The formed recommendations are used for adjusting setpoints of different control arrangements of the process and/or for changing raw material of the process, to adjust one or more upstream process steps (e.g. raw material handling and processing). The recommendations can also be overall targets for energy balance, production volume, raw material cost, etc. in order to minimize disturbances, minimize cost, minimize energy, enhance production targets, stabilize operational processes, etc. Changing raw material in a process may be e.g. changing a contaminated raw material from to a less or non-contaminated raw material. An example is a fibrous suspension (e.g. pulp suspension) which is microbiologically contaminated to an unacceptable level.
The formed recommendations can be used for adjusting setpoints of chemical dosing (e.g. retention chemicals, sizing agents, deposit control chemicals, charge control chemicals, strength chemicals, defoamers, dispersing agents, biocides, coagulants, flocculants). Further, the recommendations can be used for adjusting setpoints for tower levels/tower filling/emptying, for adjusting the amount of dilution water to pulp washers, for improving washing efficiency of pulp(s), for adjusting pH value of a process stream(s), mentioning some examples.
The formed recommendations can be used for adjusting setpoints for dosing of chemicals, such as retention chemicals, sizing agents, deposit control chemicals, charge control chemicals, strength chemicals, defoamers, dispersing agents, biocides, coagulants, flocculants; for tower levels/tower filling/emptying, for adjusting the amount of dilution water to pulp washers, for improving washing efficiency of pulp(s), for adjusting pH value of a process stream(s), for delay times in storage towers, surface level(s) in storage towers, or aeration, circulation or mixing of a process stream in in storage towers, e.g storage towers of fibrous suspensions.
The invention provides a generic method and arrangement for estimating disturbances and giving recommendations for a water intensive industrial process having several sub-processes and a large number of variables, which affect to each other. The arrangement and method are taught before it is used. Expert knowledge and process knowledge can be used during the teaching phase. E.g. expert knowledge or process knowledge obtained by correlation analysis or iteration. The above said pre-selections and determinations of the threshold values, normalization selections, operator selections, scaling selections etc. are made also during the teaching phase. One possible way is the correlation analysis based on historical data. The idea is to find out the process disturbances that causes problems in process performance (e.g. breaks in papermaking production, insufficient solid-liquid separation in waters to be treated) and/or weakens the product quality (e.g. weak strength, the number of defects increases). Domain and application experts have also a lot of tacit information, which can be collected by interviewing. The needed information can be also collected by doing tests in the lab or real process (e.g. by changing chemical dosages (the amount, dosing points, type). The teaching comprises also iteration steps.
When teaching the process disturbances, which have an effect to a certain target, are identified and selected. The target is, for example, maintaining a good process condition, improve product quality, etc. The identifying and selection of disturbances can be done by domain and application experts, visual inspection of data, calculations (like correlation calculations), ML performance values and other statistical calculations. A group of the variable of the process is selected (pre-selection) for each disturbance.
The grouping of the variables (measurements process data, ML prediction, performance value, etc.) depends on the individual variable's correlation with a specific disturbance. Thus the group of variables for each predefined disturbances are the set of measurement(s) or calculated measurement(s)/variables(s) that correlates to the predefined disturbance or describe the predefined disturbance. As said the selection of variables for each disturbance can be based on expert knowledge, process knowledge and/or analysis of historical data (e.g. correlation analysis, statistical analysis).
Regarding the normalization (61) and operation (62) steps, the parameters for normalization can be for example minimum and maximum values of variables or 25% and 75% quartiles from historical data (e.g. minimum value corresponds to 0 and maximum value correspond to 1). The selection of mathematical operator for operation step 62 is based on the causal dependences/relationship of a disturbance and preselected variables (based on process expert(s) knowledge and/or analysis of historical data).
Regarding scaling step 63, the parameters of scaling function is based on the causal dependences/relationship of a disturbance and preselected variables (based on process expert(s) knowledge and/or analysis of historical data).
When providing pre-selected groups of the outputs of the scaling steps, correlation information is used. Thus, for each recommendation a pre-selected set of inputs are identified based on their correlation with the recommendation. The correlation information may contain correlation calculations, ML performance values and other statistical calculations. The knowledge for providing recommendations is based on historical data/information on process performance.
The method and the arrangement can be mainly provided as a cloud service and on line, but it can also be a local method and arrangement within the process. So, said units can be provided in a server/servers in the could or locally. In more detail, the units can be realized as circuit boards, software or their combination, or computers. It also clear that the said library/libraries is a database/memory. As described above, the invention provides a new method and arrangement to estimate disturbances and give recommendations. The idea is to use said pre-selected groups of variables (inputs) and pre-selected groups of disturbance estimates in several phases.
By using combination (group) of the selected disturbances, a recommendation that optimizes the effect of each disturbance in the group, can be provided.
Partly because of the groupings, the method and arrangement according to the invention is providing more reliable recommendations. Using the method and arrangement according to the invention, the process becomes more stable, i.e. it is possible to mitigate efficiently the disturbances before they cause larger disturbances to the process.
The pre-selection of the variables of a process/process data to a group may be done by expert knowledge or process knowledge, e.g. by finding variables that correlate with a certain process variable. The pre-selection the group of the disturbance estimations of the parameters of the process may be done so that they form input to a recommendation. In other words, what group of disturbance estimations can be affected with a recommendation.
Further, using combination of recommendations can be used to further stabilize the overall process by mitigating combined set of disturbances.
This is especially the case when the water intensive process has several hundreds or even more than one thousand inputs into the arrangement for estimating disturbances and for giving recommendations. This kind of processes are a pulp making process, papermaking process, tissue making process, board making process, and waste water treatment process and raw water treatment process. This kind of processes are specifically a pulp making process, papermaking process, board making process, and waste water treatment process.
When using prior art methods or arrangements, the obtained recommendations would be less accurate or unreliable.
Further, it is worth to mention that the inputs for said method and arrangement according to the invention can be measurement/process data, ML values, and/or explanation values.
The process is a water intensive industrial process, such as a pulp making process, papermaking process, board making process, tissue making process, paper machine, pulp mill, tissue machine, board machine, water treatment process, waste water treatment process, raw water treatment process, water re-use process, any industrial water treatment process, municipal water, municipal waste water treatment process, sludge treatment process, mining process, oil recovery process or any other water intensive industrial process. The process may be e.g. a pulp making process, papermaking process, tissue making process, board making process, and waste water treatment process and raw water treatment process. Examples of processes are also a pulp making process, papermaking process, board making process, and waste water treatment process.
It is evident from the above that the invention is not limited to the embodiments described in this text but can be implemented utilizing many other different embodiments within the scope of the independent claims.
Number | Date | Country | Kind |
---|---|---|---|
20215456 | Apr 2021 | FI | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/FI2022/050252 | 4/14/2022 | WO |