The present application is related to and claims the priority benefit of German Patent Application No. 10 2022 117 436.0, filed on Jul. 13, 2022, the entire contents of which are incorporated herein by reference.
The present disclosure concerns an outlier detection method, in particular a computer implemented outlier detection method, of detecting outliers in measured values of a measurand, and a method of determining and providing a measurement result of a measurand including the outlier detection method.
Measured values of measurands of interest are determined and subsequently employed for various purposes in a large variety of different applications including industrial applications, as well as laboratory applications. In many applications, measured values of a measurand are, e.g., determined and provided by a measurement device measuring the measurand and subsequently employed to monitor, to regulate and/or to control the measurand, an operation of a plant or facility, e.g., a production facility, and/or at least one step of a process, e.g., a production process, performed at the application. For example, in a chemical production process, concentrations of reactants used in the production process and/or the concentration of analytes contained in pre-products, intermediate products and/or educts produced by the process can be monitored and a sequence of process steps of the production process can be scheduled, regulated and/or controlled based on measured values of the measurands. As an example, liquid analysis measurement devices measuring measurands, such as a pH-value, a concentration of free chlorine and/or a turbidity of a medium, are, e.g., employed in swimming pools, as well as in drinking water supply networks and water purification plants to monitor, to regulate and/or to control the quality of the water.
Depending on the specific application, an efficiency and/or a productivity of a production process, a product quality of products produced, the safety of operation of facilities, industrial plants and/or laboratories and/or the quality of drinking water may depend on the accuracy and the reliability of the measured values.
Even when highly accurate and reliable measurement devices are employed to determine the measured values, there always remains the problem, that the time series of measured values may include outliers, which significantly deviate from the true value of the measurand at the time. Outliers can occur due to a multitude of root causes associated with the application and/or the measurement device determining the measured values. Examples of root causes include disturbances occurring at a measurement site, where the measurand is determined, disturbances of a process performed at the application, where the measurand is determined, as a well as adverse measurement conditions a measurement device determining the measured values is exposed to.
When outliers remain unnoticed, there is a risk, that wrong decisions may be made, and/or unsuitable actions may be performed based on outliers included in the measured values. This risk is high in applications, where monitoring, regulating and/or controlling is performed based on measured values in a semi- or fully automated manner. As an example, when a valve on a supply pipe is closed due to an outlier indicating a high level of a medium inside a container, even though the true level is low, this may impair the quality of a product produced in the container and/or may even constitute a safety hazard.
In consequence, there is a need to detect outliers included measured values to prevent them from being employed any further. Outlier detection has been widely discussed in literature, but outlier detection methods capable of detecting outliers in real time are rare. Another problem is that these methods regularly operate based on parameters. To enable an accurate and reliable detection of outliers the determination of these parameters normally requires an expert analysis of the properties of the measured values, in particular of the time dependency of the measured values, and of the properties of noise included in the measured values followed by a manual adjustment of the parameters. The properties of the measured values and the noise are normally not known upfront. This makes an accurate determination of the required parameters a demanding, time and cost intensive process.
It is an object of the present disclosure to provide an outlier detection method capable of detecting outliers included in time series of measured values of a measurand, that enables outliers to be detected in real time without requiring an expert analysis or prior knowledge about the properties of the measured values and/or the noise included in them.
This object is achieved by an outlier detection method, in particular a computer implemented outlier detection method, of detecting outliers in measured values of a measurand comprising the steps of: continuously or repeatedly recording data including measured values of the measurand and their time of determination, determining filtered values of the measured values by filtering the measured values, and, based on training data included in the recorded data, determining a combined distribution of differences between individual measured values and the filtered value of the measured value preceding the respective individual measured value to be expected in the specific application where the outlier detection method is applied. The method includes determining a difference distribution of first differences of the filtered values, based on the filtered values of the measured values included in the training data, determining a noise distribution of noise included in the measured values, and, based on the noise distribution and the difference distribution determining the combined distribution.
d) identifying outliers by for at least one, several or each new measured value performing the steps of:
e) providing a detection result by performing at least one of: indicating each new measured value that has been identified as an outlier, issuing a warning when an outlier has been identified, and issuing a notification or an alarm when a predetermined number of consecutively determined new measured values has been identified as outliers.
The present disclosure provides the advantage, that the determination of the combined distribution is performed in an autonomous entirely data driven manner, that neither requires an expert analysis of the data nor any prior knowledge of the properties of the measured values and the properties of the noise. Thus, it is neither based on any assumptions, parameters or other inputs that may turn out not be valid for the specific application, where the method is employed. Based on the empirically determined combined distribution, the method enables for outliers to be detected in real time with a high accuracy and reliability, and in a manner truly accounting for the properties of the measured values and the noise in the specific application, where the outlier detection method is used. Another advantage is, that the difference distribution of the first differences and the noise distribution can be of any kind. Thus, neither the difference distribution nor the noise distribution must be compliant to predetermined requirements. This enables for the method to be universally employed regardless of the properties of these distributions. As an example, employment of the outlier detection method neither requires for the distributions to be Gaussian, nor to be symmetric, nor to be stationary, nor to be compliant to any other requirement.
According to a first embodiment, the noise distribution is determined:
A second embodiment further comprises the steps of:
According to a third embodiment, the method step of filtering the measured values comprises:
According to an embodiment of the third embodiment, each iteration includes a method step of determining the decay of the fractal dimensions:
According to an embodiment of the method according the second and the third embodiment, the parametrization of the filter is updated when the combined distribution is updated.
According to a fourth embodiment the identification of outliers is performed in real time, and/or the training data is unlabeled data and/or includes a predetermined number of measured values and/or measured values that have been measured during an initial and/or a predetermined training time interval or an arbitrarily selected time interval of a predetermined duration.
The present disclosure further includes a method of using the outlier detection method in a method of determining and providing a measurement result of a measurand comprising the steps of:
According to certain embodiments of the method of using the outlier detection method:
In certain embodiments the method of using the outlier detection method further comprises at least one of the steps of:
performing the method of determining and providing the measurement result of the measurand for two or more measurands,
The present disclosure further includes a measurement device configured to perform the method of determining and providing a measurement result, comprising:
The present disclosure further includes a measurement system configured to perform the method of determining and providing a measurement result for at least one measurand, the measurement system comprising:
In certain embodiments of the measurement system:
The present disclosure further includes a computer program comprising instructions which, when the program is executed by a computer, cause the computer to carry out the outlier detection method, or the method of determining and providing a measurement result for at least one measurand including the outlier detection method based on the measured values provided to the computer.
The present disclosure further includes a computer program product comprising this computer program and at least one computer readable medium, wherein at least the computer program is stored on the computer readable medium.
The present disclosure and further advantages are explained in more detail below based on the example shown in the figures of the drawing, wherein:
The present disclosure concerns an outlier detection method, in particular a computer implemented outlier detection method, of detecting outliers in measured values my of a measurand m, as well as a method of determining and providing a measurement result MR of the measurand m using the outlier detection method.
The measurement device MD can be any device configured to determine the measurand m. In this respect, the measurement device MD is e.g. embodied in form of physical device installed at a measurement site repeatedly or continuously measuring the measurand m and determining and providing the corresponding measured values my. As an alternative, the measurement device MD may e.g. be embodied in form of a virtual or computer implemented device, e.g. in form of a soft sensor, repeatedly or continuously determining and providing measured values my of the measurand m based on data provided to the device.
The measurand m is e.g. a level, a pressure, a temperature, a density, a conductivity, a flow, a pH-value, a turbidity, or a spectral absorption of a medium, a concentration of an analyte comprised in a medium or another type of determinable variable. In certain embodiments, the measurand m is e.g. given by a variable of interest in a specific application, where the measurement device MD is employed, e.g. a process parameter related to a process performed at the measurement site and/or a property of a medium produced, processed and/or monitored at the measurement site. Examples of applications include industrial applications, e.g. production plants, chemical plants, water treatment or purification plants, as well as laboratory applications. Further examples include applications, wherein measurements are performed in a natural environment, as well as applications in medical diagnostics, e.g. applications wherein in-situ, in-vitro or in-vivo measurements are performed.
Even though the outlier detection method is described herein in context with the determination of measurement results MR, the field of use of the outlier detection method is not limited to this type of use. The outlier detection method can be employed in the same way in a multitude of other fields to detect outliers in time series of measured value my of a multitude of different types of measurands m. In this respect the term measurand m is used in a very broad sense to denominate a variable exhibiting variable values that are not completely random, and wherein at least some kind of dependency or relation between present and past variable values of the variable exists. This is e.g. the case when the variable values exhibit at least a certain level of (linear or non-linear) autoregression. As an example, signals exhibiting and/or representing a physical property evolving over time are, despite possible abrupt changes that may occur, are showing an autoregressive behavior. Regardless of the application, the outlier detection method is performed in the same way as described in more detail below based on the corresponding time series of measured values my and their time of determination t.
Regardless of the application and/or the field of use, the outlier detection is performed based on the combined distribution PDF(Δmf) representing the application-specific distribution of the differences Δmf between individual measured values mvi and the filtered value fyi−1 of the preceding measured value mvi−1 preceding the respective individual measured value mvi to be expected in the specific application where the outlier detection method is applied.
As mentioned above, the combined distribution PDF(Δmf) is determined based on training data included in the data D. The training data is e.g. unlabeled data and/or e.g. includes a predetermined number of measured values my and/or measured values my, that have been determined, e.g. measured, during an initial and/or a predetermined training time interval or during an arbitrarily selected time interval, e.g. a time interval of a predetermined duration.
To illustrate the outlier detection method,
Determining the combined distribution PDF(Δmf) includes based on the filtered values fv of the measured values my included in the training data determining a difference distribution PDF(Δfv) of the first differences Δfv of the filtered values fv. As shown in
Next, in method step 320, the difference distribution PDF(Δfv) is determined based on the first differences Δfv. This is illustrated in
Assuming, that the filtered values fv constitute a good approximation of the true value of the measurand m, the difference distribution PDF(Δfv) represents the distribution of the changes of the true value of the measurand m to be expected in the specific application, where the method is applied.
In applications, where the measured values my are determined at a constant rate, the time differences Δti:=ti−ti−1, between consecutively determined measured values mvi−1, mvi, and thus also between consecutive filtered values fvi−1, fvi are given by a constant time unit Δti:=Δt. In this case, the difference distribution PDF(Δfv) represents the distribution of first differences Δfv to be expected to occur in one time unit Δt. The method is not limited to applications, where the measured values my are determined at a constant rate. It can be performed in the same way in applications, where the time differences Δti:=ti−ti−1 between consecutively determined measured values mvi−1, mvi vary provided that the properties of the distribution of the time differences remains approximately constant throughout the performance of the method. In this case, the first differences Δfv of the filtered values fv determined in method step 310 include the first differences Δfv that occurred during each of the different time differences Δti between the measured values my included in the training data. Correspondingly, the resulting difference distribution PDF(Δfv) represents the distribution of first differences Δfv to be expected between consecutive filtered values fvi−1, fvi when the time differences Δti between the consecutively measured values mvi−1, mvi are compliant to the approximately constant distribution of time differences.
Method step 300 of determining the combined distribution PDF(Δmf) further includes a method step 330 of determining a noise distribution PDF(N) of noise included in the measured values my. In this respect different methods of determining the noise distribution PDF(N) can be employed.
As an example, available in applications, where the measured values my are determined and provided by a measurement device MD, the noise distribution PDF(N) is e.g. determined based on a measurement uncertainty inherent to the measurement device MD. The measurement uncertainty of measurement devices MD is usually specified by the manufacturer of the device and thus constitutes readily available information. Based on the measurement uncertainty, the noise distribution PDF(N) is e.g. determined in form of a gaussian distribution having a standard deviation corresponding to the size of the standard measurement uncertainty of the measurement device MD. This type of determination provides the advantage, that it requires very little calculating power and is very well suited for applications, where the measurement device MD is exposed to favorable measurement conditions.
As another example, the noise distribution PDF(N) is e.g. determined based on the measured values my included in the training data and the corresponding filtered values fv obtained by filtering the measured values my. In this case, determining the noise distribution PDF(N) is e.g. performed by determining residues r between the measured values my and the corresponding filtered values fv, e.g. as ri:=mvi−fvi, followed by determining the noise distribution PDF(N) shown in
As another example, the noise distribution PDF(N) is e.g. determined in form of a combined noise distribution determined based on the distribution of the residues r and the measurement uncertainty inherent to the measurement device MD determining the measurand m. This provides the advantage, that a minimum noise caused by the measurement uncertainty is always accounted for. This is advantageous in applications where temporary noise reductions may occur, which may affect the training data. Accounting for a minimum noise due to the measurement uncertainty makes the outlier detection method more robust and reduces the number of false outlier detections, such as when the noise level rises following a temporary noise reduction.
In this embodiment, the noise distribution PDF(N) is e.g. determined based on the distribution of residues r between the measured values my included in the training data and the corresponding filtered values fv such, that the noise distribution PDF(N) represents a probability of occurrence of noise as a function of a noise amplitude, wherein for each noise amplitude covered by the noise distribution PDF(N) the probability of occurrence is larger or equal to a probability of occurrence of the respective noise amplitude due to the measurement uncertainty inherent to the measurement device MD.
Method step 300 of determining the combined distribution PDF(Δmf) further includes a method step 340 of based on the noise distribution PDF(N) and the difference distribution PDF(Δfv) determining the combined distribution PDF(Δmf) such, that it represents the distribution of differences Δmf between individual measured values mvi and the filtered value fvi−1 of the preceding measured value mvi−1 preceding the respective individual measured value mvi to be expected in the specific application due to the difference distribution PDF(Δfv) and the noise distribution PDF(N).
This is easily possible because by filtering of the measured values my performed in method step 200 a separation between the noise included measured values my and the filtered values fv constituting a good approximation of the true value of the measurand m is attained. Thus, each measured value my can be considered as the sum of the corresponding filtered value fv and a noise additive superimposed on the filtered value fv. Correspondingly, the difference between an individual measured value mvi and the filtered value fvi−1 of the preceding measured value mvi−1 can be interpreted as a sum of a first component and a second component. The first component corresponds to a first difference between two consecutive filtered values fv belonging to the difference distribution PDF(Δmf). The second component corresponds to a noise additive belonging to the noise distribution PDF(N). Thus, the combined distribution PDF(Δmf) is e.g. determined as or based on a convolution of the noise distribution PDF(N) and the difference distribution PDF(Δmf). Alternatively, the combined distribution is e.g. determined by Monte-Carlo simulations performed based on the noise distribution PDF(N) and the difference distribution PDF(Δmf).
This is illustrated in
Following the determination of the combined distribution PDF(Δmf) the method step 400 of identifying outliers included in the measured values my and of providing a corresponding outlier detection result DR is performed. As illustrated in
The new measured values mvj, are e.g. given by newly recorded measured values my, e.g. by new incoming measured values my and/or measured values my that have only just been provided by the same source as the training data.
As shown in method step 410, determining whether the respective new measured value mvj constitutes an outlier includes a method step 411 of determining the difference Δmfj:=mvj−fvj−1 between the respective new measured value mvj and the filtered value fvj−1 of the preceding
measured value mvj−1. Following this, it includes a method step 412 of determining a probability of occurrence P(Δmfj) of this difference Δmfj between the respective new measured value mvj and the filtered value fvj−1 of the preceding measured value mvj−1 according to the combined distribution PDF(Δmf).
This is illustrated in
In
value mvj−1 shown on the left-hand side of
The probability of occurrence P(Δmfj) of the difference Δmij between the respective new measured value mvj and the filtered value fvj−1 of the preceding measured value mvj−1 is e.g. determined as a probability of occurrence a difference Δmf of the size of the difference Δmfj between the respective new measured value mvj and the filtered value fvj−1 of the preceding measured value mvj−1 according to the combined distribution PDF(Δmf). As an example, the probability of occurrence P(Δmfj) is e.g. determined to be given by the minimum of a first probability P1 given by:
and a second probability P2 given by:
wherein C(x) is the combined distribution PDF(Δmf), wherein x is the difference between a measured value mvi and the filtered value fvi-1 of the preceding measured value mvi-1, and wherein the probability of occurrence P(Δmfj) is given by P(Δmfj):=min([P1, P2]).
Following this, in method step 413 the probability of occurrence P(Δmfj) the difference Δmfj determined for the respective new measured value mvj is compared to a predetermined level of confidence Pref and the respective new measured value mvj is identified as an outlier when the probability of occurrence P(Ain) is lower than the predetermined level of confidence Pref.
Based on the at least once, repeatedly or continuously performed outlier identification, the corresponding detection result DR is preferably determined and provided in a form that best suits the needs of the application, where the method is employed. To this extent providing the detection result DR e.g. includes indicating each new measured value mvj that has been identified as an outlier. This is advantageous in applications, wherein regulation and/or control of the measurand m, a process performed at the application and/or operation of a facility is performed in real time on the basis of the latest measured value(s) my, as well as in applications wherein decisions are made and/or actions are taken in real time based on the most recent measured value(s) my of the measurand.
In addition, or as an alternative providing the detection result DR e.g. includes issuing a wanting when an outlier has been identified and/or issuing a notification or an alarm when a predetermined number of consecutively determined new measured values mvj has been identified as outliers. This is advantageous in applications, wherein events may occur, that lead to an unexpectedly large and/or rather sudden change of the measurand m and/or the measured values my. Examples include events given by impairments of a process performed at the application, an impaired operation of a facility, as well as impairments of the measurement device MD determining the measured values my. In this case, the occurrence of the predetermined number of consecutively determined new measured values mvj that have been identified as outliers is an indicator that such an event has occurred, and this information is provided in form of the alarm or the corresponding notification. Thus, the corresponding notification or alarm enables to differentiate between individual outliers, that may e.g. be safely ignored or discarded, and the occurrence of a real event, that may require attention and/or actions to be taken. The information enabling or providing this differentiation is e.g. provided to a user by providing the corresponding detection result DR. In this context the user of the detection result DR is e.g. a person or a machine, e.g. a superordinate unit, a process automation system, or a programmable logical controller receiving the detection result DR.
When the outlier detection method is used in the method of determining and providing the measurement result MR of the measurand m shown in
As an example, determining and providing the measurement result MR e.g. includes providing the detection result DR and providing the measured values my, the filtered values fv of the measured values my, and/or processed measured values pmv determined based on the measured values my and/or the filtered values fv.
As another example, in certain embodiments, determining and providing the measurement result MR e.g. includes based on the detection result DR eliminating each outlier that has been identified and determining and providing the measurement result MR as or based on the remaining measured values mv′ remaining after the outliers have been eliminated. In this case, providing the measurement result MR e.g. includes providing the remaining measured values mv′, providing filtered values fv′ of the remaining measured values mv′, and/or providing processed measured values pmv′ determined based on the remaining measured values mv′ and/or filtered values fv′ of the remaining measured values mv′. As an option, in this embodiment, providing the measurement result MR may additionally include providing the detection result DR, e.g. by indicating each new measured value mvj that has been identified as an outlier, by issuing a warning when an outlier has been identified and/or by issuing a notification or an alarm when a predetermined number of consecutively determined new measured values mvj has been identified as outliers.
The present disclosure provides the advantages mentioned above. Individual steps of the outlier detection method and/or the method of determining the measurement result MR can be implemented in different ways without deviating from the scope of the present disclosure. Several optional embodiments are described in more detail below.
As an example, in certain embodiments, the outlier detection method may include an additional method step of at least once, repeatedly, or periodically updating the combined distribution PDF(Δmf). In this case each update is e.g. performed by repeating the method step 300 of determining the combined distribution PDF(Δmf) based on new training data included in the recorded data D. In this case, the new training data includes at least one measured value my, that has been determined after a training time interval during which the measured values my included in the training data employed to determine the previous combined distribution PDF(Δmf) have been determined.
Following each update of the combined distribution PDF(Δmf), the method step 400 of determining and providing the detection result DR is then performed as described above based on the updated combined distribution PDF(Δmf). Thus, following each update, each determination of the probability of occurrence P(Δmfj) of the difference Δmfj between the respective new measured value mvj and the filtered value fvj−1 of the preceding measured value mvj−1 preceding the respective new measured value mvj is subsequently performed based on the updated combined distribution PDF(Δmf).
Updating of the combined distribution PDF(Δmf) is advantages in applications, where properties of the measured values my and/or properties of the noise included in the measured values my may change over time. In this case, each update provides the advantage, that changes of these properties that may have occurred since the last determination of the combined distribution PDF(Δmf) are accounted for.
With respect to the respective new training data, the number of updates and/or the frequency of the updates of the combined distribution PDF(Δmf) various strategies can be pursued individually and/or in combination.
In certain embodiments, updating the combined distribution PDF(Δmf) is e.g. performed at least once, repeatedly, or periodically based on new training data including a given number larger or equal to one of measured values my that have been determined after the training time interval during which the measured values my included in the training data employed to determine the previously determined combined distribution PDF(Δmf) have been determined. Correspondingly frequent updates are advantages in applications, where the properties of the measured values my and/or the noise may change quickly.
In addition, or as an alternative, the combined distribution PDF(Δmf) is e.g. updated at least once, repeatedly, or periodically based on new training data including measured values my, that have been determined during a time interval of a predetermined duration preceding the determination of the respective updated combined distribution PDF(Δmf).
In addition, or as an alternative, the combined distribution PDF(Δmf) is e.g. updated after an event that may have an impact on the properties of the measured values my and/or the properties of the noise included in the measured values my has occurred. In context with the method of determining the measurement result MR events triggering an update of the combined distribution PDF(Δmf) to be determined e.g. include: a maintenance performed at the measurement site and/or on the measurement device MD, a repair, a modification or a replacement of the measurement device MD, a shutdown of the measurement site and/or an interruption of a process performed at the measurement site, and/or a change of the process application and/or a process performed at the application, where the measurement device MD is employed.
In addition, or as an alternative the combined distribution PDF(Δmf) is e.g. updated after an event given by a change of the constant time interval Δt between consecutively determined measured values mvi, mvi−1 or a change of at least one of the properties of the distribution of the time differences Δti between consecutively determined measured values mvi, mvi−1.
In certain embodiments, the combined distribution PDF(Δmf) is e.g. updated after an event given by a time difference between a new measured value mvj and the preceding measured
value mvj−1 exceeding a predetermined time limit. Such a situation may e.g. occur, when the measurement of the measurand m and/or a process performed at the measurement site is interrupted and/or when transmission and/or reception of the measured values my to be recorded is temporarily interrupted.
Regardless of the type of event triggering the update, the updated combined distribution PDF(Δmf) is e.g. determined based on new training data including at least a predetermined number of measured values my that have been determined after the event, and/or measured values my that have been determined during a time interval having a duration longer or equal to a minimum duration after the event occurred.
In addition, or as an alternative, updating the combined distribution PDF(Δmf) e.g. includes a method step of determining a degree of similarity between the new training data and the training data employed for the previous determination of the distribution PDF(Δmf). In this case, the combined distribution PDF(Δmf) is preferably only updated when the degree of similarity is below a predetermined threshold. In addition, or as an alternative, updating the combined distribution PDF(Δmf) is preferably postponed in case the degree of similarity exceeds the predetermined threshold. When the updating is postponed, it is e.g. postponed to a later time when sufficiently dissimilar new training data becomes available.
With respect to the filtering of the measured values my performed in method step 200 filters 13 performing filtering methods known in the art can be employed. Excellent filtering results are e.g. attained by the filtering method disclosed in German Patent Application DE 102022111387.6 filed on May 6, 2022, incorporated herein by reference.
When this filtering method is employed in the outlier detection method disclosed herein, the filtering method is performed based on the data D recorded in method step 100. As illustrated in the flow chart shown in
Regardless of the type of filter employed, the filter 13 is e.g. configured to operate based on parameter settings, that are adjustable in a manner that enables for the filtering strength S of the filter 13 to be set to a number of different predetermined filtering strength Sn. In certain embodiments, the filtering strength S is e.g. understood as a conceptual indication reflecting how much noise included in the measured values my will be taken out by the filter 13 being adjusted to have the respective filtering strength S.
As shown in
As illustrated in
In context of the filtering method, various methods of determining the decay of the fractal dimensions Δdn can be employed. As a first example, the decay of the fractal dimensions Δdn is e.g. determined for each iteration n individually based on the fractal dimension d0 of the measured values my included in training data. In this case each iteration n e.g. includes a step of determining the decay of the fractal dimensions Δdn as or based on a ratio of the fractal dimension dn determined during the respective iteration n and the fractal dimension d0 of the unfiltered measured values my included in training data, e.g. by Δdn:=dn/d0. As a second example, for each iteration n, the decay of the fractal dimensions Mn is e.g. determined based on the fractal dimension dn determined during the respective iteration n and the fractal dimension dn−1 determined during the previous iteration n−1. In this case each iteration n e.g. includes a step of determining the decay of the fractal dimensions Δdn as or based on a ratio of the fractal dimension dn determined during the respective iteration n and the fractal dimension dn−1 determined during the previous iteration n−1,
e.g. by Δdn:=dn/dn−1. As an alternative another method of determining the decay of the fractal dimensions Δdn at the end of each iteration n can be employed instead. Examples include a method of determining the decay of the fractal dimensions Δdn based on three or more of the previously determined fractal dimensions di, dj, dk, . . . ; i, j, k . . . ∈[0, 1, . . . , n]; i≠j≠k and/or based on a property of a function fitted to several or all of the previously determined fractal
dimensions d0, d1, . . . , dn.
Regardless of the method applied to determine the decays of the fractal dimensions Δdn the iterative process is terminated when the decay of the fractal dimensions Mn drops below the predetermined threshold Δdref. Following this, in method step F200 of the filtering method, the filter 13 is put into operation based on the parametrization corresponding the filtering strength Sn applied in the last iteration n. Subsequently, the measured values my are filtered and the corresponding filtered values fv are determined and provided by the thus parametrized filter 13.
The fractal dimensions dn of the filtered values [fv]n provide a quantitative measure of the complexity of the filtered values [fv]n. Correspondingly, the sequence of fractal dimensions dn determined during the iterations n provide a quantitative measure of the parameter-dependent capability of the filter 13 to eliminate the noise included in the measured values my. Thus, the parametrization determined based on the decays of the fractal dimensions Mn constitutes an optimum parametrization most capable of separating the main component of the measured values my representing the true value of the measurand m from the noise in view of the application specific properties of the measured values my and the application specific properties of the noise. Another advantage is, that this optimum parametrization is determined in an entirely data driven manner, that neither requires an expert analysis nor any prior knowledge of the properties of the measured values my and the noise.
Using this filtering method in the outlier detection method provides the advantage, that a very high degree of accuracy and reliability of the combined distribution PDF(Δmf) is attained. This is so, because the high degree of trueness of the filtered values fv to the true values of the measurand m ensures a correspondingly high degree of accuracy and reliability of the difference distribution PDF(Δfv) as well as of the noise distribution PDF(N) determined based on the residues r between the measured values my and the filtered values fv.
In certain embodiments, the outlier detection method may include an additional method step of at least once, periodically, or repeatedly updating the parametrization of the filter 13. In this case each update is e.g. performed by repeating the method step F100 of parametrizing the filter 13 based on new training data included in the recorded data D, that includes at least one measured value my that has been determined and/or recorded after the parametrization of the filter 13 has last been determined. Following each update of the parametrization, the filtered values fv of the measured values my are then determined with the filter 13 operating based on the updated parametrization. As an example, the parametrization of the filter 13 is e.g. updated each time the combined distribution PDF(Δmf) is updated. In this case, the new training data employed to determine the updated combined distribution PDF(Δmf) is e.g. also applied to determine the updated parametrization.
The outlier detection method and/or the method of determining the measurement result MR disclosed herein is preferably performed as a computer implemented method. In that case, the method steps of the respective method, in particular the method step 300 of determining the combined distribution PDF(Δmf) and the method step 400 of determining and providing the detection result DR based on the combined distribution PDF(Δmf) are performed by computing means 15 by means of a computer program SW based on the measured values my and their time of determination t provided to the computing means 15. Thus, the present disclosure is also realized in form of a computer program SW comprising instructions which, when the program is executed by a computer, cause the computer to carry out the respective method disclosed herein. In addition, the present disclosure further comprises a tangible computer program product comprising the computer program SW described above and at least one computer readable medium, wherein at least the computer program SW is stored on the computer readable medium.
In computer implemented embodiments, the filter 13 and/or the filtering method performed in method step 200 disclosed herein are e.g. implemented in software included in the computer program SW.
When the respective method is performed as a computer implemented method, the data D is e.g. transferred to and at least temporarily stored in a memory 17 associated to the computing means 15. The computing means 15 is e.g. embodied as a unit including hardware, e.g. one or more computing units or processors, a computer, or a computing system.
The present disclosure disclosed herein is also realized in form of the measurement device MD configured to perform the method of determining and providing the measurement result MR disclosed herein. In the example shown in
As an alternative option, the computing means 15 and the memory 17 may be located outside the measurement device MD. Thus, regardless of the location of the computing means 15 and the memory, the present disclosure disclosed herein is also realized in form of a measurement system MS comprising the measurement device MD determining and providing the measured values my, the computing means 15 configured to receive the measured values my and to provide the measurement results MR determined by the computing means 15, the memory 17 associated to the computing means and the computer program SW installed on the computing means 15 which, when the program is executed by the computing means 15, cause the computing means 15 to carry out the method of determining and providing the measurement result MR as described above based on the measured values my provided to the computing means 15 by the measurement device MD.
When the computing means 15 are located outside the measurement device MD, the measured values my determined by the measurement device MD are directly or indirectly provided to the computing means 15 or the memory 17 associated to the computing means 15. To this extent hard wired or wireless connections and/or communication protocols known in the art, like e.g. LAN, W-LAN, Fieldbus, Profibus, Hart, Bluetooth, Near Field Communication, TCP/IP etc. can be applied.
In certain embodiments, the measurement system MS, may include more than one measurement device MD. In this case, the computing means 15 are configured to receive the measured values my provided by each of the measurement devices MD and to provide the corresponding measurement results MR determined by the computing means 15 by executing the computer program SW for each of the measurands m determined or measured by the respective measurement device MD.
In the example shown in
In
Regardless of the number of measurands m, L, ρ, F1, F2 for which the method disclosed herein is performed and regardless of the location of the computing means 15 employed to determine the corresponding measurement result(s) MR, the measurement result(s) MR determined by the method disclosed herein provide the advantage, that outliers included in the measured values my are identified. This enables for the risk of wrong decisions to may be made, and/or unsuitable actions to be performed based on outliers to be eliminated. Correspondingly, the measurement result(s) MR provided by the method can be safely employed and/or are employed to monitor, to regulate and/or to control the respective measurand m, L, ρ, F1, F2, an operation of a plant or facility, e.g. a production facility, and/or at least one step of a process, e.g. a production process, performed at the application, where the measurement device(s) MD, M1, M2, M3, M4, is/are employed. In the example shown in
Number | Date | Country | Kind |
---|---|---|---|
10 2022 117 436.0 | Jul 2022 | DE | national |