This invention relates generally to a system for monitoring non-coincident, nonstationary process signals. More particularly, this invention relates to a system for monitoring non-coincident, nonstationary process signals used in detecting deficiencies in various stages of manufacturing processes, biological process and the like.
There is often a need or desire to monitor finite length, non-stationary signals that may include repetitive deterministic artifacts that are non-coincident in time. This phenomenon occurs, for example, in many engineering systems that contain moving parts that are monitored by digitizing sensors monitoring signals relevant to the quality of those parts.
For example, an assembly line where the thickness of manufactured plastic or metal components might be measured. In such an example, every component passing through the sensor produces a signal that has a shape that is substantially similar to the preceding signal—but the signal may be longer or shorter depending upon the speed of the conveyor belt. Another example would be the force applied to the die set in a metal stamping machine. Once again, a signal representing this force would possess a similar shape with every repetition of the machine's movement. The length of the force signal, however, may be longer or shorter depending upon how fast the machine is operating. Biological signals may also produce signals with repetitive deterministic artifacts. One such example includes the use of cardiac signals from a biological heart monitored from EKG traces.
In each of the foregoing cases, if one were to digitize and then plot the monitored signals, the length of the repetitive deterministic artifacts would vary from part to part or from cycle to cycle, depending upon the speed and variability of the system or organism being monitored. A reference signal can often be used to compare to these repetitive signal waveforms for detection of anomalies, but only if their lengths are exactly the same. If their lengths are not the same, large discrepancies between the reference signal and the input signal would be seen due to the signals not being coincident. Such discrepancies could result in an erroneous diagnosis.
It is therefore an object of the invention to develop an improved method for monitoring non-coincident and non-stationary process signals.
It is a further object of the invention to develop an improved system for monitoring non-stationary, non-coincident process signals of a definite length.
It is yet another object of the invention to develop an system for monitoring non-coincident, non-stationary process signals that correspond to a manufacturing process.
It is yet another object of the invention to develop an system for monitoring non-coincident, non-stationary process signals that correspond to a biological process, such as signals emanating from a biological heart.
In accordance with the above objects, a system is provided including a series of steps for developing a reference and for characterizing an input signal or signals for meaningful comparison with the reference. The first step includes the use of a training sequence for determining a mean and variance of a reference wave form and to define a reference wave form length. The leading and falling edges of the repetitive deterministic artifacts are determined in the monitored signal and to calculate the sample length. The monitored signal is then resampled to properly correlated with the reference signal, and the two signals are arranged such that they are coincident in time. The monitored signal is then shifted with respect to the sequence signal so that the monitored signal has the same number of samples as the reference length identified in the first step. The adjusted monitored signal is then compared to the stored reference signal.
These and other objects, advantages and features of the invention together with the organization and manner of operation thereof will become apparent from the following detailed description when taken into conjunction with the accompanying drawings wherein like elements have like numerals throughout the drawings described below.
In order to illustrate embodiments of the invention, wherein the monitored signal and the reference signal comprise a repetitive waveforms, an explanation is provided to generally describe the methodology and function for the systematic procedure of the invention and then the stepwise algorithmic approach is presented in detail. Although the manner in which the phenomena are described is one rigorous approach which explains the operation of the invention for those skilled in the art, other conventional mathematical and theoretical explanations can also be used to describe similar results which characterize embodiments of the invention. The invention is therefore not limited to the description of its operation by following illustrative mathematical explanations.
The present invention involves the use of a step-wise procedure for monitoring a plurality of repetitive signals.
After the training process completes steps 201-206, the data source for monitoring is selected, shown at 207. Again, the selection can be an on-line or real-time source, shown at 204, or it can be a storage media source, shown at 203. Next, data are acquired for analysis, shown at 208, from the chosen source. The acquired data are fed as input to the system monitoring module, shown at 209, which determines whether or not the input data are deviating from the trained normal conditions. The results from the monitoring module are directed at 210 to one or both of a data logging system, shown at 211, and/or a terminal display or monitoring indication mechanism, represented at 212.
The first data processing step, shown at 217, is a method for determining the leading and trailing edges of each individual signature in the input. An example of this procedure is illustrated in
The next step in the training procedure is to store a plurality of identified signatures in computer or embedded memory 218. As each signature is extracted from the training data set its sample length is measure and stored as well, shown at 219. Then a reference length Nref is calculated from all of the measured signature lengths at 220. The reference length can be determined from the minimum, maximum, median or mean of the plurality of measured signature lengths.
The reference length Nref is used to determine the re-sampling rate applied to each stored signature so that the lengths of all signatures are the same, represented at 221. The re-sampling is accomplished using a digital fractional re-sampling filter. The basic structure of the filter is shown in FIG. 6. If the raw input signature or data sequence representated by x(n) at 243 has an original length of N, then the signature is re-sampled using the re-sampling filter to produce a new signature of length Nref. First x(n) is fed through an expander at 244 that inserts Nref zeros between each original sample. Then a low-pass anti-aliasing filter 245, is applied to the resulting zero padded data sequence acting as an interpolator. The interpolated sequence is then decimated at 246 by a factor of N to produce the desired length of Nref for output signature y(n). In cases where N and Nref are large, may be more efficient to first simplify the ratio Nref/N to their equivalent ratio of smallest integer (i.e., 40/30=4/3).
In step 222 the re-sampled signatures are padded on both sides with a plurality of zeros. Each new re-sampled signature is compared with all previously processed signatures using a vector similarity calculation defined to be between 0 and 1 (1 for identical and 0 for no similarity) at step 223. The new signature is shifted forward and/or backward until the similarity is maximized, ensuring that the signatures optimally line up with one another. After the signatures have been lined up, the extraneous samples on both ends of the signature are removed at 224.
The next step in the training process is to calculate the mean and standard deviation for each sample in the Nref length signatures producing Nref mean values and Nref standard deviation values, shown at 225. The parameter Nref and the vectors of mean values and standard deviation values are stored for use during the monitoring phase of operation at 226 and the training is completed at step 227.
The leading and trailing edges of each signature present in the monitored data are identified sequentially at 233 using the procedure described during the training phase step 217. The signatures are re-sampled to equalize their lengths in step 234 using the same procedure as in step 221. The similarity optimization at 235, 236 and 237 is used to line the monitored signature with the reference mean, μ calculated during step 225 of the training phase. A number of similarity measurement techniques may be used. In one embodiment of the invention, a bounded angle ratio test (BART) is used as the similarity measurement technique. The BART system is discussed in detail in U.S. patent application Ser. No. 09/373,326, incorporated herein by reference. It is also possible to use other systematic methods for the third step 520. For example, one could measure the distance between two Euclidean vectors as a possible technique. The details of the most preferred BART measurement technique are described below.
The re-sampled and lined up signature is then differenced with the mean value vector to produce the residual vector R in step 238. In a particular embodiment of the invention, this is accomplished using a non-stationary sequential probability ratio test (SPRT). The SPRT system is discussed in detail in U.S. Pat. No. 5,223,207 and incorporated herein by reference. A SPRT decision ratio is then calculated to determine whether the monitored signal falls outside of normal operating conditions. This monitoring procedure can continue in real-time for the remainder of the operating run. Alternatively, the procedure can continue until a user decides to retrain the automated system.
Parameter settings for the detection engine 240 are set manually before monitoring begins or are loaded from a stored data file that can be used over and over at step 239. The results of the detection engine 240 are then processed in step 241 to determine the amount of deviation in the monitored signatures from the trained reference signature. The processing step produces an alert if the deviation is greater than a user specified amount (SFMp-positive deviation, SFMn-negative deviation, SFMs-standard deviation change) with a confidence level determined by specified false (α) and missed alarm (β) probabilities. The alert is then logged and/or displayed in the final step of the monitoring process at 242.
As described above, a non-stationary sequential probability ratio test (SPRT) is preferably used to compare the adjusted monitored signal to the stored reference signals. In one example of the method, SPRT teaches a expert system and method to determine the degradation of nuclear reactant coolant pumps and their respective sensors prior to failure.
FIG. 8. illustrates the architecture of the expert system for an online pump-surveillance system. The two coolant pumps 1 and 2 are each equipped with numerous sensors 3-6. A typical sensor arrangement is depicted in
The various recited SPRT modules monitor and compare the signals from two similar sensors which respond to a single parameter representing a physical condition associated with the pump. The purpose of this comparison is to identify subtle changes in the statistical quality of the noise associated with either signal when compared one to the other. In applications involving two or more reactor coolant pumps equipped with identical sensors, a SPRT monitor applied to the pumps will provide a sensitive annunciation of any physical disturbance affecting one of the pumps. If each of the pumps had only one sensor, it would be difficult for the SPRT technique to distinguish between a pump degradation event and a degradation of the sensor itself. However, when each pump is equipped with multiple, redundant sensors, the SPRT technique can be applied to pairs of sensors on each individual pump for sensor-operability verification.
As is illustrated in the logic diagram of
The processor 18, of module 13, first interrogates the signals N1 and N2, representing the mean shaft speed for the coolant pumps 1 and 2, respectively. The mean shaft speed signal is obtained by averaging the outputs of the three RPM sensors 3 on each of the pumps 1 and 2. If a problem is identified in the comparison of N1 and N2, a sequence of SPRT tests is invoked to validate the three sensors on the pump 1, signified by A1, B1, and C1. If one of those sensors is identified as degraded, an audible alarm 11 is actuated. If the three sensors on pump 1 are found to be operating within tolerance, then the three corresponding sensors on the pump 2 are tested. If all six sensors are confirmed to be operational, execution is passed to the next SPRT module which in this case is the SPRT module 14 which tests the vibration-level variable. If these sensors are found to be operational, then the testing is functionally shifted to the module 15 the power-signal variable, and then if it is found to be functioning properly to the module 16 the discharge-pressure variable. This sequential organization is illustrated in FIG. 7. If a problem is identified in any module, an audible alarm, 10, 11 or 17 is sounded in the reactor control room, and the operator can initiate a manual shutdown of the reactor to repair the identified problem.
The objective of the AI engine in the expert system is to analyze successive observations of a discrete process Y which represents a comparison of the stochastic components of two physical processes monitored by similar sensors. Let yk represent a sample from the process Y at time t. During normal operations with an undergraded physical system and with sensors that are functioning within specifications, the ykj should be normally distributed with means 0. If the two signals being compared do not have the same nominal means due, for example, to differences in calibration, then the input signals will be pre-normalized to the same nominal mean values during initial operation.
The specific goal of the A1 engine is to declare system 1 or system 2 degraded if the drift in Y is sufficiently large that the sequence of observations appears to be distributed about means +M or −M, where M is a preassigned system distribution magnitude. The SPRT provides a quantitative framework that enables us to decide between two hypotheses, H and H2, namely:
If it is supposed that H1 or H2 is true, we wish to decide for H1 or H2 with probability (1−β) or (1−α) respectively, where α and β represent the error (misidentification) probabilities.
From the theory described by Wald and Wolfowitz in “Optimum Character of the Sequential Probability Ratio Test, ” Ann. Math. Stat., 19,326 (1948), the most powerful test depends on the likelihood ratio 1n, where
Probability of observed sequence given H1 true.
Probability of observed sequence given H2 true.
After n observations have been made, the sequential probability ratio is just the product of the probability ratio is just the product of the probability ratios for each step:
where F(yi|H) is the distribution of the random variables y.
the Wald-Wolfowitz theory operates as follows:
Continue sampling as long as
A<1n<B (1)
Stop sampling and decide H1 as soon as 1n≧B, and stop sampling and decide H2 as soon as in 1n≦A. The acceptance thresholds are related to the error (misidentification) probabilities by the following expressions.
where
Assuming the random variable yk is normally distributed, the likelihood that H1 is true (mean M, variance σ2) is given by
Similarly for H2 (means o, variance α2),
The ratio of equations (3) and (4) gives the likelihood ratio 1n; where 1n is expressed as
combining equations 1, 2 and 5, and taking the natural logs, gives
then the sequential sampling and decision strategy can be concisely represented as
The SPRT analysis formulated here cannot be applied directly to non-Gaussian signals. For applications to nuclear system signals contaminated by non-Gaussian noise, an attempt should first be made to pretreat the input signals with a normalizing transformation.
For applications where (a) one requires a high degree of assurance that a system is functioning within specifications and (b) there is not a large penalty associated with false alarms, it is not uncommon to specify a B (missed alarm probability) that is much smaller than A (false alarm probability). In safety critical systems one may be more willing to incur a false alarm than a missed alarm. For applications where a large cost penalty is incurred with any false alarms, it is desirable to keep both A and B small.
The trade-off that must be considered before one specifies arbitrarily small values for A and B is the effect this may have on the sensitivity and maximum decision time needed by the SPRT to annunciate a disturbance. The desired sensitivity of the SPRT is fixed by specification of M, the system disturbance magnitude. For a given value of M, the average sample number required to reach a decision is influenced by A and B and also by the variance associated with the signals being monitored. It takes longer to identify a subtle change in a process characterized by a low signal-to-noise ratio than in one with a high signal-to-noise ratio.
The non-stationary version of the SPRT algorithm is a slightly modified version of Wald's SPRT. In the non-stationary case, the failure magnitude, M, reference signal
(or mean), μ, and the reference variance, , are sample dependent. Therefore, the non-stationary SPRT equation becomes
where n=1,2, . . . ,L and L is the length of the length equalized signals. In this case, y(n) is the length of the equalized monitored signal, μ(n) is the corresponding reference signal generated during the training phase and (n) is the variance of each point in μ(n).
The bounded angle ratio test (hereinafter BART) mentioned above is employed in systems with more than two variables, as shown in FIG. 9. For example, BART can be used on an actual sensor signal exhibiting non-white characteristics, such as for example, on sensor signals from the primary pump #2 of the EBR-II nuclear reactor at Argonne National Laboratory (West) in Idaho. In such a case, the signal can be a measure of the pump's speed over a given amount of time. In such a situation, one can use a nonlinear multivariate regression technique that employs an N Dimensional Space (known in vector calculus terminology as hyperspace) to model the relationships between all of the variables. This regression procedure results in a nonlinear synthesized estimate for each input observation vector based on the hyperspace regression model. The nonlinear multivariate regression technique is centered around the hyperspace BART operator that determines the element by element and vector to vector relationships of the variables and observation vectors, given a set of system data that is recorded during a time period when everything is functioning correctly.
In the BART method described in
During the BART monitoring phase, a sample vector is acquired at each time step t, that contains a reading from all of the sensors (or data sources) being used. Then the similarity angle (SA) between the sample vector and each sample vector stored in H is calculated. Next an estimate of the input sample vector Y is calculated using the BART estimation equations. The difference between the estimate and the actual sensor values is then used as input to the SPRT module. Each difference is treated separately so that a decision can be made on each sensor independently. This method is described in more detail hereinafter.
In this preferred embodiment of
In the most preferred form of BART an angle domain must be determined. The angle domain is a triangle whose tip is the reference point (R), and whose base is the similarity domain. The similarity domain consists of all scalars which can be compared with a valid measure of similarity returned. To introduce the similarity domain, two logical functional requirements can be established:
BART also requires some prior knowledge of the numbers to be compared for determination of the reference point (R). Unlike a ratio comparison of similarity, BART does not allow “factoring out” in the values to be compared. For example, with the BART methodology the similarity between 1 and 2 is not necessarily equal to the similarity between 2 and 4. Thus, the location of R is vital for good relative similarities to be obtained. R lies over the similarity domain at some distance h, perpendicular to the domain. The location on the similarity domain at which R occurs (Xmed) is related to the statistical distribution of the values to be compared. For most distributions, the median or mean is sufficient to generate good results. In a preferred embodiment the median is used since the median provides a good measure of data density and is resistant to skewing caused by large ranges of data.
Once Xmed has been determined, it is possible to calculate h. In calculating h, it is necessary to know the maximum and minimum values in the similarity domain. (Xmax and Xmin respectively) for normalization purposes the angle between Xmin and Xmax is defined to be 90°. The conditions and values defined so far are illustrated in FIG. 10. From this triangle it is possible to obtain a system of equations and solve for h as shown below:
c=Xmed−Xmin
d=Xmax−Xmed
a2=c2+h2 (19)
b2=d2+h2
(c+d)2=a2+b2
(c+d)2=c2+d2+2h2
h2=cd
h=√{square root over (cd)}
Once h has been calculated the system is ready to compute similarities. Assume that two points: X0 and X1 (X0≦X1) are given as depicted in FIG. 11 and the similarity between the two is to be measured. The first step in calculating similarity is normalizing X0 and X1 with respect to Xmed. This is done by taking the euclidean distance between Xmed and each of the points to be compared. Once X0 and X1 have been normalized, the angle <X0RX1 (hereinafter designated θ) is calculated by the formula:
θ=ArcTan(X1|h)=ArcTan(X0|h) (20)
After θ has been found, it must be normalized so that a relative measure of similarity can be obtained that lies within the similarity range. To ensure compliance with functional requirements (A) and (B) made earlier in this section, the relative similarity angle (SA) is given by:
Formula (21) satisfies both functional requirements established at the beginning of the section. The angle between Xmin and Xmax was defined to be 90°, so the similarity between Xmin and Xmax is 0. Also, the angle between equal values is 0° . The SA therefore will be confined to the interval between zero and one, as desired.
To measure similarity between two vectors using the BART methodology, the average of the element by element SAs are used. Given the vectors x1 and x2 the SA is found by first calculating Si for i=1,2,3. . . n for each pair of elements in x1 and x2 i.e.,
if x1=└X11X12X13 . . . X1n┘and
x2=└X21X22X23. . . X2n┘
The vector SA Γ is found by averaging over the Si's and is given by the following equation.
In general, when given a set of multivariate observation data from a process (or other source of signals), linear regression could be used to develop a process model that relates all of the variables in the process to one another. An assumption that must be made when using linear regression is that the cross-correlation information calculated from the process data is defined by a covariance matrix. When the cross-correlation between the process variables is nonlinear, or when the data are out of phase, the covariance matrix can give misleading results. The BART methodology is a nonlinear technique that measures similarity instead of the traditional cross-correlation between variables. One advantage of the BART method is that it is independent of the phase between process variables and does not require that relationships between variables be linear.
If there is a random observation vector y and a known set of process observation vectors from a process P, it can be determined if y is a realistic observation from a process P by combining BART with regression to form a nonlinear regression method that looks at vector SAs as opposed to euclidean distance. If the know observation vectors taken from P are given by
where H is k by m (k being the number of variables and m the number of observations), then the closest realistic observation vector to y in process P given H is given by
y=Hw (24)
Here w is a weighting vector that maps a linear combination of the observation vectors in H to the most similar representation of y. The weighting vector w is calculated by combining the standard least squares equation form with BART. Here ⊕ stands for the SA operation used in BART.
w=(H′⊕H)−1H′⊕y (25)
An example of use of the BART methodology was completed by using 10 EBR-II sensor signals. The BART system was trained using a training data set containing 1440 observation vectors. Out of the 1440 observation vectors, 129 of these were chosen to be used to construct a system model. The 129 vectors were also used to determine the height, h, of the angle domain boundary as well as the location of the BART reference point R for each of the sensors used in the experiment. To test the accuracy of the model 900 minutes of one minute data observation vectors under normal operating conditions were run through the BART system. The results of the BART system modeling accuracy are shown in
A second example shows the results of applying BART to ten sensors signals with three different types of disturbances with their respective BART estimates superimposed followed by the SPRT results when applied to the estimation error signals. The first type of disturbance used in the experiment was a simulation of a linear draft in channel #1. The drift begins at minute 500 and continues through to the end of the signal, reaching a value of 0.21% of the sensor signal magnitude and the simulation is shown in FIG. 22A. The SPRT (
While preferred embodiments have been shown and described, it should be understood that changes and modifications can be made therein without departing from the invention in its broader aspects. For example, it is possible that signals or waveforms could be measured from processes other than those in the manufacturing or biological fields. Additionally, there are many comparison techniques that could be used to correlate and compare the signals measured according to this invention. Various features of the invention are defined in the following claims.
This invention was made with government support under Contract No. W-31-109-ENG-38 awarded to the Department of Energy. The Government has certain rights in this invention.
Number | Name | Date | Kind |
---|---|---|---|
5223207 | Gross et al. | Jun 1993 | A |
5253186 | Lipner et al. | Oct 1993 | A |
5271045 | Scarola et al. | Dec 1993 | A |
5287390 | Scarola et al. | Feb 1994 | A |
5311562 | Palusamy et al. | May 1994 | A |
5351200 | Impink, Jr. | Sep 1994 | A |
5375150 | Scarola et al. | Dec 1994 | A |
5459675 | Gross et al. | Oct 1995 | A |
5485491 | Salnick et al. | Jan 1996 | A |
5517422 | Ilic et al. | May 1996 | A |
5528516 | Yemini et al. | Jun 1996 | A |
5629872 | Gross et al. | May 1997 | A |
5634039 | Simon et al. | May 1997 | A |
5689696 | Gibbons et al. | Nov 1997 | A |
5719796 | Chen | Feb 1998 | A |
5745382 | Vilim et al. | Apr 1998 | A |
5748496 | Takahashi et al. | May 1998 | A |
5761090 | Gross et al. | Jun 1998 | A |
5764509 | Gross et al. | Jun 1998 | A |
5774379 | Gross et al. | Jun 1998 | A |
5862054 | Li | Jan 1999 | A |
5971580 | Hall et al. | Oct 1999 | A |
5987399 | Wegerich et al. | Nov 1999 | A |
6049578 | Senechal et al. | Apr 2000 | A |
6066179 | Allan | May 2000 | A |
6107919 | Wilks et al. | Aug 2000 | A |
6134510 | Deco et al. | Oct 2000 | A |
6202038 | Wegerich et al. | Mar 2001 | B1 |
Number | Date | Country | |
---|---|---|---|
20030028349 A1 | Feb 2003 | US |