The present invention relates to a method and system for performing predictive modeling on more complex data, and particularly a system for achieving the predictive chaos analysis of non-linear data or events, and more preferably a system and method for analysis of EEG readings used to indicate the likely onset of epileptic seizures. More preferably, EEG readings data used to predict epileptic seizures are subjected to a further transformation to provide a model which is operable to predict or forecast the likely occurrence of an epileptic seizure that is about to occur in the future.
It has been recognized that long-term time series prediction has promise for many applications, such as prediction of earthquakes, financial market prediction, and the like, and where non-linear properties of a time series are evaluated and used for long-term prediction.
The prediction of complex time series future values is therefore a major concern for scientists with applications in various fields of science. Many natural phenomena such as variations in population, the orbit of astronomical objects and earth's seismic waves could be subject to a prediction algorithm. Prediction also has application in forecasting economic time series. Time series analysis of earth's seismic waves can be used for earthquake prediction. Prediction of other data such as population projections may be used to predict species extinction before they reach a tipping point may provide another application of time series prediction.
It has been shown that many data generated by such natural phenomena follow chaotic behavior. Various authors have proposed models for the predictive analysis of non-linear and chaotic events. Clements et al. in Forecasting Economic and Financial Time-Series with Non-linear Models, International Journal of Forecasting 20 (2004) 169-183 highlights the difficulties associated with conventional non-linear models used in the prediction of economic behavior and performance. Further, Yang et al. in Forecasting the Future: Is It Possible for Adiabatically Time-Varying Non-Linear Dynamical Systems? CHAOS 22, 033119 (2012) proposes a non-linear dynamical system in which parameters vary adiabatically with time where measured time series is used to predict future asymptotic attractors to the system. Wang et al. in Fuzzy Prediction of Chaotic Time Series Based on Fuzzy Clustering, Asian Journal of Control Vol. 13, No. 4, pp 576-581 (2011) also describes a process for time series prediction for use in weather forecasting, speech coding, noise cancellation and the like.
Most of the existing methods for complex time series prediction are based on modeling the time series to predict future values, although there are other types of methods like agent-based simulation that model the system generating the time series [Filippo Neri: Learning and Predicting Financial Time Series by Combining Natural Computation and Agent Simulation. Evo Applications (2) 2011: 111-119]. The model based approaches may be mainly classified in two main domains: linear models like ARIMA (AutoRegressive Integrated Moving Average) [G. Box and G. Jenkins, Time Series Analysis: Forecasting and Control, Holden-Day, San Francisco, 1976] and non-linear models like MLP [Zirilli, J.: Financial prediction using Neural Networks. International Thompson Computer Press (1997)] and GARCH [Bollerslev, Tim (1986). “Generalized Autoregressive Conditional Heteroskedasticity”, Journal of Econometrics, 31:307-327]. However, studies have concluded that there was no clear evidence in favor of non-linear over linear models in terms of forecast performance.
Although chaotic behaviors are deterministic, their complex properties make them hard to be distinguishing from random behavior. They are well known to be strongly dependent on initial conditions, small changes in initial conditions possibly leading to tremendous changes in subsequent time steps, and particularly difficult to predict. Since the exact conditions for many natural phenomena are not known and the properties of a chaotic time series are very complex, previously it is has proven difficult to model these systems.
Heretofore, however, there has been no robust procedure that can estimate an accurate model for chaotic time series. For conventional predictive methods, the prediction error increase dramatically with the number of time points predicted. As a result, most of existing methods focus on very short-term prediction to reach a reasonable level of accuracy. For example, for financial time series prediction a simple step ahead, may not prove overly helpful for acting against financial recession beforehand. Despite of the difficulties inherent to non-linear modeling, non-linear analysis has the potential for a variety of commercial applications.
The applicant has recognized that the prediction of non-linear health events provides significant health and/or social benefits. In medical science there are many applications for which an efficient prediction algorithm could save lives. By example, a large number of time series gained from the human body can be used as an origin of the decision making process to treat or prevent dangerous diseases such as heart attacks, cancers, epilepsy and Alzheimer's. By way of example, individuals who suffer from epilepsy may be prone to severe and unexpected seizures, which may prevent epileptics from performing routine tasks such as driving or operating heavy equipment, and which otherwise have the potential to result in the individual's injury should the onset of a seizure occur where the individual is physically in a vulnerable location, such as a stairwell or bathtub. It is recognized that by being able to predict the likely future onset of a seizure, an epileptic may be given advanced warning to provide for sufficient time to prepare for the seizure onset, and relocate to a physically safe environment.
The present invention provides a method and system for generating predictive models, and more particularly for undertaking the predictive analysis of non-linear historical data and/or real-time data to predict the likelihood of a selected event or future trend. More preferably, with the present invention, an appropriate method and system may be used in predicting long term variations across a variety of technologies, including without restriction, the prediction of medical events, the prediction of seismological or meteorological events or outcomes; the prediction of ecosystem trends; the prediction of health, pandemic and/or demographic events; as well as other macrogeographic events.
The present invention seeks to provide a simplified process and system for predictive analysis, and more particularly which may provide improved reliability for chaotic or event predictions.
In one non-limiting embodiment, the invention provides a system and method for time series prediction which analyzes of continuous electroencephalography (EEG) data. The system operates to output a user or other professional a signal or display indicating where in a further time series an epileptic seizure will occur. More preferably, the system operates whereby a predicted seizure which is later confirmed by recorded EEG data records showing the epileptic seizure to have occurred is updated, and used to establish future or a next predicted seizure event.
In a preferred construction, the invention provides a system for predicting an epileptic seizure which includes one or more EEG sensors for positioning on a user or subject's skull. The EEG sensors are operable to continuously record the user's brain electrical activity either continuously or over a selected period of time, and preferably reading fluctuations and output the user's EEG data. Most preferably, the sensors are adapted for electronic, and preferably wireless communication with a processor, which for example may be in the form of a device with computational capability such as a computer, tablet, smart phone, smart watch, personal digital assistant (PDA) or the like (hereinafter collectively “a personal digital assistant or PDA”). The PDA is configured to receive and process the output EEG readings and activities.
In undertaking the EEG recording, a threshold value in the processed data output of the computational device is determined as the level/threshold where the user's brain will experience at or about the onset of an epileptic seizure event. The threshold value is then stored in the PDA memory as a predetermined threshold value, which is indicative of an epileptic-seizure event. Alternately, the predetermined threshold value may be calculated and/or input independently, as for example by averaging empirical data collected from a number of third party subjects, and prestored in the PDA memory as part of a software component.
The PDA further operates to analyze the input EEG data received from the sensors. To initiate the system, initial EEG data is recorded continuously for a period of between 0.5 to 5 hours, and preferably about 60 minutes. The initial EEG data is preferably obtained directly from the user and used as a baseline upon which data processing may commence to produce reference values and start predicting values. EEG data is preferably logged and processed by the PDA in time interval ranging from 10 to 360 minutes, and typically between about 20 to 90 minutes, and most preferably about 60 minutes. Most preferably, the baseline recording is continuous.
In an exemplary mode, the system operates to output a warning to the user at least 16 minutes prior to the time the user is anticipated to experience the onset of an epileptic event such as a grand mal or Tonic-conic seizure.
In a preferred mode of operation the continuous data streaming into the computational device the device takes an EEG value every 10 to 60 seconds, and preferably about every 20 seconds. At the end of the 60 minutes of baseline data recording and in a preferred mode, the collection of 180 EEG data point an initial time series “SN”(x1, x2, . . . xN) is provided. The system then commences processing data and predicting values into the future. For the purpose of data analysis, a time window (L) of 10 to 60 minutes, and preferably 20 minutes is selected. The 20 second sampling interval selected; the 60 minute baseline selected and the 20 minute time window (L) are preferred and may be varied based on the input EEG data, the base data being used and the degree of accuracy to be achieved.
The processor is operated to convert or transform the collected EEG data value readings for individual time series over set sampling periods (x1, x2, . . . xN) to “V” value for selected time intervals of the recording period “L”. Most preferably, the processor transforms or converts the initial data set or series (SN) over the selected interval period (L) into a new data series SmN to calculate to a single value V(Si) using a Lyapunov characteristic exponent, and/or Fractional Dimension and/or P&H (“Poincaré Section and Higuchi Fractal Dimension”) methods. This may obtain a quantification of the rate of separation of events and/or fractal deviation to provide a statistical index of the complexity of the data over the selected time interval.
The “V” value is a measure of chaos in the data set (x1, x2, . . . xL,) and is plotted as “yL” the first point of the transformed time series at time position “L”. The window is moved one data point to the right (x2, x3, . . . xL+1,), and the “V” value is calculated and plotted as “yL+1” at time sequence position L+1. This is repeated until all data points from time position L to N have been converted.
The present invention provides in another non-limiting embodiment, a simplified processes and systems for predictive analysis, and more particularly which may provide improved reliability for chaotic or event prediction. More preferably, the invention provides a system and method for time series prediction which analyzes of continuous electroencephalography (EEG) data and can predict where in the future time series an epileptic seizure will likely occur, and which may be later confirmed by the recorded EEG data showing epileptic seizures.
In another non-limiting embodiment, the invention provides a system for predicting an epileptic seizure which includes one or more EEG sensors for positioning on a subject's skull, and which are operable to continuously record the wearer's brain electrical wave activity, and preferably electrical reading fluctuations and output the user's EEG data. The system may for example, be provided with wearable electronics and/or diagnostics such as a smart watch or glasses, sensor bands, and/or EEG sensors held to skin with electrically conductive adhesive. Most preferably, the sensors are adapted for wireless electronic communication with a processor or PDA. The PDA is configured to receive, store and process data representative of the output EEG oscillations and activity. In undertaking the EEG recording, a user-specific threshold value is preferably determined based on the processed continuous EEG signal over a period of time, and which identifies a level of the user's brain electrical activity above which the user experiences or is about to experience the onset of a selected epileptic seizure event. This threshold level value is then stored in the PDA memory as a predetermined threshold value, which is indicative of an epileptic seizure occurrence.
The PDA processor operates to analyze the input EEG data. To initiate the system, EEG data is preferably continuously recorded for at least 40 minutes, and more preferably about 60 minutes±15 to obtain directly from the user the baseline upon which data transformation is done. The transformation produces a new series of values upon which data processing may commence to produce reference values and provide predictive EEG data derived values into the future. The predicated data values are then preferably logged into the system; and are themselves then transformed; and processed by the PDA continuously for a recording period ranging from as little as 60 to 240 minutes; to days; weeks or monthly or semi-annual periods. The data analysis window is typically between about 15 to 30 minutes, and more preferably 20 minutes. The selection of the data processing window is selected based on data characteristics and the objective of the prediction. The level of confidence in the predicted data decreases as the predicted data window gets larger and also with data that is predicted into the future beyond the time interval of the processing window. Most preferably, the recording period continues through uninterrupted after the baseline data has been recorded. The system operates to output a warning to the user 5 to 120 minutes, and preferably at least 16 minutes prior to the time the user is expected to experience an epileptic event, such as a grand mal or Tonic-conic seizure.
In accordance with another possible preferred mode of operation, a selected number of data points N of the non-linear variable are monitored over a selected time sampling period T(LN), numbering from about one hundred or more, to ranges of several thousand 1000 to 2500. Where the system is used in predicting epileptic seizure events, 60 minutes of EEG data is preferably used as initial baseline sample. Longer or shorter periods based on the type of data observed and a patent-based assessment based experience with the system may, however, be used. The initial time interval period (L) is preferably selected at about a first 20 minutes, with individual sampling time intervals preferably selected at about 20 seconds with the result that 180 data points (3 for every minute) are chosen.
An initial baseline data series is SN (x1, x2, . . . xN) is generated which preferably includes 60 minutes of EEG data containing 180 data points.
The selections of 60 minutes of data and the 20 minute L time interval are selected based on reasonable values that would fit the case being evaluated and do not constitute fixed values to be used in every application. Using the data points, “Fractal dimension”, P&H and “Lyapunov exponent” calculation are used to achieve a single constant that characterizes a non-linear data reference value of the initial interval time series V(SN) for the monitored period.
The Data Series SN is transformed to the new data series SmN={yL, yL+1, . . . , yN} having new values “V”.
“Fractional Dimension” or “Lyapunov” or “P&H” calculate “a” value for data points (x1, x2, . . . xL) initially for a first series (i.e. at time interval of 20 minutes) of data. This “V” value is chosen as the new data point/value yL=V(x1, x2, . . . xL) at time position “L” in the new transformed time series “SmN”. The data interval is next shifted one data interval or time series to series (x2, x3, . . . xL+1), and the “V” value for this series is calculated which becomes the new data/point value at time position “yL+1=V(x2, x3, . . . xL+1)”. The process repeats by the continued shifting of the time interval one data point position toward xN and calculating “V” values which become the new data/point value at that time position until the xN value has been transformed. This completes the data transformation to this point in time/data readings providing the new data series SmN
For the data series SmN created above (and which by way of non-limiting example runs from the L or 20 minute time point to the 60 minute time point and having 120 data points) calculate a “V” value using the same “Fractional Dimension” or “Lyapunov” or “P&H”. This value V(SmN) is then used as reference “V” value for predicting next future data point yN+i values in the time series.
In the series yL, to yN+1 (which in an exemplary application is selected from the 20 minute time mark to the 60 minute time mark) the value vertical difference between each consecutive data point yL to yN is taken and the normal distribution of these values N(yi, σ2) is calculated. Using the calculated normal distribution N(yi, σ2) above centered on the Y-axis of yN, a series of new random data points or values are generated by a random number generator on the next time increment to be evaluated for the next point to be predicted on line at the time interval TN+1. Preferably between 5 and 25 and about 10 random data points are generated following the normal distribution curve N, however fewer or larger numbers may be used.
For an associated data sequence containing each of the randomly generated points, generated by the random number generator on line yN+1, a new “V” value is established using the data sequence (yL, yL+1, . . . yN, yjN+1), where yjN+1 represents the new points generated by the random number generator. As a result, where 10 random numbers are used Vj values separate V1 to V10 are calculated to develop V1=V(yi, yi+1, . . . yN, y1N+1), V2=V(yi, yi+1, . . . yN, y2N+1), . . . V10=V(yi, yi+1, . . . yN, y10N+1).
Each of the calculated Vj values are compared to the V(SmN) reference value, and the Vj value closest to V(SmN) is selected as the new predicted point yN+1=yjN+1.
The steps of randomly generating data points and establishing and comparing calculated Vj values and reference values are repeated until preferably at least sixteen minutes of data, and more preferably approximately one-third of the reference data is projected into the future.
In one aspect, the present invention resides in a monitoring system for providing a user with advance warning of a likely seizure event, and wherein the system comprising: a signaling mechanism operable to provide at least one of an audible, visual or sensory warning signal to said user indicative of a predicted seizure event; a sensor assembly having at least one sensor and preferably a wireless microsensor operable to sense and output sensed data values representative of the user's electroencephalographic (EEG) wave forms readings substantially continuously, a computing device having a processor and memory, the computing device electronically, and preferably wirelessly, communicating with said sensor assembly for receiving the output sensed data in said memory, the processor including program instructions operable to perform one or more of process steps described hereafter.
In another aspect, the PDA operates to select a spot value from the continuous data streaming into the PDA at a constant or same set time interval, and which is determined by the characteristics of the input data. In the case of EEG data, a preferred data sampling time interval of 20 seconds was determined, with a baseline data recording time of 60 minutes of user EEG data being used. An initial data analysis window (L) of 20 minutes was also selected to provide a 15 minute prediction target which falls within the window. The selected baseline data points for the entire 60 minute baseline data series may thus be represented as:
S
N=(x1,x2, . . . xN)
[i.e. in this embodiment xN is data point 180]
To predict future events from EEG data, a first subset of data over the first initial 20 minutes (SL) is chosen, and the data is preferably first transformed into a transformed data set SmN that is used in the data prediction model. The data transformation developed for EEG data to create a transformed data set, where the predictive model described can be applied is outlined in accordance with the following process, and wherein
S
L=(x1,x2, . . . xL)
[in this example xL is data point 60]
In a preferred operating mode in accordance with another aspect of the invention, the processor is operated to transform the collected EEG data value readings for the initial set sampling period SL=(x1, x2, . . . xL) for the data set over the selected recording period L to a single V value, V(SL), using one or more of Fractional Dimension, Lyapunov, and P&H. In particular, V(SL) is a measure of chaos in sample SL and is plotted a new value V(SL)=yL at time line point L. The data window “L” is then moved one data point right to form the new data set S(L+1)=(x2, x3, . . . x(L+1)). A new V(S(L+1)) is calculated as before, and the V(S(L+1)) value is set y(L+1) is plotted in the time line at position (L+1). This is then repeated until all the remaining EEG data points in the baseline data set SN are converted, providing new transformed data set of transformed data points (y) from time point L to N. This then provides the new transformed value time series for time points “L” through “N” as follows”
S
(LN)=(yL,yL+1,yL+2, . . . yN)
[in this embodiment SLN contains 120 data points from data points 60 to 180]
The transformed value time series (S(LN)) is then used as the data series upon which the predictive model can be applied.
Further each new EEG reading preferably goes through a corresponding transformation (from point xN+new to yN+new) before that data point is used as part of the time series upon which the predictive model is applied. The next EEG read data point therefore becomes xN+1 which is in turn transformed as above to become the new updated times series ending in data point yN+1 in the data series.
The prediction is carried out as follows on the new transformed data series SLN.
Pos(yN+i)={yjN+i,1≦j≦Nrand.}
S
j
N+1
=S
N+i−1
+y
j
N+1
={y
L
,y
L+1
,y
L+2
, . . . y
N
,y
j
N+i}
j
min=arg minj(|V(SmN+i−1+yjN+i)−V(SmN)|)
In another aspect the present invention resides in a seizure monitoring system having a signaling mechanism for providing a user with advance warning of a predicted epileptic seizure event, the system further comprising: a sensor assembly having a sensor operable to sense and output a plurality of data signal values representative of the user's electroencephalographic (EEG) activity over a monitored period of time as sensed data, a computing device having a processor and memory, the computing device electronically communicating with said sensor assembly for receiving the sensed data, the processor including program instructions operable to:
A. select and store in said memory said data signal values at a plurality of equally spaced time intervals over said monitored period as a first time series data sequence, and which for example maybe used to initiate the system and establish constants to be used in predicting future data points;
B. compute an initial base-line non-linear measure values to create an initial time series of data points (SN) which are then transformed into a new data time series (SmN) sequence using fractal dimension, Lyapunov exponent and/or P&H.
The system is preferably operable to output by a signaling device an alarm and/or warning signal to warn the user of the likelihood of an epileptic seizure or other medical event. While the foregoing describes an initial 60 minute monitoring period, and a 20 second data sampling internal interval window L selected at 20 minutes, longer and shorter monitoring interval windows, and data sampling intervals may be used.
In a further aspect, the present invention resides in a method of using an EEG monitoring device and/or system for providing a user with advance warning of an epileptic seizure event, the system comprising: a sensor assembly having a sensor operable to sense and output a plurality of data signals representative of the user's electroencephalographic (EEG) activity, a computing device having a processor and memory, the computing device electronically communicating with said sensor assembly for receiving the sensed data. The sensor assembly is operable, sensing and continuously outputting to said computing device data signal of said user's EEG activity over an extended period of time. Preferably, data signal values sampled and recorded are stored in the memory plurality of equally spaced time intervals over said initial monitored period as an initial time series data sequence. Where values in the predicted time interval of the new time series data sequence has at least one, and preferably 3 or more consecutive values that exceed a preselected threshold value and preferably in the case of Tonic-clonic seizures, a standard deviation value of 2.4, output by the signaling mechanism a warning signal to the user indicative of the likelihood of said epileptic seizure event.
In one preferred aspect, the forward prediction limit is one-third the baseline recording period N/3, and preferably should be less than the time interval L and at least a period of about 16 minutes.
The applicant has appreciated by setting a predetermined threshold value of transformed EEG signal readings, the PDA may advantageously compare the data value which is projected to occur in the future with the predetermined threshold value. If the projected value equals or exceeds the predetermined value for a selected number (i.e. 3 to 6 and preferably about 3) consecutive points, the PDA operates to provide the user with an advance warning that an epileptic event is likely to occur. In such cases, the advance warning will occur less than the predicted time interval in the future. In a simplified mode of operation the PDA operates to provide an audible and/or visual warning to the wearer and/or a third party. In an alternate configuration, the PDA may be operable to provide the user with a physical warning signal, such as a vibration or sensory warning and/or a countdown clock, counting down the time remaining to the likely epileptic event, to better assist the wearer in curtailing potentially hazardous activities and/or allowing him or her to relocate to a safe environment.
In addition to the above discussed general aspects, the invention further provides for various preferred, non-limiting aspects, and which include:
1. A device, system and/or method in accordance with any of the aforementioned aspects wherein the processor is operable or further includes program instructions to change baseline reading interval; EEG electrode used for data source; data sampling interval; algorithm used to set V values; number of random numbers generated for predicting new values; restart system; etc.; with build in diagnostics of the system verifying that all aspects of the system are functioning properly and providing user a green light signal confirming same. In the event that the system encounters a functional problem the user is alerted both by visual signal and audible signal of system malfunction along with screen displays as to the nature of the malfunction.
2. A system and/or method in accordance with any of the aforementioned aspects wherein the fixed data sampling interval is consistent and is selected based on data being used and desired prediction period into the future. This may preferably range between about 1.0 second to hours and/or days.
3. A system and/or method in accordance with any of the aforementioned aspects wherein the analytic data window/interval is selected with sufficient data points to provide sample size of statistical significance based on data being analysed. In the case of EEG reading a preferred window of about 20±5 minutes containing 60 data points was selected.
4. A device, system and/or method in accordance with any of the aforementioned aspects wherein the base data set from which to make predictions of future data points is at least two times the data window, more preferably at least three times the data window.
5. A device, system and/or method in accordance with any of the aforementioned aspects wherein the plurality of random data values used to predict the next data point and greater than 4, and more preferably from about 10 to upto several thousand, depending on required accuracy.
6. A system and/or method in accordance with any of the aforementioned aspects further including a random number generator for generating the random data values in the value range calculated for the data centered on the value of the last point in the series.
7. A system and/or method in accordance with any of the aforementioned aspects wherein a threshold value is established above which epileptic seizures were most likely to take place is selected at between about 1.2 and 4, and preferably about 2.4; and/or preferably where 3 to 10, and preferably 3 or 4 consecutive points are calculated as falling above the threshold value.
8. A system and/or method in accordance with any of the aforementioned aspects wherein said output signals comprise EEG readings over a plurality of constant time intervals, and the initial time period is selected as a time period consisting of one or more pre-seizure, seizure and post seizure events.
9. A system and/or method in accordance with any of the aforementioned aspects wherein the computing device comprises a personal digital assistant, and said signaling mechanism comprises at least one of a visual display and an audio output, and wherein the output warning signal comprises at least one of an audible signal emitted by said audio output and a visual signal visible on said visual display. The system further may include various wearable electronic components, including micro adhesive attached sensors, smart watches and/or glasses, and the like.
10. A system and/or method in accordance with any of the aforementioned aspects wherein the seizure event comprises a Tonic-clonic seizure.
11. A system and/or method in accordance with any of the aforementioned aspect, wherein the processor is operable to provide the wearer with a visual and/or audible indication which indicates either a percentage or relative likelihood and/or severity of the project seizure event. In a more preferred mode, the PDA may be operable to provide different coloured graphic warning, where for example a red warning signal appears where there is a high probability of a severe seizure, or a yellow warning indicator is provided where the risk of a seizure is moderate, increasing and/or decreasing.
12. A system and/or method in accordance with any of the aforementioned aspects, wherein in the event the predicted future event is determined from EEG signals to represent a critical value, as for example if determined to deviate from one or more previously selected and/or averaged levels of chaos of EEG signals by a threshold amount, the system may be used to effect the display of a warning signal of a likelihood of a seizure or other pending events; and/or further transmit audible or electronic instructions to the user to cease any potentially hazardous activities such as driving or machinery operation, and/or to relocate to a safe environment.
Reference may be had with the following detailed description, taken together with the accompanying drawings, in which:
Reference may be had to
As shown, the sensor 14 is adapted for placement in juxtaposed contact with a user's skull 18, and is operable to measure and record the electrical activity or electrical fluctuations of the user's brain. In one possible construction, the sensor assembly 12 may be provided as part of a smart glasses design, such as Google® glasses, or other such wearable technology. The sensor assembly 12 is operable to collect the EEG readings and wirelessly transmit them to the PDA 16 as a series of data readings or measurements taken over an initial sampling or monitoring period of from about ten to one-hundred and twenty minutes and preferably about twenty minutes. It is to be appreciated, that while
The PDA 16 is provided with an internal antennae (not shown) adapted to wirelessly receive signals from the sensor assembly 12, and includes an internal memory 20, a processor 22, an audio output speaker 24 and a visual display screen 26. As will be described, the audio output speaker 24 and display screen 26 are operable in the use of the system 10 to provide the user 8 with both an audio and visual warning of a predicted likelihood of an impending epileptic seizure or event.
In a preferred mode of operation, the sensor 14 is operated to collect and transmit to the PDA 16 the user's EEG data over an initial sampling or monitoring period as time series Tsample, where it is stored in the PDA memory 20 as measured relative EEG values. Preferably, EEG data is collected as a substantially continuous data file for the initial monitored period of time and thereafter, as for example is shown graphically in
As will be described, the processor 22 includes programme instructions which are stored in memory, and which are operable to identify any measure the threshold value of transformed EEG signal values. The determined threshold value is then stored in the memory 20 as a preset threshold value which, based on the transformed historical data, provides a value above which is indicative of the occurrence a seizure event.
Once the initial monitored data is input and stored in memory 20, the processor 22 is used to transform the sensed data into a series of data values taken at equally spaced time intervals (i.e. preferably every twenty seconds). In particular, as shown in
1. The smallest time interval taken in the illustrated time series is the sampling time between x1 and x2, and the horizontal distance/time between points is always equal.
2. In the data time series shown graphically, the point on the left is x1, with the final point or time interval on the right is xN (xN in the example shown in
Following the establishment of the initial measured time series data sequence SN=(x1, x2 . . . xN), over the sixty (60) minute period a first twenty (20) minute interval L of data (SL) is chosen SL (x1, x2, x3 . . . xL).
A non-linear measure value V(SL) is then determined for the measured time series data sequence (SL) as a reference value. Preferably, the processor 22 is used to calculate the non-linear reference value for the series SL using one or more of “Fractal Dimension” or “Lyapunov” or “P&H”. The value V(SL) represents the measure of chaos in sample SL is plotted as a new value V(SL)=yL at time point L. As such, for the first initial interval V(S1)=y1.
The data window L is then moved to the right one data point. A next time series for a next interval SL+1=(x2, x3 . . . xL+1) is then chosen and a non-linear measure V(SL+1) computed using fractal dimension, the P&H value and/or Lyapunov exponent to generate a new V(SL+1) [i.e. V(S2)]yL+1 value [i.e. y2].
The process is then repeated for all of the remaining “x” values in the measured time series SN to generate a transformed data time series SLN (yL, yL+1 . . . yN) For the initial transformed data series SmN, possible mapping may be required, forming the new time series SmN={yL, yL+1, . . . yN}:
y
i
=V(Si−L+1,i),L≦i≦N where Si−L+1,i={yi−L+1,yi−L+2, . . . yN};
otherwise SmN=SN
where 0<L<N is the size of a sliding window used to compute the local level of chaos measured by V( ). Therefore, when the mapping is applied, the new considered time series SmN corresponds to the variation in time of the local non-linear measure in the initial time series SN.
V(SmN) is then determined as a reference value that will be used for predicting the next k values of the time series:
y
N+i,1≦i≦k.
As will be described, based on the historical data collected during the initial 60 minute monitoring period Tsample, the processor 22 is operable to read actual EEG then transform these values into values upon which future data values can be predicted. These predicted values may then be compared against the preselected threshold value to identify a likely epileptic event. In a simplified embodiment of the system 10, the PDA 16 outputs to the user 8 an alert signal or other identifier on the PDA speaker 24 and/or display 26, and preferably if three or more consecutive predicted future values exceed the preselected threshold value. Preferably, the PDA 16 is operable to provide a different warning or visual output signals to the user 8. Output warning signals may vary depending upon the resultant value of the predicted from the transformed EEG readings, and as it may relate to the probability of the seizure.
In a preferred operating mode, following the determination of the transformed time series SmN, a reference non-linear measure value of the transformed time series data V(SmN) is determined using Lyapunov exponent, P&H and/or fractural dimensions. The PDA processor 22 analyzes the transformed series SmN=yL, yL+1, . . . yN taking the value difference between yL+1 and yL+2, yL+3 and yL+4, (
Preferably future time periods TL, TL+1, TL+2 . . . TN are chosen as equal constant intervals of time (s) over the same selected duration of between 5 and 60 seconds, and preferably about 20 seconds.
The processor 22 operates to generate and output predicted future data values, at such time periods based on transformed data values that are used to create predicted values for the time interval (xN+1) at point in time in the future, and preferably over a predicted future period of up to one third of the time covered by the measured baseline historical data points. Processor 22 performs a complex time series prediction based on an optimization process, whereby the processor 22 analyzes EEG data characteristics of the transferred time series SmN, and generates successively new predicted values yN+1 at successive points in time in the future, as continuing predicted time series. Further, as each new predicted data point is (yjN+i) generated, the processor 22 effects Lyapunov weighing and/or P&H methodology and/or fractal dimension to minimize the difference between the characteristic of the predicted new time series and the initial one.
A most preferred method for long-term time series prediction is shown graphically in
In particular, the parameter a of the normal distribution N(yi,σ2) 1≦i≦k of the transformed time series SmN is computed by computing the variation between every two consecutive values (i.e. yi to yi+1). This distribution represents the distribution of probability of value of yi, knowing yi−1 (
Next, the processor 22 is used to generate randomly a number of potential predicted future values for the next time interval. The processor 22 preferably operates to generate, at least five to thirty, and preferably about ten new random values. In a simplified mode, random numbers are generated by way of a random number generator program for the next and as well be described, each subsequent time interval (yN+i+1) to be evaluated at the next and each subsequent point to be predicted for time TN+1+1. For predicting yN+i+1 Pos(yN+i+1), each randomly generated valve of the set of r random values, are plotted following the normal distribution N(yN+i, σ2) (
Pos(yN+i+1)={yjN+i+1, 1≦j≦Nrand} is a parameter that can impact on the quality of the prediction, since having more values will increase the chance of finding an optimal value. However, it has been shown that in the analysis of EEG data significant improvement was not observed for the data when r was greater than ten.
For each of the random data values generated yjN+i+1, an associated extended generated time series is created (SmN+i+1=(yL, yL+1 . . . , yN, yN+1, . . . , yjN+i+1). The extended generated time series sequence in then used to compute an associated non-linear measure value V(SmN+i+1) using fractal dimension, P&H method and/or Lyapunov exponent. As such, for each separate data set containing each ten predicted point generated by the random number generator, a new “V” (i.e. V1, V2 . . . V10) value is established using the data sequence (yL, yL+1 . . . yN+i, yjN+i+1), where yjN+i+1 is one of the r new points generated by the random number generator.
The generated time series sequence having the associated non-linear measure value (V1, V2, V3 . . . V10) closest to the reference value V(SmN) is then chosen as the predicted next time series data sequence SmN+i+1=(yL, yL+1 . . . yN+i, yN+i+1). Further, the random data value yjN+i+1 for the selected next time series data sequence is assigned as the predicted data value for the next time interval TN+i+1.
yN+i+1 is thus computed by:
j
min=arg minj(|V(SmN+i−1+yjN+i)−V(SmN)|) with (SmN+i−1+yjN+i={y1,y2, . . . ,yN+i−1,yjN+1})yN+1=yjminN+i
[yjmin=minimum variance value between reference values and V values]
The value yjN+ii is chosen to make V(SmN+i+yjN+i+1) as close as possible to V(SmN).
Preliminary testing suggests that the present method and system may achieve a high degree of accuracy in providing epileptic patients with advance warning of the likely onset of a seizure.
In preliminary testing, 21 patients diagnosed with epilepsy were monitored. In particular, EEG (electroencephalography) data from each patient was acquired using a Neurofile NT™ digital video EEG system with 128 channels, 256 Hz sampling rate, and a 16 bit analogue-to-digital convert. For each of the patients, there were datasets celled “ictal” and “interictal”. As shown in
To evaluate performance of new method on prediction of epileptic seizure, the EEG time series measured by five electrodes, generating five different time series, for 21 patients were examined. For each EEG time series, the exact time of seizure was known and recorded. The P&H chaoticity values were predicted using GenericPred. The P&H chaoticity values were calculated on a constant-length (20 minutes) sliding window, with sliding time intervals of 20 seconds, of the EEG time series. During seizure, a peak in P&H values obtained from EEG time series appears. Based on the analysis of all 21 patients, a threshold for prediction of seizure determined at a preselected P&H value equal to 2.4 or greater (see
The selection of 60 minutes of baseline data and the 20 minute time interval L were selected based on an expectation of reasonable values that would fit the case being evaluated, and were not provided as fixed values to be used in every application. As such, larger or shorter baseline and/or time interval data may be used.
It is to be appreciated that establishing “V” values using “Fractal Dimension”, Lyapunov” and/or “P&H” are based on what is more appropriate for the application. It may also be acceptable to calculate “V” values using a combination of values of two or more such methods (“Fractal Dimension”, “Lyapunov” or “P&H”).
Using the P&H threshold value, the current method was shown to predict future epileptic seizures with a high degree of sensitivity and specificity up to 17 minutes in advance (see Table 1 below). Further, different ranges of EEG time series were considered before and after seizure (we considered 10 ranges during seizure-free part of EEG time series for each patient) and there was no peak predicted by the current method in any case.
For each patient, one positive and 10 negative samples were constructed. The positive sample contains one epileptic seizure event, and the 10 negative samples are seizure-free. Therefore, there are 21 positive and 210 negative samples in total that were used to compute the specificity and the sensitivity accuracy levels.
The same results were obtained by considering the data of any five electrodes independently. This is believed to represent an improvement over other predictive method, which typically achieves accuracy levels of 73% sensitivity and 67% specificity for 10 patients within a 1-19 minute range.
It is not anticipated that the current method will provide 100% sensitivity and specificity in all instances. Preliminary testing has, however, suggested that the system and method of the present invention shows strong promise in providing a good indicator of the likelihood of the onset on an epileptic event.
Although the detailed description describes the current method and system as most preferably being used for predicting epileptic seizures, the current system shows promise for a wide variety of different applications.
In an alternate mode, the method of the present invention may be used to predict the possible onset of a heart attack or stroke, as for example, by assessing chaotic variability of blood pressure changes, heart beat or heart arrhythmia. In yet another embodiment, the system 10 may be adapted for use as a medical warning device, as a predictor for the likelihood of the onset of seizure heart attack. The system 10 may include electro-cardiogram (ECG) in place of electroencephalogram (EEG) sensors to provide data representative of a patient's heart palpitations or arrhythmia over a historical or monitored time period.
In yet a further alternate possible application, the system 10 may be used as a predictor for future angina attacks. In particular, a patient's blood pressure data may be monitored over a selected period of time and input into the processor memory 20. By the aforementioned process, the processor 22 is activated to identify the future times where a potentially critical high blood pressure event is likely, and which correlates to a patient angina attack.
Again, on predicting the possible onset of such an occurrence, the system 10 could be used to provide either visual or audible warning to a user or medical practioner via the display 26. Alternately, if provided as part of an automatic drug dispensation system, the processor 22 could be used to output control signals to effect an adjustment of a pacemaker or an automated drug dispensation apparatus to alter the medical dosage of a patient's heart medication in anticipation of the possible angina event.
In a further non-limiting embodiment, the system 10 may be used to establish predictive environmental models. In one embodiment, data representing past measured amounts of vegetative growth of a particular plant or algae may be input for a selected historical time period. Using the foregoing method, the processor 22 may provide output data which is predictive of when a selected plant species may dominate or be subordinated relative to other species within a particular geographic area.
It is noted that establishing “V” values using “Fractional Dimension”, “Lyapunov” or “P&H” are based on what is more appropriate for the application. It may also be acceptable to calculate “V” values using the averaged values of one, two or all of these methods.
Each of the new non-linear data values (V1, V2 . . . VN) are compared with the earlier calculated (VmN) reference value, and generated time series with the closest corresponding value is selected, with its associated random data value chosen as the prediction for the next predicted time interval value in the time sequence. Using the generated time series sequence, the next subsequent predicted data value is determined by repeating steps of randomly generating and selecting data points by their approximation to the initial reference value. The process calculations may continue to be used to generate new predicted data values or points. Most preferably, number of new data points created in the sequence does not exceed one third of the total number of historic data points (N/3).
As a result, with the present method historical data may be rapidly updated. Most preferably, instead of making a shift of N data points at a time, a shift of a single data point is undertaken. That means that just one new real point value is measured (N+1) and then the new historical data to be taken into account are (2, 3, . . . , N+1), and the new prediction begin at N+2.
In the preferred mode, the reference value is always V(SmN), and which is obtained based on the value of the transformed non-linear data series from the original time interval. Therefore, with to the present method it is advantageous to keep the value of a transformed non-linear measure steady as much as possible during prediction (see
With the current system 10, prediction is performed using the complete time series whereas, in traditional approaches, after computation of the model, prediction is performed only using the model and no longer the original time series. Therefore, the current model allows for constant adjustment of information about the current time series, whereas classical predictive methods apply the model without taking into account the accordance between the original time series properties and the predicted ones. Moreover, the optimization step allows making choice among a set a potentially good predictive values, compared to the traditional models which only generate one value. Another advantage of the present invention is that it does not rely on a complex model of the original time series and it is therefore very general. Having no specialized model for prediction makes new method less restricted to a specific domain.
The system 10 most preferably incorporates built-in diagnostics software operable to verify that all aspects of the system 10 are functioning properly, and outputting via the display a green light signal confirming same. In the event that the system 10 encounters a functional problem, the user is alerted both by visual signal and audible signal of system malfunction, along with screen display as to the nature of the malfunction.
The present method shows a strong improvement compared to traditional methods over different situations and other chaotic time series in term of accuracy both for short and long term prediction. Moreover, the present method shows ability to predict the trend of evolution of other chaotic time series is much better than those of existing methods. Its performances are also more stable, with a standard deviation of the error measure appearing lower than those of the other methods. The method provides step toward an accurate and comprehensive time series long-term prediction.
It should be noted that preferred embodiment of the present method is not customized for a specific application, using a similar non-linear criterion may have the same function for a variety of applications. Further, by involving knowledge from other fields, it may be possible to provide a universal method for predicting a variety of non-linear time series. In another embodiment, the present method could utilize several non-linear measures simultaneously, instead of using just one measure, to identify and preserve the complexity of time series more efficiently.
Although the preferred embodiment describes the system and process for use in the predictive analysis of epileptic seizures, it is to be appreciated that the present process and system is equally applicable across a number of other possible applications. Such applications could include without restriction, applications in predicting macrogeographic events and trends; the predictive modeling of pandemics and pathogenic outbreaks; weather and meteorological modeling; and/or earthquake and geological event modeling. In addition, the system and method may further be used in the prediction and/or analysis of other complex data of non-linear events, including heart attack and/or stroke, as well as part of a health monitoring or warning system to provide an advance indication of other types of likely health events.
Although the disclosure describes and illustrates various preferred embodiments, the invention is not so limited. Many modifications and variations will now occur to persons skilled in the art. For a definition of the invention, reference may be had to the appended claims.
This application claims priority to and benefit of 35 USC §119(e) of U.S. Provisional Patent Application Ser. No. 62/042,535, filed 27 Aug. 2014, the disclosure of which is hereby incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CA2015/000476 | 8/26/2015 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62042535 | Aug 2014 | US |