The present invention is in the field of methodologies, systems, computer program products, and universal models for predicting glucose concentration in humans.
Within this application several publications are referenced by Arabic numerals within brackets. Full citations for these, and other, publications may be found at the end of the specification immediately preceding the claims. The disclosures of all these publications in their entireties are hereby expressly incorporated by reference into the present application for the purposes of indicating the background of the present invention and illustrating the state of the art. If however there are any conflicts between this disclosure and text incorporated by reference, then statements made in this document control and supersede the incorporated teachings.
Minimally invasive continuous glucose monitoring (CGM) devices are instruments utilized to measure and record a patient's glycemic state as frequently as every minute [1]. This information can be utilized to alter or improve the patient's lifestyle, to tighten their glycemic control, or to adjust therapy. These frequent measurements can also be used by data-driven models to forecast future values of subcutaneous glucose concentration and avoid undesired hypoglycemic or hyperglycemic episodes [1]-[4].
In contrast to intermittent measurements, CGM devices collect information frequently such that consecutive measurements retain a large degree of temporal correlation. This correlation is exploited by data-driven models to infer future values as a function of previous measurements [2]-[4]. However, because of the availability of glucose signals at high sampling rates, developers of data-driven models often implicitly assume that the models need to be tuned for a specific individual, thus increasing the burden of model development and reducing their practical applicability. For example, Sparacino et al. [3] uses an autoregressive (AR) model of order one, AR(1), which continuously adapts the model coefficients to the monitored individual to predict future glucose concentrations up to 30 minutes from the time of prediction. Although such a model can produce acceptable predictions, it needs to be continuously adapted for every individual. Additionally, in spite of the adaptive nature of the model, it introduces a significant delay between predicted and measured values. This delay is caused by the low order of the AR model, because a single AR model coefficient is not sufficient to capture the temporal variations of the time-series glucose signal. In another example, Dua et al. [4] employs a Kalman filter to predict future blood glucose levels by continuously adjusting parameters of a first-principles model. Although the first-principle model is significantly more flexible than the AR(1) model of Sparacino et al., the continuous adaptation also makes the Dua et al. model individual specific.
At least one embodiment of the invention provides a universal, data-driven model developed based on glucose data from one diabetic subject, which is subsequently applied to predict subcutaneous glucose concentrations of other subjects, even those with different types of diabetes. Three separate studies, each utilizing a different CGM device, were used to verify the model's universality. Two out of the three studies involved subjects with type 1 diabetes and the other study was for type 2 diabetes. The subcutaneous glucose concentration data are filtered (i.e., smoothed) by imposing constraints on their rate of change. Using the filtered data, data-driven autoregressive (AR) models of order 30 are developed and utilized to make short-term, 30-minute-ahead glucose-concentration predictions. Same-subject model predictions are utilized as a reference for comparisons against cross-subject and cross-study model predictions, which are evaluated using the root mean squared error (RMSE). For each studied subject, the average cross-subject and cross-study RMSEs of the predictions are small and indistinguishable from those obtained with the same-subject models. In addition, the predictive capability of the models is not affected by diabetes type, subject age, CGM device, and inter-individual differences. Thus, a stable, universal glucose models is developed that captures the invariant correlations in time-series signals of diabetic patients.
An embodiment of the invention provides a method for predicting at least one future glucose level in an individual. The method receives glucose signals from a glucose measuring device, wherein the glucose signals represent glucose levels obtained from an individual at fixed time intervals. The glucose signals are converted into numerical values representing the glucose levels obtained from the individual. The glucose signals and/or numerical values are stored in a memory unit housed in the glucose measuring device. In another embodiment, the memory unit is external to the glucose measuring device.
The method predicts one or more future glucose levels of the individual by weighing the glucose signals by model coefficients of a glucose prediction function. Weighing the previous glucose signals of the individual by the model coefficients reduces a time lag of the predicted future glucose levels. In at least one embodiment, the predicting of the future glucose level is performed with a processor (or programmable data processing apparatus) having code to perform calculations of the glucose prediction function. The glucose prediction function is a universal autoregressive model that is portable between individuals irrespective of health of the individuals. The health of the individual includes a diabetes type of the individual, age of the individual, and/or whether the individual is hospitalized. Moreover, the model coefficients are invariant between the individuals irrespective of the type of the glucose measuring device utilized to measure the glucose signals.
In addition, the method displays the predicted future glucose levels on a display and generates an alert when the future glucose level of the individual exceeds an upper glucose threshold and/or falls below a lower glucose threshold.
A method according to another embodiment of the invention obtains first glucose measurements (i.e., training data) via a glucose monitoring device. Current glucose levels are monitored at fixed time intervals in a plurality of individuals having type I and type II diabetes (i.e., test subjects). A programmed processor uses a portion of the first glucose measurements to train a glucose prediction function that is portable between individuals. The training of the glucose prediction function is independent of the type of glucose measurement device utilized to obtain the first glucose measurements, the ages of the individuals, and whether the individuals are hospitalized. The training creates model coefficients that are invariant between the individuals.
The method obtains second glucose measurements from the individual using the type of glucose monitoring device utilized to obtain the first glucose measurements, or using a type of glucose monitoring device that is different from the type of glucose monitoring device used to obtain the first glucose measurements. The glucose prediction function is used to predict future glucose levels in the individual. The predicted glucose levels represent glucose levels at least 5 minutes into the future, i.e., 5 minutes from the time that the second glucose measurement is obtained from the individual. Specifically, the model coefficients of the glucose prediction function are multiplied by the second glucose measurements obtained from the individual. Because the model coefficients are invariant between individuals, the predictions are independent of the type of glucose measurement device utilized to obtain the first and second glucose measurement. The predictions are also independent of the diabetes type of the individual, the age of the individual, and whether the individual is hospitalized. The glucose prediction function reduces a time lag of the future glucose levels.
Another embodiment of the invention provides a system for predicting future glucose levels in an individual. A glucose measuring device generates glucose signals representing glucose levels obtained from the individual at fixed time intervals. In at least one embodiment, a memory unit is housed in the glucose measuring device for storing the glucose signals.
A programmed processor housed within the glucose measuring device converts the glucose signals into numerical values representing the glucose levels obtained from the individual. The processor is programmed with a glucose prediction function that is portable between individuals irrespective of health of the individuals. The health of the individual includes the age of the individual, the diabetes type of the individual, and whether the individual is hospitalized. In at least one embodiment of the invention, the glucose prediction function is a universal autoregressive model.
The glucose prediction function includes model coefficients that are invariant between the individuals irrespective of the type of the glucose measuring device utilized to measure the glucose signals. The processor selects the model coefficients based on the sampling rate of glucose measuring device utilized to obtain previous glucose signals from the individual. The glucose prediction function outputs the future glucose levels by weighing the previous glucose signals obtained from the individual by the model coefficients.
The system further includes a display connected to the processor for displaying the future glucose levels. A threshold detector is also provided for generating an alert when a future glucose level of the individual exceeds an upper glucose threshold and/or falls below a lower glucose threshold.
A system according to yet another embodiment of the invention includes one or more glucose measuring devices for measuring current glucose levels in humans. One or more first types of glucose measuring devices are utilized to measure glucose levels from individuals (i.e, test subjects) at fixed time intervals (first output). A second type of glucose measuring device is utilized to measure glucose levels from the individual (second output). In at least one embodiment, the second type of glucose measuring device is different from the first types of glucose measuring devices.
The individuals from which the first output is obtained include individuals having type I and type II diabetes, individuals that are hospitalized, and individuals that are not hospitalized. The individuals range in age from 3 years old to 70 years old. In at least one embodiment of the invention, the average age of the individuals is different from the age of the individual (from which the second output is obtained).
A processor trains a glucose prediction function using the first output from the glucose measuring device. The glucose prediction function is a universal autoregressive model that is portable between individuals. The glucose prediction function includes model coefficients that are invariant between individuals.
In another embodiment, an analyzer uses the trained glucose prediction function and current output from the glucose measuring device to predict the future glucose levels in the individual. The predicted glucose levels represent glucose levels at least 5 minutes into the future, i.e., 5 minutes from the time that the second glucose measurement is obtained from the individual. Because the model coefficients are invariant between individuals, the glucose prediction function predicts the future glucose levels independent of the age of the individual, the diabetes type of the individual, and whether the individual is hospitalized.
The present invention is described with reference to the accompanying drawings. In the drawings, like reference numbers indicate identical or functionally similar elements.
Exemplary, non-limiting, embodiments of the present invention are discussed in detail below. While specific configurations are discussed to provide a clear understanding, it should be understood that the disclosed configurations are provided for illustration purposes only. A person of ordinary skill in the art will recognize that other configurations may be used without departing from the spirit and scope of the invention.
An embodiment of the invention utilizes similarities in the short-term (30-minute or less) dynamics of glucose regulation in different diabetic individuals to develop a single, universal autoregressive (AR) model for predicting future glucose levels across different patients. Data are collected from three different studies, involving subjects with both type 1 and 2 diabetes and using three different continuous glucose monitoring (CGM) (or glucose monitoring device) devices: iSense (iSense Corporation, Wilsonville, Oreg.), Guardian RT (Medtronic Inc., Northridge, Calif.), and DexCom (DexCom Inc., San Diego, Calif.). Data-driven AR models of a fixed order are developed for each subject; and, the AR models are tested on data from other subjects from the same and from different studies. The RMSE and prediction time lag are used as metrics to quantify the models' performance; and, the resulting AR coefficients from the different models developed for each subject are compared.
The developed AR models (i.e., the AR model coefficients) are not significantly dependent on a given individual, diabetes type, age, or CGM device. Thus, universal, individual-independent predictive models are developed, which reduces the burden of model development as one model can be used to predict future glucose levels in any individual using any CGM device. Such predictive models are utilized together with CGM devices for proactive regulatory therapy.
An embodiment of the invention provides a system for predicting future glucose levels in an individual. The system includes a glucose monitoring device for obtaining time-series data representing glucose levels measured at fixed time intervals from an individual patient. The time-series data is input into a universal AR model having a plurality of model coefficients. As described more fully below, the model coefficients are invariant among patients (i.e., patient/individual independent). In predicting future glucose levels, the model coefficients weight the importance of the previously measured glucose levels (e.g., a more recent measurement may be more important than an older measurement). Thus, each of the measured glucose levels input from the glucose monitoring device is multiplied by a respective model coefficient of the AR model. The models of the embodiments herein use the invariant model coefficients to develop a universal AR model that is portable from individual-to-individual.
The invention in at least one embodiment provides a prediction of a future glucose level. This embodiment uses a desired prediction horizon time for determining the number of times the model is used to process a sliding window of predicted and real glucose levels that advances one sample period per iteration. Each advance removes the oldest glucose level and slides the remaining glucose levels to the next coefficient.
The method predicts the individual's future glucose levels by weighing the stored glucose signals by model coefficients of a glucose prediction function (120). The predicting of the future glucose levels is performed with a processor having code to perform calculations of the glucose prediction function.
The glucose prediction function is a universal autoregressive model that is portable between individuals irrespective of health of the individuals. The health of the individual includes a diabetes type of the individual, age of the individual, and/or whether the individual is hospitalized. As described more fully below, the glucose prediction function in at least one embodiment is trained using test subjects that include children, adults, and the elderly having type I diabetes and type II diabetes. Moreover, the glucose levels of the test subjects were obtained using three different types of glucose measuring devices. Thus, the model coefficients of the glucose prediction function are invariant between the individuals irrespective of the type of the glucose measuring device utilized to measure the glucose signals.
In addition, the method displays the predicted future glucose levels on a display (130) and generates an alert (or other notification) when a future glucose level is predicted to exceed an upper glucose threshold and/or fall below a lower glucose threshold (140). As such, the method in at least one embodiment can be used to avoid hypoglycemic or hyperglycemic episodes. The predicted future glucose levels can be used to alter or improve the patient's lifestyle, to tighten their glycemic control, or to adjust therapy in a proactive manner before an episode occurs. As described more fully below,
A processor uses a portion of the first glucose measurements to train a glucose prediction function that is portable between individuals (120B). In at least one embodiment of the invention, the glucose prediction function is a universal autoregressive model. The training of the glucose prediction function is independent of the type of glucose measurement device utilized to obtain the first glucose measurements, the ages of the individuals, and whether the individuals are hospitalized. As described below in connection with a model training example, the glucose prediction function is trained using test subjects that included children, adults, and the elderly having type I diabetes and type II diabetes.
The training creates model coefficients that are invariant between the individuals. As described more fully below in connection with development of example coefficients for a 5-minute sampling period,
The method obtains second glucose measurements from the individual (130B). The second glucose measurements may be obtained using the type of glucose monitoring device utilized to obtain the first glucose measurements, or using a type of glucose monitoring device that is different from the glucose monitoring device used to obtain the first glucose measurements for training.
The glucose prediction function is used to predict future glucose levels in the individual (140B). The predicted glucose levels represent glucose levels at least 5 minutes into the future, i.e., 5 minutes from the time that the second glucose measurement is obtained from the individual. Specifically, the model coefficients of the glucose prediction function are multiplied by the second glucose measurements obtained from the individual. As described below, for example, for a glucose prediction function of order 30 and a 5-minute sampling interval, the most recently measured glucose level {tilde over (y)}n-1 obtained 5 minutes ago is weighed by the first model coefficient bn-1. Because the model coefficients are invariant between individuals, the predictions are independent of the type of glucose measurement device utilized to obtain the first and second glucose measurement. The predictions are also independent of the diabetes type of the individual, the age of the individual, and whether the individual is hospitalized.
The glucose prediction function reduces a time lag of the future glucose levels.
A processor 220 converts the glucose signals from the glucose measuring device 210 into numerical values representing the glucose levels obtained from the individual. In at least one embodiment, a memory unit 222 is housed in the processor 220 for storing the glucose signals. Although
The glucose prediction function includes model coefficients that are invariant between the individuals irrespective of the type of the glucose measuring device utilized to measure the glucose signals as described above and below.
The glucose prediction function outputs the future glucose levels by weighing the previous glucose signals obtained from the individual by the model coefficients. As described below, the model coefficients weight the importance of the previously measured glucose levels (e.g., a more recent measurement may be more important than an older measurement). Because the model coefficients describe the correlations in the time-series signal, their absolute values are a function of the sampling frequency of the data used to develop the model. The model coefficients are also dependent on the order of the model. Thus, models having different orders developed on glucose data sampled at different frequencies are expected to yield slightly different model coefficients. The combination of coefficients and order of the model dictate the accuracy of the glucose levels predictions.
The system 200 further includes a display 230 connected to the processor 220 for displaying the future glucose levels. A threshold detector 240 is also provided for generating an alert when a future glucose level of the individual exceeds an upper glucose threshold and/or falls below a lower glucose threshold. As such, the system 200 can be used to avoid hypoglycemic or hyperglycemic episodes. The predicted future glucose levels can be used to alter or improve the patient's lifestyle, to tighten their glycemic control, or to adjust therapy in a proactive manner. The system 200 in an alternative embodiment includes a receiver for communicating with the glucose measuring device 210 when the processor 220 and memory unit 222 are housed in an external unit separate from the glucose measuring device 210. This embodiment also allows the processor 220 to be used with different types of glucose measuring devices 210.
The analyzer 340 includes a glucose prediction function that processes the glucose signals (converted or unconverted) in order to predict future glucose levels across a prediction horizon. As described below, the prediction horizon may be input into the analyzer 340 by a user or retrieved from memory 370. In at least one embodiment of the invention, the glucose prediction function is optimized for predicting glucose levels 30 minutes into the future. In one embodiment, the signal converter 320 and the analyzer 340 are co-located in the same device. In another embodiment, the signal converter 320 and the analyzer 340 are integrally connected and present on the same processor or in circuitry.
The glucose prediction function is a universal autoregressive model that is portable between individuals irrespective of health of individuals. The health of the individual includes age of the individual, diabetes type of the individual, and whether the individual is hospitalized. The glucose prediction function includes a plurality of model coefficients that are invariant between individuals irrespective of a type of the glucose measuring device utilized to measure the series of glucose signals.
The glucose prediction function outputs the future glucose levels by weighing the current and previous glucose signals obtained from the individual by the model coefficients. As described more fully below, the glucose prediction function outputs a series of future glucose levels by omitting the oldest predicted or actual glucose level used in the last iteration of the glucose prediction function, multiplying a most recent predicted future glucose level by a first model coefficient, and multiplying a next most recent predicted or actual glucose level by a next model coefficient.
As illustrated in
Memory 370 is also included in the illustrative embodiment of
An embodiment of the invention provides a training system for predicting at least one future glucose level in an individual according to another embodiment of the invention. The system includes one or more glucose measuring devices for measuring current glucose levels in humans. One or more first types of glucose measuring device are utilized to measure glucose levels from individuals (i.e, test subjects) at fixed time intervals (first output). A second type of glucose measuring device is utilized to measure glucose levels from the individual (second output). In at least one embodiment, the second type of glucose measuring device is different from the first types of glucose measuring device.
A glucose prediction function is trained within the processor using the first output from the glucose measuring device. A filter is provided prior to or programmed into the processor for smoothing the first output. As described in more detail later, Tikhonov regularization which yields smoothed signals {tilde over (y)} by computing {tilde over (y)}=Udw, where Ud denotes the integral operator and w denotes estimates of the glucose signals' first derivatives. The estimates of the derivatives yield excellent data smoothing and do not introduce lag on the smoothed signal relative to the original raw signal.
The glucose prediction function is a universal autoregressive model that is portable between individuals. The glucose prediction function includes model coefficients that are invariant between individuals.
The training system includes, for example, a processor or an analyzer that uses the glucose prediction function and second output from the glucose measuring device to predict the future glucose levels in the individual. The predicted glucose levels represent glucose levels at least 5 minutes into the future, i.e., 5 minutes from the time that the second glucose measurement is obtained from the individual. Because the model coefficients are invariant between individuals, the glucose prediction function predicts the future glucose levels independent of the age of the individual, the diabetes type of the individual, and whether the individual is hospitalized.
Yet another embodiment of the invention provides a system for predicting future glucose levels, including means for receiving glucose signals from a glucose measuring device (e.g., a processor, an analyzer). The glucose signals represent glucose levels obtained from an individual at fixed time intervals (e.g., glucose measurements taken every 5 minutes or other sampling period). Means for storing the glucose signals is provided (e.g., a memory unit housed in the glucose measuring device). Means for converting the glucose signals into numerical values is also provided (e.g., a processor or analyzer with or without a filter being connected), wherein the numerical values represent the glucose levels obtained from the individual.
The system in at least one embodiment further includes means for predicting future glucose levels of the individual (e.g., an analyzer or a programmed processor including a computer). Specifically, the means for predicting future glucose levels performs a plurality of iterations of a glucose prediction function by iteratively weighing the glucose signals by model coefficients. The glucose prediction function is portable between individuals irrespective of the health of the individuals. Moreover, the model coefficients are invariant between the individuals.
The system includes means for generating an alert (e.g., a threshold detector with an alert feature) is also provided. The alert is generated when a predicted glucose level exceeds an upper glucose threshold and/or falls below a lower glucose threshold. Examples of the alert feature include an audio alarm, a vibration, a screen displaying or flashing an exemplary word notification, and/or other visual cue (e.g., a warning light).
An embodiment of the invention measures glucose levels in an individual at predetermined intervals to provide a moving window sample to be used to predict a future glucose level. The glucose prediction function is represented by
ŷ
n
={tilde over (y)}
n-1
b
1
+{tilde over (y)}
n-2
b
2
+{tilde over (y)}
n-3
b
3
. . . +{tilde over (y)}
n-m
b
m
where ŷn represents predicted glucose levels; {tilde over (y)}n-1 represents a previously observed glucose measurement; and, b1 represents a model coefficient. The order of the model is represented by m (i.e., 30 in the example embodiment below). Thus, {tilde over (y)}n-m represents the oldest observed glucose level used from the time series; and, {tilde over (y)}n-1 represents the last (or most recently) observed glucose level. The moving window sample will be of the last m readings received from the glucose measuring device. Each observed glucose level is then weighed (i.e., multiplied) by a respective model coefficient.
For example, if the current time is 12:00 pm, an AR model of order 30 taking glucose measurements in 5-minute intervals would need the first measurement ({tilde over (y)}n-30) at 9:30 am. Twenty-nine other measurements are taken until the most recent measurement ({tilde over (y)}n-1) is taken at 11:55 am. In order to predict a future glucose level at 12:00 pm, the thirty glucose measurements ({tilde over (y)}n-1-{tilde over (y)}n-30) are weighed by respective model coefficients (b1-b30). For instance, the most recent measurement {tilde over (y)}n-1 is multiplied by b1. In order to predict a future glucose level at 12:05 pm, the model weighs the twenty-nine most recent actual glucose measurements ({tilde over (y)}n-1-{tilde over (y)}n-29) by respective model coefficients (b2-b30) and the predicted future glucose level at 12:00 is weighed by model coefficient b1. Similarly, to predict a future glucose level at 12:10 pm, the model weighs the twenty-eight most recent actual glucose measurements ({tilde over (y)}n-1-{tilde over (y)}n-28) by respective model coefficients (b3-b30), the predicted future glucose level at 12:00 is weighed by model coefficient b2, and the predicted future glucose level at 12:05 is weighed by model coefficient b1.
In another example, if the model (or prediction function) provides a prediction of the glucose level in the future using an order of 30 with a sampling frequency of 5 minutes, the oldest observed glucose level will have been observed 150 minutes earlier (or at time equal 1 minute (i.e., {tilde over (y)}n-m) if the current time is the 146th minute of the sampling) is weighed (i.e., multiplied) by model coefficient b30 (i.e., bn-m). The observed glucose level taken 20 minutes ago ({tilde over (y)}n-4) is weighed by model coefficient b4; and, the observed glucose level taken at 5 minutes ({tilde over (y)}n-1) is weighed by model coefficient b1.
For a model using an order of 30 with a sampling frequency of 1 minute, the glucose prediction function would become
Y(30)=y(29)b1+y(28)b2 . . . +y(0)b30
where y(29) is the measurement taken at time 29 minutes, i.e., 1 minute ago; y(0) is the measurement taken at time 0 minutes, i.e., 30 minutes ago. To predict the glucose level in 60 minutes at Y(60), then 30 iterations of the equation above are required. For example, if the time is 12:30 pm, in order to predict a future glucose level at 1:00 pm using a model of order 30 and a sampling frequency of 1 minute, the model requires predicted glucose values for every minute between 12:30 and 12:59. However, 30 iterations of the equation are required to predict the future glucose value at 12:59. The above is an example of the functional processing performed by the means for predicting or suitably programmed processors, integrated circuits, chips, or computers.
The dataset from the Guardian RT study was retrieved from the Diabetes Research in Children Network (DirecNet) Web site, which makes continuous glucose data for six different studies involving children with type 1 diabetes publicly available, along with the corresponding protocols. Data was obtained from the DirecNet study entitled “A Pilot Study to Evaluate the Navigator Continuous Glucose Sensor in the Management of Type 1 Diabetes in Children,” which included 30 subjects. Subjects were included if they were between 3 and 7 years old or between 12 and 18 years old, had been diagnosed with type 1 diabetes for more than one year, had been using an insulin pump, and had HbA1c≦10.0%. Subjects were excluded if they had significant medical disorder, had severe hypoglycemic event resulting in seizure or loss of consciousness in the last month, had used systemic or inhaled corticosteroids in the last month, or had cystic fibrosis. Subjects were provided with the Guardian RT CGM system for home usage, which collected subcutaneous glucose concentration every 5 minutes for six days. 12 out of the 30 subjects were excluded from the training data because they did not possess consecutive 4,000-minute segments (i.e., 800 data points) without data gaps.
The DexCom study investigates the short- and long-term effectiveness and benefits of frequent CGM measurements versus infrequent CGM measurement (e.g., only before each meal and at bedtime, fingerstick blood glucose measurements). Seven subjects are studied, including an on-going investigation from an independent study. Subjects are included if they were older than 18 years of age, had been diagnosed with type 2 diabetes for at least three months and treated with insulin, and had HbAlc between 7% and 12%. Subjects are excluded if they had been taking glucocorticoids, amphetamines, anabolic, or weight-reducing agents. In addition, subjects were excluded if they were pregnant, lactating, or planning to become pregnant. Subjects continued to take all medications that had been prescribed for diabetes and other medical conditions, and followed their usual meal plans and activity schedules. Investigators of the DexCom study did not make any recommendations to the subjects regarding medications, weight, diet, or exercise at any time during the study. Subjects were instructed to contact their primary care provider for all treatment decisions and consultations. Subcutaneous glucose measurements with the DexCom CGM system were collected every 5 minutes for each of the seven subjects for approximately eight weeks on four two-week cycles.
A model was developed for each one of the 34 subjects that predicted their respective glucose concentrations for a future 30-minute period. To develop the models, glucose signals are obtained from one or more CGM devices. The glucose signals represent the glucose levels taken over a 4,000 minute period (i.e., 800 data points with a 5-minute sampling interval) from the 34 subjects. The glucose signals from each subject are filtered (i.e., smoothed) to remove high-frequency noise. The filtering constrains the glucose rate of change such that the first-order time derivative of the glucose signal is consistent with clinically observed values (i.e., ±0.2 mmol 1−1 min−1 (±4 mg d1−1 min−1)), while avoiding the introduction of time lags between the filtered and the original CGM signals.
An embodiment of the invention utilizes the Tikhonov regularization approach, which yields smoothed signals {tilde over (y)} by computing {tilde over (y)}=Udw, where Ud denotes the integral operator and w denotes estimates of the glucose signals' first derivatives. The estimates of the derivatives yield excellent data smoothing and do not introduce lag on the smoothed signal relative to the original raw signal. Through this approach, the first derivative or the rate of change of glucose in time is chosen to impose smoothness constraints in the glucose signal. In other words, the smoothed glucose signal {tilde over (y)} varies minimally from one value to another, thereby ensuring regularity in the underlying signal to be estimated.
To estimate the signal's derivatives w, the functional f(w) is minimized, given by
f(w)=∥y−Udw∥2+λd2∥Ldw∥2
where y denotes the N×1 vector of the raw CGM time-series signal, Ud denotes the N×N integral operator, w represents the N×1 vector of first-order differences (the rate of change of glucose with time), λd represents the data regularization parameter, and Ld denotes a well-conditioned matrix chosen to impose smoothness constraints on the derivative of the glucose signal.
For a chosen Ld, the quality of smoothing in the aforesaid formulation is determined solely by the regularization parameter λd. When λd=0, no regularization is performed, resulting in the original raw CGM data y. As λd increases, the solution w (and hence {tilde over (y)}) increasingly satisfies the imposed smoothness constraint, resulting, at the same time, in larger deviations from the raw data.
The first half of each subject's filtered data is utilized to develop an AR model. An AR model is a type of linear model that infers a future signal ŷn, at time n (n=m+1, N, where N denotes the total number of data samples available for modeling), based on a linear combination of antecedent samples {tilde over (y)}n-i weighted by a fixed set of coefficients bi,
where m denotes the order of the model, i.e., the number of previously observed and filtered glucose concentrations {tilde over (y)}n-i used to predict a future glucose concentration ŷn. This fixed set of coefficients bi, i=1, 2, . . . , m, which defines a model of order m, describes the correlations in the signal. The coefficients are calculated by the method of constrained least squares with an added smoothness constraint to insure physiologic plausibility of the obtained coefficients.
Accordingly, each AR coefficient bi reflects the degree of dependency between the corresponding previous sample {tilde over (y)}n-i and the predicted signal ŷn, providing a measure of the physiologic association of the time-series glucose data. Training of an AR model generates the coefficients b that best describe the dependencies in the entire time-series {tilde over (y)}. In the method of constrained least squares, b is estimated so that the functional ∥{tilde over (y)}−Ub∥2 is minimized, where U denotes the design matrix representing previous values of {tilde over (y)}.
For glucose concentrations to be predictable with AR models, the CGM data possesses “detectable structure” and the dynamics of the time series data is ideally stationary. By definition, a process is considered stationary when the sample mean and variance of the process measurements are constant with respect to time and the autocorrelation function (ACF) is independent of absolute time. Indication of the stationary nature of the underlying process is therefore sought before applying AR models.
To construct stable AR models, AR model coefficients are obtained through regularization. For a stationary process, the sequence of autocorrelation coefficients representing the ACF describes statistical dependencies between two measurements separated by fixed time intervals throughout the recorded observations. To force the AR coefficients to follow the same statistical dependencies of the ACF, a smoothness constraint is imposed on the method of constrained least squares of the coefficients bi, resulting in the regularized least squares functional g(b), given by
g(b)=∥{tilde over (y)}−Umb∥2+λm2∥Lmb∥2
where {tilde over (y)} denotes the (N−m)×1 vector of smoothed data, Um denotes the (N−m)×m design matrix, b represents the m×1 vector of regularized AR coefficients, λm represents the model regularization parameter, and Lm denotes a well-conditioned matrix chosen to impose smoothness on the AR coefficients. Accordingly, the minimization of the above formula results in regularized coefficients b.
Similar to the smoothing of the raw data, for a chosen Lm, the stability of the AR model in the above formulation is determined solely by the regularization parameter λm. When λm=0, no regularization is performed. As λm increases, the coefficients are constrained, resulting in more stable, regularized AR coefficients.
The optimal values of the regularization parameters, λd and λm, and the order m of the AR model are estimated. The optimum value of λd is found by minimizing the sum of the RMSE of the smoothed signal (i.e., the RMSE between the raw and the smoothed signal) and the RMSE of the prediction (i.e., the RMSE between the smoothed signal and its predictions). The RMSE of the smoothed signal is a monotonically increasing function of λd because the smoother the signal, the more it deviates from the original raw data. Conversely, the RMSE of the prediction is a monotonically decreasing function of λd because the smoother the signal, the more predictable it becomes. Therefore, by obtaining λd that minimizes the sum of these two RMSEs, a tradeoff between smoothness and predictability is effectively imposed, resulting in signals with good predictability without oversmoothing. λm is selected empirically and m through cross validation.
Once the coefficients are calculated, the models are subsequently used for predicting glucose concentrations, where model performance is quantified by computing prediction time lags and RMSEs. The RMSE is defined as the square root of the mean difference between the predicted signal ŷi and the filtered observed signal {tilde over (y)}i, i=1, 2, . . . , N,
and the prediction time lag is calculated based on the cross-correlation between the filtered and predicted signals. The lag, characterized by the peak of the cross-correlation function, provides an accurate estimate of the delay in the predictions.
In other words, model coefficients bi are derived from the training datasets of 34 subjects in three studies; and, because the derived model coefficients do not differ significantly from subject-to-subject (as demonstrated by the tightness and invariance of the line graphs in
The 34 subjects from the three studies are used to validate the model. The first 2,000 minutes of the filtered signals of each subject are used to train the AR models (training dataset) and the next 2,000 minutes are utilized to test the predictions (testing dataset). The three validation scenarios allow for the comparison of model performance on the same testing datasets by applying distinct models derived from different training datasets.
Validation scenario I tests the accuracy of the same-subject models (same subject, same CGM device). More specifically, for each of the 34 subjects, a model is trained on each subject's training dataset (i.e., first 2,000 minutes), resulting in 34 different models. For example, the training dataset for iSense subject #1 is used to derive a model for that subject, which is subsequently used to predict that subject's glucose levels. Each model is validated using the testing dataset (i.e., next 2,000 minutes) of that particular subject. For example, the testing dataset for iSense subject #1 (i.e., actual glucose measurements taken) is compared to the predictions for that subject.
Thus, as illustrated in
Validation scenario II tests the accuracy of the cross-subject models (different subjects, same CGM device). For each subject within a given study, the models developed in scenario I for the remaining subjects of that same study are applied to the testing dataset of the subject. For example, each of the models developed for iSense subjects #2-#9 are applied to the testing dataset of iSense subject #1.
As illustrated in
Validation scenario III tests the accuracy of the cross-study models (different subjects, different CGM devices). For each subject within a given study, the models developed in the other two studies are applied to the testing dataset of the subject. For example, the models developed for the eighteen subjects in the Guardian RT study and the seven subjects in the DexCom study are applied to the testing dataset of subject #1 of the iSense study.
As illustrated in
Similar tabulations are shown in
In an example to demonstrate the prediction power of the models, an embodiment of the invention selects a random subject: Guardian RT subject #5.
The glucose levels for Guardian RT subject #5 predicted utilizing the model developed using Guardian RT subject #13's training dataset (i.e., the first 2,000 measured glucose data points) illustrate model portability across different subjects within the same study (scenario II). Similarly, the glucose levels for Guardian RT subject #5 predicted utilizing the model developed using iSense subject #8's training dataset and DexCom subject #4's training dataset demonstrate portability across different studies and across different types of diabetes (scenario III). The same-subject predictions (model derived utilizing the training dataset for Guardian RT subject #5) in scenario I serve as a reference for comparison among the different models. Specifically, for validation scenarios I and II, the resulting RMSE's for Guardian RT subject #5 are 0.20 mmol/l and 0.21 mmol/l, respectively. For validation scenario III, using the iSense subject #8 model and DexCom subject #4 model, the resulting RMSE's are 0.22 mmol/l and 0.24 mmol/l, respectively.
To assess the utility of the glucose predictions using clinically acceptable metrics, a Clarke error grid analysis (EGA) is performed, which maps pairs of sensor-predicted glucose concentrations into five zones, A to E, of varying degrees of accuracy and inaccuracy of glucose estimation. Values in zones A and B are clinically acceptable; values in zone C may result in unnecessary corrections; values in zone D could lead to incorrect treatments and detections; and, values in zone E represent erroneous treatment.
The Clarke EGA is also performed for each of the three studies using the same-subject model predictions (scenario I). The composite result of each analysis is plotted on a separate graph (not shown). Of the 3,600 entries (400 data points×9 subjects) for the iSense study, 3,564 points (99.0%) lay in zone A, 35 in zone B, and 1 in zone D. Of the 7,200 entries (400×18) for the Guardian RT study, 7,150 points (99.3%), 32 points, and 18 points lay in zones A, B, and D, respectively. Similarly, of the 2,800 entries of the DexCom study, 2,787 (99.5%), 12, and 1 lay in zones A, B, and D, respectively. These results demonstrated the clinical utility of the predictive models.
To verify that the employed datasets do not correspond to well-treated diabetic patients with glucose levels mostly within the euglycemic range and that the filtering procedure does not over-smooth the raw data, the number of hypo- and hyperglycemic episodes in the raw, smoothed, and predicted data are calculated. A lower threshold of 3.9 mmol/1 (70 mg/dl) and an upper threshold of 10 mmol/1 (180 mg/dl) was adopted; and, an inter-episode separation of at least 30 minutes and a minimum of 30 minutes (seven consecutive data points) outside the euglycemic range were required to count the excursion as a hypo- or hyperglycemic episode.
For instance, for the iSense study, the average minimum glucose levels (in mmol/l) was 3.95, 4.38, and 4.28 for the raw data, smoothed data, and predicted data, respectively. The average maximum glucose levels (in mmol/l) were 15.81, 14.70, and 14.87 for the raw data, smoothed data, and predicted data, respectively. The average mean glucose levels (in mmol/l) were 8.72, 8.72, and 8.69 for the raw data, smoothed data, and predicted data, respectively; and the average standard deviations were 2.61, 2.52, and 2.55 for the raw data, smoothed data, and predicted data, respectively. The total number of hyperglycemic episodes were 25, 24, and 24 for the raw data, smoothed data, and predicted data, respectively; and, the total number of hypoglycemic episodes were 4, 3, and 3 for the raw data, smoothed data, and predicted data, respectively.
The portability properties demonstrated by the models herein are attributed to two factors: the conserved nature of the frequency content in the glucose signal of diabetic patients and the properties of the modeling approach. The dynamics in the blood glucose time-series signal of diabetic patients can be characterized by four distinct frequency ranges. These different frequency ranges characterize different physiologic mechanisms and are best described by the periodicity of their oscillations. The highest frequency range, with periods between 5 and 15 minutes, is generated by pulsatile secretion of insulin. The second highest, ultradian glucose oscillations, corresponds to periods between 60 and 120 minutes. Exogenous inputs, such as meals and insulin, generate oscillations with periods between 150 and 500 minutes; and, finally, circadian oscillations are responsible for the low-frequency range, with periods longer than 700 minutes.
Analysis of the time-series glucose signals of all subjects in the three studies supports these findings and shows that the frequency content in the signals is conserved across subjects.
This similarity in the frequency content of the glucose signals is exploited by the predictive AR models herein. Periodic signals, like glucose concentration, are characterized by three parameters: amplitude, frequency, and phase of the underlying oscillations. However, a property of AR models is their invariance with respect to a signal's amplitude and phase, and sole dependency on its frequency. The sequence of the AR model coefficients captures and represents the frequency content of a time-series signal. Therefore, the development of the predictive AR models from signals with similar frequency content produced similar (or portable) models, regardless that different time-series signals recorded from different subjects had different amplitudes and initial phases. This invariance of the AR model coefficients to the glucose signal's amplitude and phase affords model portability across subjects with type 1 and type 2 diabetes. Type 1 diabetes patients usually have larger glucose-level variations than type 2 patients. However, if these variations contain the same frequency information, the predictive AR models herein are portable across them. Moreover, because of the frequency-dependent nature of the AR model coefficients, information concerning exogenous inputs, such as meals and exercise, is automatically incorporated into the models if this information is present in the training data.
However, if some of the subjects from the training data are nondiabetic and fasting, the models' portability could be jeopardized because the glucose dynamics are different in this case. This is particularly relevant for the highest-frequency component of the glucose time-series signal, i.e., the shortest periods spanning between 5 and 15 minutes, because while these periods are prominent in nondiabetic, fasting individuals, they are absent in diabetic patients. In diabetic patients, insulin-generating cells responsible for pulsatile secretion of insulin are severely handicapped, essentially eliminating the 5-15 minute periods from the glucose signals. Moreover, the blood-to-interstitial transport acts as a low-pass filter, reducing the high-frequency dynamics in the CGM signals, which are further attenuated by the filtering procedure utilized herein.
The filtering procedure, used to attenuate any remaining high-frequency component in the signal to yield consistent AR coefficients and robust models, does not significantly impact the ability to capture hypo- and hyperglycemic episodes; and hence, the clinical usefulness of at least one embodiment of the invention.
Another contributing property for the predictive AR model portability relates to the limits imposed on the model coefficients by the constrained least squares method. Besides fitting the AR model to the data, the employed constrained least squares method also limits the curvature (i.e., the norm of the second derivative) of the AR coefficients. This is illustrated in
The results on model portability are valid for AR-type models. As discussed above, AR models capture the signal's frequency information and are invariant to the signal's phase and amplitude. The latter property is not shared by other modeling approaches, such as those based on ordinary differential equations or harmonic regression, which prevents their portability.
Accordingly, at least one embodiment of the invention develops stable, universal glucose models that capture the correlations in glucose time-series signals of diabetic patients. Given continuous glucose signals from a patient, such universal models are readily usable to make near-future glucose concentration predictions for other patients without any need for model customization.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the root terms “include” and/or “have,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
The corresponding structures, materials, acts, and equivalents of all means plus function elements in the claims below are intended to include any structure, or material, for performing the function in combination with other claimed elements as specifically claimed. The description of the present invention has been presented for purposes of illustration and description, but is not intended to be exhaustive or limited to the invention in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the invention. The embodiment was chosen and described in order to best explain the principles of the invention and the practical application, and to enable others of ordinary skill in the art to understand the invention for various embodiments with various modifications as are suited to the particular use contemplated.
The invention can take the form of an entirely hardware embodiment or an embodiment containing both hardware and software elements. In at least one exemplary embodiment, the invention is implemented in a processor (or other computing device) loaded with software, which includes but is not limited to firmware, resident software, microcode, etc.
Computer program code for carrying out operations of the present invention may be written in a variety of computer programming languages. The program code may be executed entirely on at least one computing device (or processor), as a stand-alone software package, or it may be executed partly on one computing device and partly on a remote computer. In the latter scenario, the remote computer may be connected directly to the one computing device via a LAN or a WAN (for example, Intranet), or the connection may be made indirectly through an external computer (for example, through the Internet, a secure network, a sneaker net, or some combination of these).
It will be understood that each block of the flowchart illustrations and block diagrams and combinations of those blocks can be implemented by computer program instructions and/or means. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, application specific integrated circuit (ASIC), or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowcharts or block diagrams.
The invention has industrial applicability to predict future glucose levels in diabetic patients. The invention utilizes the predicted glucose levels to alter or improve the patient's lifestyle, to tighten their glycemic control, or to adjust therapy in a proactive manner. The universal AR models of the invention predict future glycemic states, which can be used to avoid undesired hypoglycemic or hyperglycemic episodes.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US09/52505 | 7/31/2009 | WO | 00 | 1/29/2011 |
Number | Date | Country | |
---|---|---|---|
61085295 | Jul 2008 | US |