METHOD FOR DETECTING A CONVERSION FROM MILD COGNITIVE IMPAIRMENT TO ALZHEIMER DISEASE

Description

FIELD OF THE INVENTION

The present invention relates to a method for detecting the conversion from mild cognitive impairment (MCI=Mild Cognitive Impairment) to Alzheimer disease (AD=Alzheimer's Disease).

PRIOR ART

Today 47 million people are suffering from dementia all over the world. It is estimated that this number will grow to 131 million by the year 2050, as a result of the increase in the average age of the population. Alzheimer's Disease (AD) accounts for about 60% of dementia cases (World Alzheimer Report 2016) and is usually diagnosed after age 65. AD patients survive on average only 4 to 8 years after diagnosis, as this condition is still incurable.

AD is a neurodegenerative disease characterized by a subtle onset, which is estimated to begin decades before cognitive and memory problems become visible, and by gradual progression. AD upsets the metabolic processes that keep healthy neurons and causes nerve cells to stop functioning, lose interconnections with other neurons, and eventually die. The death of nerve cells causes memory deficits, personality changes, problems in the performance of daily activities.

The group of experts from the National Institute on Aging and the Alzheimer Association (NIA/AA) has postulated that what is commonly considered “Alzheimer's disease” should rather be considered the stage of a more complex and long process of degeneration.

The experts of the NIA/AA have hypothesized three phases of progression of the AD:

1) Preclinical-AD: when the disease has already triggered the degeneration of the brain, but the clinical symptoms are not yet visible;

2) Mild Cognitive Impairment (MCI) caused by AD (MCI-AD or prodromal-AD): an intermediate phase in which symptoms related to the ability to think can start to be evident, but do not affect the daily life of the subject;

3) Dementia caused by AD (Dementia-AD): in the last phase of the evolution of the disease, disorders of memory, thought and behavior undermine a person's ability to live and act independently.

In fact, not all MCI subjects will develop dementia, since not all MCI subjects are inherently affected by AD. Only 10-15% of MCI patients “convert to AD” every year. Commonly there are two different types of AD: amnestic MCI (aMCI) and non-amnestic MCI. The first one refers to patients with memory deficits and the second one refers to patients with cognitive deficits but not memory deficits. aMCI subjects are more likely to develop AD.

Longitudinal studies (follow-up) on MCI patients are of fundamental importance to diagnose as soon as possible a possible progression of the MCI condition in AD, in order to be able to promptly take the treatment.

To this end, it is necessary to have tools for the quantitative assessment of the evolution of the state of health of the brain, in order to detect any progression towards dementia. The subjective clinical evaluation of the doctor must go alongside an instrument that quantifies the effects of the progression of the disease, so that the doctor can base his overall evaluation on different criteria, both subjective and objective.

Unfortunately, in the literature there are only a few longitudinal studies on MCI patients, with consequent lack of diagnostic tools to allow the neurologist to objectively monitor the progression of the disease.

SUMMARY OF THE INVENTION

A purpose of the present invention is to provide a method of objective assessment of the progression of MCI to AD, quantifying the effects it has on the patient's electroencephalogram (EEG).

The present invention achieves the above purpose by providing a method for detecting the conversion from mild cognitive impairment (MCI) to Alzheimer disease (AD), the method comprising the following stages:

a) providing as input data a plurality of first signals EEG (1, . . . , n) recorded at a first time T₀and defining a first tracing EEG-T₀of a patient with mild cognitive impairment, and a plurality of second signals EEG (1, . . . , n) recorded at a second time T₁and defining a second tracing EEG-T₁of the same patient, each first signal and each second signal corresponding to a respective electrode V (with V=1, . . . , n), the first tracing EEG-T₀and the second tracing EEG-T₁being divided into epochs w of equal duration;

b) for each epoch w of the first tracing EEG-T₀extracting at least two first sub-tracings EEG_sb^T0corresponding to respective frequency sub-bands (sb=delta, theta, alpha, beta), and for each epoch w of the second tracing EEG-T₁extracting at least two second sub-tracings EEG_sb^T1corresponding to respective frequency sub-bands (sb=delta, theta, alpha, beta);

c) for each epoch w and for each of the first sub-tracings EEG_sb^T0and second sub-tracings EEG_sb^T1, for each possible pair of signals x and y (with x=1, . . . , n; y=1, . . . , n and x≠y) calculating the Permutation Jaccard Distance PJD_X,Y^w(sb) between signal EEG_sb(x) and signal EEG_sb(y) at both time T₀and time T₁;

d) for each first sub-tracing EEG_sb^T0and each second sub-tracing EEG_sb^T1, performing a hierarchical clustering to divide into clusters the signals (and thus the respective electrodes) of the respective sub-tracing according to their mutual Permutation Jaccard Distances;

e) estimating the network density, when a fusion level FL varies, from the clusters obtained by the hierarchical Clustering (HC), defining two curves ND^T0(sb) and ND^T1(sb) for each frequency sub-band (sb=delta, theta, alpha, beta);

f) calculating the percentage variation ΔND(sb) % of the area subtended by the two curves ND^T1(sb) and ND^T0(sb) for each frequency sub-band with the formula ΔND(sb) %=(ND^T1(sb)−ND^T0(sb))*100/ND^T0(sb);

g) verifying that said percentage variation ΔND(sb) % is negative for each frequency sub-band in the transition from T₀to T₁to confirm the conversion from mild cognitive impairment (MCI) to Alzheimer disease (AD).

Advantageously, the method of the invention is based on the advanced processing of EEG signals since electroencephalography is a non-invasive neurophysiological evaluation technique, very well tolerated by patients, rapid, cost-effective and widespread on a large scale. The EEG is therefore the optimal candidate for the development of a system of early diagnosis of AD. The clinics in which the MCI and AD subjects are taken care of are normally equipped with EEG acquisition systems since the reporting of EEG tracing is part of the most widespread evaluation protocols of this category of patients.

The method of the invention is based on an innovative measure of synchronization of the EEG signals, called Permutation Jaccard Distance (PJD) and on its use as a measure of coupling between the electrodes. The electrodes are considered “nodes” of a complex network and the coupling between the nodes is estimated through the PJD. The network thus obtained is passed into input to the hierarchical clustering so that the electrodes are divided into clusters, according to the degree of coupling between them. The density of connectivity or network density (ND) between the electrodes, then between the corresponding brain areas, is estimated accordingly. Where the patient has progressed from MCI to AD, there is a significant increase in overall PJD and a significant decrease in ND because, due to cell death, phenomena of disconnection between cortical areas are triggered. However, this decrease is not observed in stable patients.

The proposed system thus provides an objective criterion for monitoring the brain health status of MCI subjects, which goes alongside the other criteria of neurological, psychological, clinical and cognitive assessment normally provided by the evaluation protocols of these patients.

Further features and advantages of the invention will appear more clearly from the detailed description of some exemplary but not exclusive embodiments thereof.

The dependent claims describe particular embodiments of the invention.

BRIEF DESCRIPTION OF THE FIGURES

In the description of the invention, reference is made to the accompanying drawings, which are given by way of non-limiting example, in which:

FIG. 1 shows a diagram of an embodiment of the method of the invention;

FIG. 2 shows a series of dendrograms, relating to a first patient, for each sub-band (sb) at time T₀and at time T₁;

FIG. 3 shows a series of dendrograms, relating to a second patient, for each sub-band (sb) at time T₀and at time T₁;

FIG. 4 shows the trend of the network density, relative to the first patient, as a function of the level of fusion, for each sub-band (sb) at time T₀and at time T₁;

FIG. 5 shows the trend of the network density, relative to the second patient, as a function of the level of fusion, for each sub-band (sb) at time T₀and at time T₁.

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS OF THE INVENTION

The method of the invention for detecting a conversion from mild cognitive impairment (MCI) to Alzheimer disease (AD), illustrated in the Figures, comprises the following stages:

a) providing as input data a plurality of first signals EEG (1, . . . , n), thus a first set of n signals, recorded at a first time T₀and defining a first tracing EEG-T₀of a patient with mild cognitive impairment, and a plurality of second signals EEG (1, . . . , n), thus a second set of n signals, recorded at a second time T₁and defining a second tracing EEG-T₁of the same patient, each first signal of said plurality of first signals and each second signal of said plurality of second signals corresponding to a respective electrode V (with V=1, . . . , n; thus, there are n electrodes), the first tracing EEG-T₀and the second tracing EEG-T₁being divided into epochs (or windows) w of equal duration;

c) for each epoch w and for each of the first sub-tracings EEG_sb^T0and of second sub-tracings EEG_sb^T1, for each possible pair of signals x and y (with x=1, . . . , n; y=1, . . . , n and x≠y) calculating the Permutation Jaccard Distance PJD_X,Y^w(sb) between signal EEG_sb(x) and signal EEG_sb(y) at both time T₀and time T₁;

d) for each first sub-tracing EEG_sb^T0and for each second sub-tracing EEG_sb^T1, performing a hierarchical clustering to divide into clusters the signals (and thus the respective electrodes) of the respective sub-tracing according to their mutual Permutation Jaccard Distances;

e) estimating the network density, when a fusion level FL varies, from the clusters obtained by the hierarchical Clustering, defining two network density curves ND^T0(sb) and ND^T1(sb) for each frequency sub-band (sb=delta, theta, alpha, beta); f) calculating the percentage variation ΔND(sb) % of the area subtended by the two curves ND^T1(sb) and ND^T0(sb) for each frequency sub-band with the formula ΔND(sb) %=(ND^T1(sb)−ND^T0(sb))*100/ND^T0(sb);

In other words, stage a) of the method provides, as input data, data of a plurality of first signals EEG (1, . . . , n) recorded at a first time T₀and defining a first tracing EEG-T₀of a patient with mild cognitive impairment, and data of a plurality of second signals EEG (1, . . . , n) recorded at a second time T₁and defining a second tracing EEG-T₁of the same patient, each first signal and each second signal corresponding to a respective electrode V (with V=1, . . . , n), and the first tracing EEG-T₀and the second tracing EEG-T₁being divided into epochs (w) of equal duration;

or stage a) includes dividing a first tracing EEG-T₀of a patient with mild cognitive impairment, defined by a plurality of first signals EEG (1, . . . , n) recorded at a first time T₀, and a second tracing EEG-T₁of the same patient, defined by a plurality of second signals EEG (1, . . . , n) recorded at a second time T₁, in epochs w of equal duration;

or, more simply, stage a) includes dividing a first tracing EEG-T₀, defined by a plurality of first signals EEG (1, . . . , n), and a second tracing EEG-T₁, defined by a plurality of second signals EEG (1, . . . , n), in epochs w of equal duration.

It should be noted that the whole method of the invention is based on an EEG tracing data processing. Carrying out the method of the invention never requires the presence of the human body. The whole method must be executed offline once the input data, i.e. the data of the two EEG tracings previously recorded at time T₀(baseline) and at time T₁(some months after T₀, for example 2 or 3 or 4 or 5 months after T₀) have been stored on a computer.

The data processing, provided for in the steps of the method of the invention, can be performed by any computer on which a software adapted to execute said steps is installed.

It is preferable that the EEG tracings, once memorized, are reviewed by an EEG expert in order to identify, preferably visually, and eliminate tracing segments contaminated by artifacts. The EEG tracings, thus cleaned up by the artifacts, will be subsequently processed according to the method of the invention.

In one embodiment of the invention, between stage c) and stage d) for each first sub-tracing EEG_sb^T0and each second sub-tracing EEG_sb^T1the following is provided—calculating the respective average values PJD^T0_X,Y(sb), PJD^T1_X,Y(sb) on all the epochs w for each possible pair of signals EEG_sb(x) and EEG_sb(y), said average values PJD^T0_X,Y(sb) and PJD^T1_X,Y(sb) defining the dissimilarities D_X,Y^T0(sb) and D_X,Y^T1(sb) between the signal EEG_sb(x) and the signal EEG_sb(y) of each possible pair, at time T₀and at time T₁, respectively;

- building two networks NET_sb(T_i), with i=0, 1, the node “x” of which represents the signal EEG_sb(x) (thus the electrode “x”) at time T_i, and the weight connecting the nodes “x” and “y” of the network NET_sb(T_i) represents the dissimilarity between the pair of signals EEG_sb(x) and EEG_sb(y) at time T_i, thus creating two dissimilarity matrices D^T0(sb) and D^T1(sb), the (x, y)-th element of which is equal to D_X,Y^T0(sb)=PJD_X,Y^T0(sb) and D_X,Y^T1(sb)=PJD_X,Y^T1(sb), respectively.

In stage d) the hierarchical Clustering is performed, starting from the dissimilarity matrices D^T0(sb) and D^T1(sb), outputting two dendrograms showing the connection between the first signals EEG_sb(1, . . . , n) at time T₀and the connection between the second signals EEG_sb(1, . . . , n) at time T₁, respectively, as a function of a fusion level FL, whereby for each dissimilarity matrix and for each fusion level FL, a set of clusters is determined.

Preferably, the hierarchical Clustering is performed by an agglomerative hierarchical Clustering algorithm, preferably a “complete linkage algorithm”, also referred to as “furthest neighbour”, which defines the distance or dissimilarity between two clusters by means of the maximum distance between a pair of signals, a signal belonging to a first cluster while the other signal of said pair belonging to the second cluster.

In stage e), the following is provided

for each fusion level FL, calculating the number of active connections AC_{F L}^T0(sb) and AC_FL^T1(sb) respectively by summing the number of possible pairs of first signals EEG_sb(1, . . . , n) at time T₀, and by summing the number of possible pairs of second signals EEG_sb(1, . . . , n) at time T₁, present within each cluster of the respective set of clusters;

- for each fusion level FL, estimating the network densities ND_FL^T0(sb) and ND_FL^T1(sb) by normalizing ACA_FL^T0(sb) and AC_FL^T1(sb), respectively, with respect to the total number of possible connections equal to [n*(n−1)/2], where n is the number of signals.

The network densities ND_FL^T0(sb) and ND_FL^T1(sb) are estimated for different fusion levels from 0 to 1, preferably but not necessarily with steps of 0.01.

In stage c), before calculating the Permutation Jaccard Distance PJD_X,Y^w(sb) for each possible pair of signals EEG_sb(x) and EEG_sb(y), the following is provided:

c1) for each possible pair of signals EEG_sb(x) and EEG_sb(y) which is mappable in a m-dimensional space, where m is the embedding dimension, each signal EEG_sb(x), EEG_sb(y) having N time samples (t, t+1, t+N−1) in said epoch w, detecting a plurality of symbols (patterns or motifs) π_i, π_j, with i, j=1, . . . , m!, occurring in said epoch w for each sample (t, t+1, t+N−1);

c2) for each sample (t, t+1, t+N−1), detecting the number of occurrences η_X(π_i) of each motif π_ialong the signal EEG_sb(x), the number of occurrences η_Y(π_j) of each motif π_jalong the signal EEG_sb(y), and the number of joint occurrences η_X,Y(π_i, π_j) of the two motifs π_i, π_jalong said signal EEG_sb(x) and said signal EEG_sb(y);

c3) once the signals EEG_sb(x), EEG_sb(y) have been fully processed, estimating the occurrence probability p_X(π_i) of the motif π_ialong the signal EEG_sb(x), the occurrence probability p_Y(π_i) of the motif π_ialong the signal EEG_sb(y) and the joint occurrence probability p_X,Y(π_i, πj) of the two motifs π_i, π_jalong said signal EEG_sb(x) and said signal EEG_sb(y).

In particular, the following is defined:

p
_X(π_i)=η_X(π_i)/[N−(m−1)L]

p
_Y(π_i)=η_Y(π_i)/[N−(m−1)L] and

p
_X,Y(π_i,π_i)=η_X,Y(π_i,π_j)/[N−(m−1)L]

where L is the time lag between a sample and the next, comprised between 1 and 10. In the embodiment of the method described herein, L=1 is assumed.

As known, the time lag represents the number of samples between a given sample selected from signals EEG_sb(x) and EEG_sb(y), where each signal EEG_sb(x) and EEG_sb(y) has N time samples (t, t+1, t+N−1), and the next sample to be selected. For example, starting from the first sample “t”, if m=3 and L=1, the three samples “t”, “t+1” and “t+2” will be selected; if, for example, L=5, the samples “t”, “t+5”, “t+10” will be selected.

For each epoch w the Permutation Jaccard Distance PJD_X,Y(sb) between the signal EEG_sb(x) and the signal EEG_sb(y) is defined by the following relation

PJD_X,Y(sb)=1−PMI(X,Y)/PJE(X,Y),

where PMI(X,Y) is the Permutation Mutual Information defined as

PMI(X,Y)=PE(X)+PE(Y)−PJE(X,Y)

where PE(X) is the Permutation Entropy of the signal EEG_sb(x)

$PE (X) = - \sum_{i = 1}^{m!} p_{X} (π_{i}) \log (p_{X} (π_{i}))$

PE(Y) is the Permutation Entropy of the signal EEG_sb(y)

$PE (Y) = - \sum_{i = 1}^{m!} p_{Y} (π_{i}) \log (p_{Y} (π_{i}))$

PJE(X,Y) is the Permutation Joint Entropy of the signals EEG_sb(x) and EEG_sb(y)

$PJE (X, Y) = - \sum_{i = 1}^{m!} \sum_{j = 1}^{m!} p_{X, Y} (π_{i}, π_{j}) \log (p_{X, Y} (π_{i}, π_{j}))$

and where log is the natural logarithm.

Preferably but not necessarily, in stage b) it is sufficient to extract, for each epoch w of the first tracing EEG-T₀, only two first sub-tracings EEG_sb^T0corresponding to the two frequency sub-bands delta and theta, and for each epoch w of the second tracing EEG-T₁only two second sub-tracings EEG_sb^T1corresponding to the two frequency sub-bands delta and theta.

To obtain the input data used in stage a), at time T₀and at time T₁, for example with T₁=(T₀+3 months), electroencephalography (EEG) is used, a technique used to measure and record brain electrical activity. A certain number of electrodes (1, . . . , x, y, . . . , n) are placed in contact with the patient's scalp. These electrodes are connected to the acquisition system, which amplifies and records the detected electrical potentials. The electrical potentials detected are the result of the overlap of the electrical activity of neuronal populations, where this overlap is of sufficient intensity to be detected by the scalp. Monitoring the spatial-temporal dynamics of the recorded signals allows deducing information about the neural activity that generated them and, consequently, allows obtaining diagnostic information.

By way of example, the electrodes are applied to the scalp according to the standard positioning called “10-20 International System”. The 10 and 20 refer to 10% and 20% with respect to 100% of the distance between two landmarks called “inion” (protuberance at the base of the occipital bone) and “nasion” (upper attachment of the nose). The electrodes are uniquely identified by a label that identifies the area of belonging (F=frontal, T=temporal, C=central, P=parietal, 0=occipital, A=auricular), the hemisphere (even numbers for the right, odd numbers for the left, “z” for the midline) and the exact position. There are different types of electrode positioning, called “montages”, which are set up to provide a uniform view of the distribution of the cortical electrical activity. In a certain time instant t, the value detected by the single electrode V represents the potential difference with respect to a reference electrode V(t)−Vref(t). In the proposed example, the EEG is recorded according to the 10-20 International System (montage: Fp1, Fp2, F3, F4, C3, C4, P3, P4, O1, O2, F7, F8, T3, T4, T5, T6, Fz, Cz e Pz), with linked ear-lobe reference (A1-A2). For example, the recording time is 3-7 minutes, preferably 5 minutes. Preferably, during the acquisition the patients sit comfortably, keep their eyes closed but remain awake (eye closed resting state).

EEG signals are filtered in the 0.5-30 Hz range, to include four sub-bands of interest, delta (0.5-4 Hz), theta (4-8 Hz), alpha (8-13 Hz), beta (13-30 Hz), and sampled with a predetermined sampling frequency, for example fs=256 Hz.

Preferably, the EEG tracing is then viewed by the EEG expert in order to find and exclude any sleep patterns and to label and eliminate segments that exhibit artifacts. If the sampling frequency is higher than 256 Hz, a 256 Hz downsampling will be performed.

The EEG tracing thus acquired is divided into epochs w of equal duration and not overlapping, of about 3-7 seconds, for example 5 seconds, and is then divided into four sub-tracings EEG_sb(stage b), each associated with one of the specific sub-bands of interest: EEG_delta, EEG_theta, EEG_alfa, EEG_beta.

The sub-tracings are extracted by filtering, in a known manner, each EEG channel through bandpass filters based on the Fast Fourier Transform (FFT) and on the reverse FFT (IFFT). By means of the FFT, each EEG signal is broken down into its different elementary frequency components; by means of the IFFT, the EEG signal is then reconstructed in the desired specific sub-band, that is: delta (0.5-4 Hz), theta (4-8 Hz), alpha (8-13 Hz) and beta (13-30 Hz).

Each of the sub-tracings EEG_sbthus obtained (EEG_delta, EEG_theta, EEG_alpha, EEG_beta), divided into epochs w, is then analyzed independently from the others.

The concepts underlying the invention are described below, including that of considering the scalp as a network where the electrodes represent the nodes. An appropriate measure of dissimilarity is defined between each pair of electrodes x, y (with x=1, . . . , n; y=1, . . . , n and x≠y), which can represent the coupling strength between the areas covered by the two electrodes x and y. The “inter-electrode” dissimilarity is quantified by estimating the coupling strength between the corresponding signals EEG_sb(x) and EEG_sb(y). In this way it is possible to associate a graph to the EEG recorded at time T₀and a graph to the EEG recorded at time T₁.

These dissimilarities between electrodes are then passed through a hierarchical clustering (HC), so as to group the electrodes according to the coupling strength between corresponding EEG signals. The clusters depend on the selected threshold of fusion level FL.

Given a threshold of fusion level FL, a set of clusters is determined and it is possible subsequently estimate the network density or connectivity density. By comparing the connectivity density of the two graphs, corresponding to time T₀and time T₁, it is possible to indirectly quantify how the brain connectivity varies.

Other concepts underlying the invention are explained in more detail hereinafter. Permutation Entropy (PE) was introduced by Bandt and Pompe (C. Bandt and B. Pompe—Permutation entropy: A natural complexity measure for time series—Phys. Rev. Lett., 88 (17), 2002) as a symbolic descriptor of dynamic complexity changes in time series. Thanks to the projection in symbols (motifs), the Permutation Entropy, or simply PE, estimates the randomness of a time series regardless of its amplitude, which plays a key role when analyzing the EEG. In fact, the amplitude of EEG, recorded through a given electrode, depends on the distance from the reference electrode. When processing EEG recordings using amplitude-dependent techniques, each EEG signal should first be normalized to cancel the effect of closeness to the reference electrode. Normalization is not necessary when a symbolic procedure such as PE is used. However, PE is a univariate descriptor that can only describe the randomness of a single time series, in this case an EEG signal, and cannot quantify the coupling strength between two or more time series, i.e. between two or more EEG signals.

Advantageously, the descriptor proposed in the method of the invention, namely the Permutation Jaccard Distance (PJD), is based on the same projection into symbols adopted by Permutation Entropy (PE), but is a multivariate descriptor that can quantify the coupling strength between two or more time series.

From the Information Theory, given a time series x, with N samples, and its probability density function p(x), the Entropy of the series x is defined as

$H (X) = - \sum_{i = 1}^{N} p_{X} (x_{i}) \log (p_{X} (x_{i}))$

Given two time series x and y, with N samples, and their joint probability density function p_X,Y(x, y), their Joint Entropy is defined as:

$H (X, Y) = - \sum_{i = 1}^{N} \sum_{j = 1}^{N} p_{X, Y} (x_{i}, y_{j}) \log (p_{X, Y} (x_{i}, y_{j}))$

Their Mutual Information is defined as MI(X;Y)=H(X)+H(Y)−H(X,Y).

The Variation of Information is defined as VI(X,Y)=H(X,Y)−MI(X;Y).

When normalized, VI(X,Y) becomes the Jaccard Distance between the time series x and y

JD(X,Y)=1−MI(X;Y)/H(X,Y)

which is a metric because it satisfies the properties of symmetry, positivity, boundedness (0≤JD (X, Y)≤1) and triangular inequality (A. Kraskov, H. Stogbauer, R. G. Andrzejak, and P. Grassberger—Hierarchical clustering based on mutual information—arXiv:q-bio/0311039).

The concept of Permutation Jaccard Distance (PJD) is then introduced by exploiting the properties of JD as well as the advantages of projecting time series into symbols (motifs), which are particularly useful when analyzing EEG signals.

As for the time series projection in symbols, given two time series x and y with N samples, they can be mapped into an m-dimensional space, where m is the embedding dimension [N. Packard, J. Crutchfield, J. Farmer and R. Shaw, “Geometry from time series”, Phys. Rev. Lett. 45, (1980) 712.]

Given an EEG epoch under analysis, starting from two given samples x(t) and y(t) and given a time lag L, two m-dimensional vectors, X_tand Y_tcan be constructed as follows:

X
_t=[x(t),x(t+L), . . . ,x(t+(m−1)]^T

and

Y
_t=[y(t),y(t+L), . . . ,y(t+(m−1)]^T

where the apex T indicates the transposed.

The methodology is illustrated schematically using as an example m=3 and L=1 (FIG. 1). X_tand Y_tare both vectors with three elements. The algorithm eliminates the absolute values of X_tand Y_tand takes into account only the relative amplitude of their elements: low, medium, high. If we consider three possible levels (m=3), six possible (m!=6) ordinal sequences (patterns or motifs) can be identified, that is, the permutations without repetition of the three levels low, medium and high. Motifs or patterns are indicated with π_i, where i=1, . . . , 6 (FIG. 1). The algorithm checks which motif occurs in X_t(motif π₄in the example shown in FIG. 1) and which motif occurs in Y_t(motif Tri in the example of FIG. 1). According to the example in FIG. 1, in the first iteration, the algorithm will increment the number of occurrences η_X(π₄) of the motif π₄in the time series x, and the number of occurrences η_Y(π₁) of the motif Tri in the time series y. The algorithm will also increment the number of joint occurrences η_X,Y(π₄, π₁) of the two motifs π₄and π₁. Then the algorithm moves to the following samples x(t+1), y(t+1), constructs two new vectors X_t+1and Y_t+1and reiterates the procedure just illustrated.

Once the iterations have been completed and the two time series have been fully processed, the algorithm estimates the overall probability that a given motif π_ioccurs (with i=1, . . . , 6) in the time series x and in the time series y, normalizing the number of occurrences n by the number of iterations:

p
_X(π_i)=η_X(π_i)/[N−(m−1)L]

p
_Y(π_i)=η_Y(π_i)/[N−(m−1)L]

as well as the probability that a couple of motifs occurs jointly:

p
_X,Y(π_i,π_j)=η_X,Y(π_i,π_j)/[N−(m−1)L].

Advantageously, by discarding the absolute amplitude of the elements of the vectors X_tand Y_tand matching them with the predetermined patterns, the procedure becomes amplitude independent. This feature is very useful when analyzing EEG signals, because a signal recorded through an electrode close to the reference electrode will inherently have a lower amplitude, compared to a electrode located farther away.

Therefore, given a time series x, with N samples and embedding dimension m, the Permutation Entropy of the series x is defined as

$PE (X) = - \sum_{i = 1}^{m!} p_{X} (π_{i}) \log (p_{X} (π_{i}))$

Given two time series x and y, with N samples and embedding dimension m, their Permutation Joint Entropy (PJE) is defined as

$PJE (X, Y) = - \sum_{i = 1}^{m!} \sum_{j = 1}^{m!} p_{X, Y} (π_{i}, π_{j}) \log (p_{X, Y} (π_{i}, π_{j}))$

Their Permutation Mutual Information is defined as

PMI(X,Y)=PE(X)+PE(Y)−PJE(X,Y).

Their Permutation Variation of Information (PVI) is defined as

PVI(X,Y)=PJE(X,Y)−PMI(X,Y).

Therefore the Permutation Jaccard Distance PJD between time series x and y is defined as

PJD(X,Y)=1−PMI(X,Y)/PJE(X,Y).

When the coupling strength between the series x and the series y increases, a decrease in PJD is expected, because the two time series become more synchronized. In fact, as the coupling strength increases, PMI increases and joint randomness (therefore PJE) decreases.

As a consequence of the definition of Jaccard Distance (JD), the PJD satisfies the properties of a metric and is bounded between 0 and 1.

The advantages of using PJD are numerous, since PJD is a symbolic methodology, it is less sensitive to artifacts because it projects the EEG time series into a set of symbols (motifs). In this way, the possible amplitude variation in the EEG signal, due to artifacts, would not alter the amplitude of the symbols, which are predetermined. Furthermore, PJD is nonlinear and could capture nonlinear dynamics in the EEG signal better than linear descriptors of the coupling strength, such as coherence (Wavelet Coherence).

The Permutation Jaccard Distance between each possible pair of signals (electrodes) is calculated, for each patient, in each frequency sub-band, for both EEG-T₀and EEG-T₁tracings. Given a sub-band sb (delta, theta, alpha or beta) and the corresponding sub-tracing EEG_sb, and given a generic epoch w under analysis, the Permutation Jaccard Distance is calculated between each pair of signals EEG_sb(x) and EEG_sb(y), i.e. between each pair of electrodes x and y (with x=1, . . . , n; y=1, . . . , n and x≠y), both for the measurement recorded at time T₀and for the measurement recorded at time T₁(stage c).

These values PJD_X,Y^w(sb) are therefore averaged over time, i.e. over all the epochs w, to obtain the respective average values PJD^T0_X,Y(sb), PJD^T1_X,Y(sb) for each possible pair of signals EEG_sb(x) and EEG_sb(y) in each sub-band sb.

In the method of the invention, a hierarchical Clustering HC is used to group the signals, and therefore the electrodes, according to their mutual Permutation Jaccard Distances (PJDs) and subsequently to estimate the connectivity density of the electrode network.

Hierarchical clustering partitions a sample dataset into clusters. There are two types of hierarchical clustering: agglomerative and divisive. The divisive hierarchical clustering assigns the entire dataset to a cluster and then iteratively splits it into groups until all groups are single clusters. However, this algorithm is computationally expensive. In the method of the invention, it is preferable to use an agglomerative hierarchical clustering which assigns an individual cluster to each data point and then, in an iterative manner, merges the two most similar clusters. The procedure is repeated until all subsets belong to a single cluster (Brian S. Everitt, Sabine Landau, Morven Leese, Daniel Stahl. Cluster Analysis, 5th Edition (2011). Wiley. ISBN: 978-0-470-74991-3).

Several agglomerative HC algorithms have been proposed in the literature (Brian S. Everitt, Sabine Landau, Morven Leese, Daniel Stahl. Cluster Analysis, 5th Edition (2011). Wiley. ISBN: 978-0-470-74991-3). Preferably, in the method of the invention a “complete linkage algorithm” is used, also called “furthest neighbour”, which defines the distance or dissimilarity between two clusters by means of the maximum distance between a pair of signals.

Said mean values PJD^T0_X,Y(sb) and PJD^T1_X,Y(sb) thus define the dissimilarity D_X,Y^T0(sb) and D_X,Y^T1(sb) between the signal EEG_sb(x) and the signal EEG_sb(y) of each possible pair of signals, respectively at the time T₀and at the time T₁, i.e. between two electrodes of each possible pair of electrodes.

In this way, two dissimilarity matrices D^T0(sb) and D^T1(sb) are created in each frequency sub-band, the (x,y)-th element of which is equal to D_X,Y^T0(sb)=PJD_X,Y^T0(sb) and D_X,Y^T1(sb)=PJD_X,Y^T1(sb), respectively. Substantially, two networks NET_sb(T_i) are constructed, with i=0, 1, the node “x” of which represents the signal EEG_sb(x) at the time T_i, and the weight that connects the nodes “x” and “y” of the network NET_sb(T_i) represents the dissimilarity between the pair of signals EEG_sb(x) and EEG_sb(y) at time T_i. In this way, it is possible to create a dissimilarity matrix D^Ti(sb) for each phase of the follow-up T_iof each patient and for each sub-band.

At this point, the hierarchical Clustering HC is applied to the dissimilarity matrices D^T0(sb) and D^T1(sb), having as output for each frequency sub-band two dendrograms showing the connection among the first signals EEG_sb(1, . . . , n) at time T₀and the connection among the second signals EEG_sb(1, . . . , n) at time T₁, respectively, as a function of a fusion level FL, i.e. two dendrograms showing respectively the connections of the first set of signals EEG_sb(1, . . . , n) at the time T₀and the connections of the second set of signals EEG_sb(1, . . . , n) at time T₁as a function of a fusion level FL.

The dendrogram therefore provides a view of the connection between the electrodes as a function of the fusion level (stage d). Given a fusion level FL, the electrodes connected at a level below FL will belong to the same cluster and will be considered connected. Therefore, for each dissimilarity matrix and each level of fusion FL, a set of clusters is determined (see for example FIG. 2 or 3).

Thereafter (stage e), the number of active connections AC_FL^T0(sb) and AC_FL^T1(sb) is calculated for each fusion level FL, respectively by summing the number of possible pairs of first signals EEG_sb(1, . . . , n) at time T₀, and summing the number of possible pairs of second signals EEG_sb(1, . . . , n) at time T₁, present within each cluster of the respective set of clusters. That is, the number of active connections AC_FL^T0(sb) and AC_FL^T1(sb) is calculated for each fusion level FL, by summing the number of signal pairs belonging to the first set of signals EEG_sb(1, . . . , n) at time T₀, which are connected for that given fusion level FL (that is to say, they are part of the same cluster in the dendrogram), and by summing the number of signal pairs belonging to the second set of signals EEG_sb(1, . . . , n) at time T₁, which are connected for that given fusion level, respectively.

Noting that the total number of possible connections between n nodes of a network is equal to [n*(n−1)/2], the network densities ND_FL^T0(sb) and ND_FL^T1(sb) are defined for each fusion level FL by normalizing the number of active connections AC_FL^T0(sb) and AC_FL^T1(sb) with respect to the total number of possible connections, where n is the number of signals (electrodes).

In general, we have

${ND}_{FL}^{Ti} (sb) = \frac{{AC}_{FL}^{Ti} (sb)}{n * (n - 1) / 2} .$

Therefore, ND represents the ratio between the number of active connections and the number of potential connections of a network, whereby ND=0 represents a totally disconnected network while ND=1 represents a completely interconnected network.

The dendrograms of two patients are illustrated by way of example in FIGS. 2 and 3. The dendrogram in FIG. 2 refers to a first patient Pt30 with stable MCI at time T₁, while the dendrogram in FIG. 3 refers to a patient Pt51 in which there was the conversion from MCI to Alzheimer's disease (AD) at time T₁.

The axis of the ordinates of the dendrogram represents the distance or dissimilarity between the clusters (fusion level FL). The axis of the abscissas of the dendrogram represents the electrodes. Each branch of the diagram (vertical line) corresponds to a cluster. The (horizontal) conjunction line of two or more branches identifies the distance (fusion level) at which the clusters merge.

In order to provide a view at-a-glance of how connectivity changed from T₀to T₁, an arbitrary fusion level was selected (0,3), both for T₀and for T₁, and the corresponding clusters are highlighted in FIGS. 2 and 3 with different colors. It is worth noting that while the clusters have barely changed for the patient Pt30 (stable), they have instead changed significantly for the patient Pt51 (converted). For example, in the delta band, at the time T₀, 3 single elements were observed and a large cluster with 16 elements was obtained, while, at the time T₁, 6 clusters were obtained, with a size ranging from 2 to 6 elements.

In order to quantify such a visual evaluation, the network densities ND_FL^T0(sb) and ND_FL^T1(sb) were estimated as described above. ND indicates how many connections are active as a function of the threshold of the selected fusion level. The network densities ND_FL^T0(sb) and ND_FL^T1(sb) were calculated for different fusion level thresholds ranging from 0 to 1, with a step of 0.01. The fusion level is between 0 and 1 because the PJD is by definition included in that range.

FIGS. 4 and 5 show explanatory representations of the evolution of the network density from T₀to T₁as a function of the fusion level threshold, in each sub-band, for the patient Pt30 (stable MCI) and for the patient Pt51 (converted AD), respectively.

Regarding the patient Pt30 (FIG. 4), it is observed that the two patterns ND^T0and ND^T1essentially overlap for FL<0.55 in each sub-band, while they differ significantly for the patient Pt51 also for FL<0.55. This result indicates that the network density changes significantly in the patient Pt51 in the transition from T₀to T₁while remaining stable in the patient Pt30.

In order to quantify the results shown in FIGS. 4 and 5, the percentage variation ΔND(sb) % of the area below the two curves ND^T1(sb) and ND^T0(sb) for each frequency sub-band is calculated with the formula

ΔND(sb) %=(ND^T1(sb)−ND^T0(sb))*100/ND^T0(sb).

It has been found experimentally that the patients converted into AD have undergone a negative percentage variation ΔND(sb) % for each frequency sub-band in the transition from T₀to T₁. No false positive was found.

Advantageously, the method of the present invention for the indirect estimation of the density of brain connectivity is extremely sensitive and specific for the conversion from MCI to AD. The use of the PJD as a symbolic descriptor of the coupling strength gave better results than the linear descriptors of the coupling strength, such as coherence (Wavelet Coherence), which instead led to the detection of false positives.

Claims

1. A method for detecting a conversion from mild cognitive impairment (MCI) to Alzheimer disease (AD), the method comprising the following stages: a) providing as input data a plurality of first signals EEG (1, . . . , n) recorded at a first time T0 and defining a first tracing EEG-T0 of a patient with mild cognitive impairment, and a plurality of second signals EEG (1, . . . , n) recorded at a second time T1 and defining a second tracing EEG-T1 of the same patient, each first signal and each second signal corresponding to a respective electrode V (with V=1, . . . , n), and the first tracing EEG-T0 and the second tracing EEG-T1 being divided into epochs (w) of equal duration;b) for each epoch (w) of the first tracing EEG-T0 extracting at least two first sub-tracings EEGsbT0 corresponding to a respective frequency sub-band (sb=delta, theta), and for each epoch (w) of the second tracing EEG-T1 extracting at least two second sub-tracings EEGsbT1 corresponding to a respective frequency sub-band (sb=delta, theta);c) for each epoch (w) and for each of the first sub-tracings EEGsbT0 and second sub-tracings EEGsbT1, for each possible pair of signals x and y (with x=1, . . . , n; y=1, . . . , n and x≠y) calculating a Permutation Jaccard Distance PJDX,Yw(sb) between signal EEGsb(x) and signal EEGsb(y) at both time T0 and time T1;d) for each first sub-tracing EEGsbT0 and each second sub-tracing EEGsbT1, performing a hierarchical Clustering to divide into clusters the signals of the respective sub-tracing according to their mutual Permutation Jaccard Distances;e) estimating a network density, when a fusion level FL varies, from the clusters obtained by the hierarchical Clustering, defining two curves NDT0(sb) and NDT1(sb) for each frequency sub-band (sb=delta, theta);f) calculating a percentage variation ΔND(sb) % of an area subtended by the two curves NDT1(sb) and NDT0(sb) for each frequency sub-band with the formula ΔND(sb) %=(NDT1(sb)−NDT0(sb))*100/NDT0(sb);g) verifying that said percentage variation ΔND(sb) % is negative for each frequency sub-band in a transition from T0 to T1 to confirm the conversion from mild cognitive impairment (MCI) to Alzheimer disease (AD).
2. The method according to claim 1, wherein between stage c) and stage d) for each first sub-tracing EEGsbT0 and each second sub-tracing EEGsbT1 the following is provided: calculating respective average values PJDT0X,Y(sb), PJDT1X,Y(sb) on all the epochs (w) for each possible pair of signals EEGsb(x) and EEGsb(y), said average values PJDT0X,Y(sb) and PJDT1X,Y(sb) defining dissimilarities DX,YT0(sb) and DX,YT1(sb) between signal EEGsb(x) and signal EEGsb(y) of each possible pair, at time T0 and at time T1, respectively;building two networks NETsb(Ti), with i=0, 1, a node “x” of which represents signal EEGsb(x) at time Ti, and a weight connecting the nodes “x” and “y” of the network NETsb(Ti) represents the dissimilarity between the pair of signals EEGsb(x) and EEGsb(y) at time Ti, thus creating two dissimilarity matrices DT0(sb) and DT1(sb), the (x, y)-th element of which is equal to DX,YT0(sb)=PJDX,YT0(sb) and DX,YT1(sb)=PJDX,YT1(sb) respectively.
3. The method according to claim 2, wherein in stage d) the hierarchical Clustering is performed, from the dissimilarity matrices DT0(sb) and DT1(sb), outputting two dendrograms showing a connection between the first signals EEGsb(1, . . . , n) at time T0 and a connection between the second signals EEGsb(1, . . . , n) at time T1, respectively, as a function of a fusion level FL, whereby for each dissimilarity matrix and for each fusion level FL, a set of clusters is determined.
4. The method according to claim 3, wherein in stage e), the following is provided for each fusion level FL, calculating the number of active connections ACFLT0(sb) and ACFLT1(sb) by summing the number of possible pairs of first signals EEGsb(1, . . . , n) at time T0, which result to be connected for a predetermined fusion level FL, and summing the number of possible pairs of second signals EEGsb(1, . . . , n) at time T1, which result to be connected for said predetermined fusion level, respectively;for each fusion level FL, estimating the network densities NDFLT0(sb) and NDFLT1(sb) by normalizing ACFLT0(sb) and ACFLT1(sb) with respect to the total number of possible connections equal to [n*(n−1)/2], where n is the number of signals.
5. The method according to claim 4, wherein the network densities NDFLT0(sb) and NDFLT1(sb) are estimated for different fusion levels from 0 to 1, preferably with steps of 0.01.
6. The method according to claim 1, wherein in stage c), before calculating the Permutation Jaccard Distance PJDX,Yw(sb) for each possible pair of signals EEGsb(x) and EEGsb(y), the following is provided: c1) for each possible pair of signals EEGsb(x) and EEGsb(y) which is mappable in a m-dimensional space, each signal EEGsb(x), EEGsb(y) having N time samples (t, t+1, t+N−1) in said epoch (w), detecting a plurality of symbols πi, πj, with i, j=1, . . . , m!, occurring in said epoch (w) for each sample (t, t+1, t+N−1);c2) for each sample (t, t+1, t+N−1), detecting the number of occurrences ηX(πi) of each symbol πi along the signal EEGsb(x), the number of occurrences ηY(πj) of each symbol πj along the signal EEGsb(y), and the number of joint occurrences ηX,Y(πi,πj) of the two symbols πi, πj along said signal EEGsb(x) and said signal EEGsb(y);c3) once the signals EEGsb(x), EEGsb(y) have been fully processed, estimating the occurrence probability pX(πi) of the symbol πi along the signal EEGsb(x), the occurrence probability pY(πi) of the symbol πi along the signal EEGsb(y) and the joint occurrence probability pX,Y(πi, πj) of the two symbols πi, πj along said signal EEGsb(x) and said signal EEGsb(y).
7. The method according to claim 6, wherein pX(πi)=ηX(πi)/[N−(m−1)L]pY(πi)=ηY(πi)/[N−(m−1)L] andpX,Y(πi,πj)=ηX,Y(πi,πj)/[N−(m−1)L]
8. The method according to claim 7, wherein for each epoch (w) the Permutation Jaccard Distance PJDX,Y(sb) between signal EEGsb(x) and signal EEGsb(y) is defined by the following relation PJDX,Y(sb)=1−PMI(X,Y)/PJE(X,Y),where PMI(X,Y) is the Permutation Mutual Information defined as PMI(X,Y)=PE(X)+PE(Y)−PJE(X,Y)
9. The method according to claim 1, wherein the hierarchical Clustering is performed by an agglomerative hierarchical Clustering algorithm, preferably a “complete linkage algorithm”, also referred to as “furthest neighbour”.
10. A The method according to claim 1, wherein in stage b) for each epoch (w) of the first tracing EEG-T0 four first sub-tracings EEGsbT0 are extracted, corresponding to a respective frequency sub-band (sb=delta, theta, alpha, beta), and for each epoch (w) of the second tracing EEG-T1, four second sub-tracings EEGsbT1 are extracted, corresponding to a respective frequency sub-band (sb=delta, theta, alpha, beta).
11. A The method according to claim 1, wherein for each epoch (w) the Permutation Jaccard Distance PJDX,Y(sb) between signal EEGsb(x) and signal EEGsb(y) is defined by the following relation PJDX,Y(sb)=1−PMI(X,Y)/PJE(X,Y),

Priority Claims (1)

Number	Date	Country	Kind
102018000002183	Jan 2018	IT	national

PCT Information

Filing Document	Filing Date	Country	Kind
PCT/IB2019/050742	1/30/2019	WO	00

METHOD FOR DETECTING A CONVERSION FROM MILD COGNITIVE IMPAIRMENT TO ALZHEIMER DISEASE

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)

PCT Information