This application claims priority under 35 USC § 119 to European Patent Application No. 22383301.3 filed on Dec. 28, 2022, the contents of which are herein incorporated by reference in their entireties.
The disclosure relates to a computer implemented method for selecting functional biomarkers to identify a target condition in a subject, such as attention-deficit/hyperactivity disorder (ADHD). The disclosure also relates to a classifier for identifying the target condition using the selected biomarkers and to a computer implemented method for identifying the target condition using the trained classifier.
Attention-deficit/hyperactivity disorder (ADHD) is recognized as a highly prevalent neurodevelopmental disorder in school-age children worldwide, often persisting into adolescence and adulthood, and frequently overlapped with other psychiatric comorbidities. It is currently accepted that ADHD is a complex, heterogeneous disorder, in which different expressions of impairment along with variable trajectories must be recognized in order to adopt personalized approaches that best target an individual. This is important because, even despite serious distress/impairments, many patients lead rewarding and productive lives when properly managed.
Diagnosis of ADHD is event today based mainly on clinical signs and symptoms that require a detailed evaluation by an expert clinician through interviews with parents/caregivers and/or the patient himself, if applicable. Noteworthy, diagnosis cannot be solely based on rating scales, neuropsychological test or brain imaging. Despite the criticisms that argue a risk of subjectivity, the current consensus supports the validity of the diagnostic criteria applied by well-trained professionals. However, even for a specialist, clinical evaluation is quite time-consuming and requires several visits to be thoroughly performed. Besides, the significant shortage of trained professionals also contributes to a frequent delay in diagnosis or even to overlook some cases. From a developmental perspective, an early diagnosis is very likely to be of value for more effective pharmacological and psychosocial interventions. In this view, there is a need for objective biomarkers as useful adjunctive indicators to alleviate the workload of diagnoses and treatment follow-up.
Numerous studies have tried to assess ADHD through different objective diagnostic tools, most using functional or structural MRI and EEG, with other modalities (MEG, EKG, etc.) being deployed less frequently, and with an increasing use of artificial intelligence (AI) techniques.
Noticeable efforts in MRI and fMRI were made under the initiatives of the “ADHD-200 Consortium”. Despite significant advances in understanding abnormalities related to brain maturation and function, neuroimaging findings in ADHD research cannot yet be used to support clinical practice due to a variety of concerns.
An alternative tool to assess ADHD worth to explore is functional near-infrared spectroscopy (fNIRS), which is characterized by being noninvasive, wearable, cost-effective, and deployable in more friendly/ecological settings. fNIRS has shown its usefulness in monitoring functional hemodynamic changes associated with cortical brain activation. Compared to other neuroimaging modalities, few fNIRS studies have been conducted to differentiate children with ADHD from healthy controls, some of them trying to improve classification by combining different modalities (e.g. EEG+fNIRS). Even fewer studies focused on single unimodal approaches by using “exclusively” NIRS data. For example, Monden et al (2015, “Individual classification of ADHD children by right prefrontal hemodynamic responses during a go/no-go task as assessed by fNIRS”) reported a classification accuracy of 85% with a sensitivity of 90% by analyzing ROC curves obtained from right prefrontal oxy-Hb activation data during a go/no-go task.
Using prefrontal cortex activation measures during an N-back task, Crippa et al. (2017, “The utility of a computerized algorithm based on a multi-domain profile of measures for the diagnosis of attention deficit/hyperactivity disorder”) achieved mean accuracies of 78% with 72% sensitivity and 82% specificity when a support vector machine (SVM) classifier was trained on data from deoxy-Hb. Also employing an N-back task and an SVM, Gu et al. (2018, “Identifying ADHD children using hemodynamic responses during a working memory task measured by functional near-infrared spectroscopy”) reached 86% of accuracy with oxy-Hb data measured in the prefrontal and temporal cortex]. It is worth noting that no correction for components of non-cerebral origin was applied to the fNIRS signals in the aforementioned studies, which is especially important when scanning the prefrontal cortex through the forehead, since functional extra- and cerebral responses are interrelated processes that overlap in fNIRS recordings and with a greater confounding effect for oxy-Hb.
Notwithstanding this known drawback of fNIRS, classification algorithms may achieve appreciable performance by learning some type of feature representation from the uncorrected NIRS data, but uncertainty about the nature and origin of the features hampers the interpretability of predictive models. In these studies, the features were based on some kind of measurement from the averaged fNIRS data across trials/epochs, a classic approach that, while often providing robust results, fails to uncover finer distinctive patterns embedded in the data.
It is also known that a rhythmic mental arithmetic task successfully induced cyclical hemodynamic fluctuations coupled to the task frequency (33 mHz), and that the oscillatory patterns were consistent across individuals both in superficial and deep fNIRS signals recorded in the fronto polar region. Spectral analysis also showed oscillatory activity at lower frequencies (<33 mHz) seen at rest and during mental task, and with a prominent peak around 5-10 mHz. Resting-state fMRI studies have reported that ADHD patients show significant differences in the low-frequency oscillations (LFO; 10-80 mHz) band across multiple brain regions, with separable contribution of specific frequency sub-bands including extra-low frequencies (0-10 mHz). These differences have been related to abnormalities in the salience, attentional and default-mode networks functioning, but the inconsistences observed across many studies point to a large heterogeneity in spontaneous brain activity in ADHD. Despite this, evidence suggests that some characteristics of ADHD brain activity are sensitive to specific frequency bands.
Computer-aided diagnosis of attention-deficit/hyperactivity disorder (ADHD) aims to provide useful adjunctive indicators to support more accurate and cost-effective clinical decisions. Deep machine-learning (ML) techniques are increasingly used to identify neuroimaging-based features for objective assessment of ADHD. Despite promising results in diagnostic prediction, substantial barriers still hamper the translation of the research into daily clinic. Few studies have focused on functional near-infrared spectroscopy (fNIRS) data to differentiate ADHD condition at the individual level.
It is therefore an objective of the present disclosure providing a computer-implemented method for selecting functional biomarkers to identify a target condition in a subject, wherein said target condition may be, for example, attention-deficit/hyperactivity disorder (ADHD).
It is also an objective of the present disclosure providing a classifier being trained using the selected functional biomarkers, so it may be used for classifying contributing to the objective assessment of target conditions such as ADHD through computer-aided affordable tools deployable in many clinical settings.
According to an embodiment of the disclosure, there is provided a computer implemented method for selecting functional biomarkers to identify a target condition, such as attention-deficit/hyperactivity disorder in a subject, includes obtaining groups of equivalent physiological signals generated during a stimulation, including a first group of physiological signals from subjects presenting the target condition, and a second group of physiological signals from subjects of control condition, not presenting the target condition; identifying at least one frequency sub-band for each group of physiological signals. The frequency sub-band can be a single frequency sub-band, for example a frequency sub-band between 0.0025 Hz and 0.145 Hz or encompassing the whole frequency of a scalogram of the first and second group of physiological signals. The frequency sub bands can also be a plurality of frequency sub-bands corresponding to frequencies that show higher group synchronization; obtaining per each identified frequency sub-band a corresponding pattern, the pattern can be a time series pattern, in the domain of time, or a frequency series pattern, it is a spectral pattern, in the domain of frequency; and selecting one or more patterns having similarity values to the physiological signals of each group that differentiate the groups of physiological signals as functional biomarkers to identify the target condition, for example for training a classifier of the target condition for identifying the target condition from a physiological signal obtained from a subject during the same stimulation.
According to an advantageous aspect, the computer implemented method allows selecting patterns, time series patterns or frequency series patterns, as functional biomarkers to develop a methodological approach for effective identification of a target condition in a subject, such as training a classifier, for example a machine learning classifier of those known in the state of the art. This classifier may then be used in a computer implemented method for identifying if a subject from which an equivalent physiological signal is obtained during the same stimulation is more likely to be a subject presenting the target condition or a subject not presenting the target condition, it is, a subject presenting the control condition.
According to an embodiment of the disclosure, identifying at least a frequency sub-band comprises identifying a plurality of frequency sub-bands for each of the first group and the second group, the plurality of frequency sub bands corresponding to frequencies that show a higher group synchronization than other frequencies; and the obtaining, for each of the at least one frequency sub band, at least a corresponding pattern comprises obtaining, for each of the plurality of frequency sub bands, a corresponding time series pattern; and selecting, as functional biomarkers to identify the target condition, comprises selecting, as functional biomarkers to identify the target condition, one or more of the time series patterns having similarity values corresponding to the first physiological signals of the first group or the second physiological signals of the second group that differentiate the first physiological signals of the first group and the second physiological signals of the second group.
According to an embodiment of the disclosure, the stimulation is a periodical task having a task frequency, the frequency sub-bands including the task frequency.
According to an embodiment of the disclosure, the physiological signal is one of a NIRS signal, a functional magnetic resonance imaging-blood-oxygen-level-dependent (fMRI-BOLD) signal, or an electroencephalography (EEG) signal, or a hemodynamic signal such as a blood pressure or flow signal. The method may also be performed simultaneously for different physiological signals, therefore obtaining functional biomarkers corresponding to physiological signals of different kind. Moreover, the selected functional biomarkers may be combined for training the classifier.
According to an embodiment of the disclosure, the physiological signal is a cerebral signal, obtained in a same region of the brain, in response to relative or absolute changes in the concentration of a hemoglobin chromophore, so the cerebral signal may be externally obtained with devices known by the skilled person.
According to an embodiment of the disclosure, the identifying the frequency sub-band includes, per each group: obtaining a time-frequency transform of each physiological signal; averaging the time-frequency transforms; extracting the local-maxima of the averaged time-frequency transforms; and identifying the frequency sub-bands for the group as frequency ranges around the local-maxima using a suitable predefined criteria such as between the local-minima around each local-maxima. Other suitable predefined criteria could also be used, such as setting a percentage of height descent on both sides around the local-maxima, for example, 50% of the local-maxima height. The physiological signals are expected to be adapted as necessary, and also signals obtained from the physiological signals, such as transforms, may also be used. The number of frequency sub-bands may also be different per each group. For example, frequency sub-bands could be limited to those presenting a bandwidth within a certain threshold, or frequency sub-bands with a local-maximum within a certain threshold, or selecting a predefined number of frequency sub-bands based on the best bandwidth and local-maxima values. It is also envisaged, when using multiple physiological signals, to combine frequency sub-bands obtained from multiple physiological signals in case several physiological signals of similar frequency response are used, such as similar cerebral signals obtained from different regions of interest, so the common most relevant frequency sub-bands are identified. For example, averaging frequency sub-bands or selecting the combined maxima of the frequency sub-bands.
According to an embodiment of the disclosure, the time-frequency transform is a continuous wavelet transform. Other transforms known by the skilled person, such as short-timed Fourier transform, or Wigner-Wille transform could also be used.
According to an embodiment of the disclosure, the identifying the frequency sub-bands includes calculating an inter-subject synchronization measure, and identifying the frequency sub-bands for the group as frequency ranges around the local-maxima. The inter-subject synchronization measure is one of a phase-synchronization measures, or coherence measures, or correlation measures, of the measures known by the skilled person.
According to an embodiment of the disclosure, the inter-subject synchronization measure includes a phase synchronization measure.
According to an embodiment of the disclosure, the inter-subject synchronization measure includes a combination of a phase and magnitude synchronization measure.
According to an embodiment of the disclosure, the obtaining per each identified frequency sub-band a corresponding time-series pattern per each group includes: computing the inverse time-frequency transform from the calculated time-frequency transform of the physiological signals per each frequency sub-band to obtain the corresponding, band-limited, time-series for each member of the group; and generating the time-series pattern of the group by averaging the obtained time-series for each member of the group.
According to an embodiment of the disclosure, the step of selecting the one or more time-series patterns that better differentiate the groups of physiological signals includes calculating time-series similarity measures, such as Euclidean, Mahalanobis, Cityblock distances or elastic measures, as Dynamic Time Warping, or correlation methods. Similarity is calculated between each candidate time-series pattern and the time-series of each member of the group, and selecting the time-series patterns with similarity measures that better separate the groups, as functional biomarkers.
According to an embodiment of the disclosure, the best time-series patterns are selected based on statistical contrast at a predefined significance level, such as the F-statistic for the analysis of variance, so only the most relevant time-series patterns are selected, thus improving the computational efficiency of the classifier. Other selection criteria of the best time-series patterns are also envisaged, even no selection is contemplated, using all available time-series patterns.
According to an embodiment of the disclosure, the target condition may be one of: a target mental condition, target brain disorder condition, target body physiological condition, or a combination of them. For example, the target condition may be one of: attention-deficit/hyperactivity disorder, mild-cognitive impairment, neurodegenerative disorders as Alzheimer's disease, dysautonomia disorders, depression, or anxiety. However, other target conditions may also be envisaged.
According to another embodiment of the disclosure, there is provided a classifier for identifying the target condition in a subject trained using the similarity values to the functional biomarkers. The classifier using the selected time-series patterns as functional biomarkers allows classifying if a physiological signal of a subject is likely to be a physiological signal from a subject presenting the target condition or from subjects of control condition, not presenting the target condition.
According to another embodiment of the disclosure, there is provided a computer implemented method for identifying the target condition in a subject using the trained classifier, the method includes obtaining the equivalent physiological signal from a subject during the same stimulation previously used for selecting the functional biomarkers; and processing the physiological signal in the classifier for identifying the target condition in the subject. Also, according to another embodiment of the disclosure, there is provided computer programs including instructions that when executed cause a machine to perform the previous computer implemented methods.
According to another embodiment of the disclosure, there is provided a method of diagnostic of the target condition in a subject using the computer implemented method described above. For example, once the classifier is trained for identifying the target condition, the target condition can be detected in any subject. The target condition may be a target mental condition, target brain disorder condition, target body physiological condition, or a combination of them, such as attention-deficit/hyperactivity disorder, mild-cognitive impairment, neurodegenerative disorders as Alzheimer's disease, dysautonomia disorders, depression, or anxiety,
As a complement to the description provided herein and for the purpose of helping to make the characteristics of the disclosure more readily understandable, this specification is accompanied by a set of drawings, which by the way of illustration and not limitation, represent the following:
As shown in
Once a time-series pattern P is obtained per each frequency sub-band A, B, C, D of each of the first group 3a and the second group 3b, the method further includes selecting the one or more representative time-series patterns P having similarity values d to the physiological signals 1 of each group that differentiate the groups of physiological signals 1 as functional biomarkers F to identify a target condition ADHD, that may be used for training a classifier CL of the target condition ADHD.
A detailed embodiment related to a computer implemented method for selecting functional biomarkers to identify a target condition of ADHD in a subject will be explained thereafter. Naturally, other target mental or brain disorder conditions, target body physiological conditions or combination of them could be used as a target condition for selecting the corresponding functional biomarkers for that target condition. Therefore, the target condition may be a target mental condition, target brain disorder condition, target body physiological condition, or a combination of them, such as attention-deficit/hyperactivity disorder, depression, or anxiety.
In this embodiment, groups of equivalent physiological signals generated during a stimulation S will be obtained. For example, a first group 3a of physiological signals 1 from one or more subjects presenting the target condition ADHD, and a second group 3b of physiological signals from one or more subjects of control condition TD, not presenting the target condition may be obtained. Although a single equivalent physiological signal could be used, in the described embodiment several equivalent physiological signals will be used, so functional biomarkers could be extracted from each of the equivalent physiological signals. According to an example embodiment, equivalent physiological signals may mean same type of kind of signals. For example, in the embodiment, two kind cerebral signals (shallow-signal SS and clean-signal CS), from three different regions of interest (left, medium, central) and corresponding to the concentration of two chromophores, therefore, twelve equivalent physiological signals will be used for obtaining the functional biomarkers F for each equivalent signal that separate the group of subjects presenting the target condition (ADHD) and the group of subjects of control condition (TD), not presenting the target condition.
According to an example embodiment, for recording and processing the physiological signal, as a fNIRS signal, analysis may be performed on superficial and regression-corrected deep fNIRS signals recorded from the forehead of a subject 2 through a multi-distance, multi-channel device 4. According to an example embodiment, a device 4 as illustrated in
Using such device 4, relative concentration changes in oxy- (HbO) and deoxy-hemoglobin (HbR) may be computed for different regions of interest. For example, relative concentration changes in oxy- (HbO) and deoxy-hemoglobin (HbR) may be computed for a left region, a middle region and a right region. Therefore, physiological signals may be external cerebral signals from three regions of interest (ROI), right, medial, left, to different chromophores HbO HbR and different signal type (shallow-signal-SS-and clean-signal -CS-), it is, twelve physiological signals may be considered. However, the disclosure is not limited to the example regions illustrated in
According to an example embodiment, identification of the frequency components with potential capacity to differentiate between the two groups of participants (i.e., having the control condition TD and having the target condition ADHD) may be one of the features for identifying ADHD in a subject, and will be discussed in detail below.
To this end, a suitable method for locating stimulation or task-related oscillations may be used on different time scales (i.e., frequency bands), appropriate for non-stationary signal analysis, and capable of providing some measure of similarity to define class membership. A data-driven approach based on time-frequency transform, as complex continuous wavelet transform (CWT), and time-scale synchronization detection may be used.
The CWT is an time-frequency transform signal processing method that provides a time-frequency (or time-scale) representation of the characteristics of a signal base on the dilation and translation of a mother wavelet function. CWT may be viewed as a bandpass filter with varying bandwidths automatically defined by the wavelet scale, which avoids the drawbacks of using custom filters.
According to an example embodiment, to compute the complex continuous wavelet transform CWT generalized Morse wavelets may be used. For example, a flexible superfamily of exactly analytic wavelets particularly useful for analyzing signals with time-varying amplitude and frequency, i.e. modulated signals, may be used. Since Morse wavelets may be tuned to encompass many other analytic wavelets commonly used, Morse wavelets provide a unified framework as reference point.
Depending on the kind of physiological signal, the range of frequencies of interest may be adapted, for example for NIRS or fMIR being up to 0 to 100 mHz; EEG being up to 0 to 40 Hz; and electromyogram being up to 0 to 600 Hz. Also the range of frequencies of interest may be determined by the Nyquist frequency of the sampling rate, as known by the skilled person. In the example embodiment as illustrated in
Since group-wise synchronization may appear as transient peaks rather than constantly, a time-point-by-time-point analysis may be performed, which allows capturing common oscillatory patterns that evolve dynamically over time. For example, an inter-subject correlation (ISC) analysis may be performed, which is a data-driven approach for assessing consistent neural responses to stimuli across.
According to an example embodiment, instantaneous inter-subject synchronization (ISS) may be measured using the magnitude and phase information provided by the complex-valued CWT coefficients. In fMRI studies, measures as inter-subject phase synchronization and pairwise phase consistency have been validated for the assessment of voxel-wise instantaneous phase synchronization across subjects. However, these measures rely only on the uniformity of phase angles, ignoring the magnitude. Thus, when applied to fNIRS, it may mean that low-amplitude signals affect the measurement the same as significant amplitude signals (or high amplitude signals). Therefore, this approach may not be entirely appropriated for fNIRS data where amplitude changes are related to the magnitude of the hemodynamic response. Since as amplitude increases, the signal-to-noise ratio improves, it is reasonable to argue that observations with higher amplitudes may contribute to a more realistic estimate of phase synchronization. As such, according to an example embodiment, an ‘inter-trial linear coherence’ measure may be used, which combines magnitude and phase in the normalization operation. Since the measurement here was across subject observations and not across trials, this measurement may be referred to as ‘inter-subject synchronization’ (ISS) measure throughout the disclosure. Moreover, the ISS measure may omit ‘linear’ for simplicity and similarly. However, the disclosure is not limited thereto. According to an embodiment, the ISS measure may include a combination of a phase and magnitude synchronization measure, formulated as:
The ISS may be computed moment-to-moment for each scale from the CWTs coefficient matrices of each group (i.e. 15 participant observations per scale). The analysis may be limited to only to the time-interval between −30 seconds and +30 seconds around the task. However, the disclosure is not limited thereto, and as such, the parameters for computation may be varied according to other example embodiments. According to an embodiment, the ISS procedure may be applied independently to the shallow-signal SS and clean-signal CS data of each group of subjects for each chromophore and ROI. However, the disclosure is not limited thereto, and as such, the ISS procedure may be applied in various manner. An ISS representation in the time-frequency plane that may be visualized as a ISS map is illustrated in
The maximum ISS observed along each scale may be chosen, which represents the highest group synchronization achieved at each specific frequency, as can be also seen in
Therefore, in
As shown in
Once frequency sub-bands A, B, C, D are identified, the computer implemented method may further include obtaining per each identified frequency sub-band A, B, C, D a corresponding time-series pattern P and selecting the one or more representative time-series patterns P having similarity values d to the physiological signals 1 of each group that differentiate the groups of physiological signals 1 as functional biomarkers F to identify the target condition ADHD, and that may be used during the training of a classifier CL of the target condition ADHD. For example, the method may include computing an inverse time-frequency transform IT from the calculated time-frequency transform T of the physiological signals 1 per each frequency sub-band A, B, C, D to obtain a corresponding time-series t for each member of the group, it is, for each subject of the first group 3a and the second group 3b, and generating a time-series pattern P of the group by averaging the obtained inverse time-frequency transform IT for each member of the group.
Once one or more time-series patterns P are obtained, selecting the one or more time-series patterns P that better differentiate the groups of physiological signals 1 may include calculating time-series similarity values d between each candidate time-series pattern P and the time-series t of each member of the group, and selecting the time-series patterns P with similarity values d that better differentiate the groups as functional biomarkers F. The time-series patterns P may be selected as functional biomarkers F based on statistical contrast at a predefined significance level.
Like the other aforementioned synchronization measures, ISS is a compound measure that does not exist on its own at a single-subject level but represents a summary statistic of group synchronization. Therefore, to disentangle the contribution to ISS of each individual is not a straightforward issue.
However, ISS peaks suggest that some frequency components show similar time courses across individuals, at least within certain time intervals. In other words, there are sequential patterns common to the group that may provide distinctive information to define class membership. This concept may be referred to as time-series classification, which encompasses a variety of techniques for identifying those properties (features) that have sufficient differentiating power to distinguish between different classes of time series. In the context of the example embodiment of the disclosure, a well-suited technique could be the one based on the shapelet framework, which addresses the classification problem by discovering primitive time-series sequences (shapelets) that are used to quantify the similarity between classes of time series. Shapelets provide directly interpretable information about patterns (shapes) that are important for understanding how data classes differ, a desirable property for clinical decision support systems.
According to an example embodiment, a basic shapelets technique may be applied. However, instead of looking for phase-independent subsequences similar in shape (i.e. subsequences may be located anywhere in the series), the analysis within a fixed time-interval may be performed, all subsequences having the same length. According to an example embodiment, subsequence translation over time may not applied. This may mean that time-series similarity also depends on the phase (i.e. on a consistent time-alignment). Therefore, instead of local patterns, global patterns present over a whole time interval may be captured. Under this approach, the term “time-series pattern” may be used instead of “shapelet”. However, the disclosure is not limited to the use of this term.
According to an example embodiment, the method may further include extracting the time-series to be used for identifying representative time-series patterns P. The average ISS patterns suggest the frequencies that are likely to contain synchronized oscillations. By computing the inverse time-frequency transform IT as inverse CWT within the specific frequency sub-band A, B, C, D defined by the bounds of an ISS peak, a band-limited components in the time-domain may be reconstructed. To reduce edge-effects, the inverse CWT may be computed from the extended coefficient matrix that was reserved in a previous operation. Then, the resulting time-series were truncated to the interval between −30 seconds and +30 seconds around the task. After applying this procedure to the CWT of all the individuals belonging to a group, a set of time-series (n=15) may be obtained to find a reference time-series pattern for that group in a particular sub-band.
Since all the time-series have the same length and are within the same timeframe, a suitable reference time-series pattern may be obtained simply by averaging. If the time-series share a common pattern, their average should represent the group well enough. To quantify similarity with the reference time-series pattern, among other possibilities, a simple measure as Euclidean distance may be computed:
In order to assess the capability of the time-series pattern to differentiate between groups, the distances obtained from one group with those of the other group were contrasted. For example, a time-series pattern that is representative of the control condition TD group should have smaller distances to members of this group than to members of the target condition ADHD group, and vice versa.
Among other quality measures, the F-statistic for analysis of variance may be used to assess the discriminative power of a time-series pattern. This statistic indicates the ratio of the between-group variability to the within-group variability as:
According to an example embodiment, based on the average ISS patterns, four candidate frequency sub-bands A, B, C, D may be identified in each of them that were labeled A, B, C and D in decreasing order of frequency. However, the disclosure is not limited thereto, and as such, candidates of frequency sub-bands may be different than four. Each sub-band may contain a peak (local-maximum) flanked by two troughs (local-minima) that delimit the frequency boundaries. For each ISS pattern, each belonging to a target group (TD or ADHD), the following procedure for each sub-band may be performed, as indicated in
According to an example embodiment, the time-series pattern approach is illustrated to transform data observations at different time-scales into a simple feature space of Euclidean distances, but the disclosure is not limited thereto, and as such, other feature type approaches in the state of the art could be used to transform data observations without deviating from the example embodiment of the disclosure.
To assess the feasibility of the procedure to differentiate between control condition TD and target condition ADHD, four well-suited classifiers CL, machine learning algorithms for supervised binary classification, namely linear support vector machine (SVM), logistic regression (LR), linear discriminant analysis (LDA) and Gaussian naïve Bayes (NB) were tested.
These algorithms were selected because they are well known, inherently interpretable, computationally efficient, and may work with relatively small sample sizes. Under a variety of flavors (different kernel, regularization, etc.), SVM is very frequently present in neuroimaging-based studies of brain disorders, with LDA and LR being the other most popular choices. According to an embodiment, NB may be included due to its ease of application and good performance in a variety of applications despite the assumption of feature independence.
SVM differentiate classes by finding the hyperplane that maximizes the separation (margin) between the points of them. LR do the job through a logistic function (sigmoid) that model the dependent variable and maps predicted values into probabilities of belonging to one class or another. LDA assumes that the predictors come from a Gaussian mixture distribution and uses discriminant functions to estimate the probability that they are from each class. NB estimates the probability density of predictors given a class by independently mapping them onto separate normal distributions fitted to each class. Although based on different models, discriminative (LR & SVM) vs generative (LDA & NB), all four are within the linear classifier category, i.e. to make predictions, the classifiers try to learn the line that best separates the points of the two classes.
In addition to the putative functional response, fNIRS signals also contain components originating from common systemic forces and unpredictable local activity. Therefore, it is very likely that the feature matrix also contains redundant and/or irrelevant data that may degrade classifier performance by cause of overfitting and noise issues. Model regularization may be applied to some algorithms to account for statistical overfitting, however that raises the problem of choosing a suitable technique (e.g. lasso) and finding appropriate regularization parameters. To avoid increasing the complexity of the models, the problem may be addressed by reducing the feature space. Feature selection is a commonly used tool to obtain a smaller subset of the most relevant features, reducing complexity while improving classification accuracy and generalization capacity. A reasonable hybrid approach is to first apply a filter method, before modeling, to select some features based only on their intrinsic properties. Then more sophisticated methods such as wrappers may be employed to find the best subset of features, using the classifier itself as evaluator. Among others, a benefit of such selection is an easier explanation of the prediction because the models are simpler.
A wide variety of filter methods have been developed, each based on specific criteria (information, similarity, etc.) to evaluate features. According to an example embodiment, the filter may be selected based on the F-statistic. For example, the features that showed p-values<0.01 was selected for testing purpose, assuming that their generating time-series patterns were very unlikely to separate the groups by chance. In this way, the features may be significantly reduced from 48 to 5 for SS and from 48 to 10 for CS. However, the disclosure is not limited thereto, and as such, according to another embodiment, filters may be selected based on relief, minimum redundancy-maximum relevance, chi-square, etc.
According to an example embodiment, the classification methods may be first applied to filter-selected features separately for SS and CS, and then for all of them together (SS+CS). To assess the predictive performance, two cross-validation (CV) techniques for comparison purposes were applied, namely leave-one-out (LOO) and stratified 5-fold. In the first, data was partitioned into 30 folds where each observation was used once as a test set and the remaining ones formed the training set. In the latter, five partitions were randomly chosen, each with 26 observations as the training set and 6 as the test set; folds were repartitioned over 20 Monte-Carlo repetitions (5×20=100 models) to reduce CV variance, while stratification ensured that sets had the same proportion of classes (50% in an example case). Also, 5-fold instead of 10-fold was used because with the latter the test set size=3 would be too close to that of LOO=1. Moreover, since only two classes and well-balanced datasets (i.e. equal proportion of both classes) are used, accuracy, specificity and sensitivity may be used as metrics to assess performance, as obtained from the corresponding confusion matrices and then averaged across folds. At this point, accuracy (a commonly used metric in practice) was the considered to test the statistically significant classification performance. Thus, the theoretical above-chance accuracy threshold based on the binomial cumulative distribution at p<10-3 for 2-classes (probability=0.5) and a sample size=30 was computed.
Afterwards, a wrapper method may be applied to fine-tune the feature selection such as a Sequential Forward Floating Selection (SFFS) algorithm. SFFS starts with an empty set and sequentially adds one feature at a time to create candidate subsets that are evaluated by cross-validation. After that, the best feature is added to the set. When the size of the selected set is >2, a backward step tries to optimize it by removing one or more features. This procedure is repeated until there is no performance improvement. The two aforementioned cross-validation techniques were also applied independently to each classifier. Noteworthy, the input order of the finally selected features allows us to know their relative importance. According to an embodiment, the wrapper can rank features by multiple metrics at the same time, and as such, two criteria may be used to select/remove features, specificity and then accuracy. Thus, if two (or more) features equally improve the specificity, the one with the best accuracy is selected.
Once the best subset of features was selected for each classifier, the statistical significance of the observed performance may be estimated through a non-parametric label permutation procedure that does not assume any particular statistical property of the data. For example, for the testing, 5000 resamples were generated, each of which randomly permuted the labels of the two classes; realizing the null hypothesis that features do not define class membership. For each resampled data, the classification performance was evaluated using the same cross-validation scheme as for the real data. The observed performance metrics were ranked against the corresponding null-distribution to estimate a p-value. In addition, a 95% bias corrected percentile interval was estimated as confidence interval for each metric by bootstrapping (with replacement) over 2000 resamples, with each realization keeping the same proportion of classes (50%) and at least three distinct observations in each class.
In addition, according to an embodiment, the testing may include checking whether the wrapper performance might have been biased due to the use of a pre- filtered feature set that included all data in the selection process, i.e. the “peeking” effect. To this end, during the testing, repeated the wrapper procedure was repeated using the full feature set (i.e. 96 features), then comparing the performance outcomes.
Based on the observed local-maxima Lmax, on each ISS map of
As illustrated in
Table 1 shows the common synchronization sub-bands estimated from the average ISS maxima across ROIs, labeled A, B, C and D by decreasing frequency, with A corresponding to the task frequency.
Table 2 shows the performance achieved by classifiers CL trained with the features, the functional biomarkers as selected time-series patterns, which were selected by filtering, i.e. those with an F p-value<0.01.
When evaluating SS+CS, the wrapper selected the same features for LR and LDA in both cross-validation cases. Noteworthy, the first-in feature was “AD-B-HbO-R”, which was also the first for CS. The second one was “TD-A-HbO-L”, the first for SS. Finally, “TD-D-HbR-M” from CS completed the set. Looking at the wrapper history, “AD-B-HbO-R” alone achieves about 80% of the accuracy, sensitivity and specificity, which is not surprising since it has the highest F (19.6, p-val<0.0001). The addition of “TD-A-HbO-L” improves the scores up to 90% and with “TD-D-HbR-M” they approach 100%. Therefore, the most powerful feature comes from the CS-HbO data of ADHD group, specifically from band B in which a prominent ISS peak may be seen (
Table 3 summarizes the results obtained by wrappers with SS+CS features. In all cases (except for SVM in sensitivity with 5-fold) permutation testing indicated significance at p<0.001. LR and LDA showed the highest scores in all metrics and also the narrowest CIs. Although highly significant, NB showed lower performance and larger ICs, whereas SVM performed the worst.
001
, 100]
, 100]
, 100]
, 73.3]
]
, 100]
, 100]
, 100]
]
, 100]
.001
, 100]
, 100]
, 100]
, 100]
, 100]
indicates data missing or illegible when filed
According to an embodiment, when the full feature set (48 SS+48 CS) was used to feed the wrappers for LR and LDA, the same feature subset for both cross-validation schemes may be obtained, and as such, the same scores and statistics may be obtained. Therefore, feature pre-selection may provide good performance with reduced computational cost (i.e., lower computer resources such as memory and processing resources).
Surprisingly, SVM performed the worst in the SS+CS case, particularly with the 5-fold cross-validation where it achieved the lowest scores and also the widest CIs. In contrast, the other classifiers were not affected by the cross-validation, reaching similar scores and CIs with both schemes (Table 3). Most likely, SVM performance degraded due to lack of proper regularization, leading to insufficient penalty for misclassification.
As seen from
LOO and k-fold are the most commonly adopted cross-validation methods in ADHD studies; plus hold-out which seems more appropriate for large datasets. Some researchers have suggested that LOO may be more useful in a diagnostic scenario, whereas others recommend k-fold or repeated random splits for more stable estimates. In either case, it is known that cross-validation is compromised by small sample sizes, particularly if there are many predictors, which tend to overestimate predictive accuracy to a variable degree depending on the particularities of the study. In light of this, LOO and 5-fold were tested expecting differences in performance, with LOO showing more optimistic scores and larger confidence bounds. However, both LOO and 5-fold produced very similar results with LR and LDA (Table 3), suggesting that the three chosen features are good predictors to yield stable cross-validation measures regardless of method.
According to an embodiment illustrated in
Once a frequency series pattern P, a spectral pattern, is obtained per the frequency sub-band A for each of the first group 3a and the second group 3b, the method further includes selecting the one or more representative frequency series patterns P having similarity values d to the physiological signals 1 of each group that differentiate the groups of physiological signals 1 as functional biomarker F to identify a target condition ADHD, that may be used for training a classifier CL of the target condition ADHD. As previously explained the similarity value d can be the Euclidian distance.
Similarly, as previously explained in
Once the classifier CL is trained, a computer implemented method for identifying a target condition ADHD in a subject can be executed, in a similar way as previously explained in
As previously explained, it is envisaged that the physiological signals 1 may be external cerebral signals from three regions of interest (ROI), right, medial, left, to different chromophores HbO, HbR, HbT and different signal type (shallow-signal -SS- and clean-signal -CS-) from which a scalogram can be obtained as previously explained. Also combinations of physiological signals 1 from the same user 2 can be used, for example, a scalogram of the crossed spectrum between HbO and Hb of the same region of interested can be used, and the crossed spectrum between the same chromophore (HbO, HbR or HbT) of different region of interest can be used.
According to an example embodiment, methods and operations illustrated above may be implemented in an electronic device or a computer. For instance, the electronic device may include a memory storing one or more instructions, and a processor configured to execute the one or more instructions to: obtain, based on a stimulation procedure, a first group of first physiological signals from one or more first subjects representing a target condition, and a second group of second physiological signals from one or more second subjects representing a control condition not presenting the target condition, identify at least one frequency sub-band for each of the first group and the second group, the at least one frequency sub-band preferably corresponding to one or more frequencies that show a higher group synchronization than other frequencies, obtain, for each of the plurality of frequency sub-bands, at least a corresponding pattern, preferably a time-series pattern, and select, as functional biomarkers to identify the target condition, the one or more patterns, preferably one or more time-series patterns or one or more frequency series patterns, having similarity values corresponding to the first physiological signals of the first group or the second physiological signals of the second group that differentiate the first physiological signals of the first group and the second physiological signals of the second group.
According to an example embodiment, there is provided a non-transitory computer readable medium having stored thereon a computer program including instructions that when executed cause a machine to perform a method including obtaining, based on a stimulation procedure, a first group of first physiological signals from one or more first subjects representing a target condition, and a second group of second physiological signals from one or more second subjects representing a control condition not presenting the target condition, identifying one or more frequency sub-bands for each of the first group and the second group, a plurality of frequency sub-bands may correspond to frequencies that show a higher group synchronization than other frequencies, obtaining, for each frequency sub-band, at least a corresponding pattern, that may be a time series patter or a frequency series pattern, selecting, as functional biomarkers to identify the target condition, one or more patterns having similarity values corresponding to the first physiological signals of the first group or the second physiological signals of the second group that differentiate the first physiological signals of the first group and the second physiological signals of the second group. According to an embodiment, the machine may be a computer or a hardware processor.
While the present disclosure has been described with reference to example embodiments thereof, it will be apparent to those of ordinary skill in the art that various changes and modifications may be made thereto without departing from the spirit and scope of the present disclosure as set forth in the following claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
22383301.3 | Dec 2022 | EP | regional |