The present invention relates to a method and a system for indicating the likelihood of a gastrointestinal condition, and more particularly, although not exclusively, to a method and system for indicating the likelihood of a functional gastrointestinal disorder such as irritable bowel syndrome, and/or the likelihood of a gastrointestinal organic disease, such as inflammatory bowel disease and/or differentiation between the two.
Functional gastrointestinal (GI) disorders such as irritable bowel syndrome (IBS), and GI organic diseases such as inflammatory bowel disease (IBD) including Crohn's disease and ulcerative colitis are debilitating GI conditions. They can also be common, IBS for example is estimated to affect around 11% of the world's population.
The current gold standard for IBS diagnosis is through the Rome IV symptom based diagnostic criteria. While offering positive diagnosis, these criteria do not have high reliability (low sensitivity). Physicians typically diagnose IBS through a process of exclusion, i.e., ruling out a number of organic diseases that share symptoms with IBS. Initial screening would usually include baseline blood tests and stool tests for exclusion of infections, coeliac disease and IBD. Typically, primary care physicians also refer patients for colonoscopy and biopsy, even though colonoscopy has been found to reveal a GI organic disease, such as IBD in only a small percentage of patients with IBS symptoms.
These invasive tests are a burden to health systems, contributing to lengthening waiting lists for gastroenterological review as well as adding to the financial costs associated with IBS. Colonoscopies are not only unpleasant for patients but carry significant risks. In addition to these risks, the burden on patients is multifaceted including physical discomfort, psychological distress, and financial costs due to time off-work. Further, since IBS is unrelated to any obvious structural or biochemical changes in the gut, these invasive procedures cannot provide a positive diagnosis for IBS. A diagnosis of exclusion often leaves patients confused and reluctant to engage in treatment. A cost-effective test that could provide a positive diagnosis for patients with a family or symptom history of IBS would be hugely beneficial in diagnosis and overall management of the condition.
In addition, for patients who have a GI organic disease, such as IBD or coeliac disease, a non-invasive test would be an extremely useful and cost-effective screening tool, prior to confirmation with biopsy.
There is a need for a new cost-effective, accurate and non-invasive diagnostic test for gastrointestinal conditions.
It would be advantageous if a non-invasive test could allow determining a likelihood of an individual having a GI condition versus having healthy bowels. It would further be advantageous if a single non-invasive test could allow (i) differentiating between healthy individuals and individuals suffering from a functional GI disorder such as IBS, (ii) differentiating between healthy individuals and individuals suffering from a GI organic disease such as IBD, and (iii) differentiating between individuals suffering from a functional GI disorder and individuals suffering from a GI organic disease. Thus, it would be advantageous if a single non-invasive test could allow indicating a likelihood of an individual having a functional GI disorder versus being healthy, a likelihood of an individual having a GI organic disease versus being healthy, and a likelihood of the individual having a functional GI disorder versus having a GI organic disease.
In broad terms, embodiments of the present invention seek to provide an indication of a likelihood that a patient may have a GI condition or may have healthy bowels based on the patient's bowel sounds. This may provide a cost-effective and non-invasive diagnostic test for a GI condition, including a functional GI disorder such as IBS and a GI organic disease such as IBD.
According to a first aspect of the invention, there is provided a system for indicating a likelihood of a gastrointestinal (GI) condition by analysing bowel sounds, the system comprising:
The at least one statistical distribution property may comprise skewness and/or kurtosis.
The system may be arranged to generate an index value based on the association of the at least one statistical distribution property with the corresponding reference parameter, and compare the index value to a threshold value, in order to determine the likelihood of the GI condition.
The system may be arranged to generate the index value using the formula,
where ‘f’ is the index value, ‘xi’ represents each one of the at least one features, ‘i’ is an integer from 1 to n, where n is the number of features, and ‘ci’ represents the reference parameter associated with the feature ‘xi’.
The reference parameter may be a weight value applicable to the associated statistical distribution property of the at least one identified feature.
In one embodiment, the GI condition is a functional GI disorder such as irritable bowel syndrome (IBS). The system may be arranged to determine a likelihood of IBS versus healthy bowels based on the association.
In another embodiment, the GI condition is a GI organic disease such as inflammatory bowel disease (IBD). The system may be arranged to determine a likelihood of IBD versus healthy bowels based on the association.
In a further embodiment, the GI condition includes a functional GI disorder and a GI organic disease, wherein the at least one statistical distribution property is capable of at least assisting in providing an indication of the existence or non-existence of the functional GI disorder and the GI organic disease. The system may be arranged to determine a likelihood of IBS versus IBD based on the association of the at least one statistical distribution property with a corresponding reference parameter.
The system may also be arranged to simultaneously determine at least one statistical distribution property capable of at least assisting in providing an indication of the existence or non-existence of IBS, and at least one statistical distribution property capable of at least assisting in providing an indication of the existence or non-existence of IBD, whereby the system is arranged to simultaneously determine a likelihood of IBS versus healthy bowels and a likelihood of IBD versus healthy bowels based on respective associations of the at least one statistical distribution property with corresponding reference parameters.
The system may further be arranged to determine a likelihood of IBS versus IBD when the respective association of the at least one statistical distribution property with the corresponding reference parameter indicates that IBS is more likely than healthy bowels.
Alternatively, or additionally, the system may also be arranged to determine a likelihood of IBS versus IBD when the respective association of the at least one statistical distribution property with the corresponding reference parameter indicates that IBD is more likely than healthy bowels.
The at least one feature may comprise, or be based on, one or more of the following: amplitude; burst amount; burst ratio; contraction interval time; higher order zero crossing; band energy ratio; spectral bandwidth double frequency; flatness; spectral centroid; energy; dynamic range; mel width; envelope crest factor; and roll off.
In one embodiment, the system is arranged to identify a plurality of different features from each of the plurality of bowel sounds signals and determine the likelihood of the GI condition based on a combination of the different features.
In one embodiment, the system is arranged to determine the likelihood of IBS versus healthy bowels based on a first combination of the different features comprising at least one feature based on: burst; spectral bandwidth double frequency; contraction interval time; or higher order zero crossing.
In another embodiment, the system is arranged to determine the likelihood of IBD versus healthy bowels based on a second combination of the different features comprising at least one feature based on: flatness 3000; or spectral centroid.
The system may also be arranged to determine the likelihood of IBS vs IBD based on a third combination of the different features comprising at least one feature based on: envelope crest factor; or roll off.
The system may be arranged to determine a plurality of different statistical distribution properties of the collection of values for the at least one feature and determine the likelihood of the GI condition based on a combination of the different statistical distribution properties.
The sound detector may comprise at least two acoustic sensors locatable in proximity to an abdominal region of a subject and spaced-apart from each other for detecting bowel sounds from the abdominal region.
The system may be further arranged such that for each bowel sound signal identified by the system, the system identifies one of the at least two acoustic sensors to be associated with the bowel sound signal based on which sensor produced a highest amplitude reading corresponding to the bowel sound signal.
In order to identify individual bowel sound signals, the signal processor may be arranged to divide the corresponding signal into a plurality of segments and, for each segment, determine whether there is a signal portion within any one of the following ranges: 200 Hz to 800 Hz; 600 Hz to 1000 Hz; 800 Hz to 1200 Hz; 1000 Hz to 1600 Hz; and 1600 Hz to 2000 Hz.
According to a second aspect of the invention, there is provided a method of indicating a likelihood of a GI condition by analysing bowel sounds, the method comprising:
The at least one statistical distribution property may comprise skewness and/or kurtosis.
The method may comprise generating an index value based on the association of the at least one statistical distribution property with the corresponding reference parameter, and comparing the index value to a threshold value, in order to determine the likelihood of the GI condition.
The method may comprise generating the index value using the formula,
where ‘f’ is the index value, ‘xi’ represents each one of the at least one features, ‘i’ is an integer from 1 to n, where n is the number features, and ‘ci’ represents the reference parameter associated the feature ‘xi’.
The reference parameter may be a weight value applicable to the associated statistical distribution property of the at least one identified feature.
In one embodiment, the GI condition is a functional GI disorder such as irritable bowel syndrome (IBS). The method may comprise determining a likelihood of IBS versus healthy bowels based on the association.
In another embodiment, the GI condition is a GI organic disease such as inflammatory bowel disease (IBD). The method may comprise determining a likelihood of IBD versus healthy bowels based on the association.
In a further embodiment, the GI condition includes IBS and IBD, wherein the at least one statistical distribution property is capable of at least assisting in providing an indication of the existence or non-existence of IBS and IBD.
The method may comprise determining a likelihood of IBS versus IBD based on the association of the at least one statistical distribution property with a corresponding reference parameter.
In one embodiment, the method further comprises simultaneously determining at least one statistical distribution property capable of at least assisting in providing an indication of the existence or non-existence of IBS, and at least one statistical distribution property capable of at least assisting in providing an indication of the existence or non-existence of IBD, whereby the method comprises simultaneously determining a likelihood of IBS versus healthy bowels and a likelihood of IBD versus healthy bowels based on respective associations of the at least one statistical distribution property with corresponding reference parameters.
In one embodiment, the method further comprises determining a likelihood of IBS versus IBD when the respective association of the at least one statistical distribution property with the corresponding reference parameter indicates that IBS is more likely than healthy bowels.
Alternatively, or additionally, the method may further comprise determining a likelihood of IBS versus IBD when the respective association of the at least one statistical distribution property with the corresponding reference parameter indicates that IBD is more likely than healthy bowels.
The at least one feature may comprise, or be based on, one or more of the following: amplitude; burst amount; burst ratio; contraction interval time; higher order zero crossing; band energy ratio; spectral bandwidth; spectral bandwidth double frequency; flatness, spectral centroid; frequency centroid; energy; dynamic range; mel width; envelope crest factor; and roll off.
The method may comprise obtaining the signal representative of the sound including the plurality of bowel sounds using a sound detector. The sound detector may comprise at least two acoustic sensors locatable in proximity to an abdominal region of a subject and spaced-apart from each other for detecting bowel sounds from the abdominal region.
The method may comprise identifying for each bowel sound signal one of the at least two acoustic sensors to be associated with the bowel sound signal based on which sensor produced a highest amplitude reading corresponding to the bowel sound signal.
In order to identify individual bowel sound signals, the method may comprise dividing the signal representative of the bowel sounds into a plurality of segments and, for each segment, determine whether there is a signal portion within any one of the following ranges: 200 Hz to 800 Hz; 600 Hz to 1000 Hz; 800 Hz to 1200 Hz; 1000 Hz to 1600 Hz; and 1600 Hz to 2000 Hz.
The method may comprise identifying a plurality of different features from each of the plurality of bowel sounds signals and determining the likelihood of the GI condition based on a combination of the different features.
In one embodiment, the method comprises determining the likelihood of IBS versus healthy bowels based on a first combination of the different features comprising at least one feature based on: burst; spectral bandwidth double frequency; contraction interval time; or higher order zero crossing.
In another embodiment, the method comprises determining the likelihood of IBD versus healthy bowels based on a second combination of the different features comprising at least one feature based on: flatness; or spectral centroid.
The method may also comprise determining the likelihood of IBS vs IBD based on a third combination of the different features comprising at least one feature based on: envelope crest factor; or roll off.
The method may comprise determining a plurality of different statistical distribution properties of the collection of values for the at least one feature and determine the likelihood of the GI condition based on a combination of the different statistical distribution properties.
According to a third aspect of the invention, there is provided computer readable medium for storing instructions that, when executed by a computing device, causes the computer to perform the method according to the second aspect.
According to a fourth aspect of the invention, there is provided a system for diagnosing a GI condition by analysing bowel sounds, the system comprising:
According to a fifth aspect of the invention, there is provided a method of diagnosing a GI condition by analysing bowel sounds, the method comprising:
Notwithstanding any other forms which may fall within the scope of the disclosure as set forth in the Summary, specific embodiments will now be described, by way of example only, with reference to the accompanying drawings in which:
Embodiments of the present invention relate to a method and a system that allow providing a single non-invasive and cost-effective test for indicating a likelihood that a patient may have a gastrointestinal (GI) condition or may have healthy bowels based on the patient's bowel sounds. The GI condition includes a functional GI disorder such as irritable bowel syndrome (IBS), and a GI organic disease such as inflammatory bowel disease (IBD). IBD includes Crohn's disease and ulcerative colitis. It will be understood that embodiments of the invention however may include the determination of a likelihood of functional GI disorder conditions other than IBS, such as cyclic vomiting syndrome functional constipation or functional diarrhea, and may also include the determination of a likelihood of other GI organic diseases, such as coeliac disease, neoplasm, infectious enteritis, obstruction or cancer.
For the diagnosis of IBS, the physician may choose to use the method and system in accordance with embodiments of the present invention after ruling out other diseases, such as IBD, through screening tests or colonoscopy and biopsy. A positive determination or diagnosis of a likelihood of IBS or healthy bowels using the method and system in accordance with embodiments of the present invention would for example allow providing a patient with additional confirmation of a positive IBS diagnosis and that IBD can be ruled out.
The single test using the method and system in accordance with embodiments of the present invention could further allow differentiating, for example, between three groups of patients, namely patients with IBS, patients with IBD and healthy individuals. A physician may then choose to order other tests, such as a colonoscopy with biopsy, to confirm any diagnosis of organic diseases, such as IBD.
Referring to
The system comprises a sound detector 12 for detecting bowel sounds and generating a corresponding signal representative of the bowel sounds. The sound detector 12 can for example be a microphone or piezoelectric sensor. The system 10 also comprises a signal processor arranged to identify a plurality of bowel sound signals within the corresponding signal, wherein each bowel sound signal is representative of an individual bowel sound. In this example, the signal processor comprises a bowel sound identifier 14 for identifying the individual bowel sounds.
The system 10 is arranged to then identify at least one feature from each of the plurality of bowel sound signals so as to produce a collection of values for the same at least one feature. In this example, the signal processor also comprises a feature extractor 16 arranged to extract or identify the at least one feature. The feature(s) may for example be amplitude and/or duration of the bowel sound signals. Preferably, multiple different features are identified from the bowel sound signals, and for each feature a collection of values are obtained. Features that are preferable and/or advantageous will be described in more detail below.
Since a plurality of values for each feature is collected from the plurality of bowel sound signals, a statistical distribution of any one feature can be obtained. The system 10 is then arranged to determine at least one statistical distribution property of the at least one feature, the at least one statistical distribution property being capable of at least assisting in providing an indication of the existence or non-existence of the GI condition. The statistical distribution feature may for example be skewness or kurtosis.
The system 10 is further arranged to associate the at least one statistical distribution property with a reference parameter of a corresponding feature derived using reference data. The system 10 can then determine the likelihood of the GI condition based on the association.
In the example, the system 10 comprises storage for storing the corresponding reference parameters and a GI condition determiner 18 for determining the likelihood based on the association.
In accordance with a first specific embodiment of the invention, the system 10 is configured to determine, based on the association, the likelihood that the subject producing the bowel sounds has IBS as compared to having healthy bowels and the GI condition determiner 18 is an IBS determiner. The system 10 is thus arranged to determine at least one statistical distribution property of the at least one feature, the at least one statistical distribution property being capable of at least assisting in providing an indication of the existence or non-existence of IBS.
In accordance with a second embodiment of the invention, the system 10 is configured to determine, based on the association, the likelihood that the subject producing the bowel sounds has IBD as compared to having healthy bowels and the GI condition determiner 18 is an IBD determiner. The system 10 is thus arranged to determine at least one statistical distribution property of the at least one feature, the at least one statistical distribution property being capable of at least assisting in providing an indication of the existence or non-existence of IBD.
It is noted that patients having Crohn's disease and ulcerative colitis have both been grouped as IBD patients for the purpose of the present invention.
In accordance with a third embodiment of the invention, the system 10 is configured to determine, based on the association, the likelihood that the subject producing the bowel sounds has IBS as compared to having IBD and the GI condition determiner 18 is an IBS/IBD determiner. The system 10 is thus arranged to determine at least one statistical distribution property of the at least one feature, the at least one statistical distribution property being capable of at least assisting in providing an indication of the existence or non-existence of IBS and IBD.
Thus, the system 10 could allow differentiating between healthy individuals and individuals suffering from a functional GI disorder such as IBS, differentiating between healthy individuals and individuals suffering from a GI organic disease such as IBD, and differentiating between individuals suffering from a functional GI disorder such as IBS and individuals suffering from a GI organic disease such as IBD.
In a further embodiment of the invention, the GI condition determiner 18 comprises each of an IBS determiner, an IBD determiner and an IBS/IBD determiner, and the system 10 is configured to determine simultaneously a likelihood of IBS versus healthy bowels and a likelihood of IBD versus healthy bowels based on respective associations of the at least one statistical distribution property with corresponding reference parameters. The system 10 is then further configured to determine a likelihood of IBS versus IBD when the respective association of the at least one statistical distribution property with the corresponding reference parameter indicates that IBS is more likely than healthy bowels. The system 10 is also arranged to determine a likelihood of IBS versus IBD when the respective association of the at least one statistical distribution property with the corresponding reference parameter indicates that IBD is more likely than healthy bowels.
Further, it will be understood that the GI condition determiner 18 may alternatively comprise either one or more of the IBS determiner, IBD determiner and IBS/IBD determiner, and/or other determiners associated with GI conditions other than IBS and IBD.
Components of the system 10 will now be described in more detail.
According to a specific example, the sound detector 12 comprises an array of vibration sensors attachable to or held in place by a belt. With reference to
Each vibration sensor V1 to V4 in this example comprises a piezoelectric sensor component and a transducer for converting detected sounds into electrical signals. The sensors V1 to V4 are connected to a recording device for recording the signals, which may also form part of the sound detector 12 component of the system 10. Each vibration sensor could further incorporate double transducers to allow for active noise cancellation if used in a noisy environment. In this example, bowel sounds were recorded in a relatively quiet environment and detected using four single piezoelectric sensors and respective transducers. Piezoelectric sensors are predominantly contact microphones and relatively insensitive to background noise. The recording device may also comprise an analogue-to-digital converter for the purpose of digital signal processing. In this example, hand-held recorder 36 is used, as shown in
Preferably, the corresponding signal acquired by the sound detector 12 corresponds to an approximately or over 2-hour recording of bowel sounds from the abdominal region of the subject. In particular, it is proposed that a 2-hour recording of bowel sounds after the subject has fasted for approximately 12 hours, and a further recording of approximately 40 minutes after the subject has had a simple meal (e.g. toast, butter, water, or meal drink such as Sustagen®), may be particularly useful for determining IBS or for determining IBD. Accordingly, the acquired signal may have a signal portion corresponding to a “fasting condition” and another portion corresponding to “food condition”.
Once the corresponding signal is acquired, the sound detector 12 transmits the signal to the bowel sound identifier 14. The signal transmission may be via wireless or wired data transmission means. The bowel sound identifier 14 then processes the signal to automatically identify individual bowel sound signals. With reference to
The segmentation module 20 divides the corresponding signal into signal segments, XBS_1, XBS_2, . . . , XBS_N, where XBS is bowel sound time series data. The segments may for example be 20-40 ms in length, such as approximately 30 ms in length. In this specific example, the segmentation module 20 utilises a window function, which may have a window size of approximately 30 ms, with 20 ms overlap between adjacent windows. As bowel sounds are usually short bursts, where the energy versus time distribution is extremely uneven, a rectangular window function was selected.
The signal modifier 22 then applies a Fourier transformation 26 to the signal segments to obtain frequency spectrum data, SBS_1, SBS_2, . . . , SBS_N, as follows:
S
BS
=FFT(XBS) (Eq. 1)
The frequency response (SN) of the sound detector 12 is also evaluated for the purpose of removing background noise from the spectrum, as follows:
S
N
=FFT(XNoise) (Eq. 2)
The modifier 22 then performs noise reduction 28 to obtain a series of modified spectrum, SMBS_1, SMBS_2, . . . , SMBS_N, corresponding to each signal segment, as follows:
S
MBS
=S
BS
/S
Noise (Eq. 3)
The series of modified spectrum data SMBS_1, SMBS_2, . . . , SMBS_N, is then inputted into a frequency band detector 24 of the bowel sound identifier 14 in order to identify or designate individual bowel sound signals. Alternatively or additionally, active noise cancellation could be performed prior to identification of bowel sounds.
In this regard, it was recognised that the main frequency component of all of the bowel sounds was between 200 and 2000 Hz with a relatively narrow bandwidth. In contrast, other contaminating noise factors may be present in the spectrum data, such as friction against the sensors, lung sounds, and the heart beating, etc. However, it was determined that the frequency spectra of these noises did not overlap with the bowel sounds. Thus, in order to automatically identify bowel sounds, a plurality of specific frequency band subsets are selected. In this example, the following five frequency bands were used:
The term band energy ratio (BER) is used herein to mean the ratio of energy that a particular signal or signal portion has within a specific frequency band over the full range of frequencies present in the recording. For each signal segment, a frequency band detector 24 calculates the BER that a signal segment has within specific frequency bands. If the detector 24 identifies that a signal segment has a BER higher than a threshold, such as 90%, within one of the frequency bands, the signal segment is recognised as a bowel sound section. Further, if the detector 24 recognises no other bowel sound section within a range of 100 ms either side of the recognised bowel sound section, the detector 24 defines the bowel sound section as an individual bowel sound signal. Alternatively, if more than one bowel sound section was recognised within the time frame of the signal segment, the detector 24 groups the bowel sound sections and defines the grouping as a single bowel sound signal with multiple components.
In an over 2-hour long recording of bowel sounds, it is estimated that there may be hundreds of thousands of individual bowel sound signals identified.
The system 10 then inputs the identified bowel sound signals into the feature extractor 16.
In this example, the system 10 comprises a feature extractor 16 arranged to identify a plurality of different features from each of the plurality of bowel sounds signals. The different features may for example be features based on burst (such as burst amount or burst ratio), contraction interval time, spectral bandwidth double frequency, band energy ratio, higher order zero crossing, flatness, spectral centroid, energy, dynamic range, mel width, envelope crest factor, or roll off, which will be described in more detail below. In this example, based on a combination of the different features, the GI condition determiner 18 is arranged to determine the likelihood of the GI condition.
In the first embodiment, the IBS determiner of the GI condition determiner 18 is arranged to determine a likelihood of IBS versus healthy bowels based on a first combination of the different features comprising at least one feature based on burst, spectral bandwidth double frequency, contraction interval time, or higher order zero crossing.
In the second embodiment, the IBD determiner of the GI condition determiner 18 is arranged to determine a likelihood of IBD versus healthy bowels based on a second combination of the different features comprising at least one feature based on flatness or spectral centroid.
In a third embodiment, the IBS/IBD determiner of the GI condition determiner 18 is arranged to determine a likelihood of IBS versus IBD based on a third combination of the different features comprising at least one feature based on envelope crest factor or roll off.
Before describing the feature extractor 16 in detail, a specific process that was conducted for selecting preferable features to extract from the bowel sound signals will be described. However, it will be understood that embodiments of the invention are not limited thereto, and variations on the process described herein may be utilised to selected features.
Using the sound detector 12 described above, an experiment was conducted whereby 2-hour long bowel sound recordings were obtained from participants after fasting, and a further 40 minutes recording was obtained after the participants had a standard meal. Hence, 160 minute recordings were obtained from each participant using the four sensors V1 to V4. The positioning of the sensors as shown in
The recordings were sampled at a sampling rate of 44.1 kHz, equating to approximately 1.6 billion samples. Signal processing was performed in order to reduce the number of samples and extract features from the samples. In particular, the bowel sound identification process described above as being performed by the bowel sound identifier 14 was conducted, resulting in a collection of individual bowel sound signals and respective frequency spectrum data for each participant.
Initially, several time domain features and frequency domain features were identified for each individual bowel sound signal. The time domain features include the following.
BR=Duration/Burst (Eq. 4)
DR=20*log10(max(XBS)−min(XBS))) (Eq. 6)
A=max(XBS) (Eq. 7)
En=20*log10(mean(XBS2)) (Eq. 8)
The mean of all HOCn values identified from the plurality of bowel sound signals for a participant can be determined by the Equation 7, as follows.
The frequency domain features include the following.
MelMax=max(SMel) (Eq. 15)
MelSum=ΣSMel (Eq. 16)
MelWidth=(S_Mel>max(S_Mel)) (Eq. 17)
The features above were extracted from each individual bowel sound signal or respective spectrum to construct a bowel sound library.
Additionally, to characterise the bowel activity further, the individual bowel sounds signals were assigned to one of the four sensors V1 to V4 positioned at different abdominal quadrants P1 to P4. The assigning of individual bowel sound signals to a particular abdominal quadrant was done according to the amplitude of the bowel sound signal and on the assumption that the sensitivity of each the sensors were identical. In particular, bowel sound signals that were detected by multiple sensors V1 to V4 were assigned to the quadrant/sensor that was most strongly associated with the bowel sound signal. In this example, each bowel sound signal was associated with the sensor V1 to V4 that produced the highest amplitude reading for that bowel sound. For instance, if the same bowel sound was detected by two sensors, the sensor that registered the corresponding bowel sound signal with the highest amplitude reading would be selected. Accordingly, each bowel sound signal would only be associated with one sensor/quadrant. Additionally, a minimum threshold of 60% of maximum energy was applied. Therefore, if for example a bowel sound originated from a relatively central region, it would only be assigned to a quadrant if that quadrant obtained a reading of the bowel sound that exceeded the threshold.
Since hundreds of thousands of bowel sound signals were identified from each participant, a corresponding amount of values for each feature can be extracted, and the statistical distribution of each feature can be statistically analysed. It was found that the statistical distribution of the features was different in participants with IBS compared to healthy participants, and also that the statistical distribution of the features was different in participants with IBD compared to healthy participants. Further, the statistical distribution of the features was different in participants with IBD compared to participants with IBS. In all three cases, for some features the difference was to a greater degree than others. This was evident by examining the skewness and kurtosis of the statistical distribution of the features. In other words, the skewness and kurtosis of the distribution of features contributed significantly to classification of participants. A reason for this could be that there is greater variability in the distribution of sounds from IBS participants given their altered motility pattern, and from IBD participants given the underlying motility and structural changes
As a result, a collection of “hybrid” features was obtained for the bowel sound signals, the hybrid features having several components including: (a) feature, e.g. amplitude, burst; (b) statistical distribution feature, e.g. skew, kurtosis; (c) assigned sensor; and (d) a condition, e.g. fasting or food.
Logistic regression analysis was then used to identify the optimal array or subset of all the hybrid features (taking into account associated quadrants P1 to P4) that was most strongly associated with participants having a GI condition, i.e. in the present examples IBS or IBD, and healthy participants. The logistic regression analysis firstly uses a linear regression model and then a sigmoid function to predict the probability of a sample being positive. No assumption about the data distribution are made when using logistic regression, but the correlation coefficients among each of the features should be smaller than 0.7 to obtain a stable and reasonable result. The particular linear regression model and sigmoid functions used are provided in Equations 20 and 21, respectively.
In Equations 20 and 21 above, ‘xi’ represents one of the features, where ‘i’ is an integer from 1 to n, with n being the total number of features, and ‘c’ is a weighting coefficient associated with each one of the features ‘xi’.
In the first embodiment wherein, the system 10 is arranged to determine a likelihood of IBS versus healthy bowels, the weighting coefficients were then adjusted using a cost function so that f=1 if a bowel sound belongs to a participant having IBS, and f=0 to indicate that the participant does not have IBS. Use of the cost function allows for determination of respective weighting coefficients for the various features ‘xi’, which would minimise the “cost” that a particular coefficient would have on the result, thus optimising the result, i.e. towards f=1 for a bowel sound associated with IBS and towards f=0 for a bowel sound that is not indicative of IBS. The weighting coefficients were first assigned a random number, and then adjusted to conform to the participants true condition, whether IBS or healthy. This was repeated multiple times until the accuracy of the coefficients stopped improving.
In the second embodiment, the same iterative process and background logistic regression model, including the linear regression model (Eq. 20), sigmoid function (Eq. 21) and assumptions, were used to identify the optimal array or subset of all the hybrid features (taking into account associated quadrants P1 to P4) that was most strongly associated with IBD participants and healthy participants. In this case, the weighting coefficients were then adjusted using a cost function so that f=1 if a bowel sound belongs to a participant with IBD, and f=0 to indicate that the participant does not have IBD. The weighting coefficients were first assigned a random number, and then adjusted to conform to the participants true condition, whether IBD or healthy. This was repeated multiple times until the accuracy of the coefficients stopped improving.
The same method was also used in the third embodiment for differentiation between IBD and IBS individuals. The same iterative process and background logistic regression model, including the linear regression model (Eq. 20), sigmoid function (Eq. 21) and assumptions, were used to identify the optimal array or subset of all the hybrid features (taking into account associated quadrants P1 to P4) that was most strongly associated with IBD participants and IBS participants. In this case the weighting coefficients were then adjusted using a cost function so that f=1 if a bowel sound belongs to a participant with IBD, and f=0 to indicate that the participant does not have IBS.
Along with the logistic regression, regularisation was used for preventing over-fitting. There are two common regularisation methods: L1 and L2. The latter was selected for its potential to increase the generalisation of the model by reducing the absolute value of the weights and thus prevent perfect fitting.
In order to corroborate the accuracy of the model, cross-validation was also performed. In particular, the leave-one-out cross validation (LOOCV) method was used for tuning the parameters and selecting features. The LOOCV procedure involves removing one sample and training the model using the rest of the samples, before calculating the error on the sample which was removed. Alternatively or additionally, bootstrapping is another method for cross-validating the selected features and model.
As there are thousands of hybrid features that can be used in the model, all skewness-related features were initially included as a starting point in the process of determining an optimal subset of features. Each feature was then removed one by one and the LOOCV performance of the remaining features was analysed. The features subset with highest LOOCV accuracy was retained before the remaining features was removed one by one. This process was repeated until accuracy plateaued. Subsequently, additional features from the whole features set were added to the model one by one until the maximum accuracy was achieved.
In addition, in the third embodiment in particular wherein the system 10 is arranged for determining a likelihood of IBS versus IBD, the logistic regression analysis can be affected by a problem of imbalanced data set, i.e. a difference between the number of sample participants having IBS and the number of sample participants having IBD, which may create a bias towards IBD. Indeed, the sample size of IBD is typically much greater than the sample size of IBS. In order to correct for this bias and corroborate the accuracy of the model for differentiating between IBS and IBD in particular, a method of oversampling was used to increase the sample size of IBS and healthy recordings to match the number of IBD recordings. The oversampling method involved generating new samples using the following equation:
x
(new)
=x
i+μ*(x{zi}−xi) (Eq. 22)
where μ is a random number in the range [0,1], and where this interpolation creates a sample on the line between xi and x{zi}, x{zi} being nearest-neighbours to xi.
In the first embodiment of differentiation between IBS and healthy bowels, a total of 26 optimal or “ultimate” features were identified from among the hybrid features to form part of the optimum model. These features are provided in Table 1 below together with an example of respective weighting coefficients for those features. As shown below,
In the embodiment of differentiation between IBD and healthy bowels, a total of 44 optimal or “ultimate” features were identified from among the hybrid features to form part of the optimum model. These features are provided in Table 2 below together with an example of respective weighting coefficients for those features. As shown below,
In the embodiment of differentiation between IBD and IBS, a total of 26 optimal or “ultimate” features were identified from among the hybrid features to form part of the optimum model. These features are provided in Table 3 below together with an example of respective weighting coefficients for those features. As shown below,
It will be appreciated that the features and respective weighting coefficients above are examples only, and in other embodiments different features and values for weighting coefficients may be used. As mentioned above, in the listing of ultimate features, four components are represented in each feature, where a first component corresponds to a “feature”, a second to a“statistical measure”, a third to a“sensor” and a fourth to a “condition”. Each component is separated by an underscore and is selected among the following listed in Table 4 below.
Although a specific experiment has been described above to obtain the respective 26 or 44 ultimate features to be used in the system 10, those skilled in the art will understand that other methods of obtaining desirable features, and other combinations of features, may be selected according to other embodiments.
Continuing with the embodiment shown in
For each bowel sound signal, the feature extractor 16 identifies selected features from each of the plurality of bowel sound signals so as to produce a collection of values for each of the selected features. In the present examples, the selected features are the 26 ultimate features listed above in Table 1 when the system 10 is arranged for determining a likelihood of IBS versus healthy bowels, the 44 ultimate features identified in Table 2 when the system 10 is arranged for determining a likelihood of IBD versus healthy bowels, or the 26 features identified in Table 3 when the system 10 is arranged for determining a likelihood of IBS versus IBD. The feature extractor 16 then determines at least one statistical distribution property of the collection of values.
The feature extractor 16 comprises a feature identifier 30, a signal localiser 32 and a statistical measure identifier 34.
In the example of differentiation between IBS and healthy individuals, the feature identifier 30 is configured to extract the features listed in Table 1 above (column 1) from the bowel sound signals received from the bowel sound identifier 14. For example, the feature identifier 30 may extract the CIT feature from each bowel sound signal by utilising Equation 5 above.
In the example of differentiation between IBD and healthy individuals, the feature identifier 30 is configured to extract the features listed in Table 2 above (column 1) from the bowel sound signals received from the bowel sound identifier 14. For example, the feature identifier 30 may extract the flatness 3000 feature from each bowel sound signal by utilising Equation 19 above.
In the example of differentiation between IBS and IBD individuals, the feature identifier 30 is configured to extract the features listed in Table 3 above (column 1) from the bowel sound signals received from the bowel sound identifier 14. For example, the feature identifier 30 may extract from each bowel sound signal the envelope crest factor feature and/or the roll off feature by utilising Equation 18 above.
Since a plurality of bowel sound signals are identified for each subject, the feature identifier 30 will then output a collection or series of values for each feature. As an example only, for each bowel sound recording the following collection of features may be obtained for amplitude and burst:
The signal localiser 32 is configured to then assign each bowel sound signal to one of the sensors V1 to V4. As described above in relation to the “Feature Selection”, the assigning of bowel sound signals was done by assigning the signal to the sensor V1 to V4 that detected the highest amplitude, while applying a minimum threshold of 60% of the maximum energy. As an example only, signal localiser 32 may obtain the following:
The statistical measure identifier 34 is arranged to then determine a plurality of different statistical distribution properties of the collection of values for the features. In particular, with reference to Tables 1, 2 and 3 above, the statistical distribution properties calculated include kurtosis and skewness of the collection of values for specific features and specific sensors. For example, with reference to the 26 features in Table 1, the identifier 34 would calculate values for the kurtosis of the collection of amplitude values of signals assigned to V3 (feature no. 1), and the skew of the collection of burst values of signals assigned to V3 (feature no. 4). Additionally, the statistical measure identifier 34 also calculates the median of the sum of Mel-frequencies of signals assigned to V2 (feature no. 15).
The statistical measure identifier 34 in this example uses the following equations to identify skewness and kurtosis:
In Equations 23 and 24 above, the variable “F” is a value of the feature being examined such that the sum of all the values of the feature is evaluated in the equations above, and the variable “NBS” represents the number of bowel sounds. Values for the 26 selected features in Table 1 above are thus obtained from the recorded bowel sound.
Similarly, in the example with reference to the 44 features in Table 2, the identifier 34 would calculate values for the kurtosis of the collection of flatness 3000 values of signals assigned to V2 (features no. 13 and 15), and the skew of the collection of spectral centroid values of signals assigned to V1 (features no. 11 and 12). The statistical measure identifier 34 in this example uses the equations 23 and 24 to identify skew and kurtosis and values for the 44 selected features in Table 2 above are thus obtained from the recorded bowel sound.
In the third example to differentiate between IBD and IBS individuals, the same method could be employed. In the example with reference to the 26 features in Table 3, the identifier 34 would calculate values for the kurtosis of the collection of envelope crest factor values of signals assigned to V2 (feature no. 7) and the kurtosis of the collection of roll off values of signals assigned to V4 (feature no. 19). The statistical measure identifier 34 in this example uses the equations 23 and 24 to identify skew and kurtosis and values for the 26 selected features in Table 3 above are thus obtained from the recorded bowel sound.
In this example, the system 10 comprises the GI condition determiner 18 for determining the likelihood that the subject from which the bowel sounds are obtained has the respective GI condition versus having healthy bowels, and preferably outputs an index value indicative of that likelihood. The determiner 18 communicates with reference storage 38 and the feature extractor 16. The reference storage 38 stores reference parameters associated with each of the ultimate features. The reference parameters may for example be a coefficient, constant value, variable or property.
In the example when determining a likelihood of IBS versus healthy bowels, the reference parameters are the weighting coefficients listed in Table 1 above, which were derived from the process of selecting the optimum hybrid features. The IBS determiner 18 then applies Equation 21 to the values of the 26 ultimate features. Equation 21 is copied below for convenience:
In doing so, the IBS determiner 18 associates each feature obtained from the feature extractor 16 with the weighting coefficient associated with that feature (see Table 1) using Equation 20 (copied below for convenience), where ‘xi’ represents one of the features, ‘i’ is an integer from 1 to 26, and ‘ci’ is a weighting coefficient associated with each one of the features ‘xi’:
The IBS determiner 18 also comprises threshold storage 40 for storing a threshold against which the IBS determiner compares the calculated value of ‘f’. In this example, the threshold storage 40 stores a threshold of 0.5, such that if the IBS determiner 18 determines that f>0.5 the subject is likely to have IBS, and conversely if the IBS determiner 18 determines that f<0.5 the subject is not likely to have IBS. It will be appreciated that the higher the value of ‘f’ the more likely the subject has IBS, and the lower the value of ‘f’ the less likely the subject has IBS. The IBS determiner 18 thus generates an index value that indicates the likelihood of IBS.
Similarly, in the example when determining a likelihood of IBD versus healthy bowels, the reference parameters are the weighting coefficients listed in Table 2 above, which were derived from the process of selecting the optimum hybrid features. The IBD determiner 18 applies Equation 21 to the values of the 44 ultimate features and in doing so, the IBD determiner 18 associates each feature obtained from the feature extractor 16 with the weighting coefficient associated with that feature (see Table 2) using Equation 20, where ‘i’ is an integer from 1 to 44. The IBD determiner 18 also comprises threshold storage 40 for storing a threshold against which the IBD determiner compares the calculated value of ‘f’. In this example, similarly to the IBS example, the threshold storage 40 stores a threshold of 0.5, such that if the IBD determiner 18 determines that f>0.5 the subject is likely to have IBD, and conversely if the IBD determiner 18 determines that f<0.5 the subject is not likely to have IBD. It will be appreciated that the higher the value of ‘f’ the more likely the subject has IBD, and the lower the value of ‘f’ the less likely the subject has IBD. The IBD determiner 18 thus generates an index value that indicates the likelihood of IBD.
In a third example, the GI condition determiner 18 can also be used to determine the likelihood that the subject from which the bowel sounds are obtained has IBD rather than IBS. The IBS/IBD determiner 18 then outputs an index value indicative of that likelihood of IBs versus IBD. The reference parameters are the weighting coefficients listed in Table 3 above, which were derived from the process of selecting the optimum hybrid features. The IBS/IBD determiner applies Equation 21 to the values of the 26 ultimate features and in doing so, the IBS/IBD determiner 18 associates each feature obtained from the feature extractor 16 with the weighting coefficient associated with that feature (see Table 3) using Equation 20, where ‘i’ is an integer from 1 to 26. The IBS/IBD determiner 18 also comprises threshold storage 40 for storing a threshold against which the IBS/IBD determiner compares the calculated value of ‘f’. In this example, similarly to the IBS versus healthy and IBD versus healthy examples, the threshold storage 40 stores a threshold of 0.5, and if the IBS/IBD determiner 18 determines that f>0.5 the subject is more likely to have IBD and less likely to have IBS, and conversely if the IBS/IBD determiner 18 determines that f<0.5 the subject is more likely to have IBS and less likely to have IBD. It will be appreciated that the higher the value of ‘f’ the more likely the subject has IBD, and the lower the value of ‘f’ the less likely the subject has IBD and the more likely the subject has IBS. The IBD determiner 18 thus generates an index value that indicates the likelihood of IBS versus IBD.
A physician may choose to reach a diagnostic decision for a GI condition such as IBS for example based on the prediction derived from a single determiner, i.e. the IBS determiner, and rule out other organic diseases by concurrently carrying out stool, blood or biopsy tests.
Alternatively, it would be advantageous if a physician could in practice use a single test to indicate a likelihood of a patient having either IBS, or IBD or having healthy bowels, and to differentiate between IBS and IBD. In a further embodiment, with reference to
Such a system 10 with model aggregator 19 would provide the means to differentiate between three groups, i.e. patients with IBS, patients with IBD and healthy individuals using one single test. It would constitute a non-invasive single test wherein a combination of analyses of a recording of bowel sounds allows differentiating between GI conditions with similar symptoms, such as between IBS and IBD, and healthy bowels, and would present additional clinical value.
Alternatively, the physician may choose to avoid use of a colonoscopy in the first instance and make use of the IBS versus healthy test in combination with a range of simple laboratory tests using stool and blood samples that screen for IBD (faecal calprotectin test), coeliac disease (serology) and colon cancer (feacal occult blood test) prior to making a diagnosis.
Also, if a patient has a family history or “red-flags” for inflammatory bowel disease, a physician may choose to proceed with the non-invasive test for ‘IBD versus healthy bowels’ only, which test would be an extremely useful and cost-effective screening tool, prior to confirmation of a diagnosis of IBD or other organic diseases with other tests or biopsy.
Further, if IBD has not been diagnosed following a biopsy or colonoscopy or a screening test such as the faecal calprotectin test, the physician may choose to proceed with the non-invasive test for ‘IBS versus healthy bowels’ only, or for ‘IBS versus IBD’, which would allow providing the patient with additional clinical information to confirm an IBS diagnosis and/or confirming the results of the colonoscopy/biopsy such that IBD can be ruled out as a diagnosis.
It is contemplated that the system 10 may be implemented on a single device including a belt, a plurality of sensors such as sensors V1 to V4 attached to the belt, and processing device in communication with the sensors, comprising the bowel sound identifier 14, feature extractor 16, and GI condition determiner 18. The processing device may comprise a microcontroller to control and coordinate functions of the system 10. The processing device may additionally comprise the model aggregator 19.
Alternatively, a portion of the system 10 comprising the bowel sound identifier 14, feature extractor 16 and GI condition determiner 18 may be remote from the sensors. For example, that portion of the system 10 may comprise a software program supplying instructions executable on a computing device to operate the system 10. The computing device may for example be a smartphone or other portable electronic device, or a PC. The software program may be provided in the form of a computer-readable medium.
Referring to
The method 1000 comprises obtaining and recording 1002 a signal representative of a plurality of bowel sounds originating from an abdominal region. As described above, the signal may be obtained by recording bowel sounds using a plurality of acoustic sensors, such as sensors V1 to V4. Each vibration sensor V1 to V4 may incorporate double transducers to allow for active noise cancellation if used in a noisy environment. Then, the recorded signal is segmented 1004 into a plurality of segments. Again, as described above, each of the segments may be 20-40 ms in length.
The segments are then modified 1006 by performing a Fourier transformation on the signal segments to obtain a frequency spectrum of the signal. Preferably, the resulting spectrums of corresponding signal segments are also modified to remove background noise. This may comprise detecting the frequency response of the sensor(s) based on the background noise and removing it from the signal spectrum.
A plurality of individual bowel sound signals is then identified 1008 by considering band energy ratios of the spectrum of each signal segment. As described above, this may comprise evaluating the BER that a signal segment has within the frequency bands: 200 Hz to 800 Hz; 600 Hz to 1000 Hz; 800 Hz to 1200 Hz; 1000 Hz to 1600 Hz; and 1600 Hz to 2000 Hz.
Features are then extracted 1010 from the identified individual bowel sound signals, such as one or more of the features listed in Table 2 above. Since this step is performed for a plurality of individual bowel sound signals, a collection of values for each feature is obtained. Accordingly, statistical distribution properties of the collection of values for each feature can be obtained. Each bowel sound signal is also localised 1014 by assigning the signal to a particular sensor V1 to V4 that produces the highest amplitude reading corresponding to the signal, as described above in relation to the signal localiser 32.
The statistical distribution properties of the collection of values for each feature are then extracted 1014. According to a specific embodiment, the statistical distribution properties include skewness and kurtosis. Further, with reference to the “Ultimate Features” column in Tables 1 to 3 above, specific distribution properties are only obtained for specific values that have been selected and prior determination has been made as to which features are most strongly associated with an indication of IBS versus healthy bowels in a first embodiment, IBD versus healthy in a second embodiment, and an indication of IBS versus IBD in a third embodiment. For example, referring to Table 1 for a determination of a likelihood of IBS versus healthy bowels, the kurtosis of the collection of amplitude values of signals assigned to V3 (feature no. 1), and the skewness of the collection of burst values of signals assigned to V3 (feature no. 4), would be extracted. Referring to Table 2 for a determination of a likelihood of IBD versus healthy bowels, the kurtosis of the collection of flatness 3000 values of signals assigned to V2 (features no. 13 and 15), and the skewness of the collection of spectral centroid values of signals assigned to V1 (features no. 11 and 12) would be extracted, for example. And referring to Table 3 for a determination of a likelihood of IBS versus IBD, the kurtosis of the collection of envelope crest factor values of signals assigned to V2 (feature no. 7) and the kurtosis of the collection of roll off values of signals assigned to V4 (feature no. 19), would be extracted, for example. Equations 23 and 24 above can be used to determine the skewness and kurtosis values. As a result, a plurality of respective values for individual or selected features is obtained at step 1016, such as values corresponding to the selected features listed in Tables 1, 2 and 3 above, respectively.
The model shown in Equation 20 can then be applied at step 1018 to the respective values for the respective selected features obtained in step 1016, which provides an output 1020 indicating the likelihood of the GI condition, i.e. IBS or IBD, versus healthy bowels. In doing so, the respective features are associated with a respective corresponding reference parameter, such as a respective weighting coefficient, stored in a library, as discussed above in relation to the GI condition determiner 18. The result is then compared to a threshold value of 0.5 to output 1020 a binary value, whereby if the result is greater than 0.5 the subject is likely to have the GI condition (IBS or IBD), and if the result is less than 0.5 the subject is not likely to have the GI condition. Alternatively, a value between ‘0’ and ‘1’ can be outputted whereby the closer the value is to ‘1’ the greater the likelihood of the GI condition. The reference parameters, as discussed in relation to the GI condition determiner 18 above, vary depending on the differentiation being made, either between IBS and healthy, IBD and healthy or between IBS and IBD.
It will be understood to persons skilled in the art of the invention that many modifications may be made without departing from the spirit and scope of the invention. For example, a different number of features may be identified by the feature identifier 30, such as only one or two features. Such features may include the ultimate features that have relatively larger weighting coefficients, such as: the kurtosis of burst ratio, the skew of the burst amount, and the skew of contraction time interval for determination of a likelihood of IBS versus healthy bowels. Moreover, it may not be necessary to take into account all of the components of the hybrid features.
Alternatively or additionally, instead of the 26 ultimate features listed in Table 1, the 44 features listed in Table 2, or the 26 features listed in Table 3, respective different combinations of features and statistical distribution properties may be used. As another example, instead of weighting coefficients, other reference parameters or properties may be used, such as a reference skew and/or kurtosis value. Furthermore, the association of features to reference parameters may comprise a direction comparison of those features to their respective reference parameters.
Further, while a single non-invasive test for IBS can be performed by a physician on a patient for whom other pathology tests, including colonoscopy and biopsy, may have been undertaken simultaneously to rule out gastrointestinal organic diseases, an embodiment of the present invention provides a method that employs a decision tree algorithm to aggregate the determinations of all three embodiments described above using the model aggregator 19 described in
It will be understood that other algorithms may alternatively be used to combine analyses of bowel sounds of patients and provide an overall determination indicative of a likelihood of a patient having either IBS, or IBD or having healthy bowels, and to differentiate between IBS and IBD. For example, other tree-based algorithms (random forest, etc.) may be used. Further, a kernel method with vector output, or a neural network method with softmax function output may also be used.
In the claims which follow and in the preceding description of the invention, except where the context requires otherwise due to express language or necessary implication, the word “comprise” or variations such as “comprises” or “comprising” is used in an inclusive sense, i.e. to specify the presence of the stated features but not to preclude the presence or addition of further features in various embodiments of the invention.
It is to be understood that, if any prior art publication is referred to herein, such reference does not constitute an admission that the publication forms a part of the common general knowledge in the art, in Australia or any other country.
Number | Date | Country | Kind |
---|---|---|---|
2018900459 | Feb 2018 | AU | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/AU2018/051332 | 12/13/2018 | WO | 00 |