The present disclosure generally relates to methods and systems for engineering features or parameters from biophysical signals for use in diagnostic applications; in particular, the engineering and use of wavelet-based features for use in characterizing one or more physiological systems and their associated functions, activities, and abnormalities. The features or parameters may also be used for monitoring or tracking, controls of medical equipment, or to guide the treatment of a disease, medical condition, or an indication of either.
There are numerous methods and systems for assisting a healthcare professional in diagnosing disease. Some of these involve the use of invasive or minimally invasive techniques, radiation, exercise or stress, or pharmacological agents, sometimes in combination, with their attendant risks and other disadvantages.
Diastolic heart failure, a major cause of morbidity and mortality, is defined as symptoms of heart failure in a patient with preserved left ventricular function. It is characterized by a stiff left ventricle with decreased compliance and impaired relaxation leading to increased end-diastolic pressure in the left ventricle, which is measured through left heart catheterization. Current clinical standard of care for diagnosing pulmonary hypertension (PH), and for pulmonary arterial hypertension (PAH), in particular, involves a cardiac catheterization of the right side of the heart that directly measures the pressure in the pulmonary arteries. Coronary angiography is the current standard of care used to assess coronary arterial disease (CAD) as determined through the coronary lesions described by a treating physician. Non-invasive imaging systems such as magnetic resonance imaging and computed tomography require specialized facilities to acquire images of blood flow and arterial blockages of a patient that are reviewed by radiologists.
It is desirable to have a system that can assist healthcare professionals in the diagnosis of cardiac disease and various other diseases and conditions without the aforementioned disadvantages.
A clinical evaluation system and method are disclosed that facilitate the use of one or more wavelet-based features or parameters determined from biophysical signals such as cardiac/biopotential signals and/or photoplethysmography signals that are acquired, in preferred embodiments, non-invasively from surface sensors placed on a patient while the patient is at rest. The wavelet-based features or parameters can be used in a model or classifier (e.g., a machine-learned classifier) to estimate metrics associated with the physiological state of a patient, including for the presence or non-presence of a disease, medical condition, or an indication of either. The estimated metric may be used to assist a physician or other healthcare provider in diagnosing the presence or non-presence and/or severity and/or localization of diseases or conditions or in the treatment of said diseases or conditions.
The estimation or determined likelihood of the presence or non-presence of a disease, condition, or indication of either can supplant, augment, or replace other evaluation or measurement modalities for the assessment of a disease or medical condition. In some cases, a determination can take the form of a numerical score and related information.
Examples of wavelet-based features or parameters include measures that are derived from extractable properties or geometric characteristics of a spectral image or data of high-power spectral contents or high-coherence in waveform signals of interest in an acquired biophysical signal. Additional examples of wavelet-based features or parameters include measures that are derived from a statistical quantification of the distribution of the power of the high-power spectral contents in the waveform signals of interest.
As used herein, the term “feature” (in the context of machine learning and pattern recognition and as used herein) generally refers to an individual measurable property or characteristic of a phenomenon being observed. A feature is defined by analysis and may be determined in groups in combination with other features from a common model or analytical framework.
As used herein, “metric” refers to an estimation or likelihood of the presence, non-presence, severity, and/or localization (where applicable) of one or more diseases, conditions, or indication(s) of either, in a physiological system or systems. Notably, the exemplified methods and systems can be used in certain embodiments described herein to acquire biophysical signals and/or to otherwise collect data from a patient and to evaluate those signals and/or data in signal processing and classifier operations to evaluate for a disease, condition, or indicator of one that can supplant, augment, or replace other evaluation modalities via one or more metrics. In some cases, a metric can take the form of a numerical score and related information.
In the context of cardiovascular and respiratory systems, examples of diseases and conditions to which such metrics can relate include, for example: (i) heart failure (e.g., left-side or right-side heart failure; heart failure with preserved ejection fraction (HFpEF)), (ii) coronary artery disease (CAD), (iii) various forms of pulmonary hypertension (PH) including without limitation pulmonary arterial hypertension (PAH), (iv) abnormal left ventricular ejection fraction (LVEF), and various other diseases or conditions. An example indicator of certain forms of heart failure is the presence or non-presence of elevated or abnormal left-ventricular end-diastolic pressure (LVEDP). An example indicator of certain forms of pulmonary hypertension is the presence or non-presence of elevated or abnormal mean pulmonary arterial pressure (mPAP).
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments and, together with the description, serve to explain the principles of the methods and systems.
Embodiments of the present invention may be better understood from the following detailed description when read in conjunction with the accompanying drawings. Such embodiments, which are for illustrative purposes only, depict novel and non-obvious aspects of the invention. The drawings include the following figures:
Each and every feature described herein, and each and every combination of two or more of such features, is included within the scope of the present invention provided that the features included in such a combination are not mutually inconsistent.
While the present disclosure is directed to the practical assessment of biophysical signals, e.g., raw or pre-processed photoplethysmographic signals, biopotential/cardiac signals, etc., in the diagnosis, tracking, and treatment of cardiac-related pathologies and conditions, such assessment can be applied to the diagnosis, tracking, and treatment (including without limitation surgical, minimally invasive, lifestyle, nutritional, and/or pharmacologic treatment, etc.) of any pathologies or conditions in which a biophysical signal is involved in any relevant system of a living body. The assessment may be used in the controls of medical equipment or wearable devices or in monitoring applications (e.g., to report the wavelet-based features, parameters, or an intermediate output discussed herein).
The terms “subject” and “patient” as used herein are generally used interchangeably to refer to those who had undergone analysis performed by the exemplary systems and methods.
The term “cardiac signal” as used herein refers to one or more signals directly or indirectly associated with the structure, function, and/or activity of the cardiovascular system—including aspects of that signal's electrical/electrochemical conduction—that, e.g., cause contraction of the myocardium. A cardiac signal may include, in some embodiments, biopotential signals or electrocardiographic signals, e.g., those acquired via an electrocardiogram (ECG), the cardiac and photoplethysmographic waveform or signal capture or recording instrument later described herein, or other modalities.
The term “biophysical signal” as used herein includes but is not limited to one or more cardiac signal(s), neurological signal(s), ballistocardiographic signal(s), and/or photoplethysmographic signal(s), but it also encompasses more broadly any physiological signal from which information may be obtained. Not intending to be limited by example, one may classify biophysical signals into types or categories that can include, for example, electrical (e.g., certain cardiac and neurological system-related signals that can be observed, identified, and/or quantified by techniques such as the measurement of voltage/potential (e.g., biopotential), impedance, resistivity, conductivity, current, etc. in various domains such as time and/or frequency), magnetic, electromagnetic, optical (e.g., signals that can be observed, identified and/or quantified by techniques such as reflectance, interferometry, spectroscopy, absorbance, transmissivity, visual observation, photoplethysmography, and the like), acoustic, chemical, mechanical (e.g., signals related to fluid flow, pressure, motion, vibration, displacement, strain), thermal, and electrochemical (e.g., signals that can be correlated to the presence of certain analytes, such as glucose). Biophysical signals may in some cases be described in the context of a physiological system (e.g., respiratory, circulatory (cardiovascular, pulmonary), nervous, lymphatic, endocrine, digestive, excretory, muscular, skeletal, renal/urinary/excretory, immune, integumentary/exocrine and reproductive systems), one or more organ system(s) (e.g., signals that may be unique to the heart and lungs as they work together), or in the context of tissue (e.g., muscle, fat, nerves, connective tissue, bone), cells, organelles, molecules (e.g., water, proteins, fats, carbohydrates, gases, free radicals, inorganic ions, minerals, acids, and other compounds, elements, and their subatomic components. Unless stated otherwise, the term “biophysical signal acquisition” generally refers to any passive or active means of acquiring a biophysical signal from a physiological system, such as a mammalian or non-mammalian organism. Passive and active biophysical signal acquisition generally refers to the observation of natural or induced electrical, magnetic, optical, and/or acoustics emittance of the body tissue. Non-limiting examples of passive and active biophysical signal acquisition means include, e.g., voltage/potential, current, magnetic, optical, acoustic, and other non-active ways of observing the natural emittance of the body tissue, and in some instances, inducing such emittance. Non-limiting examples of passive and active biophysical signal acquisition means include, e.g., ultrasound, radio waves, microwaves, infrared and/or visible light (e.g., for use in pulse oximetry or photoplethysmography), visible light, ultraviolet light, and other ways of actively interrogating the body tissue that does not involve ionizing energy or radiation (e.g., X-ray). An active biophysical signal acquisition may involve excitation-emission spectroscopy (including, for example, excitation-emission fluorescence). The active biophysical signal acquisition may also involve transmitting ionizing energy or radiation (e.g., X-ray) (also referred to as “ionizing biophysical signal”) to the body tissue. Passive and active biophysical signal acquisition means can be performed in conjunction with invasive procedures (e.g., via surgery or invasive radiologic intervention protocols) or non-invasively (e.g., via imaging, ablation, heart contraction regulation (e.g., via pacemakers), catheterization, etc.).
The term “photoplethysmographic signal” as used herein refers to one or more signals or waveforms acquired from optical sensors that correspond to measured changes in light absorption by oxygenated and deoxygenated hemoglobin, such as light having wavelengths in the red and infrared spectra. Photoplethysmographic signal(s), in some embodiments, include a raw signal(s) acquired via a pulse oximeter or a photoplethysmogram (PPG). In some embodiments, photoplethysmographic signal(s) are acquired from off-the-shelf, custom, and/or dedicated equipment or circuitries that are configured to acquire such signal waveforms for the purpose of monitoring health and/or diagnosing disease or abnormal conditions. The photoplethysmographic signal(s) typically include a red photoplethysmographic signal (e.g., an electromagnetic signal in the visible light spectrum most dominantly having a wavelength of approximately 625 to 740 nanometers) and an infrared photoplethysmographic signal (e.g., an electromagnetic signal extending from the nominal red edge of the visible spectrum up to about 1 mm), though other spectra such as near-infrared, blue and green may be used in different combinations, depending on the type and/or mode of PPG being employed.
The term “ballistocardiographic signal,” as used herein, refers to a signal or group of signals that generally reflect the flow of blood through the entire body that may be observed through vibration, acoustic, movement, or orientation. In some embodiments, ballistocardiographic signals are acquired by wearable devices, such as vibration, acoustic, movement, or orientation-based seismocardiogram (SCG) sensors, which can measure the body's vibrations or orientation as recorded by sensors mounted close to the heart. Seismocardiogram sensors are generally used to acquire “seismocardiogram,” which is used interchangeably with the term “ballistocardiogram” herein. In other embodiments, ballistocardiographic signals may be acquired by external equipment, e.g., bed or surface-based equipment that measures phenomena such as a change in body weight as blood moves back and forth in the longitudinal direction between the head and feet. In such embodiments, the volume of blood in each location may change dynamically and be reflected in the weight measured at each location on the bed as well as the rate of change of that weight.
In addition, the methods and systems described in the various embodiments herein are not so limited and may be utilized in any context of another physiological system or systems, organs, tissue, cells, etc., of a living body. By way of example only, two biophysical signal types that may be useful in the cardiovascular context include cardiac/biopotential signals that may be acquired via conventional electrocardiogram (ECG/EKG) equipment, bipolar wide-band biopotential (cardiac) signals that may be acquired from other equipment such as those described herein, and signals that may be acquired by various plethysmographic techniques, such as, e.g., photoplethysmography. In another example, the two biophysical signal types can be further augmented by ballistocardiographic techniques.
The example analysis and classifiers described herein may be used to assist a healthcare provider in the diagnosis and/or treatment of cardiac- and cardiopulmonary-related pathologies and medical conditions, or an indicator of one. Examples include significant coronary artery disease (CAD), one or more forms of heart failure such as, e.g., heart failure with preserved ejection fraction (HFpEF), congestive heart failure, various forms of arrhythmia, valve failure, various forms of pulmonary hypertension, among various other disease and conditions disclosed herein.
In addition, there exist possible indicators of a disease or condition, such as an elevated or abnormal left ventricular end-diastolic pressure (LVEDP) value as it relates to some forms of heart failure, abnormal left ventricular ejection fraction (LVEF) values as they relate to some forms of heart failure or an elevated mean pulmonary arterial pressure (mPAP) value as it relates to pulmonary hypertension and/or pulmonary arterial hypertension. Indicators of the likelihood that such indicators are abnormal/elevated or normal, such as those provided by the example analysis and classifiers described herein, can help a healthcare provider assess or diagnose that the patient has or does not have a given disease or condition. In addition to these metrics associated with a disease state of condition, other measurements and factors may be employed by a healthcare professional in making a diagnosis, such as the results of a physical examination and/or other tests, the patient's medical history, current medications, etc. The determination of the presence or non-presence of a disease state or medical condition can include the indication (or a metric of measure that is used in the diagnosis) for such disease.
In
In the
Plots 110a′ and 110b′ show examples of the first biophysical-signal data set 110a and the second biophysical-signal data set 110a, respectively. Specifically, Plot 110a′ shows an example of an acquired photoplethysmographic or hemodynamic signal. In Plot 110a′, the photoplethysmographic signal is a time series signal having a signal voltage potential as a function of time as acquired from two light sources (e.g., infrared and red-light source). Plot 110b′ shows an example cardiac signal comprising a 3-channel potential time series plot. In some embodiments, the biophysical signal capture system 102 preferably acquires biophysical signals via non-invasive means or component(s). In alternative embodiments, invasive or minimally-invasively means or component(s) may be used to supplement or as substitutes for the non-invasive means (e.g., implanted pressure sensors, chemical sensors, accelerometers, and the like). In still further alternative embodiments, non-invasive and non-contact probes or sensors capable of collecting biophysical signals may be used to supplement or as substitutes for the non-invasive and/or invasive/minimally invasive means, in any combination (e.g., passive thermometers, scanners, cameras, x-ray, magnetic, or other means of non-contact or contact energy data collection system as discussed herein). Subsequent to signal acquisitions and recording, the biophysical signal capture system 102 then provides, e.g., sending over a wireless or wired communication system and/or a network, the acquired biophysical-signal data set 110 (or a data set derived or processed therefrom, e.g., filtered or pre-processed data) to a data repository 112 (e.g., a cloud-based storage area network) of the assessment system 103. In some embodiments, the acquired biophysical-signal data set 110 is sent directly to the assessment system 103 for analysis or is uploaded to a data repository 112 through a secure clinician's portal.
Biophysical signal capture system 102 is configured with circuitries and computing hardware, software, firmware, middleware, etc., in some embodiments, to acquire, store, transmit, and optionally process both the captured biophysical signals to generate the biophysical-signal data set 110. An example biophysical signal capture system 102 and the acquired biophysical-signal set data 110 are described in U.S. Pat. No. 10,542,898, entitled “Method and Apparatus for Wide-Band Phase Gradient Signal Acquisition,” or U.S. Patent Publication No. 2018/0249960, entitled “Method and Apparatus for Wide-Band Phase Gradient Signal Acquisition,” each of which is hereby incorporated by reference herein in its entirety.
In some embodiments, biophysical signal capture system 102 includes two or more signal acquisition components, including a first signal acquisition component (not shown) to acquire the first biophysical signals (e.g., photoplethysmographic signals) and includes a second signal acquisition component (not shown) to acquire the second biophysical signals (e.g., cardiac signals). In some embodiments, the electrical signals are acquired at a multi-kilohertz rate for a few minutes, e.g., between 1 kHz and 10 kHz. In other embodiments, the electrical signals are acquired between 10 kHz and 100 kHz. The hemodynamic signals may be acquired, e.g., between 100 Hz and 1 kHz.
Biophysical signal capture system 102 may include one or more other signal acquisition components (e.g., sensors such as mechano-acoustic, ballistographic, ballistocardiographic, etc.) for acquiring signals. In other embodiments of the signal capture system 102, a signal acquisition component comprises conventional electrocardiogram (ECG/EKG) equipment (e.g., Holter device, 12 lead ECG, etc.).
Assessment system 103 comprises, in some embodiments, the data repository 112 and an analytical engine or analyzer (not shown—see
Classifier module 116 (e.g., ML classifier module) may include transfer functions, look-up tables, models, or operators developed based on algorithms such as but not limited to decision trees, random forests, neural networks, linear models, Gaussian processes, nearest neighbor, SVMs, Naïve Bayes, etc. In some embodiments, classifier module 116 may include models that are developed based on ML techniques described in U.S. Provisional Patent Application No. 63/235,960, filed Aug. 23, 2021, entitled “Method and System to Non-Invasively Assess Elevated Left Ventricular End-Diastolic Pressure”; U.S. Patent Publication No. 20190026430, entitled “Discovering Novel Features to Use in Machine Learning Techniques, such as Machine Learning Techniques for Diagnosing Medical Conditions”; or U.S. Patent Publication No. 20190026431, entitled “Discovering Genomes to Use in Machine Learning Techniques,” each of which is hereby incorporated by reference herein in its entirety.
Example Biophysical Signal Acquisition.
As shown in
The electrical signals (e.g., corresponding to the second biophysical signal data set 110b) are collected using three orthogonally paired surface electrodes arranged across the patient's chest and back along with a reference lead. The electrical signals are acquired, in some embodiments, using a low-pass anti-aliasing filter (e.g., ˜2 kHz) at a multi-kilohertz rate (e.g., 8 thousand samples per second for each of the six channels) for a few minutes (e.g., 215 seconds). In alternative embodiments, the biophysical signals may be continuously/intermittently acquired for monitoring, and portions of the acquired signals are used for analysis. The hemodynamic signals (e.g., corresponding to the first biophysical signal data set 110a) are collected using a photoplethysmographic sensor placed on a finger. The photo-absorption of red light (e.g., any wavelengths between 600-750 nm) and infrared light (e.g., any wavelengths between 850-950 nm) are recorded, in some embodiments, at a rate of 500 samples per second over the same period. The biophysical signal capture system 102a may include a common mode drive that reduces common-mode environmental noise in the signal. The photoplethysmographic and cardiac signals were simultaneously acquired for each patient. Jitter (inter-modality jitter) in the data may be less than about 10 microseconds (μs). Jitter among the cardiac signal channels may be less than 10 microseconds, e.g., around ten femtoseconds (fs).
A signal data package containing the patient metadata and signal data may be compiled at the completion of the signal acquisition procedure. This data package may be encrypted before the biophysical signal capture system 102a transferring the package to the data repository 112. In some embodiments, the data package is transferred to the assessment system (e.g., 103). The transfer is initiated, in some embodiments, following the completion of the signal acquisition procedure without any user intervention. The data repository 112 is hosted, in some embodiments, on a cloud storage service that can provide secure, redundant, cloud-based storage for the patient's data packages, e.g., Amazon Simple Storage Service (i.e., “Amazon S3”). The biophysical signal capture system 102a also provides an interface for the practitioner to receive notification of an improper signal acquisition to alert the practitioner to immediately acquire additional data from the patient.
Example Method of Operation
Estimation of Presence of Disease State or Indicating Condition.
As stated above, one example in the cardiac context is the estimation of the presence of abnormal left-ventricular end-diastolic pressure (LVEDP) or mean pulmonary artery pressure (mPAP), significant coronary artery disease (CAD), abnormal left ventricular ejection fraction (LVEF), and one or more forms of pulmonary hypertension (PH), such as pulmonary arterial hypertension (PAH). Other pathologies or indicating conditions that may be estimated include, e.g., one or more forms of heart failure such as, e.g., heart failure with preserved ejection fraction (HFpEF), arrhythmia, congestive heart failure, valve failure, among various other disease and medical conditions disclosed herein.
Method 300a further includes the step of retrieving (304) the data set and determining values of wavelet-based features or parameters that characterize properties or geometric shapes of a binarized data object generated from a wavelet transform performed on the biophysical-signal data set. Example operations to determine the values of wavelet-based features or parameters are provided in relation to
Method 300a further includes the step of outputting (308) estimated value(s) for the presence of disease state or abnormal condition in a report (e.g., to be used diagnosis or treatment of the disease state, medical condition, or indication of either), e.g., as described in relation to
Diagnostics or Condition Monitoring or Tracking using Wavelet-based Features or Parameters.
Method 300b further includes determining (310) wavelet-based features or parameters from the acquired biophysical data set, e.g., as described in relation to
Method 300b further includes outputting (312) wavelet-based features or parameters (e.g., in a report for use in diagnostics or as signals for controls). As discussed herein, the wavelet-based features or parameters can provide a characterization or indication of the high-energy content of the spectral power in a given biophysical signal (e.g., biopotential/cardiac, photoplethysmographic signals, ballistocardiographic signals). For monitoring and tracking, the output may be via a wearable device, a handheld device, or medical diagnostic equipment (e.g., pulse oximeter system, wearable health monitoring systems) to provide augmented data associated with health. In some embodiments, the outputs may be used in resuscitation systems, cardiac or pulmonary stress test equipment, pacemakers, etc., in which frequency spectral information is desired.
Wavelet-Based Features or Parameters
A wavelet operator can generate wavelet series as a decomposition of the time series in an orthonormal space created by orthonormal wavelet basis functions constructed using a mother wavelet ψ. Mathematically, the continuous wavelet transform (CWT) of a real signal x(t), denoted by W here, can be calculated using Equation 1.
In Equation 1, a is the wavelet scale (similar to convolution kernel window size) and is the translation. While Fast Fourier Transform (FFT) is applicable for the decomposition of a stationary signal in the frequency domain, wavelet transform provides a flexible time-frequency analysis for non-stationary signals whose spectral content can change over time.
Discrete wavelet transform (DWT) is a class of wavelets that restricts the values of the scale and translation to computationally enhance the performance of the transformation with preserved accuracy. DWT commonly discretizes the scale into the power of 2 (a=1, 2, 4, 8, . . . ) and the translation to integer values as of the case in the discrete time-series (b=1, 2, 3, . . . ).
Examples of wavelet decomposition that may be used include, but are not limited to, continuous wavelet transforms (CWT), and discrete wavelet transforms (DWT).
In implementations in which Module 400a generates spectral images, the images may be represented in two-dimensional axes of time and frequency, and the power is shown as color. To this end, a threshold operator may be used to produce a binarized image with all power above a specific level are assigned the same color and the remaining power assigned a different color. The analysis of spectral energy as images allows image processing operations to be used that can readily extract visually discernable characteristics from the spectral image.
Table 1 shows an example set of 11 extractable high spectral energy characteristics of a waveform region of interest from a generated binarized spectral image or data.
For a circle, the circularity has a value of l.
In
The method 600a then includes isolating or detecting (604) (also referred to as delineating), e.g., using wavelet transform, one or more regions of spectral interest in the biophysical signals. The waveform region detection 604 facilitates the isolation of specific waveform regions for analysis that can minimize the influences of other beats and temporal noises on the power spectrum analysis. Wavelet operators facilitate a more accurate analysis of the waveform frequency components as they can account for frequency drift over time. For cardiac signals, the detection operation (604) may include isolating one or more regions of the cardiac waveform, e.g., associated with atrial depolarization (also referred to as “P wave”), ventricular depolarization (also referred to as QRS wave or peaks), and/or ventricular repolarization (also referred to as “T wave”). For photoplethysmographic signals, the detection operation (604) may include isolating waveforms regions associated with the base, peak, and minimum landmarks in the photoplethysmographic signal(s) or velocityplethysmogram signal(s) derived from the photoplethysmographic signal(s). Further description of the delineation operation 604 is provided in the description provided in relation to
Spectral Image or Data Generation. The method 600a then includes generating (606), using wavelet decomposition/transform operation, a wavelet-power spectrum model of the isolated regions as a spectral image or spectral data. The power spectrum may be calculated for each or a substantial portion (e.g., >50%) of the isolated waveforms (i.e., P-wave. T-wave, and QRS complex of a cardiac signal, a coherence waveform, or a peak or landmark within a photoplethysmographic or velocityplethysmogram signal) for the plurality of cycles in a given biophysical signal. For continuous real-time implementations (e.g., for
The cardiac, photoplethysmographic, and/or coherence power spectrum may be calculated for each of the detected waveforms (e.g., P-wave. T-wave, QRS complex of a cardiac signal or peak or landmark within a photoplethysmographic or velocityplethysmogram signal, etc.). In some embodiments, a 1-D continuous wavelet transform is applied, such as a Morlet (also referred to as a Gabor) wavelet as the mother wavelet. Other wavelets, e.g., having equal variance in time and frequency, may be used, e.g., Gaussian, Mexican Hat, Spline, and Mayer wavelet. The wavelet may have a resolution of 48 voices per octave. The Morlet wavelet is a wavelet composed of a complex exponential (carrier) multiplied by a Gaussian window (envelope), as shown in Equation 2.
ψ(t)=exp(iωt)exp(−t2/2σ2) (Equation 2)
In Equation 2, ω is the wavelet central frequency, and σ=n/2πf is the width of the Gaussian window with n (the number of cycles) controlling the time-frequency resolution trade-off.
Coherence waveforms may be determined as a measure of the correlation between time series signals, such as between two channels in a cardiac signal data set or between two channels in a photoplethysmographic signal data set or between two channels of different modalities. Wavelet coherence may be determined, for example, by Equation 3.
In Equation 3, the cross-spectrum Cxy is a measure of the distribution of power of two signals x and y in the time-frequency domain given by Equation 4.
C
xy(a,τ)=|S(C*xy(a,τ))|2 (Equation 4)
In Equation 4, superscript denotes a complex conjugate, and S is a smoothing operator in time and scale. In some embodiments, a coherence spectrum operator is performed, e.g., with 32 voices per octave to find the coherence spectrum of the paired channels (e.g., between channels X and Y, channels x and X, and channels Y and Z). Coherence spectrum may be generated between a cardiac signal and a photoplethysmographic signal.
High-Power Spectral Image or Data Generation. Referring to
The method 606a includes, in some embodiments, trimming (904) the generated spectral image or data to remove regions in the image or data where confidence in the wavelet transformation is low, e.g., due to wavelet cone of influence. These regions are typically at very low frequencies and at the temporal boundaries.
The operation 606a then includes (i) cropping (906) the spectral image temporally to limit to the central time period of the image and (ii) cropping (906) the image in the frequency domain to limit to a maximum frequency. For cardiac signals, the image may be cropped temporarily (e.g., in the temporal/time axis, e.g., x-axis) to limit the image to the central 0.6 seconds. The image may be cropped in the frequency domain to limit to a maximum frequency. The maximum frequency may be applied differently for the different waveform regions. For example, cropped maximum frequency for the P-wave waveforms may be 60 Hz; for the QRS waveforms, 80 Hz; and for the T-wave waveforms, 60 Hz. The QRS waveforms may be permitted higher frequencies to facilitate analysis of the higher frequency content as compared to P-waves and T-waves.
The operation 606a includes thresholding (908) the image to generate a binarized logical representation of the spectral characteristics of the acquired signal. In some embodiments, the image is thresholded using percentiles where all values (corresponding to spectral power in the image) above a certain percentile is maintained, and all power below that threshold is removed. Table 2 shows example threshold levels for various cardiac waveform types.
As shown in Table 2, thresholding may be dependent on the waveform, e.g., with the top 1% (i.e., a threshold of 0.01) used for the P-wave and QRS-wave, and 2.5% (i.e., a threshold of 0.025) for the T-wave. The lower threshold facilitates analysis of the power in more of the lower frequency components associated with the ventricular repolarization (T-wave). The thresholded regions in the binarized spectral image or data thus include the high-power regions of the spectral image in reference to its frequency location and time location within the acquired biophysical signal.
In
Features Computation. Referring to
Cardiac signal Spectral-Image based features. Table 3A shows a summarized set of 198 features directed to 11 geometric or properties (shown as “TimeRange,” “TimeCentroid,” FrequencyCentroid,” “SurfaceArea,” “Eccentricity,” “Circularity,” and “Extent” in Table 3A) that can be extracted from high-power spectral content information in a binarized spectral image or data for a respective waveform signal region of interests (e.g., atrial depolarization (P-wave) regions, ventricular depolarization (QRS) wave regions, and ventricular repolarization (T-wave) regions in a given signal channel of a cardiac signal (referred signal channels X, Y, and/or Z in Table 3A) to provide up to 99 features. The wavelet-based feature or parameter may be computed from spectral images generated across many cardiac cycles. To compress the computed feature values to a summary feature (e.g., suitable for ML), Module 400a may employ (i) the interquartile range (“IQR”) that shows how the feature varies across the cycles or (ii) the middle value in the distribution of feature values (“median”). When quantified for the IQR or the median, Module 400a can compute, e.g., for a total of 198 potential wavelet-based features or parameters.
It has been observed through experimentation that time range, frequency range, time centroid, frequency centroid, surface area, eccentricity, circularity, extent, orientation, and power centroid among the various waveform signal regions and signal channels have significant utility in the assessment of the presence or non-presence of at least one cardiac disease or condition—specifically, the determination of presence or non-presence of elevated LVEDP. It has also been observed through experimentation that time range, frequency range, time centroid, frequency centroid, surface area, eccentricity, circularity, extent, and orientation among the various waveform signal regions and signal channels have significant utility in the assessment of the presence or non-presence of coronary artery disease. The list of the specific wavelet-based features or parameters determined to have significant utility in the assessment of the presence or non-presence of abnormal or elevated LVEDP and the presence or non-presence of significant CAD is provided in Table 8A and 9A, respectively.
PPG signal Spectral-Image-based features. Table 3B shows an example set of 264 extractable wavelet-based features or parameters of the same 11 parameters for three example regions of two photoplethysmographic signals and three example regions of a corresponding two set of velocityplethysmogram signals, to provide up to 132 features. The parameters may be extracted across multiple photoplethysmographic signal cycles or velocityplethysmogram signal cycles and can be quantified as the interquartile range (“IQR”) or the median to provide a total of 264 features. It also has been observed through experimentation that time range, frequency range, time centroid, frequency centroid, surface area, eccentricity, circularity, extent, orientation, power centroid, and number of regions, among PPG and VPG channels have significant utility in the assessment of the presence or non-presence of coronary artery disease. The list of the specific wavelet-based features or parameters determined to have significant utility in the assessment of presence or non-presence of significant CAD is provided in Table 9A.
Coherence analysis Spectral-Image features. Table 3C shows an example set of 33 extractable wavelet-based features or parameters for the 11 parameters of Table 1 as computed for three coherence waveforms, namely, among cardiac signals, XY, XZ, and YZ. It has been observed through experimentation that the frequency centroid parameter and the eccentricity parameter have significant utility in the assessment of the presence or non-presence of at least one cardiac disease or condition—specifically, the determination of presence or non-presence of elevated LVEDP. It also has been observed through experimentation that frequency range, time centroid, frequency centroid, circularity, and extent parameters have significant utility in the assessment of the presence or non-presence of coronary artery disease. The list of the specific wavelet-based features or parameters determined to have significant utility in the assessment of the presence or non-presence of abnormal or elevated LVEDP and the presence or non-presence of significant CAD is provided in Tables 8A and 9A, respectively.
Due to the potential sensitivity of wavelet features or parameters of Module 400a to the power threshold value, Module 400a may be complemented or supplemented with additional wavelet-based features or parameters of Modules 400b to mitigate such dependence. Module 400b may generate a spectral image or spectral data from a wavelet transformation of a given waveform region of interest (or used intermediate outputs from Module 400a) to which multiple threshold operations at different threshold values are performed to generate a corresponding number of binarized spectral images or data. Each binarized spectral image or data is evaluated for a given parameter (e.g., time range, frequency range, time centroid, surface area, eccentricity, circularity, extent, orientation, and/or power centroid) of the high spectral energy characteristics of a waveform region to which a trend line may be determined. This trend and sensitivity evaluation is also referred to as “decay” analysis.
Tables 4A and 4B each shows a summarized set of 99 wavelet-based features (“parameters”) computed from a threshold sensitivity analysis of the high-power content spectral content of the waveform signal regions of interests for a given signal channel of a biopotential signal (referred to as “Signal Channel”). In Table 4A, the wavelet-based features are shown generated from three potential waveform regions (shown as “Signal Region”), namely the atrial depolarization (P-wave) regions, ventricular depolarization (QRS) wave regions, and ventricular repolarization (T-wave) regions, and for each of the acquired channels (e.g., X, Y, and Z channels). The parameter may be computed across many cycles and quantified as the interquartile range (“IQR”) or as the median to provide a total of 198 features.
In Table 4A, the geometric or properties associated with “Decay_TimeRange,” “Decay_FrequencyRange,” “Decay_TimeCentroid,” FrequencyCentroid,” “Decay_Eccentricity,” “Decay_Circularity,” “Decay_Extent,” “Decay_Orientation,” and “Decay_PowerCentroid” for each of the waveform signal regions and signal channels have been experimentally determined to have significant utility in the assessment of the presence or non-presence of at least one cardiac disease or condition—specifically, the determination of presence or non-presence of elevated LVEDP. It also has been observed through experimentation that Decay_TimeRange,” “Decay_TimeCentroid,” “Decay_SurfaceArea,” “Decay_Eccentricity,” “Decay_Circularity,” “Decay_Extent,” and “Decay_Orientation” parameters have significant utility in the assessment of the presence or non-presence of coronary artery disease. The list of the specific features determined to have significant utility in the assessment of the presence or non-presence of abnormal or elevated LVEDP and the presence or non-presence of significant CAD is provided in Table 8B and Table 9B, respectively.
Table 4B shows the 11 features of Table 1 computed for three waveform signal regions (e.g., maximum peak regions, minimum peak regions, and base pulse regions) of photoplethysmographic signal and/or a corresponding velocityplethysmogram signal (e.g., PPG IR channel, PPG red channel, VPG IR channel, and VPG red channel) to provide a total of 132 features. The features may be generated across multiple cycles, which can be expressed in interquartile range or median distribution to provide a total number of 264 features.
In an example,
Wavelet Distribution Features—Example #3
Module 500 is configured to calculate the power distribution for the high-power region of the power spectrum for the P-wave, QRS complex, and T-wave separately filtered by using the power percentile threshold. The distribution analysis is performed over the frequencies in the range of 1-60 Hz to focus on the analysis of the majority of the frequency component existing in the ventricular depolarization and repolarization waves.
The distribution may include the quantile-quantile probability assessed between (i) a distribution function determined within the wavelet-based models and (ii) a base distribution function (e.g., uniform or normal distribution). The distribution may be based on the power density function or cumulative density function.
Table 5 shows an example set of 25 extractable wavelet-based features or parameters of the power distribution for the high-power region of the power spectrum of waveform regions of interest. The first 13 features are calculated from the power spectral density distribution.
In
The method 600c then includes determining (612) the high-power region of a generated wavelet spectral image. For cardiac signals, for example, the high-power region of the P-wave, QRS complex, and T-wave may be assessed, e.g., using filters having a power percentile threshold of 0.01, 0.01, and 0.025, respectively. The distribution analysis is limited to the frequencies in the range of 1-60 Hz, which contains the majority of the frequency component existing in the ventricular depolarization and repolarization waves. For photoplethysmographic signal, the wavelet power spectrum of the PPG and VPG Red are filtered by using the 4% and 0.3% highest power percentile (corresponding to the threshold of 0.04, 0.003), respectively. The distribution analysis for PPG and VPG is limited to frequencies below 10 Hz and 15 Hz, respectively.
The method 600c then includes determining (614) distribution of the high-power region, such as the power spectrum density or the cumulative distribution function. In some embodiments, the power spectrum density (PSD) is obtained by integrating the power (e.g., the z-axis of the image) over the spectrum time (e.g., the x-axis of the image) at each specified frequency.
The method 600c then includes determining (616) features and associated values from aspects of the generated distribution per Tables 6A and 6B provided below.
Cardiac Wavelet Distribution Features. Table 6A shows a summarized set of 213 features directed to the 25 distribution parameters of Table 5. In Table 6A, 21 features may be determined for any one of three example waveform regions (e.g., atrial depolarization (P-wave) regions, ventricular depolarization (QRS) wave regions, and ventricular repolarization (T-wave) regions) of a given cardiac signal (e.g., channels X, Y, and Z) while 4 of the features may be determined for two of the regions. The wavelet-based feature or parameter may be computed from spectral images generated across many cardiac cycles. To compress the computed feature values down to a summary feature (e.g., suitable for ML), Module 500 may employ the median in the distribution of feature values.
It has been observed that 14 of the features (“Dist_medianPSD,” “Dist_stdPSD,” “Dist_skewPSD,” “Dist_entropy,” “Dist_peakWidthPSD,” “Dist_peakDispPSD,” “Dist_qqslop uniform,” “Dist_qqr2 uniform”, “Dist_qqsse normal,” “Dist_qqr2 normal”, “Dist_cdfNormal_L1norm”, “Dist_cdfNormal_std,” “Dist_pdftkernel_L1norm”, and “Dist_pdfkernal_std”) among various waveform signal regions and signal channels have been experimentally determined to have significant utility in the assessment of the presence or non-presence of at least one cardiac disease or condition—specifically, the determination of presence or non-presence of elevated LVEDP. It also has been observed through experimentation that “medianPSD,” “stdPSD,” “kurtPSD,” “entropy,” “peakWidthPSD,” “qqslop_uniform,” and “qqr2_uniform,” “qqslop_normal,” “qqsse_normal,” “qqr2_normal,” “dfNormal_L1norm,” and “pdfKernal_std” parameters have significant utility in the assessment of the presence or non-presence of coronary artery disease. The list of the specific wavelet-based features or parameters determined to have significant utility in the assessment of the presence or non-presence of abnormal or elevated LVEDP and the presence or non-presence of significant CAD is provided in Table 8C and Table 9C, respectively.
PPG/VPG Wavelet Distribution Features. Table 6B shows a summarized set of 50 features directed to the 25 distribution parameters of Table 5. In Table 6B, the 25 features may be determined for each cycle per signal (e.g., VPG and PPG) and compressed by their median across the cycles. It also has been observed through experimentation that “sumPSD,” “stdPSD,” “skewPSD,” “kurtPSD,” “entropy,” “peakWidthPSD,” “peakRatioPSD,” “numpeakPSD,” peakRatioPSD,” “fpeakRatioPSD,” “peakDispRate,” “qqslop_uniform,” “qqsse_normal,” “qqslop_normal,” “Pdftkernal_L1_norm,” “Pdftkernal_std,” “powerpeak,” “peakDispPSD,” and “Pdftkernal_numpeak” parameters have significant utility in the assessment of the presence or non-presence of coronary artery disease. The list of the specific wavelet-based features or parameters determined to have significant utility in the assessment of the presence or non-presence of significant CAD is provided in Table 9C.
PSD Distribution Features.
Power Spectral Density (PSD) Quantile-Quantile Probability Features.
Cumulative Density Distribution (CDD) Features.
Waveform Region Isolation Operation
Cardiac signals.
The delineation operation 604a, in the example of
The delineation operation 604a then includes segmenting/isolating (808) waveforms associated with ventricular repolarization and atrial depolarization in the transformed signal (806). In some embodiments, the ventricular repolarization and atrial depolarization segment (also referred to as T-P segment) is isolated by segmenting the signal between the QRSoff,i and the next consecutive cycle QRSonset,i+1. Having T-P segment isolated, ⅔ of the TP segment may be assigned to the waveform region associated with ventricular repolarization (T-wave) and the remaining ⅓ to atrial depolarization (P-wave). Other segmentation may be used, for example, that described in D. B. Dubin, Rapid Interpretation of EKG's: An interactive course, 6th ed. tampa: Cover Pub, 2000.
Photoplethysmographic Signals.
The Hemodynamic Delineator may first determine the PPG pulse base, which can then be used for the detection of other landmarks. To determine the PPG pulse base, in
Following the PPG pulse base determination, systolic peak 838 may be determined. The peak may be defined as having a minimum pulse of 125 ms and filtered using the criteria: (i) the minimum peak width (at the half-prominence) being less than the minimum pulse width of 125 ms and (ii) systolic peak values being less than ten-scaled median absolute deviations (MAD) away from the median of the detected pulse bases. A maximum filter may be applied to detect the maximum values of the detected peaks within two consecutive pulse bases as the systolic peak for a corresponding cycle.
Following the PPG pulse base determination, a diastolic peak may be determined. The diastolic peak (operation 842) may be determined by (i) segmenting (858) the PPG signal using indexes of VPG min (discussed below) at cycle n and the consecutive PPG pulse base at cycle n+1, smoothing (860) the segmented PPG using a smoothing operator, determining (862) a VPG signal from the smoothed PPG signal, and detecting (862) peaks in the determined VPG signal. The smoothing operation (860) may employ a 20-datapoint Gaussian-weighted moving average filter as one example. The peak detection (864) may be configured to detect VPG peaks by application of a time constraint that constrains the local maxima to the first 50% of indexes of the segmented VPG. In some embodiments, a maximum filter is applied to the original PPG (non-smoothed) to search for local maxima around the detected diastolic peak in the smoothed PPG to determine the diastolic peak.
To determine the VPG min, the PPG signal may be segmented using the indexes of PPG systolic peak at cycle n and the consecutive PPG pulse base at cycle n+1. The segmented PPG may be smoothened, e.g., with a 20-datapoint Gaussian-weighted moving average filter. A VPG signal may be derived from the smoothed PPG, and the peaks for the inverted VPG are detected using a peak finder operator. The detected peaks may be filtered by applying a time constrain that the VPG min should occur at the first 30% of indexes of the segmented VPG. A maximum filter is applied to the original VPG (non-smoothed) that searches for local minima around the detected VPG min in the smoothed VPG.
To find a VPG peak, method 842 includes (i) segmenting (866) indexes from a photoplethysmographic signal corresponding to a monotonically increasing segment in the photoplethysmographic signal, also referred to as PPG raise-segments, (ii) detecting (868) VPG peaks using a peak finder operator, and (iii) filtering (870) the detected peaks and applying a maximum filter to identify the maximum value of the detected peaks within the PPG raise-segment as the VPG peak for the corresponding cycle. The PPG raise-segments (per operation 866) may be identified as the data points between the PPG pulse base at cycle n and a consecutive PPG systolic peak at cycle n+1. To detect (868) the VPG peaks, a peak finder operator may be used, e.g., configured with a minimum pulse width of 25% of the median of the PPG raise-duration (the time in ms associated with the PPG raise-segment) across the VPG signal. To filter the detected peaks, the filter may be applied with the criteria: (i) the minimum peak width (at the half-prominence) be less than the VPG minimum pulse width and (ii) VPG peak values be less than ten scaled MAD away from the median of the detected peaks.
To determine the VPG min, method 844 includes (i) segmenting (872) the PPG signal using the indexes of PPG systolic peak at cycle n and the consecutive PPG pulse base at cycle n+1, (ii) smoothing (874) the segmented PPG, (iii) determining (876) an inverted VPG signal from the smoothed PPG and detect peaks of the inverted VPG, and (iv) applying (878) a maximum filter to the original VPG (non-smoothed) to search for a local minima around the detected VPG min in the smoothed VPG. To smooth the segmented PPG, a 20-datapoint Gaussian-weighted moving average filter may be used as one example. To detect the peaks (876), a filter may be used that is configured to filter for detected peaks having a time constraint that the VPG min occurs at the first 30% of indexes of the segmented VPG. The method 844 may.
To determine the VPG base, method 846 include (i) correcting (880) a baseline from the VPG signal that may occur during the numerical derivatization of PPG and (ii) determining (882) the VPG bases through zero-crossing the VPG where VPG bases less than four-scaled median absolute deviations (MAD) away from the median of the detected bases are removed. The baseline correction (880) may be performed by percentile filtering the VPG to identify the data points with the values between the 20% and 50% percentile of the VPG signal and then subtracting the VPG signal from the mean of the percentile filtered VPG. The baseline correction 880 may segment the baseline-corrected VPG using the indexes of VPG min at cycle n and a consecutive VPG peak at cycle n+1.
Several development studies have been conducted to develop feature sets, and in turn, algorithms that can be used to estimate the presence or non-presence, severity, or localization of diseases, medical conditions, or an indication of either. In one study, algorithms were developed for the non-invasive assessment of abnormal or elevated LVEDP. As noted above, abnormal or elevated LVEDP is an indicator of heart failure in its various forms. In another development study, algorithms and features were developed for the non-invasive assessment of coronary artery disease.
As part of these two development studies, clinical data were collected from adult human patients using a biophysical signal capture system and according to protocols described in relation to
Within the feature development phases, features were developed, including the wavelet-based features, to extract characteristics in an analytical framework from biopotential signals (as an example of the cardiac signals discussed herein) and photo-absorption signals (as examples of the hemodynamic or photoplethysmographic discussed herein) that are intended to represent properties of the cardiovascular system. Corresponding classifiers were also developed using classifier models, linear models (e.g., Elastic Net), decision tree models (XGB Classifier, random forest models, etc.), support vector machine models, and neural network models to non-invasively estimate the presence of an elevated or abnormal LVEDP. Univariate feature selection assessments and cross-validation operations were performed to identify features for use in machine learning models (e.g., classifiers) for the specific disease indication of interest. Further description of the machine learning training and assessment are described in U.S. Provisional Patent Application No. 63/235,960, filed Aug. 23, 2021, entitled “Method and System to Non-Invasively Assess Elevated Left Ventricular End-Diastolic Pressure,” which is hereby incorporated by reference herein in its entirety.
The univariate feature selection assessments evaluated many scenarios, each defined by a negative and a positive dataset pair using t-test, mutual information, and AUC-ROC evaluation. The t-test is a statistical test that can determine if there is a difference between two sample means from two populations with unknown variances. Here, the t-tests were conducted against a null hypothesis that there is no difference between the means of the feature in these groups, e.g., normal LVEDP vs. elevated (for LVEDP algorithm development); CAD− vs. CAD+(for CAD algorithm development). A small p-value (e.g., ≤0.05) indicates strong evidence against the null hypothesis.
Mutual information (MI) operations were conducted to assess the dependence of elevated or abnormal LVEDP or significant coronary artery disease on certain features. An MI score greater than one indicates a higher dependency between the variables being evaluated. MI scores less than one indicate a lower dependency of such variables, and an MI score of zero indicates no such dependency.
A receiver operating characteristic curve, or ROC curve, illustrates the diagnostic ability of a binary classifier system as its discrimination threshold is varied. The ROC curve may be created by plotting the true positive rate (TPR) against the false positive rate (FPR) at various threshold settings. AUC-ROC quantifies the area under a receiver operating characteristic (ROC) curve—the larger this area, the more diagnostically useful the model is. The ROC, and AUC-ROC, value is considered statistically significant when the bottom end of the 95% confidence interval is greater than 0.50.
Table 7 shows an example list of the negative and a positive dataset pair used in the univariate feature selection assessments. Specifically, Table 7 shows positive datasets being defined as having an LVEDP measurement greater than 20 mmHg or 25 mmHg, and negative datasets were defined as having an LVEDP measurement less than 12 mmHg or belonging to a subject group determined to have normal LVEDP readings.
Tables 8A, 8B, and 8C each shows a list of wavelet-based features having been determined to have utility in estimating the presence and non-presence of elevated LVEDP in an algorithm executing in a clinical evaluation system. The features of Tables 8A, 8B, and 8C and corresponding classifiers have been validated to have clinical performance comparable to the gold standard invasive method to measure elevated LVEDP.
1= LVEDP <= 12 (N = 246) vs >=25 (N = 78)
2= LVEDP <= 20 (N = 95) vs CADHealth G1 (N = 122)
3= LVEDP >= 25 (N = 95) vs CADHealth G2 (N = 37)
Tables 9A, 9B, and 9C each shows a list of power spectral-based features having been determined to have utility in estimating the presence and non-presence of significant CAD in an algorithm executing in a clinical evaluation system. The features of Tables 9A, 9B, and 9C and corresponding classifiers have been validated to have clinical performance comparable to the gold standard invasive method to measure CAD.
The determination that certain wavelet-based features have clinical utility in estimating the presence and non-presence of elevated LVEDP or the presence and non-presence of significant CAD provides a basis for the use of these wavelet-based features or parameters, as well as other features described herein, in estimating for the presence or non-presence and/or severity and/or localization of other diseases, medical conditions, or indications of either particularly, though not limited to, heart disease or conditions described herein.
The experimental results further indicate that intermediary data or parameters of wavelet-based features also have clinical utility in diagnostics as well as treatment, controls, monitoring, and tracking applications.
Example Clinical Evaluation System
In various embodiments, different versions of the clinical evaluation system 1500 may implement the assessment system 103 (
In
The base system 1504 can provide a foundation of functions and instructions upon which each add-on module 1502 (which includes the disease-specific algorithm) then interface to assess for the pathology or indicating condition. The base system 1504, as shown in the example of
Data repository 112a, which can be cloud-based, stores data from the signal capture system 102 (shown as 102b). Biophysical signal capture system 102b, in some embodiments, is a reusable device designed as a single unit with a seven-channel lead set and photoplethysmogram (PPG) sensor securely attached (i.e., not removable). Signal capture system 102b, together with its hardware, firmware, and software, provides a user interface to collect patient-specific metadata entered therein (e.g., name, gender, date of birth, medical record number, height, and weight, etc.) to synchronously acquire the patient's electrical and hemodynamic signals. The signal capture system 102b may securely transmit the metadata and signal data as a single data package directly to the cloud-based data repository. The data repository 112a, in some embodiments, is a secure cloud-based database configured to accept and store the patient-specific data package and allow for its retrieval by the analytical engines or analyzer 1506 or 1514.
Base analytical engine or analyzer 1506 is a secure cloud-based processing tool that may perform quality assessments of the acquired signals (performed via “SQA” module 1516), the results of which can be communicated to the user at the point of care. The base analytical engine or analyzer 1506 may also perform pre-processing (shown via pre-processing module 1518) of the acquired biophysical signals (e.g., 110—see
Add-on module 1502 includes a second part 1514 (also referred to herein as the analytical engine (AE) or analyzer 1514 and shown as “AE add-on module” 1514) that operates with the base analytical engine (AE) or analyzer 1506. Analytical engine (AE) or analyzer 1514 can include the main function loop of a given disease-specific algorithm, e.g., the feature computation module 1520, the classifier model 1524 (shown as “Ensemble” module 1524), and the outlier assessment and rejection module 1524 (shown as “Outlier Detection” module 1524). In certain modular configurations, the analytical engines or analyzers (e.g., 1506 and 1514) may be implemented in a single analytical engine module.
The main function loop can include instructions to (i) validate the executing environment to ensure all required environment variables values are present and (ii) execute an analysis pipeline that analyzes a new signal capture data file comprising the acquired biophysical signals to calculate the patient's score using the disease-specific algorithm. To execute the analysis pipeline, AE add-on module 1514 can include and execute instructions for the various feature modules 114 and classifier module 116 as described in relation to
The clinical evaluation system 1500 can manage the data within and across components using the web-service DTAPIs 1508 (also may be referred to as HCPP web services in some embodiments). DTAPIs 1508 may be used to retrieve acquired biophysical data sets from, and to store signal quality analysis results to, the data repository 112a. DTAPIs 1508 may also be invoked to retrieve and provide the stored biophysical data files to the analytical engines or analyzers (e.g., 1506, 1514), and the results of the analytical engine's analysis of the patient signals may be transferred using DTAPI 1508 to the report database 1510. DTAPIs 1508 may also be used, upon a request by a healthcare professional, to retrieve a given patient data set to the web portal module 1513, which may present a report to the healthcare practitioner for review and interpretation in a secure web-accessible interface.
Clinical evaluation system 1500 includes one or more feature libraries 1526 that store the wavelet-based features 120 and various other features of the feature modules 122. The feature libraries 1526 may be a part of the add-on modules 1502 (as shown in
Further details of the modularity of modules and various configurations are provided in U.S. Provisional Patent Application No. 63/235,960, filed Aug. 19, 2021, entitled “Modular Disease Assessment System,” which is hereby incorporated by reference herein in its entirety.
Example Operation of the Modular Clinical Evaluation System
Signal quality assessment/rejection (1530). Referring to
The base analytical engine or analyzer 1506 performs two sets of assessments for signal quality, one for the electrical signals and one for the hemodynamic signals. The electrical signal assessment (1530) confirms that the electrical signals are of sufficient length, that there is a lack of high-frequency noise (e.g., above 170 Hz), and that there is no power line noise from the environment. The hemodynamic signal assessment (1530) confirms that the percentage of outliers in the hemodynamic data set is below a pre-defined threshold and that the percentage and maximum duration that the signals of the hemodynamic data set are railed or saturated is below a pre-defined threshold.
Feature Value Computation (1532). The AE add-on module 1514 performs feature extraction and computation to calculate feature output values. In the example of the LVEDP algorithm, the AE add-on module 1514 determines, in some embodiments, a total of 446 feature outputs belonging to 18 different feature families (e.g., generated in modules 120 and 122), including the wavelet-based features (e.g., generated in module 120). For the CAD algorithm, an example implementation of the AE add-on module 1214 determines a set of features, including 456 features corresponding to the same 18 feature families.
Additional descriptions of the various features, including those used in the LVEDP algorithm and other features and their feature families, are described in U.S. Provisional Patent Application No. 63/235,960, filed Aug. 23, 2021, entitled “Method and System to Non-Invasively Assess Elevated Left Ventricular End-Diastolic Pressure”; U.S. Provisional Patent Application No. 63/236,072, filed Aug. 23, 2021, entitled “Methods and Systems for Engineering Visual Features From Biophysical Signals for Use in Characterizing Physiological Systems”; U.S. Provisional Patent Application No. 63/235,963, filed Aug. 23, 2021, entitled “Methods and Systems for Engineering Power Spectral Features From Biophysical Signals for Use in Characterizing Physiological Systems”; U.S. Provisional Patent Application No. 63/235,966, filed Aug. 23, 2021, entitled “Method and System for Engineering Rate-Related Features From Biophysical Signals for Use in Characterizing Physiological Systems”; a U.S. Provisional Patent Application No. 63/130,324, titled “Method and System to Assess Disease Using Cycle Variability Analysis of Cardiac and Photoplethysmographic Signals”; U.S. Provisional Patent Application no. 63/235,971, filed Aug. 23, 2021, entitled “Methods and Systems for Engineering photoplethysmographic Waveform Features for Use in Characterizing Physiological Systems”; U.S. Provisional Patent Application No. 63/236,193, filed Aug. 23, 2021, entitled “Methods and Systems for Engineering Cardiac Waveform Features From Biophysical Signals for Use in Characterizing Physiological Systems”; U.S. Provisional Patent Application No. 63/235,974, filed Aug. 23, 2021, entitled “Methods and Systems for Engineering Conduction Deviation Features From Biophysical Signals for Use in Characterizing Physiological Systems”, each of which is hereby incorporated by reference herein in its entirety.
Classifier Output Computation (1534). The AE add-on module 1514 then uses the calculated feature outputs in classifier models (e.g., machine-learned classifier models) to generate a set of model scores. The AE add-on module 1514 joins the set of model scores in an ensemble of the constituent models, which, in some embodiments, averages the output of the classifier models as shown in Equation 5 in the example of the LVEDP algorithm.
In some embodiments, classifier models may include models that are developed based on ML techniques described in U.S. Patent Publication No. 20190026430, entitled “Discovering Novel Features to Use in Machine Learning Techniques, such as Machine Learning Techniques for Diagnosing Medical Conditions”; or U.S. Patent Publication No. 20190026431, entitled “Discovering Genomes to Use in Machine Learning Techniques,” each of which is hereby incorporated by reference herein in its entirety.
In the example of the LVEDP algorithm, thirteen (13) machine-learned classifier models are each calculated using the calculated feature outputs. The 13 classifier models include four ElasticNet machine-learned classifier models, four RandomForestClassifier machine-learned classifier models, and five extreme gradient boosting (XGB) classifier models. In some embodiments, the patient's metadata information, such as age, gender, BMI value, may be used. The output of the ensemble estimation may be a continuous score. The score may be shifted to a threshold value of zero by subtracting the threshold value for presentation within the web portal. The threshold value may be selected as a trade-off between sensitivity and specificity. The threshold may be defined within the algorithm and used as the determination point for test positive (e.g., “Likely Elevated LVEDP”) and test negative (e.g., “Not Likely Elevated LVEDP”) condition.
In some embodiments, the analytical engine or analyzer can fuse the set of model scores with a body mass index-based adjustment or an adjustment based on age or gender. For example, the analytical engine or analyzer can average the model estimation with a sigmoid function of the patient BMI having the form sigmoid(x)=1/1+e−x.
Physician Portal Visualization (1536). The patient's report may include a visualization 1536 of the acquired patient data and signals and the results of the disease analyses. The analyses are presented, in some embodiments, in multiple views in the report. In the example shown in
The healthcare portal may list a report for a patient if a given patient's acquired signal data set meets the signal quality standard. The report may indicate a disease-specific result (e.g., elevated LVEDP) being available if the signal analysis could be performed. The patient's estimated score (shown via visual element 118a, 118b, 118c) for the disease-specific analysis may be interpreted relative to an established threshold.
In the score summary section 1540 shown in the example of
The threshold section 1542 shows reported statistics of the threshold as provided to a validation population that defines the sensitivity and specificity for the estimation of the patient score (e.g., 118). The threshold is the same for every test regardless of the individual patient's score (e.g., 118), meaning that every score, positive or negative, may be interpreted for accuracy in view of the provided sensitivity and specificity information. The score may change for a given disease-specific analysis as well with the updating of the clinical evaluation.
The frequency distribution section 1544 illustrates the distribution of all patients in two validation populations (e.g., (i) a non-elevated population to indicate the likelihood of a false positive estimation and (ii) an elevated population to indicate a likelihood of a false negative estimation). The graphs (1546, 1548) are presented as smooth histograms to provide context for interpreting the patient's score 118 (e.g., 118b, 118c) relative to the test performance validation population patients.
The frequency distribution section 1540 includes a first graph 1546 (shown as “Non-Elevated LVEDP Population” 1546) that shows the score (118b), indicating the likelihood of the non-presence of the disease, condition, or indication, within a distribution of a validation population having non-presence of that disease, condition, or indication and a second graph 1548 (shown as “Elevated LVEDP Population” 1548) that shows the score (118c), indicates the likelihood of the presence of the disease, condition, or indication, within a distribution of validation population having the presence of that disease, condition, or indication. In the example of the assessment of elevated LVDEP, the first graph 1546 shows a non-elevated LVEDP distribution of the validation population that identifies the true negative (TN) and false positive (FP) areas. The second graph 1548 shows an elevated LVEDP distribution of the validation population that identifies the false negative (TN) and true positive (FP) areas.
The frequency distribution section 1540 also includes interpretative text of the patient's score relative to other patients in a validation population group (as a percentage). In this example, the patient has an LVEDP score of −0.08, which is located to the left side of the LVEDP threshold, indicating that the patient has “Not Likely Elevated LVEDP.”
The report may be presented in the healthcare portal, e.g., to be used by a physician or healthcare provider in their diagnosis for indications of left-heart failure. The indications include, in some embodiments, a probability or a severity score for the presence of a disease, medical condition, or an indication of either.
Outlier Assessment and Rejection Detection (1538). Following the AE add-on module 1514 computing the feature value outputs (in process 1532) and prior to their application to the classifier models (in process 1534), the AE add-on module 1514 is configured in some embodiments to perform outlier analysis (shown in process 1538) of the feature value outputs. Outlier analysis evaluation process 1538 executes a machine-learned outlier detection module (ODM), in some embodiments, to identify and exclude anomalous acquired biophysical signals by identifying and excluding anomalous feature output values in reference to the feature values generated from the validation and training data. The outlier detection module assesses for outliers that present themselves within sparse clusters at isolated regions that are out of distribution from the rest of the observations. Process 1538 can reduce the risk that outlier signals are inappropriately applied to the classifier models and produce inaccurate evaluations to be viewed by the patient or healthcare provider. The accuracy of the outlier module has been verified using hold-out validation sets in which the ODM is able to identify all the labeled outliers in a test set with the acceptable outlier detection rate (ODR) generalization.
While the methods and systems have been described in connection with certain embodiments and specific examples, it is not intended that the scope be limited to the particular embodiments set forth, as the embodiments herein are intended in all respects to be illustrative rather than restrictive. The wavelet-based features discussed herein may ultimately be employed to make, or to assist a physician or other healthcare provider in making, noninvasive diagnoses or determinations of the presence or non-presence and/or severity of other diseases, medical conditions, or indication of either, such as, e.g., coronary artery disease, pulmonary hypertension and other pathologies as described herein using similar or other development approaches. In addition, the example analysis, including the wavelet-based features, can be used in the diagnosis and treatment of other cardiac-related pathologies and indicating conditions as well as neurological-related pathologies and indicating conditions, such assessment can be applied to the diagnosis and treatment (including surgical, minimally invasive, and/or pharmacologic treatment) of any pathologies or indicating conditions in which a biophysical signal is involved in any relevant system of a living body. One example in the cardiac context is the diagnosis of CAD, and other diseases, medical condition, or indicating conditions disclosed herein and its treatment by any number of therapies, alone or in combination, such as the placement of a stent in a coronary artery, the performance of an atherectomy, angioplasty, prescription of drug therapy, and/or the prescription of exercise, nutritional and other lifestyle changes, etc. Other cardiac-related pathologies or indicating conditions that may be diagnosed include, e.g., arrhythmia, congestive heart failure, valve failure, pulmonary hypertension (e.g., pulmonary arterial hypertension, pulmonary hypertension due to left heart disease, pulmonary hypertension due to lung disease, pulmonary hypertension due to chronic blood clots, and pulmonary hypertension due to other diseases such as blood or other disorders), as well as other cardiac-related pathologies, indicating conditions and/or diseases. Non-limiting examples of neurological-related diseases, pathologies or indicating conditions that may be diagnosed include, e.g., epilepsy, schizophrenia, Parkinson's Disease, Alzheimer's Disease (and all other forms of dementia), autism spectrum (including Asperger syndrome), attention deficit hyperactivity disorder, Huntington's Disease, muscular dystrophy, depression, bipolar disorder, brain/spinal cord tumors (malignant and benign), movement disorders, cognitive impairment, speech impairment, various psychoses, brain/spinal cord/nerve injury, chronic traumatic encephalopathy, cluster headaches, migraine headaches, neuropathy (in its various forms, including peripheral neuropathy), phantom limb/pain, chronic fatigue syndrome, acute and/or chronic pain (including back pain, failed back surgery syndrome, etc.), dyskinesia, anxiety disorders, indicating conditions caused by infections or foreign agents (e.g., Lyme disease, encephalitis, rabies), narcolepsy and other sleep disorders, post-traumatic stress disorder, neurological conditions/effects related to stroke, aneurysms, hemorrhagic injury, etc., tinnitus and other hearing-related diseases/indicating conditions and vision-related diseases/indicating conditions.
In addition, the clinical evaluation system described herein may be configured to analyze biophysical signals such as an electrocardiogram (ECG), electroencephalogram (EEG), gamma synchrony, respiratory function signals, pulse oximetry signals, perfusion data signals; quasi-periodic biological signals, fetal ECG signals, blood pressure signals; cardiac magnetic field signals, heart rate signals, among others.
Further examples of processing that may be used with the exemplified method and system disclosed herein are described in: U.S. Pat. Nos. 9,289,150; 9,655,536; 9,968,275; 8,923,958; 9,408,543; 9,955,883; 9,737,229; 10,039,468; 9,597,021; 9,968,265; 9,910,964; 10,672,518; 10,566,091; 10,566,092; 10,542,897; 10,362,950; 10,292,596; 10,806,349; U.S. Patent Publication nos. 2020/0335217; 2020/0229724; 2019/0214137; 2018/0249960; 2019/0200893; 2019/0384757; 2020/0211713; 2019/0365265; 2020/0205739; 2020/0205745; 2019/0026430; 2019/0026431; PCT Publication nos. WO2017/033164; WO2017/221221; WO2019/130272; WO2018/158749; WO2019/077414; WO2019/130273; WO2019/244043; WO2020/136569; WO2019/234587; WO2020/136570; WO2020/136571; U.S. patent application Ser. Nos. 16/831,264; 16/831,380; 17/132,869; PCT Application nos. PCT/IB2020/052889; PCT/IB2020/052890, each of which is hereby incorporated by reference herein in its entirety.
This US application claims priority to, and the benefit of, U.S. Provisional Patent Application No. 63/235,968, filed Aug. 23, 2021, entitled “Methods and Systems for Engineering Wavelet-Based Features From Biophysical Signals for Use in Characterizing Physiological Systems,” which is incorporated by reference herein in its entirety.
Number | Date | Country | |
---|---|---|---|
63235968 | Aug 2021 | US |