The present disclosure generally relates to disease screening mechanisms; and includes a system and associated method for rapidly screening for signs and symptoms of disorders including COVID-19.
The year 2020 ushered in the Coronavirus disease 2019 (COVID-19) pandemic, the likes of which the modern world had not recently experienced. Scientists around the globe have been working on developing various forms of a vaccine and treatment. In the meantime, there is an urgent need for technology and decision systems to quickly detect whether an individual is exhibiting signs or symptoms similar to those of COVID-19 infection in order to prevent the further spread of this disease and changes to signs and symptoms in response to a drug or other intervention.
It is with these observations in mind, among others, that various aspects of the present disclosure were conceived and developed.
In some examples, the present disclosure takes the form of a non-invasive method predicting a disorder diagnosis that does not require presence of a clinician during operation, comprising the steps of: accessing by a processor of a plurality of processing elements screening data generated from a sensor system positioned along an individual of a plurality of individuals for screening the individual for a disorder as the individual performs a predetermined sequence of activities, the predetermined sequence of activities including at least one exertion configured for predicting a presence of the disorder; conducting signal processing by the processor from raw sensor information of the screening data to derive a plurality of signals from each activity of the predetermined sequence of activities, the plurality of signals from an activity collectively predictive for detecting the presence of the disorder; and computing by the processor an output defining a probability measure of risk of a positive diagnosis of the disorder attributable to the individual by applying the plurality of signals defined by the screening data to a machine learning model, parameters of the machine learning model configured, based on the plurality of signals, to maximize a probability of detecting the disorder.
The method may further include the step of configuring the machine learning model, by: accessing by at least one of the plurality of processing elements one or more training datasets, each of the one or more training datasets generated from an implementation of the sensor system positioned along a sample individual of the plurality of individuals as the sample individual performs the predetermined sequence of activities; and conducting signal processing by the processor for each of the one or more training datasets to derive a plurality of sample signals from one or more activities of the predetermined sequence of activities, wherein the machine learning model is trained and configured based on the plurality of sample signals.
The method may further include the step of training the machine learning model by conducting feature extraction by the processor to extract feature values for each of the plurality of sample signals that quantify statistical properties for one or more activities of the predetermined sequence of activities.
In some examples, the present disclosure takes the form of a system (or device) including at least a sensor system positioned along an individual of a plurality of individuals, and a processor in operable communication with the sensor system. By the present example, the processor of the system accesses screening data from the sensor system for screening the individual for a disorder as the individual performs a predetermined sequence of activities, the predetermined sequence of activities including at least one exertion configured for predicting a presence of the disorder. In addition, the processor conducts signal processing from raw sensor information of the screening data to derive a plurality of signals from each activity of the predetermined sequence of activities, the plurality of signals from an activity collectively predictive for detecting the presence of the disorder. The processor further computes an output defining a probability measure of risk of a positive diagnosis of the disorder attributable to the individual by applying the plurality of signals defined by the screening data to a machine learning model. The parameters of the machine learning model are configured, based on the plurality of signals, to maximize a probability of detecting the disorder.
In some examples, the sensor system of the subject system example includes a first sensor positioned along a chest of the individual to monitor movement and gait patterns, respiratory dynamics, and heart dynamics of the individual, and a second sensor positioned along a finger of the individual including a PPG sensing device. The first sensor measures acceleration, ECG, and a first temperature, and the second sensor measures blood-oxygen and a second temperature. The sensor system may include a motion sensor defining an accelerometer and a photopletysmography (PPG) sensor, such that the plurality of signals includes mechano-acoustic signals recorded by the accelerometer and blood oxygen levels recorded by the PPG sensor. The disorder being screened may be a COVID-19 infection. In this example, the plurality of signals includes a heart signal, and a change in heart signal between activities in the predetermined sequence of activities is extracted by the processor as a feature for the machine learning model, and the plurality of signals further includes an acceleration signal indicative of a respiration rate of the individual.
In some examples, the present disclosure takes the form of tangible, non-transitory, computer-readable media having instructions encoded thereon, such that a processor executing the instructions is operable to access screening data from a sensor system for screening an individual for a disorder as the individual performs a predetermined sequence of activities, the predetermined sequence of activities including at least one exertion configured for predicting a presence of the disorder. In addition, the processor executing the instructions is operable to conduct signal processing from raw sensor information of the screening data to derive a plurality of signals from each activity of the predetermined sequence of activities, the plurality of signals from an activity collectively predictive for detecting the presence of the disorder. The processor executing the instructions is further operable to compute an output defining a probability measure of risk of a positive diagnosis of the disorder attributable to the individual by applying the plurality of signals defined by the screening data to a machine learning model. The parameters of the machine learning model are configured, based on the plurality of signals, to maximize a probability of detecting the disorder.
In the subject example, the machine learning model may be preconfigured or trained in the manner described herein, such that the model as applied is suitable for predicting a diagnosis of a disorder.
Corresponding reference characters indicate corresponding elements among the view of the drawings. The headings used in the figures do not limit the scope of the claims.
Aspects of the present disclosure relate to examples of a system and variations of a customized method to screen individuals for suspect respiratory, cough, oxygen saturation, cardiac, and movement symptoms indicative of various disorders and in some examples to predict a diagnosis of a disorder. The examples provided in the present application are directed toward detection of symptoms related to COVID-19 infection, but those skilled in the art will recognize the same method/s could be modified to specifically detect symptoms of other disorders including but not limited to infections by other viral pathogens, bacterial pathogens, asthma, and COPD. In contrast to recent attempts of detecting COVID-19 based on continuous monitoring through wearable sensors in which the aim is to uncover general trends in the signals, the novel approach of the present disclosure uses a “snapshot” of sensor data generated during a predetermined sequence of activities to detect physiological and movement changes associated with the disease, among other features described herein.
The method may be used as a diagnostic biomarker, and is based on data including one or more of physiological, motion and mechano-acoustic (a.k.a. sensor modalities) datasets recorded with one or more wireless wearable sensors applied on the user, while the user performs a scripted sequence of activities. The sensors can be placed on the user's chest, throat, wrist or finger, and may be connected wirelessly to an application running on a mobile device (phone or tablet), which is used to manage the devices and transfer the recorded data. The mobile application guides the individual through the scripted sequence of activities and records the sensor data associated to each performed activity. An artificial intelligence and/or machine learning (AI/ML) model is trained with the data derived from the sensors to detect the physiological changes associated with these activities, and outputs a probability measure of risk of infection compatible with COVID-19 symptoms. One feature of the method is that the scripted procedure can be completed in a short period of time by the user, and does not require the presence of trained personnel to be executed, thus adhering to the social distancing guidelines recommended during the pandemic. The ML algorithm may run on the user's mobile device; and anonymized data may also be uploaded to a HIPAA compliant cloud server where the data can further be processed and/or can be stored for further analysis.
The system 100 may further include at least one data source device 108 shown by non-limiting example as data source device 108A and data source device 108B for optionally providing additional inputs to the model 101. In some examples, aspects of the system 100 and methods described herein may be provided to or intergrated with a client device 110 (e.g., mobile device) and/or a display device 112 in operable communication with the processor 102 for providing feedback. Further, examples of the system 100 may include one or more devices of a cloud 114 which may define cloud computing, storage, processing, infrastructure, and the like. The processor 102 may be a processing element of the cloud 114, and/or leverage processing of computing elements of the cloud 114 in various examples described herein. The components of the system 100 are shown as an example, are non-limiting with respect to form, type, and number and the system 100 may include other related components and variations. While the present inventive concept is described primarily as an implementation of the system, it should be appreciated that the inventive concept may also take the form of a device, and/or a tangible, non-transitory, computer-readable media having instructions encoded thereon and executable by a processor, and any number of methods related to embodiments of the system described herein.
As further shown in
Referring to
In some specific examples, the sensor data 106 is generated by one or more of the sensor systems 104 in view of at least one exertion activity of the predetermined sequence of activities performed by the individual 152. In some embodiments, data collection involves high resolution (varying sampling rates between 100 Hz to 2 kHz) sensor modalities capable of measuring body motions and physiological parameters, including EKG and temperature, as well as mechano-acoustic signals, such as cough, breathing, and heart sounds; which may be implemented along or proximate to the user (sensor system 104 may include wearable sensors, as shown in
After placing the sensors on the body, the user performs the predetermined sequence of activities 154 (also referred to herein as the protocol). Various embodiments or variations of the predetermined sequence of activities 154 can be administered depending on the need of expert clinicians. As one example, the predetermined sequence of activities includes the following activities (sensor data 106 generated during each activity): 30 seconds of breathing normally (e.g., at rest), walking for 30 seconds, 30 seconds of resting, five deep breaths, 30 seconds of walking, five deep breaths, and five forced coughs.
In some examples, as indicated in
In some examples, the data (the raw signal from the accelerometer and other sensor modalities) recorded by the sensor system 104 during the execution of each activity of the predetermined sequence of activities 154 is then uploaded wirelessly to a paired mobile device (156 in
Referring to
By non-limiting example with respect to configuration of the model 101 to diagnose COVID-19, the present disclosure considers that COVID-19 leads to changes in normal heart rate dynamics such that heart rate dynamics and changes thereof are considered to be ideal physiological signals for configuring the model 101 to diagnose COVID-19. Therefore feature values (410 in
In a similar fashion, respiration rate and lung function is known to be affected by COVID, such that respiration rate and lung functions signals of the sensor data 106 are considered to be ideal physiological signals for configuring the model 101 to diagnose COVID-19. Therefore, features that capture the respiration signal are extracted from the acceleration signal during different activities using digital filtering techniques to remove noise and motion artifacts, and isolate the breathing or cough signal. In one example, a combination of features in time and frequency domain is used to capture cough, respiration rate and their dynamics. These features include, but are not limited to, the ones mentioned above, and the following: crest factor, spectral skewness, spectral centroid, spectral spread and spectral entropy. Signal differences and quotients relative to their pre-activity baseline are also included.
In addition, gait patterns and movement features relating to physical effort may be indicative of COVID and correlated with the above. In one embodiment for the gait activity, the raw accelerometer data is converted into the frequency domain. As above, statistical features including, but not limited to, mean, standard deviation, range, IQR, variance, kurtosis, skewness, root mean square or sample entropy are extracted and used as features.
In some examples, additional data may optionally be leveraged to further tailor or configure the model 101. For example, demographic features such as age, gender, height, weight, BMI, smoking status and medical history (presence of specific symptoms and onset, medications, existing conditions/co-morbidities, vaccination status) may also be used as inputs to the model 101.
All feature values 410 are aggregated across signals and activities of the predetermined sequence of activities 154, and used as the input to a ML/AI model 101, which has been trained to predict either the presence/absence of respiratory infection, or the risk of having contracted the infection. In some examples, the model 101 combines aspects of the sensor data 106 with available medical history to act as an expert decision system to provide a risk prediction for the existence or absence of COVID or other cardio-respiratory illness. The model 101 may be developed on the basis of a large pool of data collected from individuals with either 1) no known current illness (healthy) or 2) acute-onset symptoms that could be attributed to COVID-19 (COVID-potential). One of many potential machine learning algorithms, such as logistic regression, support vector machines, decision trees, ensemble methods, or neural networks (deep learning), may be used as the base predictor function of the model 101 or to model generally the statistical associations between the extracted signal features and participant type (healthy vs. COVID-potential). A statistical model may be developed based on data from individuals with known illness status, with the objective of maximizing accurate assessment of illness status (yi) based only on a series of input features taken from wearable signals and medical history (xi) (
Once the model is fit (trained) to the data using a known dataset in this way, the model may be used to generate estimates of risk of active illness, given the same set of extracted features and medical history, from an individual of unknown illness risk. The trained model can be deployed to provide a user with a probability measure of the following equation format:
ŷ
i
=P(xi=Covid|θ),
where yi defines an infection risk for COVID-19.
Referring to
Referring to blocks 504 and 506, once the training datasets are acquired by some processing element, the processing element conducts signal processing from the raw sensor information of the training data for each of the one or more training datasets to derive a plurality of sample signals from each of the one or more activities of the predetermined sequence of activities. The plurality of signals are then used to train or otherwise configure the model 101. More specifically, in some examples, a processing element of the plurality of processing elements conducts feature extraction to extract feature values for each of the plurality of sample signals that quantify statistical properties for one or more activities of the predetermined sequence of activities. Alternatively, the sample signals may simply be fed to the model 101 directly without feature extraction (e.g., where model 101 includes a neural network or deep learning). In some examples, the processing element aggregates multiple features across a portion of the plurality of sample signals for a portion of the predetermined sequence of activities, and applies the feature values as inputs to the model 101. Feature values may relate to averages, standard deviations, ranges, minimums, maximums, root-mean squared, quantiles, moments, entropy metrics, skewness, kurtosis, and linear and non-linear metrics. Feature values may further relate to frequency domain features including power spectral density features, peak frequency, power skewness, kurtosis, entropy, center, and spread.
As indicated in block 508, the model 101 may optionally be further configured or trained using additional information including medical history specific to the individual being sampled or like individuals, sensor device parameters or characteristics, and disorder-specific information. Variations to the model 101 may be effectuated to, e.g., account for different environments, types of individuals, and the like.
Referring to block 510, the model 101 may then be tested, tuned and validated over time to optimize the ability of the model 101 to accurately predict a presence of the disorder in other individuals. For example, the model 101 may be trained using additional training data in a similar fashion until the model 101 accurately predicts the presence of the disorder in subsequent sensor data according to a predefined accuracy threshold. In either case, the model 101 as trained and optimized is then suitable for deployment for individuals that need to be screened. In some examples, the model 101 as trained outputs a probability measure P(x=Disorder( )) of infection or affliction risk of the disorder. In some examples, the parameters of the model 101 as trained maximize the probability of detecting the disorder. The trained model 101 in the present example can be deployed to provide a user with a probability measure of the following equation format:
ŷ
i
P(xi=DISORDER|θ)
where yi defines an infection risk for the disorder.
Referring to block 552 of process 550, the model 101 as trained is then applicable to individuals that need to be screened or monitored for the disorder. As indicated in block 552, some processor or processing element, such as the same processing element used to train the model 101, a different processing element, or a combination of the same, accesses screening data generated by an implementation of the sensor system 104 as an individual that needs to be screened performs the same predetermined sequence of activities 154 utilized during training. As with the training, the predetermined sequence of activities includes at least one exertion by the individual; i.e., some form of movement, motion, or change in state of the individual that induces a physiological change. By non-limiting examples, an exertion includes jumping, walking, coughing, breathing, and the like. In some embodiments the processing element applying the model 101 is a mobile device or client device.
Referring to block 554, block 556, and block 558, the processor applying the model 101 conducts signal processing from raw sensor data generated from the sensor system 104 to derive a plurality of signals from each of the predetermined sequence of activities. The plurality of signals collectively are predictive for detecting the presence of the disorder. In some examples, the processor further conducts feature extraction to extract feature values for each of the plurality of signals that quantify statistical properties for one or more activities of the predetermined sequence of activities. Optionally, the model 101 may be tuned or configured for the individual being screened prior to application by supplementing the model with additional information such as medical history specific to the individual being sampled or like individuals, sensor device parameters or characteristics, and disorder-specific information.
Referring to block 560, aspects of the plurality of signals derived from the sensor data generated are applied to the model 101 to compute an output. In some examples, the output includes a probability measure of a risk of a positive diagnosis of the disorder attributable to the individual. Where features are extracted from the plurality of signals, values associated with the features as extracted are fed to the model 101 to compute the output (
Referring to
Using leave-one-subject-out cross validation, an algorithm associated with a machine learning classifier (Random Forest Classifier) was selected as a basis for the model 101 to differentiate between participants who had COVID+ and COVID− PCR tests. The confusion matrix is shown in
In some examples, the signal processing conducted during the training and configuration process 500 and the application process 550 is conducted post-processing; meaning after the recordation of the sensor data 106. However, other examples and variations encompass real-time signal processing of at least some of the signals at each stage.
Referring to
The computing device 1200 may include various hardware components, such as a processor 1202, a main memory 1204 (e.g., a system memory), and a system bus 1201 that couples various components of the computing device 1200 to the processor 1202. The system bus 1201 may be any of several types of bus structures including a memory bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus architectures. For example, such architectures may include Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) bus, Enhanced ISA (EISA) bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus also known as Mezzanine bus.
The computing device 1200 may further include a variety of memory devices and computer-readable media 1207 that includes removable/non-removable media and volatile/nonvolatile media and/or tangible media, but excludes transitory propagated signals. Computer-readable media 1207 may also include computer storage media and communication media. Computer storage media includes removable/non-removable media and volatile/nonvolatile media implemented in any method or technology for storage of information, such as computer-readable instructions, data structures, program modules or other data, such as RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium that may be used to store the desired information/data and which may be accessed by the computing device 1200. Communication media includes computer-readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave or other transport mechanism and includes any information delivery media. The term “modulated data signal” means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. For example, communication media may include wired media such as a wired network or direct-wired connection and wireless media such as acoustic, RF, infrared, and/or other wireless media, or some combination thereof. Computer-readable media may be embodied as a computer program product, such as software stored on computer storage media.
The main memory 1204 includes computer storage media in the form of volatile/nonvolatile memory such as read only memory (ROM) and random access memory (RAM). A basic input/output system (BIOS), containing the basic routines that help to transfer information between elements within the computing device 1200 (e.g., during start-up) is typically stored in ROM. RAM typically contains data and/or program modules that are immediately accessible to and/or presently being operated on by processor 1202. Further, data storage 1206 in the form of Read-Only Memory (ROM) or otherwise may store an operating system, application programs, and other program modules and program data.
The data storage 1206 may also include other removable/non-removable, volatile/nonvolatile computer storage media. For example, the data storage 1206 may be: a hard disk drive that reads from or writes to non-removable, nonvolatile magnetic media; a magnetic disk drive that reads from or writes to a removable, nonvolatile magnetic disk; a solid state drive; and/or an optical disk drive that reads from or writes to a removable, nonvolatile optical disk such as a CD-ROM or other optical media. Other removable/non-removable, volatile/nonvolatile computer storage media may include magnetic tape cassettes, flash memory cards, digital versatile disks, digital video tape, solid state RAM, solid state ROM, and the like. The drives and their associated computer storage media provide storage of computer-readable instructions, data structures, program modules, and other data for the computing device 1200.
A user may enter commands and information through a user interface 1240 (displayed via a monitor 1260) by engaging input devices 1245 such as a tablet, electronic digitizer, a microphone, keyboard, and/or pointing device, commonly referred to as mouse, trackball or touch pad. Other input devices 1245 may include a joystick, game pad, satellite dish, scanner, or the like. Additionally, voice inputs, gesture inputs (e.g., via hands or fingers), or other natural user input methods may also be used with the appropriate input devices, such as a microphone, camera, tablet, touch pad, glove, or other sensor. These and other input devices 1245 are in operative connection to the processor 1202 and may be coupled to the system bus 1201, but may be connected by other interface and bus structures, such as a parallel port, game port or a universal serial bus (USB). The monitor 1260 or other type of display device may also be connected to the system bus 1201. The monitor 1260 may also be integrated with a touch-screen panel or the like.
The computing device 1200 may be implemented in a networked or cloud-computing environment using logical connections of a network interface 1203 to one or more remote devices, such as a remote computer. The remote computer may be a personal computer, a server, a router, a network PC, a peer device or other common network node, and typically includes many or all of the elements described above relative to the computing device 1200. The logical connection may include one or more local area networks (LAN) and one or more wide area networks (WAN), but may also include other networks. Such networking environments are commonplace in offices, enterprise-wide computer networks, intranets and the Internet.
When used in a networked or cloud-computing environment, the computing device 1200 may be connected to a public and/or private network through the network interface 1203. In such embodiments, a modem or other means for establishing communications over the network is connected to the system bus 1201 via the network interface 1203 or other appropriate mechanism. A wireless networking component including an interface and antenna may be coupled through a suitable device such as an access point or peer computer to a network. In a networked environment, program modules depicted relative to the computing device 1200, or portions thereof, may be stored in the remote memory storage device.
Certain embodiments may be described herein as including one or more modules. Such modules are hardware-implemented, and thus include at least one tangible unit capable of performing certain operations and may be configured or arranged in a certain manner. For example, a hardware-implemented module may comprise dedicated circuitry that is permanently configured (e.g., as a special-purpose processor, such as a field-programmable gate array (FPGA) or an application-specific integrated circuit (ASIC)) to perform certain operations. A hardware-implemented module may also comprise programmable circuitry (e.g., as encompassed within a general-purpose processor or other programmable processor) that is temporarily configured by software or firmware to perform certain operations. In some example embodiments, one or more computer systems (e.g., a standalone system, a client and/or server computer system, or a peer-to-peer computer system) or one or more processors may be configured by software (e.g., an application or application portion) as a hardware-implemented module that operates to perform certain operations as described herein.
Accordingly, the term “hardware-implemented module” encompasses a tangible entity, be that an entity that is physically constructed, permanently configured (e.g., hardwired), or temporarily configured (e.g., programmed) to operate in a certain manner and/or to perform certain operations described herein. Considering embodiments in which hardware-implemented modules are temporarily configured (e.g., programmed), each of the hardware-implemented modules need not be configured or instantiated at any one instance in time. For example, where the hardware-implemented modules comprise a general-purpose processor configured using software, the general-purpose processor may be configured as respective different hardware-implemented modules at different times. Software may accordingly configure the processor 1202, for example, to constitute a particular hardware-implemented module at one instance of time and to constitute a different hardware-implemented module at a different instance of time.
Hardware-implemented modules may provide information to, and/or receive information from, other hardware-implemented modules. Accordingly, the described hardware-implemented modules may be regarded as being communicatively coupled. Where multiple of such hardware-implemented modules exist contemporaneously, communications may be achieved through signal transmission (e.g., over appropriate circuits and buses) that connect the hardware-implemented modules. In embodiments in which multiple hardware-implemented modules are configured or instantiated at different times, communications between such hardware-implemented modules may be achieved, for example, through the storage and retrieval of information in memory structures to which the multiple hardware-implemented modules have access. For example, one hardware-implemented module may perform an operation, and may store the output of that operation in a memory device to which it is communicatively coupled. A further hardware-implemented module may then, at a later time, access the memory device to retrieve and process the stored output. Hardware-implemented modules may also initiate communications with input or output devices.
Computing systems or devices referenced herein may include desktop computers, laptops, tablets e-readers, personal digital assistants, smartphones, gaming devices, servers, and the like. The computing devices may access computer-readable media that include computer-readable storage media and data transmission media. In some embodiments, the computer-readable storage media are tangible storage devices that do not include a transitory propagating signal. Examples include memory such as primary memory, cache memory, and secondary memory (e.g., DVD) and other storage devices. The computer-readable storage media may have instructions recorded on them or may be encoded with computer-executable instructions or logic that implements aspects of the functionality described herein. The data transmission media may be used for transmitting data via transitory, propagating signals or carrier waves (e.g., electromagnetism) via a wired or wireless connection.
It should be understood from the foregoing that, while particular embodiments have been illustrated and described, various modifications can be made thereto without departing from the spirit and scope of the invention as will be apparent to those skilled in the art. Such changes and modifications are within the scope and teachings of this invention as defined in the claims appended hereto.
The present PCT patent application claims the benefit of provisional patent application No. 63/054,866 filed on Jul. 22, 2020, which is hereby incorporated by reference to its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/042846 | 7/22/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63054866 | Jul 2020 | US |