The disclosure relates to blood pressure (BP) monitoring, and more particularly to end-to-end cuffless BP monitoring using multimodal physiological signals.
Hypertension is a significant health concern and is a major contributor to cardiovascular diseases. As a result, blood pressure readings, including systolic blood pressure (SBP) values and diastolic blood pressure (DBP) values, are important cardiovascular health indicators.
Many approaches to blood pressure (BP) monitoring, such as oscillometric techniques and auscultatory techniques, rely on cuff-based devices to measure BP values. Although reliable, they are not suited for continuous monitoring and are not convenient to set up and use. These limitations have fueled research into more convenient, noninvasive methods, such as cuffless BP monitoring techniques.
Many cuffless BP monitoring techniques leverage electrocardiogram (ECG) and photoplethysmogram (PPG) signals, but are constrained by the need for handcrafted feature extraction. This may limit the generalizability of such models to diverse populations.
One or more embodiments of the disclosure are directed to end-to-end cuffless BP monitoring using multimodal physiological signals.
In accordance with an aspect of the disclosure, a method for performing cuffless blood pressure (BP) measurement includes: obtaining a first physiological signal and a second physiological signal associated with a user; providing the first physiological signal as an input to a first transformer model; providing the second physiological signal as an input to a second transformer model; providing an output of the first transformer model and an output of the second transformer model as inputs to a third transformer model; providing an output of the third transformer model to at least one BP estimation model; and generating an estimated BP value corresponding to the first physiological signal and the second physiological signal based on an output of the at least one BP estimation model.
The first physiological signal may include an electrocardiogram (ECG) signal associated with the user, and the second physiological signal may include a photoplethysmogram (PPG) signal associated with the user.
The first transformer model, the second transformer model, the third transformer model, and the at least one BP estimation model may be trained using a pre-training process and a user-specific training process, the pre-training process may be performed using a first training dataset corresponding to a plurality of users, and the user-specific training process may be performed based on a second training dataset corresponding to the user.
At least one of the pre-training process and the user-specific training process may be performed using a weighted contrastive loss function including a similarity metric.
The similarity metric may indicate a similarity between a first ground truth BP value corresponding to a first training sample and a second ground truth BP value corresponding to a second training sample.
The similarity metric may be used to cluster training samples included in at least one of the first training dataset and the second training dataset in an embedding space.
The at least one BP estimation model may include a systolic BP (SBP) estimation model and a diastolic BP (DBP) estimation model.
The method may further include providing the output of the third transformer model to the SBP estimation model; generating an estimated SBP value based on an output of the SBP estimation model; providing the output of the third transformer model to the DBP estimation model; and generating an estimated DBP value based on an output of the DBP estimation model, and the estimated BP value may include the estimated SBP value and the estimated DBP value.
In accordance with an aspect of the disclosure, an electronic device for performing cuffless blood pressure (BP) measurement, including: a first sensor configured to obtain a first physiological signal from a user; a second sensor configured to obtain a second physiological signal from the user; and at least one processor configured to: provide the first physiological signal as an input to a first transformer model; provide the second physiological signal as an input to a second transformer model; provide an output of the first transformer model and an output of the second transformer model as inputs to a third transformer model; provide an output of the third transformer model to at least one BP estimation model; and generate an estimated BP value corresponding to the first physiological signal and the second physiological signal based on an output of the at least one BP estimation model.
The first physiological signal may include an electrocardiogram (ECG) signal associated with the user, and the second physiological signal may include a photoplethysmogram (PPG) signal associated with the user.
The first transformer model, the second transformer model, the third transformer model, and the at least one BP estimation model may be trained using a pre-training process and a user-specific training process, the pre-training process may be performed using a first training dataset corresponding to a plurality of users, and the user-specific training process may be performed based on a second training dataset corresponding to the user.
At least one of the pre-training process and the user-specific training process may be performed using a weighted contrastive loss function including a similarity metric.
The similarity metric may indicate a similarity between a first ground truth BP value corresponding to a first training sample and a second ground truth BP value corresponding to a second training sample.
The similarity metric may be used to cluster training samples included in at least one of the first training dataset and the second training dataset in an embedding space.
The at least one BP estimation model may include a systolic BP (SBP) estimation model and a diastolic BP (DBP) estimation model.
The at least one processor may be further configured to: provide the output of the third transformer model to the SBP estimation model; generate an estimated SBP value based on an output of the SBP estimation model; provide the output of the third transformer model to the DBP estimation model; and generate an estimated DBP value based on an output of the DBP estimation model, and the estimated BP value may include the estimated SBP value and the estimated DBP value.
In accordance with an aspect of the disclosure, a non-transitory computer-readable medium stores instructions which, when executed by at least one processor of a device for performing cuffless blood pressure (BP) measurement, cause the device to: obtain a first physiological signal and a second physiological signal associated with a user; provide the first physiological signal as an input to a first transformer model; provide the second physiological signal as an input to a second transformer model; provide an output of the first transformer model and an output of the second transformer model as inputs to a third transformer model; provide an output of the third transformer model to at least one BP estimation model; and generate an estimated BP value corresponding to the first physiological signal and the second physiological signal based on an output of the at least one BP estimation model.
The first physiological signal may include an electrocardiogram (ECG) signal associated with the user, and the second physiological signal may include a photoplethysmogram (PPG) signal associated with the user.
The first transformer model, the second transformer model, the third transformer model, and the at least one BP estimation model may be trained using a pre-training process and a user-specific training process, the pre-training process may be performed using a first training dataset corresponding to a plurality of users, and the user-specific training process may be performed based on a second training dataset corresponding to the user.
At least one of the pre-training process and the user-specific training process may be performed using a weighted contrastive loss function including a similarity metric.
The above and other aspects, features, and advantages of certain embodiments of the disclosure will be more apparent from the following description taken in conjunction with the accompanying drawings, in which:
The terms as used in the disclosure are provided to merely describe specific embodiments, not intended to limit the scope of other embodiments. Singular forms include plural referents unless the context clearly dictates otherwise. The terms and words as used herein, including technical or scientific terms, may have the same meanings as generally understood by those skilled in the art. The terms as generally defined in dictionaries may be interpreted as having the same or similar meanings as or to contextual meanings of the relevant art. Unless otherwise defined, the terms should not be interpreted as ideally or excessively formal meanings. Even though a term is defined in the disclosure, the term should not be interpreted as excluding embodiments of the disclosure under circumstances.
The electronic device according to one or more embodiments may be one of various types of electronic devices. In some embodiments of the disclosure, the electronic devices may include a portable communication device (e.g., a smartphone), a computer device, a portable multimedia device, a portable medical device, a camera, or a wearable device. The electronic devices are not limited to those described above, in accordance with some other embodiments of the disclosure.
The disclosure and the terms used therein are not intended to limit the technological features set forth herein to particular embodiments and include various changes, equivalents, or replacements for a corresponding embodiment. With regard to the description of the drawings, similar reference numerals may be used to refer to similar or related elements. It is to be understood that a singular form of a noun corresponding to an item may include one or more of the things, unless the relevant context clearly indicates otherwise. As used herein, each of such phrases as “A or B”, “at least one of A and B”, “at least one of A or B”, “A, B, or C”, “at least one of A, B, and C”, and “at least one of A, B, or C”, may include any one of, or all possible combinations of the items enumerated together in a corresponding one of the phrases. As used herein, such terms as “1st” and “2nd”, or “first” and “second” may be used to simply distinguish a corresponding component from another, and does not limit the components in other aspects (e.g., importance or order). It is to be understood that if an element (e.g., a first element) is referred to, with or without the term “operatively” or “communicatively”, as “coupled with”, “coupled to”, “connected with”, or “connected to” another element (e.g., a second element), it means that the element may be coupled with the other element directly (e.g., wiredly), wirelessly, or via a third element.
As discussed above, hypertension is a significant health concern and a major contributor to cardiovascular diseases. Accordingly, blood pressure (BP) readings, including systolic BP (SBP) values and diastolic BP (DBP) values, are important cardiovascular health indicators. Monitoring these can be vital in diagnosis and management of high blood pressure. However, many approaches to BP monitoring rely on cuff-based devices such as a sphygmomanometer, which works by restricting the blood flow on the arm and gradually releasing the pressure while monitoring the flow. These cuff-based devices may be used to accurately measure BP across a diverse population, but they may not be convenient for continuous monitoring, and therefore may not be widely adapted and used for continuous monitoring.
In order to perform cuffless BP monitoring, some approaches use statistical models based on other physiological signals such as electrocardiogram (ECG) and photoplethysmogram (PPG). These physiological signals may be easily measured by personal electronic devices (e.g., the one or more of the first electronic device 101, the earbuds 101A, the watch 101B, and the ring 101C discussed below with reference to
Other approaches to cuffless BP monitoring use personalized deep leaning models, such as end-to-end deep neural network (DNN)-based BP prediction models. Unlike the statistical model approaches discussed above, it may not be necessary to choose what features to extract, because personalized DNN-based model may learn the correct features to extract in a data driven manner during training. However, unlike the statistical model approaches, this approach may use a large amount of training data. For example, some personalized models may require as much as twenty-seven hours of data from a subject in order to ensure accuracy with respect to that subject. One way to overcome this is to use a diverse training dataset to create a generalized DNN-based BP prediction model. This may reduce or eliminate the amount of subject-specific training data required. However, these generalized models often suffer from poor prediction accuracy because physiological measurements related to BP monitoring are very diverse, so collecting a training dataset that is diverse enough to ensure accuracy may be difficult.
Still further approaches to cuffless BP monitoring use transfer learning. An initial model may be pre-trained on a large and diverse dataset, and then may be fine-tuned using subject-specific data. This approach may greatly reduce the amount of subject-specific data needed to build highly accurate BP prediction models. Some approaches to transfer learning rely on prediction error (e.g., mean absolute error (MAE) and/or mean squared error (MSE)) as the metric for pre-training.
Accordingly, embodiments may relate to a novel end-to-end deep learning approach to cuffless blood pressure (BP) monitoring that employs a transformer architecture to analyze raw multimodal physiological signals, for example electrocardiogram (ECG) and photoplethysmogram (PPG) signals. The strong sequence modeling capabilities provided by transformers may make them useful for handling the intricacies of physiological time-series data. Embodiments may provide convenient and accurate BP monitoring without the need for manual feature engineering, thus enhancing model flexibility and adaptability.
Embodiments may relate to performing pre-training based on a relatively large and diverse dataset, followed by fine-tuning based on a smaller subject-specific dataset. For example, in some embodiments, the pre-training may be performed using a dataset corresponding to approximately one hundred subjects, and the fine-tuning may be performed using a dataset corresponding to up to ten subjects, but embodiments are not limited thereto. According to embodiments, the pre-training may be performed using one or more of a weighted contrastive learning (WCL) loss and a mean absolute error (MAE) loss. This may allow for fine-tuning using individual data to improve personalization while retaining high accuracy.
Accordingly, embodiments may apply end-to-end transformer architectures for personalized, cuffless BP monitoring using multimodal physiological signals. By leveraging end-to-end learning and transfer learning, embodiments may provide a scalable and personalized approach to cuffless BP monitoring with significant implications for personalized healthcare.
Embodiments of the disclosure may be implemented in or by the following example electronic devices shown in
In
In one embodiment, the electronic device 101 may include a processor 120, memory 130, an input device 150, a sound output circuit 155, a display 160, an audio circuit 170, a sensor 176, an interface 177, a haptic circuit 179, a camera 180, a power management circuit 188, a battery 189, a communication circuit 190, a subscriber identification module (SIM) 196, or an antenna 197.
In some embodiments, at least one (e.g., the display 160 or the camera 180) of the components may be omitted from the electronic device 101, or one or more other components may be added in the electronic device 101. In some embodiments, some of the components may be implemented as single integrated circuitry. For example, the sensor 176 (e.g., a fingerprint sensor, an iris sensor, or an illuminance sensor) may be implemented as embedded in the display 160 (e.g., a display).
The processor 120 may execute, for example, software (e.g., a program 140) to control at least one other component (e.g., a hardware or software component) of the electronic device 101 coupled with the processor 120, and may perform various data processing or computation. In one embodiment, as at least part of the data processing or computation, the processor 120 may load a command or data received from another component (e.g., the sensor 176 or the communication circuit 190) in volatile memory 132, process the command or the data stored in the volatile memory 132, and store resulting data in non-volatile memory 134. In one embodiment, the processor 120 may include a main processor 121 (e.g., a central processing unit (CPU) or an application processor (AP)), and an auxiliary processor 123 (e.g., a graphics processing unit (GPU), an image signal processor (ISP), a sensor hub processor, or a communication processor (CP)) that is operable independently from, or in conjunction with, the main processor 121. Additionally or alternatively, the auxiliary processor 123 may be adapted to consume less power than the main processor 121, or to be specific to a specified function. The processor 120 may refer to or correspond to one or more processors. For example, the electronic device 101 may include two or more processors like the processor 120.
The auxiliary processor 123 may be implemented as separate from, or as part of the main processor 121. The auxiliary processor 123 may control at least some of functions or states related to at least one component (e.g., the display 160, the sensor 176, or the communication circuit 190) among the components of the electronic device 101, instead of the main processor 121 while the main processor 121 is in an inactive (e.g., sleep) state, or together with the main processor 121 while the main processor 121 is in an active state (e.g., executing an application). In one embodiment, the auxiliary processor 123 (e.g., an image signal processor or a communication processor) may be implemented as part of another component (e.g., the camera 180 or the communication circuit 190) functionally related to the auxiliary processor 123.
The memory 130 may store various data used by at least one component (e.g., the processor 120 or the sensor 176) of the electronic device 101. The various data may include, for example, software (e.g., the program 140) and input data or output data for a command related thereto. The memory 130 may include the volatile memory 132 or the non-volatile memory 134. The program 140 may be stored in the memory 130 as software, and may include, for example, an operating system (OS) 142, middleware 144, or an application 146.
One or more embodiments of the disclosure may be implemented as software (e.g., the application 146, the middleware 144, the operating system) including one or more instructions that are stored in the memory 130 (a storage medium) that is readable by the electronic device 101. For example, the processor 120 of the electronic device 101 may invoke at least one of the one or more instructions stored in the memory 130, and execute the at least one of the one or more instructions, with or without using one or more other components under the control of the processor 120. This allows the electronic device 101 to be operated to perform at least one function according to the at least one instruction invoked. The one or more instructions may include a code generated by a compiler or a code executable by an interpreter. The memory 130, which may be a machine-readable storage medium, may be provided in the form of a non-transitory storage medium. Wherein, the term “non-transitory” simply means that the storage medium is a tangible device, and does not include a signal (e.g., an electromagnetic wave), but this term does not differentiate between where data is semi-permanently stored in the memory 130 (the storage medium) and where the data is temporarily stored in the memory 130.
In some embodiments, functions related to artificial intelligence (AI) are operated by the processor 120 (or the main processor 121 or the auxiliary processor 123) and the memory 130. The processor 120 (or the main processor 121 or the auxiliary processor 123) may include or may correspond to a general-purpose processor, such as a CPU, an application processor, or a digital signal processor (DSP), a graphics-dedicated processor, such as a graphics processing unit (GPU) or a vision processing unit (VPU), or an artificial intelligence-dedicated processor, such as a neural processing unit (NPU). The processor 120 (or the main processor 121 or the auxiliary processor 123) may control input data to be processed according to predefined operation rules or artificial intelligence models, which are stored in the memory 130. Alternatively, the processor 120 (or the main processor 121 or the auxiliary processor 123) may be an artificial intelligence-dedicated processor including a hardware structure specialized for processing of a particular artificial intelligence model.
The predefined operation rules or the artificial intelligence models may be made through training. Here, the statement of being made through training means that a basic artificial intelligence model may be trained by a learning algorithm by using a large number of training data, thereby making a predefined operation rule or an artificial intelligence model, which is configured to perform a desired characteristic (or purpose). Such training may be performed in a device itself, in which artificial intelligence according to the disclosure is performed, or may be performed via a separate server or a separate system. Examples of the learning algorithm may include, but are not limited to, supervised learning, unsupervised learning, semi-supervised learning, and reinforcement learning.
The artificial intelligence model may include a plurality of neural network layers. Each of the plurality of neural network layers has a plurality of weight values and performs neural network calculations through calculations between a calculation result of a previous layer and the plurality of weight values. The plurality of weight values of the plurality of neural network layers may be optimized by a training result of the artificial intelligence model. For example, the plurality of weight values may be updated to minimize a loss value or a cost value, which is obtained from the artificial intelligence model during the process of training. An artificial neural network may include a DNN, and examples of the artificial neural network may include, but are not limited to, a random forest model, a convolutional neural network (CNN), a DNN, a recurrent neural network (RNN), a restricted Boltzmann machine (RBM), a deep belief network (DBN), a bidirectional recurrent deep neural network (BRDNN), and deep Q-Networks.
The input device 150 may receive a command or data to be used by other components (e.g., the processor 120) of the electronic device 101, from the outside (e.g., a user) of the electronic device 101. The input device 150 may include, for example, a microphone, a mouse, or a keyboard.
The sound output circuit 155 may output sound signals to the outside of the electronic device 101. The sound output circuit 155 may include, for example, a speaker or a receiver. The speaker may be used for general purposes, such as playing multimedia or playing record, and the receiver may be used for incoming calls. In one embodiment, the receiver may be implemented as separate from, or as part of the speaker.
The display 160 may visually provide information to the outside (e.g., a user) of the electronic device 101. The display 160 may include, for example, a display, a hologram device, or a projector and control circuitry to control a corresponding one of the display, hologram device, and projector. In one embodiment, the display 160 may include touch circuitry adapted to detect a touch, or sensor circuitry (e.g., a pressure sensor) adapted to measure the intensity of force incurred by the touch.
The audio circuit 170 may convert a sound into an electrical signal and vice versa. In one embodiment, the audio circuit 170 may obtain the sound via the input device 150, or output the sound via the sound output circuit 155 or a headphone of an external electronic device (e.g., an electronic device 102) directly (e.g., wiredly) or wirelessly coupled with the electronic device 101.
The sensor 176 may detect an operational state (e.g., power or temperature) of the electronic device 101 or an environmental state (e.g., a state of a user) external to the electronic device 101, and then generate an electrical signal or data value corresponding to the detected state. In one embodiment, the sensor 176 may include, for example, a gesture sensor, a gyro sensor, an atmospheric pressure sensor, a magnetic sensor, an acceleration sensor, a grip sensor, a proximity sensor, a color sensor, an infrared (IR) sensor, a biometric sensor, a temperature sensor, a humidity sensor, or an illuminance sensor.
The interface 177 may support one or more specified protocols to be used for the electronic device 101 to be coupled with the external electronic device (e.g., the electronic device 102) directly (e.g., wiredly) or wirelessly. In one embodiment, the interface 177 may include, for example, a high definition multimedia interface (HDMI), a universal serial bus (USB) interface, a secure digital (SD) card interface, or an audio interface.
A connection terminal 178 may include a connector via which the electronic device 101 may be physically connected with the external electronic device (e.g., the electronic device 102). In one embodiment, the connection terminal 178 may include, for example, a HDMI connector, a USB connector, a SD card connector, or an audio connector (e.g., a headphone connector),
The haptic circuit 179 may convert an electrical signal into a mechanical stimulus (e.g., a vibration or a movement) or electrical stimulus which may be recognized by a user via his tactile sensation or kinesthetic sensation. In one embodiment, the haptic circuit 179 may include, for example, a motor, a piezoelectric element, or an electric stimulator.
The camera 180 may capture a still image or moving images. In one embodiment, the camera 180 may include one or more lenses, image sensors, image signal processors, or flashes.
The power management circuit 188 may manage power supplied to the electronic device 101. In one embodiment, the power management circuit 188 may be implemented as at least part of, for example, a power management integrated circuit (PMIC).
The battery 189 may supply power to at least one component of the electronic device 101. In one embodiment, the battery 189 may include, for example, a primary cell which is not rechargeable, a secondary cell which is rechargeable, or a fuel cell.
The communication circuit 190 may support establishing a direct (e.g., wired) communication channel or a wireless communication channel between the electronic device 101 and the external electronic device (e.g., the electronic device 102, the electronic device 104, or the server 108) and performing communication via the established communication channel. The communication circuit 190 may include one or more communication processors that are operable independently from the processor 120 (e.g., the application processor (AP)) and supports a direct (e.g., wired) communication or a wireless communication. In one embodiment, the communication circuit 190 may include a wireless communication circuit 192 (e.g., a cellular communication circuit, a short-range wireless communication circuit, or a global navigation satellite system (GNSS) communication circuit) or a wired communication circuit 194 (e.g., a local area network (LAN) communication circuit or a power line communication (PLC) module). A corresponding one of these communication circuits may communicate with the external electronic device via the first network 198 (e.g., a short-range communication network, such as Bluetooth™, wireless-fidelity (Wi-Fi) direct, or infrared data association (IrDA)) or the second network 199 (e.g., a long-range communication network, such as a cellular network, the Internet, or a computer network (e.g., LAN or wide area network (WAN)). These various types of communication circuits may be implemented as a single component (e.g., a single chip), or may be implemented as multi components (e.g., multi chips) separate from each other. The wireless communication circuit 192 may identify and authenticate the electronic device 101 in a communication network, such as the first network 198 or the second network 199, using subscriber information (e.g., international mobile subscriber identity (IMSI)) stored in the subscriber identification module 196.
The antenna 197 may transmit or receive a signal or power to or from the outside (e.g., the external electronic device) of the electronic device 101. In one embodiment, the antenna 197 may include one or more antennas, and, therefrom, at least one antenna appropriate for a communication scheme used in the communication network, such as the first network 198 or the second network 199, may be selected, for example, by the communication circuit 190 (e.g., the wireless communication circuit 192). The signal or the power may then be transmitted or received between the communication circuit 190 and the external electronic device via the selected at least one antenna.
At least some of the above-described components may be coupled mutually and communicate signals (e.g., commands or data) therebetween via an inter-peripheral communication scheme (e.g., a bus, general purpose input and output (GPIO), serial peripheral interface (SPI), or mobile industry processor interface (MIPI)).
In one embodiment, commands or data may be transmitted or received between the electronic device 101 and the external electronic device 104 via the server 108 coupled with the second network 199. Each of the electronic devices 102 and 104 may be a device of a same type as, or a different type, from the electronic device 101. In one embodiment, all or some of operations to be executed at the electronic device 101 may be executed at one or more of the external electronic devices 102, 104, or 108. For example, if the electronic device 101 should perform a function or a service automatically, or in response to a request from a user or another device, the electronic device 101, instead of, or in addition to, executing the function or the service, may request the one or more external electronic devices to perform at least part of the function or the service. The one or more external electronic devices receiving the request may perform the at least part of the function or the service requested, or an additional function or an additional service related to the request, and transfer an outcome of the performing to the electronic device 101. The electronic device 101 may provide the outcome, with or without further processing of the outcome, as at least part of a reply to the request. To that end, a cloud computing, distributed computing, or client-server computing technology may be used, for example.
The electronic device according to various embodiments may be one of various types of electronic devices. The electronic devices may include, for example, a portable communication device (e.g., a smart phone), a computer device, a portable multimedia device, a portable medical device, a camera, a wearable device, or a home appliance. In one embodiment of the disclosure, the electronic devices are not limited to those described above.
In some embodiments, the earbuds 101A may be paired with a computer, an electronic device, or a mobile device that corresponds to the first electronic device 101 shown in
In
In one embodiment, the earbuds 101A may output an audio signal (e.g., via the sound output circuit 155) when worn and in close proximity to a part of the user's body. In one embodiment, at least one of a right earbud and a left earbud may output an audio signal using wireless data transmission and reception with an electronic device (such as a mobile phone), for example, configured to play a song to be transmitted to the earbuds 101A.
For example, the wireless data transmission and reception path may be based on or may correspond to a Bluetooth communication scheme, a route for a Bluetooth low energy (BLE) communication scheme, and a route for a ultra-wide band (UWA) communication scheme. The wireless data transmission and reception path may include at least one of a path for a wireless fidelity (Wi-Fi) direct communication technique and a path for a mobile communication technique (e.g., long term evolution (LTE) sidelink, etc.).
In one embodiment, only one of the earbuds 101A may create the communication path with the electronic device. For example, a first earbud may output an audio signal based on the audio data of the electronic device received through the communication path (the wireless data transmission and reception path). When the electronic device is connected to the right earbud, the electronic device or the right earbud provides information about the communication path to the left earbud so that the left earbud can output the audio signal. The left earbud can receive data transmitted to the right earbud based on the information about the communication path and output the audio signal.
For example, the left earbud can receive data transmitted to the right earbud by monitoring information about the communication path. In one embodiment, the right earbud connected to the electronic device may be referred to as a master device, and the left earbud not connected to the electronic device may be referred to as a slave device. In one embodiment, the master device and the slave device among the pair of the earbuds 101A may be changed. In one embodiment, at least one of the earbuds 101A may transmit data to the electronic device. For example, the data may include information for controlling the audio signal output through the earbuds 101A (e.g., information for playing a sound source, information for pausing the sound source, information for stopping the sound source, information for controlling the volume of the sound source, and information for selecting the sound source.
In
In one embodiment, the earbuds 101A may include at least one conductive pin pad (e.g., the connection terminal 178) on the outside. In one embodiment, the power supply device 200 may include at least one externally conductive pin (e.g., a conductive terminal). The conductive pin pad of the earbuds 101A and the conductive pin of the power supply 200 may be arranged to physically contact each other while the earbuds 101A is connected to the power supply 200. In one embodiment, when the earbuds 101A is connected to the power supply device 200, the conductive pin of the earbuds 101A and the conductive pin of the power supply device 200 may be in contact and electrically connected. In one embodiment, by identifying a conductive pin contact, the earbuds 101A or the power supply 200 can determine whether the earbuds 101A is connected to the power supply 200. In one embodiment, the earbuds 101A may detect the amount of light reflected from the power supply device 200 through a proximity sensor (e.g., included in the sensor 176).
In some embodiments, the watch 101B may be paired with a computer, an electronic device, or a mobile device that corresponds to the first electronic device 101 shown in
In
The watch 101B may display data or information through the display 160. The watch 101B may display data stored in the memory 130 through the display 160. For example, the watch 101B may display data or information provided by at least one application stored in the memory 130 through the display 160. Further, the watch 101B may display an execution screen or an execution state of at least one application through the display 160.
In one embodiment, the display 160 may display a user interface for performing a function of the watch 101B. For example, the display 160 may display an execution screen of at least one application stored in the watch 101B. Further, the display 160 may simultaneously display execution screens of a plurality of applications stored in the watch 101B.
In one embodiment, through the display 160, the watch 101B may identify at least one of a touch input, drag input (or touch-drag input), swipe input, and pinch input on the display 160.
The first input device 150A may generate an input signal in response to an input to the watch 101B. For example, the first input device 150A may be implemented in the form of a bezel of a watch. The user may rotate the first input device 150A in a first direction (for example, a clockwise direction) or a second direction (for example, a counterclockwise direction), and the watch 101B may identify the input corresponding to the rotation.
The watch 101B may further include the second input device 150B. For example, the second input device 150B may be implemented in the form of a watch stem. The user may rotate the second input device 150B and press the second input device 150B, and the watch 101B may identify the input corresponding to the rotation or pressing.
The watch 101B may further include the sound output circuit 155 (e.g., a speaker). For example, the watch 101B may provide information in the form of a voice or a sound through the sound output circuit 155.
In some embodiments, the execution screen may be a screen on which a corresponding application is executed and that indicates an execution state. That is, the corresponding application has already been executed or is being executed.
In
In some embodiments, the ring 101C may be paired with a computer, an electronic device, or a mobile device that corresponds to the first electronic device 101 shown in
For example, the body part of the user (on which the ring 101C is worn) may be the user's finger. For example, the frame 410 of the ring 101C may have a ring shape, such that the ring 101C may be worn on the user's finger. However, the disclosure is not limited thereto. The ring 101C, which may be referred to as a wearable device, may have a shape corresponding to the body part of the user.
According to an embodiment, the frame 410 may include a first surface 410A (e.g., an inner surface) facing the body part of the user while the ring 101C is worn on the body part of the user, and a second surface 410B (e.g., an outer surface) opposite to the first surface 410A. The body part may be one of the user's fingers. For example, when the ring 101C is worn by the user, at least a portion of the first surface 410A may contact the body part of the user. For example, the first surface 410A may surround the body part of the user wearing the ring 101C. For example, the first surface 410A may cover the body part of the user wearing the ring 101C. For example, the first surface 410A may be configured such that the ring 101C is fastened to the body part by pressurizing the body part of the user when the ring 101C is worn by the user.
For example, the second surface 410B may be an external appearance of ring 101C together with the first surface 410A. For example, the second surface 410B may be a ring-shaped frame 410 together with the first surface 410A. The second surface 410B (opposite to the first surface 410A) may be farthest from the body part. For example, the first surface 410A may be an inner circumferential surface of the frame 410. The second surface 410B (opposite to the first surface 410A) may be an outer circumference surface of the frame 410.
Here, the ring 101C has been described as being worn on a body part of the user, but the disclosure is not limited thereto. The term “body part” is used to describe the body part of the user on which the ring 101C is worn and is not intended to limit the body part of the user on which the ring 101C is worn or a positional relationship between the body part and the ring 101C thereto. For example, the body part may be one of the user's fingers, but is not limited thereto.
According to an embodiment, the frame 410 may include a first frame 411 defining the first surface 410A, and a second frame 412 defining the second surface 410B and coupled to the first frame 411.
For example, the first frame 411 may be a portion of the frame 410 including the first surface 410A. For example, when the ring 101C is worn by the user, the first frame 411 may be in contact with the body part of the user. The first frame 411 may include at least one of silicon, epoxy, and acryl, but is not limited thereto.
For example, the second frame 412 may surround the first frame 411. For example, the second frame 412 may support the first frame 411. For example, the second frame 412 may form the outer appearance of the frame 410 together with the first frame 411. For example, the second frame 412 may be a portion of the frame 410 including a second surface 410B opposite to the first surface 410A. The second frame 412 may include at least one of metal and titanium, but is not limited thereto. The frame 410 of the ring 101C may include the first frame 411 and the second frame 412 including different materials, thereby providing a user with various user experiences.
In
In some embodiments, the sensor 176 may include at least one of a temperature sensor, a proximity sensor, a motion sensor, or a pressure sensor. Further, the sensor 176 may include a light emitter facing the first surface 410A of the frame 410 and a light receiver spaced apart from the light emitter. In some embodiments, the sensor 176 is configured to detect biometric information about the user.
In
In
In
Also, according to another exemplary embodiment, the examples shown in
Each of the light-receiving devices 530 shown in
The structure of the light-receiving device 530 may be modified to have the structure of a light-receiving device 530B as illustrated in
In
In one case where a user is located at or near the second electronic device 102, the sensor 176B of the second electronic device 102 may be used to detect signals (e.g., biometric signals or physiological signals) from the user. Then, the second electronic device 102 may transmit the detected signals to the electronic device (computer) 101D over the first network 198. Next, the transmitted signals may be (temporarily or permanently) stored in the memory 130 and the processor 120 may perform various functions regarding the signals originally generated from the sensor 176B of the second electronic device 102.
Similarly, in another case where a user is located at or near the third electronic device 104, the sensor 176C of the third electronic device 104 may be used to detect signals (e.g., biometric signals) from the user. Then, the third electronic device 104 may transmit the detected signals to the electronic device (computer) 101D over the second network 199. Next, the transmitted signals may be (temporarily or permanently) stored in the memory 130 and the processor 120 may perform various functions regarding the signals originally generated from the sensor 176C of the third electronic device 104.
According to embodiments, the ECG transformer model 711A, the PPG transformer model 711B, and the merged transformer model 712 may be time-series transformer (TST) models, which may be a type of network used to process sequential data and assist in capturing temporal dependencies and patterns. In some embodiments, the ECG transformer model 711A, the PPG transformer model 711B, and the merged transformer model 712 may be PatchTST models, but embodiments are not limited thereto, and any transformer architecture may be used.
According to embodiments, the SBP estimation model 713A and the DBP estimation model 713B may be separate multi-layer perceptrons (MLP), but embodiments are not limited thereto. For example, in some embodiments the SBP estimation model 713A and the DBP estimation model 713B may be any other type of network, for example CNNs or RNNs.
According to embodiments, the SBP projection head 714A and the DBP projection head 714B may be projection networks or projection layers, for example fully-connected (FC) layers, but embodiments are not limited thereto.
As shown in
As further shown in
As further shown in
For example, at operation 723, the ECG transformer model 711A and the PPG transformer model 711B may be used to independently process the ECG signals xECG and the PPG signals XPPG to extract features into a similar embedding space so that they can be combined. In embodiments, the ECG transformer model 711A may generate ECG embeddings zECG based on the ECG signals xECG, and the PPG transformer model 711B may generate PPG embeddings zPPG based on the PPG signals xPPG. The ECG embeddings zECG and the PPG embeddings zPPG may be combined, for example using concatenation or summing, and the combined embeddings may be provided as input to the merged transformer model 712, which may extract synergistic features that can only be extracted leveraging the information of both of the input modalities. The output of the merged transformer model 712 may be provided to separate branches for systolic blood pressure (SBP) and diastolic blood pressure (DBP). For example, the output of the merged transformer model 712 may be provided to the SBP estimation model 713A to generate SBP embeddings zSBP, and may be provided to the DBP estimation model 713B to generate DBP embeddings zDBP. Then, the projection heads may be used to project the embeddings to the predicted BP values. For example, in embodiments the SBP projection head 714A may generate a predicted SBP value {circumflex over (γ)}SBP based on the SBP embeddings zSBP, and the DBP projection head 714B may generate a predicted DBP value {circumflex over (γ)}DBP based on the DBP embeddings zDBP. In embodiments, the predicted BP values may then be passed along to any downstream applications that might be depending on them
Although examples are provided herein in which the multimodal physiological signals include the ECG signals xECG and PPG signals xPPG, embodiments are not limited thereto. For example, according to embodiments the model 710 may be signal-agnostic, and may be applied to any other signals or modalities. For example, in some embodiments the model 710 may receive as inputs one or more of physiological signals such as heart rate variability (HRV), respiration rate, electroencephalogram (EEG), skin conductance, etc., behavioral measurements such as movement, gait, voice, stress, cognition, etc., and chemical measurements such as glucose level, sweat composition, etc.
As discussed above, a large subject-specific dataset may be used to train a model for performing cuffless BP monitoring, for example a training dataset including triplets of ECG readings, PPG readings and ground-truth BP readings of a particular subject, taken at multiple time points. Some approaches that use such datasets use a straightforward learning mechanism that attempts to directly predict the BP values, given the ECG and PPG signals as input. For example, these approaches may train the model using only an MAE loss function, as shown for example in Equation 1 below:
In Equation 1 above, ( ) may denote one of SBP and DBP for which the loss is being computed, N may denote a number of samples in the training dataset, γi may denote a ground-truth SBP value or DBP value for a given sample i, and {circumflex over (γ)}i may denote a corresponding predicted SBP value or DBP value for the sample i. While MAE loss may be effective, it does not offer any regularization.
According to embodiments, in order to provide improved SBP and DBP prediction, a different type of loss, which may be referred to as weighted contrastive learning (WCL) loss, may be used during training in order to bring together representations of ECG and PPG signals that have similar SBP values and DBP values. According to embodiments, such representations then lead to a better SBP and DBP regression. An example of a WCL loss function is shown below in Equation 2.
In Equation 2, Sij may denote a similarity metric between two samples indexed by i and j, zi may denote the SBP embedding or DBP embedding of the sample i, τw may govern how similarity decays with distance in the embedding space, and T may denote transposition. The WCL loss may be general in that it may be applied to the SBP embeddings and DBP embeddings separately.
In some embodiments, the similarity metric Sij may be computed according to Equation 3 below.
In Equation 3 above, zj may denote the SBP embedding or DBP embedding of the sample j, τs may denote a temperature term that governs how similarity decays with distance in the label space, Ts may denote a threshold below which two samples are deemed dis-similar, and ρ may denote a power term that governs a tail decay of similarity in the label space. In embodiments, at least one of τs, Ts, ρ, and τw may be hyperparameters.
According to embodiments, WCL loss may differ from other loss functions based on how it handles the relationship between samples during training. In approaches based solely on loss functions such as MAE loss and MSE loss, the difference between a predicted result and a corresponding ground truth result are minimized, without considering the relationship between different samples in the dataset. In contrast, using WCL loss may allow samples which are similar to each other to be clustered together in the embedding space. For example, training based on WCL may force the model to group similar data points (e.g., ECG signals and PPG signals with similar ground-truth SBP values and DBP values), in a high-dimensional space (e.g., the latent space or embedding space). This clustering may indicate that the model recognizes these data points as being related, which may assist in providing more accurate predictions. For example, this may allow the model to shape the embedding space in a way that resembles the ground-truth space (e.g., the space including the ground-truth SBP values and DBP values).
As an example in which the sample i is not equal to the sample j, the SBP embedding zSBPi of the sample i and the SBP embedding zSBPj of the sample j which have similar ground truth SBP values γSBP may be drawn closer to each other, and the higher the similarity SSBPij is, the higher the attraction may be. However, if SSBPij=0, then the SBP embeddings zSBPi and zSBPj may be considered dissimilar, and therefore may be repelled from each other.
Although examples are discussed above in which the similarity metric Sij is computed using Equation 3, embodiments are not limited thereto. For example, according to embodiments the similarity metric Sij may be computed using any other type of similarity, for example cosine similarity, L1 norm similarity, L2 norm similarity, dynamic time warping (DTW), or Jaccard similarity.
As can be seen in MAESBP may be determined based on the ground truth SBP values γSBP and the predicted SBP value {circumflex over (γ)}SBP, and an SBP WCL loss
WCLSBP may be determined based on the ground truth SBP values γSBP and the SBP embeddings ZSBP. Similarly, a DBP MAE loss
MAEDBP may be determined based on the ground truth DBP values γDBP and the predicted DBP value {circumflex over (γ)}DBP, and a DBP WCL loss
WCLDBP may be determined based on the ground truth DBP values γDBP and the DBP embeddings ZDBP. In embodiments, at least one of the pre-training process and the subject-specific training process may be performed based on any combination of one or more of the SBP MAE loss
MAESBP, the DBP MAE loss
MAEDBP, the SBP WCL loss
WCLSBP, and the DBP WCL loss
WCLDBP.
As shown in
As further shown in
As further shown in
According to embodiments, a training process performed according to the process 920 may include two stages, a pre-training stage and a fine-tuning stage. In embodiments, operations 921 through 923 may be performed in each of the pre-training stage and the fine-tuning stage, but embodiments are not limited thereto. For example, in some embodiments all of the training data may be collected and prepared at operation 921 and operation 922, and then operation 923 may be performed for the pre-training stage, and then performed again for the fine-tuning stage.
The pre-training stage may correspond to the pre-training process discussed above with reference to MAESBP, the DBP MAE loss
MAEDBP, the SBP WCL loss
WCLSBP, and the DBP WCL loss
WCLDBP may be combined, for example using addition or weighted addition, and the resulting composite loss may be used to perform the pre-training stage. In addition, in some embodiments, some or all of the subject-specific dataset may be combined with the large and diverse dataset to perform the pre-training.
The fine-tuning stage may correspond to the fine-tuning training process discussed above with reference to
As shown in
As further shown in
As further shown in
As further shown in
As further shown in
As further shown in
In embodiments, the first transformer model, the second transformer model, the third transformer model, and the at least one BP estimation model may be trained using a pre-training process and a user-specific training process, wherein the pre-training process is performed using a first training dataset corresponding to a plurality of users, and wherein the user-specific training process is performed based on a second training dataset corresponding to the user. In embodiments, the pre-training process may correspond to the pre-training process discussed above with reference to
In embodiments, at least one of the pre-training process and the user-specific training process is performed using a weighted contrastive loss function comprising a similarity metric. In embodiments, the weighted contrastive loss function and the similarity metric may correspond to the WCL loss WCL and the similarity metric Sij discussed above with reference to Equation 2 and Equation 3.
In embodiments, the similarity metric may indicate a similarity between a first ground truth BP value corresponding to a first training sample and a second ground truth BP value corresponding to a second training sample.
In embodiments, the similarity metric may be used to cluster training samples included in at least one of the first training dataset and the second training dataset in an embedding space.
In embodiments, the process 1000 may further include providing the output of the third transformer model to the SBP estimation model; generating an estimated SBP value based on an output of the SBP estimation model; providing the output of the third transformer model to the DBP estimation model; and generating an estimated DBP value based on an output of the DBP estimation model, and the estimated BP value may include the estimated SBP value and the estimated DBP value.
According to one or more embodiments of the disclosure, a method may be included and provided in a computer program product. The computer program product may be traded as a product between a seller and a buyer. The computer program product may be distributed in the form of a machine-readable storage medium (e.g., compact disc read only memory (CD-ROM)), or be distributed (e.g., downloaded or uploaded) online via an application store (e.g., PlayStore™), or between two user devices (e.g., smart phones) directly. If distributed online, at least part of the computer program product may be temporarily generated or at least temporarily stored in a machine-readable storage medium, such as memory of the manufacturer's server, a server of the application store, or a relay server.
According to one or more embodiments, each component (e.g., a module or a program) of the above-described components may include a single entity or multiple entities. According to one or more embodiments, one or more of the above-described components may be omitted, or one or more other components may be added. Alternatively or additionally, a plurality of components (e.g., modules or programs) may be integrated into a single component. In such a case, according to one or more embodiments, the integrated component may still perform one or more functions of each of the plurality of components in the same or similar manner as they are performed by a corresponding one of the plurality of components before the integration. According to one or more embodiments, operations performed by the module, the program, or another component may be carried out sequentially, in parallel, repeatedly, or heuristically, or one or more of the operations may be executed in a different order or omitted, or one or more other operations may be added.
According to one or more embodiments, in a non-volatile storage medium storing instructions, the instructions may be configured to, when executed by at least one processor, cause the at least one processor to perform at least one operation. The at least one operation may include displaying an application screen of a running application (e.g., the application 146) on a display (e.g., the display 160), identifying a data input field included in the application screen, identifying a data type corresponding to the data input field, displaying at least one external electronic device, around the electronic device, capable of providing data corresponding to the identified data type, receiving data corresponding to the identified data type from an external electronic device selected from among the at least one external electronic device through a communication circuit, and entering the received data into the data input field.
The embodiments of the disclosure described in the specification and the drawings are only presented as specific examples to easily explain the technical content according to the embodiments of the disclosure and help understanding of the embodiments of the disclosure, not intended to limit the scope of the embodiments of the disclosure. Therefore, the scope of one or more embodiments of the disclosure should be construed as encompassing all changes or modifications derived from the technical spirit of one or more embodiments of the disclosure in addition to the embodiments disclosed herein.
This application is based on and claims priority under 35 U.S.C. § 119 to U.S. Provisional Patent Application No. 63/539,546, filed on Sep. 20, 2023, in the U.S. Patent and Trademark Office, the disclosure of which is incorporated by reference herein in its entirety.
Number | Date | Country | |
---|---|---|---|
63539546 | Sep 2023 | US |