The voice is a fundamental tool in people's lives as communication of a person is mainly conditioned by the voice. Therefore, the appearance of conditions or voice disorders affecting or canceling the ability to speak involves a significant decrease in quality of life and a serious occupational health problem. Vocal folds nodules, muscle dysphonia, tension, spasmodic dysphonia, paralysis of the folds, or temporary loss of voice, among others, annually affect millions of people worldwide.
Currently, treatments for voice pathologies or disorders are only carried out through visits to medical consultations and in limited times, which does not allow obtaining constant and sufficient information to adequately describe its daily use. In this sense, clinical monitoring is essential for the effectiveness of the diagnosis and treatment of these pathologies, but its implementation is limited to medical consultations, which excludes the possibility of obtaining information that adequately describe the actual use of the voice. Without this information, specialists can only design broad-spectrum therapies, which are often inefficient and result in a high number of recurrences.
Recently, to provide a proper monitoring of patients with voice conditions, wearable devices have been developed, commonly in the form of a necklaces, that allows monitoring the use of the voice using electrodes, microphones, and other equipment.
An exemplary technology of the prior art is described in document US 2014/235977 A1, which discloses a neck-worn sensor that is a single, body-worn system that measures several parameters from an ambulatory patient. From stroke volume, a first algorithm employing a linear model can estimate the patient's pulse pressure. And from pulse pressure and pulse transit time, a second algorithm, also employing a linear algorithm, can estimate systolic blood pressure and diastolic blood pressure. Thus, the necklace can measure all five vital signs along with hemodynamic parameters. It also includes a motion-detecting accelerometer, from which it can determine motion-related parameters such as posture, degree of motion, activity level, respiratory-induced heaving of the chest, and falls.
The prior art device described above addresses the problems related to the portability of the equipment, allowing a semi-continuous monitoring of the user by means of a wearable device. However, the configuration of this device is intended for general data logging, but there is no need for extra care with the signal integrity. This is clear considering the purpose of the transducer used, an accelerometer, which aims to record the user movement, but there are no further treatments to thoroughly analyze the characteristics of the voice signals.
Another approach to register and analyze voice signals is based in the use of neck-surface accelerometers, which allow for non-invasive and non-obstructive speech and swallow measures. One document that represents this kind of technologies of the prior art is document US 2014/0066724 A1, which describes a system and method to assess vocal function of a subject. This system includes an accelerometer configured to acquire surface acceleration data associated with vocal functionality of the subject and a computer system configured to analyze the surface acceleration data and to estimate glottal airflow wave-forms produced by the subject based on the surface acceleration data. The computer system performs the analysis and estimation by applying an inverse filter to the surface acceleration data based on a calibrated transmission line model and generates an indication of vocal functionality of the subject based on the estimated glottal airflow waveforms.
However, the system described above requires the connection of a sensor to a computer system to analyze the data obtained by the sensor. Accordingly, the system described do not address the portability issues and do not comply with the requirements for an autonomous operation. Additionally, the computer system requires the use of audio codecs to pre-process the data and store it digitally. This pre-process may vary considering the computer system and may include gain, band-pass filtering and noise reduction that will cause distortion of the signal, thereby affecting the integrity of the signal and the subsequent analysis of the data. Also, being a wired device it's prone to have unintended connection problems.
Accordingly, there is a need of providing an autonomous device to record the complete use of the voice during the day in everyday conditions, and able to maintain at the same time the accuracy and integrity of the signal, so the data can be used to obtain and estimate certain parameters and indicators that are useful for the assessment of the vocal function.
The invention refers to a wearable voice detection system, in the form of a necklace, that allows monitoring the use of the voice of a user, the system comprising:
The system described comprises compact, small-sized elements that allow portability and usability as a wearable object, thereby allowing a continuous monitoring of the voice in everyday conditions of use. By the combined use of the sound detection means and the accelerometer, the system is capable of accurately registering and analyzing the use of the vocal folds in order to estimate a series of physiological parameters that are not only of clinical interest to researchers and voice professionals, but also to any professional who uses their voice as a work tool and requires precise monitoring of the health of their voice, such as announcers, singers, teachers, journalists, among others. Thus, the specific configuration of the system described above is capable of substantially improving the evaluation, diagnosis, and monitoring capacity of vocal pathologies that affect millions of people a year in the world, which, in their most serious cases, even lead to permanent loss of voice.
Therefore, the operation of the device of the present invention is based in the use of an accelerometer and a microphone simultaneously, allowing to process signals in real time to deliver instant feedback to users based on their vocal use, which constitutes a disruptive methodology for vocal therapies.
One of the key features of the system described is that the processing means are separate from the sensing device, thus providing separate, small, and compact elements which facilitates the wearability and comfort for the user, by providing a minimally invasive wearable device in the form of a necklace.
Preferably, the processed data obtained from the captured signals can be stored in a data storage means in the control device, and can be transmitted periodically or in real time to an external location, for example, an external computer or in the cloud, where it can be later analyzed by a specialist, in a post-processing or medical analysis. Alternatively, the control device can be configured to transmit information in real time to a user interface, such as an app in a smartphone via Bluetooth.
One of the key aspects that are addressed by the invention is the integrity of the signal. This is accomplished on one hand by selecting the correct transducers, an accelerometer with the specific bandwidth for vocal applications combined with a microphone, and on the other hand, by the optimal data acquisition process, which includes filtering by hardware to precondition de signal and then the use of an audio codec to further process the signals. The combination of this characteristics of the invention provides full control over the behavior of the input signal, minimizing phase and harmonic distortions and, finally, encoding de data for transfer and storage.
More particularly, the use of an accelerometer strategically positioned on the trachea allows the estimation of glottic flow, subglottic pressure, and other determining variables for the identification of vocal hyperfunction. Additionally, the data obtained by the accelerometer is complemented by an environmental capture of sound by the sound detecting means, which can be selectively controlled to be turned on and off, thereby allowing the patient to decide when he do not want certain information to be recorded. The combination of both mechanisms allows the instant detection of vocal abuse.
The combined and simultaneous use of two kind of signals, sound, and acceleration, allows the delivery of clinically relevant information for the evaluation of vocal function and has been shown to better identify patterns of vocal abuse, generating a more useful device and, therefore, having greater appreciation by health professionals and patients. Its features allow feedback using advanced parameters and indicators for vocal use, which is a revolutionary therapeutic methodology, performing a pre-processing of the signals and transmitting data to provide feedback to the user. In addition, its wireless, ergonomic, and discreet design conceals its medical nature and makes it easy to use, providing an object that does not look like a medical device, allowing the user to use the device as a wearable item without affecting the quality of the captured signals.
The following is a detailed description of exemplary embodiments to illustrate the principles of the invention. The embodiments are provided to illustrate aspects of the invention, but the invention is not limited to any embodiment. The scope of the invention encompasses numerous alternatives, modifications and equivalent, only limited by the embodiments of the claims.
According to
As disclosed in
Referring to
Referring to
The energy storage means (124) allows an autonomous operation of the system. To accomplish this operation, the energy storage means (124) preferably consist of a battery that allows the system to operate without a physical connection to an external source. In this embodiment, a charging port can be included in the control device to allow the charge of the battery with an external source.
The processing means (123) is configured to implement voice processing algorithms and to command all the elements of the system (100). Preferably, the processing means (123) consist of a printed circuit board, configured with state-of-the-art electronic technology, and it is capable of processing signals to deliver instant feedback (biofeedback) to users based on their vocal use, which is a new methodology for vocal therapies. The data obtained can be stored and processed safely in the cloud (HIPPA compliant) or an external location, thanks to unique algorithms specially designed for the interpretation of this data, which allows the generation of new useful information for health professionals (such as flow glottic airway, subglottic pressure, and vocal efficiency).
In preferred embodiments of the invention, the processing means (123) include data storage means configured to store all the data that is being processed by the system, thus allowing to process data while the device is be used.
The control device preferable include data transmission means configured to allow the transmission of the processed data to an external location. Preferably, the transmission means is configured to transmit the processed data periodically or in real time to the external location, for example, an external computer or in the cloud, where it can be later analyzed by a specialist, in a post-processing or medical analysis. Alternatively, the control device can be configured to transmit information in real time to a user interface, such as a computer or an app in a smartphone via Bluetooth.
In preferred embodiments, the processed data is preferably transmitted to a user interface, which is configured to visualize and analyze the data in a corresponding software, aimed at researchers and voice professionals. The processed data can be visualized in all kinds of mobile platforms and computers, both for health professionals and patients, to visualize and analyze information processed periodically or in in real time, and to track unprecedented vocal function.
The control means (121) can consist in a keypad including one or more buttons, or can be configured as a touchpad, and is configured to provide basic commands for the operation of the system, such as turning the system on and off, or other alternative functions. Additionally, the control means can include displaying means, such as a screen or lights to provide basic information about the status of the operation, like the battery level, or other operation features.
In preferred embodiments, the processing means is configured to implement several treatments or algorithms to the input signal, including filtering by hardware to precondition de signal and then the use of an audio codec to further process the signals. This procedure in combination with the use of the correct transducers, an accelerometer with the specific bandwidth for vocal applications combined with a microphone, allow to maintain the integrity of the signals, providing full control over the behavior of the input signal, minimizing phase and harmonic distortions and, finally, encoding de data for transfer and storage.
The processing means is configured to implement a vocal analysis engine, which is the core analysis obtained using the system (100). The vocal analysis engine comprises several algorithms designed for the assessment of the vocal function, with two analysis modules that operates with a neck surface acceleration signal (ACC) and a sound signal obtained by the sound detecting means, preferably a microphone (MIC).
The first analysis module is the “standard vocal health analysis”, which considers speech signal processing approaches that are not available in any prior ambulatory voice monitor. The following features are included in this module:
Additionally, the following advanced features can be obtained by the processing means, which have been shown to better identify vocal hyperfunctional behaviors and are key for a more comprehensive assessment of vocal function in an ambulatory scenario:
Therefore, by means of the features described above the invention allows to obtain and estimate parameters and indicators that are useful for the assessment of the vocal function, such as SPL, VAD, f0, H1-H2, CPP, which nowadays can only be obtained in clinical facilities, and some of them, such as the aerodynamics features, requires highly invasive procedures to be obtained. The voice detection system described herein allows to obtain these parameters and indicators in a continuous operation by means of a portable device.
Additionally, the processing means is configured to provide daily reports. Once the vocal health indicators are calculated, the Vocal Analysis Engine generates a summary of the results. These results are saved and sent to both the users, for example via a mobile application or a web browser, and the Health Specialist. The specific content includes raw features, daily/weekly statistics, and daily biofeedback summary. As complement for the daily reports, the Vocal Analysis Engine is also capable of generating graphic information based on the daily reports and user-requested analyses. Features in this module can be selected as desired and include:
While the present invention has been described in terms of particular embodiments and applications, in both summarized and detailed forms, it is not intended that these descriptions in any way limit its scope to any such embodiments and applications, and it will be understood that many substitutions, changes and variations in the described embodiments, applications and details of the methods and system illustrated herein can be made by those skilled in the art without departing from the spirit of this invention.
This application claims the benefit of Applicants' prior provisional application, number U.S. 63/061,348, filed on Aug. 5, 2020.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2021/057224 | 8/5/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63061348 | Aug 2020 | US |