his application relates generally to ear-level electronic systems and devices, including hearing aids, personal amplification devices, and hearables. In one embodiment, an ear-wearable device includes an input sensor that provides an input signal. The input signal is digitized via circuitry of the ear-wearable device. The device includes an adaptive feedback canceller with an adaptive foreground filter that inserts a feedback cancellation signal into the digitized input signal to produce an error signal. An instability detector of the device is configured to extract two or more features from the error signal. The instability detector includes a machine learning module that determines an instability in the error signal based on the two or more features. The instability module changes a step size of the adaptive foreground filter in response to determining the instability. The changed step size causes the adaptive foreground filter to have a faster adaptation to perturbations in the error signal compared to a previously used step size.
In another embodiment, a method involves extracting two or more features from an error signal of a feedback cancellation loop of an ear-wearable device. The two or more features include a power-level-dependent feature and a power-level-independent feature. The two or more features are input to a machine learning module to determine an instability in the error signal. A step size of an adaptive foreground filter used to cancel feedback in the ear-wearable device is changed in response to determining the instability. The changed step size causes the adaptive foreground filter to have a faster adaptation to perturbations in the error signal compared to a previously used step size
In another embodiment, a method involves extracting two or more features from an error signal of a feedback cancellation loop of an ear-wearable device. The two or more features include a power-level-dependent feature and a power-level-independent feature. The two or more features are input to a machine learning module to determine an instability in the error signal. An optimization algorithm of an adaptive foreground filter used to cancel feedback in the ear-wearable device is changed in response to determining the instability. The changed optimization algorithm causes the adaptive foreground filter to have a faster adaptation to perturbations in the error signal compared to a previously used optimization algorithm.
The above summary is not intended to describe each disclosed embodiment or every implementation of the present disclosure. The figures and the detailed description below more particularly exemplify illustrative embodiments.
The discussion below makes reference to the following figures.
The figures are not necessarily to scale. Like numbers used in the figures refer to like components. However, it will be understood that the use of a number to refer to a component in a given figure is not intended to limit the component in another figure labeled with the same number.
Embodiments disclosed herein are directed to feedback detection in an ear-worn or ear-level electronic device. Such a device may include cochlear implants and bone conduction devices, without departing from the scope of this disclosure. The devices depicted in the figures are intended to demonstrate the subject matter, but not in a limited, exhaustive, or exclusive sense. Ear-worn electronic devices (also referred to herein as “hearing devices” or “ear-wearable devices”), such as hearables (e.g., wearable earphones, ear monitors, and earbuds), hearing aids, hearing instruments, and hearing assistance devices, typically include an enclosure, such as a housing or shell, within which internal components are disposed.
Typical components of a hearing device can include a processor (e.g., a digital signal processor or DSP), memory circuitry, power management and charging circuitry, one or more communication devices (e.g., one or more radios, a near-field magnetic induction (NFMI) device), one or more antennas, one or more microphones, buttons and/or switches, and a receiver/speaker, for example. Hearing devices can incorporate a long-range communication device, such as a Bluetooth® transceiver or other type of radio frequency (RF) transceiver.
The term hearing device of the present disclosure refers to a wide variety of ear-level electronic devices that can aid a person with impaired hearing. The term hearing device also refers to a wide variety of devices that can produce processed sound for persons with normal hearing. Hearing devices include, but are not limited to, behind-the-ear (BTE), in-the-ear (ITE), in-the-canal (ITC), invisible-in-canal (IIC), receiver-in-canal (RIC), receiver-in-the-ear (RITE) or completely-in-the-canal (CIC) type hearing devices or some combination of the above. Throughout this disclosure, reference is made to a “hearing device” or “ear-wearable device,” which are understood to refer to a system comprising a single left ear device, a single right ear device, or a combination of a left ear device and a right ear device.
Embodiments described below utilize adaptive feedback cancellation (FBC), which involves detecting feedback affecting a hearing device. Various FBC systems are widely used and provide benefit for many patients with many devices, such as those popular large vents and open fittings. Perturbations pose challenges to traditional adaptive feedback cancellation algorithms that are based on least squared error (LSE) or mean squared error (MSE). Such perturbations include strong disturbances caused by significant feedback path changes due to user movements/changes of enclosure or environment around hearing devices.
In embodiments described below, the active FBC can be made robust against perturbations due to variations to the incoming signal statistics and feedback path changes. One way to address this problem is to precisely and promptly detect the perturbations due to feedback path change and make the adjustment to a different adaptation rate, e.g., when the feedback path change is detected, the system adapts at a higher rate for a fast re-convergence otherwise the adaptation rate reduces for a steady state with higher added stable gain.
Embodiments described below can improve the overall performance of the adaptive feedback canceler in normal, routine operation. In some embodiments, the adaptive feedback canceller is made robust against perturbations that include strong feedback path changes caused by user movements/changes of enclosure or environment around hearing devices and also variations to its statistics.
In
The feedback canceller 108 uses an adaptive filter to generate cancellation signal ŷ(n), which is an estimate of the feedback signal and is combined with the microphone signal m(n) at summation block 112. The output of the summation block 112 is error signal m(n), which is ideally close to or the same as the input signal x(n), depending on how well the cancellation signal ŷ(n) matches feedback y(n).
When there is a sudden change to the feedback path due to user's physical actions 110 or the statistics of incoming signal x(n), the steady state of the adaptive filtering is undermined, and it needs some time to re-converge. The step size in the adaptive filter update is a trade-off between fast convergence rate/good tracking ability on the one hand and low mis-adjustment on the other hand. In embodiments described herein, the adaptive filter step size can be time-varying to adapt to changing conditions (e.g., instabilities) affecting the feedback cancellation system.
The convergence of typical adaptive filtering algorithms assumes a stable feedback path and stationary input signal. However, this assumption can be difficult to achieve in practice. Many practical scenarios or actions made by users, e.g. user sneezing, standing up and sitting down, phone moving close to the ears, would leads to noticeable changes to the acoustic feedback paths. Additionally, the transition between stationarity periods of the incoming signal leads to outliers in the error signal, resulting in local divergence of the adaptive filter. These perturbations are addressed by the disclosed embodiments.
In one embodiment, a feedback path stability detector controls the step-size of the adaptation as a factor of the update term. The adaptation behavior with variable step size would be: 1) steady state: the error signal e(n) is equal to the source signal x(n) and this means that the step size is 0 as desired. 2) unsteady state: σe(n) is much larger than σx(n). Thus, the step size increases to 1 for fast adaptation which is the desired behavior. A fast reset scheme for the time-variant step-size factor is also devised. The algorithm can switch between a normalized least mean square algorithm (NLMS) and a normalized sign algorithm (NSA). The selection of which algorithm to use depends on an improved stability detector using machine learning.
In
Robustness of the adaptive filter 204 signifies insensitivity to a certain amount of deviations from statistical modeling assumptions due to some outliers. The sensitivity to outliers increases with the convergence speed of the adaptation algorithm and limits the performance of signal processing algorithms, especially when fast convergence is required such as in feedback cancellation. The update filter 202 will allow the adaptive filter 204 to switch between these regimes based on an estimate of instability. Additional features of the adaptive filter will be described further below.
Also seen in
In
After changing the step size of the adaptive filter and/or changing the optimization algorithm, the stability detector determines the error signal has returned to stability, causing signal to revert to a value of ‘0.’ In response thereto, the update filter 208 reverts to a previously used step size of the adaptive filter. The previously used step size results in the adaptive filter having a lower sensitivity to the perturbations than the optimization algorithm. In addition to the change in step size, the signal's reversion to ‘0’ results in the optimization algorithm being changed from the a normalized least square algorithm 302 to the normalized sign algorithm 304. A reset signal 308 may be used to initialize the system and/or reset the system to a default value, e.g., using NLMS 304 as a default and/or using an initial step size.
In
A feature extraction process 404 extracts features from the signal 402, which may be obtained from a frequency-domain or time-domain representation of the signal data stream 402. These may be a combination of power-dependent and power-independent features, as will be described in greater detail elsewhere below. These features are normalized 405 into a vector suitable for the machine learning model. A training process 406 generally involves inputting the feature vectors into an initialized machine-learning model. In some embodiments (e.g., Gaussian mixture model, support vector machine) the training process involves classifying the signals (or parts thereof) into groups/classes. In other embodiments, e.g., neural networks, the training process calculates differences between the output of the model compared to the predefined classification data, and adjusts the model parameters to minimize the differences, e.g., using stochastic gradient descent. The training process 406 may continue iteratively until a condition is met, e.g., classification errors are sufficiently low, classification errors do not further decrease, etc.
One a preliminary trained model is formed, a validation 407 is performed, typically with training data that is not part of the original set 402. This can be used to verify performance of the model and/or additionally refine the model. After validation 407, a trained model 408 is established. The trained model 408 contains data (e.g., parameters, weights) that can be deployed 409 as a detection model 414 that is stored and operated on a hearing device. In the hearing device, an operational error signal data stream 410 is fed into a feature extractor 412 and feature vector normalization 413, which is then processed by the detection module 414, which outputs the control signal 300, which is used as shown in
The stability detector 200 shown in
The feature extractor 412 can extract at least two features from the error signal data 410. In an embodiment shown in
In
The shadow-filter (dual-filter) structure shown in
Err_ƒorek(n)=Err_ƒorek(n−1)+α_1*((E_ƒore(k)·*conj(E_ƒore(k))−Err_ƒorek(n−1)) (1)
α_1*((E_back(k)·*conj(E_back(k))−Err_backk(n−1)) (2)
Shadow_error(n)=20 log 10(∥Err_back[n1:n2](n)∥/∥Err-ƒore[n1:n2](n)∥) (3)
In reference again to
For the auto PSD, Skis calculated by recursive smoothing of the short-term power of error signals. This is shown in Equation (5), where alpha1 is the smoothing factor based on the smoothing time constant and errSignalk is the complex error signal from the FFT-analysis at the kth band. Once again, this calculation may use the bands that correspond to around 2000 Hz˜4500 Hz where the feedback paths are generally significant.
sk(n)=sk(n−1)+alpha1*[{errSignalk*conj(errSignalk)}−sk(n−1)] (5)
Feature 502 includes log mel-band energies from effective bands. Log mel-band energies have been used for single channel sound event detection and have proven to be good features for this purpose. In the proposed system log mel-band energies can be extracted from the foreground filter error signal. Note that the features 500 and 501 being ratios are power-level-independent features (e.g., independent of the average PSD of the error signal), while feature 502 will generally increase or decrease with the average PSD of the signal.
In reference again to
However, this data collection scheme may have drawbacks. For example, the dynamic feedback paths are cumbersome and time-consuming to measure in practice, which makes it difficult to achieve a sufficiently large and comprehensive data base. Also, the ground truth labelling of the feedback path change can only be roughly labelled. Further, the subjects' movement behaviors differ from one other and are specific to individuals, therefore the obtained data may not be general to represent the feedback path change given the size of the data samples being relatively small. Given this, a mixture of both real-world measurement of dynamic feedback paths and artificially manipulated feedback path changing data set may be used.
For the manipulated feedback path changing data set, one option is to use existing feedback path databases. Within these databases, it is possible to find a number of individual feedback path responses that are stationary. The feedback path can be manipulated by switching from one response to another non-repetitive one over a short time period (every 2 seconds or so) in a cross-validation arrangement. This enables creating a large number feedback path change paths compared to the number of individual feedback path change events. The artificially manipulated feedback path changing data set can be mixed with real-world measurement data. Let β be the percentage ratio of how much the real-world measurement of dynamic feedback path response samples being weighted in the overall training set, e.g., β can be 25%, 50%, 75%, up to 100%.
In reference again to
In
In
The user interface 601 is also shown with radio buttons 604 that allows selecting which features (e.g., features 500-502 in
A simulation was performed using a feedback cancellation arrangement as shown above. The simulation used real-world measured feedback paths and confirmed clear improvements. There was an added stable gain improvement of 4.5 dB in steady state (no path change). There was 33% less instance of negative gain margin than existing baseline, indicating that the risks of having chirps are significantly lower. The simulation observed 8% less severe chirpings (gain margin streaks <−10 dB) than baseline method during path change.
In
The hearing device 700 includes a processor 720 operatively coupled to a main memory 722 and a non-volatile memory 723. The processor 720 can be implemented as one or more of a multi-core processor, a digital signal processor (DSP), a microprocessor, a programmable controller, a general-purpose computer, a special-purpose computer, a hardware controller, a software controller, a combined hardware and software device, such as a programmable logic controller, and a programmable logic device (e.g., FPGA, ASIC). The processor 720 can include or be operatively coupled to main memory 722, such as RAM (e.g., DRAM, SRAM). The processor 720 can include or be operatively coupled to non-volatile memory 723, such as ROM, EPROM, EEPROM or flash memory. As will be described in detail hereinbelow, the non-volatile memory 723 is configured to store instructions that facilitate using a feedback path stability detector in order to change the operation of a feedback cancellation filter.
The hearing device 700 includes an audio processing facility operably coupled to, or incorporating, the processor 720. The audio processing facility includes audio signal processing circuitry (e.g., analog front-end, analog-to-digital converter, digital-to-analog converter, DSP, and various analog and digital filters), a microphone arrangement 730, and a speaker or receiver 732. The microphone arrangement 730 can include one or more discrete microphones or a microphone array(s) (e.g., configured for microphone array beamforming). Each of the microphones of the microphone arrangement 730 can be situated at different locations of the housing 702. It is understood that the term microphone used herein can refer to a single microphone or multiple microphones unless specified otherwise.
The hearing device 700 may also include a user interface with a user control interface 727 operatively coupled to the processor 720. The user control interface 727 is configured to receive an input from the wearer of the hearing device 700. The input from the wearer can be any type of user input, such as a touch input, a gesture input, or a voice input. The user control interface 727 may be configured to receive an input from the wearer of the hearing device 700 to change feedback cancellation parameters of the hearing device 700, such as shown in
The hearing device 700 also includes a feedback cancellation module 738 operably coupled to the processor 720. The feedback cancellation module 738 can be implemented in software, hardware, or a combination of hardware and software. The feedback cancellation module 738 can be a component of, or integral to, the processor 720 or another processor (e.g., a DSP) coupled to the processor 720. The feedback cancellation module 738 is configured to detect and cancel feedback in different types of acoustic environments.
According to various embodiments, the feedback cancellation module 738 includes an adaptive filter that inserts a feedback cancellation signal into a digitized input signal to produce an error signal. The feedback cancellation module 738 includes or is coupled to an instability detector configured to extract two or more features from the error signal. The instability detector includes a machine learning module that determines instability in the error signal based on the two or more features. The instability module changes a step size of the adaptive filter in response to determining the instability. The changed step size causes the adaptive filter to have a faster adaptation to perturbations in the error signal.
The hearing device 700 can include one or more communication devices 736 coupled to one or more antenna arrangements. For example, the one or more communication devices 736 can include one or more radios that conform to an IEEE 802.11 (e.g., WiFi®) or Bluetooth® (e.g., BLE, Bluetooth® 4. 2, 5.0, 5.1, 5.2 or later) specification, for example. In addition, or alternatively, the hearing device 700 can include a near-field magnetic induction (NFMI) sensor (e.g., an NFMI transceiver coupled to a magnetic antenna) for effecting short-range communications (e.g., ear-to-ear communications, ear-to-kiosk communications).
The hearing device 700 also includes a power source, which can be a conventional battery, a rechargeable battery (e.g., a lithium-ion battery), or a power source comprising a supercapacitor. In the embodiment shown in
In
Generally, the method can be implemented within an infinite loop in a hearing device. The method involves extracting 800 two or more features from an error signal of a feedback cancellation loop of an ear-wearable device. For example, the two or more features may include at least one power-level-dependent feature and at least one power-level-independent feature. The two or more features are input 801 to a machine learning module to determine an instability in the error signal. At block 802, it is determined whether the machine learning module detects a change from a stable to unstable state of the feedback path or other factors that can affect the ability to perform feedback cancellation.
If a change 802 from stable to unstable is detected, then one or both of blocks 803, 804 may be performed. Block 803 involves changing a step size of an adaptive filter used to cancel feedback in the ear-wearable device in response to determining the instability. The change in step sized cause to the adaptive filter to have a faster adaptation to perturbations in the error signal compared to a previously used step size. Block 804 involves changing an optimization algorithm of the adaptive filter, the changed optimization algorithm having a faster adaptation to perturbations in the error signal.
If block 802 returns ‘no’ then another decision block 805 determines if whether the machine learning module detects a change from an unstable to stable state of the feedback path/environment. If a change 805 from unstable to stable is detected, then one or both of blocks 806, 807 may be performed. Block 803 involves reverting the step size of the adaptive filter to a previously used value, the previously used step size resulting in the adaptive filter having a lower sensitivity to the perturbations than the changed step size. Block 804 involves reverting to a previously used optimization algorithm of the adaptive filter, the previously used optimization algorithm having a lower sensitivity to the perturbations than the algorithm selected at block 804. Note that while some of blocks 803, 804, 806, and 807 are described as being optional, if the operation described in block 803 is used, then the operation described in block 806 will also generally be performed as appropriate. A similar dependency may exist between blocks 804 and 807.
Although reference is made herein to the accompanying set of drawings that form part of this disclosure, one of at least ordinary skill in the art will appreciate that various adaptations and modifications of the embodiments described herein are within, or do not depart from, the scope of this disclosure. For example, aspects of the embodiments described herein may be combined in a variety of ways with each other. Therefore, it is to be understood that, within the scope of the appended claims, the claimed invention may be practiced other than as explicitly described herein.
All references and publications cited herein are expressly incorporated herein by reference in their entirety into this disclosure, except to the extent they may directly contradict this disclosure. Unless otherwise indicated, all numbers expressing feature sizes, amounts, and physical properties used in the specification and claims may be understood as being modified either by the term “exactly” or “about.” Accordingly, unless indicated to the contrary, the numerical parameters set forth in the foregoing specification and attached claims are approximations that can vary depending upon the desired properties sought to be obtained by those skilled in the art utilizing the teachings disclosed herein or, for example, within typical ranges of experimental error.
The recitation of numerical ranges by endpoints includes all numbers subsumed within that range (e.g. 1 to 5 includes 1, 1.5, 2, 2.75, 3, 3.80, 4, and 5) and any range within that range. Herein, the terms “up to” or “no greater than” a number (e.g., up to 50) includes the number (e.g., 50), and the term “no less than” a number (e.g., no less than 5) includes the number (e.g., 5).
The terms “coupled” or “connected” refer to elements being attached to each other either directly (in direct contact with each other) or indirectly (having one or more elements between and attaching the two elements). Either term may be modified by “operatively” and “operably,” which may be used interchangeably, to describe that the coupling or connection is configured to allow the components to interact to carry out at least some functionality (for example, a radio chip may be operably coupled to an antenna element to provide a radio frequency electric signal for wireless communication).
Terms related to orientation, such as “top,” “bottom,” “side,” and “end,” are used to describe relative positions of components and are not meant to limit the orientation of the embodiments contemplated. For example, an embodiment described as having a “top” and “bottom” also encompasses embodiments thereof rotated in various directions unless the content clearly dictates otherwise.
Reference to “one embodiment,” “an embodiment,” “certain embodiments,” or “some embodiments,” etc., means that a particular feature, configuration, composition, or characteristic described in connection with the embodiment is included in at least one embodiment of the disclosure. Thus, the appearances of such phrases in various places throughout are not necessarily referring to the same embodiment of the disclosure. Furthermore, the particular features, configurations, compositions, or characteristics may be combined in any suitable manner in one or more embodiments.
The words “preferred” and “preferably” refer to embodiments of the disclosure that may afford certain benefits, under certain circumstances. However, other embodiments may also be preferred, under the same or other circumstances. Furthermore, the recitation of one or more preferred embodiments does not imply that other embodiments are not useful and is not intended to exclude other embodiments from the scope of the disclosure.
As used in this specification and the appended claims, the singular forms “a,” “an,” and “the” encompass embodiments having plural referents, unless the content clearly dictates otherwise. As used in this specification and the appended claims, the term “or” is generally employed in its sense including “and/or” unless the content clearly dictates otherwise.
As used herein, “have,” “having,” “include,” “including,” “comprise,” “comprising” or the like are used in their open-ended sense, and generally mean “including, but not limited to.” It will be understood that “consisting essentially of,” “consisting of,” and the like are subsumed in “comprising,” and the like. The term “and/or” means one or all of the listed elements or a combination of at least two of the listed elements.
The phrases “at least one of,” “comprises at least one of,” and “one or more of” followed by a list refers to any one of the items in the list and any combination of two or more items in the list.
This application is a U.S. National Stage application under 35 U.S.C. 371 of PCT Application No. PCT/US2021/025886 filed Apr. 6, 2021, which claims the benefit of U.S. Provisional Application No. 63/007,617, filed Apr. 9, 2020, the entire contents of which are is hereby incorporated by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/025886 | 4/6/2021 | WO |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2021/207134 | 10/14/2021 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6377682 | Benesty et al. | Apr 2002 | B1 |
6947549 | Yiu et al. | Sep 2005 | B2 |
7650004 | Durant | Jan 2010 | B2 |
8355794 | Lineaweaver et al. | Jan 2013 | B2 |
10492008 | Xu et al. | Nov 2019 | B2 |
20120087509 | Elmedyb | Apr 2012 | A1 |
20150189450 | Van Der Werf | Jul 2015 | A1 |
20160302014 | Fitz et al. | Oct 2016 | A1 |
20170289705 | Hillbratt | Oct 2017 | A1 |
20170311095 | Fitz | Oct 2017 | A1 |
20190297433 | Kuriger et al. | Sep 2019 | A1 |
20200053486 | Jensen | Feb 2020 | A1 |
Number | Date | Country |
---|---|---|
2071875 | Jun 2011 | EP |
2439958 | Jun 2013 | EP |
3236675 | Jan 2020 | EP |
WO 0019605 | Apr 2000 | WO |
2008051569 | May 2008 | WO |
Entry |
---|
PCT Search Report and Written Opinion for PCT/US2021/025886 dated Jul. 14, 2021 (16 pages). |
Cakir et al., “Polyphonic Sound Event Detection Using Multi Label Deep Neural Networks”, 7 pages. |
Huo et al., “A Robust Transform Domain Echo Canceller Employing a Parallel Filter Structure”, ScienceDirect, vol. 86, Issue 12, Dec. 2006, pp. 3752-3760. |
Nordholm et al, “Stability-Controlled Hybrid Adaptive Feedback Cancellation Scheme for Hearing Aids”, J. Acoust. Soc. Am. 143(1), Jan. 2018, pp. 150-156. |
Parascandolo et al., “Recurrent Neural Networks for Polyphonic Sound Event Detection in Real Life Recordings”, ICASSP2016, Mar. 20-25, 2016, Shanghai China, 6 pages. |
Vega et al., “A New Robust Variable Step-Size NLMS Algorithm”, IEEE Transactions on Signal Processing, vol. 56, No. 5, May 2008, 16 pages. |
Number | Date | Country | |
---|---|---|---|
20230143347 A1 | May 2023 | US |
Number | Date | Country | |
---|---|---|---|
63007617 | Apr 2020 | US |