The application relates to an audio system, such as a hearing aid, a communication system (including but not limited to, a teleconference system, an intercom system, etc.), etc., with feedback cancellation. The feedback cancellation may include echo cancellation, cancellation of acoustic feedback signals, cancellation of mechanically coupled feedback signals, cancellation of electromagnetically coupled feedback signals, etc.
Feedback is a well known problem in audio systems and several systems for suppression or cancellation of feedback exist within the art. With the development of very small digital signal processing (DSP) units, it has become possible to perform advanced algorithms for feedback suppression in a tiny device such as a hearing instrument, c.f. e.g. U.S. Pat. No. 5,619,580; U.S. Pat. No. 5,680,467; and U.S. Pat. No. 6,498,858.
The above mentioned prior art systems for feedback cancellation in hearing aids are all primarily concerned with the problem of external feedback, i.e. transmission of sound between the loudspeaker (often denoted receiver) and the microphone of the hearing aid along a path outside the hearing aid device. This problem, which is also known as acoustical feedback, occurs e.g. when a hearing aid ear mould does not completely fit the wearer's ear, or in the case of an ear mould comprising a canal or opening for e.g. ventilation purposes. In both examples, sound may “leak” from the receiver to the microphone and thereby cause feedback.
However, feedback in a hearing aid may also occur internally as sound can be transmitted from the receiver to the microphone via a path inside the hearing aid housing. Such transmission may be airborne or caused by mechanical vibrations in the hearing aid housing or some of the components within the hearing instrument. In the latter case, vibrations in the receiver are transmitted to other parts of the hearing aid, e.g. via the receiver mounting(s). For this reason, the receiver is not fixed but flexibly mounted within some state-of-the-art hearing aids of the ITE-type (In-The-Ear), whereby transmission of vibrations from the receiver to other parts of the device is reduced.
Typically, feedback suppression or cancellation circuits utilise one or more adaptive filters. The adaptive filter performance is a trade-off between low steady-state error and sufficient ability to track changes. Thus, under steady-state conditions the performance is sub-optimal since the adaptive filter should be capable of adapting to a sudden change, while in dynamic situations the performance is sub-optimal because the tracking is slow.
It is an object to provide an audio system with feedback cancellation with an improved trade-off between low steady-state error and fast tracking.
According to some embodiments, the above-mentioned and other objects are fulfilled by an audio system comprising a signal processor for processing an audio signal, and a feedback suppressor circuit configured for modelling a feedback signal path of the audio system by provision of a feedback compensation signal based on sets of feedback model parameters for the feedback signal path that are stored in a repository for storage of the sets of feedback model parameters.
In one embodiment, the audio system comprises a hearing aid with a microphone for converting sound into an audio signal, the signal processor for processing the audio signal, and a receiver that is connected to an output of the signal processor for converting the processed audio signal into a sound signal. The hearing aid further includes the feedback suppressor circuit configured for modelling a feedback signal path of the hearing aid by provision of the feedback compensation signal based on sets of feedback model parameters for the feedback signal path that are stored in the repository for storage of the sets of feedback model parameters.
In a conventional feedback cancellation circuit with one or more adaptive filters, the filter coefficients of the adaptive filter(s) are adjusted in accordance with an algorithm that strives to minimize an error function. Thus, when a feedback signal path of the audio system has been stable for some time, the filter coefficients will reach substantially constant values that correspond to the current feedback signal path. However, when the feedback signal path changes, the algorithm changes the filter coefficients in order to adapt the filter coefficients to the new feedback path and thus, the set of filter coefficients corresponding to the previous stable feedback signal path is lost. Hence, if this feedback signal path occurs again, the corresponding filter coefficients have to be re-calculated by repeated adaptation.
In an embodiment, previous sets of filter coefficients corresponding to respective feedback signal paths are stored in the repository. When one of the feedback signal paths recurs, the corresponding set of filter coefficients is loaded into a digital filter or another digital signal processing circuit that provides the feedback compensation signal.
As further explained below, a detector may be provided for detecting whether a previous feedback signal path is recurring, for example including an environment detector and an environment classifier indicating whether or not the set of feedback model parameters currently used by the feedback suppressor circuit for provision of the feedback compensation signal should be replaced by another set from the repository.
In general, according to some embodiments, previous sets of feedback model parameters corresponding to respective feedback signal paths are stored in the repository. When one of the feedback signal paths recurs, the corresponding set of feedback model parameters is used by the feedback suppressor circuit that provides the feedback compensation signal.
In this way, the feedback suppressor circuit provided exhibits low steady-state error in combination with fast transient response in response to a change of the feedback signal path.
Some or all sets of feedback model parameters stored in the repository may be updated during normal use of the audio system.
Some or all sets of feedback model parameters, e.g. sets of filter coefficients of a digital filter, e.g. an adaptive digital filter, stored in the repository, may correspond to frequently occurring feedback signal paths for which feedback model parameters may be obtained and updated during normal use of the audio system.
Some or all sets of feedback model parameters may be obtained during a learning period of the audio system.
Some or all sets of feedback model parameters may be obtained by other equipment and subsequently entered into the repository, for example during manufacture of the audio system.
For example, in an embodiment, the audio system comprises a hearing aid with a repository for storing a plurality of sets of feedback model parameters. The repository holds a plurality of sets of feedback model parameters and is operatively connected to the feedback suppressor circuit for transfer of a selected set of feedback model parameters from the repository to the feedback suppressor circuit. In one embodiment, the feedback suppressor circuit also has a fast adaptive filter for modelling the current acoustic feedback path of the hearing aid and its filter coefficients constitute the feedback model parameters. Sets of filter coefficients corresponding to respective stable feedback signal paths are stored in the repository. When a sudden change of the feedback signal path occurs, e.g. when the user brings a phone handset close to the hearing aid, a suitable set of filter coefficients corresponding to the feedback path of that situation is selected from the repository. The selected set of feedback model parameters is then entered into the feedback suppressor circuit for provision of the feedback compensation signal. The feedback compensation signal may for example be provided by a digital filter with filter coefficients constituted by the selected set of feedback model parameters. The digital filter may be an adaptive filter with low steady-state error wherein the selected set of feedback model parameters is loaded into the adaptive filter and forms a new starting point for the further adaptation, whereby the transient properties of the adaptive filter becomes of minor importance to the performance of the feedback suppressor circuit.
As already mentioned, the repository may include sets of feedback model parameters that remain unchanged during normal use of the audio system. In a hearing aid, such feedback model parameters may be entered into the repository when the hearing aid is fitted to the user by a hearing aid dispenser. Some or all of the stored sets of feedback model parameters may be standard sets of feedback model parameters, which have been found to work well for the type of hearing aid in question.
Some of the stored sets of feedback model parameters may be determined during fitting of the hearing aid. For example during fitting, a number of sets of feedback model parameters may be available for modelling the physical feedback path of one or more different situations, such as a situation where the user makes use of a mobile phone, which is placed close to the ear. During fitting, the most suitable sets of feedback model parameters are selected from the available sets for the actual hearing aid and user and the selected sets are stored in the repository.
The repository may include a plurality of sets of feedback model parameters, which are updated during operation of the audio system. The updating and storing of sets of feedback model parameters during use of the audio system may for example be performed using cluster based learning techniques as described in the following.
Further, the system may comprise a user interface allowing the user to command the system to store a current set of feedback model parameters in the repository, e.g. when an object, such as a mobile phone, a neck rest of a chair, a child, a side window of a car, etc., is placed close to the ear of a user of a hearing aid. When the user perceives that the system has attained optimum performance in such a situation, the user may command the system, e.g. by pressing a push button, to store the present set of feedback model parameters, or a set of feedback model parameters derived there from, in the repository. The audio system may further be configured for evaluation of the set of feedback model parameters to be stored in the repository and for storing the set of feedback model parameters only when certain criteria are fulfilled, for example that the variation of the values of the set of feedback model parameters remain below a certain threshold or fulfil other quality measures.
In addition to the sets of feedback model parameters, the system may also store other information identifying the current feedback path. Subsequently, the system can use this information to determine when a similar feedback path occurs and locate and retrieve the set of feedback model parameters to be used for provision of the feedback compensation signal, for example as a starting point for further adaptation.
A detector may be provided for detecting whether or not the set of feedback model parameters currently used by the feedback suppressor circuit for provision of the feedback compensation signal should be replaced by another set from the repository, and if so, the detector may further be configured for selecting the set of feedback model parameters to be used from the sets of feedback model parameters stored in the repository.
The detector may for example be a phone detector, such as a magnetic phone detector configured for detecting the presence of a phone in the proximity of the user's ear. A permanent magnet may be positioned on the mobile phone, and the detector may be configured to detect the presence of the magnet, or, the detector may be adapted for detecting the presence of a magnetic field generated by the speaker of a mobile phone.
The detector may comprise one or more proximity sensors configured for detecting whether or not an object which may influence the feedback path of the audio system is present. When such an object is detected, a suitable set of feedback model parameters is selected from the repository for use by the feedback processor circuit for provision of the feedback compensation signal.
The detector may be configured for detecting changes in the feedback path of the audio system thereby detecting situations in which the set of feedback model parameters currently used by the feedback suppressor circuit may be substituted by another set of feedback model parameters from the repository.
The detector may comprise an environment detector configured for detecting the environment of the audio system, for example the acoustic environment of a hearing aid. The detector may further comprise an environment classifier, for example classifying an acoustical environment of a hearing aid as speech, noise, speech in quiet surroundings, speech in noisy surroundings, babble noise, traffic noise and/or other types of acoustic situations. In a hearing aid, the environment classification may cause a program shift in the signal processor whereby the signal processing may change abruptly. For example, a hearing aid may be able to shift between various programs where different signal processing, such as directionality, noise reduction, etc., are employed and different components may be used, e.g. the hearing aid may or may not make use of a telecoil. Such abrupt change of the signal processing in a hearing aid may also change the feedback path abruptly due to the change of the transfer function of the hearing aid. For example, when executing one signal processing programme, the hearing aid may be closer to an unstable situation than when executing another signal processing programme. The feedback suppressor circuit may further be configured for determining a set of feedback model parameters based on the detected environment and the sets of feedback model parameters stored in the repository for modelling the feedback signal path corresponding to the detected environment.
In a preferred embodiment, the hearing aid further comprises a first subtractor for subtracting the feedback compensation signal from the audio signal to form a compensated audio signal supplied to the signal processor.
In some embodiments, an audio system includes a signal processor for processing an audio signal, and a feedback suppressor circuit configured for modelling a feedback signal path of the audio system by provision of a feedback compensation signal based on sets of feedback model parameters for the feedback signal path that are stored in a repository.
The above and other features and advantages will become readily apparent to those skilled in the art by the following detailed description of exemplary embodiments thereof with reference to the attached drawings, in which:
The figures are schematic and simplified for clarity, and are for showing some of the features of the embodiments.
It should be noted that the embodiments shown in the accompanying drawings should not be limited to the configuration shown, and may have different configurations (e.g., different forms) in different embodiments.
Various embodiments are described hereinafter with reference to the figures. It should be noted that the figures are not drawn to scale and that elements of similar structures or functions are represented by like reference numerals throughout the figures. It should also be noted that the figures are only intended to facilitate the description of the embodiments. They are not intended as an exhaustive description of the invention or as a limitation on the scope of the invention. In addition, an illustrated embodiment needs not have all the aspects or advantages shown. An aspect or an advantage described in conjunction with a particular embodiment is not necessarily limited to that embodiment and can be practiced in any other embodiments even if not so illustrated.
In the illustrated embodiments, the device is used in connection with adaptive feedback cancellation in hearing instruments, but the device may be used in audio systems with one or more adaptive filters switching between near-stationary states.
Throughout the present disclosure, the expressions feedback cancellation and feedback suppression are used interchangeably. With a feedback cancellation or feedback suppression circuit, the influence of a feedback signal is attenuated and only in rare cases completely eliminated.
A hearing aid with a prior art feedback cancellation circuit is schematically illustrated in
An external signal of interest x is amplified by a signal processor G that provides a processed output signal y. A receiver (not shown) converts the processed output signal into a sound signal after digital to analogue conversion (not shown). Some of the output signal y leaks back to the input and is added to the external signal x in the form of an unknown feedback signal, e.g. acoustical feedback signals, mechanically coupled feedback signals, electromagnetically coupled feedback signals, etc. In order to compensate for distortions and potential instability caused by this feedback loop, a feedback cancellation or suppression signal c, which attempts to model the signal f, is then subtracted from the external signal x. In the ideal case, c cancels f and e will equal x and the hearing aid will be able to provide sufficient amplification without audible distortion or artefacts.
Adaptive filtering techniques are used to form a feedback model W based on an analysis of the signal e. In this case, the filter coefficients constitute the feedback model parameters. A well-known conceptually straightforward technique often denoted “the direct approach” is to minimize the expected signal strength of e. The direct approach is known to provide biased results when the input signal exhibits a long-tailed auto-correlation function. In the case of tonal signals, for example, this typically leads to sub-optimal solutions because the adaptive feedback model will attempt to suppress the external tones instead of modelling the actual feedback. For many naturally occurring signals however this so-called bias problem is not so important because the typical hearing aid processing introduces sufficient delay to de-correlate the output from the input. Modern feedback cancellation systems nevertheless employ a number of additional tricks, such as constrained adaptation and (adaptive) de-correlation, to ensure stability in the presence of tonal input.
The incoming acoustic signal s to the hearing aid
s(n)=x(n)+f(n) (1)
is a sum of the signal of interest x and the distortions caused by feedback signal f. The so called error signal e(n) is obtained by subtracting the cancellation signal c:
e(n)=s(n)−c(n) (2)
which is an approximation of the signal of interest x.
A standard N-taps FIR filter for modelling the feedback path is described by an input vector
{right arrow over (d)}(n)=[d(n),d(n−1), . . . ,d(n−N+1)]T (3)
a weight vector
{right arrow over (w)}(n)=[w1(n),w2(n), . . . ,wN(n)]T (4)
and an inner product
c(n)={right arrow over (w)}(n)T{right arrow over (d)}(n) (5)
to obtain the cancellation signal c at each sample n.
An efficient technique to optimize the FIR filter defined above is the Block Normalized Least Mean Squares (BNLMS) update. BNLMS minimizes the square error criterion over a block of M samples
by calculating the gradient
and the signal power
and combining them with an adaptation rate μ in the update
which is performed once for every M samples.
In a direct approach feedback canceller, the trade-off between a low steady-state error and a sufficient ability to track changes is determined by the adaptation rate μ. Small values of μ favour a low steady-state error while larger values favour good tracking. In practice values of μ are chosen between zero and one (values above one are normally of no use and values above two may even lead to divergence).
Noticeable changes of the sound environment of the hearing aid and thereby of the feedback path are typically caused by activities such as chewing, yawning, placing a phone to the ear, putting on a hat or scarf, moving into a different environment such as a car. Some of the dynamics involved are of a slow varying nature while others exhibit more sudden transients.
In order to illustrate the operation of feedback cancellation circuits, sudden changes in the sound environment and thereby the feedback path of the hearing aid are modelled with a switching linear system with multiple (approximately stationary) states as schematically illustrated in
In its simplest form the feedback model is switching between two states. As an example, the performance is shown of a direct-approach feedback canceller with a feedback path that is switching between a feedback path where a phone is placed to the ear and a feedback path where the phone is removed. In the simulation the switching is performed instantaneously every 4 seconds. The external signal x is stationary white noise and the adaptive FIR filter of the feedback model uses 32 coefficients and a constant bulk delay. A linear gain, a dc-filter, and a hard clipper constitute the hearing aid processing. The gain is set at the maximum stable gain level without feedback cancellation for the worst of the two feedback paths. The NLMS block update is performed on blocks of 24 samples. In the simulation, shadow filtering is used to calculate the ideal response (the so-called shadow filtering runs in a separate branch where the feedback signal f and the cancellation signal c are both removed) and compare that to the actual signal e.
When the feedback path switches (at 4, 8, and 12 seconds), the fast update is able to respond rapidly. It reaches a stationary SNR level in about one tenth of a second, at about 17 dB, after which there is no further improvement. In contrast, the slow update requires significantly more time to react to the change. It takes roughly one second to reach the same SNR level as the fast update, but eventually reaches a much higher SNR level.
According to some embodiments, good tracking properties of the fast update are combined with excellent convergence properties of the slow update in stationary conditions. This is obtained by provision of a repository for storing feedback model parameters of the feedback path for various sound environments, for example filter coefficients of an adaptive filter. When a sound environment occurs for which corresponding feedback model parameters have been stored previously in the repository, modelling of the feedback path may again be performed based on these previously stored parameters whereby fast tracking is maintained without sacrificing the steady-state error. In the prior art, previous feedback model parameters are lost when a new situation occurs with a different feedback signal path. This is further explained below.
In the exemplary embodiment, schematically illustrated in
In case that none of the clusters in the repository adequately matches the actual feedback path, the illustrated embodiment is equipped with a fallback switch to use the fast adaptive filter directly in the signal path as in a conventional feedback canceller.
During update of the clusters, the new set of filter coefficients may be incorporated into an existing cluster, a new cluster may be formed, two existing clusters may be merged, an existing cluster may be divided into two clusters, and/or an existing cluster may be deleted. This is further described below.
Clustering is a process of organizing objects into groups whose members are similar in some way. Thus, a cluster is a collection of objects any of which fulfils a certain criterion for that cluster. For example, the objects may be data that are grouped into clusters in accordance with a distance criterion, i.e. data residing close to each other are grouped into the same cluster. This is called distance based clustering.
It is well known in the art to use the Minkowski metric as a similarity measure, in this case a distance measure. If each data xi consists of a set of parameters (xi,1, xi,2, . . . , xi,n), then the Minkowski metric is defined by:
wherein d is the dimensionality of the data. The often used Euclidean distance is a special case of the Minkowski metric with p=2. The Manhattan metric is a special case of the Minkowski metric with p=1.
In the following, the similarity measure is called similarity distance to indicate that a small value indicates similarity and that a large value indicates dissimilarity.
Another kind of clustering is conceptual clustering in which a cluster is a collection of objects with a common concept.
Clustering algorithms may be classified into exclusive clustering, overlapping clustering, hierarchical clustering, and probabilistic clustering. In exclusive clustering, a member of a cluster cannot be a member of another cluster. In overlapping clustering, fuzzy logic is used to cluster the members so that members may belong to two or more clusters with different degrees of membership. Hierarchical clustering is based on the union of two nearest (most similar) clusters. At the start of the clustering process, each member defines a cluster and after a few iterations, the desired number of clusters is reached.
One of the best-known traditional clustering algorithms is the k-means algorithm introduced by MacQueen (J. MacQueen: “Some methods for classification and analysis of multivariate observations” in Proceedings of 5-th Berkeley Symposium on Mathematical Statistics and Probability, volume 1, pages 281-297. Berkeley, University of California Press, 1967). The k-means algorithm is an exclusive clustering algorithm and it assigns a data point to the cluster whose centre (also called centroid) is nearest. The centre is the average of all the data points in the cluster, i.e. its coordinates are the arithmetic mean for each separate dimension of all the points in the cluster. It maintains k cluster centres
C=[{right arrow over (C1)}, . . . ,{right arrow over (Ck)}] (11)
each representing the mean of all vectors assigned to that cluster, and the membership counts
{right arrow over (M)}=[M1, . . . ,Mk] (12)
for the number of vectors assigned to each cluster.
In the illustrated embodiment, the filter coefficients w1 constitute the data points processed by the k-means clustering algorithm. When a new weight vector {right arrow over (w)} arrives the k-means algorithm assigns it to the nearest cluster centre Cn determined using a similarity or distance criterion d (for which the Euclidean distance function is typically used), increments the membership count Mn by one and updates the cluster centre by
In the illustrated embodiment, the MacQueen update of the k-means algorithm is used in connection with a Gaussian mixture model with a shared spherical covariance structure, cf. A. Sam′e, C. Ambrosie, and G. Govaert: “A mixture model approach for on-line clustering” in Compstat 2004, 23-27 Aug. 2004, Prague, Czech Republic. http://eprints.pascal-network.org/archive/00000582/, 2004. The primary advantages of the k-means algorithm, compared to well-known alternatives such as the batch Expectation-Maximization (EM) algorithm, are its simplicity, speed, and low complexity through the use of only first order statistics (e.g., inverse covariance matrices are not needed).
In the Gaussian mixture model, each cluster is a Gaussian with a mixing proportion, mean, and covariance matrix. The Gaussian mixture model makes it possible to find potential solutions (maxima) in between the peaks of each individual cluster.
Further, the covariance information of individual clusters characterizes the clusters in more detail than, e.g., a single characteristic length (which essentially corresponds to a scaled unity covariance matrix).
The feedback suppressor circuit may be configured to share statistical information between clusters, e.g., use one covariance matrix for several or all clusters. This makes the model more efficient because similar clusters can collect statistics at a higher rate. E.g., if the covariance matrix is formed individually for each cluster, it takes significantly more time than if the information is shared. Further, because such a matrix may have to be inverted, sharing the information reduces the risk of singularity problems (where the matrix inversion is unreliable).
In an embodiment, a forgetting factor γ is introduced for the membership counts by performing the update
{right arrow over (M)}←γ{right arrow over (M)} (14)
at each iteration (typically 0<<γ<1). The effect of the forgetting factor is twofold. First it introduces a soft upper bound on the membership counts, which ensures that the update always maintains some minimal amount of adaptivity. In a useful algorithm this is necessary because otherwise the update would eventually freeze. The second effect is that it facilitates the detection of outliers by having a low membership count. Outliers typically get sampled a few times when something radical happens, e.g. the hearing aid is removed from the ear canal by the user, the hearing aid is dropped, the hearing aid is turned on, etc. Feedback model parameters corresponding to such rare events may not be required to be stored indefinitely. Consequently when the cluster membership count falls below some predefined threshold, it can simply be removed from the repository.
In an embodiment, the clustering includes formation of new clusters, deletion of existing clusters, and merging of clusters. The feedback suppressor circuit may keep track of the distances between cluster centres, specifically tracking the minimum distance dm between the two nearest clusters {right arrow over (Cm
Using this information, updating the cluster centres proceeds to one of the following three cases.
(1) if (Ml<Mmin) & (dn>ασ)
If the minimal membership count Ml is smaller than some minimal value Mmin (e.g. Mmin=1) and the distance to the nearest cluster dn is greater then ασ, where α is a tuning parameter (typically in the order between 1 and 3 when σ is an estimate of the standard deviation), then cluster {right arrow over (Cl)} is replaced by the incoming vector {right arrow over (w)} and its membership count is set to one.
(2) else if (dm<dn)
If the distance between the two nearest cluster centres {right arrow over (Cm
Mmerged=Mm
(3) default
In the case that no clusters are merged or replaced, {right arrow over (w)} is assigned to its nearest cluster centre using the original MacQueen update.
In the following, one way of selecting a set of feedback model parameters from the set of cluster centres stored in the repository is explained. The nearest cluster centre as already identified by the cluster algorithm update may be selected, although it is preferred to take the membership counts into account to avoid that the selected model becomes a newly created cluster too often in which case little or no advantage over the fast adaptive feedback model is obtained.
To overcome this problem, a mixture of Gaussian algorithm is utilized, i.e. it is assumed that the probability density function of the clusters is Gaussian. The Gaussian probability density at point {right arrow over (w)} in an N-dimensional space around the cluster with mean {right arrow over (Ci)} and covariance matrix Ri
is given by
Assuming spherical clusters, with a shared identical diagonal structure of the covariance matrix, equation (16) can be simplified:
As mentioned before, in this exemplified embodiment, σ is estimated to be proportional to the length of vector {right arrow over (w)} (i.e., d({right arrow over (w)}, {right arrow over (0)})). Alternatively, σ can be set as a constant based on prior information about an appropriate cluster scale, or, an individual σi may be estimated for each cluster.
Under the assumption that the prior probability of a cluster i is characterized by its relative membership count, the likelihood of a cluster i generating the observed vector {right arrow over (w)} is estimated by
In practice, exact knowledge of each probability is not needed. It is only required to identify the cluster with the highest probability. For this purpose, equation (18) is simplified by utilization of the logarithm and removal of all additive constants (everything that came from the denominators and constants of the Gaussian probability density function), leading to.
having a maximum value for the most likely cluster to be used as feedback model W1.
During use, a new situation may arise for which none of the clusters in the repository provide adequate performance. In this case, the fast adaptive filter is available as a fallback option. The fallback switch operates independently of assumptions made in the clustering model and directly compares the feedback cancellation error e1(n) (which for a direct approach feedback canceller is simply the power over one block) of the signal generated by the most likely model in the repository to the error of the signal e2(n) generated by the fast adaptive model. If e1(n) exceeds that of e2(n) by some predefined margin, the fallback switch connects the fast adaptive filter for conventional feedback cancellation, and during update of the clusters, the new set may be incorporated into an existing cluster, a new cluster may be formed, two existing clusters may be merged, an existing cluster may be divided into two clusters, and/or an existing cluster may be deleted. Otherwise, the fallback switch connects the digital filter W1 for feedback cancellation.
As an example, the experiment explained in connection with
In this example cluster 1 remains small (and unlikely) because there are only two stationary feedback paths. Occasionally it may grow a bit, but since it cannot become sufficiently different from the two big clusters its members are eventually absorbed by one of the big clusters (through the merging operation).
It is an important advantage that the trade-off of prior art feedback cancellation circuits with adaptive filters between static and dynamic performance has been significantly improved.
In some embodiments, the amount of improvement gained depends on (1) the signal to noise ratio, (2) the extent of variation of the sound environment experienced during use of the device, and (3) the ability to represent meaningful clusters.
When applied in feedback suppression, point 1 is influenced by the gain (which sets the balance between the strength of the feedback signal and the external signal). If gain is very high (e.g., 10-20 dB above the Maximum Stable Gain without feedback suppression MSGoff), then the standard adaptive filters have an excellent signal to work with and may already provide adequate performance without a repository. In some embodiments, when the gain is lower (e.g., at or below MSGoff, such as in the example) then the advantage becomes more pronounced. The reason for this is that, especially in poor SNR conditions, standard adaptive filters must average over a longer time frame (or equivalently use a smaller adaptation rate) to obtain a high-quality model estimate. Obviously, when it takes a long time to find a good model, it will be more worthwhile to preserve it in a repository.
Regarding point 2 relating to the extent of variation of the sound environment. If the environment is too stationary, i.e., there is only one signal path, there will not be much benefit in trying to segment the parameter space. If on the other hand the environment is highly non-stationary, with frequent transitions between a variety of feedback paths, then the clustering model may not be appropriate either. Embodiments described herein are well suited in an environment that is stationary most of the time, but occasionally switches between different feedback paths. Typically, a hearing aid with feedback suppression is used in this way. Sudden changes in the feedback path occur when the user of the hearing aid, e.g., picks up a phone, or lays his or her head on a pillow.
Regarding point 3: the ability to represent meaningful clusters, this primarily depends on the distance/dissimilarity criterion and the associated geometry and compactness of the solution space. Thus, it is important whether a FIR representation, a FFT mapping, a transformation to reflection coefficients, or some pre-processing is used to reduce the dimensionality by, e.g., a PCA or LDA mapping. In general the ideal representation must have compact separable clusters, meaning that the within-scatter (the distances within one cluster) is low and the between-scatter (the distances between clusters) is high. In this respect a raw FIR representation may not be optimal (for example because phase shifts may violate compactness), but nevertheless, the illustrated embodiment has shown that the approach works reasonable well in practice.
Below a number of additional embodiments is disclosed.
Further, adaptive non-linear de-correlation may be applied in the signal path. Non-linear de-correlation in the signal path decreases the correlation of the external signal with the hearing aid output. The contribution to the input signal caused by feedback remains equally correlated (because the applied non-linearity is known) so it becomes easier to distinguish feedback from tonal input and consequently the feedback models will improve.
The adaptive non-linear de-correlation may be applied depending on the selected cluster. Non-linear de-correlation in the signal path may lead to perception of distortion and therefore it may be desirable to utilize non-linear distortion for the most problematic feedback paths, which can be identified by the specific parameters and statistics of the cluster.
In the embodiment of
The feedback suppressor circuit may further be configured for maintaining a clustering model of the external signal whereby sensitivity to non-stationary tonal input is reduced. A block diagram of such an embodiment is shown in
In some sound environments, the external signal and background noise have relatively constant characteristics most of the time, but occasionally switches rapidly to different levels. It should be noted that, compared to
For efficiency reasons, a k-means clustering algorithm was used in the illustrated embodiments that only requires calculation of the first order statistics of the clusters. In general however, the performance may be further improved provided that sufficient computational resources are available by incorporating higher order statistics, e.g., co-variances, in the cluster models. For updating the clusters, instead of using the MacQueen update, utilization of one or more iterations of the EM (Expectation Maximization) algorithm may be considered. Further, it is contemplated to utilize a more refined, possibly non-Gaussian, underlying probability density function for the clusters.
In the illustrated embodiments, the most likely model based on a comparison with the fast adaptive filter coefficients is used. An alternative would be to calculate the full least-squares error, either by actually running all models in parallel or by deriving it from the auto- and cross-correlation statistics, and simply select the model with the lowest error. Yet another alternative is to include the fast adaptive filter in the statistical model and, e.g., include a confidence in the observed vector {right arrow over (w)} to avoid switching models when the fast adaptive filter itself is considered unreliable or in a transition state.
Another alternative for selecting the model is not to do a hard selection at all. Instead, the most likely model may be formed by a weighted sum of all the models in the repository.
Further, a history of models selected in previous iterations may be stored, e.g. in the repository for improving the performance. In particular, frequent switching may be prevented in this way, e.g. by smoothing the likelihoods over time.
In addition to forming clusters during use, fixed models may also be provided that can be selected in the same way that clusters formed during operation are selected. Of course, such an approach is only feasible when prior information is available, for example by means of an initialization procedure as is typically performed in modern hearing aids.
Further, fixed clusters may be provided, e.g. by storing a limited number of models that once have been dominant for a very long time without the forgetting factor.
Moreover, models used by one user may be combined with models used by other users and stored as models in a repository of a new user.
In some cases, embodiments described herein may also be utilised in a multi-channel hearing aid in which the incoming audiosignal is divided into a number of bandpass filtered signals (frequency channels) that is individually processed in the signal processor, e.g. in accordance with the audiogram recorded for the user, i.e. based on the hearing threshold as a function of frequency. The processed bandpass filtered signals are combined together, e.g. in a summing circuit, for digital to analogue conversion and conversion to an acoustic signal in the receiver. Likewise, the feedback cancellation circuit may be divided into a number of frequency channels that is individually processed in the feedback suppressor circuit as disclosed above for a single channel. Additionally, the feedback suppressor circuit may be configured for sharing statistics across channels. Feedback path changes of various frequency channels probably correlate strongly. Consequently, an improved performance may be obtained if, e.g., each cluster represents the combination of all feedback paths, which may for example be achieved by concatenating the filter coefficients.
In the illustrated embodiment, the fast adaptive feedback filter for determining the vector {right arrow over (w)} of filter coefficients is outside the clustering model. This reduces the complexity of the system. It is also possible to perform inference directly on the observed incoming signal s, out-going signal y (or d) to directly update all feedback models available in the repository, as well as possibly some signal models for de-correlation (which may be stored in a similar way as the feedback models).
Given an observed input signal s and a (delayed) output signal d, the observations of s and d are characterized by the statistics S. For a linear system S should at least contain information about the autocorrelation of d and the cross-correlations between s and d, but may also contain higher order statistics, e.g., for dealing with non-linear feedback paths, as well as any statistics needed for maintaining a signal model, e.g., for adaptive de-correlation.
A possible design for obtaining the statistics S is shown in
In one embodiment of the feedback cancellation system, a plurality of candidate feedback models Wi is provided. Each candidate feedback model Wi typically contains a set of filter coefficients like the cluster centres, but may also contain a specific design structure, e.g., some models may use longer filters than others. In addition, a plurality of signal models Xj may be provided, which are used internally to distinguish correlations caused by the actual feedback path from correlations inherently present in the external signal (unrelated to the feedback).
Given the observed statistics of the environment, p(S|Wi,Xj) may be calculated, which represents the likelihood that a candidate feedback model i with an external signal model j is responsible for generating the observed statistics. From this, using Bayes' rule, the likelihood of the candidate models is inferred given the observed statistics
If the fact that the feedback models should be independent of the external signal models (p(Wi,Xj)=p(Wi)p(Xj)), the joint likelihood of feedback model i with signal model j given S is
Since the signal model is only used internally, in order to explain the observed statistics, only the likelihood of the feedback models given S is relevant. It is obtained by summing over all signal models:
which of course becomes simpler for one signal model, e.g. the embodiment of
The most likely feedback model to be used in the signal loop may be selected in various ways. Firstly, a hard selection of the maximum a posteriori (MAP) estimate may be made simply by enumerating over all candidate models and selecting the one maximizing equation (23). It should be noted that P(S) need not be calculated since its function as a scaling factor does not influence determination of the maximum.
Alternatively, a relative degree of ‘ownership’ may be determined, e.g., proportional to the model likelihood, and select the feedback model as a weighted combination of the models in the repository. A third possibility is to use all clusters in the repositories as components of a (Gaussian) mixture model, and search for a new model W* in a continuous parameter space of feedback models w, to maximize the posterior likelihood
With the last two possibilities the tracking of the feedback path becomes continuous, with the cluster models only being active in the background.
The advantage of this, in contrast to the discrete switching associated with a hard selection, may be that certain repetitively occurring dynamics may be modelled more accurately.
By enumerating all candidate models, the expectations regarding the likelihood of observing the statistics S can be calculated in accordance with:
To improve the models, adjustments are desired in such a way that this marginal likelihood is maximized. To this end the candidate models can be updated, incrementally, using one or more of the following operations:
1. Hard assignment: Observed statistics may be classified as belonging to one particular 2-tuple (i, j) of feedback and signal model, in which case only the corresponding feedback and signal models are updated.
2. Soft assignment: Observed statistics may be characterized by some fractional ownership of several feedback and signal models, representing the degrees of certainty when multiple models may have been responsible. In this case all the models are updated relative to their degree of ownership.
3. Merge: Two models may be merged into one. This is typically done when two existing models have become rather similar and a combined model is sufficiently well suited to describe the current situation.
4. Split: A model may be split into two. This could, e.g., be done when a model becomes too general and does not describe the current situation in sufficient detail.
5. Delete: When a model becomes unlikely it may be deleted. This is typically done to get rid of outliers and obsolete knowledge.
6. Create: When a new situation appears a new model may be created.
The effect of any of the operations described above can be assessed by comparing the marginal likelihood p(S) before and after the operation, which enables a search procedure, or the formulation of a set of rules, to perform the operations needed to optimize the models.
It should be noted, though, that it is not necessary to restrict the update to use only the above categorization of operations. Standard optimization techniques, such as the EM algorithm, or any other search procedure that is able to incrementally increase the marginal likelihood, may be considered. In the illustrated embodiments, the total number of clusters has been kept fixed, which implies that the merge, split, delete and create operators are always applied in pairs, e.g., if one cluster is deleted, another cluster is created. In general however, a variable number of clusters is allowed. This can be done by making the assumptions about the model complexity explicit in the above formula, i.e. p(S) becomes p(S|H(imax, jmax)). It is even possible to take this one step further and allow the number of clusters to become infinite. Although practical implementations will only maintain a finite number clusters, the underlying inference process in a Bayesian mixture model can be done as if there are an infinite number of mixture components, cf. C. Rasmussen: “The Infinite Gaussian Mixture Model” in Advances in Neural Information Processing Systems, MIT Press, 12: 554-560, 2000. An especially appealing property of this is that it elegantly sidesteps the problem of finding the right number of clusters.
In one embodiment, the hearing aid may further comprise an environment detector for detection of the sound environment of the hearing aid and wherein the feedback suppressor circuit is further configured for determining a set of feedback model parameters based on the sound environment detection and the sets of feedback model parameters stored in the repository for modelling the feedback signal path corresponding to the detected sound environment.
The hearing aid processor may further be configured to reduce gain in the signal path depending on the selected feedback path model. Gain reduction is a well-known remedy for oscillation reduction or elimination. Based on the selected cluster, the feedback suppressor circuit may provide an estimate of the strength of the feedback signal for determining whether a gain reduction is appropriate.
The feedback suppressor circuit may further be configured for maintaining a statistical model of the external signal for distinguishing correlations between the hearing aid output and input caused by feedback from correlations already present in the external signal (tonal input) whereby sensitivity to tonal input is reduced.
The feedback suppressor circuit may further be configured to individually process multiple input signals, e.g. provided by two or more microphones, e.g. in order to obtain improved directionality.
The feedback suppressor circuit may further be configured to share information between the multiple input signals for improved directionality. Feedback models become more efficient because changes in the feedback path are likely to be correlated when the microphones are close to each other. By improving the feedback models the algorithms providing the directionality have a better input signal.
The feedback suppressor circuit may further be configured to use a shared signal model, e.g., for adaptive de-correlation, for several or all of the input signals.
The observed external signal from each microphone may be assumed to be nearly identical, except of course with respect to the time of arrival. Utilization of one signal model improves the statistics and hence a better and more reliable estimate of the feedback paths is obtained compared to the situation in which each channel has its own signal model.
The feedback suppressor circuit may further be configured for clustering models that combine the feedback paths of all input signals whereby switching between feedback paths becomes more reliable because changes to one channel should be highly correlated with changes to the other channel(s) assuming the microphones are positioned close to each other.
The feedback suppressor circuit may further take higher order statistics into account to characterize receiver, amplifier, and/or microphone non-linearities in the feedback path whereby performance is improved in, e.g., power devices where the extreme gains may drive the analogue components into saturation, which may be best modeled by a non-linear time-varying feedback path.
The clustering and selected feedback model statistics may be stored in a log. Further, the encountered signal model statistics may be stored in a log.
Hereby, if the user experiences a problem with the device, the user can go back to the dispenser who can then get more detailed information regarding the sound environments and situations that may have been responsible for the problem. This enables a dispenser to provide better service. For example, it may be observed that problems occur when listening to a specific class of signals.
The performance of the feedback suppressor circuit may also be stored in a log.
Statistics on the history of selecting clusters may be stored and these data may be provided to the dispenser for counseling. For each particular cluster, the number of times it was selected may be recorded and optionally its time duration of use, the sound environment in which it was used, such as speech, music, noise, etc., the average modeling errors, etc. Moreover, sets of often used feedback path models can be collected by the dispenser or manufacturer. Useful models of one user may be combined with useful models from other users and used as starting models for a new user.
Presence of a nearby reflection, such as from a phone, may be determined based on the selected cluster whereby certain actions may be triggered for user assistance, e.g., automatically switching to a phone mode, making automatic adjustments in the signal path, such as reducing the gain, etc.
The use of a phone may further be detected based on the current signal model, e.g., as used for adaptive de-correlation whereby detection of presence of a phone may be improved because (1) phones typically use a narrower frequency range than the normal incoming signal, and (2) the predominant signal model during phone listening will have a form characteristic of speech.
Phone detection is useful because it enables the hearing aid to take appropriate measures such as maximizing speech intelligibility when using the phone. It has already been described that the device is able to rapidly track changes caused by picking up a phone in accordance with some embodiments. Further, the presence of a phone is typically associated with an increase in feedback signal strength by roughly 3 to 6 dB, see for example the weights in
Further, it is well known that some speech characteristics can be modeled quite well using Auto-Regressive techniques. The de-correlation filter in
Positioning of the hearing aid, i.e. is the hearing aid inserted in the ear canal, is the hearing aid removed from the ear canal, or is the hearing aid positioned incorrectly in the ear canal, may be detected based on the selected cluster whereby the operation of the hearing aid may be automatically controlled, e.g. the gains may be temporarily reduced during repositioning of the hearing aid, the hearing aid may be automatically turned off when it is removed from the ear canal, etc.
It is noted that in the illustrated embodiments, the feedback suppression circuit is configured for modelling the external feedback path in an internal feedback loop and to subtract an estimated feedback signal from the input signal in order to compensate for external feedback, such as acoustic feedback. As an alternative, the feedback suppression circuit may be connected in an internal feed-forward path and may, for example, contain adaptive notch filters for gain reduction. Embodiments described herein may be utilized in such types of feedback suppression circuits, which are often called feedback cancellation or feedback suppression systems.
Although particular embodiments have been shown and described, it will be understood that it is not intended to limit the claimed inventions to the embodiments, and it will be obvious to those skilled in the art that various changes and modifications may be made. The specification and drawings are, accordingly, to be regarded in an illustrative rather than restrictive sense. The claimed inventions are intended to cover alternatives, modifications, and equivalents.
Number | Date | Country | Kind |
---|---|---|---|
2008 00525 | Apr 2008 | DK | national |
This application is the national stage of International Application No. PCT/DK2009/000089, filed on Apr. 8, 2009, now pending, which claims priority to and the benefit of U.S. Provisional Patent Application No. 61/043,991, filed on Apr. 10, 2008, and Danish Patent Application No. PA 2008 00525, filed on Apr. 10, 2008, now abandoned, the entireties of all of which are expressly incorporated by reference herein.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/DK2009/000089 | 4/8/2009 | WO | 00 | 12/27/2010 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2009/124550 | 10/15/2009 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5619580 | Hansen | Apr 1997 | A |
5680467 | Hansen | Oct 1997 | A |
6498858 | Kates | Dec 2002 | B2 |
6990193 | Beaucoup et al. | Jan 2006 | B2 |
20010055985 | Matt et al. | Dec 2001 | A1 |
20040125966 | Weidner | Jul 2004 | A1 |
20050047609 | Buchner | Mar 2005 | A1 |
20070206824 | Hellgren et al. | Sep 2007 | A1 |
20070217620 | Zhang | Sep 2007 | A1 |
20070217639 | Stirnemann | Sep 2007 | A1 |
20080212816 | Pedersen et al. | Sep 2008 | A1 |
Number | Date | Country |
---|---|---|
1708543 | Apr 2006 | EP |
1898670 | Mar 2008 | EP |
11205890 | Jul 1999 | JP |
2007037029 | Apr 2007 | WO |
2007053896 | May 2007 | WO |
Entry |
---|
International Search Report and Written Opinion mailed Sep. 19, 2009 for PCT/DK2009/000089. |
Jain et al.: “Data Clustering: A review”. ACM Computing Surveys, ACM, New York, NY. vol. 31, No. 3, Sep. 1, 1999, pp. 264-323, XP002165131. |
Danish Office Action dated Oct. 22, 2008 for Application No. PA 2008 00525. |
International Type Search Report dated Jan. 30, 2009 for DK 200800525. |
English Abstract of JP 11205890. |
English Abstract of JP 01024696. |
J. MacQueen, “Some Methods for Classification and Analysis of Multivariate Observations”, 5-th Berkeley Symposium on Mathematical Statistics and Probability, 1967, pp. 281-297, vol. 1, University of California Press, Berkeley. |
Allou Same et al., “A Mixture Model Approach for On-Line Clustering”, Symposium in Compstat, 2004, pp. 1-7, Prague, Czech Republic. |
Carl Edward Rasmussen, “The Infinite Gaussian Mixture Model”, Advances in Neural Information Processing Systems 12, 2000, pp. 554-560, MIT Press. |
Notification of Reexamination dated Apr. 27, 2015, for corresponding Chinese Patent Application No. 200980120548.7, 15 pages. |
Number | Date | Country | |
---|---|---|---|
20110103613 A1 | May 2011 | US |
Number | Date | Country | |
---|---|---|---|
61043991 | Apr 2008 | US |