Active noise cancellation decisions using a degraded reference

Information

  • Patent Grant
  • 9099077
  • Patent Number
    9,099,077
  • Date Filed
    Wednesday, February 8, 2012
    12 years ago
  • Date Issued
    Tuesday, August 4, 2015
    9 years ago
Abstract
Active noise cancellation (ANC) circuitry is coupled to the input of an earpiece speaker in a portable audio device, to control the ambient acoustic noise outside of the device and that may be heard by a user of the device. A microphone is to pickup sound emitted from the earpiece speaker, as well as the ambient acoustic noise. Control circuitry deactivates the ANC in response to determining that an estimate of how much sound emitted from the earpiece speaker has been corrupted by noise indicates insufficient corruption by noise. In another embodiment, the ANC decision is in response to determining that an estimate of the ambient acoustic noise level is greater than an estimate of the anti-noise produced by the ANC. Other embodiments are also described and claimed.
Description
BACKGROUND

Mobile phones enable their users to conduct conversations in many different acoustic environments, some of which are relatively quiet while others are quite noisy. The user may be in a particularly hostile acoustic environment, that is, with high background or ambient noise levels, such as on a busy street or near an airport or train station. To improve intelligibility of the far-end user's speech to the near-end user who is in a hostile acoustic environment (i.e., an environment in which the ambient acoustic noise or unwanted sound surrounding the mobile phone is particularly high), an audio signal processing technique known as active noise cancellation (ANC) can be implemented in the mobile phone. With ANC, the background sound that is heard by the near-end user through the ear that is pressed against or that is carrying an earpiece speaker, is reduced by producing an anti-noise signal designed to cancel the background sound, and driving the earpiece speaker with this anti-noise signal. Such ambient noise reduction systems may be based on either one of two different principles, namely the “feedback” method, and the “feed-forward” method.


In the feedback method, a small microphone is placed inside a cavity that is formed between the user's ear and the inside of an earphone shell. This microphone is used to pickup the background sound that has leaked into that cavity. An output signal from the microphone is coupled back to the earpiece speaker via a negative feedback loop that may include analog amplifiers and digital filters. This forms a servo system in which the earpiece speaker is driven so as to attempt to create a null sound pressure level at the pickup microphone. In contrast, with the feed-forward method, the pickup microphone is placed on the exterior of the earpiece shell in order to directly detect the ambient noise. The detected signal is again amplified and may be inverted and otherwise filtered using analog and digital signal processing components, and then fed to the earpiece speaker. This is designed to create a combined acoustic output that contains not just the primary audio content signal (in this case the downlink speech of the far-end user) but also a noise reduction signal component. The latter is designed to essentially cancel the incoming ambient acoustic noise, at the outlet of the earpiece speaker. Both of these ANC techniques are intended to create an easy listening experience for the user of a portable audio device who is in a hostile acoustic noise environment.


SUMMARY

In one embodiment of the invention, a portable audio device has an earpiece speaker with an input to receive an audio signal, and a first microphone to pickup sound emitted from the earpiece signal, and any ambient or background acoustic noise that is outside of the device but that may be heard by a user of the device. The device also includes ANC circuitry that is coupled to the input of the earpiece speaker, to control the ambient acoustic noise. An estimate of how much sound emitted from the earpiece speaker has been corrupted by ambient acoustic noise is computed, by computing a degraded audio reference signal, which is an estimate of the audio signal as it has been corrupted by the ambient acoustic noise. Control circuitry then determines whether this estimate indicates insufficient corruption by noise, in which case it will deactivate the ANC circuitry. This will help preserve battery life in the portable device, since in many instances the acoustic environment surrounding the user of a portable audio device is not hostile, i.e. it is relatively quiet such that running ANC provides no user benefits.


If, however, the estimate indicates sufficient corruption by noise (e.g., when the user is in a hostile acoustic environment), then a decision is made to not deactivate the ANC circuitry. In other words, the ANC circuitry is allowed to continue to operate if the estimate indicates that there is sufficient corruption by ambient acoustic noise.


In one embodiment, estimates of the ambient acoustic noise and the degraded audio reference signal are smoothed in accordance with subjective loudness weighting and then averaged, before computing a signal to noise ratio and then making the threshold decision as to whether to deactivate or activate the ANC. The subjective loudness weighting may be filtered so that only the frequencies where ANC is expected to be effective are taken into account (when determining the SNR). For example, in some cases, effective noise reduction by the ANC may be limited to the range 500-1500 Hz. Also, the decision whether to activate or deactivate the ANC may be taken only after having introduced hysteresis into the threshold SNR values, to prevent rapid switching of the decision near the threshold.


In another embodiment, if the estimated ambient acoustic noise is deemed to be louder than a threshold, then ANC is activated (or is not deactivated), thereby allowing the ANC to continue reducing unwanted ambient sound. The threshold may be based on an actual (measured) or expected (computed) strength of an audio artifact that is induced by the ANC in sound emitted from the earpiece speaker. This artifact is caused by operation of the ANC circuitry, and is some times referred to as a “hiss” that can be heard by the user. If more hiss is being heard by the user than noise that needs to be canceled, then the ANC circuitry is deactivated. This reflects the situation where the ANC circuitry is not providing sufficient user benefit and thus may be shutdown to save power.


In accordance with another embodiment of the invention, a method for performing a call or playing an audio file or an audio stream using a portable audio device, may proceed as follows. ANC circuitry in the device is activated, to control ambient acoustic noise during the call or playback. An estimate of how much sound emitted from an earpiece speaker of the device has been corrupted by the ambient acoustic noise is computed. A determination is then made whether the estimate indicates insufficient corruption by noise, in which case the ANC circuitry is deactivated. On the other hand, if the estimate indicates sufficient corruption by noise, then the ANC circuitry is allowed to continue operation in an attempt to reduce the unwanted ambient sound. The estimate may be computed as signal to noise ratio (SNR), which may refer to a downlink speech signal or an audio signal produced when playing an audio file or an audio stream. As an alternative to computing the estimate of how much of the desired sound has been corrupted by noise, an estimate of the ambient acoustic noise may be computed and then compared to a threshold, to determine whether the noise would be deemed louder than a threshold (e.g., a hiss threshold); if so, then ANC is activated (or is not deactivated), thereby allowing the ANC to continue reducing unwanted ambient sound.


In one embodiment, the ANC circuitry may be deactivated by setting the tap coefficients of a digital anti-noise filter (whose output feeds the earpiece speaker) to zero, so that essentially no signal is output by the filter. In addition, the deactivation of the ANC circuitry may also include at the same time disabling an adaptive filter controller that normally updates those tap coefficients, so that the tap coefficients are no longer being updated.


In an alternative embodiment, the ANC circuitry may be deactivated by disabling the adaptive filter controller so that the tap coefficients of the anti-noise filter are no longer being updated (e.g., freezing the adaptive filter, so that although some signal is output by the anti-noise filter, the latter is not changing and the controller is not computing any updates to it).


In yet another embodiment of the method for performing a call or playing an audio file or audio stream using the portable audio device, the ANC circuitry is not activated during the call or playback, until a determination has been made that there is sufficient corruption, due to the presence of ambient acoustic noise, of the sound being emitted from the earpiece speaker. Thereafter, an estimate of how much sound emitted from the earpiece speaker (during the call or playback) is being corrupted is again computed, and if there is insufficient corruption by the ambient acoustic noise then the ANC circuitry is deactivated.


The above summary does not include an exhaustive list of all aspects of the present invention. It is contemplated that the invention includes all systems and methods that can be practiced from all suitable combinations of the various aspects summarized above, as well as those disclosed in the Detailed Description below and particularly pointed out in the claims filed with the application. Such combinations have particular advantages not specifically recited in the above summary.





BRIEF DESCRIPTION OF THE DRAWINGS

The embodiments of the invention are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” embodiment of the invention in this disclosure are not necessarily to the same embodiment, and they mean at least one.



FIG. 1 depicts a mobile communications device in use by a user in a hostile acoustic environment.



FIG. 2 is a block diagram of system for making ANC decisions in an audio device based on estimates of signal and noise.



FIG. 3A is a block diagram of an algorithm for the control process or circuitry that makes the decision whether to activate or deactivate ANC, based on signal and noise estimates.



FIG. 3B is a block diagram of an algorithm used in the control process (or implemented in the decision control circuitry) that makes the decision whether to activate or deactivate ANC, based on only an estimate of the ambient acoustic noise.



FIG. 4 is a plot of intelligibility versus SNR for sentences and single-syllable words.



FIG. 5 is a block diagram of feed forward ANC and ANC decision control based on signal and noise estimates.



FIG. 6 is a block diagram of feedback ANC and ANC decision control based on signal and noise estimates.



FIG. 7 depicts an algorithm or process for ANC decision making.



FIG. 8 depicts another algorithm for ANC decision making, based on computing the strength of ambient noise and comparing it to a hiss threshold.





DETAILED DESCRIPTION

Several embodiments of the invention with reference to the appended drawings are now explained. While numerous details are set forth, it is understood that some embodiments of the invention may be practiced without these details. In other instances, well-known circuits, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.



FIG. 1 depicts a portable audio device 2, here a mobile communications device, in use by a near-end user in a hostile acoustic environment. The near-end user is holding the portable audio device 2, and in particular, an earpiece speaker 6, against his ear, while conducting a conversation with a far-end user. The conversation occurs generally in what is referred to as a “call” between the near-end user's portable audio device 2 and the far-end user's audio device 4. The call or communications connection or channel, in this case, includes a wireless segment in which a base station 5 communicates using, for instance, a cellular telephone protocol, with the near-end user's device 2. In general, however, the ANC decision making mechanisms described here are applicable to other types of handheld, battery-powered audio devices including portable audio communication devices that use any known types of networks 3 including wireless/cellular and wireless/local area network, in conjunction with plain old telephone system (POTS), public switched telephone network (PSTN), and perhaps one or more segments over high speed Internet connections (e.g., using voice over Internet protocol).


During the call, the near-end user would hear some of the ambient acoustic noise that surrounds him, where the ambient acoustic noise may leak into the cavity that has been created between the user's ear and the shell or housing behind which the earpiece speaker 6 is located. In this monaural arrangement, the near-end user can hear the speech of the far-end user in his left ear, but in addition may also hear some of the ambient acoustic noise that has leaked into the cavity next to his left ear. The near-end user's right ear is completely exposed to the ambient noise.


As explained above, an active noise cancellation (ANC) mechanism operating within the audio device 2 can reduce the unwanted sound that travels into the left ear of the user and that would otherwise corrupt the primary audio content, which in this case is the speech of the far-end user. In some cases, however, ANC imparts little apparent improvement on speech intelligibility, particularly where the signal-to-noise ratio (SNR) at the user's ear is greater than a certain threshold (as discussed below). Moreover, ANC induces audible artifacts that can be heard by the user in relatively quiet environments. The various embodiments of the invention make decisions on activation and deactivation of ANC in a way that helps reduce the presence of such audible artifacts and conserves power, when it has been determined that the ANC would not be of substantial benefit to the user.


Turning now to FIG. 2, a block diagram of a system for making ANC decisions in an audio device based on estimates of signal and noise is shown. An ANC block 10 (also referred to as ANC circuitry 10) generates an anti-noise signal, an(k), that is combined with the desired audio signal by a mixer 12, before being fed to the input of the earpiece speaker 6. This may be an entirely conventional feedback or feed forward ANC mechanism. In accordance with an embodiment of the invention, an ANC decision control block 11 determines whether to activate or deactivate the ANC block 10, based on computed or estimated values for a so-called degraded reference, s′(k)+n′(k), and noise, n′(k). The computed degraded reference s′(k)+n′(k) represents the noisy audio signal (or the primary audio degraded by the ambient acoustic noise) as it might be heard by a user without the effect of the ANC block 10. Note that the estimated clean or primary audio signal by itself, namely s′(k), may or may not be used, by the decision control block 11, when performing calculations used to make the decision.


The references to s′(k), s′(k)+n′(k), and n′(k) are used here to represent a time sequence of discrete values, as the signal processing operations performed on any audio signals by the blocks depicted in this disclosure are in the discrete time domain. More generally, it is possible to implement some or all of the functional unit blocks in analog form (continuous time domain). However, it is believed that the digital domain is more flexible and more suitable for implementation in modern, consumer electronic audio devices, such as smart phones, digital media players, and desktop and notebook personal computers.


The signal and noise estimates are computed by noise measurement circuitry 23, which includes an error microphone 8 that is located and oriented in such a manner as to pickup both (a) sound emitted from the earpiece speaker 6 and (b) the ambient acoustic noise that has leaked into the cavity or region between the handset housing or shell (not shown) that is in front of the earpiece speaker 6 and the user's ear. The error microphone 8 may be embedded in the housing of a cellular handset in which the earpiece speaker 6 is also integrated, directed at the cavity formed by the user's ear and the front face earpiece region of the handset, i.e. located close to the earpiece speaker and far from the primary or talker microphone (not shown) that is used to pickup the near-end user's speech. This combination of the earpiece speaker 6 and the error microphone 8, along with the acoustic cavity formed against the user's ear, is referred to as the system or plant that is being controlled by the ANC circuitry 10; the frequency response of this system or plant is labeled F. A digital filter models the system or plant F, and is described as having a frequency response F′, an instance of which appears in the noise measurement circuitry 23 as a filter 13, is shown. A signal picked up by the microphone is fed to a differencing unit 18 whose other input receives a signal from the output of the filter 13. This allows the output of the differencing unit 18 to provide an estimate of the ambient acoustic noise, n′(k), while the output of another filter 17 (being a second instance of F′) provides an estimate of the primary or desired audio signal, s′(k) (here, the downlink speech signal).


The noise measurement circuitry 23 also includes an instance of F′, namely a filter 9, whose input receives the anti-noise signal (and not the audio signal). After passing through the filter 9, and becoming in effect an estimate of the anti-noise, the estimated anti-noise is fed to an input of a differencing unit 22; another input of the differencing unit 22 is coupled to the output of the error microphone (8). This arrangement produces the degraded reference s′(k)+n′(k), at the output of the differencing unit 22.


While FIG. 2 shows that all three estimated signals and noise s′(k), n′(k) and s′(k)+n′(k) could be computed by the noise measurement circuitry 23, in one embodiment of the invention only the signals s′(k) and n′(k) are needed by the ANC decision control circuitry 11 to determine an estimate of how much sound emitted from the earpiece speaker 6 has been corrupted by the ambient acoustic noise (e.g., SNR)—see prior application Ser. No. 12/794,588. In another embodiment, only the degraded reference s′(k)+n′(k) and the estimated noise n′(k) are needed to compute a measure of how much sound that might be heard by the user has been corrupted by the ambient acoustic noise.


The SNR may be calculated in the primarily audible frequency range in which ANC is effective, e.g. at the low end between 300-500 Hz, up to at the high end 1.5-2 kHz. The signal and noise levels may be computed as signal energy within the ANC's effective frequency range and in a finite time interval or frame of the sequences s′(k) and n′(k). If the indication is that there is insufficient corruption by noise (or the SNR is greater than a predetermined threshold), then the ANC circuitry 10 is deactivated, consistent with the belief that ANC in this situation may not be of benefit to the near-end user.


The ANC decision control 11 may alternately determine that its computed estimate (of how much the desired sound has been corrupted by noise) indicates that there is sufficient corruption by noise (e.g., the SNR is smaller than the predetermined threshold). In that case, the ANC circuitry 10 should not be deactivated (consistent with the belief here that the ANC is expected to benefit the near-end user by increasing intelligibility of the far-end user's speech). In a further embodiment of the invention, this decision by the ANC decision control 11 means that the ANC circuitry 10 should be activated (assuming in that case the ANC circuitry 10 was not active at the time of the decision made by the decision control 11.


In yet another embodiment, the ANC decision control 11 makes its decision (regarding whether or not to deactivate the ANC circuitry 10) based only on a comparison between the estimated noise n′(k) and a threshold. In other words, if this estimate indicates that the ambient environment surrounding the user is currently sufficiently noisy (e.g., a computed quantity containing n′(k) is larger than a predetermined threshold value), then the ANC circuitry 10 should be signaled to be deactivated (or, in another instance, remain inactive).


Still referring to FIG. 2, in the embodiment where the earpiece speaker 6 is an integrated “receiver” of a mobile or wireless telephony handset (e.g., a cellular phone, a smart phone with wireless local area network-based Internet telephony capability, and a satellite-based mobile phone), the plant F varies substantially e.g., by as much as 40 decibels, depending on how and whether or not the user is holding the handset earpiece region against their ear. In that case, a fixed model for the transfer function F′ (which appears in the filters 9, 13, 17) may not work to properly determine the signal and noise estimates s′(k) and n′(k). Accordingly, the transfer function F′ should be updated continuously during operation of the handset (e.g., during a call).


The filters 9, 13, 17 (having transfer function F′) may be implemented as digital adaptive filters whose tap coefficients are adapted by an adaptive filter controller 7 according to any suitable conventional algorithm, e.g. least mean squares algorithm. The adaptive filter controller 7 takes as input the audio signal (which is also input to a mixer 12) and the estimate for noise, n′(k), and using, for example, the least mean squares algorithm, conducts an iterative process that attempts to converge the tap coefficients so that very little or no content from the audio signal appears in the output of a differencing unit 21. In other words, the adaptive filter controller 16 adapts the tap coefficients (reflected in filters 9, 13, 17) so that its transfer function F′ will in essence match that of the system or plant F. In practice, there may be a short convergence time needed to obtain such a match (e.g., on the order of one or two seconds, for example), as the plant F changes when the user moves the handset on and off their ear. Therefore, any decision by the ANC decision control block 11 may be conditioned upon a signal from the adaptive filter controller 7 that the modeling of the plant F is up to date or that there is sufficient convergence in the adaptive filter algorithm.


The arrangement depicted in FIG. 2 may be implemented within an audio coder/decoder integrated circuit die (also referred to as a codec chip) that may perform several other audio related functions such as analog-to-digital conversion, digital-to-analog conversion, and analog pre-amplification of microphone signals. In other embodiments, the arrangement of FIG. 2 may be implemented in a digital signal processing codec suitable for mobile wireless communications, where the codec may include functions such as downlink and uplink speech enhancement processing, e.g. one ore more of the following: mixing, acoustic echo cancellation, noise suppression, speech channel automatic gain control, companding and expansion, and equalization. The entire functionality depicted in FIG. 2 may be performed in discrete-time domain, in which analog signals such as the output of an analog microphone have been converted to digital form, and the output signal of the mixer 12 has been converted to analog form prior to being input to the earpiece speaker 6; these well known aspects need not be explicitly described or shown indicated in the figures.


Turning now to FIG. 3A, an algorithm for the ANC decision control 11 (see FIG. 2) is shown, where signal to noise ratio (SNR) is computed and compared to a threshold. The blocks depicted in FIG. 3A may be digital time-domain processing elements, or they may be frequency domain processing elements. Both the signal and noise estimates, s′(k) and n′(k), pass through a smoothing conditioner, which in this case includes a subjective loudness weighting block 12 and an averaging block 14. The loudness weighting block 12 may be a typical filtering operation used when measuring noise in audio systems (e.g., A-weighting, ITU-R 468). The averaging block 14 may implement a typical root mean square or other suitable signal averaging algorithm, e.g. ITU-T G.160, exemplified by the following formula.








y
r



(
k
)


=


1
n






i
=

k
-
n
+
1


k



x
i
2







The output sequences following the loudness weighting and averaging blocks 12, 14 are then used by the threshold decision block 15 to compute the signal to noise ratio by essentially comparing the smoothed noise estimate n″(k) to the smoothed signal estimate s″(k) based on a configurable threshold parameter x as shown in FIG. 3. This block essentially determines whether the sound being emitted from the earpiece speaker 6 has been sufficiently corrupted by the ambient acoustic noise (see FIG. 2) as follows. If the SNR is below a configurable parameter or threshold, then the decision is made to not deactivate the ANC circuitry, or to activate it. That is because in this case, it is expected that ANC is likely to achieve some substantial reduction in the unwanted sound that the user may be hearing. On the other hand, if the SNR is above the threshold, then this suggests that the ambient acoustic environment may be sufficiently quiet such that ANC is likely to provide no benefit to the user and hence should be deactivated or disabled, or not activated or enabled, to save power and avoid unwanted audio artifacts.


The threshold for the SNR comparison may be determined using known information that has been published about the intelligibility of various types of speech being carried by typical communications systems. FIG. 4 depicts the results of such findings. In accordance with an embodiment of the invention, a particular threshold that may be suitable for the ANC decision control 11 is approximately 12 dBA. At 12 dBA, it is expected that single-syllable words are intelligible 80% of the time or more, whereas sentences are intelligible more than 90% of the time. More generally, however, the threshold may be set above 12 dBA or below 12 dBA, with the understanding that by setting the threshold higher, the ambient acoustic noise level needs to be even lower in order to make the decision to deactivate the ANC.


Still referring to FIG. 3A, while this figure shows a comparison between the estimated noise n′(k) and the estimated clean or primary audio signal s′(k), an alternative is to compare the clean signal s′(k) to the estimated noisy signal, s′(k)+n′(k). For instance, if the comparison indicates that the noisy signal isn't “much worse” than the clean signal, then the decision should be to deactivate (or not activate) the ANC circuitry 10. The subjective loudness weighting 12 and averaging 14 could also be applied in this case to the input signals s′(k) and s′(k)+n′(k), before performing the threshold decision 15.


In yet another embodiment depicted in FIG. 3B, the ANC decision control 11 makes its decision (regarding whether or not to deactivate the ANC circuitry 10) based only on a comparison between the estimated noise n′(k) and a threshold. Once again, the subject loudness weighting 12 and averaging 14 may applied here to computed a smoothed version of the estimated noise, namely n″(k), before comparison to a configurable threshold y.


Turning now to FIG. 5, a block diagram of feed forward ANC is shown, together with the noise measurement circuitry 23 and ANC decision control 11 of FIG. 2. In this embodiment of the invention, the ANC circuitry 10 includes a reference microphone 25 that in one embodiment may also be integrated in the handset housing of the portable audio device 2, and is located and oriented so as to pickup the ambient acoustic noise. In other words, the reference microphone 25 is oriented and thus intended to primarily detect the ambient acoustic noise, rather than speech of the near-end user or any sounds being emitted from the earpiece speaker 6. In some cases, the reference microphone 25 will be located farther away from the earpiece speaker 6 than the error microphone 8, or it may be oriented in a different direction than the primary or talker microphone (not shown), which is typically used to pickup the speech of the near-end user. For instance, referring now to FIG. 1, the reference microphone 25 may be directed out of the back face of the handset housing of the portable audio device, in contrast to the earpiece speaker 6, which is directed out of the front face or a bottom side.


The feed forward arrangement of FIG. 5 would also include an anti-noise filter 16 whose input may be coupled to the output of the reference microphone 25, while its output produces the anti-noise signal that feeds the mixer 12. In this embodiment of the invention, the anti-noise filter 16 is adaptive. As such, the ANC circuitry 10 includes an adaptive filter controller 19, which continuously adjusts the tap coefficients of the anti-noise filter 16 in order to achieve the lowest level of total noise in the earpiece cavity. To do so, the adaptive filter controller 19 receives as input a filtered version of the output of the reference microphone 9, using a filter 20 whose transfer function is also F′ which is a model of the actual system or plant F. This is in effect another estimate of the ambient acoustic noise that may be heard by the user. The adaptive filter controller 19, based on these two noise estimates as input, adjusts the anti-noise filter 16 continuously, so as to reduce or minimize the amount of noise in the earpiece cavity (that is, sound picked up by the error microphone 8 with the filtered speech signal, s′(k), subtracted). In one embodiment, a least means square algorithm may also be used for the adaptive filter controller 19 in order to converge on a solution for the tap coefficients of the anti-noise filter 16 that minimizes the estimated noise in the earpiece cavity, n′(k) +an'(k).


It should be noted that although not explicitly depicted in FIG. 5, the modeling of the plant F by the transfer function F′ that appears in filters 9, 13, 17, 20 should be “online”, that is continuously adjusted during operation of the portable audio device 2. Thus, the transfer function F′ is not fixed, but rather varies in order to match the changes that occur in the actual plant F due to the user moving the handset earpiece region on and off their ear.


In contrast to the feed forward mechanism for ANC depicted in FIG. 5, FIG. 6 shows a block diagram of feedback ANC. In this case, the noise measurement circuitry 23 and the mixer 12 are arranged in the same manner as in FIG. 5, except that now the anti-noise signal input to the mixer 12 is generated by an anti-noise digital filter 16 that need not be adaptive and whose input is coupled to receive the noise estimate, n′(k). In one embodiment, the anti-noise filter 16 performs a simple inversion of its input sequence, so as to cancel the unwanted sound (ambient acoustic noise) at the output of the earpiece speaker 6, by generating an inverse of the estimate n′(k). The ANC decision control 11 may operate in the same manner as in FIG. 5, having as possible inputs the estimated noise n′(k), the degraded reference s′(k)+n′(k), and the clean signal s′(k) (not shown). A suitable combination of one or more of those estimated signal and noise estimates may be used to compute a metric that is then compared to a threshold, to make the decision on whether or not to deactivate the anti-noise filter 16. As described above in connection with FIG. 2, this may be done by computing how much sound emitted from the earpiece speaker 6 has been corrupted by the ambient acoustic noise (e.g., an SNR type calculation involving s′(k) and n′(k), or a metric involving s′(k) and s′(k)+n′(k), where the computed SNR value or other metric is then compared to a threshold), or by simply comparing the estimated noise n′(k) to a threshold.


Until now, this disclosure has been referring to the activation and deactivation of the ANC circuitry 10, or the anti-noise filter 16, in a general sense. There may be several different implementations to achieve such activation and deactivation. In one embodiment, the ANC may be deactivated by setting the tap coefficients of the anti-noise filter 16 to zero, so that no signal is output by these filters. This is essentially similar to opening a hard switch that may be inserted between the output of the filter 16 and the input to the mixer 12. In the feedforward embodiment of FIG. 5, this deactivation of the filter 16 may be accompanied by simultaneous disabling of the adaptive filter controller 19 (in the feed forward embodiment depicted in FIG. 5), so that the tap coefficients of the anti-noise filter 16 are no longer being updated. As an example, in the case of an LMS controller, this could be achieved by setting the LMS gain to zero, thereby forcing the controller to stop updating.


In another embodiment, the ANC may be deactivated by only disabling the adaptive filter controller 19 (FIG. 5), so that the tap coefficients of the anti-noise filter 16 are no longer being updated. In that case, some anti-noise signal is output by the anti-noise filter 16, however, the filter transfer function is not changing and the controller 19 is not computing any updates to the filter 16. This may also be referred to as freezing the adaptive filter controller 19.


To activate or reactivate the ANC the deactivation operations described above may be essentially reversed, by, e.g. unfreezing the adaptive filter controller 19 and allowing the tap coefficients of the anti-noise filter 16 to be set by the controller 19, or reverting to a predetermined default in the case of a non-adaptive anti-noise filter 22 (e.g., as may be used in the feedback version depicted in FIG. 6).


Turning now to FIG. 7, an algorithm or process flow for ANC decision making is depicted. Operation begins in a portable audio communications device when a call or playback of an audio file or audio stream begins (block 24). At this point, the ANC circuitry may or may not be activate. Operation continues with block 26 in which an estimate of how much the monaural sound being emitted from the earpiece speaker has been corrupted by ambient acoustic noise (that may be heard by the user) is computed. This is also referred to as computing the SNR.


In some cases, the speech of the near-end user may cause a relatively low SNR to be computed in block 26 possibly due to a side tone signal which may also be input to the mixer 12+43see FIG. 2. Therefore, in one embodiment, block 26 is performed only if the portable audio communications device 2 is in receive (RX) status, that is, no uplink speech is being transmitted. In other words, the decision to deactivate ANC should only be made when the near-end user is not talking (but the far-end user may be talking). This may require obtaining transmit or receive (TX/RX) status of the call, in block 27.


Assuming that the portable audio device is not sending uplink speech (or is in RX status as determined in block 27), then a decision may be made regarding whether there is sufficient corruption (block 28) or there is insufficient corruption (block 30) of the downlink speech signal (by the ambient noise). If there is sufficient corruption (block 28), then the ANC circuitry is activated (block 31). This leads to a reduction in the ambient noise that is being heard by the user, due to an anti-noise signal being driven through the earpiece speaker. The algorithm may then loop back to block 26 after some predetermined time interval, e.g., the next audio frame in s′(k) and n′(k), until the call or playback ends (block 34). At that point, the ANC circuitry can be deactivated (block 35).


In another scenario, after the initial activation of the ANC circuitry in block 31, during the call, the algorithm loops back to block 26 and computes a new estimate of the SNR (or of another suitable metric), during the call. This time, it may be that the ambient acoustic noise level has dropped sufficiently such that there is insufficient corruption of the downlink speech signal (block 30). In response, the ANC circuitry is deactivated (block 33). Accordingly, during a call, the ANC circuitry may be activated and then deactivated several times, depending upon the level of ambient acoustic noise, and how much the downlink speech signal is corrupted as a result.


In another embodiment, still referring to the algorithm of FIG. 7, once the call or playback begins (block 24), the ANC circuitry may be automatically activated to control the ambient noise being heard by the user during the call. The algorithm would then proceed once again with block 26 where it estimates how much the downlink speech is corrupted by the ambient noise, and if there is insufficient corruption (block 30), then the ANC circuitry is deactivated during the call. Thereafter, the algorithm loops back to block 26 to re-compute the signal-to-noise ratio and this time if it encounters sufficient corruption by noise, the ANC circuitry may be reactivated (block 31) during the call.


In FIG. 7, the ANC activation/deactivation decisions may be based on estimates of signal (the clean signal and/or the degraded reference) and in some cases the ambient acoustic noise. In accordance with another embodiment of the invention, the ANC decision control 11 is based on the actual or expected presence of an audio artifact induced by operation of the ANC. This is also referred to as the “hiss threshold” embodiment. This embodiment may use the same noise measurement circuitry 23 and the ANC circuitry 10 of the feed forward or feedback embodiments, except that the ANC decision control block 11 makes a comparison between the estimated ambient acoustic noise (e.g., by itself), and a hiss threshold, to determine if the ambient acoustic noise is louder than any hiss that might be heard by the user. If not, then the ANC should be deactivated.


In one embodiment, the ANC decision control 11 computes the strength of an audio artifact that has been caused or induced by operation of the ANC circuitry 10, and that may be heard by the user in the sound emitted from the earpiece speaker 6. This artifact is some times referred to as a hiss. A threshold level or loudness is used to represent the strength of the audio artifact, and this threshold level may be stored in the device 2 to be accessed by the ANC decision control 11 when comparing to the estimated ambient noise n′(k).


In another embodiment, the ANC decision control 11 determines whether the audio artifact's strength is greater than the estimated level of the ambient acoustic noise n′(k). If the audio artifact is louder than the ambient noise, then the ANC circuitry 10 is deactivated.


In one embodiment, the artifact may present itself above the frequency range in which the ANC is expected to be effective. For instance, the ANC may be effective to reduce noise at a low end between 300-500 Hz, up to a high end of 1.5-2 kHz. The hiss in that case would likely appear above 2 kHz. Thus, if the magnitude of an′(k), which may be computed based on the output of filter 9 in FIG. 5, in a certain band, such as above 2 kHz, is greater than the magnitude of n′(k), which is available at the output of differencing unit 18, then the user is likely hearing more ANC-generated hiss than ambient noise. The decision control 11 would in that case signal the deactivation of the ANC.


An algorithm for ANC decision making based on a comparison of the ambient noise to an expected or actual audio artifact is depicted in FIG. 8. Once a call or playback of an audio file or stream begins (block 40), the ANC circuitry may or may not be automatically activated. At that point, the ambient acoustic noise heard by the user is estimated (block 42). If the estimated ambient noise is “louder” than a hiss threshold (which may be a predetermined threshold that is loaded from memory —block 44), or if it is louder than an estimate of the anti-noise, an′(k), over the same time interval, then the ANC circuitry is in response activated (block 46). On the other hand, if the ambient noise is not loud enough, then the ANC circuitry remains deactivated or is deactivated (block 48).


It should be noted that while the algorithms in FIG. 7 (based on SNR or other suitable metric) and in FIG. 8 (based on a hiss threshold comparison) have been described separately, it is possible to combine both aspects in the ANC decision control. For instance, the decision on whether to deactivate the ANC circuitry as taken in block 33 of FIG. 7 may be verified by making a determination as to whether the estimated ambient noise is louder than the hiss threshold as per FIG. 8.


In accordance with another embodiment of the invention, the decision to deactivate ANC may be made in part or entirely based on having detected that a mobile phone handset is not being held firmly against the user's ear. For example, in a conventional iPhone™ device, there is a proximity detector circuit or mechanism that can indicate when the device is being held against a user's ear (and when it is not). Such a proximity sensor or detector may use infrared transmission and detection incorporated in the mobile phone handset, to provide the indication that the handset is close to an object such as the user's ear. The ANC decision control circuitry in such an embodiment would be coupled to the proximity detector, as well as the ANC circuitry, and would deactivate the latter when the proximity detector indicates that the handset is not being held sufficiently close to the user's ear. The decision to deactivate ANC in this case may be based entirely on the output of the proximity detector, or it may be based on considering both the output of the proximity detector and one or more of the audio signal processing-based techniques described above in connection with, for instance, FIG. 7 or FIG. 8.


As explained above, an embodiment of the invention may be a machine-readable medium (such as microelectronic memory) having stored thereon instructions, which program one or more data processing components (generically referred to here as a “processor”) to perform the digital audio processing operations described above including noise and signal strength measurement, filtering, mixing, adding, inversion, comparisons, and decision making. In other embodiments, some of these operations might be performed by specific hardware components that contain hardwired logic (e.g., dedicated digital filter blocks). Those operations might alternatively be performed by any combination of programmed data processing components and fixed hardwired circuit components.


While certain embodiments have been described and shown in the accompanying drawings, it is to be understood that such embodiments are merely illustrative of and not restrictive on the broad invention, and that the invention is not limited to the specific constructions and arrangements shown and described, since various other modifications may occur to those of ordinary skill in the art. For instance, the error microphone 8 may instead be located within the housing of a wired or wireless headset, which is connected to a smart phone handset. The description is thus to be regarded as illustrative instead of limiting.

Claims
  • 1. A portable audio device comprising: an earpiece speaker having an input to receive an audio signal;active noise cancellation (ANC) circuitry to provide an anti-noise signal at the input of the earpiece speaker to control ambient acoustic noise outside of the device that is heard by a user of the device; andnoise measurement circuitry having a first input coupled to an output of a first microphone and a second input coupled to receive the anti-noise signal and not the audio signal, the first microphone to pick up (a) sound emitted from the earpiece speaker and (b) the ambient acoustic noise, the noise measurement circuitry including a first filter that models the earpiece speaker and the first microphone, wherein the anti-noise signal and not the audio signal is to pass through the first filter, and a first differencing unit having a first input coupled to the output of the first microphone and a second input coupled to an output of the first filter, and an output that provides a degraded audio reference signal; andcontrol circuitry coupled to receive the degraded audio reference signal and to deactivate the ANC circuitry in response to using the degraded audio reference to determine that sound emitted from the earpiece speaker has not been sufficiently corrupted by noise.
  • 2. The portable audio device of claim 1 wherein the noise measurement circuitry generates an estimate of the ambient acoustic noise using a) a signal from the output of the first microphone, b) the anti-noise signal, and c) the audio signal, and wherein the ANC circuitry comprises an anti-noise filter that inverts a signal at its input, the input of the anti-noise filter being coupled to receive the estimate of the ambient acoustic noise.
  • 3. The portable audio device of claim 1 wherein the ANC circuitry comprises a second microphone to pick up the ambient acoustic noise, wherein the first microphone is positioned closer to the earpiece speaker than the second microphone, and an adaptive filter that generates the anti-noise signal using a representation of the ambient acoustic noise as picked up by the second microphone.
  • 4. The portable audio device of claim 1 wherein the control circuitry is to calculate a ratio of the degraded reference signal to an estimate of the ambient acoustic noise (SNR), and wherein the control circuitry is to deactivate the ANC circuitry when the calculated SNR is above a predetermined threshold.
  • 5. The portable audio device of claim 1 wherein the noise measurement circuitry comprises: a third filter that models the earpiece speaker and the first microphone, wherein the audio signal and not the anti-noise signal pass through the third filter; andwherein the control circuitry has inputs coupled to the outputs of the third filter and the first differencing unit.
  • 6. The portable audio device of claim 5 wherein the control circuitry comprises: a smoothing conditioner to smooth the signal from output of the first differencing unit; anda decision circuit to receive the smoothed signal, the decision circuit having an output that indicates whether or not the ANC circuitry is to be deactivated.
  • 7. The portable audio device of claim 6 wherein the control circuitry is to deactivate the ANC circuitry when the smoothed signal is above a predetermined threshold.
  • 8. The portable audio device of claim 1 wherein the noise measurement circuitry comprises: a second filter that models the earpiece speaker and the first microphone, wherein the audio signal combined with the anti-noise signal pass through the second filter; anda second differencing unit having a first input coupled to the output of the first microphone and a second input coupled to an output of the second filter.
  • 9. The portable audio device of claim 8 wherein the noise measurement circuitry comprises: a third filter that models the earpiece speaker and the first microphone, wherein the audio signal and not the anti-noise signal pass through the third filter,wherein the control circuitry has inputs coupled to the outputs of the third filter and the first and second differencing units.
  • 10. A method for performing a call using a portable audio communications device comprising: activating active noise cancellation (ANC) circuitry to control ambient acoustic noise during the call using an anti-noise signal;computing a degraded reference, being an estimate of an audio signal containing downlink speech of the call that has been corrupted by the ambient acoustic noise, by removing a filtered version of the anti-noise signal from a microphone signal;determining that an estimate of how much sound emitted from an earpiece speaker of the device has been corrupted by said ambient acoustic noise indicates insufficient corruption by noise, using the degraded reference; anddeactivating the ANC circuitry in response to the determination.
  • 11. The method of claim 10 wherein the determining comprises computing a signal to noise ratio using the degraded reference and an estimate of the ambient acoustic noise.
  • 12. The method of claim 11 wherein the determining comprises computing the estimate of the ambient acoustic noise by removing a filtered version of the anti-noise signal and a filtered version of the audio signal from the microphone signal.
  • 13. The method of claim 10 wherein the determining comprises computing a metric using the degraded reference and an estimate of the audio signal containing downlink speech.
  • 14. The method of claim 10 wherein the deactivating the ANC circuitry comprises: setting a plurality of tap coefficients of a digital anti-noise filter whose output feeds the earpiece speaker, to zero.
  • 15. The method of claim 14 wherein the deactivating the ANC circuitry further comprises: disabling an adaptive filter controller that updates the tap coefficients, so that the tap coefficients are no longer being updated.
  • 16. The method of claim 10 wherein the deactivating the ANC circuitry comprises: disabling an adaptive filter controller that updates a plurality of tap coefficients of a digital anti-noise filter, so that the tap coefficients are no longer being updated.
  • 17. The method of claim 10, wherein computing the degraded reference comprises computing the filtered version of the anti-noise signal by passing the anti-noise signal through a filter that models the earpiece speaker and an error microphone.
RELATED MATTERS

This application is a continuation-in-part of U.S. patent application Ser. No. 12/794,588, filed Jun. 4, 2010, entitled “Active Noise Cancellation Decisions in a Portable Audio Device, which is currently pending. An embodiment of the invention is related to activation and deactivation of an active noise cancellation (ANC) process or circuit in a portable audio device such as a mobile phone. Other embodiments are also described.

US Referenced Citations (28)
Number Name Date Kind
4811404 Vilmur et al. Mar 1989 A
5251263 Andrea et al. Oct 1993 A
6201866 Ariyama et al. Mar 2001 B1
6278786 McIntosh Aug 2001 B1
6704428 Wurtz Mar 2004 B1
7035796 Zhang et al. Apr 2006 B1
7203308 Kubota Apr 2007 B2
7885417 Christoph Feb 2011 B2
8416959 Lott et al. Apr 2013 B2
8515089 Nicholson Aug 2013 B2
20010036281 Astorino et al. Nov 2001 A1
20040258253 Wurtz Dec 2004 A1
20070076897 Philipp Apr 2007 A1
20070298845 Knoedgen Dec 2007 A1
20080162072 Copley et al. Jul 2008 A1
20090034748 Sibbald Feb 2009 A1
20090046867 Clemow Feb 2009 A1
20090086988 Ou et al. Apr 2009 A1
20090147969 Kinouchi et al. Jun 2009 A1
20100022283 Terlizzi Jan 2010 A1
20100061564 Clemow et al. Mar 2010 A1
20100226505 Kimura Sep 2010 A1
20100260345 Shridhar et al. Oct 2010 A1
20100296668 Lee et al. Nov 2010 A1
20100322430 Isberg Dec 2010 A1
20120140943 Hendrix et al. Jun 2012 A1
20120316872 Stoltz et al. Dec 2012 A1
20140146976 Rundle May 2014 A1
Foreign Referenced Citations (14)
Number Date Country
4200811 Jul 1993 DE
2234881 Feb 1991 GB
2441835 Mar 2008 GB
2455827 Jun 2009 GB
H05011772 Jan 1922 JP
05-011772 Jan 1993 JP
H06318085 Nov 1994 JP
H07240989 Sep 1995 JP
H09037380 Feb 1997 JP
2009141698 Jun 2009 JP
2010019876 Jan 2010 JP
595238 Jun 2004 TW
I279775 Apr 2007 TW
WO-2010022456 Mar 2010 WO
Non-Patent Literature Citations (14)
Entry
PCT International Search Report and Written Opinion (dated Jun. 13, 2012), International Application No. PCT/US2011/038617, International Filing Date—May 31, 2011, (17 pages).
Non-Final Office Action (dated Aug. 6, 2012), U.S. Appl. No. 12/794,588, filed Jun. 4, 2010, First Named Inventor: Guy C. Nicholson, 14 pages.
CN First Office Action (dated Oct. 23, 2013), Application No. 201180021665.5, Date Filed—Oct. 30, 2012, (13 pages).
JP Office Action (dated Jan. 8, 2014), Application No. 2013-508096, Date Filed—May 31, 2011, (7 pages).
“Series G: Transmission Systems and Media, Digital Systems and Networks”, Voice enhancement devices, Amendment 1: Revised Appendix II—Objective measures for the characterization of the basic functioning of noise reduction algorithms, Nov. 2009, ITU-T Telecommunication Standardization Sector of ITU, G.160, (18 pages).
O'Shaughnessy, Douglas, “Speech Communications Human and Machine”, Second Edition, The Institute of Electrical and Electronics Engineers, Inc., New York, USA, copyright 2000, ISBN 0-7803-3449-3, (pp. vii-xv, and 323-336).
Non-Final Office Action (dated Dec. 4, 2012), U.S. Appl. No. 12/794,588, filed Jun. 4, 2010, First Named Inventor: Guy C. Nicholson, (10 pages).
PCT International Preliminary Report on Patentability and Written Opinion (dated Dec. 13, 2012), International Application No. PCT/US2011/038617, International Filing Date—May 31, 2011, (11 pages).
European Search Report (dated Aug. 5, 2013), European Application No. 11724113.3, International Filing Date—May 31, 2011, 5 pages.
PCT Invitation to Pay Additional Fees, (dated Feb. 24, 2012), International Application No. PCT/US2011/038617, International Filing Date—May 31, 2011, (7 pages).
Apple Inc., CN Application No. 201180021665.5 Office Action and Search Report dated Sep. 30, 2014.
Final Office Action (dated Nov. 10, 2014), U.S. Appl. No. 13/895,659, filed May 16, 2013, First Named Inventor: Guy C. Nicholson, 15 pages.
Apple Inc., TW Application No. 100119665, Office Action and Search Reported dated Sep. 19, 2014.
Non-Final Office Action (dated Jul. 1, 2014), U.S. Appl. No. 13/895,659, filed May 16, 2013, First Named Inventor: Guy C. Nicholson, 14 pages.
Related Publications (1)
Number Date Country
20120140917 A1 Jun 2012 US
Continuation in Parts (1)
Number Date Country
Parent 12794588 Jun 2010 US
Child 13369011 US