Aspects of the disclosure here relate to digital audio signal processing techniques for protecting a speaker of a headphone and for improving user experience of the program audio in certain usage cases while it is being used by an acoustic noise cancellation system. Other aspects are also described.
Headphones come in various fit types, such as an over-ear that partially rests directly against the head and surrounds the ear, an on-ear that rests against the ear, and in-ear that at least partially fits into the ear canal. In the case where the headphone is physically designed to acoustically and passively isolate ambient noise, especially in the case of a sealed in-ear headphone, there is a pocket of air that becomes essentially trapped either entirely in a blocked ear canal or between the ear and the main sound output port of the headphone. This trapped pocket of air induces the so-called occlusion effect, where the wearer perceives a louder and unnatural version of their own voice when talking. It is possible to mitigate this aspect of the occlusion effect when the user is talking, by configuring an acoustic noise cancellation, ANC, system (also referred to as an active noise reduction system) to actively reproduce the user's own voice through the speaker of the headphone.
An aspect of the disclosure here relates to an audio system in a headphone, in which a speaker of the headphone is fed at least the following signals: a first audio signal, from an internal microphone, that has been processed in a feedback path; and a second audio signal, from an external microphone, that has been processed in a feedforward path. The feedforward and feedback paths may be part of an ANC system and can be configured to process the respective audio signals to result in anti-noise produced by the speaker, intended to acoustically cancel any external or ambient noise (undesired sound) that has made its way past the headphone and into the wearer's ear. But the ANC system may over drive the speaker, under certain circumstances such as loud ambient sound levels typically present for example in a pop or rock concert, or high sound pressure levels created within the wearer's ear due to their walking while their ear canal is blocked by the headphone. A speaker is being overdriven when its input audio signal is so strong as to cause its diaphragm or other vibrating, sound radiating element to reach an excursion or displacement limit. This risks damaging the speaker. To mitigate this, the audio system has a digital processor programmed to perform a method for signal processing of the microphone signals of the headphone, as described next.
The method includes the processor filtering the audio signals that are from a first microphone and from a second microphone, thereby producing first and second filtered signals, respectively. These filtered signals may be produced by the adaptive filters of a feedforward and feedback acoustic noise cancellation system (implemented as the programmed processor.) Such filters are configured during a noise cancellation mode of operation, to produce anti-noise signals. The processor then performs a dynamic range control process upon those filtered signals, to produce first and second dynamic range adjusted signals, which it then combines into a single audio signal that drives a speaker (having one or more drivers or sound output transducers) of the headphone. To maintain the headphone listening experience of the user (wearer), the dynamic range control process may be designed to operate with reduced latency and to modify one or both of the filtered signals only when needed to prevent the speaker from reaching its excursion limit. This method is particularly effective in protecting the speaker of a sealing, in-ear type headphone against overdriving that may be caused when the user is walking and when the user is in a loud ambient environment such as a pop concert or a loud restaurant or social club.
In another aspect, referred to as a cascade approach, the dynamic range control includes producing the first DRC adjusted signal by a first low shelf cut filter, and then applying a second low shelf cut filter to the first DRC adjusted signal. A transition frequency of the second low shelf filter is varied based on side chain processing of the first DRC adjusted signal, wherein the transition frequency of the second low shelf filter is higher than the transition frequency of the first low shelf filter. Applying the second low shelf filter (that has a higher transition frequency than the first low shelf filter) further reduces energy of the first filtered audio signal, and is applied only if the first low shelf filter was unable to sufficiently reduce energy of the filtered audio signal.
In yet another aspect, dynamic user experience processing is performed upon one or both of the filtered signals that are produced by the adaptive filters of the feedforward and feedback acoustic noise cancellation systems. This is done so as to improve the user experience in certain usage cases, such as when the user is riding a bus (while wearing the headphone and either listening to program audio or otherwise while the acoustic noise cancellation is active.) Whenever the bus hits a bump, a clicking sound artifact might be heard by the user. In other instances, while riding the bus, the user can hear the program audio as if it is modulated by some low frequency carrier. A system is described that detects infrasound (e.g., frequency content between 1 Hz and 20 Hz) via side chain processing in a feedback or feedforward path of an ANC system, and in response performs dynamic range control upon the signal in that path that attenuates the signal in that path in a dynamic or time-varying manner, for example using a low shelf cut filter.
The above summary does not include an exhaustive list of all aspects of the present disclosure. It is contemplated that the disclosure includes all systems and methods that can be practiced from all suitable combinations of the various aspects summarized above, as well as those disclosed in the Detailed Description below and particularly pointed out in the Claims section. Such combinations may have particular advantages not specifically recited in the above summary.
Several aspects of the disclosure here are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” aspect in this disclosure are not necessarily to the same aspect, and they mean at least one. Also, in the interest of conciseness and reducing the total number of figures, a given figure may be used to illustrate the features of more than one aspect of the disclosure, and not all elements in the figure may be required for a given aspect.
Several aspects of the disclosure with reference to the appended drawings are now explained. Whenever the shapes, relative positions and other aspects of the parts described are not explicitly defined, the scope of the invention is not limited only to the parts shown, which are meant merely for the purpose of illustration. Also, while numerous details are set forth, it is understood that some aspects of the disclosure may be practiced without these details. In other instances, well-known circuits, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.
The headphone 1 has an against-the-ear acoustic transducer or speaker 7 arranged and configured to reproduce sound that is represented in an audio signal directly into the ear of a user, an external microphone 5 (arranged and configured to receive ambient sound directly), and an internal microphone 3 (arranged and configured to directly receive the sound reproduced by the speaker 7.) The headset is configured to acoustically couple the external microphone to an ambient environment of the headphone, in contrast to the internal microphone being acoustically coupled to a trapped volume of air within the ear that is being blocked by the headphone. In one variation, as integrated in the headphone and worn by its user, the external microphone 5 may be more sensitive than the internal microphone 3 to a far field sound source outside of the headphone. Viewed another way, as integrated in the headphone and worn by its user, the external microphone 5 may be less sensitive than the internal microphone 3 to sound within the user's ear. Here it should be noted that while the figures show a single microphone symbol in each instance (external microphone 5 and internal microphone 3), as producing a sound pickup channel, this does not mean that the sound pickup channel must be produced by only one microphone. In some instances, the sound pickup channel may be the result of combining multiple microphone signals, e.g., by a beamforming process performed on a multi-channel output from a microphone array—this variation or option is depicted in dotted lines in the figures, as additional external microphones and a beamforming process.
In one aspect, along with the transducers and the electronics that process and produce the transducer signals (output microphone signals and an input audio signal to drive the speaker), there is also electronics that is integrated in the headphone housing. Such electronics may include an audio amplifier to drive the speaker with an audio signal (that may include program audio), a microphone sensing circuit or amplifier that receives the microphone signals converts them into a desired format for digital signal processing, and a digital processor 2 and associated memory (not shown), where the memory stores instructions for configuring or programing the processor (e.g., instructions to be executed by the processor) to perform digital signal processing methods as described below in detail. A playback signal (program audio) that may contain user content such as music, podcast, or the voice of a far end user during a voice communication session can also be provided to drive the speaker in some modes of operation, e.g., during noise cancellation mode. The playback signal may be provided to the processor from an external, companion audio source device (not shown in the example of
Turning now
Note that in some cases, the noise cancellation mode of operation is performed during user content media playback, where a program audio signal containing for example music or a podcast or the voice of a far end user in a phone call is also combined into the single audio signal that is driving the speaker 7. In other cases, the program audio signal is silent during noise cancellation mode.
As explained above, there are instances where the output signals from one or both of the feedforward and feedback paths of the ANC system can overdrive the speaker 7, such as when the user is walking (footfall events) and/or when the ambient environment is loud (e.g., rock or pop concert.) This problem is more likely when the headphone 1 is a sealing, in-ear type. To mitigate this, dynamic range control is performed upon the first filtered signal from Gfb, to produce a first dynamic range adjusted signal, and upon the second filtered signal from Gff, to produce a second dynamic range adjusted signal, before driving the speaker 7. In the example of
In one instance, the dynamic range control includes downward compressing the first filtered signal, and/or downward compressing the second filtered signal. This reduces the magnitude of a component of the speaker input signal (e.g. the first filtered signal produced by the feedback path) which helps reduce sound pressure in the trapped volume of air. That sound pressure would otherwise increase beyond normal loud sounds, due to footfall events (e.g. the user is walking, hopping, rolling over a bump).
Still referring to
Note that while a low shelf cut filter attenuates frequencies below its transition frequency, its response flattens out at some level that still passes through the input signal at a meaningful level. This is contrast to a low pass filter: while it too attenuates frequencies below a cutoff frequency, its response generally maintains a continuous to roll off as the frequency drops until the input signal is essentially no longer passed through.
Also, it should be noted that detecting signal level (for example when evaluating the speaker displacement function) refers to a generic way of covering different techniques of determining the wide-band strength of a signal. This is in contrast to computing narrow band strengths, in given frequency bins for instance. Detecting the signal level may be including for example envelope detection. Time domain techniques for envelope detection may be more suitable here, to ensure low latency in the response by the controller 11.
Similar to the dynamic range control of the feedback path (at the output of the Gfb block), dynamic range control is also performed in the feedforward path, and particularly upon the second filtered signal that is produced at the output of Gff block. This produces a second dynamic range adjusted signal which is then combined with the first dynamic range adjusted signal (at the summing junction shown) into an audio signal that drives the speaker 7 of the headphone. In this particular case, an approach for dynamic range control that is similar to the one applied to the feedback path is taken, namely using a controller 9 that, similar to the controller 11, performs side chain processing of at least the output of the Gff block in the same manner as described above (applying the filtered signal to the input of a speaker displacement model and comparing the resulting speaking displacement function to a threshold based on which a transition frequency of a low shelf cut filter 8 is computed.) An option here is to also consider the first filtered signal when performing the side chain processing, as indicated by the dotted line connecting the output of the Gfb block to the controller 9. For example, the two filtered signals may be combined, as represented by the summing junction, into a single audio signal that is then input to the speaker displacement model.
The dynamic range control applied to the output of the Gff block may serve to reduce the magnitude of the output of the feedforward path, so that the speaker 7 is less likely to be overdriven when the user is in a loud ambient environment. As to the dynamic range control applied to the output of the Gfb block, that may serve to reduce the magnitude of the output of the feedback path, so that the speaker 7 is less likely to be overdriven during footfall events (e.g., the user is walking or riding over bumps.) As most of the energy in footfall events is below 50 Hz, the transition frequency of the low shelf cut filter 10, and perhaps also that of the low shelf cut filter 8, may vary between 20 Hz to 50 Hz. Thus, the speaker 7 is protected against the disturbances caused by footfall in both quiet and loud ambient environments, while both the feeedforward and feedback paths of the ANC system are active.
One of the problems encountered when seeking to protect the speaker 7 against being overdriven is how to keep the delay in responding to a detected overdriving condition (in the feedback and/or feedforward paths) as short as possible. A solution here is to perform the filtering by the Gfb block, the filtering by the low shelf filter, and the side chain processing (to determine the transition frequency of the low shelf filter), in time domain. For example, the filtering and side chain processing may all be performed without converting any of their input signals into frequency domain or sub-band signals, so as to avoid introducing too much latency into the feedback paths. Also, using a low shelf filter in the dynamic range control also helps keep the delay as short as possible, because of such a filter's desirable phase response characteristics. These observations also apply in a similar manner to reduce latency when responding to a detected overdriving condition in the feedforward path.
Referring now to
It should be noted that if there is no footfall event and the ambient environment is not loud, then the side chain processing performed by each of the controller 9, the controller 11 (
Turning now to
In
The effect of the version shown in
Referring now to
In accordance with an aspect of the disclosure here, dynamic range control is performed upon the filtered signal that is produced at the output of the Gff block or Gfb block (or both), by side chain processing of the filtered signal to detect energy or power below 20 Hz, for example on a frame by frame basis (digital audio frames.) Then, gain reduction is performed upon the filtered signal (of the Gff block, the Gfb block, or both as shown), in response to detecting that a signal level of the detected energy or power exceeds a threshold. Thus, in the example of a hybrid ANC system in
In this disclosure, microphone signals are processed by an ANC system and are translated into speaker displacement functions, for purposes of speaker protection, or for detecting infrasound energy or power. Thus, the use of personally identifiable information is not likely to be needed in this disclosure. However, it should be understood that any such use should follow privacy policies and practices that are generally recognized as meeting or exceeding industry or governmental requirements for maintaining the privacy of users. In particular, personally identifiable information should be managed and handled so as to minimize risks of unintentional or unauthorized access or use, and the nature of authorized use should be clearly indicated to users.
To aid the Patent Office and any readers of any patent issued on this application in interpreting the claims appended hereto, applicant wishes to note that they do not intend any of the appended claims or claim elements to invoke 35 U.S.C. 112(f) unless the words “means for” or “step for” are explicitly used in the particular claim.
While certain aspects have been described and shown in the accompanying drawings, it is to be understood that such are merely illustrative of and not restrictive on the broad invention, and that the invention is not limited to the specific constructions and arrangements shown and described, since various other modifications may occur to those of ordinary skill in the art. For example, although not shown in
This nonprovisional patent application claims the benefit of the earlier filing dates of U.S. provisional applications 62/907,315 filed Sep. 27, 2019 and 62/923,391 filed Oct. 18, 2019.
Number | Date | Country | |
---|---|---|---|
62923391 | Oct 2019 | US | |
62907315 | Sep 2019 | US |