Methods and systems for providing consistency in noise reduction during speech and non-speech periods

Information

  • Patent Grant
  • 9812149
  • Patent Number
    9,812,149
  • Date Filed
    Thursday, January 28, 2016
    8 years ago
  • Date Issued
    Tuesday, November 7, 2017
    7 years ago
Abstract
Methods and systems for providing consistency in noise reduction during speech and non-speech periods are provided. First and second signals are received. The first signal includes at least a voice component. The second signal includes at least the voice component modified by human tissue of a user. First and second weights may be assigned per subband to the first and second signals, respectively. The first and second signals are processed to obtain respective first and second full-band power estimates. During periods when the user's speech is not present, the first weight and the second weight are adjusted based at least partially on the first full-band power estimate and the second full-band power estimate. The first and second signals are blended based on the adjusted weights to generate an enhanced voice signal. The second signal may be aligned with the first signal prior to the blending.
Description
FIELD

The present application relates generally to audio processing and, more specifically, to systems and methods for providing noise reduction that has consistency between speech-present periods and speech-absent periods (speech gaps).


BACKGROUND

The proliferation of smart phones, tablets, and other mobile devices has fundamentally changed the way people access information and communicate. People now make phone calls in diverse places such as crowded bars, busy city streets, and windy outdoors, where adverse acoustic conditions pose severe challenges to the quality of voice communication. Additionally, voice commands have become an important method for interaction with electronic devices in applications where users have to keep their eyes and hands on the primary task, such as, for example, driving. As electronic devices become increasingly compact, voice command may become the preferred method of interaction with electronic devices. However, despite recent advances in speech technology, recognizing voice in noisy conditions remains difficult. Therefore, mitigating the impact of noise is important to both the quality of voice communication and performance of voice recognition.


Headsets have been a natural extension of telephony terminals and music players as they provide hands-free convenience and privacy when used. Compared to other hands-free options, a headset represents an option in which microphones can be placed at locations near the user's mouth, with constrained geometry among user's mouth and microphones. This results in microphone signals that have better signal-to-noise ratios (SNRs) and are simpler to control when applying multi-microphone based noise reduction. However, when compared to traditional handset usage, headset microphones are relatively remote from the user's mouth. As a result, the headset does not provide the noise shielding effect provided by the user's hand and the bulk of the handset. As headsets have become smaller and lighter in recent years due to the demand for headsets to be subtle and out-of-way, this problem becomes even more challenging.


When a user wears a headset, the user's ear canals are naturally shielded from outside acoustic environment. If a headset provides tight acoustic sealing to the ear canal, a microphone placed inside the ear canal (the internal microphone) would be acoustically isolated from the outside environment such that environmental noise would be significantly attenuated. Additionally, a microphone inside a sealed ear canal is free of wind-buffeting effect. A user's voice can be conducted through various tissues in a user's head to reach the ear canal, because the sound is trapped inside of the ear canal. A signal picked up by the internal microphone should thus have much higher SNR compared to the microphone outside of the user's ear canal (the external microphone).


Internal microphone signals are not free of issues, however. First of all, the body-conducted voice tends to have its high-frequency content severely attenuated and thus has much narrower effective bandwidth compared to voice conducted through air. Furthermore, when the body-conducted voice is sealed inside an ear canal, it forms standing waves inside the ear canal. As a result, the voice picked up by the internal microphone often sounds muffled and reverberant while lacking the natural timbre of the voice picked up by the external microphones. Moreover, effective bandwidth and standing-wave patterns vary significantly across different users and headset fitting conditions. Finally, if a loudspeaker is also located in the same ear canal, sounds made by the loudspeaker would also be picked by the internal microphone. Even with acoustic echo cancellation (AEC), the close coupling between the loudspeaker and internal microphone often leads to severe voice distortion even after AEC.


Other efforts have been attempted in the past to take advantage of the unique characteristics of the internal microphone signal for superior noise reduction performance. However, attaining consistent performance across different users and different usage conditions has remained challenging. It can be particularly challenging to provide robustness and consistency for noise reduction both when the user is speaking and in gaps when the user is not speaking (speech gaps). Some known methods attempt to address this problem; however, those methods may be more effective when the user's speech is present but less so when the user's speech is absent. What is needed is a method that overcomes the drawbacks of the known methods. More specifically, what is needed is a method that improves noise reduction performance during speech gaps such that it is not inconsistent with the noise reduction performance during speech periods.


SUMMARY

This summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.


Methods and systems for providing consistency in noise reduction during speech and non-speech periods are provided. An example method includes receiving a first audio signal and a second audio signal. The first audio signal includes at least a voice component. The second audio signal includes at least the voice component modified by at least a human tissue of a user. The voice component may be the speech of the user. The first and second audio signals including periods where the speech of the user is not present. The method can also include assigning a first weight to the first audio signal and a second weight to the second audio signal. The method also includes processing the first audio signal to obtain a first full-band power estimate. The method also includes processing the second audio signal to obtain a second full-band power estimate. For the periods when the user's speech is not present, the method includes adjusting, based at least partially on the first full-band power estimate and the second full-band power estimate, the first weight and the second weight. The method also includes blending, based on the first weight and the second weight, the first signal and the second signal to generate an enhanced voice signal.


In some embodiments, the first signal and the second signal are transformed into subband signals. In other embodiments, assigning the first weight and the second weight is performed per subband and based on SNR estimates for the subband. The first signal is processed to obtain a first SNR for the subband and the second signal is processed to obtain a second SNR for the subband. If the first SNR is larger than the second SNR, the first weight for the subband receives a larger value than the second weight for the subband. Otherwise, if the second SNR is larger than the first SNR, the second weight for the subband receives a larger value than the first weight for the subband. In some embodiments, the difference between the first weight and the second weight corresponds to the difference between the first SNR and the second SNR for the subband. However, this SNR-based method is more effective when the user's speech is present but less effective when the user's speech is absent. More specifically, when the user's speech is present, according to this example, selecting the signal with a higher SNR leads to the selection of the signal with lower noise. Because the noise in the ear canal tends to be 20-30 dB lower than the noise outside, there is typically a 20-30 dB noise reduction relative to the external microphone signal. However, when the user's speech is absent, in this example, the SNR is 0 at both the internal and external microphone signals. Deciding the weights based only on the SNRs, as in the SNR-based method, would lead to evenly split weights when the user's speech is absent in this example. As a result, only 3-6 dB of noise reduction is typically achieved relative to the external microphone signal when only the SNR-based method is used.


To mitigate this deficiency of SNR-based mixing methods during speech-absent periods (speech gaps), the full-band noise power is used, in various embodiments, to decide the mixing weights during the speech gaps. Because there is no speech, lower full-band power means there is lower noise power. The method, according to various embodiments, selects the signals with lower full-band power in order to maintain the 20-30 dB noise reduction in speech gaps. In some embodiments, during the speech gaps, adjusting the first weight and the second weight includes determining a minimum value between the first full-band power estimate and the second full-band power estimate. When the minimum value corresponds to the first full-band power estimate, the first weight is increased and the second weight is decreased. When the minimum value corresponds to the second full-band power estimate, the second weight is increased and the first weight is decreased. In some embodiments, the weights are increased and decreased by applying a shift. In various embodiments, the shift is calculated based on a difference between the first full-band power estimate and the second full-band power estimate. The shift receives a larger value for a larger difference value. In certain embodiments, the shift is applied only after determining that the difference exceeds a pre-determined threshold. In other embodiments, a ratio of the first full-band power estimate to the second full-band power estimate is calculated. The shift is calculated based on the ratio. The shift receives a larger value the further the value of ratio is from 1.


In some embodiments, the second audio signal represents at least one sound captured by an internal microphone located inside an ear canal. In certain embodiments, the internal microphone is at least partially sealed for isolation from acoustic signals external to the ear canal.


In some embodiments, the first signal represents at least one sound captured by an external microphone located outside an ear canal. In some embodiments, prior to associating the first weight and the second weight, the second signal is aligned with the first signal. In some embodiments, the assigning of the first weight and the second weight includes determining, based on the first signal, a first noise estimate and determining, based on the second signal, a second noise estimate. The first weight and the second weight can be calculated based on the first noise estimate and the second noise estimate.


In some embodiments, blending includes mixing the first signal and the second signal according to the first weight and the second weight. According to another example embodiment of the present disclosure, the steps of the method for providing consistency in noise reduction during speech and non-speech periods are stored on a non-transitory machine-readable medium comprising instructions, which, when implemented by one or more processors, perform the recited steps.


Other example embodiments of the disclosure and aspects will become apparent from the following description taken in conjunction with the following drawings.





BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments are illustrated by way of example and not limitation in the figures of the accompanying drawings, in which like references indicate similar elements.



FIG. 1 is a block diagram of a system and an environment in which methods and systems described herein can be practiced, according to an example embodiment.



FIG. 2 is a block diagram of a headset suitable for implementing the present technology, according to an example embodiment.



FIG. 3 is a block diagram illustrating a system for providing consistency in noise reduction during speech and non-speech periods, according to an example embodiment.



FIG. 4 is a flow chart showing steps of a method for providing consistency in noise reduction during speech and non-speech periods, according to an example embodiment.



FIG. 5 illustrates an example of a computer system that can be used to implement embodiments of the disclosed technology.





DETAILED DESCRIPTION

The present technology provides systems and methods for audio processing which can overcome or substantially alleviate problems associated with ineffective noise reduction during speech-absent periods. Embodiments of the present technology can be practiced on any earpiece-based audio device that is configured to receive and/or provide audio such as, but not limited to, cellular phones, MP3 players, phone handsets and headsets. While some embodiments of the present technology are described in reference to operation of a cellular phone, the present technology can be practiced with any audio device.


According to an example embodiment, the method for audio processing includes receiving a first audio signal and a second audio signal. The first audio signal includes at least a voice component. The second audio signal includes the voice component modified by at least a human tissue of a user, the voice component being speech of the user. The first and second audio signals may include periods when the speech of the user is not present. The first and second audio signals may be transformed into subband signals. The example method includes assigning, per subband, a first weight to the first audio signal and a second weight to the second audio signal. The example method includes processing the first audio signal to obtain a first full-band power estimate. The example method includes processing the second audio signal to obtain a second full-band power estimate. For the periods when the user's speech is not present (speech gaps), the example method includes adjusting, based at least partially on the first full-band power estimate and the second full-band power estimate, the first weight and the second weight. The example method also includes blending, based on the adjusted first weight and the adjusted second weight, the first audio signal and the second audio signal to generate an enhanced voice signal.


Referring now to FIG. 1, a block diagram of an example system 100 suitable for providing consistency in noise reduction during speech and non-speech periods and environment thereof are shown. The example system 100 includes at least an internal microphone 106, an external microphone 108, a digital signal processor (DSP) 112, and a radio or wired interface 114. The internal microphone 106 is located inside a user's ear canal 104 and is relatively shielded from the outside acoustic environment 102. The external microphone 108 is located outside of the user's ear canal 104 and is exposed to the outside acoustic environment 102.


In various embodiments, the microphones 106 and 108 are either analog or digital. In either case, the outputs from the microphones are converted into synchronized pulse coded modulation (PCM) format at a suitable sampling frequency and connected to the input port of the digital signal processor (DSP) 112. The signals xin and xex denote signals representing sounds captured by internal microphone 106 and external microphone 108, respectively.


The DSP 112 performs appropriate signal processing tasks to improve the quality of microphone signals xin and xex. The output of DSP 112, referred to as the send-out signal (sout), is transmitted to the desired destination, for example, to a network or host device 116 (see signal identified as sout uplink), through a radio or wired interface 114.


If a two-way voice communication is needed, a signal is received by the network or host device 116 from a suitable source (e.g., via the wireless or wired interface 114). This is referred to as the receive-in signal (rin) (identified as rin downlink at the network or host device 116). The receive-in signal can be coupled via the radio or wired interface 114 to the DSP 112 for processing. The resulting signal, referred to as the receive-out signal (rout), is converted into an analog signal through a digital-to-analog convertor (DAC) 110 and then connected to a loudspeaker 118 in order to be presented to the user. In some embodiments, the loudspeaker 118 is located in the same ear canal 104 as the internal microphone 106. In other embodiments, the loudspeaker 118 is located in the ear canal opposite the ear canal 104. In example of FIG. 1, the loudspeaker 118 is found in the same ear canal as the internal microphone 106; therefore, an acoustic echo canceller (AEC) may be needed to prevent the feedback of the received signal to the other end. Optionally, in some embodiments, if no further processing of the received signal is necessary, the receive-in signal (rin) can be coupled to the loudspeaker without going through the DSP 112. In some embodiments, the receive-in signal rin includes an audio content (for example, music) presented to user. In certain embodiments, receive-in signal rin includes a far end signal, for example a speech during a phone call.



FIG. 2 shows an example headset 200 suitable for implementing methods of the present disclosure. The headset 200 includes example inside-the-ear (ITE) module(s) 202 and behind-the-ear (BTE) modules 204 and 206 for each ear of a user. The ITE module(s) 202 are configured to be inserted into the user's ear canals. The BTE modules 204 and 206 are configured to be placed behind (or otherwise near) the user's ears. In some embodiments, the headset 200 communicates with host devices through a wireless radio link. The wireless radio link may conform to a Bluetooth Low Energy (BLE), other Bluetooth, 802.11, or other suitable wireless standard and may be variously encrypted for privacy.


In various embodiments, each ITE module 202 includes an internal microphone 106 and the loudspeaker 118 (shown in FIG. 1), both facing inward with respect to the ear canals. The ITE module(s) 202 can provide acoustic isolation between the ear canal(s) 104 and the outside acoustic environment 102.


In some embodiments, each of the BTE modules 204 and 206 includes at least one external microphone 108 (also shown in FIG. 1). In some embodiments, the BTE module 204 includes a DSP 112, control button(s), and wireless radio link to host devices. In certain embodiments, the BTE module 206 includes a suitable battery with charging circuitry.


In some embodiments, the seal of the ITE module(s) 202 is good enough to isolate acoustics waves coming from outside acoustic environment 102. However, when speaking or singing, a user can hear user's own voice reflected by ITE module(s) 202 back into the corresponding ear canal. The sound of voice of the user can be distorted because, while traveling through skull of the user, high frequencies of the sound are substantially attenuated. Thus, the user can hear mostly the low frequencies of the voice. The user's voice cannot be heard by the user outside of the earpieces since the ITE module(s) 202 isolate external sound waves.



FIG. 3 illustrates a block diagram 300 of DSP 112 suitable for fusion (blending) of microphone signals, according to various embodiments of the present disclosure. The signals xin and xex are signals representing sounds captured from, respectively, the internal microphone 106 and external microphone 108. The signals xin and xex need not be the signals coming directly from the respective microphones; they may represent the signals that are coming directly from the respective microphones. For example, the direct signal outputs from the microphones may be preprocessed in some way, for example, by conversion into a synchronized pulse coded modulation (PCM) format at a suitable sampling frequency, where the method disclosed herein can be used to convert the signal.


In the example in FIG. 3, the signals xin and xex are first processed by noise tracking/noise reduction (NT/NR) modules 302 and 304 to obtain running estimates of the noise level picked up by each microphone. Optionally, the noise reduction (NR) can be performed by NT/NR modules 302 and 304 by utilizing an estimated noise level.


By way of example and not limitation, suitable noise reduction methods are described by Ephraim and Malah, “Speech Enhancement Using a Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator,” IEEE Transactions on Acoustics, Speech, and Signal Processing, December 1984, and U.S. patent application Ser. No. 12/832,901 (now U.S. Pat. No. 8,473,287), entitled “Method for Jointly Optimizing Noise Reduction and Voice Quality in a Mono or Multi-Microphone System,” filed on Jul. 8, 2010, the disclosures of which are incorporated herein by reference for all purposes.


In various embodiments, the microphone signals xin and xex, with or without NR, and noise estimates (e.g., “external noise and SNR estimates” output from NT/NR module 302 and/or “internal noise and SNR estimates” output from NT/NR module 304) from the NT/NR modules 302 and 304 are sent to a microphone spectral alignment (MSA) module 306, where a spectral alignment filter is adaptively estimated and applied to the internal microphone signal xin. A primary purpose of MSA module 306, in the example in FIG. 3; is to spectrally align the voice picked up by the internal microphone 106 to the voice picked up by the external microphone 108 within the effective bandwidth of the in-canal voice signal.


The external microphone signal xex, the spectrally-aligned internal microphone signal xin,align, and the estimated noise levels at both microphones 106 and 108 are then sent to a microphone signal blending (MSB) module 308, where the two microphone signals are intelligently combined based on the current signal and noise conditions to form a single output with optimal voice quality. The functionalities of various embodiments of the NT/NR modules 302 and 304, MSA module, and MSB module 308 are discussed in more detail in U.S. patent application Ser. No. 14/853,947, entitled “Microphone Signal Fusion”, filed Sep. 14, 2015.


In some embodiments, external microphone signal xex and the spectrally-aligned internal microphone signal xin,align are blended using blending weights. In certain embodiments, the blending weights are determined in MSB module 308 based on the “external noise and SNR estimates” and the “internal noise and SNR estimates”.


For example, MSB module 308 operates in the frequency-domain and determines the blending weights of the external microphone signal and spectral-aligned internal microphone signal in each frequency bin based on the SNR differential between the two signals in the bin. When a user's speech is present (for example, the user of headset 200 is speaking during a phone call) and the outside acoustic environment 102 becomes noisy, the SNR of the external microphone signal xex becomes lower as compared to the SNR of the internal microphone signal xin. Therefore, the blending weights are shifted toward the internal microphone signals xin. Because acoustic sealing tends to reduce the noise in the ear canal by 20-30 dB relative to the external environment, the shift can potentially provide 20-30 dB noise reduction relative to the external microphone signal. When the user's speech is absent, the SNRs of both internal and external microphone signals are effectively zero, so the blending weights become evenly distributed between the internal and external microphone signals. Therefore, if the outside acoustic environment is noisy, the resulting blended signal sout includes the part of the noise. The blending of internal microphone signal xin and noisy external microphone signal xex may result in 3-6 dB noise reduction, which is generally insufficient for extraneous noise conditions.


In various embodiments, the method includes utilizing differences between the power estimates for the external and the internal microphone signals for locating gaps in the speech of the user of headset 200. In certain embodiments, for the gap intervals, blending weight for the external microphone signal is decreased or set to zero and blending weight for the internal microphone signal is increased or set to one before blending of the internal microphone and external microphone signals. Thus, during the gaps in the user's speech, the blending weights are biased to the internal microphone signal, according to various embodiments. As a result, the resulting blended signal contains a lesser amount of the external microphone signal and, therefore, a lesser amount of noise from the outside external environment. When the user is speaking, the blended weights are determined based on “noise and SNR estimates” of internal and external microphone signals. Blending the signals during user's speech improves the quality of the signal. For example, the blending of the signals can improve a quality of signals delivered to the far-end talker during a phone call or to an automatic speech recognition system by the radio or wired interface 114.


In various embodiments, DSP 112 includes a microphone power spread (MPS) module 310 as shown in FIG. 3. In certain embodiments, MPS module 310 is operable to track full-band power for both external microphone signal xex and internal microphone signal xin. In some embodiments, MPS module 310 tracks full-band power of the spectrally-aligned internal microphone signal xin,align instead of the raw internal microphone signal xin. In some embodiments, power spreads for the internal microphone signal and external microphone signal are estimated. In clean speech conditions, the powers of both the internal microphone and external microphone signals tend to follow each other. A wide power spread indicates the presence of an excessive noise in the microphone signal with much higher power.


In various embodiments, the MPS module 310 generates microphone power spread (MPS) estimates for the internal microphone signal and external microphone signal. The MPS estimates are provided to MSB module 308. In certain embodiments, the MPS estimates are used for a supplemental control of microphone signal blending. In some embodiments, MSB module 308 applies a global bias toward the microphone signal with significantly lower full-band power, for example, by increasing the weights for that microphone signal and decreasing the weights for the other microphone signal (i.e., shifting the weights toward the microphone signal with significantly lower full-band power) before the two microphone signals are blended.



FIG. 4 is a flow chart showing steps of method 400 for providing consistency in noise reduction during speech and non-speech periods, according to various example embodiments. The example method 400 can commence with receiving a first audio signal and a second audio signal in block 402. The first audio signal includes at least a voice component and a second audio signal includes the voice component modified by at least a human tissue.


In block 404, method 400 can proceed with assigning a first weight to the first audio signal and a second weight to the second audio signal. In some embodiments, prior to assigning the first weight and the second weight, the first audio signal and the second audio signal are transformed into subband signals and, therefore, assigning of the weights may be performed per each subband. In some embodiments, the first weight and the second weight are determined based on noise estimates in the first audio signal and the second audio signal. In certain embodiments, when the user's speech is present, the first weight and the second weight are assigned based on subband SNR estimates in the first audio signal and the second audio signal.


In block 406, method 400 can proceed with processing the first audio signal to obtain a first full-band power estimate. In block 408, method 400 can proceed with processing the second audio signal to obtain a second full-band power estimate. In block 410, during speech gaps when the user's speech is not present, the first weight and the second weight may be adjusted based, at least partially, on the first full-band power estimate and the second full-band power estimate. In some embodiments, if the first full-band power estimate is less than the second full-band estimate, the first weight and the second weight are shifted towards the first weight. If the second full-band power estimate is less than the first full-band estimate, the first weight and the second weight are shifted towards the second weight.


In block 412, the first signal and the second signal can be used to generate an enhanced voice signal by being blended together based on the adjusted first weight and the adjusted second weight.



FIG. 5 illustrates an exemplary computer system 500 that may be used to implement some embodiments of the present invention. The computer system 500 of FIG. 5 may be implemented in the contexts of the likes of computing systems, networks, servers, or combinations thereof. The computer system 500 of FIG. 5 includes one or more processor unit(s) 510 and main memory 520. Main memory 520 stores, in part, instructions and data for execution by processor units 510. Main memory 520 stores the executable code when in operation, in this example. The computer system 500 of FIG. 5 further includes a mass data storage 530, portable storage device 540, output devices 550, user input devices 560, a graphics display system 570, and peripheral devices 580.


The components shown in FIG. 5 are depicted as being connected via a single bus 590. The components may be connected through one or more data transport means. Processor unit(s) 510 and main memory 520 is connected via a local microprocessor bus, and the mass data storage 530, peripheral devices 580, portable storage device 540, and graphics display system 570 are connected via one or more input/output (I/O) buses.


Mass data storage 530, which can be implemented with a magnetic disk drive, solid state drive, or an optical disk drive, is a non-volatile storage device for storing data and instructions for use by processor unit(s) 510. Mass data storage 530 stores the system software for implementing embodiments of the present disclosure for purposes of loading that software into main memory 520.


Portable storage device 540 operates in conjunction with a portable non-volatile storage medium, such as a flash drive, floppy disk, compact disk, digital video disc, or Universal Serial Bus (USB) storage device, to input and output data and code to and from the computer system 500 of FIG. 5. The system software for implementing embodiments of the present disclosure is stored on such a portable medium and input to the computer system 500 via the portable storage device 540.


User input devices 560 can provide a portion of a user interface. User input devices 560 may include one or more microphones, an alphanumeric keypad, such as a keyboard, for inputting alphanumeric and other information, or a pointing device, such as a mouse, a trackball, stylus, or cursor direction keys. User input devices 560 can also include a touchscreen. Additionally, the computer system 500 as shown in FIG. 5 includes output devices 550. Suitable output devices 550 include speakers, printers, network interfaces, and monitors.


Graphics display system 570 include a liquid crystal display (LCD) or other suitable display device. Graphics display system 570 is configurable to receive textual and graphical information and processes the information for output to the display device.


Peripheral devices 580 may include any type of computer support device to add additional functionality to the computer system.


The components provided in the computer system 500 of FIG. 5 are those typically found in computer systems that may be suitable for use with embodiments of the present disclosure and are intended to represent a broad category of such computer components that are well known in the art. Thus, the computer system 500 of FIG. 5 can be a personal computer (PC), hand held computer system, telephone, mobile computer system, workstation, tablet, phablet, mobile phone, server, minicomputer, mainframe computer, wearable, or any other computer system. The computer may also include different bus configurations, networked platforms, multi-processor platforms, and the like. Various operating systems may be used including UNIX, LINUX, WINDOWS, MAC OS, PALM OS, QNX ANDROID, IOS, CHROME, TIZEN, and other suitable operating systems.


The processing for various embodiments may be implemented in software that is cloud-based. In some embodiments, the computer system 500 is implemented as a cloud-based computing environment, such as a virtual machine operating within a computing cloud. In other embodiments, the computer system 500 may itself include a cloud-based computing environment, where the functionalities of the computer system 500 are executed in a distributed fashion. Thus, the computer system 500, when configured as a computing cloud, may include pluralities of computing devices in various forms, as will be described in greater detail below.


In general, a cloud-based computing environment is a resource that typically combines the computational power of a large grouping of processors (such as within web servers) and/or that combines the storage capacity of a large grouping of computer memories or storage devices. Systems that provide cloud-based resources may be utilized exclusively by their owners or such systems may be accessible to outside users who deploy applications within the computing infrastructure to obtain the benefit of large computational or storage resources.


The cloud may be formed, for example, by a network of web servers that comprise a plurality of computing devices, such as the computer system 500, with each server (or at least a plurality thereof) providing processor and/or storage resources. These servers may manage workloads provided by multiple users (e.g., cloud resource customers or other users). Typically, each user places workload demands upon the cloud that vary in real-time, sometimes dramatically. The nature and extent of these variations typically depends on the type of business associated with the user.


The present technology is described above with reference to example embodiments. Therefore, other variations upon the example embodiments are intended to be covered by the present disclosure.

Claims
  • 1. A method for audio processing, the method comprising: receiving a first signal including at least a voice component and a second signal including at least the voice component modified by at least a human tissue of a user, the voice component being speech of the user, the first and second signals including periods when the speech of the user is not present;assigning a first weight to the first signal and a second weight to the second signal;processing the first signal to obtain a first power estimate;processing the second signal to obtain a second power estimate;utilizing the first and second power estimates to identify the periods when the speech of the user is not present;for the periods that have been identified to be when the speech of the user is not present, performing one or both of decreasing the first weight and increasing the second weight so as to enhance the level of the second signal relative to the first signal;blending, based on the first weight and the second weight, the first signal and the second signal to generate an enhanced voice signal; andprior to the assigning, aligning the second signal with the first signal, the aligning including applying a spectral alignment filter to the second signal.
  • 2. The method of claim 1, further comprising: further processing the first signal to obtain a first full-band power estimate;further processing the second signal to obtain a second full-band power estimate;determining a minimum value between the first full-band power estimate and the second full-band power estimate; andbased on the determination: increasing the first weight and decreasing the second weight when the minimum value corresponds to the first full-band power estimate; andincreasing the second weight and decreasing the first weight when the minimum value corresponds to the second full-band power estimate.
  • 3. The method of claim 2, wherein the increasing and decreasing is carried out by applying a shift.
  • 4. The method of claim 3, wherein the shift is calculated based on a difference between the first full-band power estimate and the second full-band power estimate, the shift receiving a larger value for a larger difference value.
  • 5. The method of claim 4, further comprising: prior to the increasing and decreasing, determining that the difference exceeds a pre-determined threshold; andbased on the determination, applying the shift if the difference exceeds the pre-determined threshold.
  • 6. The method of claim 1, wherein the first signal and the second signal are transformed into subband signals.
  • 7. The method of claim 6, wherein, for the periods when the speech of the user is present, the assigning the first weight and the second weight is carried out per subband by performing the following: processing the first signal to obtain a first signal-to-noise ratio (SNR) for the subband;processing the second signal to obtain a second SNR for the subband;comparing the first SNR and the second SNR; andbased on the comparison, assigning a first value to the first weight for the subband and a second value to the second weight for the subband, and wherein: the first value is larger than the second value if the first SNR is larger than the second SNR;the second value is larger than the first value if the second SNR is larger than the first SNR; anda difference between the first value and the second value depends on a difference between the first SNR and the second SNR.
  • 8. The method of claim 1, wherein the second signal represents at least one sound captured by an internal microphone located inside an ear canal.
  • 9. The method of claim 8, wherein the internal microphone is at least partially sealed for isolation from acoustic signals external to the ear canal.
  • 10. The method of claim 1, wherein the first signal represents at least one sound captured by an external microphone located outside an ear canal.
  • 11. The method of claim 1, wherein the assigning of the first weight and the second weight includes: determining, based on the first signal, a first noise estimate;determining, based on the second signal, a second noise estimate; andcalculating, based on the first noise estimate and the second noise estimate, the first weight and the second weight.
  • 12. The method of claim 1, wherein the blending includes mixing the first signal and the second signal according to the first weight and the second weight.
  • 13. A system for audio processing, the system comprising: a processor; anda memory communicatively coupled with the processor, the memory storing instructions, which, when executed by the processor, perform a method comprising: receiving a first signal including at least a voice component and a second signal including at least the voice component modified by at least a human tissue of a user, the voice component being speech of the user, the first and second signals including periods when the speech of the user is not present;assigning a first weight to the first signal and a second weight to the second signal;processing the first signal to obtain a first power estimate;processing the second signal to obtain a second power estimate;utilizing the first and second power estimates to identify the periods when the speech of the user is not present;for the periods that have been identified to be when the speech of the user is not present, performing one or both of decreasing the first weight and increasing the second weight so as to enhance the level of the second signal relative to the first signal;blending, based on the first weight and the second weight, the first signal and the second signal to generate an enhanced voice signal; andprior to the assigning, aligning the second signal with the first signal, the aligning including applying a spectral alignment filter to the second signal.
  • 14. The system of claim 13, wherein the method further comprises: further processing the first signal to obtain a first full-band power estimate;further processing the second signal to obtain a second full-band power estimate;determining a minimum value between the first full-band power estimate and the second full-band power estimate; andbased on the determination: increasing the first weight and decreasing the second weight when the minimum value corresponds to the first full-band power estimate; andincreasing the second weight and decreasing the first weight when the minimum value corresponds to the second full-band power estimate.
  • 15. The system of claim 14, wherein the increasing and decreasing is carried out by applying a shift.
  • 16. The system of claim 15, wherein the shift is calculated based on a difference of the first full-band power estimate and the second full-band power estimate, the shift receiving a larger value for a larger value difference.
  • 17. The system of claim 16, further comprising: prior to the increasing and decreasing, determining that the difference exceeds a pre-determined threshold; andbased on the determination, applying the shift if the difference exceeds the pre-determined threshold.
  • 18. The system of claim 13, wherein the first signal and the second signal are transformed into subband signals.
  • 19. The system of claim 18, wherein, for the periods when the speech of the user is present, the assigning the first weight and the second weight is carried out per subband by performing the following: processing the first signal to obtain a first signal-to-noise ratio (SNR) for the subband;processing the second signal to obtain a second SNR for the subband;comparing the first SNR and the second SNR; andbased on the comparison, assigning a first value to the first weight for the subband and a second value to the second weight for the subband, and wherein: the first value is larger than the second value if the first SNR is larger than the second SNR;the second value is larger than the first value if the second SNR is larger than the first SNR; anda difference between the first value and the second value depends on a difference between the first SNR and the second SNR.
  • 20. The system of claim 13, wherein the second signal represents at least one sound captured by an internal microphone located inside an ear canal.
  • 21. The system of claim 20, wherein the internal microphone is at least partially sealed for isolation from acoustic signals external to the ear canal.
  • 22. The system of claim 13, wherein the first signal represents at least one sound captured by an external microphone located outside an ear canal.
  • 23. The system of claim 13, wherein the assigning the first weight and the second weight includes: determining, based on the first signal, a first noise estimate;determining, based on the second signal, a second noise estimate; andcalculating, based on the first noise estimate and the second noise estimate, the first weight and the second weight.
  • 24. A non-transitory computer-readable storage medium having embodied thereon instructions, which, when executed by at least one processor, perform steps of a method, the method comprising: receiving a first signal including at least a voice component and a second signal including at least the voice component modified by at least a human tissue of a user, the voice component being speech of the user, the first and second signals including periods when the speech of the user is not present;determining, based on the first signal, a first noise estimate;determining, based on the second signal, a second noise estimate;assigning, based on the first noise estimate and second noise estimate, a first weight to the first signal and a second weight to the second signal;processing the first signal to obtain a first power estimate;processing the second signal to obtain a second power estimate;utilizing the first and second power estimates to identify the periods when the speech of the user is not present;for the periods that have been identified to be when the speech of the user is not present, performing one or both of decreasing the first weight and increasing the second weight so as to enhance the level of the second signal relative to the first signal;blending, based on the first weight and the second weight, the first signal and the second signal to generate an enhanced voice signal; andprior to the assigning, aligning the second signal with the first signal, the aligning including applying a spectral alignment filter to the second signal.
US Referenced Citations (295)
Number Name Date Kind
2535063 Halstead Dec 1950 A
3995113 Tani Nov 1976 A
4150262 Ono Apr 1979 A
4455675 Bose et al. Jun 1984 A
4516428 Konomi May 1985 A
4520238 Ikeda May 1985 A
4588867 Konomi May 1986 A
4596903 Yoshizawa Jun 1986 A
4644581 Sapiejewski Feb 1987 A
4652702 Yoshii Mar 1987 A
4696045 Rosenthal Sep 1987 A
4761825 Ma Aug 1988 A
4975967 Rasmussen Dec 1990 A
5208867 Stites, III May 1993 A
5222050 Marren et al. Jun 1993 A
5251263 Andrea et al. Oct 1993 A
5282253 Konomi Jan 1994 A
5289273 Lang Feb 1994 A
5295193 Ono Mar 1994 A
5305387 Sapiejewski Apr 1994 A
5319717 Holesha Jun 1994 A
5327506 Stites, III Jul 1994 A
D360691 Mostardo Jul 1995 S
D360948 Mostardo Aug 1995 S
D360949 Mostardo Aug 1995 S
5490220 Loeppert Feb 1996 A
5734621 Ito Mar 1998 A
5870482 Loeppert et al. Feb 1999 A
D414493 Jiann-Yeong Sep 1999 S
5960093 Miller Sep 1999 A
5983073 Ditzik Nov 1999 A
6044279 Hokao et al. Mar 2000 A
6061456 Andrea et al. May 2000 A
6094492 Boesen Jul 2000 A
6118878 Jones Sep 2000 A
6122388 Feldman Sep 2000 A
6130953 Wilton et al. Oct 2000 A
6184652 Yang Feb 2001 B1
6211649 Matsuda Apr 2001 B1
6219408 Kurth Apr 2001 B1
6255800 Bork Jul 2001 B1
D451089 Hohl et al. Nov 2001 S
6362610 Yang Mar 2002 B1
6373942 Braund Apr 2002 B1
6408081 Boesen Jun 2002 B1
6453289 Ertem Sep 2002 B1
6462668 Foseide Oct 2002 B1
6535460 Loeppert et al. Mar 2003 B2
6567524 Svean et al. May 2003 B1
6661901 Svean et al. Dec 2003 B1
6683965 Sapiejewski Jan 2004 B1
6694180 Boesen Feb 2004 B1
6717537 Fang et al. Apr 2004 B1
6738485 Boesen May 2004 B1
6748095 Goss Jun 2004 B1
6751326 Nepomuceno Jun 2004 B2
6754358 Boesen et al. Jun 2004 B1
6754359 Svean et al. Jun 2004 B1
6757395 Fang et al. Jun 2004 B1
6801632 Olson Oct 2004 B2
6847090 Loeppert Jan 2005 B2
6879698 Boesen Apr 2005 B2
6920229 Boesen Jul 2005 B2
6931292 Brumitt et al. Aug 2005 B1
6937738 Armstrong et al. Aug 2005 B2
6987859 Loeppert et al. Jan 2006 B2
7023066 Lee et al. Apr 2006 B2
7024010 Saunders et al. Apr 2006 B2
7039195 Svean et al. May 2006 B1
7103188 Jones Sep 2006 B1
7127389 Chazan Oct 2006 B2
7132307 Wang et al. Nov 2006 B2
7136500 Collins Nov 2006 B2
7203331 Boesen Apr 2007 B2
7209569 Boesen Apr 2007 B2
7215790 Boesen et al. May 2007 B2
7289636 Saunders et al. Oct 2007 B2
7302074 Wagner et al. Nov 2007 B2
D573588 Warren et al. Jul 2008 S
7406179 Ryan Jul 2008 B2
7433481 Armstrong et al. Oct 2008 B2
7477754 Rasmussen et al. Jan 2009 B2
7477756 Wickstrom et al. Jan 2009 B2
7502484 Ngia Mar 2009 B2
7590254 Olsen Sep 2009 B2
7680292 Warren et al. Mar 2010 B2
7747032 Zei et al. Jun 2010 B2
7773759 Alves et al. Aug 2010 B2
7869610 Jayanth et al. Jan 2011 B2
7889881 Ostrowski Feb 2011 B2
7899194 Boesen Mar 2011 B2
7965834 Alves Jun 2011 B2
7983433 Nemirovski Jul 2011 B2
8005249 Wirola et al. Aug 2011 B2
8019107 Ngia Sep 2011 B2
8027481 Beard Sep 2011 B2
8045724 Sibbald Oct 2011 B2
8072010 Lutz Dec 2011 B2
8077873 Shridhar et al. Dec 2011 B2
8081780 Goldstein et al. Dec 2011 B2
8103029 Ngia et al. Jan 2012 B2
8111853 Isvan Feb 2012 B2
8116489 Mejia et al. Feb 2012 B2
8116502 Saggio, Jr. et al. Feb 2012 B2
8135140 Shridhar et al. Mar 2012 B2
8180067 Soulodre May 2012 B2
8189799 Shridhar et al. May 2012 B2
8194880 Avendano Jun 2012 B2
8199924 Wertz et al. Jun 2012 B2
8213643 Hemer Jul 2012 B2
8213645 Rye et al. Jul 2012 B2
8229125 Short Jul 2012 B2
8229740 Nordholm et al. Jul 2012 B2
8238567 Burge et al. Aug 2012 B2
8249287 Silvestri et al. Aug 2012 B2
8254591 Goldstein et al. Aug 2012 B2
8270626 Shridhar et al. Sep 2012 B2
8285344 Kahn et al. Oct 2012 B2
8295503 Sung et al. Oct 2012 B2
8311253 Silvestri et al. Nov 2012 B2
8315404 Shridhar et al. Nov 2012 B2
8325963 Kimura Dec 2012 B2
8331604 Saito et al. Dec 2012 B2
8363823 Santos Jan 2013 B1
8376967 Mersky Feb 2013 B2
8385560 Solbeck et al. Feb 2013 B2
8401200 Tiscareno et al. Mar 2013 B2
8401215 Warren et al. Mar 2013 B2
8416979 Takai Apr 2013 B2
8462956 Goldstein et al. Jun 2013 B2
8473287 Every et al. Jun 2013 B2
8483418 Platz et al. Jul 2013 B2
8488831 Saggio, Jr. et al. Jul 2013 B2
8494201 Anderson Jul 2013 B2
8498428 Schreuder et al. Jul 2013 B2
8503689 Schreuder et al. Aug 2013 B2
8503704 Francart et al. Aug 2013 B2
8509465 Theverapperuma Aug 2013 B2
8526646 Boesen Sep 2013 B2
8532323 Wickstrom et al. Sep 2013 B2
8553899 Salvetti et al. Oct 2013 B2
8553923 Tiscareno et al. Oct 2013 B2
8571227 Donaldson et al. Oct 2013 B2
8594353 Anderson Nov 2013 B2
8620650 Walters et al. Dec 2013 B2
8634576 Salvetti et al. Jan 2014 B2
8655003 Duisters Feb 2014 B2
8666102 Bruckhoff et al. Mar 2014 B2
8681999 Theverapperuma et al. Mar 2014 B2
8682001 Annunziato et al. Mar 2014 B2
8705787 Larsen Apr 2014 B2
8837746 Burnett Sep 2014 B2
8942976 Li et al. Jan 2015 B2
8983083 Tiscareno et al. Mar 2015 B2
9014382 Van De Par et al. Apr 2015 B2
9025415 Derkx May 2015 B2
9042588 Aase May 2015 B2
9047855 Bakalos Jun 2015 B2
9078064 Wickstrom et al. Jul 2015 B2
9100756 Dusan et al. Aug 2015 B2
9107008 Leitner Aug 2015 B2
9123320 Carreras et al. Sep 2015 B2
9154868 Narayan et al. Oct 2015 B2
9167337 Shin Oct 2015 B2
9185487 Solbach et al. Nov 2015 B2
9208769 Azmi Dec 2015 B2
9226068 Hendrix et al. Dec 2015 B2
9264823 Bajic et al. Feb 2016 B2
9401158 Yen Jul 2016 B1
20010011026 Nishijima Aug 2001 A1
20010021659 Okamura Sep 2001 A1
20010049262 Lehtonen Dec 2001 A1
20020016188 Kashiwamura Feb 2002 A1
20020021800 Bodley et al. Feb 2002 A1
20020038394 Liang et al. Mar 2002 A1
20020054684 Menzl May 2002 A1
20020056114 Fillebrown et al. May 2002 A1
20020067825 Baranowski et al. Jun 2002 A1
20020098877 Glezerman Jul 2002 A1
20020136420 Topholm Sep 2002 A1
20020159023 Swab Oct 2002 A1
20020176330 Ramonowski et al. Nov 2002 A1
20020183089 Heller et al. Dec 2002 A1
20030002704 Pronk Jan 2003 A1
20030013411 Uchiyama Jan 2003 A1
20030017805 Yeung et al. Jan 2003 A1
20030058808 Eaton et al. Mar 2003 A1
20030085070 Wickstrom May 2003 A1
20030198357 Schneider Oct 2003 A1
20030207703 Liou et al. Nov 2003 A1
20030223592 Deruginsky et al. Dec 2003 A1
20050027522 Yamamoto et al. Feb 2005 A1
20050222842 Zakarauskas Oct 2005 A1
20060029234 Sargaison Feb 2006 A1
20060034472 Bazarjani et al. Feb 2006 A1
20060153155 Jacobsen et al. Jul 2006 A1
20060227990 Kirchhoefer Oct 2006 A1
20060239472 Oda Oct 2006 A1
20070104340 Miller et al. May 2007 A1
20070147635 Dijkstra et al. Jun 2007 A1
20080019548 Avendano Jan 2008 A1
20080037801 Alves Feb 2008 A1
20080063228 Mejia et al. Mar 2008 A1
20080101640 Ballad et al. May 2008 A1
20080181419 Goldstein Jul 2008 A1
20080232621 Burns Sep 2008 A1
20080260180 Goldstein Oct 2008 A1
20090010456 Goldstein Jan 2009 A1
20090034765 Boillot Feb 2009 A1
20090041269 Hemer Feb 2009 A1
20090067661 Keady Mar 2009 A1
20090080670 Solbeck et al. Mar 2009 A1
20090147966 McIntosh Jun 2009 A1
20090182913 Rosenblatt et al. Jul 2009 A1
20090207703 Matsumoto et al. Aug 2009 A1
20090214068 Wickstrom Aug 2009 A1
20090264161 Usher Oct 2009 A1
20090323982 Solbach et al. Dec 2009 A1
20100022280 Schrage Jan 2010 A1
20100074451 Usher Mar 2010 A1
20100081487 Chen et al. Apr 2010 A1
20100183167 Phelps et al. Jul 2010 A1
20100233996 Herz et al. Sep 2010 A1
20100270631 Renner Oct 2010 A1
20110125063 Shalon May 2011 A1
20110125491 Alves May 2011 A1
20110257967 Every et al. Oct 2011 A1
20110293103 Park Dec 2011 A1
20120008808 Saltykov Jan 2012 A1
20120020505 Yamada Jan 2012 A1
20120056282 Van Lippen et al. Mar 2012 A1
20120099753 van der Avoort et al. Apr 2012 A1
20120197638 Li et al. Aug 2012 A1
20120321103 Smailagic et al. Dec 2012 A1
20130024194 Zhao et al. Jan 2013 A1
20130051580 Miller Feb 2013 A1
20130058495 Furst et al. Mar 2013 A1
20130070935 Hui et al. Mar 2013 A1
20130142358 Schultz et al. Jun 2013 A1
20130272564 Miller Oct 2013 A1
20130287219 Hendrix et al. Oct 2013 A1
20130315415 Shin Nov 2013 A1
20130322642 Streitenberger et al. Dec 2013 A1
20130343580 Lautenschlager et al. Dec 2013 A1
20130345842 Karakaya et al. Dec 2013 A1
20140010378 Voix et al. Jan 2014 A1
20140044275 Goldstein et al. Feb 2014 A1
20140086425 Jensen et al. Mar 2014 A1
20140169579 Azmi Jun 2014 A1
20140177869 Percy Jun 2014 A1
20140233741 Gustavsson Aug 2014 A1
20140254825 Tahernezhadi Sep 2014 A1
20140270231 Dusan et al. Sep 2014 A1
20140273851 Donaldson et al. Sep 2014 A1
20140314238 Usher Oct 2014 A1
20140348346 Fukuda Nov 2014 A1
20140355787 Jiles et al. Dec 2014 A1
20140369517 Zhou Dec 2014 A1
20150025881 Carlos et al. Jan 2015 A1
20150043741 Shin Feb 2015 A1
20150055810 Shin Feb 2015 A1
20150078574 Shin Mar 2015 A1
20150110280 Wardle Apr 2015 A1
20150131814 Usher May 2015 A1
20150161981 Kwatra Jun 2015 A1
20150172814 Usher Jun 2015 A1
20150215701 Usher Jul 2015 A1
20150237448 Loeppert Aug 2015 A1
20150243271 Goldstein Aug 2015 A1
20150245129 Dusan et al. Aug 2015 A1
20150264472 Aase Sep 2015 A1
20150296305 Shao et al. Oct 2015 A1
20150296306 Shao et al. Oct 2015 A1
20150304770 Watson et al. Oct 2015 A1
20150310846 Andersen et al. Oct 2015 A1
20150325229 Carreras et al. Nov 2015 A1
20150325251 Dusan et al. Nov 2015 A1
20150365770 Lautenschlager Dec 2015 A1
20150382094 Grinker et al. Dec 2015 A1
20160007119 Harrington Jan 2016 A1
20160021480 Johnson et al. Jan 2016 A1
20160029345 Sebeni et al. Jan 2016 A1
20160037261 Harrington Feb 2016 A1
20160037263 Pal et al. Feb 2016 A1
20160042666 Hughes Feb 2016 A1
20160044151 Shoemaker et al. Feb 2016 A1
20160044398 Siahaan et al. Feb 2016 A1
20160044424 Dave et al. Feb 2016 A1
20160060101 Loeppert Mar 2016 A1
20160105748 Pal et al. Apr 2016 A1
20160112811 Jensen Apr 2016 A1
20160150335 Qutub et al. May 2016 A1
20160155453 Harvey Jun 2016 A1
20160165334 Grossman Jun 2016 A1
20160165361 Miller et al. Jun 2016 A1
Foreign Referenced Citations (54)
Number Date Country
204119490 Jan 2015 CN
204145685 Feb 2015 CN
204168483 Feb 2015 CN
204669605 Sep 2015 CN
204681587 Sep 2015 CN
204681593 Sep 2015 CN
ZL2015203769650 Sep 2015 CN
ZL2015204747042 Sep 2015 CN
ZL2015204903074 Sep 2015 CN
915826 Jul 1954 DE
3723275 Mar 1988 DE
102009051713 May 2011 DE
102011003470 Aug 2012 DE
0124870 Nov 1984 EP
0500985 Sep 1992 EP
0684750 Nov 1995 EP
0806909 Nov 1997 EP
1299988 Apr 2003 EP
1509065 Feb 2005 EP
1310136 Mar 2006 EP
1469701 Apr 2008 EP
2434780 Mar 2012 EP
S5888996 May 1983 JP
S60103798 Jun 1985 JP
2007150743 Jun 2007 JP
2012169828 Sep 2012 JP
5049312 Oct 2012 JP
20110058769 Jun 2011 KR
101194904 Oct 2012 KR
1020140026722 Mar 2014 KR
WO8303733 Oct 1983 WO
WO9407342 Mar 1994 WO
WO9623443 Aug 1996 WO
WO0025551 May 2000 WO
WO0217835 Mar 2002 WO
WO0217836 Mar 2002 WO
WO0217837 Mar 2002 WO
WO0217838 Mar 2002 WO
WO0217839 Mar 2002 WO
WO03073790 Sep 2003 WO
WO2006114767 Nov 2006 WO
WO2007073818 Jul 2007 WO
WO2007082579 Jul 2007 WO
WO2007147416 Dec 2007 WO
WO2008128173 Oct 2008 WO
WO2009012491 Jan 2009 WO
WO2009023784 Feb 2009 WO
WO2011051469 May 2011 WO
WO2011061483 May 2011 WO
WO2013033001 Mar 2013 WO
WO-2014022359 Feb 2014 WO
WO2016085814 Jun 2016 WO
WO2016089671 Jun 2016 WO
WO2016089745 Jun 2016 WO
Non-Patent Literature Citations (31)
Entry
Westerlund et al., “In-ear Microphone Equalization Exploiting an Active Noise Control.” Proceedings of Internoise 2001, Aug. 2001, pp. 1-6. 17.
Yen, Kuan-Chieh et al., “Microphone Signal Fusion”, U.S. Appl. No. 14/853,947, filed Sep. 14, 2015.
Yen, Kuan-Chieh et al., “Audio Monitoring and Adaptation Using Headset Microphones Inside User's Ear Canal”, U.S. Appl. No. 14/985,187, filed Dec. 30, 2015.
Gadonniex, Sharon et al., “Occlusion Reduction and Active Noise Reduction Based on Seal Quality”, U.S. Appl. No. 14/985,057, filed Dec. 30, 2015.
Miller, Thomas E. et al., “Voice-Enhanced Awareness Mode”, U.S. Appl. No. 14/985,112, filed Dec. 30, 2015.
Verma, Tony, “Context Aware False Acceptance Rate Reduction”, U.S. Appl. No. 14/749,425, filed Jun. 24, 2015.
Notice of Allowance, dated Mar. 21, 2016, U.S. Appl. No. 14/853,947, filed Sep. 14, 2015.
Final Office Action, dated May 12, 2016, U.S. Appl. No. 13/224,068, filed Sep. 1, 2011.
Ephraim, Y. et al., “Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-32, No. 6, Dec. 1984, pp. 1109-1121.
Sun et al., “Robust Noise Estimation Using Minimum Correction with Harmonicity Control.” Conference: Interspeech 2010, 11th Annual Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, Sep. 26-30, 2010. p. 1085-1088.
Lomas, “Apple Patents Earbuds With Noise-Canceling Sensor Smarts,” Aug. 27, 2015. [retrieved on Sep. 16, 2015]. TechCrunch. Retrieved from the Internet: <URL: http://techcrunch.com/2015/08/27/apple-wireless-earbuds-at-last/>. 2 pages.
Smith, Gina, “New Apple Patent Applications: The Sound of Hearables to Come,” aNewDomain, Feb. 12, 2016, accessed Mar. 2, 2016 at URL: <http://anewdomain.net/2016/02/12/new-apple-patent-applications-glimpse-hearables-come/>.
Qutub, Sarmad et al., “Acoustic Apparatus with Dual MEMS Devices,” U.S. Appl. No. 14/872,887, filed Oct. 1, 2015.
Office Action dated Feb. 4, 2016 in U.S. Appl. No. 14/318,436, filed Jun. 27, 2014.
Office Action dated Jan. 22, 2016 in U.S. Appl. No. 14/774,666, filed Sep. 10, 2015.
International Search Report and Written Opinion for Patent Cooperation Treaty Application No. PCT/US2015/062940 dated Mar. 28, 2016 (10 pages).
International Search Report and Written Opinion for Patent Cooperation Treaty Application No. PCT/US2015/062393 dated Apr. 8, 2016 (9 pages).
International Search Report and Written Opinion for Patent Cooperation Treaty Application No. PCT/US2015/061871 dated Mar. 29, 2016 (9 pages).
Hegde, Nagaraj, “Seamlessly Interfacing MEMS Microphones with BlackfinTM Processors”, EE350 Analog Devices, Rev. 1, Aug. 2010, pp. 1-10.
Korean Office Action regarding Application No. 10-2014-7008553, dated May 21, 2015.
Written Opinion of the International Searching Authority and International Search Report mailed Jan. 21, 2013 in Patent Cooperation Treaty Application No. PCT/US2012/052478, filed Aug. 27, 2012.
Langberg, Mike, “Bluelooth Sharpens Its Connections,” Chicago Tribune, Apr. 29, 2002, Business Section, p. 3, accessed Mar. 11, 2016 at URL: <http://articles.chicagotribune.com/2002-04-29/business/0204290116—1—bluetooth-enabled-bluetooth-headset-bluetooth-devices>.
Duplan Corporaton vs. Deering Milliken decision, 197 USPQ 342.
Combined Bluetooth Headset and USB Dongle, Advance Information, RTX Telecom A/S, vol. 1, Apr. 6, 2002.
Notice of Allownace, dated Sep. 27, 2012, U.S. Appl. No. 13/568,989, filed Aug. 7, 2012.
Non-Final Office Action, dated Sep. 23, 2015, U.S. Appl. No. 13/224,068, filed Sep. 1, 2011.
Non-Final Office Action, dated Mar. 10, 2004, U.S. Appl. No. 10/138,929, filed May 3, 2002.
Final Office Action, dated Jan. 12, 2005, U.S. Appl. No. 10/138,929, filed May 3, 2002.
Non-Final Office Action, dated Jan. 12, 2006, U.S. Appl. No. 10/138,929, filed May 3, 2002.
Non-Final Office Action, dated Nov. 4, 2015, U.S. Appl. No. 14/853,947, filed Sep. 14, 2015.
International Search Report and Written Opinion, PCT/US2016/069094, Knowles Electronics, LLC, 11 pages (dated May 23, 2017).
Related Publications (1)
Number Date Country
20170221501 A1 Aug 2017 US