Speech signal separation and synthesis based on auditory scene analysis and speech modeling

Information

  • Patent Grant
  • 9536540
  • Patent Number
    9,536,540
  • Date Filed
    Friday, July 18, 2014
    9 years ago
  • Date Issued
    Tuesday, January 3, 2017
    7 years ago
  • CPC
  • Field of Search
    • US
    • 704 009000
    • 704 200000
    • 704 247000
    • 704 251000
    • 704 275000
    • CPC
    • G10L15/005
    • G10L15/22
    • G10L17/22
  • International Classifications
    • G10L21/0272
    • Term Extension
      26
Abstract
Provided are systems and methods for generating clean speech from a speech signal representing a mixture of a noise and speech. The clean speech may be generated from synthetic speech parameters. The synthetic speech parameters are derived based on the speech signal components and a model of speech using auditory and speech production principles. The modeling may utilize a source-filter structure of the speech signal. One or more spectral analyzes on the speech signal are performed to generate spectral representations. The feature data is derived based on a spectral representation. The features corresponding to the target speech according to a model of speech are grouped and separated from the feature data. The synthetic speech parameters, including spectral envelope, pitch data and voice classification data are generated based on features corresponding to the target speech.
Description
TECHNICAL FIELD

The present disclosure relates generally to audio processing, and, more particularly, to generating clean speech from a mixture of noise and speech.


BACKGROUND

Current noise suppression techniques, such as Wiener filtering, attempt to improve the global signal-to-noise ratio (SNR) and attenuate low-SNR regions, thus introducing distortion into the speech signal. It is common practice to perform such filtering as a magnitude modification in a transform domain. Typically, the corrupted signal is used to reconstruct the signal with the modified magnitude. This approach may miss signal components dominated by noise, thereby resulting in undesirable and unnatural spectro-temporal modulations.


When the target signal is dominated by noise, a system that synthesizes a clean speech signal instead of enhancing the corrupted audio via modifications is advantageous for achieving high signal-to noise ratio improvement (SNRI) values and low signal distortion.


SUMMARY

This summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.


According to an aspect of the present disclosure, a method is provided for generating clean speech from a mixture of noise and speech. The method may include deriving, based on the mixture of noise and speech, and a model of speech, synthetic speech parameters, and synthesizing, based at least partially on the speech parameters, clean speech.


In some embodiments, deriving speech parameters commences with performing one or more spectral analyses on the mixture of noise and speech to generate one or more spectral representations. The one or more spectral representations can be then used for deriving feature data. The features corresponding to the target speech may then be grouped according to the model of speech and separated from the feature data. Analysis of feature representations may allow segmentation and grouping of speech component candidates. In certain embodiments, candidates for the features corresponding to target speech are evaluated by a multi-hypothesis tracking system aided by the model of speech. The synthetic speech parameters can be generated based partially on features corresponding to the target speech.


In some embodiments, the generated synthetic speech parameters include spectral envelope and voicing information. The voicing information may include pitch data and voice classification data. In some embodiments, the spectral envelope is estimated from a sparse spectral envelope.


In various embodiments, the method includes determining, based on a noise model, non-speech components in the feature data. The non-speech components as determined may be used in part to discriminate between speech components and noise components.


In various embodiments, the speech components may be used to determine pitch data. In some embodiments, the non-speech components may also be used in the pitch determination. (For instance, knowledge about where noise components occlude speech components may be used.) The pitch data may be interpolated to fill missing frames before synthesizing clean speech; where a missing frame refers to a frame where a good pitch estimate could not be determined.


In some embodiments, the method includes generating, based on the pitch data, a harmonic map representing voiced speech. The method may further include estimating a map for unvoiced speech based on the non-speech components from feature data and the harmonic map. The harmonic map and map for unvoiced speech may be used to generate a mask for extracting the sparse spectral envelope from the spectral representation of the mixture of noise and speech.


In further example embodiments of the present disclosure, the method steps are stored on a machine-readable medium comprising instructions, which, when implemented by one or more processors, perform the recited steps. In yet further example embodiments, hardware systems, or devices can be adapted to perform the recited steps. Other features, examples, and embodiments are described below.





BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments are illustrated by way of example and not limitation in the figures of the accompanying drawings, in which like references indicate similar elements and in which:



FIG. 1 shows an example system suitable for implementing various embodiments of the methods for generating clean speech from a mixture of noise and speech.



FIG. 2 illustrates a system for speech processing, according to an example embodiment.



FIG. 3 illustrates a system for separation and synthesis of a speech signal, according to an example embodiment.



FIG. 4 shows an example of a voiced frame.



FIG. 5 is a time-frequency plot of sparse envelope estimation for voiced frames, according to an example embodiment.



FIG. 6 shows an example of envelope estimation.



FIG. 7 is a diagram illustrating a speech synthesizer, according to an example embodiment.



FIG. 8A shows example synthesis parameters for a clean female speech sample.



FIG. 8B is a close-up of FIG. 8A showing example synthesis parameters for a clean female speech sample.



FIG. 9 illustrates an input and an output of a system for separation and synthesis of speech signals, according to an example embodiment.



FIG. 10 illustrates an example method for generating clean speech from a mixture of noise and speech.



FIG. 11 illustrates an example computer system that may be used to implement embodiments of the present technology.





DETAILED DESCRIPTION

The following detailed description includes references to the accompanying drawings, which form a part of the detailed description. The drawings show illustrations in accordance with exemplary embodiments. These exemplary embodiments, which are also referred to herein as “examples,” are described in enough detail to enable those skilled in the art to practice the present subject matter. The embodiments can be combined, other embodiments can be utilized, or structural, logical, and electrical changes can be made without departing from the scope of what is claimed. The following detailed description is, therefore, not to be taken in a limiting sense, and the scope is defined by the appended claims and their equivalents.


Provided are systems and methods that allow generating a clean speech from a mixture of noise and speech. Embodiments described herein can be practiced on any device that is configured to receive and/or provide a speech signal including but not limited to, personal computers (PCs), tablet computers, mobile devices, cellular phones, phone handsets, headsets, media devices, internet-connected (internet-of-things) devices and systems for teleconferencing applications. The technologies of the current disclosure may be also used in personal hearing devices, non-medical hearing aids, hearing aids, and cochlear implants.


According to various embodiments, the method for generating a clean speech signal from a mixture of noise and speech includes estimating speech parameters from a noisy mixture using auditory (e.g., perceptual) and speech production principles (e.g., separation of source and filter components). The estimated parameters are then used for synthesizing clean speech or can potentially be used in other applications where the speech signal may not necessarily be synthesized but where certain parameters or features corresponding to the clean speech signal are needed (e.g., automatic speech recognition and speaker identification).



FIG. 1 shows an example system 100 suitable for implementing methods for the various embodiments described herein. In some embodiments, the system 100 comprises a receiver 110, a processor 120, a microphone 130, an audio processing system 140, and an output device 150. The system 100 may comprise more or other components to provide a particular operation or functionality. Similarly, the system 100 may comprise fewer components that perform similar or equivalent functions to those depicted in FIG. 1. In addition, elements of system 100 may be cloud-based, including but not limited to, the processor 120.


The receiver 110 can be configured to communicate with a network such as the Internet, Wide Area Network (WAN), Local Area Network (LAN), cellular network, and so forth, to receive an audio data stream, which may comprise one or more channels of audio data. The received audio data stream may then be forwarded to the audio processing system 140 and the output device 150.


The processor 120 may include hardware and software that implement the processing of audio data and various other operations depending on a type of the system 100 (e.g., communication device or computer). A memory (e.g., non-transitory computer readable storage medium) may store, at least in part, instructions and data for execution by processor 120.


The audio processing system 140 includes hardware and software that implement the methods according to various embodiments disclosed herein. The audio processing system 140 is further configured to receive acoustic signals from an acoustic source via microphone 130 (which may be one or more microphones or acoustic sensors) and process the acoustic signals. After reception by the microphone 130, the acoustic signals may be converted into electric signals by an analog-to-digital converter.


The output device 150 includes any device that provides an audio output to a listener (e.g., the acoustic source). For example, the output device 150 may comprise a speaker, a class-D output, an earpiece of a headset, or a handset on the system 100.



FIG. 2 shows a system 200 for speech processing, according to an example embodiment. The example system 200 includes at least an analysis module 210, a feature estimation module 220, a grouping module 230, and a speech information extraction and modeling module 240. In certain embodiments, the system 200 includes a speech synthesis module 250. In other embodiments, the system 200 includes a speaker recognition module 260. In yet further embodiments, the system 200 includes an automatic speech recognition module 270.


In some embodiments, the analysis module 210 is operable to receive one or more time-domain speech input signals. The speech input can be analyzed with a multi-resolution front end that yields spectral representations at various predetermined time-frequency resolutions.


In some embodiments, the feature estimation module 220 receives various analysis data from the analysis module 210. Signal features can be derived from the various analyses according to the type of feature (for example, a narrowband spectral analysis for tone detection and a wideband spectral analysis for transient detection) to generate a multi-dimensional feature space.


In various embodiments, the grouping module 230 receives the feature data from the feature estimation module 220. The features corresponding to target speech may then be grouped according to auditory scene analysis principles (e.g., common fate) and separated from the features of the interference or noise. In certain embodiments, in the case of multi-talker input or other speech-like distractors, a multi-hypothesis grouper can be used for scene organization.


In some embodiments, the order of the grouping module 230 and feature estimation module 220 may be reversed, such that grouping module 230 groups the spectral representation (e.g., from analysis module 210) before the feature data is derived in feature estimation module 220.


A resultant sparse multi-dimensional feature set may be passed from the grouping module 230 to the speech information extraction and modeling module 240. The speech information extraction and modeling module 240 can be operable to generate output parameters representing the target speech in the noisy speech input.


In some embodiments, the output of the speech information extraction and modeling module 240 includes synthesis parameters and acoustic features. In certain embodiments, the synthesis parameters are passed to the speech synthesis module 250 for synthesizing clean speech output. In other embodiments, the acoustic features generated by speech information extraction and modeling module 240 are passed to the automatic speech recognition module 270 or the speaker recognition module 260.



FIG. 3 shows a system 300 for speech processing, specifically, speech separation and synthesis for noise suppression, according to another example embodiment. The system 300 may include a multi-resolution analysis (MRA) module 310, a noise model module 320, a pitch estimation module 330, a grouping module 340, a harmonic map unit 350, a sparse envelope unit 360, a speech envelope model module 370, and a synthesis module 380.


In some embodiments, the MRA module 310 receives the speech input signal. The speech input signal can be contaminated by additive noise and room reverberation. The MRA module 310 can be operable to generate one or more short-time spectral representations.


This short-time analysis from the MRA module 310 can be initially used for deriving an estimate of the background noise via the noise model module 320. The noise estimate can then be used for grouping in grouping module 340 and to improve the robustness of pitch estimation in pitch estimation module 330. The pitch track generated by the pitch estimation module 330, including a voicing decision, may be used for generating a harmonic map (at the harmonic map unit 350) and as an input to the synthesis module 380.


In some embodiments, the harmonic map (which represents the voiced speech), from the harmonic map unit 350, and the noise model, from the noise model module 320, are used for estimating a map of unvoiced speech (i.e., the difference between the input and the noise model in a non-voiced frame). The voiced and unvoiced maps may then be grouped (at the grouping module 340) and used to generate a mask for extracting a sparse envelope (at the sparse envelope unit 360) from the input signal representation. Finally, the speech envelope model module 370 may estimate the spectral envelope (ENV) from the sparse envelope and may feed the ENV to the speech synthesizer (e.g., synthesis module 380), which together with the voicing information (pitch F0 and voicing classification such as voiced/unvoiced (V/U)) from the pitch estimation module 330) can generate the final speech output.


In some embodiments, the system of FIG. 3 is based on both human auditory perception and speech production principles. In certain embodiments, the analysis and processing are performed for envelope and excitation separately (but not necessarily independently). According to various embodiments, speech parameters (i.e., envelope and voicing in this instance) are extracted from the noisy observation and the estimates are used to generate clean speech via the synthesizer.


Noise Modeling

The noise model module 320 may identify and extract non-speech components from the audio input. This may be achieved by generating a multi-dimensional representation, such as a cortical representation, for example, where discrimination between speech and non-speech is possible. Some background on cortical representations is provided in M. Elhilali and S. A. Shamma, “A cocktail party with a cortical twist: How cortical mechanisms contribute to sound segregation,” J. Acoust. Soc. Am. 124(6): 3751-3771 (December 2008), the disclosure of which is incorporated herein by reference in its entirety.


In the example system 300, the multi-resolution analysis may be used for estimating the noise by noise model module 320. Voicing information such as pitch may be used in the estimation to discriminate between speech and noise components. For broadband stationary noise, a modulation-domain filter may be implemented for estimating and extracting the slowly-varying (low modulation) components characteristic of the noise but not of the target speech. In some embodiments, alternate noise modeling approaches such as minimum statistics may be used.


Pitch Analysis and Tracking

The pitch estimation module 330 can be implemented based on autocorrelogram features. Some background on autocorrelogram features is provided in Z. Jin and D. Wang, “HMM-Based Multipitch Tracking for Noisy and Reverberant Speech,” IEEE Transactions on Audio, Speech, and Language Processing, 19(5):1091-1102 (July 2011), the disclosure of which is incorporated herein by reference in its entirety. Multi-resolution analysis may be used to extract pitch information from both resolved harmonics (narrowband analysis) and unresolved harmonics (wideband analysis). The noise estimate can be incorporated to refine pitch cues by discarding unreliable sub-bands where the signal is dominated by noise. In some embodiments, a Bayesian filter or Bayesian tracker (for example, a hidden Markov model (HMM)) is then used to integrate per-frame pitch cues with temporal constraints in order to generate a continuous pitch track. The resulting pitch track may then be used for estimating a harmonic map that highlights time-frequency regions where harmonic energy is present. In some embodiments, suitable alternate pitch estimation and tracking methods, other than methods based on autocorrelogram features, are used.


For synthesis, the pitch track may be interpolated for missing frames and smoothed to create a more natural speech contour. In some embodiments, a statistical pitch contour model is used for interpolation/extrapolation and smoothing. Voicing information may be derived from the saliency and confidence of the pitch estimates.


Sparse Envelope Extraction

Once the voiced speech and background noise regions are identified, an estimate of the unvoiced speech regions may be derived. In some embodiments, the feature region is declared unvoiced if the frame is not voiced (that determination may be based, e.g., on a pitch saliency, which is a measure of how pitched the frame is) and the signal does not conform to the noise model, e.g., the signal level (or energy) exceeds a noise threshold or the signal representation in the feature space falls outside the noise model region in the feature space.


The voicing information may be used to identify and select the harmonic spectral peaks corresponding to the pitch estimate. The spectral peaks found in this process may be stored for creating the sparse envelope.


For unvoiced frames, all spectral peaks may be identified and added to the sparse envelope signal. An example for a voiced frame is shown in FIG. 4. FIG. 5 is an exemplary time-frequency plot of the sparse envelope estimation for a voiced frame.


Spectral Envelope Modeling

The spectral envelope may be derived from the sparse envelope by interpolation. Many methods can be applied to derive the sparse envelope, including simple two-dimensional mesh interpolation (e.g., image processing techniques) or more sophisticated data-driven methods which may yield more natural and undistorted speech.


In the example shown in FIG. 6, cubic interpolation in the logarithmic domain is applied on a per-frame basis to the sparse spectrum to obtain a smooth spectral envelope. Using this approach, the fine structure due to the excitation may be removed or minimized. Where noise exceeds the speech harmonics, the envelope may be assigned a weighted value based on some suppression law (e.g., Wiener filter) or based on a speech envelope model.


Speech Synthesis


FIG. 7 is block diagram of a speech synthesizer 700, according to an example embodiment. The example speech synthesizer 700 can include a Linear Predictive Coding (LPC) Modeling block 710, a Pulse block 720, a White Gaussian Noise (WGN) block 730, Perturbation Modeling block 760, Perturbation filters 740 and 750, and a Synthesis filter 780.


Once the pitch track and the spectral envelope are computed, a clean speech utterance may be synthesized. With these parameters, a mixed-excitation synthesizer may be implemented as follows. The spectral envelope (ENV) may be modeled by a high-order Linear Predictive Coding (LPC) filter (e.g., 64th order) to preserve vocal tract detail but exclude other excitation-related artifacts (LPC Modeling block 710, FIG. 7). The excitation (of voicing information (pitch F0 and voicing classification such as voiced/unvoiced (V/U) in the example in FIG. 7)) may be modeled by the sum of a filtered pulse train (Pulse block 720, FIG. 7) driven by the pitch value in each frame and a filtered White Gaussian Noise source (WGN block 730, FIG. 7). As can be seen in the example embodiment in FIG. 7, the pitch F0 and voicing classification such as voiced/unvoiced (V/U) may be input to Pulse block 720, WGN block 730, and Perturbation Modeling block 760. Perturbation filters P(z) 750 and Q(z) 740 may be derived from the spectro-temporal energy profile of the envelope.


In contrast to other known methods, the perturbation of the periodic pulse train can be controlled only based on the relative local and global energy of the spectral envelope and not based on an excitation analysis, according to various embodiments. The filter P(z) 750 may add spectral shaping to the noise component in the excitation, and the filter Q(z) 740 may be used to modify the phase of the pulse train to increase dispersion and naturalness.


To derive the perturbation filters P(z) 750 and Q(z) 740, the dynamic range within each frame may be computed, and a frequency-dependent weight may be applied based on the level of each spectral value relative to the minimum and maximum energy in the frame. Then, a global weight may be applied based on the level of the frame relative to the maximum and minimum global energies tracked over time. The rationale behind this approach is that during onsets and offsets (low relative global energy) the glottis area is reduced, giving rise to higher Reynolds numbers (increased probability of turbulence). During the steady state, local frequency perturbations can be observed at lower energies where turbulent energy dominates.


It should be noted that the perturbation may be computed from the spectral envelope in voiced frames, but, in practice, for some embodiments, the perturbation is assigned a maximum value during unvoiced regions. An example of the synthesis parameters for a clean female speech sample is shown in FIG. 8A (also shown in more detail in FIG. 8B). The perturbation function is shown in the dB domain as an aperiodicity function.


An example of the performance of the system 300 is illustrated in FIG. 9, where a noisy speech input is processed by the system 300, thereby producing a synthetic noise-free output.



FIG. 10 is a flow chart of method 1000 for generating clean speech from a mixture of noise and speech. The method 1000 may be performed by processing logic that may include hardware (e.g., dedicated logic, programmable logic, and microcode), software (such as run on a general-purpose computer system or a dedicated machine), or a combination of both. In one example embodiment, the processing logic resides at the audio processing system 140.


At operation 1010, the example method 1000 can include deriving, based on the mixture of noise and speech and a model of speech, speech parameters. The speech parameters may include the spectral envelope and voice information. The voice information may include pitch data and voice classification. At operation 1020, the method 1000 can proceed with synthesizing clean speech from the speech parameters.



FIG. 11 illustrates an exemplary computer system 1100 that may be used to implement some embodiments of the present invention. The computer system 1100 of FIG. 11 may be implemented in the contexts of the likes of computing systems, networks, servers, or combinations thereof. The computer system 1100 of FIG. 11 includes one or more processor units 1110 and main memory 1120. Main memory 1120 stores, in part, instructions and data for execution by processor units 1110. Main memory 1120 stores the executable code when in operation, in this example. The computer system 1100 of FIG. 11 further includes a mass data storage 1130, portable storage device 1140, output devices 1150, user input devices 1160, a graphics display system 1170, and peripheral devices 1180.


The components shown in FIG. 11 are depicted as being connected via a single bus 1190. The components may be connected through one or more data transport means. Processor unit 1110 and main memory 1120 are connected via a local microprocessor bus, and the mass data storage 1130, peripheral device(s) 1180, portable storage device 1140, and graphics display system 1170 are connected via one or more input/output (I/O) buses.


Mass data storage 1130, which can be implemented with a magnetic disk drive, solid state drive, or an optical disk drive, is a non-volatile storage device for storing data and instructions for use by processor unit 1110. Mass data storage 1130 stores the system software for implementing embodiments of the present disclosure for purposes of loading that software into main memory 1120.


Portable storage device 1140 operates in conjunction with a portable non-volatile storage medium, such as a flash drive, floppy disk, compact disk, digital video disc, or Universal Serial Bus (USB) storage device, to input and output data and code to and from the computer system 1100 of FIG. 11. The system software for implementing embodiments of the present disclosure is stored on such a portable medium and input to the computer system 1100 via the portable storage device 1140.


User input devices 1160 can provide a portion of a user interface. User input devices 1160 may include one or more microphones, an alphanumeric keypad, such as a keyboard, for inputting alphanumeric and other information, or a pointing device, such as a mouse, a trackball, stylus, or cursor direction keys. User input devices 1160 can also include a touchscreen. Additionally, the computer system 1100 as shown in FIG. 11 includes output devices 1150. Suitable output devices 1150 include speakers, printers, network interfaces, and monitors.


Graphics display system 1170 include a liquid crystal display (LCD) or other suitable display device. Graphics display system 1170 is configurable to receive textual and graphical information and processes the information for output to the display device.


Peripheral devices 1180 may include any type of computer support device to add additional functionality to the computer system.


The components provided in the computer system 1100 of FIG. 11 are those typically found in computer systems that may be suitable for use with embodiments of the present disclosure and are intended to represent a broad category of such computer components that are well known in the art. Thus, the computer system 1100 of FIG. 11 can be a personal computer (PC), hand held computer system, telephone, mobile computer system, workstation, tablet, phablet, mobile phone, server, minicomputer, mainframe computer, wearable, internet-connected device, or any other computer system. The computer may also include different bus configurations, networked platforms, multi-processor platforms, and the like. Various operating systems may be used including UNIX, LINUX, WINDOWS, MAC OS, PALM OS, QNX ANDROID, IOS, CHROME, TIZEN, and other suitable operating systems.


The processing for various embodiments may be implemented in software that is cloud-based. In some embodiments, the computer system 1100 is implemented as a cloud-based computing environment, such as a virtual machine operating within a computing cloud. In other embodiments, the computer system 1100 may itself include a cloud-based computing environment, where the functionalities of the computer system 1100 are executed in a distributed fashion. Thus, the computer system 1100, when configured as a computing cloud, may include pluralities of computing devices in various forms, as will be described in greater detail below.


In general, a cloud-based computing environment is a resource that typically combines the computational power of a large grouping of processors (such as within web servers) and/or that combines the storage capacity of a large grouping of computer memories or storage devices. Systems that provide cloud-based resources may be utilized exclusively by their owners, or such systems may be accessible to outside users who deploy applications within the computing infrastructure to obtain the benefit of large computational or storage resources.


The cloud may be formed, for example, by a network of web servers that comprise a plurality of computing devices, such as the computer system 1100, with each server (or at least a plurality thereof) providing processor and/or storage resources. These servers may manage workloads provided by multiple users (e.g., cloud resource customers or other users). Typically, each user places workload demands upon the cloud that vary in real-time, sometimes dramatically. The nature and extent of these variations typically depends on the type of business associated with the user.


The present technology is described above with reference to example embodiments. Therefore, other variations upon the example embodiments are intended to be covered by the present disclosure.

Claims
  • 1. A method for generating clean speech from a mixture of noise and speech, the method comprising: deriving speech parameters, based on the mixture of noise and speech and a model of speech, the deriving using at least one hardware processor, wherein the deriving speech parameters comprises: performing one or more spectral analyses on the mixture of noise and speech to generate one or more spectral representations;deriving, based on the one or more spectral representations, feature data;grouping target speech features in the feature data according to the model of speech;separating the target speech features from the feature data; andgenerating, based at least partially on the target speech features, the speech parameters; andsynthesizing, based at least partially on the speech parameters, clean speech.
  • 2. The method of claim 1, wherein candidates for the target speech features are evaluated by a multi-hypothesis tracking system aided by the model of speech.
  • 3. The method of claim 1, wherein the speech parameters include spectral envelope and voicing information, the voicing information including pitch data and voice classification data.
  • 4. The method of claim 3, further comprising, prior to grouping the feature data, determining, based on a noise model, non-speech components in the feature data.
  • 5. The method of claim 4, wherein the pitch data are determined based, at least partially, on the non-speech components.
  • 6. The method of claim 4, wherein the pitch data are determined based, at least on, knowledge about where noise components occlude speech components.
  • 7. The method of claim 5, further comprising, while generating the speech parameters: generating, based on the pitch data, a harmonic map, the harmonic map representing voiced speech; andestimating, based on the non-speech components and the harmonic map, an unvoiced speech map.
  • 8. The method of claim 7, further comprising extracting a sparse spectral envelope from the one or more spectral representations using a mask, the mask being generated based on a harmonic map and an unvoiced speech map.
  • 9. The method of claim 8, further comprising estimating the spectral envelope based on a sparse spectral envelope.
  • 10. The method of claim 3, wherein the pitch data are interpolated to fill missing frames before synthesizing clean speech.
  • 11. A system for generating clean speech from a mixture of noise and speech, the system comprising: one or more processors; anda memory communicatively coupled with the processor, the memory storing instructions which if executed by the one or more processors perform a method comprising:deriving speech parameters, based on the mixture of noise and speech and a model of speech, wherein the deriving speech parameters comprises: performing one or more spectral analyses on the mixture of noise and speech to generate one or more spectral representations;deriving, based on the one or more spectral representations, feature data;grouping target speech features in the feature data according to the model of speech;separating the target speech features from the feature data; andgenerating, based at least partially on the target speech features, the speech parameters; andsynthesizing, based at least partially on the speech parameters, clean speech.
  • 12. The system of claim 11, wherein candidates for the target speech features are evaluated by a multi-hypothesis tracking system aided by the model of speech.
  • 13. The system of claim 11, wherein the speech parameters include a spectral envelope and voicing information, the voicing information including pitch data and voice classification data.
  • 14. The system of claim 13, further comprising, prior to grouping the feature data, determining, based on a noise model, non-speech components in the feature data.
  • 15. The system of claim 14, wherein the pitch data are determined based partially on the non-speech components.
  • 16. The system of claim 14, wherein the pitch data are determined based, at least on, knowledge about where noise components occlude speech components.
  • 17. The system of claim 15, further comprising, while generating the speech parameters: generating, based on the pitch data, a harmonic map, the harmonic map representing voiced speech; andestimating, based on the non-speech components and the harmonic map, an unvoiced speech map.
  • 18. The system of claim 15, further comprising extracting a sparse spectral envelope from the one or more spectral representations using a mask, the mask being generated based on a harmonic map and an unvoiced speech map.
  • 19. The system of claim 18, further comprising estimating the spectral envelope based on the sparse spectral envelope.
  • 20. A non-transitory computer-readable storage medium having embodied thereon a program, the program being executable by a processor to perform a method for generating clean speech from a mixture of noise and speech, the method comprising: deriving speech parameters, based on the mixture of noise and speech and a model of speech, via instructions stored in the memory and executed by the one or more processors, wherein the deriving speech parameters comprises: performing one or more spectral analyses on the mixture of noise and speech to generate one or more spectral representations;deriving, based on the one or more spectral representations, feature data;grouping target speech features in the feature data according to the model of speech;separating the target speech features from the feature data; andgenerating, based at least partially on the target speech features, the speech parameters; andsynthesizing, based at least partially on the speech parameters, via instructions stored in the memory and executed by the one or more processors, clean speech.
CROSS-REFERENCE TO RELATED APPLICATIONS

The present application claims the benefit of U.S. Provisional Application No. 61/856,577, filed on Jul. 19, 2013 and entitled “System and Method for Speech Signal Separation and Synthesis Based on Auditory Scene Analysis and Speech Modeling”, and U.S. Provisional Application No. 61/972,112, filed Mar. 28, 2014 and entitled “Tracking Multiple Attributes of Simultaneous Objects”. The subject matter of the aforementioned applications is incorporated herein by reference for all purposes.

US Referenced Citations (544)
Number Name Date Kind
3976863 Engel Aug 1976 A
3978287 Fletcher et al. Aug 1976 A
4137510 Iwahara Jan 1979 A
4433604 Ott Feb 1984 A
4516259 Yato et al. May 1985 A
4535473 Sakata Aug 1985 A
4536844 Lyon Aug 1985 A
4581758 Coker et al. Apr 1986 A
4628529 Borth et al. Dec 1986 A
4630304 Borth et al. Dec 1986 A
4649505 Zinser, Jr. et al. Mar 1987 A
4658426 Chabries et al. Apr 1987 A
4674125 Carlson et al. Jun 1987 A
4718104 Anderson Jan 1988 A
4811404 Vilmur et al. Mar 1989 A
4812996 Stubbs Mar 1989 A
4864620 Bialick Sep 1989 A
4920508 Yassaie et al. Apr 1990 A
4969203 Herman Nov 1990 A
4991166 Julstrom Feb 1991 A
5027410 Williamson et al. Jun 1991 A
5054085 Meisel et al. Oct 1991 A
5058419 Nordstrom et al. Oct 1991 A
5099738 Hotz Mar 1992 A
5119711 Bell et al. Jun 1992 A
5142961 Paroutaud Sep 1992 A
5150413 Nakatani et al. Sep 1992 A
5175769 Hejna, Jr. et al. Dec 1992 A
5177482 Cideciyan et al. Jan 1993 A
5187776 Yanker Feb 1993 A
5204906 Nohara et al. Apr 1993 A
5208864 Kaneda May 1993 A
5210366 Sykes, Jr. May 1993 A
5216423 Mukherjee Jun 1993 A
5222251 Roney, IV et al. Jun 1993 A
5224170 Waite, Jr. Jun 1993 A
5230022 Sakata Jul 1993 A
5319736 Hunt Jun 1994 A
5323459 Hirano Jun 1994 A
5341432 Suzuki et al. Aug 1994 A
5381473 Andrea et al. Jan 1995 A
5381512 Holton et al. Jan 1995 A
5400409 Linhard Mar 1995 A
5402493 Goldstein Mar 1995 A
5402496 Soli et al. Mar 1995 A
5406635 Jarvinen Apr 1995 A
5416847 Boze May 1995 A
5440751 Santeler et al. Aug 1995 A
5471195 Rickman Nov 1995 A
5473759 Slaney et al. Dec 1995 A
5479564 Vogten et al. Dec 1995 A
5502663 Lyon Mar 1996 A
5544250 Urbanski Aug 1996 A
5544346 Amini et al. Aug 1996 A
5550924 Helf et al. Aug 1996 A
5555306 Gerzon Sep 1996 A
5574824 Slyh et al. Nov 1996 A
5583784 Kapust et al. Dec 1996 A
5590241 Park et al. Dec 1996 A
5598505 Austin et al. Jan 1997 A
5602962 Kellermann Feb 1997 A
5633631 Teckman May 1997 A
5675778 Jones Oct 1997 A
5682463 Allen et al. Oct 1997 A
5694474 Ngo et al. Dec 1997 A
5706395 Arslan et al. Jan 1998 A
5717829 Takagi Feb 1998 A
5729612 Abel et al. Mar 1998 A
5732189 Johnston et al. Mar 1998 A
5749064 Pawate et al. May 1998 A
5757937 Itoh et al. May 1998 A
5777658 Kerr et al. Jul 1998 A
5792971 Timis et al. Aug 1998 A
5796819 Romesburg Aug 1998 A
5796850 Shiono et al. Aug 1998 A
5806025 Vis et al. Sep 1998 A
5809463 Gupta et al. Sep 1998 A
5839101 Vahatalo et al. Nov 1998 A
5845243 Smart et al. Dec 1998 A
5887032 Cioffi Mar 1999 A
5920840 Satyamurti et al. Jul 1999 A
5933495 Oh Aug 1999 A
5937070 Todter et al. Aug 1999 A
5943429 Handel Aug 1999 A
5956674 Smyth et al. Sep 1999 A
5974379 Hatanaka et al. Oct 1999 A
5974380 Smyth et al. Oct 1999 A
5978567 Rebane et al. Nov 1999 A
5978824 Ikeda Nov 1999 A
5983139 Zierhofer Nov 1999 A
5990405 Auten et al. Nov 1999 A
6002776 Bhadkamkar et al. Dec 1999 A
6061456 Andrea et al. May 2000 A
6072881 Linder Jun 2000 A
6092126 Rossum Jul 2000 A
6097820 Turner Aug 2000 A
6098038 Hermansky et al. Aug 2000 A
6104993 Ashley Aug 2000 A
6108626 Cellario et al. Aug 2000 A
6122384 Mauro Sep 2000 A
6122610 Isabelle Sep 2000 A
6125175 Goldberg et al. Sep 2000 A
6134524 Peters et al. Oct 2000 A
6137349 Menkhoff et al. Oct 2000 A
6140809 Doi Oct 2000 A
6173255 Wilson et al. Jan 2001 B1
6188769 Jot et al. Feb 2001 B1
6188797 Moledina et al. Feb 2001 B1
6202047 Ephraim et al. Mar 2001 B1
6205421 Morii Mar 2001 B1
6205422 Gu et al. Mar 2001 B1
6208671 Paulos et al. Mar 2001 B1
6216103 Wu et al. Apr 2001 B1
6222927 Feng et al. Apr 2001 B1
6223090 Brungart Apr 2001 B1
6226616 You et al. May 2001 B1
6240386 Thyssen et al. May 2001 B1
6263307 Arslan et al. Jul 2001 B1
6266633 Higgins et al. Jul 2001 B1
6317501 Matsuo Nov 2001 B1
6321193 Nystrom et al. Nov 2001 B1
6324235 Savell et al. Nov 2001 B1
6339706 Tillgren et al. Jan 2002 B1
6339758 Kanazawa et al. Jan 2002 B1
6355869 Mitton Mar 2002 B1
6363345 Marash et al. Mar 2002 B1
6377637 Berdugo Apr 2002 B1
6381570 Li et al. Apr 2002 B2
6421388 Parizhsky et al. Jul 2002 B1
6424938 Johansson et al. Jul 2002 B1
6430295 Handel et al. Aug 2002 B1
6434417 Lovett Aug 2002 B1
6449586 Hoshuyama Sep 2002 B1
6453289 Ertem et al. Sep 2002 B1
6456209 Savari Sep 2002 B1
6469732 Chang et al. Oct 2002 B1
6477489 Lockwood Nov 2002 B1
6487257 Gustafsson et al. Nov 2002 B1
6490556 Graumann et al. Dec 2002 B1
6496795 Malvar Dec 2002 B1
6513004 Rigazio et al. Jan 2003 B1
6516066 Hayashi Feb 2003 B2
6516136 Lee Feb 2003 B1
6526140 Marchok et al. Feb 2003 B1
6529606 Jackson, Jr. II et al. Mar 2003 B1
6531970 McLaughlin et al. Mar 2003 B2
6549630 Bobisuthi Apr 2003 B1
6584203 Elko et al. Jun 2003 B2
6584438 Manjunath et al. Jun 2003 B1
6647067 Hjelm et al. Nov 2003 B1
6683938 Henderson Jan 2004 B1
6717991 Gustafsson et al. Apr 2004 B1
6718309 Selly Apr 2004 B1
6738482 Jaber May 2004 B1
6745155 Andringa et al. Jun 2004 B1
6760450 Matsuo Jul 2004 B2
6772117 Laurila et al. Aug 2004 B1
6785381 Gartner et al. Aug 2004 B2
6792118 Watts Sep 2004 B2
6795558 Matsuo Sep 2004 B2
6798886 Smith et al. Sep 2004 B1
6804203 Benyassine et al. Oct 2004 B1
6804651 Juric et al. Oct 2004 B2
6810273 Mattila et al. Oct 2004 B1
6859508 Koyama et al. Feb 2005 B1
6862567 Gao Mar 2005 B1
6882736 Dickel et al. Apr 2005 B2
6907045 Robinson et al. Jun 2005 B1
6915257 Heikkinen et al. Jul 2005 B2
6915264 Baumgarte Jul 2005 B2
6917688 Yu et al. Jul 2005 B2
6934387 Kim Aug 2005 B1
6978159 Feng et al. Dec 2005 B2
6982377 Sakurai et al. Jan 2006 B2
6990196 Zeng et al. Jan 2006 B2
7016507 Brennan Mar 2006 B1
7020605 Gao Mar 2006 B2
7031478 Belt et al. Apr 2006 B2
7042934 Zamir May 2006 B2
7050388 Kim et al. May 2006 B2
7054452 Ukita May 2006 B2
7054809 Gao May 2006 B1
7058574 Taniguchi et al. Jun 2006 B2
7065485 Chong-White et al. Jun 2006 B1
7076315 Watts Jul 2006 B1
7092529 Yu et al. Aug 2006 B2
7092882 Arrowood et al. Aug 2006 B2
7099821 Visser et al. Aug 2006 B2
7127072 Rademacher et al. Oct 2006 B2
7142677 Gonopolskiy et al. Nov 2006 B2
7146013 Saito et al. Dec 2006 B1
7146316 Alves Dec 2006 B2
7155019 Hou Dec 2006 B2
7165026 Acero et al. Jan 2007 B2
7171008 Elko Jan 2007 B2
7171246 Mattila et al. Jan 2007 B2
7174022 Zhang et al. Feb 2007 B1
7190665 Warke et al. Mar 2007 B2
7206418 Yang et al. Apr 2007 B2
7209567 Kozel et al. Apr 2007 B1
7225001 Eriksson et al. May 2007 B1
7242762 He et al. Jul 2007 B2
7246058 Burnett Jul 2007 B2
7254242 Ise et al. Aug 2007 B2
7283956 Ashley et al. Oct 2007 B2
7289554 Alloin Oct 2007 B2
7289955 Deng et al. Oct 2007 B2
7327985 Morfitt, III et al. Feb 2008 B2
7330138 Mallinson et al. Feb 2008 B2
7339503 Elenes Mar 2008 B1
7359520 Brennan et al. Apr 2008 B2
7366658 Moogi et al. Apr 2008 B2
7376558 Gemello et al. May 2008 B2
7383179 Alves et al. Jun 2008 B2
7395298 Debes et al. Jul 2008 B2
7412379 Taori et al. Aug 2008 B2
7433907 Nagai et al. Oct 2008 B2
7436333 Forman et al. Oct 2008 B2
7472059 Huang Dec 2008 B2
7548791 Johnston Jun 2009 B1
7555434 Nomura et al. Jun 2009 B2
7561627 Chow et al. Jul 2009 B2
7577084 Tang et al. Aug 2009 B2
7590250 Ellis et al. Sep 2009 B2
7617099 Yang et al. Nov 2009 B2
7657038 Doclo et al. Feb 2010 B2
7657427 Jelinek Feb 2010 B2
7725314 Wu et al. May 2010 B2
7764752 Langberg et al. Jul 2010 B2
7777658 Nguyen et al. Aug 2010 B2
7783032 Abutalebi et al. Aug 2010 B2
7783481 Endo et al. Aug 2010 B2
7895036 Hetherington et al. Feb 2011 B2
7899565 Johnston Mar 2011 B1
7912567 Chhatwal et al. Mar 2011 B2
7949522 Hetherington et al. May 2011 B2
7953596 Pinto May 2011 B2
8010355 Rahbar Aug 2011 B2
8032364 Watts Oct 2011 B1
8032369 Manjunath et al. Oct 2011 B2
8036767 Soulodre Oct 2011 B2
8046219 Zurek et al. Oct 2011 B2
8060363 Ramo et al. Nov 2011 B2
8081878 Zhang et al. Dec 2011 B1
8098812 Fadili et al. Jan 2012 B2
8098844 Elko Jan 2012 B2
8103011 Mohammad et al. Jan 2012 B2
8126159 Goose et al. Feb 2012 B2
8143620 Malinowski et al. Mar 2012 B1
8150065 Solbach et al. Apr 2012 B2
8180064 Avendano et al. May 2012 B1
8184818 Ishiguro May 2012 B2
8194880 Avendano Jun 2012 B2
8194882 Every et al. Jun 2012 B2
8195454 Muesch Jun 2012 B2
8204252 Avendano Jun 2012 B1
8204253 Solbach Jun 2012 B1
8233352 Beaucoup Jul 2012 B2
8280731 Yu Oct 2012 B2
8311817 Murgia et al. Nov 2012 B2
8345890 Avendano et al. Jan 2013 B2
8378871 Bapat Feb 2013 B1
8473287 Every et al. Jun 2013 B2
8488805 Santos et al. Jul 2013 B1
8494193 Zhang et al. Jul 2013 B2
8521530 Every et al. Aug 2013 B1
8615394 Avendano et al. Dec 2013 B1
8737188 Murgia et al. May 2014 B1
8737532 Green et al. May 2014 B2
8744844 Klein Jun 2014 B2
8774423 Solbach Jul 2014 B1
8804865 Elenes et al. Aug 2014 B2
8831937 Murgia et al. Sep 2014 B2
8867759 Avendano et al. Oct 2014 B2
8880396 Laroche et al. Nov 2014 B1
8886525 Klein Nov 2014 B2
8908882 Goodwin et al. Dec 2014 B2
8934641 Avendano et al. Jan 2015 B2
8949120 Every et al. Feb 2015 B1
8965942 Rossum et al. Feb 2015 B1
8989401 Ojanpera Mar 2015 B2
9049282 Murgia et al. Jun 2015 B1
9076456 Avendano et al. Jul 2015 B1
9094496 Teutsch Jul 2015 B2
9185487 Solbach et al. Nov 2015 B2
9197974 Clark et al. Nov 2015 B1
9210503 Avendano et al. Dec 2015 B2
9236874 Rossum Jan 2016 B1
9247192 Lee et al. Jan 2016 B2
20010016020 Gustafsson et al. Aug 2001 A1
20010031053 Feng et al. Oct 2001 A1
20010041976 Taniguchi et al. Nov 2001 A1
20010053228 Jones Dec 2001 A1
20020002455 Accardi et al. Jan 2002 A1
20020009203 Erten Jan 2002 A1
20020041693 Matsuo Apr 2002 A1
20020080980 Matsuo Jun 2002 A1
20020097884 Cairns Jul 2002 A1
20020106092 Matsuo Aug 2002 A1
20020116187 Erten Aug 2002 A1
20020133334 Coorman et al. Sep 2002 A1
20020147595 Baumgarte Oct 2002 A1
20020156624 Gigi Oct 2002 A1
20020176589 Buck et al. Nov 2002 A1
20030014248 Vetter Jan 2003 A1
20030023430 Wang et al. Jan 2003 A1
20030026437 Janse et al. Feb 2003 A1
20030033140 Taori et al. Feb 2003 A1
20030038736 Becker et al. Feb 2003 A1
20030039369 Bullen Feb 2003 A1
20030040908 Yang et al. Feb 2003 A1
20030061032 Gonopolskiy Mar 2003 A1
20030063759 Brennan et al. Apr 2003 A1
20030072382 Raleigh et al. Apr 2003 A1
20030072460 Gonopolskiy et al. Apr 2003 A1
20030095667 Watts May 2003 A1
20030099345 Gartner et al. May 2003 A1
20030101048 Liu May 2003 A1
20030103632 Goubran et al. Jun 2003 A1
20030128851 Furuta Jul 2003 A1
20030138116 Jones et al. Jul 2003 A1
20030147538 Elko Aug 2003 A1
20030169891 Ryan et al. Sep 2003 A1
20030191641 Acero et al. Oct 2003 A1
20030228019 Eichler et al. Dec 2003 A1
20030228023 Burnett et al. Dec 2003 A1
20040001450 He et al. Jan 2004 A1
20040013276 Ellis et al. Jan 2004 A1
20040015348 McArthur et al. Jan 2004 A1
20040042616 Matsuo Mar 2004 A1
20040047464 Yu et al. Mar 2004 A1
20040066940 Amir Apr 2004 A1
20040078199 Kremer et al. Apr 2004 A1
20040083110 Wang Apr 2004 A1
20040125965 Alberth, Jr. et al. Jul 2004 A1
20040131178 Shahaf et al. Jul 2004 A1
20040133421 Burnett et al. Jul 2004 A1
20040165736 Hetherington et al. Aug 2004 A1
20040185804 Kanamori et al. Sep 2004 A1
20040196989 Friedman et al. Oct 2004 A1
20040263636 Cutler et al. Dec 2004 A1
20050008169 Muren et al. Jan 2005 A1
20050008179 Quinn Jan 2005 A1
20050025263 Wu Feb 2005 A1
20050027520 Mattila et al. Feb 2005 A1
20050043959 Stemerdink et al. Feb 2005 A1
20050049864 Kaltenmeier et al. Mar 2005 A1
20050060142 Visser et al. Mar 2005 A1
20050066279 LeBarton et al. Mar 2005 A1
20050080616 Leung et al. Apr 2005 A1
20050096904 Taniguchi et al. May 2005 A1
20050114128 Hetherington et al. May 2005 A1
20050143989 Jelinek Jun 2005 A1
20050152559 Gierl et al. Jul 2005 A1
20050152563 Amada et al. Jul 2005 A1
20050185813 Sinclair et al. Aug 2005 A1
20050203735 Ichikawa Sep 2005 A1
20050213778 Buck et al. Sep 2005 A1
20050216259 Watts Sep 2005 A1
20050228518 Watts Oct 2005 A1
20050249292 Zhu Nov 2005 A1
20050261894 Balan et al. Nov 2005 A1
20050261896 Schuijers et al. Nov 2005 A1
20050276363 Joublin et al. Dec 2005 A1
20050276423 Aubauer et al. Dec 2005 A1
20050281410 Grosvenor et al. Dec 2005 A1
20050283544 Yee Dec 2005 A1
20050288923 Kok Dec 2005 A1
20060072768 Schwartz et al. Apr 2006 A1
20060074646 Alves et al. Apr 2006 A1
20060098809 Nongpiur et al. May 2006 A1
20060100868 Hetherington et al. May 2006 A1
20060120537 Burnett et al. Jun 2006 A1
20060133621 Chen et al. Jun 2006 A1
20060136203 Ichikawa Jun 2006 A1
20060149535 Choi et al. Jul 2006 A1
20060153391 Hooley et al. Jul 2006 A1
20060160581 Beaugeant et al. Jul 2006 A1
20060184363 McCree et al. Aug 2006 A1
20060198542 Benjelloun Touimi et al. Sep 2006 A1
20060222184 Buck et al. Oct 2006 A1
20060242071 Stebbings Oct 2006 A1
20060270468 Hui et al. Nov 2006 A1
20060293882 Giesbrecht et al. Dec 2006 A1
20070021958 Visser et al. Jan 2007 A1
20070025562 Zalewski et al. Feb 2007 A1
20070027685 Arakawa et al. Feb 2007 A1
20070033020 (Kelleher) Francois et al. Feb 2007 A1
20070033494 Wenger et al. Feb 2007 A1
20070038440 Sung et al. Feb 2007 A1
20070058822 Ozawa Mar 2007 A1
20070067166 Pan et al. Mar 2007 A1
20070071206 Gainsboro et al. Mar 2007 A1
20070078649 Hetherington et al. Apr 2007 A1
20070088544 Acero et al. Apr 2007 A1
20070094031 Chen Apr 2007 A1
20070100612 Ekstrand et al. May 2007 A1
20070110263 Brox May 2007 A1
20070116300 Chen May 2007 A1
20070136056 Moogi et al. Jun 2007 A1
20070136059 Gadbois Jun 2007 A1
20070150268 Acero et al. Jun 2007 A1
20070154031 Avendano et al. Jul 2007 A1
20070165879 Deng et al. Jul 2007 A1
20070195968 Jaber Aug 2007 A1
20070198254 Goto et al. Aug 2007 A1
20070230712 Belt et al. Oct 2007 A1
20070230913 Ichimura Oct 2007 A1
20070237271 Pessoa et al. Oct 2007 A1
20070244695 Manjunath et al. Oct 2007 A1
20070253574 Soulodre Nov 2007 A1
20070276656 Solbach et al. Nov 2007 A1
20070282604 Gartner et al. Dec 2007 A1
20070287490 Green et al. Dec 2007 A1
20070294263 Punj et al. Dec 2007 A1
20080019548 Avendano Jan 2008 A1
20080033723 Jang et al. Feb 2008 A1
20080059163 Ding et al. Mar 2008 A1
20080069366 Soulodre Mar 2008 A1
20080071540 Nakano et al. Mar 2008 A1
20080111734 Fam et al. May 2008 A1
20080117901 Klammer May 2008 A1
20080118082 Seltzer et al. May 2008 A1
20080140391 Yen et al. Jun 2008 A1
20080140396 Grosse-Schulte et al. Jun 2008 A1
20080152157 Lin et al. Jun 2008 A1
20080170703 Zivney Jul 2008 A1
20080192956 Kazama Aug 2008 A1
20080195384 Jabri et al. Aug 2008 A1
20080201138 Visser et al. Aug 2008 A1
20080208575 Laaksonen et al. Aug 2008 A1
20080212795 Goodwin et al. Sep 2008 A1
20080228478 Hetherington et al. Sep 2008 A1
20080247567 Kjolerbakken et al. Oct 2008 A1
20080260175 Elko Oct 2008 A1
20080273476 Cohen et al. Nov 2008 A1
20080310646 Amada Dec 2008 A1
20080317261 Yoshida et al. Dec 2008 A1
20090012783 Klein Jan 2009 A1
20090012784 Murgia et al. Jan 2009 A1
20090012786 Zhang et al. Jan 2009 A1
20090018828 Nakadai et al. Jan 2009 A1
20090048824 Amada Feb 2009 A1
20090060222 Jeong et al. Mar 2009 A1
20090063142 Sukkar Mar 2009 A1
20090070118 Den Brinker et al. Mar 2009 A1
20090086986 Schmidt et al. Apr 2009 A1
20090106021 Zurek et al. Apr 2009 A1
20090112579 Li et al. Apr 2009 A1
20090116652 Kirkeby et al. May 2009 A1
20090119096 Gerl et al. May 2009 A1
20090119099 Lee et al. May 2009 A1
20090129610 Kim et al. May 2009 A1
20090144053 Tamura Jun 2009 A1
20090144058 Sorin Jun 2009 A1
20090154717 Hoshuyama Jun 2009 A1
20090177464 Gao et al. Jul 2009 A1
20090192790 El-Maleh et al. Jul 2009 A1
20090204413 Sintes et al. Aug 2009 A1
20090216526 Schmidt et al. Aug 2009 A1
20090220107 Every et al. Sep 2009 A1
20090226005 Acero et al. Sep 2009 A1
20090226010 Schnell et al. Sep 2009 A1
20090228272 Herbig Sep 2009 A1
20090245335 Fang Oct 2009 A1
20090245444 Fang Oct 2009 A1
20090253418 Makinen Oct 2009 A1
20090257609 Gerkmann et al. Oct 2009 A1
20090262969 Short et al. Oct 2009 A1
20090271187 Yen et al. Oct 2009 A1
20090287481 Paranjpe et al. Nov 2009 A1
20090292536 Hetherington et al. Nov 2009 A1
20090303350 Terada Dec 2009 A1
20090323982 Solbach et al. Dec 2009 A1
20100004929 Baik Jan 2010 A1
20100027799 Romesburg et al. Feb 2010 A1
20100033427 Marks et al. Feb 2010 A1
20100094643 Avendano et al. Apr 2010 A1
20100138220 Matsumoto et al. Jun 2010 A1
20100166199 Seydoux Jul 2010 A1
20100177916 Gerkmann et al. Jul 2010 A1
20100211385 Sehlstedt Aug 2010 A1
20100228545 Ito et al. Sep 2010 A1
20100245624 Beaucoup Sep 2010 A1
20100278352 Petit et al. Nov 2010 A1
20100280824 Petit et al. Nov 2010 A1
20100290615 Takahashi Nov 2010 A1
20100296668 Lee et al. Nov 2010 A1
20100309774 Astrom Dec 2010 A1
20110019833 Kuech et al. Jan 2011 A1
20110035213 Malenovsky et al. Feb 2011 A1
20110038486 Beaucoup Feb 2011 A1
20110038557 Closset et al. Feb 2011 A1
20110044324 Li et al. Feb 2011 A1
20110075857 Aoyagi Mar 2011 A1
20110081024 Soulodre Apr 2011 A1
20110107367 Georgis et al. May 2011 A1
20110123019 Gowreesunker et al. May 2011 A1
20110129095 Avendano et al. Jun 2011 A1
20110137646 Ahgren et al. Jun 2011 A1
20110142257 Goodwin et al. Jun 2011 A1
20110178800 Watts Jul 2011 A1
20110184732 Godavarti Jul 2011 A1
20110184734 Wang et al. Jul 2011 A1
20110191101 Uhle et al. Aug 2011 A1
20110208520 Lee Aug 2011 A1
20110257965 Hardwick Oct 2011 A1
20110257967 Every et al. Oct 2011 A1
20110261150 Goyal et al. Oct 2011 A1
20110264449 Sehlstedt Oct 2011 A1
20120063609 Triki et al. Mar 2012 A1
20120087514 Williams et al. Apr 2012 A1
20120116758 Murgia et al. May 2012 A1
20120121096 Chen et al. May 2012 A1
20120123775 Murgia et al. May 2012 A1
20120140917 Nicholson et al. Jun 2012 A1
20120179462 Klein Jul 2012 A1
20120197898 Pandey et al. Aug 2012 A1
20120209611 Furuta et al. Aug 2012 A1
20120220347 Davidson Aug 2012 A1
20120237037 Ninan et al. Sep 2012 A1
20120250871 Lu et al. Oct 2012 A1
20120257778 Hall et al. Oct 2012 A1
20130011111 Abraham et al. Jan 2013 A1
20130024190 Fairey Jan 2013 A1
20130096914 Avendano et al. Apr 2013 A1
20130289988 Fry Oct 2013 A1
20130289996 Fry Oct 2013 A1
20130322461 Poulsen Dec 2013 A1
20130343549 Vemireddy et al. Dec 2013 A1
20140003622 Ikizyan et al. Jan 2014 A1
20140098964 Rosca et al. Apr 2014 A1
20140241702 Solbach et al. Aug 2014 A1
20140350926 Schuster et al. Nov 2014 A1
20150078555 Zhang et al. Mar 2015 A1
20150078606 Zhang et al. Mar 2015 A1
20150208165 Volk et al. Jul 2015 A1
20160027451 Solbach et al. Jan 2016 A1
20160037245 Harrington Feb 2016 A1
20160061934 Woodruff et al. Mar 2016 A1
20160078880 Avendano et al. Mar 2016 A1
20160093307 Warren et al. Mar 2016 A1
20160094910 Vallabhan et al. Mar 2016 A1
20160162469 Santos Jun 2016 A1
Foreign Referenced Citations (119)
Number Date Country
105474311 Apr 2016 CN
112014003337 Mar 2016 DE
0756437 Jan 1997 EP
1081685 Mar 2001 EP
1232496 Aug 2002 EP
1474755 Nov 2004 EP
20080428 Jul 2008 FI
20080623 Nov 2008 FI
20100431 Dec 2010 FI
20110428 Dec 2011 FI
20125600 Jun 2012 FI
123080 Oct 2012 FI
124716 Dec 2014 FI
62110349 May 1987 JP
4184400 Jul 1992 JP
5053587 Mar 1993 JP
H05172865 Jul 1993 JP
H05300419 Nov 1993 JP
6269083 Sep 1994 JP
H07248793 Sep 1995 JP
H07336793 Dec 1995 JP
H10313497 Nov 1998 JP
H11249693 Sep 1999 JP
2001159899 Jun 2001 JP
2002366200 Dec 2002 JP
2002542689 Dec 2002 JP
2003514473 Apr 2003 JP
2003271191 Sep 2003 JP
2004053895 Feb 2004 JP
2004187283 Jul 2004 JP
2004531767 Oct 2004 JP
2004533155 Oct 2004 JP
2005110127 Apr 2005 JP
2005148274 Jun 2005 JP
2005518118 Jun 2005 JP
2005195955 Jul 2005 JP
2005309096 Nov 2005 JP
2006094522 Apr 2006 JP
2006515490 May 2006 JP
2006337415 Dec 2006 JP
2007006525 Jan 2007 JP
2007201818 Aug 2007 JP
2008015443 Jan 2008 JP
2008518257 May 2008 JP
2008135933 Jun 2008 JP
2008542798 Nov 2008 JP
2009037042 Feb 2009 JP
2009522942 Jun 2009 JP
2009538450 Nov 2009 JP
2010532879 Oct 2010 JP
2011527025 Oct 2011 JP
5007442 Jun 2012 JP
2012514233 Jun 2012 JP
5081903 Sep 2012 JP
2013513306 Apr 2013 JP
2013527479 Jun 2013 JP
5718251 Mar 2015 JP
5762956 Jun 2015 JP
5855571 Dec 2015 JP
1020060024498 Mar 2006 KR
1020070068270 Jun 2007 KR
1020080092404 Oct 2008 KR
101050379 Dec 2008 KR
1020080109048 Dec 2008 KR
1020090013221 Feb 2009 KR
1020100041741 Apr 2010 KR
1020110038024 Apr 2011 KR
1020110111409 Oct 2011 KR
1020120094892 Aug 2012 KR
1020120101457 Sep 2012 KR
101210313 Dec 2012 KR
101294634 Aug 2013 KR
101461141 Nov 2014 KR
101610662 Apr 2016 KR
519615 Feb 2003 TW
526468 Apr 2003 TW
200305854 Nov 2003 TW
200629240 Aug 2006 TW
I279776 Apr 2007 TW
200847133 Dec 2008 TW
200910793 Mar 2009 TW
201009817 Mar 2010 TW
201113873 Apr 2011 TW
201143475 Dec 2011 TW
I421858 Jan 2014 TW
I463817 Dec 2014 TW
I465121 Dec 2014 TW
201513099 Apr 2015 TW
I488179 Jun 2015 TW
WO0137265 May 2001 WO
WO0141504 Jun 2001 WO
WO0156328 Aug 2001 WO
WO0174118 Oct 2001 WO
WO0207061 Jan 2002 WO
WO02080362 Oct 2002 WO
WO02103676 Dec 2002 WO
WO03043374 May 2003 WO
WO03069499 Aug 2003 WO
WO2004010415 Jan 2004 WO
WO2005086138 Sep 2005 WO
WO2006027707 Mar 2006 WO
WO2007001068 Jan 2007 WO
WO2007049644 May 2007 WO
WO2007081916 Jul 2007 WO
WO2007140003 Dec 2007 WO
WO2008034221 Mar 2008 WO
WO2008045476 Apr 2008 WO
WO2009008998 Jan 2009 WO
WO2010005493 Jan 2010 WO
WO2010077361 Jul 2010 WO
WO2011002489 Jan 2011 WO
WO2011068901 Jun 2011 WO
WO2011091068 Jul 2011 WO
WO2012094422 Jul 2012 WO
WO2012097016 Jul 2012 WO
WO2014131054 Aug 2014 WO
WO2015010129 Jan 2015 WO
WO2016040885 Mar 2016 WO
WO2016049566 Mar 2016 WO
Non-Patent Literature Citations (301)
Entry
Non-Final Office Action, Dec. 6, 2011, U.S. Appl. No. 12/319,107, filed Dec. 31, 2008.
Final Office Action, Apr. 16, 2012, U.S. Appl. No. 12/319,107, filed Dec. 31, 2008.
Advisory Action, Jun. 28, 2012, U.S. Appl. No. 12/319,107, filed Dec. 31, 2008.
Non-Final Office Action, Jan. 3, 2014, U.S. Appl. No. 12/319,107, filed Dec. 31, 2008.
Notice of Allowance, Aug. 25, 2014, U.S. Appl. No. 12/319,107, filed Dec. 31, 2008.
Non-Final Office Action, Dec. 10, 2012, U.S. Appl. No. 12/493,927, filed Jun. 29, 2009.
Final Office Action, May 14, 2013, U.S. Appl. No. 12/493,927, filed Jun. 29, 2009.
Non-Final Office Action, Jan. 9, 2014, U.S. Appl. No. 12/493,927, filed Jun. 29, 2009.
Notice of Allowance, Aug. 20, 2014, U.S. Appl. No. 12/493,927, filed Jun. 29, 2009.
Non-Final Office Action, Aug. 28, 2012, U.S. Appl. No. 12/860,515, filed Aug. 20, 2010.
Final Office Action, Mar. 11, 2013, U.S. Appl. No. 12/860,515, filed Aug. 20, 2010.
Non-Final Office Action, Aug. 28, 2013, U.S. Appl. No. 12/860,515, filed Aug. 20, 2010.
Notice of Allowance, Jun. 18, 2014, U.S. Appl. No. 12/860,515, filed Aug. 20, 2010.
Non-Final Office Action, Oct. 11, 2012, U.S. Appl. No. 12/896,725, filed Oct. 1, 2010.
Final Office Action, May 22, 2013, U.S. Appl. No. 12/896,725, filed Oct. 1, 2010.
Non-Final Office Action, Jan. 30, 2014, U.S. Appl. No. 12/896,725, filed Oct. 1, 2010.
Non-Final Office Action, Nov. 19, 2014, U.S. Appl. No. 12/896,725, filed Oct. 1, 2010.
Notice of Allowance, Jul. 30, 2015, U.S. Appl. 12/896,725, filed Oct. 1, 2010.
Non-Final Office Action, Oct. 2, 2012, U.S. Appl. No. 12/906,009, filed Oct. 15, 2010.
Non-Final Office Action, Jul. 2, 2013, U.S. Appl. No. 12/906,009, filed Oct. 15, 2010.
Final Office Action, May 7, 2014, U.S. Appl. No. 12/906,009, filed Oct. 15, 2010.
Non-Final Office Action, Apr. 21, 2015, U.S. Appl. No. 12/906,009, filed Oct. 15, 2010.
Non-Final Office Action, Jul. 31, 2013, U.S. Appl. No. 13/009,732, filed Jan. 19, 2011.
Final Office Action, Dec. 16 2014, U.S. Appl. No. 13/009,732, filed Jan. 19, 2011.
Non-Final Office Action, Apr. 24, 2013, U.S. Appl. No. 13/012,517, filed Jan. 24, 2011.
Final Office Action, Dec. 3, 2013, U.S. Appl. No. 13/012,517, filed Jan. 24, 2011.
Non-Final Office Action, Nov. 19, 2014, U.S. Appl. No. 13/012,517, filed Jan. 24, 2011.
Final Office Action, Jun. 17, 2015, U.S. Appl. No. 13/012,517, filed Jan. 24, 2011.
Non-Final Office Action, Feb. 21, 2012, U.S. Appl. No. 13/288,858, filed Nov. 3, 2011.
Notice of Allowance, Sep. 10, 2012, U.S. Appl. No. 13/288,858, filed Nov. 3, 2011.
Non-Final Office Action, Feb. 14, 2012, U.S. Appl. No. 13/295,981, filed Nov. 14, 2011.
Final Office Action, Jul. 9, 2012, U.S. Appl. No. 13/295,981, filed Nov. 14, 2011.
Final Office Action, Jul. 17, 2012, U.S. Appl. No. 13/295,981, filed Nov. 14, 2011.
Advisory Action, Sep. 24, 2012, U.S. Appl. No. 13/295,981, filed Nov. 14, 2011.
Notice of Allowance, May 9, 2014, U.S. Appl. No. 13/295,981, filed Nov. 14, 2011.
Non-Final Office Action, May 10, 2013, U.S. Appl. No. 13/751,907, filed Jan. 28, 2013.
Notice of Allowance, Sep. 17, 2013, U.S. Appl. No. 13/751,907, filed Jan. 28, 2013.
Non-Final Office Action, Dec. 28, 2015, U.S. Appl. No. 14/081,723, filed Nov. 15, 2013.
International Search Report dated Jun. 8, 2001 in Patent Cooperation Treaty Application No. PCT/US2001/008372.
International Search Report dated Apr. 3, 2003 in Patent Cooperation Treaty Application No. PCT/US2002/036946.
International Search Report dated May 29, 2003 in Patent Cooperation Treaty Application No. PCT/US2003/004124.
International Search Report and Written Opinion dated Oct. 19, 2007 in Patent Cooperation Treaty Application No. PCT/US2007/000463.
International Search Report and Written Opinion dated Apr. 9, 2008 in Patent Cooperation Treaty Application No. PCT/US2007/021654.
International Search Report and Written Opinion dated Sep. 16, 2008 in Patent Cooperation Treaty Application No. PCT/US2007/012628.
International Search Report and Written Opinion dated Oct. 1, 2008 in Patent Cooperation Treaty Application No. PCT/US2008/008249.
International Search Report and Written Opinion dated Aug. 27, 2009 in Patent Cooperation Treaty Application No. PCT/US2009/003813.
Dahl, Mattias et al., “Acoustic Echo and Noise Cancelling Using Microphone Arrays”, International Symposium on Signal Processing and its Applications, ISSPA, Gold coast, Australia, Aug. 25-30, 1996, pp. 379-382.
Demol, M. et al., “Efficient Non-Uniform Time-Scaling of Speech With WSOLA for CALL Applications”, Proceedings of InSTIL/ICALL2004—NLP and Speech Technologies in Advanced Language Learning Systems—Venice Jun. 17-19, 2004.
Laroche, Jean. “Time and Pitch Scale Modification of Audio Signals”, in “Applications of Digital Signal Processing to Audio and Acoustics”, The Kluwer International Series in Engineering and Computer Science, vol. 437, pp. 279-309, 2002.
Moulines, Eric et al., “Non-Parametric Techniques for Pitch-Scale and Time-Scale Modification of Speech”, Speech Communication, vol. 16, pp. 175-205, 1995.
Verhelst, Werner, “Overlap-Add Methods for Time-Scaling of Speech”, Speech Communication vol. 30, pp. 207-221, 2000.
Bach et al., Learning Spectral Clustering with application to spech separation, Journal of machine learning research, 2006.
Mokbel et al., 1995, IEEE Transactions of Speech and Audio Processing, vol. 3, No. 5, Sep. 1995, pp. 346-356.
Office Action mailed Oct. 14, 2013 in Taiwanese Patent Application 097125481, filed Jul. 4, 2008.
Office Action mailed Oct. 29, 2013 in Japanese Patent Application 2011-516313, filed Jun. 26, 2009.
Office Action mailed Dec. 20, 2013 in Taiwanese Patent Application 096146144, filed Dec. 4, 2007.
Office Action mailed Dec. 9, 2013 in Finnish Patent Application 20100431, filed Jun. 26, 2009.
Office Action mailed Jan. 20, 2014 in Finnish Patent Application 20100001, filed Jul. 3, 2008.
Office Action mailed Mar. 10, 2014 in Taiwanese Patent Application 097125481, filed Jul. 4, 2008.
Bai et al., “Upmixing and Downmixing Two-channel Stereo Audio for Consumer Electronics”. IEEE Transactions on Consumer Electronics [Online] 2007, vol. 53, Issue 3, pp. 1011-1019.
Jo et al., “Crosstalk cancellation for spatial sound reproduction in portable devices with stereo loudspeakers”. Communications in Computer and Information Science [Online] 2011, vol. 266, pp. 114-123.
Nongpuir et al., “NEXT cancellation system with improved convergence rate and tracking performance”. IEEE Proceedings—Communications [Online] 2005, vol. 152, Issue 3, pp. 378-384.
Ahmed et al., “Blind Crosstalk Cancellation for DMT Systems” IEEE—Emergent Technologies Technical Committee. Sep. 2002. pp. 1-5.
Allowance mailed May 21, 2014 in Finnish Patent Application 20100001, filed Jan. 4, 2010.
Office Action mailed May 2, 2014 in Taiwanese Patent Application 098121933, filed Jun. 29, 2009.
Office Action mailed Apr. 15, 2014 in Japanese Patent Application 2010-514871, filed Jul. 3, 2008.
Office Action mailed Jun. 27, 2014 in Korean Patent Application No. 10-2010-7000194, filed Jan. 6, 2010.
Office Action mailed Jun. 18, 2014 in Finnish Patent Application No. 20080428, filed Jul. 4, 2008.
International Search Report & Written Opinion dated Jul. 15, 2014 in Patent Cooperation Treaty Application No. PCT/US2014/018443, filed Feb. 25, 2014.
Notice of Allowance dated Aug. 26, 2014 in Taiwanese Application No. 096146144, filed Dec. 4, 2007.
Notice of Allowance dated Sep. 16, 2014 in Korean Application No. 10-2010-7000194, filed Jul. 3, 2008.
Notice of Allowance dated Sep. 29, 2014 in Taiwanese Application No. 097125481, filed Jul. 4, 2008.
Notice of Allowance dated Oct. 10, 2014 in Finnish Application No. 20100001, filed Jul. 3, 2008.
International Search Report & Written Opinion dated Nov. 12, 2014 in Patent Cooperation Treaty Application No. PCT/US2014/047458, filed Jul. 21, 2014.
Office Action mailed Oct. 28, 2014 in Japanese Patent Application No. 2011-516313, filed Dec. 27, 2012.
Heiko Purnhagen, “Low Complexity Parametric Stereo Coding in MPEG-4,” Proc. of the 7th Int. Conference on Digital Audio Effects (DAFx'04), Naples, Italy, Oct. 5-8, 2004.
Chun-Ming Chang et al., “Voltage-Mode Multifunction Filter with Single Input and Three Outputs Using Two Compound Current Conveyors” IEEE Transactions on Circuits and Systems-I: Fundamental Theory and Applications, vol. 46, No. 11, Nov. 1999.
Notice of Allowance mailed Feb. 10, 2015 in Taiwanese Patent Application No. 098121933, filed Jun. 29, 2009.
Office Action mailed Jan. 30, 2015 in Finnish Patent Application No. 20080623, filed May 24, 2007.
Office Action mailed Mar. 24, 2015 in Japanese Patent Application No. 2011-516313, filed Jun. 26, 2009.
Office Action mailed Apr. 16, 2015 in Korean Patent Application No. 10-2011-7000440, filed Jun. 26, 2009.
Notice of Allowance mailed Jun. 2, 2015 in Japanese Patent Application 2011-516313, filed Jun. 26, 2009.
Office Action mailed Jun. 4, 2015 in Finnish Patent Application 20080428, filed Jan. 5, 2007.
Office Action mailed Jun. 9, 2015 in Japanese Patent Application 2014-165477 filed Jul. 3, 2008.
Notice of Allowance mailed Aug. 13, 2015 in Finnish Patent Application 20080623, filed May 24, 2007.
International Search Report & Written Opinion dated Nov. 27, 2015 in Patent Cooperation Treaty Application No. PCT/US2015/047263, filed Aug. 27, 2015.
Non-Final Office Action, Oct. 27, 2003, U.S. Appl. No. 09/534,682, filed Mar. 24, 2000.
Non-Final Office Action, Feb. 10, 2004, U.S. Appl. No. 09/534,682, filed Mar. 24, 2000.
Final Office Action, Dec. 17, 2004, U.S. Appl. No. 09/534,682, filed Mar. 24, 2000.
Non-Final Office Action, Apr. 20, 2005, U.S. Appl. No. 09/534,682, filed Mar. 24, 2000.
Notice of Allowance, Oct. 26, 2005, U.S. Appl. No. 09/534,682, filed Mar. 24, 2000.
Non-Final Office Action, May 3, 2005, U.S. Appl. No. 09/993,442, filed Nov. 13, 2001.
Final Office Action, Oct. 19, 2005, U.S. Appl. No. 09/993,442, filed Nov. 13, 2001.
Advisory Action, Jan. 20, 2006, U.S Appl. No. 09/993,442, filed Nov. 13, 2001.
Non-Final Office Action, May 17, 2006, U.S. Appl. No. 09/993,442, filed Nov. 13, 2001.
Non-Final Office Action, Nov. 16, 2006, U.S. Appl. No. 09/993,442, filed Nov. 13, 2001.
Final Office Action, Jun. 15, 2007, U.S. Appl. No. 09/993,442, filed Nov. 13, 2001.
Non-Final Office Action, Oct. 8, 2003, U.S. Appl. No. 10/004,141, filed Nov. 14, 2001.
Notice of Allowance, Feb. 24, 2004, U.S. Appl. No. 10/004,141, filed Nov. 14, 2001.
Non-Final Office Action, May 9, 2003, U.S. Appl. No. 10/074,991, filed Feb. 13, 2002.
Notice of Allowance, Jun. 4, 2003, U.S. Appl. No. 10/074,991, filed Feb. 13, 2002.
Non-Final Office Action, Jun. 26, 2006, U.S. Appl. No. 10/074,991, filed Feb. 13, 2002.
Final Office Action, Feb. 23, 2007, U.S. Appl. No. 10/074,991, Feb. 13, 2002.
Non-Final Office Action, Oct. 6, 2005, U.S. Appl. No. 10/177,049, filed Jun. 21, 2002.
Final Office Action, Mar. 28, 2006, U.S. Appl. No. 10/177,049, filed Jun. 21, 2002.
Advisory Action, Jun. 19, 2006, U.S. Appl. No. 10/177,049, filed Jun. 21, 2002.
Non-Final Office Action, Dec. 13, 2006, U.S. Appl. No. 10/613,224, filed Jul. 3, 2003.
Non-Final Office Action, Jun. 13, 2007, U.S. Appl. No. 10/613,224, filed Jul. 3, 2003.
Non-Final Office Action, Jun. 13, 2006, U.S. Appl. No. 10/840,201, filed May 5, 2004.
Non-Final Office Action, Mar. 30, 2010, U.S. Appl. No. 11/343,524, filed Jan. 30, 2006.
Non-Final Office Action, Sep. 13, 2010, U.S. Appl. No. 11/343,524, filed Jan. 30, 2006.
Final Office Action, Mar. 30, 2011, U.S. Appl. No. 11/343,524, filed Jan. 30, 2006.
Final Office Action, May 21, 2012, U.S. Appl. No. 11/343,524, filed Jan. 30, 2006.
Notice of Allowance, Oct. 9, 2012, U.S. Appl. No. 11/343,524, filed Jan. 30, 2006.
Non-Final Office Action, Aug. 5, 2008, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Non-Final Office Action, Jan. 21, 2009, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Final Office Action, Sep. 3, 2009, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Non-Final Office Action, May 10, 2011, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Final Office Action, Oct. 24, 2011, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Notice of Allowance, Feb. 13, 2012, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Non-Final Office Action, Apr. 7, 2011, U.S. Appl. No. 11/699,732, filed Jan. 29, 2007.
Final Office Action, Dec. 6, 2011, U.S. Appl. No. 11/699,732, filed Jan. 29, 2007.
Advisory Action, Feb. 2012, U.S. Appl. No. 11/699,732, filed Jan. 29, 2007.
Notice of Allowance, Mar. 15, 2012, U.S. Appl. No. 11/699,732, filed Jan. 29, 2007.
Non-Final Office Action, Aug. 18, 2010 U.S. Appl. No. 11/825,563, filed Jul. 6, 2007.
Final Office Action, Apr. 28, 2011, U.S. Appl. No. 11/825,563, filed Jul. 6, 2007.
Non-Final Office Action, Apr. 24, 2013, U.S. Appl. No. 11/825,563, filed Jul. 6, 2007.
Final Office Action, Dec. 30, 2013, U.S. Appl. No. 11/825,563, filed Jul. 6, 2007.
Notice of Allowance, Mar. 25, 2014, U.S. Appl. No. 11/825,563, filed Jul. 6, 2007.
Non-Final Office Action, Oct. 3, 2011, U.S. Appl. No. 12/004,788, filed Dec. 21, 2007.
Notice of Allowance, Feb. 23, 2012. U.S. Appl. No. 12/004,788, filed Dec. 21, 2007.
Non-Final Office Action, Sep. 14, 2011, U.S. Appl. No. 12/004,897, filed Dec. 21, 2007.
Notice of Allowance, Jan. 27, 2012, U.S. Appl. No. 12/004,897, filed Dec. 21, 2007.
Non-Final Office Action, Jul. 28, 2011, U.S. Appl. No. 12/072,931, filed Feb. 29, 2008.
Notice of Allowance, Mar. 1, 2012, U.S. Appl. No. 12/072,931, filed Feb. 29, 2008.
Notice of Allowance, Mar. 1, 2012, U.S. Appl. No. 12/080,115, filed Mar. 31, 2008.
Non-Final Office Action, Nov. 14, 2011, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Final Office Action, Apr. 24, 2012, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Advisory Action, Jul. 3, 2012, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Non-Final Office Action, Mar. 11, 2014, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Final Office Action, Jul. 11, 2014, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Non-Final Office Action, Dec. 8, 2014, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Notice of Allowance, Jul. 7, 2015, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Non-Final Office Action, Jul. 13, 2011, U.S. Appl. No. 12/217,076, filed Jun. 30, 2008.
Final Office Action, Nov. 16, 2011, U.S. Appl. No. 12/217,076, filed Jun. 30, 2008.
Non-Final Office Action, Mar. 14, 2012, U.S. Appl. No. 12/217,076, filed Jun. 30, 2008.
Final Office Action, Sep. 19, 2012, U.S. Appl. No. 12/217,076, filed Jun. 30, 2008.
Notice of Allowance, Apr. 15, 2013, U.S. Appl. No. 12/217,076, filed Jun. 30, 2008.
Non-Final Office Action, Sep. 1, 2011, U.S. Appl. No. 12/286,909, filed Oct. 2, 2008.
Notice of Allowance, Feb. 28, 2012, U.S. Appl. No. 12/286,909, filed Oct. 2, 2008.
Non-Final Office Action, Nov. 15, 2011, U.S. Appl. No. 12/286,995, filed Oct. 2, 2008.
Final Office Action, Apr. 10, 2012, U.S. Appl. No. 12/286,995, filed Oct. 2, 2008.
Notice of Allowance, Mar. 13, 2014, U.S. Appl. No. 12/286,995, filed Oct. 2, 2008.
Non-Final Office Action, Dec. 28, 2011, U.S. Appl. No. 12/288,228, filed Oct. 16, 2008.
Non-Final Office Action, Dec. 30, 2011, U.S. Appl. No. 12/422,917, filed Apr. 13, 2009.
Final Office Action, May 14, 2012, U.S. Appl. No. 12/422,917, filed Apr. 13, 2009.
Advisory Action, Jul. 27, 2012, U.S. Appl. No. 12/422,917, filed Apr. 13, 2009.
Notice of Allowance, Sep. 11, 2014, U.S. Appl. No. 12/422,917, filed Apr. 13, 2009.
Non-Final Office Action, Jun. 20, 2012, U.S. Appl. No. 12/649,121, filed Dec. 29, 2009.
Final Office Action, Nov. 28, 2012, U.S. Appl. No. 12/649,121, filed Dec. 29, 2009.
Advisory Action, Feb. 19, 2013, U.S. Appl. No. 12/649,121, filed Dec. 29, 2009.
Notice of Allowance, Mar. 19, 2013, U.S. Appl. No. 12/649,121, filed Dec. 29, 2009.
Non-Final Office Action, Feb. 19, 2013, U.S. Appl. No. 12/944,659, filed Nov. 11, 2010.
Notice of Allowance, May 25, 2011, U.S. Appl. No. 13/016,916, filed Jan. 28, 2011.
Notice of Allowance, Aug. 4, 2011, U.S. Appl. No. 13/016,916, filed Jan. 28, 2011.
Non-Final Office Action, Nov. 2013, U.S. Appl. No. 13/363,362, filed Jan. 31, 2012.
Final Office Action, Sep. 12, 2014, U.S. Appl. No. 13/363,362, filed Jan. 31, 2012.
Non-Final Office Action, Oct. 28, 2015, U.S. Appl. No. 13/363,362, filed Jan. 31, 2012.
Non-Final Office Action, Dec. 4, 2013, U.S. Appl. No. 13/396,568, Feb. 14, 2012.
Final Office Action, Sep. 23, 2014, U.S. Appl. No. 13/396,568, filed Feb. 14, 2012.
Non-Final Office Action, Nov. 5, 2015, U.S. Appl. No. 13/396,568, filed Feb. 14, 2012.
Non-Final Office Action, Sep. 17, 2013, U.S. Appl. No. 13/397,597, filed Feb. 15, 2012.
Final Office Action, Apr. 1, 2014, U.S. Appl. No. 13/397,597, filed Feb. 15, 2012.
Non-Final Office Action, Nov. 21, 2014, U.S. Appl. No. 13/397,597, filed Feb. 15, 2012.
Non-Final Office Action, Jun. 7, 2012, U.S. Appl. No. 13/426,436, filed Mar. 21, 2012.
Final Office Action, Dec. 31, 2012, U.S. Appl. No. 13/426,436, filed Mar. 21, 2012.
Non-Final Office Action, Sep. 12, 2013, U.S. Appl. No. 13/426,436, filed Mar. 21, 2012.
Notice of Allowance, Jul. 16, 2014, U.S. Appl. No. 13/426,436, filed Mar. 21, 2012.
Non-Final Office Action, Jul. 15, 2014, U.S. Appl. No. 13/432,490, filed Mar. 28, 2012.
Notice of Allowance, Apr. 3, 2015, U.S. Appl. No. 13/432,490, filed Mar. 28, 2012.
Notice of Allowance, Oct. 17, 2012, U.S. Appl. No. 13/565,751, filed Aug. 2, 2012.
Non-Final Office Action, Jan. 9, 2012, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Non-Final Office Action, Dec. 28, 2012, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Non-Final Office Action, Mar. 7, 2013, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Final Office Action, Apr. 29, 2013, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Non-Final Office Action, Nov. 27, 2013, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Notice of Allowance, Jan. 30, 2014, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Non-Final Office Action, Jun. 4, 2013, U.S. Appl. No. 13/705,132, filed Dec. 4, 2012.
Final Office Action, Dec. 19, 2013, U.S. Appl. No. 13/705,132, filed Dec. 4, 2012.
Notice of Allowance, Jun. 19, 2014, U.S. Appl. No. 13/705,132, filed Dec. 4, 2012.
Non-Final Office Action, Jul. 14, 2015, U.S. Appl. No. 14/046,551, filed Oct. 4, 2013.
Non-Final Office Action, May 21, 2015, U.S. Appl. No. 14/189,817, filed Feb. 25, 2014.
Final Office Action, Dec. 15, 2015, U.S. Appl. No. 14/189,817, filed Feb. 25, 2014.
Notice of Allowance, Oct. 7, 2014, U.S. Appl. No. 14/207,096, filed Mar. 12, 2014.
Non-Final Office Action, Oct. 28, 2015, U.S. Appl. No. 14/216,567, filed Mar. 17, 2014.
Non-Final Office Action, Jul. 10, 2014, U.S. Appl. No. 14/279,092, filed May 15, 2014.
Notice of Allowance, Jan. 29, 2015, U.S. Appl. No. 14/279,092, filed May 15, 2014.
Non-Final Office Action, Feb. 27, 2015, U.S. Appl. No. 14/336,934, filed Jul. 21, 2014.
Notice of Allowance, Aug. 28, 2015, U.S. Appl. No. 14/336,934, filed Jul. 21, 2014.
Allen, Jont B. “Short Term Spectral Analysis, Synthesis, and Modification by Discrete Fourier Transform”, IEEE Transactions on Acoustics, Speech, and Signal Processing. vol. ASSP-25, No. 3, Jun. 1977. pp. 235-238.
Allen, Jont B. et al., “A Unified Approach to Short-Time Fourier Analysis and Synthesis”, Proceedings of the IEEE. vol. 65, No. 11, Nov. 1977. pp. 1558-1564.
Avendano, Carlos, “Frequency-Domain Source Identification and Manipulation in Stereo Mixes for Enhancement, Suppression and Re-Panning Applications,” 2003 IEEE Workshop on Application of Signal Processing to Audio and Acoustics, Oct. 19-22, pp. 55-58, New Paltz, New York, USA.
Boll, Steven F. “Suppression of Acoustic Noise in Speech using Spectral Subtraction”, IEEE Transactions on Acoustics, Speech and Signal Processing, vol. ASSP-27, No. 2, Apr. 1979, pp. 113-120.
Boll, Steven F. et al., “Suppression of Acoustic Noise in Speech Using Two Microphone Adaptive Noise Cancellation”, IEEE Transactions on Acoustic, Speech, and Signal Processing, vol. ASSP-28, No. 6, Dec. 1980, pp. 752-753.
Boll, Steven F. “Suppression of Acoustic Noise in Speech Using Spectral Subtraction”, Dept. of Computer Science, University of Utah Salt Lake City, Utah, Apr. 1979, pp. 18-19.
Chen, Jingdong et al., “New Insights into the Noise Reduction Wiener Filter”, IEEE Transactions on Audio, Speech, and Language Processing. vol. 14, No. 4, Jul. 2006, pp. 1218-1234.
Cohen, Israel et al., “Microphone Array Post-Filtering for Non-Stationary Noise Suppression”, IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2002, pp. 1-4.
Cohen, Israel, “Multichannel Post-Filtering in Nonstationary Noise Environments”, IEEE Transactions on Signal Processing, vol. 52, No. 5, May 2004, pp. 1149-1160.
Dahl, Mattias et al., “Simultaneous Echo Cancellation and Car Noise Suppression Employing a Microphone Array”, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr. 21-24, pp. 239-242.
Elko, Gary W., “Chapter 2: Differential Microphone Arrays”, “Audio Signal Processing for Next-Generation Multimedia Communication Systems”, 2004, pp. 12-65, Kluwer Academic Publishers, Norwell, Massachusetts, USA.
“ENT 172,” Instructional Module. Prince George's Community College Department of Engineering Technology. Accessed: Oct. 15, 2011. Subsection: “Polar and Rectangular Notation”. <http://academic.ppgcc.edu/ent/ent172—instr—mod.html>.
Fuchs, Martin et al., “Noise Suppression for Automotive Applications Based on Directional Information”, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, May 17-21, pp. 237-240.
Fulghum, D. P. et al., “LPC Voice Digitizer with Background Noise Suppression”, 1979 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 220-223.
Goubran, R.A. et al., “Acoustic Noise Suppression Using Regressive Adaptive Filtering”, 1990 IEEE 40th Vehicular Technology Conference, May 6-9, pp. 48-53.
Graupe, Daniel et al., “Blind Adaptive Filtering of Speech from Noise of Unknown Spectrum Using a Virtual Feedback Configuration”, IEEE Transactions on Speech and Audio Processing, Mar. 2000, vol. 8, No. 2, pp. 146-158.
Haykin, Simon et al., “Appendix A.2 Complex Numbers.” Signals and Systems. 2nd Ed. 2003. p. 764.
Hermansky, Hynek “Should Recognizers Have Ears?”, In Proc. ESCA Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, pp. 1-10, France 1997.
Hohmann, V. “Frequency Analysis and Synthesis Using a Gammatone Filterbank”, ACTA Acustica United with Acustica, 2002, vol. 88, pp. 433-442.
Jeffress, Lloyd A. et al., “A Place Theory of Sound Localization,” Journal of Comparative and Physiological Psychology, 1948, vol. 41, p. 35-39.
Jeong, Hyuk et al., “Implementation of a New Algorithm Using the STFT with Variable Frequency Resolution for the Time-Frequency Auditory Model”, J. Audio Eng. Soc., Apr. 1999, vol. 47, No. 4., pp. 240-251.
Kates, James M. “A Time-Domain Digital Cochlear Model”, IEEE Transactions on Signal Processing, Dec. 1991, vol. 39, No. 12, pp. 2573-2592.
Kato et al., “Noise Suppression with High Speech Quality Based on Weighted Noise Estimation and MMSE STSA” Proc. IWAENC [Online] 2001, pp. 183-186.
Lazzaro, John et al., “A Silicon Model of Auditory Localization,” Neural Computation Spring 1989, vol. 1, pp. 47-57, Massachusetts Institute of Technology.
Lippmann, Richard P. “Speech Recognition by Machines and Humans”, Speech Communication, Jul. 1997, vol. 22, No. 1, pp. 1-15.
Liu, Chen et al., “A Two-Microphone Dual Delay-Line Approach for Extraction of a Speech Sound in the Presence of Multiple Interferers”, Journal of the Acoustical Society of America, vol. 110, No. 6, Dec. 2001, pp. 3218-3231.
Martin, Rainer et al., “Combined Acoustic Echo Cancellation, Dereverberation and Noise Reduction: A two Microphone Approach”, Annales des Telecommunications/Annals of Telecommunications. vol. 49, No. 7-8, Jul.-Aug. 1994, pp. 429-438.
Martin, Rainer “Spectral Subtraction Based on Minimum Statistics”, in Proceedings Europe. Signal Processing Conf., 1994, pp. 1182-1185.
Mitra, Sanjit K. Digital Signal Processing: a Computer-based Approach. 2nd Ed. 2001. pp. 131-133.
Mizumachi, Mitsunori et al., “Noise Reduction by Paired-Microphones Using Spectral Subtraction”, 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, May 12-15. pp. 1001-1004.
Moonen, Marc et al., “Multi-Microphone Signal Enhancement Techniques for Noise Suppression and Dereverbration,” http://www.esat.kuleuven.ac.be/sista/yearreport97//node37.html, accessed on Apr. 21, 1998.
Watts, Lloyd Narrative of Prior Disclosure of Audio Display on Feb. 15, 2000 and May 31, 2000.
Cosi, Piero et al., (1996), “Lyon's Auditory Model Inversion: a Tool for Sound Separation and Speech Enhancement,” Proceedings of ESCA Workshop on ‘The Auditory Basis of Speech Perception,’ Keele University, Keele (UK), Jul. 15-19, 1996, pp. 194-197.
Parra, Lucas et al., “Convolutive Blind Separation of Non-Stationary Sources”, IEEE Transactions on Speech and Audio Processing. vol. 8, No. 3, May 2008, pp. 320-327.
Rabiner, Lawrence R. et al., “Digital Processing of Speech Signals”, (Prentice-Hall Series in Signal Processing). Upper Saddle River, NJ: Prentice Hall, 1978.
Weiss, Ron et al., “Estimating Single-Channel Source Separation Masks: Revelance Vector Machine Classifiers vs. Pitch-Based Masking”, Workshop on Statistical and Perceptual Audio Processing, 2006.
Schimmel, Steven et al., “Coherent Envelope Detection for Modulation Filtering of Speech,” 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, No. 7, pp. 221-224.
Slaney, Malcom, “Lyon's Cochlear Model”, Advanced Technology Group, Apple Technical Report #13, Apple Computer, Inc., 1988, pp. 1-79.
Slaney, Malcom, et al., “Auditory Model Inversion for Sound Separation,” 1994 IEEE International Conference on Acoustics, Speech and Signal Processing, Apr. 19-22, vol. 2, pp. 77-80.
Slaney, Malcom. “An Introduction to Auditory Model Inversion”, Interval Technical Report IRC 1994-014, http://coweb.ecn.purdue.edu/˜maclom/interval/1994-014/, Sep. 1994, accessed on Jul. 6, 2010.
Solbach, Ludger “An Architecture for Robust Partial Tracking and Onset Localization in Single Channel Audio Signal Mixes”, Technical University Hamburg-Harburg, 1998.
Soon et al., “Low Distortion Speech Enhancement” Proc. Inst. Elect. Eng. [Online] 2000, vol. 147, pp. 247-253.
Stahl, V. et al., “Quantile Based Noise Estimation for Spectral Subtraction and Wiener Filtering,” 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, Jun. 5-9, vol. 3, pp. 1875-1878.
Syntrillium Software Corporation, “Cool Edit User's Manual”, 1996, pp. 1-74.
Tashev, Ivan et al., “Microphone Array for Headset with Spatial Noise Suppressor”, http://research.microsoft.com/users/ivantash/Documents/Tashev—MAforHeadset—HSCMA—05.pdf. (4 pages).
Tchorz, Jurgen et al., “SNR Estimation Based on Amplitude Modulation Analysis with Applications to Noise Suppression”, IEEE Transactions on Speech and Audio Processing, vol. 11, No. 3, May 2003, pp. 184-192.
Valin, Jean-Marc et al., “Enhanced Robot Audition Based on Microphone Array Source Separation with Post-Filter”, Proceedings of 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sep. 28-Oct. 2, 2004, Sendai, Japan. pp. 2123-2128.
Watts, Lloyd, “Robust Hearing Systems for Intelligent Machines,” Applied Neurosystems Corporation, 2001, pp. 1-5.
Widrow, B. et al., “Adaptive Antenna Systems,” Proceedings of the IEEE, vol. 55, No. 12, pp. 2143-2159, Dec. 1967.
Yoo, Heejong et al., “Continuous-Time Audio Noise Suppression and Real-Time Implementation”, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, May 13-17, pp. IV3980-1V3983.
Office Action mailed May 17, 2016 in Korean Patent Application 1020127001822 filed Jun. 21, 2010.
Lauber, Pierre et al., “Error Concealment for Compressed Digital Audio,” Audio Engineering Society, 2001.
International Search Report and Written Opinion dated May 20, 2010 in Patent Cooperation Treaty Application No. PCT/US2009/006754.
Fast Cochlea Transform, US Trademark Reg. No. 2,875,755 (Aug. 17, 2004).
3GPP2 “Enhanced Variable Rate Codec, Speech Service Options 3, 68, 70, and 73 for Wideband Spread Spectrum Digital Systems”, May 2009, pp. 1-308.
3GPP2 “Selectable Mode Vocoder (SMV) Service Option for Wideband Spread Spectrum Communication Systems”, Jan. 2004, pp. 1-231.
3GPP2 “Source-Controlled Variable-Rate Multimode Wideband Speech Codec (VMR-WB) Service Option 62 for Spread Spectrum Systems”, Jun. 11, 2004, pp. 1-164.
3GPP “3GPP Specification 26.071 Mandatory Speech Codec Speech Processing Functions; AMR Speech Codec; General Description”, http://www.3gpp.org/ftp/Specs/html-info/26071.htm, accessed on Jan. 25, 2012.
3GPP “3GPP Specification 26.094 Mandatory Speech Codec Speech Processing Functions; Adaptive Multi-Rate (AMR) Speech Codec; Voice Activity Detector (VAD)”, http://www.3gpp.org/ftp/Specs/html-info/26094.htm, accessed on Jan. 25, 2012.
3GPP “3GPP Specification 26.171 Speech Codec Speech Processing Functions; Adaptive Multi-Rate—Wideband (AMR-WB) Speech Codec; General Description”, http://www.3gpp.org/ftp/Specs/html-info26171.htm, accessed on Jan. 25, 2012.
3GPP “3GPP Specification 26.194 Speech Codec Speech Processing Functions; Adaptive Multi-Rate—Wideband (AMR-WB) Speech Codec; Voice Activity Detector (VAD)” http://www.3gpp.org/ftp/Specs/html-info26194.htm, accessed on Jan. 25, 2012.
International Telecommunication Union “Coding of Speech at 8 kbit/s Using Conjugate-Structure Algebraic-code-excited Linear-prediction (CS-ACELP)”, Mar. 19, 1996, pp. 1-39.
International Telecommunication Union “Coding of Speech at 8 kbit/s Using Conjugate Structure Algebraic-code-excited Linear-prediction (CS-ACELP) Annex B: A Silence Compression Scheme for G.729 Optimized for Terminals Conforming to Recommendation V.70”, Nov. 8, 1996, pp. 1-23.
International Search Report and Written Opinion dated Aug. 19, 2010 in Patent Cooperation Treaty Application No. PCT/US2010/001786.
International Search Report and Written Opinion dated Feb. 7, 2011 in Patent Cooperation Treaty Application No. PCT/US2010/058600, filed Dec. 1, 2010.
Cisco, “Understanding How Digital T1 CAS (Robbed Bit Signaling) Works in IOS Gateways”, Jan. 17, 2007, http://www.cisco.com/image/gif/paws/22444/t1-cas-ios.pdf, accessed on Apr. 3, 2012.
Jelinek et al., “Noise Reduction Method for Wideband Speech Coding” Proc. Eusipco, Vienna, Austria, Sep. 2004, pp. 1959-1962.
Widjaja et al., “Application of Differential Microphone Array for IS-127 EVRC Rate Determination Algorithm”, Interspeech 2009, 10th Annual Conference of the International Speech Communication Association, Brighton, United Kingdom Sep. 6-10, 2009, pp. 1123-1126.
Sugiyama et al., “Single-Microphone Noise Suppression for 3G Handsets Based on Weighted Noise Estimation” in Benesty et al., “Speech Enhancement”, 2005, pp. 115-133, Springer Berlin Heidelberg.
Watts, “Real-Time, High-Resolution Simulation of the Auditory Pathway, with Application to Cell-Phone Noise Reduction” Proceedings of 2010 IEEE International Symposium on Circuits and Systems (ISCAS), May 30-Jun. 2, 2010, pp. 3821-3824.
3GPP Minimum Performance Specification for the Enhanced Variable rate Codec, Speech Service Option 3 and 68 for Wideband Spread Spectrum Digital Systems, Jul. 2007, pp. 1-83.
Ramakrishnan, 2000. Reconstruction of Incomplete Spectrograms for robust speech recognition. PHD thesis, Carnegie Mellon University, Pittsburgh, Pennsylvania.
Kim et al., “Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions,” Audio, Speech, and Language Processing, IEEE Transactions on, vol. 18, No. 8 pp. 2111-2120, Nov. 2010.
Cooke et al.,“Robust Automatic Speech Recognition with Missing and Unreliable Acoustic data,” Speech Commun., vol. 34, No. 3, pp. 267-285, 2001.
Liu et al., “Efficient cepstral normalization for robust speech recognition.” Proceedings of the workshop on Human Language Technology. Association for Computational Linguistics, 1993.
Yoshizawa et al., “Cepstral gain normalization for noise robust speech recognition.” Acoustics, Speech, and Signal Processing, 2004. Proceedings, (ICASSP04), IEEE International Conference on vol. 1 IEEE, 2004.
Office Action mailed Apr. 8, 2014 in Japan Patent Application 2011-544416, filed Dec. 30, 2009.
Elhilali et al.,“A cocktail party with a cortical twist: How cortical mechanisms contribute to sound segregation.” J. Acoust. Soc. Am., vol. 124, No. 6, Dec. 2008; 124(6): 3751-3771).
Jin et al., “HMM-Based Multipitch Tracking for Noisy and Reverberant Speech.” Jul. 2011.
Kawahara, W., et al., “Tandem-Straight: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation.” IEEE ICASSP 2008.
Lu et al. “A Robust Audio Classification and Segmentation Method.” Microsoft Research, 2001, pp. 203, 206, and 207.
Office Action dated Aug. 26, 2014 in Japan Application No. 2012-542167, filed Dec. 1, 2010.
Office Action mailed Oct. 31, 2014 in Finland Patent Application No. 20125600, filed Jun. 1, 2012.
Krini, Mohamed et al., “Model-Based Speech Enhancement,” in Speech and Audio Processing in Adverse Environments; Signals and Communication Technology, edited by Hansler et al., 2008, Chapter 4, pp. 89-134.
Office Action mailed Dec. 9, 2014 in Japan Patent Application No. 2012-518521, filed Jun. 21, 2010.
Office Action mailed Dec. 10, 2014 in Taiwan Patent Application No. 099121290, filed Jun. 29, 2010.
Nayebi et al., “Low delay FIR filter banks: design and evaluation” IEEE Transactions on Signal Processing, vol. 42, No. 1, pp. 24-31, Jan. 1994.
Notice of Allowance mailed Feb. 17, 2015 in Japan Patent Application No. 2011-544416, filed Dec. 30, 2009.
Office Action mailed Mar. 27, 2015 in Korean Patent Application No. 10-2011-7016591, filed Dec. 30, 2009.
Office Action mailed Jul. 21, 2015 in Japan Patent Application No. 2012-542167, filed Dec. 1, 2010.
Office Action mailed Sep. 29, 2015 in Finland Patent Application No. 20125600, filed Dec. 1, 2010.
Office Action mailed Oct. 15, 2015 in Korean Patent Application 10-2011-7016591.
Allowance mailed Nov. 17, 2015 in Japan Patent Application No. 2012-542167, filed Dec. 1, 2010.
International Search Report & Written Opinion dated Dec. 14, 2015 in Patent Cooperation Treaty Application No. PCT/US2015/049816, filed Sep. 11, 2015.
International Search Report & Written Opinion dated Dec. 22, 2015 in Patent Cooperation Treaty Application No. PCT/US2015/052433, filed Sep. 25, 2015.
Notice of Allowance dated Jan. 14, 2016 in South Korean Patent Application No. 10-2011-7016591 filed Jul. 15, 2011.
International Search Report & Written Opinion dated Feb. 12, 2016 in Patent Cooperation Treaty Application No. PCT/US2015/064523, filed Dec. 8, 2015.
International Search Report & Written Opinion dated Feb. 11, 2016 in Patent Cooperation Treaty Application No. PCT/US2015/063519, filed Dec. 2, 2015.
Klein, David, “Noise-Robust Multi-Lingual Keyword Spotting with a Deep Neural Network Based Architecture”, U.S. Appl. No. 14/614,348, filed Feb. 4, 2015.
Vitus, Deborah Kathleen et al., “Method for Modeling User Possession of Mobile Device for User Authentication Framework”, U.S. Appl. No. 14/548,207, filed Nov. 19, 2014.
Miurgia, Carlo, “Selection of System Parameters Based on Non-Acoustic Sensor Information”, U.S. Appl. No. 14/331,205, filed Jul. 14, 2014.
Goodwin, Michael M. et al., “Key Click Suppression”, U.S. Appl. No. 14/745,176, filed Jun. 19, 2015.
Related Publications (1)
Number Date Country
20150025881 A1 Jan 2015 US
Provisional Applications (2)
Number Date Country
61856577 Jul 2013 US
61972112 Mar 2014 US