Speech signal separation and synthesis based on auditory scene analysis and speech modeling

Description

TECHNICAL FIELD

The present disclosure relates generally to audio processing, and, more particularly, to generating clean speech from a mixture of noise and speech.

BACKGROUND

Current noise suppression techniques, such as Wiener filtering, attempt to improve the global signal-to-noise ratio (SNR) and attenuate low-SNR regions, thus introducing distortion into the speech signal. It is common practice to perform such filtering as a magnitude modification in a transform domain. Typically, the corrupted signal is used to reconstruct the signal with the modified magnitude. This approach may miss signal components dominated by noise, thereby resulting in undesirable and unnatural spectro-temporal modulations.

When the target signal is dominated by noise, a system that synthesizes a clean speech signal instead of enhancing the corrupted audio via modifications is advantageous for achieving high signal-to noise ratio improvement (SNRI) values and low signal distortion.

SUMMARY

This summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.

According to an aspect of the present disclosure, a method is provided for generating clean speech from a mixture of noise and speech. The method may include deriving, based on the mixture of noise and speech, and a model of speech, synthetic speech parameters, and synthesizing, based at least partially on the speech parameters, clean speech.

In some embodiments, deriving speech parameters commences with performing one or more spectral analyses on the mixture of noise and speech to generate one or more spectral representations. The one or more spectral representations can be then used for deriving feature data. The features corresponding to the target speech may then be grouped according to the model of speech and separated from the feature data. Analysis of feature representations may allow segmentation and grouping of speech component candidates. In certain embodiments, candidates for the features corresponding to target speech are evaluated by a multi-hypothesis tracking system aided by the model of speech. The synthetic speech parameters can be generated based partially on features corresponding to the target speech.

In some embodiments, the generated synthetic speech parameters include spectral envelope and voicing information. The voicing information may include pitch data and voice classification data. In some embodiments, the spectral envelope is estimated from a sparse spectral envelope.

In various embodiments, the method includes determining, based on a noise model, non-speech components in the feature data. The non-speech components as determined may be used in part to discriminate between speech components and noise components.

In various embodiments, the speech components may be used to determine pitch data. In some embodiments, the non-speech components may also be used in the pitch determination. (For instance, knowledge about where noise components occlude speech components may be used.) The pitch data may be interpolated to fill missing frames before synthesizing clean speech; where a missing frame refers to a frame where a good pitch estimate could not be determined.

In some embodiments, the method includes generating, based on the pitch data, a harmonic map representing voiced speech. The method may further include estimating a map for unvoiced speech based on the non-speech components from feature data and the harmonic map. The harmonic map and map for unvoiced speech may be used to generate a mask for extracting the sparse spectral envelope from the spectral representation of the mixture of noise and speech.

In further example embodiments of the present disclosure, the method steps are stored on a machine-readable medium comprising instructions, which, when implemented by one or more processors, perform the recited steps. In yet further example embodiments, hardware systems, or devices can be adapted to perform the recited steps. Other features, examples, and embodiments are described below.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments are illustrated by way of example and not limitation in the figures of the accompanying drawings, in which like references indicate similar elements and in which:

FIG. 1 shows an example system suitable for implementing various embodiments of the methods for generating clean speech from a mixture of noise and speech.

FIG. 2 illustrates a system for speech processing, according to an example embodiment.

FIG. 3 illustrates a system for separation and synthesis of a speech signal, according to an example embodiment.

FIG. 4 shows an example of a voiced frame.

FIG. 5 is a time-frequency plot of sparse envelope estimation for voiced frames, according to an example embodiment.

FIG. 6 shows an example of envelope estimation.

FIG. 7 is a diagram illustrating a speech synthesizer, according to an example embodiment.

FIG. 8A shows example synthesis parameters for a clean female speech sample.

FIG. 8B is a close-up of FIG. 8A showing example synthesis parameters for a clean female speech sample.

FIG. 9 illustrates an input and an output of a system for separation and synthesis of speech signals, according to an example embodiment.

FIG. 10 illustrates an example method for generating clean speech from a mixture of noise and speech.

FIG. 11 illustrates an example computer system that may be used to implement embodiments of the present technology.

DETAILED DESCRIPTION

The following detailed description includes references to the accompanying drawings, which form a part of the detailed description. The drawings show illustrations in accordance with exemplary embodiments. These exemplary embodiments, which are also referred to herein as “examples,” are described in enough detail to enable those skilled in the art to practice the present subject matter. The embodiments can be combined, other embodiments can be utilized, or structural, logical, and electrical changes can be made without departing from the scope of what is claimed. The following detailed description is, therefore, not to be taken in a limiting sense, and the scope is defined by the appended claims and their equivalents.

Provided are systems and methods that allow generating a clean speech from a mixture of noise and speech. Embodiments described herein can be practiced on any device that is configured to receive and/or provide a speech signal including but not limited to, personal computers (PCs), tablet computers, mobile devices, cellular phones, phone handsets, headsets, media devices, internet-connected (internet-of-things) devices and systems for teleconferencing applications. The technologies of the current disclosure may be also used in personal hearing devices, non-medical hearing aids, hearing aids, and cochlear implants.

According to various embodiments, the method for generating a clean speech signal from a mixture of noise and speech includes estimating speech parameters from a noisy mixture using auditory (e.g., perceptual) and speech production principles (e.g., separation of source and filter components). The estimated parameters are then used for synthesizing clean speech or can potentially be used in other applications where the speech signal may not necessarily be synthesized but where certain parameters or features corresponding to the clean speech signal are needed (e.g., automatic speech recognition and speaker identification).

FIG. 1 shows an example system 100 suitable for implementing methods for the various embodiments described herein. In some embodiments, the system 100 comprises a receiver 110, a processor 120, a microphone 130, an audio processing system 140, and an output device 150. The system 100 may comprise more or other components to provide a particular operation or functionality. Similarly, the system 100 may comprise fewer components that perform similar or equivalent functions to those depicted in FIG. 1. In addition, elements of system 100 may be cloud-based, including but not limited to, the processor 120.

The receiver 110 can be configured to communicate with a network such as the Internet, Wide Area Network (WAN), Local Area Network (LAN), cellular network, and so forth, to receive an audio data stream, which may comprise one or more channels of audio data. The received audio data stream may then be forwarded to the audio processing system 140 and the output device 150.

The processor 120 may include hardware and software that implement the processing of audio data and various other operations depending on a type of the system 100 (e.g., communication device or computer). A memory (e.g., non-transitory computer readable storage medium) may store, at least in part, instructions and data for execution by processor 120.

The audio processing system 140 includes hardware and software that implement the methods according to various embodiments disclosed herein. The audio processing system 140 is further configured to receive acoustic signals from an acoustic source via microphone 130 (which may be one or more microphones or acoustic sensors) and process the acoustic signals. After reception by the microphone 130, the acoustic signals may be converted into electric signals by an analog-to-digital converter.

The output device 150 includes any device that provides an audio output to a listener (e.g., the acoustic source). For example, the output device 150 may comprise a speaker, a class-D output, an earpiece of a headset, or a handset on the system 100.

FIG. 2 shows a system 200 for speech processing, according to an example embodiment. The example system 200 includes at least an analysis module 210, a feature estimation module 220, a grouping module 230, and a speech information extraction and modeling module 240. In certain embodiments, the system 200 includes a speech synthesis module 250. In other embodiments, the system 200 includes a speaker recognition module 260. In yet further embodiments, the system 200 includes an automatic speech recognition module 270.

In some embodiments, the analysis module 210 is operable to receive one or more time-domain speech input signals. The speech input can be analyzed with a multi-resolution front end that yields spectral representations at various predetermined time-frequency resolutions.

In some embodiments, the feature estimation module 220 receives various analysis data from the analysis module 210. Signal features can be derived from the various analyses according to the type of feature (for example, a narrowband spectral analysis for tone detection and a wideband spectral analysis for transient detection) to generate a multi-dimensional feature space.

In various embodiments, the grouping module 230 receives the feature data from the feature estimation module 220. The features corresponding to target speech may then be grouped according to auditory scene analysis principles (e.g., common fate) and separated from the features of the interference or noise. In certain embodiments, in the case of multi-talker input or other speech-like distractors, a multi-hypothesis grouper can be used for scene organization.

In some embodiments, the order of the grouping module 230 and feature estimation module 220 may be reversed, such that grouping module 230 groups the spectral representation (e.g., from analysis module 210) before the feature data is derived in feature estimation module 220.

A resultant sparse multi-dimensional feature set may be passed from the grouping module 230 to the speech information extraction and modeling module 240. The speech information extraction and modeling module 240 can be operable to generate output parameters representing the target speech in the noisy speech input.

In some embodiments, the output of the speech information extraction and modeling module 240 includes synthesis parameters and acoustic features. In certain embodiments, the synthesis parameters are passed to the speech synthesis module 250 for synthesizing clean speech output. In other embodiments, the acoustic features generated by speech information extraction and modeling module 240 are passed to the automatic speech recognition module 270 or the speaker recognition module 260.

FIG. 3 shows a system 300 for speech processing, specifically, speech separation and synthesis for noise suppression, according to another example embodiment. The system 300 may include a multi-resolution analysis (MRA) module 310, a noise model module 320, a pitch estimation module 330, a grouping module 340, a harmonic map unit 350, a sparse envelope unit 360, a speech envelope model module 370, and a synthesis module 380.

In some embodiments, the MRA module 310 receives the speech input signal. The speech input signal can be contaminated by additive noise and room reverberation. The MRA module 310 can be operable to generate one or more short-time spectral representations.

This short-time analysis from the MRA module 310 can be initially used for deriving an estimate of the background noise via the noise model module 320. The noise estimate can then be used for grouping in grouping module 340 and to improve the robustness of pitch estimation in pitch estimation module 330. The pitch track generated by the pitch estimation module 330, including a voicing decision, may be used for generating a harmonic map (at the harmonic map unit 350) and as an input to the synthesis module 380.

In some embodiments, the harmonic map (which represents the voiced speech), from the harmonic map unit 350, and the noise model, from the noise model module 320, are used for estimating a map of unvoiced speech (i.e., the difference between the input and the noise model in a non-voiced frame). The voiced and unvoiced maps may then be grouped (at the grouping module 340) and used to generate a mask for extracting a sparse envelope (at the sparse envelope unit 360) from the input signal representation. Finally, the speech envelope model module 370 may estimate the spectral envelope (ENV) from the sparse envelope and may feed the ENV to the speech synthesizer (e.g., synthesis module 380), which together with the voicing information (pitch F0 and voicing classification such as voiced/unvoiced (V/U)) from the pitch estimation module 330) can generate the final speech output.

In some embodiments, the system of FIG. 3 is based on both human auditory perception and speech production principles. In certain embodiments, the analysis and processing are performed for envelope and excitation separately (but not necessarily independently). According to various embodiments, speech parameters (i.e., envelope and voicing in this instance) are extracted from the noisy observation and the estimates are used to generate clean speech via the synthesizer.

Noise Modeling

The noise model module 320 may identify and extract non-speech components from the audio input. This may be achieved by generating a multi-dimensional representation, such as a cortical representation, for example, where discrimination between speech and non-speech is possible. Some background on cortical representations is provided in M. Elhilali and S. A. Shamma, “A cocktail party with a cortical twist: How cortical mechanisms contribute to sound segregation,” J. Acoust. Soc. Am. 124(6): 3751-3771 (December 2008), the disclosure of which is incorporated herein by reference in its entirety.

In the example system 300, the multi-resolution analysis may be used for estimating the noise by noise model module 320. Voicing information such as pitch may be used in the estimation to discriminate between speech and noise components. For broadband stationary noise, a modulation-domain filter may be implemented for estimating and extracting the slowly-varying (low modulation) components characteristic of the noise but not of the target speech. In some embodiments, alternate noise modeling approaches such as minimum statistics may be used.

Pitch Analysis and Tracking

The pitch estimation module 330 can be implemented based on autocorrelogram features. Some background on autocorrelogram features is provided in Z. Jin and D. Wang, “HMM-Based Multipitch Tracking for Noisy and Reverberant Speech,” IEEE Transactions on Audio, Speech, and Language Processing, 19(5):1091-1102 (July 2011), the disclosure of which is incorporated herein by reference in its entirety. Multi-resolution analysis may be used to extract pitch information from both resolved harmonics (narrowband analysis) and unresolved harmonics (wideband analysis). The noise estimate can be incorporated to refine pitch cues by discarding unreliable sub-bands where the signal is dominated by noise. In some embodiments, a Bayesian filter or Bayesian tracker (for example, a hidden Markov model (HMM)) is then used to integrate per-frame pitch cues with temporal constraints in order to generate a continuous pitch track. The resulting pitch track may then be used for estimating a harmonic map that highlights time-frequency regions where harmonic energy is present. In some embodiments, suitable alternate pitch estimation and tracking methods, other than methods based on autocorrelogram features, are used.

For synthesis, the pitch track may be interpolated for missing frames and smoothed to create a more natural speech contour. In some embodiments, a statistical pitch contour model is used for interpolation/extrapolation and smoothing. Voicing information may be derived from the saliency and confidence of the pitch estimates.

Sparse Envelope Extraction

Once the voiced speech and background noise regions are identified, an estimate of the unvoiced speech regions may be derived. In some embodiments, the feature region is declared unvoiced if the frame is not voiced (that determination may be based, e.g., on a pitch saliency, which is a measure of how pitched the frame is) and the signal does not conform to the noise model, e.g., the signal level (or energy) exceeds a noise threshold or the signal representation in the feature space falls outside the noise model region in the feature space.

The voicing information may be used to identify and select the harmonic spectral peaks corresponding to the pitch estimate. The spectral peaks found in this process may be stored for creating the sparse envelope.

For unvoiced frames, all spectral peaks may be identified and added to the sparse envelope signal. An example for a voiced frame is shown in FIG. 4. FIG. 5 is an exemplary time-frequency plot of the sparse envelope estimation for a voiced frame.

Spectral Envelope Modeling

The spectral envelope may be derived from the sparse envelope by interpolation. Many methods can be applied to derive the sparse envelope, including simple two-dimensional mesh interpolation (e.g., image processing techniques) or more sophisticated data-driven methods which may yield more natural and undistorted speech.

In the example shown in FIG. 6, cubic interpolation in the logarithmic domain is applied on a per-frame basis to the sparse spectrum to obtain a smooth spectral envelope. Using this approach, the fine structure due to the excitation may be removed or minimized. Where noise exceeds the speech harmonics, the envelope may be assigned a weighted value based on some suppression law (e.g., Wiener filter) or based on a speech envelope model.

Speech Synthesis

FIG. 7 is block diagram of a speech synthesizer 700, according to an example embodiment. The example speech synthesizer 700 can include a Linear Predictive Coding (LPC) Modeling block 710, a Pulse block 720, a White Gaussian Noise (WGN) block 730, Perturbation Modeling block 760, Perturbation filters 740 and 750, and a Synthesis filter 780.

Once the pitch track and the spectral envelope are computed, a clean speech utterance may be synthesized. With these parameters, a mixed-excitation synthesizer may be implemented as follows. The spectral envelope (ENV) may be modeled by a high-order Linear Predictive Coding (LPC) filter (e.g., 64th order) to preserve vocal tract detail but exclude other excitation-related artifacts (LPC Modeling block 710, FIG. 7). The excitation (of voicing information (pitch F0 and voicing classification such as voiced/unvoiced (V/U) in the example in FIG. 7)) may be modeled by the sum of a filtered pulse train (Pulse block 720, FIG. 7) driven by the pitch value in each frame and a filtered White Gaussian Noise source (WGN block 730, FIG. 7). As can be seen in the example embodiment in FIG. 7, the pitch F0 and voicing classification such as voiced/unvoiced (V/U) may be input to Pulse block 720, WGN block 730, and Perturbation Modeling block 760. Perturbation filters P(z) 750 and Q(z) 740 may be derived from the spectro-temporal energy profile of the envelope.

In contrast to other known methods, the perturbation of the periodic pulse train can be controlled only based on the relative local and global energy of the spectral envelope and not based on an excitation analysis, according to various embodiments. The filter P(z) 750 may add spectral shaping to the noise component in the excitation, and the filter Q(z) 740 may be used to modify the phase of the pulse train to increase dispersion and naturalness.

To derive the perturbation filters P(z) 750 and Q(z) 740, the dynamic range within each frame may be computed, and a frequency-dependent weight may be applied based on the level of each spectral value relative to the minimum and maximum energy in the frame. Then, a global weight may be applied based on the level of the frame relative to the maximum and minimum global energies tracked over time. The rationale behind this approach is that during onsets and offsets (low relative global energy) the glottis area is reduced, giving rise to higher Reynolds numbers (increased probability of turbulence). During the steady state, local frequency perturbations can be observed at lower energies where turbulent energy dominates.

It should be noted that the perturbation may be computed from the spectral envelope in voiced frames, but, in practice, for some embodiments, the perturbation is assigned a maximum value during unvoiced regions. An example of the synthesis parameters for a clean female speech sample is shown in FIG. 8A (also shown in more detail in FIG. 8B). The perturbation function is shown in the dB domain as an aperiodicity function.

An example of the performance of the system 300 is illustrated in FIG. 9, where a noisy speech input is processed by the system 300, thereby producing a synthetic noise-free output.

FIG. 10 is a flow chart of method 1000 for generating clean speech from a mixture of noise and speech. The method 1000 may be performed by processing logic that may include hardware (e.g., dedicated logic, programmable logic, and microcode), software (such as run on a general-purpose computer system or a dedicated machine), or a combination of both. In one example embodiment, the processing logic resides at the audio processing system 140.

At operation 1010, the example method 1000 can include deriving, based on the mixture of noise and speech and a model of speech, speech parameters. The speech parameters may include the spectral envelope and voice information. The voice information may include pitch data and voice classification. At operation 1020, the method 1000 can proceed with synthesizing clean speech from the speech parameters.

FIG. 11 illustrates an exemplary computer system 1100 that may be used to implement some embodiments of the present invention. The computer system 1100 of FIG. 11 may be implemented in the contexts of the likes of computing systems, networks, servers, or combinations thereof. The computer system 1100 of FIG. 11 includes one or more processor units 1110 and main memory 1120. Main memory 1120 stores, in part, instructions and data for execution by processor units 1110. Main memory 1120 stores the executable code when in operation, in this example. The computer system 1100 of FIG. 11 further includes a mass data storage 1130, portable storage device 1140, output devices 1150, user input devices 1160, a graphics display system 1170, and peripheral devices 1180.

The components shown in FIG. 11 are depicted as being connected via a single bus 1190. The components may be connected through one or more data transport means. Processor unit 1110 and main memory 1120 are connected via a local microprocessor bus, and the mass data storage 1130, peripheral device(s) 1180, portable storage device 1140, and graphics display system 1170 are connected via one or more input/output (I/O) buses.

Mass data storage 1130, which can be implemented with a magnetic disk drive, solid state drive, or an optical disk drive, is a non-volatile storage device for storing data and instructions for use by processor unit 1110. Mass data storage 1130 stores the system software for implementing embodiments of the present disclosure for purposes of loading that software into main memory 1120.

Portable storage device 1140 operates in conjunction with a portable non-volatile storage medium, such as a flash drive, floppy disk, compact disk, digital video disc, or Universal Serial Bus (USB) storage device, to input and output data and code to and from the computer system 1100 of FIG. 11. The system software for implementing embodiments of the present disclosure is stored on such a portable medium and input to the computer system 1100 via the portable storage device 1140.

User input devices 1160 can provide a portion of a user interface. User input devices 1160 may include one or more microphones, an alphanumeric keypad, such as a keyboard, for inputting alphanumeric and other information, or a pointing device, such as a mouse, a trackball, stylus, or cursor direction keys. User input devices 1160 can also include a touchscreen. Additionally, the computer system 1100 as shown in FIG. 11 includes output devices 1150. Suitable output devices 1150 include speakers, printers, network interfaces, and monitors.

Graphics display system 1170 include a liquid crystal display (LCD) or other suitable display device. Graphics display system 1170 is configurable to receive textual and graphical information and processes the information for output to the display device.

Peripheral devices 1180 may include any type of computer support device to add additional functionality to the computer system.

The components provided in the computer system 1100 of FIG. 11 are those typically found in computer systems that may be suitable for use with embodiments of the present disclosure and are intended to represent a broad category of such computer components that are well known in the art. Thus, the computer system 1100 of FIG. 11 can be a personal computer (PC), hand held computer system, telephone, mobile computer system, workstation, tablet, phablet, mobile phone, server, minicomputer, mainframe computer, wearable, internet-connected device, or any other computer system. The computer may also include different bus configurations, networked platforms, multi-processor platforms, and the like. Various operating systems may be used including UNIX, LINUX, WINDOWS, MAC OS, PALM OS, QNX ANDROID, IOS, CHROME, TIZEN, and other suitable operating systems.

The processing for various embodiments may be implemented in software that is cloud-based. In some embodiments, the computer system 1100 is implemented as a cloud-based computing environment, such as a virtual machine operating within a computing cloud. In other embodiments, the computer system 1100 may itself include a cloud-based computing environment, where the functionalities of the computer system 1100 are executed in a distributed fashion. Thus, the computer system 1100, when configured as a computing cloud, may include pluralities of computing devices in various forms, as will be described in greater detail below.

In general, a cloud-based computing environment is a resource that typically combines the computational power of a large grouping of processors (such as within web servers) and/or that combines the storage capacity of a large grouping of computer memories or storage devices. Systems that provide cloud-based resources may be utilized exclusively by their owners, or such systems may be accessible to outside users who deploy applications within the computing infrastructure to obtain the benefit of large computational or storage resources.

The cloud may be formed, for example, by a network of web servers that comprise a plurality of computing devices, such as the computer system 1100, with each server (or at least a plurality thereof) providing processor and/or storage resources. These servers may manage workloads provided by multiple users (e.g., cloud resource customers or other users). Typically, each user places workload demands upon the cloud that vary in real-time, sometimes dramatically. The nature and extent of these variations typically depends on the type of business associated with the user.

The present technology is described above with reference to example embodiments. Therefore, other variations upon the example embodiments are intended to be covered by the present disclosure.

Claims

1. A method for generating clean speech from a mixture of noise and speech, the method comprising: deriving speech parameters, based on the mixture of noise and speech and a model of speech, the deriving using at least one hardware processor, wherein the deriving speech parameters comprises: performing one or more spectral analyses on the mixture of noise and speech to generate one or more spectral representations;deriving, based on the one or more spectral representations, feature data;grouping target speech features in the feature data according to the model of speech;separating the target speech features from the feature data; andgenerating, based at least partially on the target speech features, the speech parameters; andsynthesizing, based at least partially on the speech parameters, clean speech.
2. The method of claim 1, wherein candidates for the target speech features are evaluated by a multi-hypothesis tracking system aided by the model of speech.
3. The method of claim 1, wherein the speech parameters include spectral envelope and voicing information, the voicing information including pitch data and voice classification data.
4. The method of claim 3, further comprising, prior to grouping the feature data, determining, based on a noise model, non-speech components in the feature data.
5. The method of claim 4, wherein the pitch data are determined based, at least partially, on the non-speech components.
6. The method of claim 4, wherein the pitch data are determined based, at least on, knowledge about where noise components occlude speech components.
7. The method of claim 5, further comprising, while generating the speech parameters: generating, based on the pitch data, a harmonic map, the harmonic map representing voiced speech; andestimating, based on the non-speech components and the harmonic map, an unvoiced speech map.
8. The method of claim 7, further comprising extracting a sparse spectral envelope from the one or more spectral representations using a mask, the mask being generated based on a harmonic map and an unvoiced speech map.
9. The method of claim 8, further comprising estimating the spectral envelope based on a sparse spectral envelope.
10. The method of claim 3, wherein the pitch data are interpolated to fill missing frames before synthesizing clean speech.
11. A system for generating clean speech from a mixture of noise and speech, the system comprising: one or more processors; anda memory communicatively coupled with the processor, the memory storing instructions which if executed by the one or more processors perform a method comprising:deriving speech parameters, based on the mixture of noise and speech and a model of speech, wherein the deriving speech parameters comprises: performing one or more spectral analyses on the mixture of noise and speech to generate one or more spectral representations;deriving, based on the one or more spectral representations, feature data;grouping target speech features in the feature data according to the model of speech;separating the target speech features from the feature data; andgenerating, based at least partially on the target speech features, the speech parameters; andsynthesizing, based at least partially on the speech parameters, clean speech.
12. The system of claim 11, wherein candidates for the target speech features are evaluated by a multi-hypothesis tracking system aided by the model of speech.
13. The system of claim 11, wherein the speech parameters include a spectral envelope and voicing information, the voicing information including pitch data and voice classification data.
14. The system of claim 13, further comprising, prior to grouping the feature data, determining, based on a noise model, non-speech components in the feature data.
15. The system of claim 14, wherein the pitch data are determined based partially on the non-speech components.
16. The system of claim 14, wherein the pitch data are determined based, at least on, knowledge about where noise components occlude speech components.
17. The system of claim 15, further comprising, while generating the speech parameters: generating, based on the pitch data, a harmonic map, the harmonic map representing voiced speech; andestimating, based on the non-speech components and the harmonic map, an unvoiced speech map.
18. The system of claim 15, further comprising extracting a sparse spectral envelope from the one or more spectral representations using a mask, the mask being generated based on a harmonic map and an unvoiced speech map.
19. The system of claim 18, further comprising estimating the spectral envelope based on the sparse spectral envelope.
20. A non-transitory computer-readable storage medium having embodied thereon a program, the program being executable by a processor to perform a method for generating clean speech from a mixture of noise and speech, the method comprising: deriving speech parameters, based on the mixture of noise and speech and a model of speech, via instructions stored in the memory and executed by the one or more processors, wherein the deriving speech parameters comprises: performing one or more spectral analyses on the mixture of noise and speech to generate one or more spectral representations;deriving, based on the one or more spectral representations, feature data;grouping target speech features in the feature data according to the model of speech;separating the target speech features from the feature data; andgenerating, based at least partially on the target speech features, the speech parameters; andsynthesizing, based at least partially on the speech parameters, via instructions stored in the memory and executed by the one or more processors, clean speech.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application claims the benefit of U.S. Provisional Application No. 61/856,577, filed on Jul. 19, 2013 and entitled “System and Method for Speech Signal Separation and Synthesis Based on Auditory Scene Analysis and Speech Modeling”, and U.S. Provisional Application No. 61/972,112, filed Mar. 28, 2014 and entitled “Tracking Multiple Attributes of Simultaneous Objects”. The subject matter of the aforementioned applications is incorporated herein by reference for all purposes.

US Referenced Citations (544)

Number	Name	Date	Kind
3976863	Engel	Aug 1976	A
3978287	Fletcher et al.	Aug 1976	A
4137510	Iwahara	Jan 1979	A
4433604	Ott	Feb 1984	A
4516259	Yato et al.	May 1985	A
4535473	Sakata	Aug 1985	A
4536844	Lyon	Aug 1985	A
4581758	Coker et al.	Apr 1986	A
4628529	Borth et al.	Dec 1986	A
4630304	Borth et al.	Dec 1986	A
4649505	Zinser, Jr. et al.	Mar 1987	A
4658426	Chabries et al.	Apr 1987	A
4674125	Carlson et al.	Jun 1987	A
4718104	Anderson	Jan 1988	A
4811404	Vilmur et al.	Mar 1989	A
4812996	Stubbs	Mar 1989	A
4864620	Bialick	Sep 1989	A
4920508	Yassaie et al.	Apr 1990	A
4969203	Herman	Nov 1990	A
4991166	Julstrom	Feb 1991	A
5027410	Williamson et al.	Jun 1991	A
5054085	Meisel et al.	Oct 1991	A
5058419	Nordstrom et al.	Oct 1991	A
5099738	Hotz	Mar 1992	A
5119711	Bell et al.	Jun 1992	A
5142961	Paroutaud	Sep 1992	A
5150413	Nakatani et al.	Sep 1992	A
5175769	Hejna, Jr. et al.	Dec 1992	A
5177482	Cideciyan et al.	Jan 1993	A
5187776	Yanker	Feb 1993	A
5204906	Nohara et al.	Apr 1993	A
5208864	Kaneda	May 1993	A
5210366	Sykes, Jr.	May 1993	A
5216423	Mukherjee	Jun 1993	A
5222251	Roney, IV et al.	Jun 1993	A
5224170	Waite, Jr.	Jun 1993	A
5230022	Sakata	Jul 1993	A
5319736	Hunt	Jun 1994	A
5323459	Hirano	Jun 1994	A
5341432	Suzuki et al.	Aug 1994	A
5381473	Andrea et al.	Jan 1995	A
5381512	Holton et al.	Jan 1995	A
5400409	Linhard	Mar 1995	A
5402493	Goldstein	Mar 1995	A
5402496	Soli et al.	Mar 1995	A
5406635	Jarvinen	Apr 1995	A
5416847	Boze	May 1995	A
5440751	Santeler et al.	Aug 1995	A
5471195	Rickman	Nov 1995	A
5473759	Slaney et al.	Dec 1995	A
5479564	Vogten et al.	Dec 1995	A
5502663	Lyon	Mar 1996	A
5544250	Urbanski	Aug 1996	A
5544346	Amini et al.	Aug 1996	A
5550924	Helf et al.	Aug 1996	A
5555306	Gerzon	Sep 1996	A
5574824	Slyh et al.	Nov 1996	A
5583784	Kapust et al.	Dec 1996	A
5590241	Park et al.	Dec 1996	A
5598505	Austin et al.	Jan 1997	A
5602962	Kellermann	Feb 1997	A
5633631	Teckman	May 1997	A
5675778	Jones	Oct 1997	A
5682463	Allen et al.	Oct 1997	A
5694474	Ngo et al.	Dec 1997	A
5706395	Arslan et al.	Jan 1998	A
5717829	Takagi	Feb 1998	A
5729612	Abel et al.	Mar 1998	A
5732189	Johnston et al.	Mar 1998	A
5749064	Pawate et al.	May 1998	A
5757937	Itoh et al.	May 1998	A
5777658	Kerr et al.	Jul 1998	A
5792971	Timis et al.	Aug 1998	A
5796819	Romesburg	Aug 1998	A
5796850	Shiono et al.	Aug 1998	A
5806025	Vis et al.	Sep 1998	A
5809463	Gupta et al.	Sep 1998	A
5839101	Vahatalo et al.	Nov 1998	A
5845243	Smart et al.	Dec 1998	A
5887032	Cioffi	Mar 1999	A
5920840	Satyamurti et al.	Jul 1999	A
5933495	Oh	Aug 1999	A
5937070	Todter et al.	Aug 1999	A
5943429	Handel	Aug 1999	A
5956674	Smyth et al.	Sep 1999	A
5974379	Hatanaka et al.	Oct 1999	A
5974380	Smyth et al.	Oct 1999	A
5978567	Rebane et al.	Nov 1999	A
5978824	Ikeda	Nov 1999	A
5983139	Zierhofer	Nov 1999	A
5990405	Auten et al.	Nov 1999	A
6002776	Bhadkamkar et al.	Dec 1999	A
6061456	Andrea et al.	May 2000	A
6072881	Linder	Jun 2000	A
6092126	Rossum	Jul 2000	A
6097820	Turner	Aug 2000	A
6098038	Hermansky et al.	Aug 2000	A
6104993	Ashley	Aug 2000	A
6108626	Cellario et al.	Aug 2000	A
6122384	Mauro	Sep 2000	A
6122610	Isabelle	Sep 2000	A
6125175	Goldberg et al.	Sep 2000	A
6134524	Peters et al.	Oct 2000	A
6137349	Menkhoff et al.	Oct 2000	A
6140809	Doi	Oct 2000	A
6173255	Wilson et al.	Jan 2001	B1
6188769	Jot et al.	Feb 2001	B1
6188797	Moledina et al.	Feb 2001	B1
6202047	Ephraim et al.	Mar 2001	B1
6205421	Morii	Mar 2001	B1
6205422	Gu et al.	Mar 2001	B1
6208671	Paulos et al.	Mar 2001	B1
6216103	Wu et al.	Apr 2001	B1
6222927	Feng et al.	Apr 2001	B1
6223090	Brungart	Apr 2001	B1
6226616	You et al.	May 2001	B1
6240386	Thyssen et al.	May 2001	B1
6263307	Arslan et al.	Jul 2001	B1
6266633	Higgins et al.	Jul 2001	B1
6317501	Matsuo	Nov 2001	B1
6321193	Nystrom et al.	Nov 2001	B1
6324235	Savell et al.	Nov 2001	B1
6339706	Tillgren et al.	Jan 2002	B1
6339758	Kanazawa et al.	Jan 2002	B1
6355869	Mitton	Mar 2002	B1
6363345	Marash et al.	Mar 2002	B1
6377637	Berdugo	Apr 2002	B1
6381570	Li et al.	Apr 2002	B2
6421388	Parizhsky et al.	Jul 2002	B1
6424938	Johansson et al.	Jul 2002	B1
6430295	Handel et al.	Aug 2002	B1
6434417	Lovett	Aug 2002	B1
6449586	Hoshuyama	Sep 2002	B1
6453289	Ertem et al.	Sep 2002	B1
6456209	Savari	Sep 2002	B1
6469732	Chang et al.	Oct 2002	B1
6477489	Lockwood	Nov 2002	B1
6487257	Gustafsson et al.	Nov 2002	B1
6490556	Graumann et al.	Dec 2002	B1
6496795	Malvar	Dec 2002	B1
6513004	Rigazio et al.	Jan 2003	B1
6516066	Hayashi	Feb 2003	B2
6516136	Lee	Feb 2003	B1
6526140	Marchok et al.	Feb 2003	B1
6529606	Jackson, Jr. II et al.	Mar 2003	B1
6531970	McLaughlin et al.	Mar 2003	B2
6549630	Bobisuthi	Apr 2003	B1
6584203	Elko et al.	Jun 2003	B2
6584438	Manjunath et al.	Jun 2003	B1
6647067	Hjelm et al.	Nov 2003	B1
6683938	Henderson	Jan 2004	B1
6717991	Gustafsson et al.	Apr 2004	B1
6718309	Selly	Apr 2004	B1
6738482	Jaber	May 2004	B1
6745155	Andringa et al.	Jun 2004	B1
6760450	Matsuo	Jul 2004	B2
6772117	Laurila et al.	Aug 2004	B1
6785381	Gartner et al.	Aug 2004	B2
6792118	Watts	Sep 2004	B2
6795558	Matsuo	Sep 2004	B2
6798886	Smith et al.	Sep 2004	B1
6804203	Benyassine et al.	Oct 2004	B1
6804651	Juric et al.	Oct 2004	B2
6810273	Mattila et al.	Oct 2004	B1
6859508	Koyama et al.	Feb 2005	B1
6862567	Gao	Mar 2005	B1
6882736	Dickel et al.	Apr 2005	B2
6907045	Robinson et al.	Jun 2005	B1
6915257	Heikkinen et al.	Jul 2005	B2
6915264	Baumgarte	Jul 2005	B2
6917688	Yu et al.	Jul 2005	B2
6934387	Kim	Aug 2005	B1
6978159	Feng et al.	Dec 2005	B2
6982377	Sakurai et al.	Jan 2006	B2
6990196	Zeng et al.	Jan 2006	B2
7016507	Brennan	Mar 2006	B1
7020605	Gao	Mar 2006	B2
7031478	Belt et al.	Apr 2006	B2
7042934	Zamir	May 2006	B2
7050388	Kim et al.	May 2006	B2
7054452	Ukita	May 2006	B2
7054809	Gao	May 2006	B1
7058574	Taniguchi et al.	Jun 2006	B2
7065485	Chong-White et al.	Jun 2006	B1
7076315	Watts	Jul 2006	B1
7092529	Yu et al.	Aug 2006	B2
7092882	Arrowood et al.	Aug 2006	B2
7099821	Visser et al.	Aug 2006	B2
7127072	Rademacher et al.	Oct 2006	B2
7142677	Gonopolskiy et al.	Nov 2006	B2
7146013	Saito et al.	Dec 2006	B1
7146316	Alves	Dec 2006	B2
7155019	Hou	Dec 2006	B2
7165026	Acero et al.	Jan 2007	B2
7171008	Elko	Jan 2007	B2
7171246	Mattila et al.	Jan 2007	B2
7174022	Zhang et al.	Feb 2007	B1
7190665	Warke et al.	Mar 2007	B2
7206418	Yang et al.	Apr 2007	B2
7209567	Kozel et al.	Apr 2007	B1
7225001	Eriksson et al.	May 2007	B1
7242762	He et al.	Jul 2007	B2
7246058	Burnett	Jul 2007	B2
7254242	Ise et al.	Aug 2007	B2
7283956	Ashley et al.	Oct 2007	B2
7289554	Alloin	Oct 2007	B2
7289955	Deng et al.	Oct 2007	B2
7327985	Morfitt, III et al.	Feb 2008	B2
7330138	Mallinson et al.	Feb 2008	B2
7339503	Elenes	Mar 2008	B1
7359520	Brennan et al.	Apr 2008	B2
7366658	Moogi et al.	Apr 2008	B2
7376558	Gemello et al.	May 2008	B2
7383179	Alves et al.	Jun 2008	B2
7395298	Debes et al.	Jul 2008	B2
7412379	Taori et al.	Aug 2008	B2
7433907	Nagai et al.	Oct 2008	B2
7436333	Forman et al.	Oct 2008	B2
7472059	Huang	Dec 2008	B2
7548791	Johnston	Jun 2009	B1
7555434	Nomura et al.	Jun 2009	B2
7561627	Chow et al.	Jul 2009	B2
7577084	Tang et al.	Aug 2009	B2
7590250	Ellis et al.	Sep 2009	B2
7617099	Yang et al.	Nov 2009	B2
7657038	Doclo et al.	Feb 2010	B2
7657427	Jelinek	Feb 2010	B2
7725314	Wu et al.	May 2010	B2
7764752	Langberg et al.	Jul 2010	B2
7777658	Nguyen et al.	Aug 2010	B2
7783032	Abutalebi et al.	Aug 2010	B2
7783481	Endo et al.	Aug 2010	B2
7895036	Hetherington et al.	Feb 2011	B2
7899565	Johnston	Mar 2011	B1
7912567	Chhatwal et al.	Mar 2011	B2
7949522	Hetherington et al.	May 2011	B2
7953596	Pinto	May 2011	B2
8010355	Rahbar	Aug 2011	B2
8032364	Watts	Oct 2011	B1
8032369	Manjunath et al.	Oct 2011	B2
8036767	Soulodre	Oct 2011	B2
8046219	Zurek et al.	Oct 2011	B2
8060363	Ramo et al.	Nov 2011	B2
8081878	Zhang et al.	Dec 2011	B1
8098812	Fadili et al.	Jan 2012	B2
8098844	Elko	Jan 2012	B2
8103011	Mohammad et al.	Jan 2012	B2
8126159	Goose et al.	Feb 2012	B2
8143620	Malinowski et al.	Mar 2012	B1
8150065	Solbach et al.	Apr 2012	B2
8180064	Avendano et al.	May 2012	B1
8184818	Ishiguro	May 2012	B2
8194880	Avendano	Jun 2012	B2
8194882	Every et al.	Jun 2012	B2
8195454	Muesch	Jun 2012	B2
8204252	Avendano	Jun 2012	B1
8204253	Solbach	Jun 2012	B1
8233352	Beaucoup	Jul 2012	B2
8280731	Yu	Oct 2012	B2
8311817	Murgia et al.	Nov 2012	B2
8345890	Avendano et al.	Jan 2013	B2
8378871	Bapat	Feb 2013	B1
8473287	Every et al.	Jun 2013	B2
8488805	Santos et al.	Jul 2013	B1
8494193	Zhang et al.	Jul 2013	B2
8521530	Every et al.	Aug 2013	B1
8615394	Avendano et al.	Dec 2013	B1
8737188	Murgia et al.	May 2014	B1
8737532	Green et al.	May 2014	B2
8744844	Klein	Jun 2014	B2
8774423	Solbach	Jul 2014	B1
8804865	Elenes et al.	Aug 2014	B2
8831937	Murgia et al.	Sep 2014	B2
8867759	Avendano et al.	Oct 2014	B2
8880396	Laroche et al.	Nov 2014	B1
8886525	Klein	Nov 2014	B2
8908882	Goodwin et al.	Dec 2014	B2
8934641	Avendano et al.	Jan 2015	B2
8949120	Every et al.	Feb 2015	B1
8965942	Rossum et al.	Feb 2015	B1
8989401	Ojanpera	Mar 2015	B2
9049282	Murgia et al.	Jun 2015	B1
9076456	Avendano et al.	Jul 2015	B1
9094496	Teutsch	Jul 2015	B2
9185487	Solbach et al.	Nov 2015	B2
9197974	Clark et al.	Nov 2015	B1
9210503	Avendano et al.	Dec 2015	B2
9236874	Rossum	Jan 2016	B1
9247192	Lee et al.	Jan 2016	B2
20010016020	Gustafsson et al.	Aug 2001	A1
20010031053	Feng et al.	Oct 2001	A1
20010041976	Taniguchi et al.	Nov 2001	A1
20010053228	Jones	Dec 2001	A1
20020002455	Accardi et al.	Jan 2002	A1
20020009203	Erten	Jan 2002	A1
20020041693	Matsuo	Apr 2002	A1
20020080980	Matsuo	Jun 2002	A1
20020097884	Cairns	Jul 2002	A1
20020106092	Matsuo	Aug 2002	A1
20020116187	Erten	Aug 2002	A1
20020133334	Coorman et al.	Sep 2002	A1
20020147595	Baumgarte	Oct 2002	A1
20020156624	Gigi	Oct 2002	A1
20020176589	Buck et al.	Nov 2002	A1
20030014248	Vetter	Jan 2003	A1
20030023430	Wang et al.	Jan 2003	A1
20030026437	Janse et al.	Feb 2003	A1
20030033140	Taori et al.	Feb 2003	A1
20030038736	Becker et al.	Feb 2003	A1
20030039369	Bullen	Feb 2003	A1
20030040908	Yang et al.	Feb 2003	A1
20030061032	Gonopolskiy	Mar 2003	A1
20030063759	Brennan et al.	Apr 2003	A1
20030072382	Raleigh et al.	Apr 2003	A1
20030072460	Gonopolskiy et al.	Apr 2003	A1
20030095667	Watts	May 2003	A1
20030099345	Gartner et al.	May 2003	A1
20030101048	Liu	May 2003	A1
20030103632	Goubran et al.	Jun 2003	A1
20030128851	Furuta	Jul 2003	A1
20030138116	Jones et al.	Jul 2003	A1
20030147538	Elko	Aug 2003	A1
20030169891	Ryan et al.	Sep 2003	A1
20030191641	Acero et al.	Oct 2003	A1
20030228019	Eichler et al.	Dec 2003	A1
20030228023	Burnett et al.	Dec 2003	A1
20040001450	He et al.	Jan 2004	A1
20040013276	Ellis et al.	Jan 2004	A1
20040015348	McArthur et al.	Jan 2004	A1
20040042616	Matsuo	Mar 2004	A1
20040047464	Yu et al.	Mar 2004	A1
20040066940	Amir	Apr 2004	A1
20040078199	Kremer et al.	Apr 2004	A1
20040083110	Wang	Apr 2004	A1
20040125965	Alberth, Jr. et al.	Jul 2004	A1
20040131178	Shahaf et al.	Jul 2004	A1
20040133421	Burnett et al.	Jul 2004	A1
20040165736	Hetherington et al.	Aug 2004	A1
20040185804	Kanamori et al.	Sep 2004	A1
20040196989	Friedman et al.	Oct 2004	A1
20040263636	Cutler et al.	Dec 2004	A1
20050008169	Muren et al.	Jan 2005	A1
20050008179	Quinn	Jan 2005	A1
20050025263	Wu	Feb 2005	A1
20050027520	Mattila et al.	Feb 2005	A1
20050043959	Stemerdink et al.	Feb 2005	A1
20050049864	Kaltenmeier et al.	Mar 2005	A1
20050060142	Visser et al.	Mar 2005	A1
20050066279	LeBarton et al.	Mar 2005	A1
20050080616	Leung et al.	Apr 2005	A1
20050096904	Taniguchi et al.	May 2005	A1
20050114128	Hetherington et al.	May 2005	A1
20050143989	Jelinek	Jun 2005	A1
20050152559	Gierl et al.	Jul 2005	A1
20050152563	Amada et al.	Jul 2005	A1
20050185813	Sinclair et al.	Aug 2005	A1
20050203735	Ichikawa	Sep 2005	A1
20050213778	Buck et al.	Sep 2005	A1
20050216259	Watts	Sep 2005	A1
20050228518	Watts	Oct 2005	A1
20050249292	Zhu	Nov 2005	A1
20050261894	Balan et al.	Nov 2005	A1
20050261896	Schuijers et al.	Nov 2005	A1
20050276363	Joublin et al.	Dec 2005	A1
20050276423	Aubauer et al.	Dec 2005	A1
20050281410	Grosvenor et al.	Dec 2005	A1
20050283544	Yee	Dec 2005	A1
20050288923	Kok	Dec 2005	A1
20060072768	Schwartz et al.	Apr 2006	A1
20060074646	Alves et al.	Apr 2006	A1
20060098809	Nongpiur et al.	May 2006	A1
20060100868	Hetherington et al.	May 2006	A1
20060120537	Burnett et al.	Jun 2006	A1
20060133621	Chen et al.	Jun 2006	A1
20060136203	Ichikawa	Jun 2006	A1
20060149535	Choi et al.	Jul 2006	A1
20060153391	Hooley et al.	Jul 2006	A1
20060160581	Beaugeant et al.	Jul 2006	A1
20060184363	McCree et al.	Aug 2006	A1
20060198542	Benjelloun Touimi et al.	Sep 2006	A1
20060222184	Buck et al.	Oct 2006	A1
20060242071	Stebbings	Oct 2006	A1
20060270468	Hui et al.	Nov 2006	A1
20060293882	Giesbrecht et al.	Dec 2006	A1
20070021958	Visser et al.	Jan 2007	A1
20070025562	Zalewski et al.	Feb 2007	A1
20070027685	Arakawa et al.	Feb 2007	A1
20070033020	(Kelleher) Francois et al.	Feb 2007	A1
20070033494	Wenger et al.	Feb 2007	A1
20070038440	Sung et al.	Feb 2007	A1
20070058822	Ozawa	Mar 2007	A1
20070067166	Pan et al.	Mar 2007	A1
20070071206	Gainsboro et al.	Mar 2007	A1
20070078649	Hetherington et al.	Apr 2007	A1
20070088544	Acero et al.	Apr 2007	A1
20070094031	Chen	Apr 2007	A1
20070100612	Ekstrand et al.	May 2007	A1
20070110263	Brox	May 2007	A1
20070116300	Chen	May 2007	A1
20070136056	Moogi et al.	Jun 2007	A1
20070136059	Gadbois	Jun 2007	A1
20070150268	Acero et al.	Jun 2007	A1
20070154031	Avendano et al.	Jul 2007	A1
20070165879	Deng et al.	Jul 2007	A1
20070195968	Jaber	Aug 2007	A1
20070198254	Goto et al.	Aug 2007	A1
20070230712	Belt et al.	Oct 2007	A1
20070230913	Ichimura	Oct 2007	A1
20070237271	Pessoa et al.	Oct 2007	A1
20070244695	Manjunath et al.	Oct 2007	A1
20070253574	Soulodre	Nov 2007	A1
20070276656	Solbach et al.	Nov 2007	A1
20070282604	Gartner et al.	Dec 2007	A1
20070287490	Green et al.	Dec 2007	A1
20070294263	Punj et al.	Dec 2007	A1
20080019548	Avendano	Jan 2008	A1
20080033723	Jang et al.	Feb 2008	A1
20080059163	Ding et al.	Mar 2008	A1
20080069366	Soulodre	Mar 2008	A1
20080071540	Nakano et al.	Mar 2008	A1
20080111734	Fam et al.	May 2008	A1
20080117901	Klammer	May 2008	A1
20080118082	Seltzer et al.	May 2008	A1
20080140391	Yen et al.	Jun 2008	A1
20080140396	Grosse-Schulte et al.	Jun 2008	A1
20080152157	Lin et al.	Jun 2008	A1
20080170703	Zivney	Jul 2008	A1
20080192956	Kazama	Aug 2008	A1
20080195384	Jabri et al.	Aug 2008	A1
20080201138	Visser et al.	Aug 2008	A1
20080208575	Laaksonen et al.	Aug 2008	A1
20080212795	Goodwin et al.	Sep 2008	A1
20080228478	Hetherington et al.	Sep 2008	A1
20080247567	Kjolerbakken et al.	Oct 2008	A1
20080260175	Elko	Oct 2008	A1
20080273476	Cohen et al.	Nov 2008	A1
20080310646	Amada	Dec 2008	A1
20080317261	Yoshida et al.	Dec 2008	A1
20090012783	Klein	Jan 2009	A1
20090012784	Murgia et al.	Jan 2009	A1
20090012786	Zhang et al.	Jan 2009	A1
20090018828	Nakadai et al.	Jan 2009	A1
20090048824	Amada	Feb 2009	A1
20090060222	Jeong et al.	Mar 2009	A1
20090063142	Sukkar	Mar 2009	A1
20090070118	Den Brinker et al.	Mar 2009	A1
20090086986	Schmidt et al.	Apr 2009	A1
20090106021	Zurek et al.	Apr 2009	A1
20090112579	Li et al.	Apr 2009	A1
20090116652	Kirkeby et al.	May 2009	A1
20090119096	Gerl et al.	May 2009	A1
20090119099	Lee et al.	May 2009	A1
20090129610	Kim et al.	May 2009	A1
20090144053	Tamura	Jun 2009	A1
20090144058	Sorin	Jun 2009	A1
20090154717	Hoshuyama	Jun 2009	A1
20090177464	Gao et al.	Jul 2009	A1
20090192790	El-Maleh et al.	Jul 2009	A1
20090204413	Sintes et al.	Aug 2009	A1
20090216526	Schmidt et al.	Aug 2009	A1
20090220107	Every et al.	Sep 2009	A1
20090226005	Acero et al.	Sep 2009	A1
20090226010	Schnell et al.	Sep 2009	A1
20090228272	Herbig	Sep 2009	A1
20090245335	Fang	Oct 2009	A1
20090245444	Fang	Oct 2009	A1
20090253418	Makinen	Oct 2009	A1
20090257609	Gerkmann et al.	Oct 2009	A1
20090262969	Short et al.	Oct 2009	A1
20090271187	Yen et al.	Oct 2009	A1
20090287481	Paranjpe et al.	Nov 2009	A1
20090292536	Hetherington et al.	Nov 2009	A1
20090303350	Terada	Dec 2009	A1
20090323982	Solbach et al.	Dec 2009	A1
20100004929	Baik	Jan 2010	A1
20100027799	Romesburg et al.	Feb 2010	A1
20100033427	Marks et al.	Feb 2010	A1
20100094643	Avendano et al.	Apr 2010	A1
20100138220	Matsumoto et al.	Jun 2010	A1
20100166199	Seydoux	Jul 2010	A1
20100177916	Gerkmann et al.	Jul 2010	A1
20100211385	Sehlstedt	Aug 2010	A1
20100228545	Ito et al.	Sep 2010	A1
20100245624	Beaucoup	Sep 2010	A1
20100278352	Petit et al.	Nov 2010	A1
20100280824	Petit et al.	Nov 2010	A1
20100290615	Takahashi	Nov 2010	A1
20100296668	Lee et al.	Nov 2010	A1
20100309774	Astrom	Dec 2010	A1
20110019833	Kuech et al.	Jan 2011	A1
20110035213	Malenovsky et al.	Feb 2011	A1
20110038486	Beaucoup	Feb 2011	A1
20110038557	Closset et al.	Feb 2011	A1
20110044324	Li et al.	Feb 2011	A1
20110075857	Aoyagi	Mar 2011	A1
20110081024	Soulodre	Apr 2011	A1
20110107367	Georgis et al.	May 2011	A1
20110123019	Gowreesunker et al.	May 2011	A1
20110129095	Avendano et al.	Jun 2011	A1
20110137646	Ahgren et al.	Jun 2011	A1
20110142257	Goodwin et al.	Jun 2011	A1
20110178800	Watts	Jul 2011	A1
20110184732	Godavarti	Jul 2011	A1
20110184734	Wang et al.	Jul 2011	A1
20110191101	Uhle et al.	Aug 2011	A1
20110208520	Lee	Aug 2011	A1
20110257965	Hardwick	Oct 2011	A1
20110257967	Every et al.	Oct 2011	A1
20110261150	Goyal et al.	Oct 2011	A1
20110264449	Sehlstedt	Oct 2011	A1
20120063609	Triki et al.	Mar 2012	A1
20120087514	Williams et al.	Apr 2012	A1
20120116758	Murgia et al.	May 2012	A1
20120121096	Chen et al.	May 2012	A1
20120123775	Murgia et al.	May 2012	A1
20120140917	Nicholson et al.	Jun 2012	A1
20120179462	Klein	Jul 2012	A1
20120197898	Pandey et al.	Aug 2012	A1
20120209611	Furuta et al.	Aug 2012	A1
20120220347	Davidson	Aug 2012	A1
20120237037	Ninan et al.	Sep 2012	A1
20120250871	Lu et al.	Oct 2012	A1
20120257778	Hall et al.	Oct 2012	A1
20130011111	Abraham et al.	Jan 2013	A1
20130024190	Fairey	Jan 2013	A1
20130096914	Avendano et al.	Apr 2013	A1
20130289988	Fry	Oct 2013	A1
20130289996	Fry	Oct 2013	A1
20130322461	Poulsen	Dec 2013	A1
20130343549	Vemireddy et al.	Dec 2013	A1
20140003622	Ikizyan et al.	Jan 2014	A1
20140098964	Rosca et al.	Apr 2014	A1
20140241702	Solbach et al.	Aug 2014	A1
20140350926	Schuster et al.	Nov 2014	A1
20150078555	Zhang et al.	Mar 2015	A1
20150078606	Zhang et al.	Mar 2015	A1
20150208165	Volk et al.	Jul 2015	A1
20160027451	Solbach et al.	Jan 2016	A1
20160037245	Harrington	Feb 2016	A1
20160061934	Woodruff et al.	Mar 2016	A1
20160078880	Avendano et al.	Mar 2016	A1
20160093307	Warren et al.	Mar 2016	A1
20160094910	Vallabhan et al.	Mar 2016	A1
20160162469	Santos	Jun 2016	A1

Foreign Referenced Citations (119)

Number	Date	Country
105474311	Apr 2016	CN
112014003337	Mar 2016	DE
0756437	Jan 1997	EP
1081685	Mar 2001	EP
1232496	Aug 2002	EP
1474755	Nov 2004	EP
20080428	Jul 2008	FI
20080623	Nov 2008	FI
20100431	Dec 2010	FI
20110428	Dec 2011	FI
20125600	Jun 2012	FI
123080	Oct 2012	FI
124716	Dec 2014	FI
62110349	May 1987	JP
4184400	Jul 1992	JP
5053587	Mar 1993	JP
H05172865	Jul 1993	JP
H05300419	Nov 1993	JP
6269083	Sep 1994	JP
H07248793	Sep 1995	JP
H07336793	Dec 1995	JP
H10313497	Nov 1998	JP
H11249693	Sep 1999	JP
2001159899	Jun 2001	JP
2002366200	Dec 2002	JP
2002542689	Dec 2002	JP
2003514473	Apr 2003	JP
2003271191	Sep 2003	JP
2004053895	Feb 2004	JP
2004187283	Jul 2004	JP
2004531767	Oct 2004	JP
2004533155	Oct 2004	JP
2005110127	Apr 2005	JP
2005148274	Jun 2005	JP
2005518118	Jun 2005	JP
2005195955	Jul 2005	JP
2005309096	Nov 2005	JP
2006094522	Apr 2006	JP
2006515490	May 2006	JP
2006337415	Dec 2006	JP
2007006525	Jan 2007	JP
2007201818	Aug 2007	JP
2008015443	Jan 2008	JP
2008518257	May 2008	JP
2008135933	Jun 2008	JP
2008542798	Nov 2008	JP
2009037042	Feb 2009	JP
2009522942	Jun 2009	JP
2009538450	Nov 2009	JP
2010532879	Oct 2010	JP
2011527025	Oct 2011	JP
5007442	Jun 2012	JP
2012514233	Jun 2012	JP
5081903	Sep 2012	JP
2013513306	Apr 2013	JP
2013527479	Jun 2013	JP
5718251	Mar 2015	JP
5762956	Jun 2015	JP
5855571	Dec 2015	JP
1020060024498	Mar 2006	KR
1020070068270	Jun 2007	KR
1020080092404	Oct 2008	KR
101050379	Dec 2008	KR
1020080109048	Dec 2008	KR
1020090013221	Feb 2009	KR
1020100041741	Apr 2010	KR
1020110038024	Apr 2011	KR
1020110111409	Oct 2011	KR
1020120094892	Aug 2012	KR
1020120101457	Sep 2012	KR
101210313	Dec 2012	KR
101294634	Aug 2013	KR
101461141	Nov 2014	KR
101610662	Apr 2016	KR
519615	Feb 2003	TW
526468	Apr 2003	TW
200305854	Nov 2003	TW
200629240	Aug 2006	TW
I279776	Apr 2007	TW
200847133	Dec 2008	TW
200910793	Mar 2009	TW
201009817	Mar 2010	TW
201113873	Apr 2011	TW
201143475	Dec 2011	TW
I421858	Jan 2014	TW
I463817	Dec 2014	TW
I465121	Dec 2014	TW
201513099	Apr 2015	TW
I488179	Jun 2015	TW
WO0137265	May 2001	WO
WO0141504	Jun 2001	WO
WO0156328	Aug 2001	WO
WO0174118	Oct 2001	WO
WO0207061	Jan 2002	WO
WO02080362	Oct 2002	WO
WO02103676	Dec 2002	WO
WO03043374	May 2003	WO
WO03069499	Aug 2003	WO
WO2004010415	Jan 2004	WO
WO2005086138	Sep 2005	WO
WO2006027707	Mar 2006	WO
WO2007001068	Jan 2007	WO
WO2007049644	May 2007	WO
WO2007081916	Jul 2007	WO
WO2007140003	Dec 2007	WO
WO2008034221	Mar 2008	WO
WO2008045476	Apr 2008	WO
WO2009008998	Jan 2009	WO
WO2010005493	Jan 2010	WO
WO2010077361	Jul 2010	WO
WO2011002489	Jan 2011	WO
WO2011068901	Jun 2011	WO
WO2011091068	Jul 2011	WO
WO2012094422	Jul 2012	WO
WO2012097016	Jul 2012	WO
WO2014131054	Aug 2014	WO
WO2015010129	Jan 2015	WO
WO2016040885	Mar 2016	WO
WO2016049566	Mar 2016	WO

Non-Patent Literature Citations (301)

Entry
Non-Final Office Action, Dec. 6, 2011, U.S. Appl. No. 12/319,107, filed Dec. 31, 2008.
Final Office Action, Apr. 16, 2012, U.S. Appl. No. 12/319,107, filed Dec. 31, 2008.
Advisory Action, Jun. 28, 2012, U.S. Appl. No. 12/319,107, filed Dec. 31, 2008.
Non-Final Office Action, Jan. 3, 2014, U.S. Appl. No. 12/319,107, filed Dec. 31, 2008.
Notice of Allowance, Aug. 25, 2014, U.S. Appl. No. 12/319,107, filed Dec. 31, 2008.
Non-Final Office Action, Dec. 10, 2012, U.S. Appl. No. 12/493,927, filed Jun. 29, 2009.
Final Office Action, May 14, 2013, U.S. Appl. No. 12/493,927, filed Jun. 29, 2009.
Non-Final Office Action, Jan. 9, 2014, U.S. Appl. No. 12/493,927, filed Jun. 29, 2009.
Notice of Allowance, Aug. 20, 2014, U.S. Appl. No. 12/493,927, filed Jun. 29, 2009.
Non-Final Office Action, Aug. 28, 2012, U.S. Appl. No. 12/860,515, filed Aug. 20, 2010.
Final Office Action, Mar. 11, 2013, U.S. Appl. No. 12/860,515, filed Aug. 20, 2010.
Non-Final Office Action, Aug. 28, 2013, U.S. Appl. No. 12/860,515, filed Aug. 20, 2010.
Notice of Allowance, Jun. 18, 2014, U.S. Appl. No. 12/860,515, filed Aug. 20, 2010.
Non-Final Office Action, Oct. 11, 2012, U.S. Appl. No. 12/896,725, filed Oct. 1, 2010.
Final Office Action, May 22, 2013, U.S. Appl. No. 12/896,725, filed Oct. 1, 2010.
Non-Final Office Action, Jan. 30, 2014, U.S. Appl. No. 12/896,725, filed Oct. 1, 2010.
Non-Final Office Action, Nov. 19, 2014, U.S. Appl. No. 12/896,725, filed Oct. 1, 2010.
Notice of Allowance, Jul. 30, 2015, U.S. Appl. 12/896,725, filed Oct. 1, 2010.
Non-Final Office Action, Oct. 2, 2012, U.S. Appl. No. 12/906,009, filed Oct. 15, 2010.
Non-Final Office Action, Jul. 2, 2013, U.S. Appl. No. 12/906,009, filed Oct. 15, 2010.
Final Office Action, May 7, 2014, U.S. Appl. No. 12/906,009, filed Oct. 15, 2010.
Non-Final Office Action, Apr. 21, 2015, U.S. Appl. No. 12/906,009, filed Oct. 15, 2010.
Non-Final Office Action, Jul. 31, 2013, U.S. Appl. No. 13/009,732, filed Jan. 19, 2011.
Final Office Action, Dec. 16 2014, U.S. Appl. No. 13/009,732, filed Jan. 19, 2011.
Non-Final Office Action, Apr. 24, 2013, U.S. Appl. No. 13/012,517, filed Jan. 24, 2011.
Final Office Action, Dec. 3, 2013, U.S. Appl. No. 13/012,517, filed Jan. 24, 2011.
Non-Final Office Action, Nov. 19, 2014, U.S. Appl. No. 13/012,517, filed Jan. 24, 2011.
Final Office Action, Jun. 17, 2015, U.S. Appl. No. 13/012,517, filed Jan. 24, 2011.
Non-Final Office Action, Feb. 21, 2012, U.S. Appl. No. 13/288,858, filed Nov. 3, 2011.
Notice of Allowance, Sep. 10, 2012, U.S. Appl. No. 13/288,858, filed Nov. 3, 2011.
Non-Final Office Action, Feb. 14, 2012, U.S. Appl. No. 13/295,981, filed Nov. 14, 2011.
Final Office Action, Jul. 9, 2012, U.S. Appl. No. 13/295,981, filed Nov. 14, 2011.
Final Office Action, Jul. 17, 2012, U.S. Appl. No. 13/295,981, filed Nov. 14, 2011.
Advisory Action, Sep. 24, 2012, U.S. Appl. No. 13/295,981, filed Nov. 14, 2011.
Notice of Allowance, May 9, 2014, U.S. Appl. No. 13/295,981, filed Nov. 14, 2011.
Non-Final Office Action, May 10, 2013, U.S. Appl. No. 13/751,907, filed Jan. 28, 2013.
Notice of Allowance, Sep. 17, 2013, U.S. Appl. No. 13/751,907, filed Jan. 28, 2013.
Non-Final Office Action, Dec. 28, 2015, U.S. Appl. No. 14/081,723, filed Nov. 15, 2013.
International Search Report dated Jun. 8, 2001 in Patent Cooperation Treaty Application No. PCT/US2001/008372.
International Search Report dated Apr. 3, 2003 in Patent Cooperation Treaty Application No. PCT/US2002/036946.
International Search Report dated May 29, 2003 in Patent Cooperation Treaty Application No. PCT/US2003/004124.
International Search Report and Written Opinion dated Oct. 19, 2007 in Patent Cooperation Treaty Application No. PCT/US2007/000463.
International Search Report and Written Opinion dated Apr. 9, 2008 in Patent Cooperation Treaty Application No. PCT/US2007/021654.
International Search Report and Written Opinion dated Sep. 16, 2008 in Patent Cooperation Treaty Application No. PCT/US2007/012628.
International Search Report and Written Opinion dated Oct. 1, 2008 in Patent Cooperation Treaty Application No. PCT/US2008/008249.
International Search Report and Written Opinion dated Aug. 27, 2009 in Patent Cooperation Treaty Application No. PCT/US2009/003813.
Dahl, Mattias et al., “Acoustic Echo and Noise Cancelling Using Microphone Arrays”, International Symposium on Signal Processing and its Applications, ISSPA, Gold coast, Australia, Aug. 25-30, 1996, pp. 379-382.
Demol, M. et al., “Efficient Non-Uniform Time-Scaling of Speech With WSOLA for CALL Applications”, Proceedings of InSTIL/ICALL2004—NLP and Speech Technologies in Advanced Language Learning Systems—Venice Jun. 17-19, 2004.
Laroche, Jean. “Time and Pitch Scale Modification of Audio Signals”, in “Applications of Digital Signal Processing to Audio and Acoustics”, The Kluwer International Series in Engineering and Computer Science, vol. 437, pp. 279-309, 2002.
Moulines, Eric et al., “Non-Parametric Techniques for Pitch-Scale and Time-Scale Modification of Speech”, Speech Communication, vol. 16, pp. 175-205, 1995.
Verhelst, Werner, “Overlap-Add Methods for Time-Scaling of Speech”, Speech Communication vol. 30, pp. 207-221, 2000.
Bach et al., Learning Spectral Clustering with application to spech separation, Journal of machine learning research, 2006.
Mokbel et al., 1995, IEEE Transactions of Speech and Audio Processing, vol. 3, No. 5, Sep. 1995, pp. 346-356.
Office Action mailed Oct. 14, 2013 in Taiwanese Patent Application 097125481, filed Jul. 4, 2008.
Office Action mailed Oct. 29, 2013 in Japanese Patent Application 2011-516313, filed Jun. 26, 2009.
Office Action mailed Dec. 20, 2013 in Taiwanese Patent Application 096146144, filed Dec. 4, 2007.
Office Action mailed Dec. 9, 2013 in Finnish Patent Application 20100431, filed Jun. 26, 2009.
Office Action mailed Jan. 20, 2014 in Finnish Patent Application 20100001, filed Jul. 3, 2008.
Office Action mailed Mar. 10, 2014 in Taiwanese Patent Application 097125481, filed Jul. 4, 2008.
Bai et al., “Upmixing and Downmixing Two-channel Stereo Audio for Consumer Electronics”. IEEE Transactions on Consumer Electronics [Online] 2007, vol. 53, Issue 3, pp. 1011-1019.
Jo et al., “Crosstalk cancellation for spatial sound reproduction in portable devices with stereo loudspeakers”. Communications in Computer and Information Science [Online] 2011, vol. 266, pp. 114-123.
Nongpuir et al., “NEXT cancellation system with improved convergence rate and tracking performance”. IEEE Proceedings—Communications [Online] 2005, vol. 152, Issue 3, pp. 378-384.
Ahmed et al., “Blind Crosstalk Cancellation for DMT Systems” IEEE—Emergent Technologies Technical Committee. Sep. 2002. pp. 1-5.
Allowance mailed May 21, 2014 in Finnish Patent Application 20100001, filed Jan. 4, 2010.
Office Action mailed May 2, 2014 in Taiwanese Patent Application 098121933, filed Jun. 29, 2009.
Office Action mailed Apr. 15, 2014 in Japanese Patent Application 2010-514871, filed Jul. 3, 2008.
Office Action mailed Jun. 27, 2014 in Korean Patent Application No. 10-2010-7000194, filed Jan. 6, 2010.
Office Action mailed Jun. 18, 2014 in Finnish Patent Application No. 20080428, filed Jul. 4, 2008.
International Search Report & Written Opinion dated Jul. 15, 2014 in Patent Cooperation Treaty Application No. PCT/US2014/018443, filed Feb. 25, 2014.
Notice of Allowance dated Aug. 26, 2014 in Taiwanese Application No. 096146144, filed Dec. 4, 2007.
Notice of Allowance dated Sep. 16, 2014 in Korean Application No. 10-2010-7000194, filed Jul. 3, 2008.
Notice of Allowance dated Sep. 29, 2014 in Taiwanese Application No. 097125481, filed Jul. 4, 2008.
Notice of Allowance dated Oct. 10, 2014 in Finnish Application No. 20100001, filed Jul. 3, 2008.
International Search Report & Written Opinion dated Nov. 12, 2014 in Patent Cooperation Treaty Application No. PCT/US2014/047458, filed Jul. 21, 2014.
Office Action mailed Oct. 28, 2014 in Japanese Patent Application No. 2011-516313, filed Dec. 27, 2012.
Heiko Purnhagen, “Low Complexity Parametric Stereo Coding in MPEG-4,” Proc. of the 7th Int. Conference on Digital Audio Effects (DAFx'04), Naples, Italy, Oct. 5-8, 2004.
Chun-Ming Chang et al., “Voltage-Mode Multifunction Filter with Single Input and Three Outputs Using Two Compound Current Conveyors” IEEE Transactions on Circuits and Systems-I: Fundamental Theory and Applications, vol. 46, No. 11, Nov. 1999.
Notice of Allowance mailed Feb. 10, 2015 in Taiwanese Patent Application No. 098121933, filed Jun. 29, 2009.
Office Action mailed Jan. 30, 2015 in Finnish Patent Application No. 20080623, filed May 24, 2007.
Office Action mailed Mar. 24, 2015 in Japanese Patent Application No. 2011-516313, filed Jun. 26, 2009.
Office Action mailed Apr. 16, 2015 in Korean Patent Application No. 10-2011-7000440, filed Jun. 26, 2009.
Notice of Allowance mailed Jun. 2, 2015 in Japanese Patent Application 2011-516313, filed Jun. 26, 2009.
Office Action mailed Jun. 4, 2015 in Finnish Patent Application 20080428, filed Jan. 5, 2007.
Office Action mailed Jun. 9, 2015 in Japanese Patent Application 2014-165477 filed Jul. 3, 2008.
Notice of Allowance mailed Aug. 13, 2015 in Finnish Patent Application 20080623, filed May 24, 2007.
International Search Report & Written Opinion dated Nov. 27, 2015 in Patent Cooperation Treaty Application No. PCT/US2015/047263, filed Aug. 27, 2015.
Non-Final Office Action, Oct. 27, 2003, U.S. Appl. No. 09/534,682, filed Mar. 24, 2000.
Non-Final Office Action, Feb. 10, 2004, U.S. Appl. No. 09/534,682, filed Mar. 24, 2000.
Final Office Action, Dec. 17, 2004, U.S. Appl. No. 09/534,682, filed Mar. 24, 2000.
Non-Final Office Action, Apr. 20, 2005, U.S. Appl. No. 09/534,682, filed Mar. 24, 2000.
Notice of Allowance, Oct. 26, 2005, U.S. Appl. No. 09/534,682, filed Mar. 24, 2000.
Non-Final Office Action, May 3, 2005, U.S. Appl. No. 09/993,442, filed Nov. 13, 2001.
Final Office Action, Oct. 19, 2005, U.S. Appl. No. 09/993,442, filed Nov. 13, 2001.
Advisory Action, Jan. 20, 2006, U.S Appl. No. 09/993,442, filed Nov. 13, 2001.
Non-Final Office Action, May 17, 2006, U.S. Appl. No. 09/993,442, filed Nov. 13, 2001.
Non-Final Office Action, Nov. 16, 2006, U.S. Appl. No. 09/993,442, filed Nov. 13, 2001.
Final Office Action, Jun. 15, 2007, U.S. Appl. No. 09/993,442, filed Nov. 13, 2001.
Non-Final Office Action, Oct. 8, 2003, U.S. Appl. No. 10/004,141, filed Nov. 14, 2001.
Notice of Allowance, Feb. 24, 2004, U.S. Appl. No. 10/004,141, filed Nov. 14, 2001.
Non-Final Office Action, May 9, 2003, U.S. Appl. No. 10/074,991, filed Feb. 13, 2002.
Notice of Allowance, Jun. 4, 2003, U.S. Appl. No. 10/074,991, filed Feb. 13, 2002.
Non-Final Office Action, Jun. 26, 2006, U.S. Appl. No. 10/074,991, filed Feb. 13, 2002.
Final Office Action, Feb. 23, 2007, U.S. Appl. No. 10/074,991, Feb. 13, 2002.
Non-Final Office Action, Oct. 6, 2005, U.S. Appl. No. 10/177,049, filed Jun. 21, 2002.
Final Office Action, Mar. 28, 2006, U.S. Appl. No. 10/177,049, filed Jun. 21, 2002.
Advisory Action, Jun. 19, 2006, U.S. Appl. No. 10/177,049, filed Jun. 21, 2002.
Non-Final Office Action, Dec. 13, 2006, U.S. Appl. No. 10/613,224, filed Jul. 3, 2003.
Non-Final Office Action, Jun. 13, 2007, U.S. Appl. No. 10/613,224, filed Jul. 3, 2003.
Non-Final Office Action, Jun. 13, 2006, U.S. Appl. No. 10/840,201, filed May 5, 2004.
Non-Final Office Action, Mar. 30, 2010, U.S. Appl. No. 11/343,524, filed Jan. 30, 2006.
Non-Final Office Action, Sep. 13, 2010, U.S. Appl. No. 11/343,524, filed Jan. 30, 2006.
Final Office Action, Mar. 30, 2011, U.S. Appl. No. 11/343,524, filed Jan. 30, 2006.
Final Office Action, May 21, 2012, U.S. Appl. No. 11/343,524, filed Jan. 30, 2006.
Notice of Allowance, Oct. 9, 2012, U.S. Appl. No. 11/343,524, filed Jan. 30, 2006.
Non-Final Office Action, Aug. 5, 2008, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Non-Final Office Action, Jan. 21, 2009, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Final Office Action, Sep. 3, 2009, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Non-Final Office Action, May 10, 2011, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Final Office Action, Oct. 24, 2011, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Notice of Allowance, Feb. 13, 2012, U.S. Appl. No. 11/441,675, filed May 25, 2006.
Non-Final Office Action, Apr. 7, 2011, U.S. Appl. No. 11/699,732, filed Jan. 29, 2007.
Final Office Action, Dec. 6, 2011, U.S. Appl. No. 11/699,732, filed Jan. 29, 2007.
Advisory Action, Feb. 2012, U.S. Appl. No. 11/699,732, filed Jan. 29, 2007.
Notice of Allowance, Mar. 15, 2012, U.S. Appl. No. 11/699,732, filed Jan. 29, 2007.
Non-Final Office Action, Aug. 18, 2010 U.S. Appl. No. 11/825,563, filed Jul. 6, 2007.
Final Office Action, Apr. 28, 2011, U.S. Appl. No. 11/825,563, filed Jul. 6, 2007.
Non-Final Office Action, Apr. 24, 2013, U.S. Appl. No. 11/825,563, filed Jul. 6, 2007.
Final Office Action, Dec. 30, 2013, U.S. Appl. No. 11/825,563, filed Jul. 6, 2007.
Notice of Allowance, Mar. 25, 2014, U.S. Appl. No. 11/825,563, filed Jul. 6, 2007.
Non-Final Office Action, Oct. 3, 2011, U.S. Appl. No. 12/004,788, filed Dec. 21, 2007.
Notice of Allowance, Feb. 23, 2012. U.S. Appl. No. 12/004,788, filed Dec. 21, 2007.
Non-Final Office Action, Sep. 14, 2011, U.S. Appl. No. 12/004,897, filed Dec. 21, 2007.
Notice of Allowance, Jan. 27, 2012, U.S. Appl. No. 12/004,897, filed Dec. 21, 2007.
Non-Final Office Action, Jul. 28, 2011, U.S. Appl. No. 12/072,931, filed Feb. 29, 2008.
Notice of Allowance, Mar. 1, 2012, U.S. Appl. No. 12/072,931, filed Feb. 29, 2008.
Notice of Allowance, Mar. 1, 2012, U.S. Appl. No. 12/080,115, filed Mar. 31, 2008.
Non-Final Office Action, Nov. 14, 2011, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Final Office Action, Apr. 24, 2012, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Advisory Action, Jul. 3, 2012, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Non-Final Office Action, Mar. 11, 2014, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Final Office Action, Jul. 11, 2014, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Non-Final Office Action, Dec. 8, 2014, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Notice of Allowance, Jul. 7, 2015, U.S. Appl. No. 12/215,980, filed Jun. 30, 2008.
Non-Final Office Action, Jul. 13, 2011, U.S. Appl. No. 12/217,076, filed Jun. 30, 2008.
Final Office Action, Nov. 16, 2011, U.S. Appl. No. 12/217,076, filed Jun. 30, 2008.
Non-Final Office Action, Mar. 14, 2012, U.S. Appl. No. 12/217,076, filed Jun. 30, 2008.
Final Office Action, Sep. 19, 2012, U.S. Appl. No. 12/217,076, filed Jun. 30, 2008.
Notice of Allowance, Apr. 15, 2013, U.S. Appl. No. 12/217,076, filed Jun. 30, 2008.
Non-Final Office Action, Sep. 1, 2011, U.S. Appl. No. 12/286,909, filed Oct. 2, 2008.
Notice of Allowance, Feb. 28, 2012, U.S. Appl. No. 12/286,909, filed Oct. 2, 2008.
Non-Final Office Action, Nov. 15, 2011, U.S. Appl. No. 12/286,995, filed Oct. 2, 2008.
Final Office Action, Apr. 10, 2012, U.S. Appl. No. 12/286,995, filed Oct. 2, 2008.
Notice of Allowance, Mar. 13, 2014, U.S. Appl. No. 12/286,995, filed Oct. 2, 2008.
Non-Final Office Action, Dec. 28, 2011, U.S. Appl. No. 12/288,228, filed Oct. 16, 2008.
Non-Final Office Action, Dec. 30, 2011, U.S. Appl. No. 12/422,917, filed Apr. 13, 2009.
Final Office Action, May 14, 2012, U.S. Appl. No. 12/422,917, filed Apr. 13, 2009.
Advisory Action, Jul. 27, 2012, U.S. Appl. No. 12/422,917, filed Apr. 13, 2009.
Notice of Allowance, Sep. 11, 2014, U.S. Appl. No. 12/422,917, filed Apr. 13, 2009.
Non-Final Office Action, Jun. 20, 2012, U.S. Appl. No. 12/649,121, filed Dec. 29, 2009.
Final Office Action, Nov. 28, 2012, U.S. Appl. No. 12/649,121, filed Dec. 29, 2009.
Advisory Action, Feb. 19, 2013, U.S. Appl. No. 12/649,121, filed Dec. 29, 2009.
Notice of Allowance, Mar. 19, 2013, U.S. Appl. No. 12/649,121, filed Dec. 29, 2009.
Non-Final Office Action, Feb. 19, 2013, U.S. Appl. No. 12/944,659, filed Nov. 11, 2010.
Notice of Allowance, May 25, 2011, U.S. Appl. No. 13/016,916, filed Jan. 28, 2011.
Notice of Allowance, Aug. 4, 2011, U.S. Appl. No. 13/016,916, filed Jan. 28, 2011.
Non-Final Office Action, Nov. 2013, U.S. Appl. No. 13/363,362, filed Jan. 31, 2012.
Final Office Action, Sep. 12, 2014, U.S. Appl. No. 13/363,362, filed Jan. 31, 2012.
Non-Final Office Action, Oct. 28, 2015, U.S. Appl. No. 13/363,362, filed Jan. 31, 2012.
Non-Final Office Action, Dec. 4, 2013, U.S. Appl. No. 13/396,568, Feb. 14, 2012.
Final Office Action, Sep. 23, 2014, U.S. Appl. No. 13/396,568, filed Feb. 14, 2012.
Non-Final Office Action, Nov. 5, 2015, U.S. Appl. No. 13/396,568, filed Feb. 14, 2012.
Non-Final Office Action, Sep. 17, 2013, U.S. Appl. No. 13/397,597, filed Feb. 15, 2012.
Final Office Action, Apr. 1, 2014, U.S. Appl. No. 13/397,597, filed Feb. 15, 2012.
Non-Final Office Action, Nov. 21, 2014, U.S. Appl. No. 13/397,597, filed Feb. 15, 2012.
Non-Final Office Action, Jun. 7, 2012, U.S. Appl. No. 13/426,436, filed Mar. 21, 2012.
Final Office Action, Dec. 31, 2012, U.S. Appl. No. 13/426,436, filed Mar. 21, 2012.
Non-Final Office Action, Sep. 12, 2013, U.S. Appl. No. 13/426,436, filed Mar. 21, 2012.
Notice of Allowance, Jul. 16, 2014, U.S. Appl. No. 13/426,436, filed Mar. 21, 2012.
Non-Final Office Action, Jul. 15, 2014, U.S. Appl. No. 13/432,490, filed Mar. 28, 2012.
Notice of Allowance, Apr. 3, 2015, U.S. Appl. No. 13/432,490, filed Mar. 28, 2012.
Notice of Allowance, Oct. 17, 2012, U.S. Appl. No. 13/565,751, filed Aug. 2, 2012.
Non-Final Office Action, Jan. 9, 2012, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Non-Final Office Action, Dec. 28, 2012, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Non-Final Office Action, Mar. 7, 2013, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Final Office Action, Apr. 29, 2013, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Non-Final Office Action, Nov. 27, 2013, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Notice of Allowance, Jan. 30, 2014, U.S. Appl. No. 13/664,299, filed Oct. 30, 2012.
Non-Final Office Action, Jun. 4, 2013, U.S. Appl. No. 13/705,132, filed Dec. 4, 2012.
Final Office Action, Dec. 19, 2013, U.S. Appl. No. 13/705,132, filed Dec. 4, 2012.
Notice of Allowance, Jun. 19, 2014, U.S. Appl. No. 13/705,132, filed Dec. 4, 2012.
Non-Final Office Action, Jul. 14, 2015, U.S. Appl. No. 14/046,551, filed Oct. 4, 2013.
Non-Final Office Action, May 21, 2015, U.S. Appl. No. 14/189,817, filed Feb. 25, 2014.
Final Office Action, Dec. 15, 2015, U.S. Appl. No. 14/189,817, filed Feb. 25, 2014.
Notice of Allowance, Oct. 7, 2014, U.S. Appl. No. 14/207,096, filed Mar. 12, 2014.
Non-Final Office Action, Oct. 28, 2015, U.S. Appl. No. 14/216,567, filed Mar. 17, 2014.
Non-Final Office Action, Jul. 10, 2014, U.S. Appl. No. 14/279,092, filed May 15, 2014.
Notice of Allowance, Jan. 29, 2015, U.S. Appl. No. 14/279,092, filed May 15, 2014.
Non-Final Office Action, Feb. 27, 2015, U.S. Appl. No. 14/336,934, filed Jul. 21, 2014.
Notice of Allowance, Aug. 28, 2015, U.S. Appl. No. 14/336,934, filed Jul. 21, 2014.
Allen, Jont B. “Short Term Spectral Analysis, Synthesis, and Modification by Discrete Fourier Transform”, IEEE Transactions on Acoustics, Speech, and Signal Processing. vol. ASSP-25, No. 3, Jun. 1977. pp. 235-238.
Allen, Jont B. et al., “A Unified Approach to Short-Time Fourier Analysis and Synthesis”, Proceedings of the IEEE. vol. 65, No. 11, Nov. 1977. pp. 1558-1564.
Avendano, Carlos, “Frequency-Domain Source Identification and Manipulation in Stereo Mixes for Enhancement, Suppression and Re-Panning Applications,” 2003 IEEE Workshop on Application of Signal Processing to Audio and Acoustics, Oct. 19-22, pp. 55-58, New Paltz, New York, USA.
Boll, Steven F. “Suppression of Acoustic Noise in Speech using Spectral Subtraction”, IEEE Transactions on Acoustics, Speech and Signal Processing, vol. ASSP-27, No. 2, Apr. 1979, pp. 113-120.
Boll, Steven F. et al., “Suppression of Acoustic Noise in Speech Using Two Microphone Adaptive Noise Cancellation”, IEEE Transactions on Acoustic, Speech, and Signal Processing, vol. ASSP-28, No. 6, Dec. 1980, pp. 752-753.
Boll, Steven F. “Suppression of Acoustic Noise in Speech Using Spectral Subtraction”, Dept. of Computer Science, University of Utah Salt Lake City, Utah, Apr. 1979, pp. 18-19.
Chen, Jingdong et al., “New Insights into the Noise Reduction Wiener Filter”, IEEE Transactions on Audio, Speech, and Language Processing. vol. 14, No. 4, Jul. 2006, pp. 1218-1234.
Cohen, Israel et al., “Microphone Array Post-Filtering for Non-Stationary Noise Suppression”, IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2002, pp. 1-4.
Cohen, Israel, “Multichannel Post-Filtering in Nonstationary Noise Environments”, IEEE Transactions on Signal Processing, vol. 52, No. 5, May 2004, pp. 1149-1160.
Dahl, Mattias et al., “Simultaneous Echo Cancellation and Car Noise Suppression Employing a Microphone Array”, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr. 21-24, pp. 239-242.
Elko, Gary W., “Chapter 2: Differential Microphone Arrays”, “Audio Signal Processing for Next-Generation Multimedia Communication Systems”, 2004, pp. 12-65, Kluwer Academic Publishers, Norwell, Massachusetts, USA.
“ENT 172,” Instructional Module. Prince George's Community College Department of Engineering Technology. Accessed: Oct. 15, 2011. Subsection: “Polar and Rectangular Notation”. <http://academic.ppgcc.edu/ent/ent172—instr—mod.html>.
Fuchs, Martin et al., “Noise Suppression for Automotive Applications Based on Directional Information”, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, May 17-21, pp. 237-240.
Fulghum, D. P. et al., “LPC Voice Digitizer with Background Noise Suppression”, 1979 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 220-223.
Goubran, R.A. et al., “Acoustic Noise Suppression Using Regressive Adaptive Filtering”, 1990 IEEE 40th Vehicular Technology Conference, May 6-9, pp. 48-53.
Graupe, Daniel et al., “Blind Adaptive Filtering of Speech from Noise of Unknown Spectrum Using a Virtual Feedback Configuration”, IEEE Transactions on Speech and Audio Processing, Mar. 2000, vol. 8, No. 2, pp. 146-158.
Haykin, Simon et al., “Appendix A.2 Complex Numbers.” Signals and Systems. 2nd Ed. 2003. p. 764.
Hermansky, Hynek “Should Recognizers Have Ears?”, In Proc. ESCA Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, pp. 1-10, France 1997.
Hohmann, V. “Frequency Analysis and Synthesis Using a Gammatone Filterbank”, ACTA Acustica United with Acustica, 2002, vol. 88, pp. 433-442.
Jeffress, Lloyd A. et al., “A Place Theory of Sound Localization,” Journal of Comparative and Physiological Psychology, 1948, vol. 41, p. 35-39.
Jeong, Hyuk et al., “Implementation of a New Algorithm Using the STFT with Variable Frequency Resolution for the Time-Frequency Auditory Model”, J. Audio Eng. Soc., Apr. 1999, vol. 47, No. 4., pp. 240-251.
Kates, James M. “A Time-Domain Digital Cochlear Model”, IEEE Transactions on Signal Processing, Dec. 1991, vol. 39, No. 12, pp. 2573-2592.
Kato et al., “Noise Suppression with High Speech Quality Based on Weighted Noise Estimation and MMSE STSA” Proc. IWAENC [Online] 2001, pp. 183-186.
Lazzaro, John et al., “A Silicon Model of Auditory Localization,” Neural Computation Spring 1989, vol. 1, pp. 47-57, Massachusetts Institute of Technology.
Lippmann, Richard P. “Speech Recognition by Machines and Humans”, Speech Communication, Jul. 1997, vol. 22, No. 1, pp. 1-15.
Liu, Chen et al., “A Two-Microphone Dual Delay-Line Approach for Extraction of a Speech Sound in the Presence of Multiple Interferers”, Journal of the Acoustical Society of America, vol. 110, No. 6, Dec. 2001, pp. 3218-3231.
Martin, Rainer et al., “Combined Acoustic Echo Cancellation, Dereverberation and Noise Reduction: A two Microphone Approach”, Annales des Telecommunications/Annals of Telecommunications. vol. 49, No. 7-8, Jul.-Aug. 1994, pp. 429-438.
Martin, Rainer “Spectral Subtraction Based on Minimum Statistics”, in Proceedings Europe. Signal Processing Conf., 1994, pp. 1182-1185.
Mitra, Sanjit K. Digital Signal Processing: a Computer-based Approach. 2nd Ed. 2001. pp. 131-133.
Mizumachi, Mitsunori et al., “Noise Reduction by Paired-Microphones Using Spectral Subtraction”, 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, May 12-15. pp. 1001-1004.
Moonen, Marc et al., “Multi-Microphone Signal Enhancement Techniques for Noise Suppression and Dereverbration,” http://www.esat.kuleuven.ac.be/sista/yearreport97//node37.html, accessed on Apr. 21, 1998.
Watts, Lloyd Narrative of Prior Disclosure of Audio Display on Feb. 15, 2000 and May 31, 2000.
Cosi, Piero et al., (1996), “Lyon's Auditory Model Inversion: a Tool for Sound Separation and Speech Enhancement,” Proceedings of ESCA Workshop on ‘The Auditory Basis of Speech Perception,’ Keele University, Keele (UK), Jul. 15-19, 1996, pp. 194-197.
Parra, Lucas et al., “Convolutive Blind Separation of Non-Stationary Sources”, IEEE Transactions on Speech and Audio Processing. vol. 8, No. 3, May 2008, pp. 320-327.
Rabiner, Lawrence R. et al., “Digital Processing of Speech Signals”, (Prentice-Hall Series in Signal Processing). Upper Saddle River, NJ: Prentice Hall, 1978.
Weiss, Ron et al., “Estimating Single-Channel Source Separation Masks: Revelance Vector Machine Classifiers vs. Pitch-Based Masking”, Workshop on Statistical and Perceptual Audio Processing, 2006.
Schimmel, Steven et al., “Coherent Envelope Detection for Modulation Filtering of Speech,” 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, No. 7, pp. 221-224.
Slaney, Malcom, “Lyon's Cochlear Model”, Advanced Technology Group, Apple Technical Report #13, Apple Computer, Inc., 1988, pp. 1-79.
Slaney, Malcom, et al., “Auditory Model Inversion for Sound Separation,” 1994 IEEE International Conference on Acoustics, Speech and Signal Processing, Apr. 19-22, vol. 2, pp. 77-80.
Slaney, Malcom. “An Introduction to Auditory Model Inversion”, Interval Technical Report IRC 1994-014, http://coweb.ecn.purdue.edu/˜maclom/interval/1994-014/, Sep. 1994, accessed on Jul. 6, 2010.
Solbach, Ludger “An Architecture for Robust Partial Tracking and Onset Localization in Single Channel Audio Signal Mixes”, Technical University Hamburg-Harburg, 1998.
Soon et al., “Low Distortion Speech Enhancement” Proc. Inst. Elect. Eng. [Online] 2000, vol. 147, pp. 247-253.
Stahl, V. et al., “Quantile Based Noise Estimation for Spectral Subtraction and Wiener Filtering,” 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, Jun. 5-9, vol. 3, pp. 1875-1878.
Syntrillium Software Corporation, “Cool Edit User's Manual”, 1996, pp. 1-74.
Tashev, Ivan et al., “Microphone Array for Headset with Spatial Noise Suppressor”, http://research.microsoft.com/users/ivantash/Documents/Tashev—MAforHeadset—HSCMA—05.pdf. (4 pages).
Tchorz, Jurgen et al., “SNR Estimation Based on Amplitude Modulation Analysis with Applications to Noise Suppression”, IEEE Transactions on Speech and Audio Processing, vol. 11, No. 3, May 2003, pp. 184-192.
Valin, Jean-Marc et al., “Enhanced Robot Audition Based on Microphone Array Source Separation with Post-Filter”, Proceedings of 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sep. 28-Oct. 2, 2004, Sendai, Japan. pp. 2123-2128.
Watts, Lloyd, “Robust Hearing Systems for Intelligent Machines,” Applied Neurosystems Corporation, 2001, pp. 1-5.
Widrow, B. et al., “Adaptive Antenna Systems,” Proceedings of the IEEE, vol. 55, No. 12, pp. 2143-2159, Dec. 1967.
Yoo, Heejong et al., “Continuous-Time Audio Noise Suppression and Real-Time Implementation”, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, May 13-17, pp. IV3980-1V3983.
Office Action mailed May 17, 2016 in Korean Patent Application 1020127001822 filed Jun. 21, 2010.
Lauber, Pierre et al., “Error Concealment for Compressed Digital Audio,” Audio Engineering Society, 2001.
International Search Report and Written Opinion dated May 20, 2010 in Patent Cooperation Treaty Application No. PCT/US2009/006754.
Fast Cochlea Transform, US Trademark Reg. No. 2,875,755 (Aug. 17, 2004).
3GPP2 “Enhanced Variable Rate Codec, Speech Service Options 3, 68, 70, and 73 for Wideband Spread Spectrum Digital Systems”, May 2009, pp. 1-308.
3GPP2 “Selectable Mode Vocoder (SMV) Service Option for Wideband Spread Spectrum Communication Systems”, Jan. 2004, pp. 1-231.
3GPP2 “Source-Controlled Variable-Rate Multimode Wideband Speech Codec (VMR-WB) Service Option 62 for Spread Spectrum Systems”, Jun. 11, 2004, pp. 1-164.
3GPP “3GPP Specification 26.071 Mandatory Speech Codec Speech Processing Functions; AMR Speech Codec; General Description”, http://www.3gpp.org/ftp/Specs/html-info/26071.htm, accessed on Jan. 25, 2012.
3GPP “3GPP Specification 26.094 Mandatory Speech Codec Speech Processing Functions; Adaptive Multi-Rate (AMR) Speech Codec; Voice Activity Detector (VAD)”, http://www.3gpp.org/ftp/Specs/html-info/26094.htm, accessed on Jan. 25, 2012.
3GPP “3GPP Specification 26.171 Speech Codec Speech Processing Functions; Adaptive Multi-Rate—Wideband (AMR-WB) Speech Codec; General Description”, http://www.3gpp.org/ftp/Specs/html-info26171.htm, accessed on Jan. 25, 2012.
3GPP “3GPP Specification 26.194 Speech Codec Speech Processing Functions; Adaptive Multi-Rate—Wideband (AMR-WB) Speech Codec; Voice Activity Detector (VAD)” http://www.3gpp.org/ftp/Specs/html-info26194.htm, accessed on Jan. 25, 2012.
International Telecommunication Union “Coding of Speech at 8 kbit/s Using Conjugate-Structure Algebraic-code-excited Linear-prediction (CS-ACELP)”, Mar. 19, 1996, pp. 1-39.
International Telecommunication Union “Coding of Speech at 8 kbit/s Using Conjugate Structure Algebraic-code-excited Linear-prediction (CS-ACELP) Annex B: A Silence Compression Scheme for G.729 Optimized for Terminals Conforming to Recommendation V.70”, Nov. 8, 1996, pp. 1-23.
International Search Report and Written Opinion dated Aug. 19, 2010 in Patent Cooperation Treaty Application No. PCT/US2010/001786.
International Search Report and Written Opinion dated Feb. 7, 2011 in Patent Cooperation Treaty Application No. PCT/US2010/058600, filed Dec. 1, 2010.
Cisco, “Understanding How Digital T1 CAS (Robbed Bit Signaling) Works in IOS Gateways”, Jan. 17, 2007, http://www.cisco.com/image/gif/paws/22444/t1-cas-ios.pdf, accessed on Apr. 3, 2012.
Jelinek et al., “Noise Reduction Method for Wideband Speech Coding” Proc. Eusipco, Vienna, Austria, Sep. 2004, pp. 1959-1962.
Widjaja et al., “Application of Differential Microphone Array for IS-127 EVRC Rate Determination Algorithm”, Interspeech 2009, 10th Annual Conference of the International Speech Communication Association, Brighton, United Kingdom Sep. 6-10, 2009, pp. 1123-1126.
Sugiyama et al., “Single-Microphone Noise Suppression for 3G Handsets Based on Weighted Noise Estimation” in Benesty et al., “Speech Enhancement”, 2005, pp. 115-133, Springer Berlin Heidelberg.
Watts, “Real-Time, High-Resolution Simulation of the Auditory Pathway, with Application to Cell-Phone Noise Reduction” Proceedings of 2010 IEEE International Symposium on Circuits and Systems (ISCAS), May 30-Jun. 2, 2010, pp. 3821-3824.
3GPP Minimum Performance Specification for the Enhanced Variable rate Codec, Speech Service Option 3 and 68 for Wideband Spread Spectrum Digital Systems, Jul. 2007, pp. 1-83.
Ramakrishnan, 2000. Reconstruction of Incomplete Spectrograms for robust speech recognition. PHD thesis, Carnegie Mellon University, Pittsburgh, Pennsylvania.
Kim et al., “Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions,” Audio, Speech, and Language Processing, IEEE Transactions on, vol. 18, No. 8 pp. 2111-2120, Nov. 2010.
Cooke et al.,“Robust Automatic Speech Recognition with Missing and Unreliable Acoustic data,” Speech Commun., vol. 34, No. 3, pp. 267-285, 2001.
Liu et al., “Efficient cepstral normalization for robust speech recognition.” Proceedings of the workshop on Human Language Technology. Association for Computational Linguistics, 1993.
Yoshizawa et al., “Cepstral gain normalization for noise robust speech recognition.” Acoustics, Speech, and Signal Processing, 2004. Proceedings, (ICASSP04), IEEE International Conference on vol. 1 IEEE, 2004.
Office Action mailed Apr. 8, 2014 in Japan Patent Application 2011-544416, filed Dec. 30, 2009.
Elhilali et al.,“A cocktail party with a cortical twist: How cortical mechanisms contribute to sound segregation.” J. Acoust. Soc. Am., vol. 124, No. 6, Dec. 2008; 124(6): 3751-3771).
Jin et al., “HMM-Based Multipitch Tracking for Noisy and Reverberant Speech.” Jul. 2011.
Kawahara, W., et al., “Tandem-Straight: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0, and aperiodicity estimation.” IEEE ICASSP 2008.
Lu et al. “A Robust Audio Classification and Segmentation Method.” Microsoft Research, 2001, pp. 203, 206, and 207.
Office Action dated Aug. 26, 2014 in Japan Application No. 2012-542167, filed Dec. 1, 2010.
Office Action mailed Oct. 31, 2014 in Finland Patent Application No. 20125600, filed Jun. 1, 2012.
Krini, Mohamed et al., “Model-Based Speech Enhancement,” in Speech and Audio Processing in Adverse Environments; Signals and Communication Technology, edited by Hansler et al., 2008, Chapter 4, pp. 89-134.
Office Action mailed Dec. 9, 2014 in Japan Patent Application No. 2012-518521, filed Jun. 21, 2010.
Office Action mailed Dec. 10, 2014 in Taiwan Patent Application No. 099121290, filed Jun. 29, 2010.
Nayebi et al., “Low delay FIR filter banks: design and evaluation” IEEE Transactions on Signal Processing, vol. 42, No. 1, pp. 24-31, Jan. 1994.
Notice of Allowance mailed Feb. 17, 2015 in Japan Patent Application No. 2011-544416, filed Dec. 30, 2009.
Office Action mailed Mar. 27, 2015 in Korean Patent Application No. 10-2011-7016591, filed Dec. 30, 2009.
Office Action mailed Jul. 21, 2015 in Japan Patent Application No. 2012-542167, filed Dec. 1, 2010.
Office Action mailed Sep. 29, 2015 in Finland Patent Application No. 20125600, filed Dec. 1, 2010.
Office Action mailed Oct. 15, 2015 in Korean Patent Application 10-2011-7016591.
Allowance mailed Nov. 17, 2015 in Japan Patent Application No. 2012-542167, filed Dec. 1, 2010.
International Search Report & Written Opinion dated Dec. 14, 2015 in Patent Cooperation Treaty Application No. PCT/US2015/049816, filed Sep. 11, 2015.
International Search Report & Written Opinion dated Dec. 22, 2015 in Patent Cooperation Treaty Application No. PCT/US2015/052433, filed Sep. 25, 2015.
Notice of Allowance dated Jan. 14, 2016 in South Korean Patent Application No. 10-2011-7016591 filed Jul. 15, 2011.
International Search Report & Written Opinion dated Feb. 12, 2016 in Patent Cooperation Treaty Application No. PCT/US2015/064523, filed Dec. 8, 2015.
International Search Report & Written Opinion dated Feb. 11, 2016 in Patent Cooperation Treaty Application No. PCT/US2015/063519, filed Dec. 2, 2015.
Klein, David, “Noise-Robust Multi-Lingual Keyword Spotting with a Deep Neural Network Based Architecture”, U.S. Appl. No. 14/614,348, filed Feb. 4, 2015.
Vitus, Deborah Kathleen et al., “Method for Modeling User Possession of Mobile Device for User Authentication Framework”, U.S. Appl. No. 14/548,207, filed Nov. 19, 2014.
Miurgia, Carlo, “Selection of System Parameters Based on Non-Acoustic Sensor Information”, U.S. Appl. No. 14/331,205, filed Jul. 14, 2014.
Goodwin, Michael M. et al., “Key Click Suppression”, U.S. Appl. No. 14/745,176, filed Jun. 19, 2015.

Related Publications (1)

	Number	Date	Country
	20150025881 A1	Jan 2015	US

Provisional Applications (2)

	Number	Date	Country
	61856577	Jul 2013	US
	61972112	Mar 2014	US

Speech signal separation and synthesis based on auditory scene analysis and speech modeling

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

US

CPC

International Classifications

Term Extension

Abstract