System for suppressing passing tire hiss

Information

  • Patent Grant
  • 8521521
  • Patent Number
    8,521,521
  • Date Filed
    Thursday, September 1, 2011
    13 years ago
  • Date Issued
    Tuesday, August 27, 2013
    11 years ago
Abstract
A voice enhancement logic improves the perceptual quality of a processed voice. The voice enhancement system includes a passing tire hiss noise detector and a passing tire hiss noise attenuator. The passing tire hiss noise detector detects a passing tire hiss noise by modeling the passing tire hiss. The passing tire hiss noise attenuator dampens the passing tire hiss noise to improve the intelligibility of a speech signal.
Description
BACKGROUND OF THE INVENTION

1. Technical Field


This invention relates to acoustics, and more particularly, to a system that enhances the perceptual quality of a processed voice.


2. Related Art


Many communication devices acquire, assimilate, and transfer a voice signal. Voice signals pass from one system to another through a communication medium. In some systems, including some systems used in vehicles, the clarity of the voice signal does not depend only on the quality of the communication system or the quality of the communication medium. The clarity of the voice signal may also depend on the amount of noise which accompanies the voice signal. When noise occurs near a source or a receiver, distortion garbles the voice signal, destroys information, and in some instances, masks the voice signal so that it is not recognized by a listener or a voice recognition system.


Noise, which may be annoying, distracting, or result in a loss of information, may come from many sources. Noise from a vehicle may be created by the engine, the road, the tires, or by the movement of air. When a vehicle is in motion on a paved road, a significant amount of the noise it produces may be generated from the contact between the tire and the road—a whooshing or hissing sound one hears as the car passes by. This sound may be particularly noticeable to others driving on the highway with their windows down. The noise may originate from an air pumping effect emanating from the air compression and expansion between the tires of the passing car and the road. This sound may be amplified by the side less horn shape formed by the tire and the road. The short-term, or transient, whooshing or hissing sound as a vehicle passes by a communication device may cause the communication device to suffer voice quality and intelligibility loss, and may also cause speech recognition failure.


Noise estimation techniques may have temporal smoothing parameters to ensure that they do not incorporate speech and temporally short events into their estimates. Because passing tire hiss noise may have a duration similar to that of speech sounds, many conventional noise estimation techniques are unsuitable for identifying passing tire hiss as noise. Instead, passing tire hiss noise may be misinterpreted as signal content and augmented in noise reduction algorithms or misclassified as an utterance in speech recognition applications.


Therefore there is a need for a system that counteracts passing tire hiss noise.


SUMMARY

A voice enhancement logic improves the perceptual quality of a processed voice. The system detects and dampens some noises associated with moving tires. The system includes a passing tire hiss noise detector and a passing tire hiss noise attenuator. The passing tire hiss noise detector may detect a passing tire hiss noise by comparing the input signal to a passing tire hiss model. The passing tire hiss noise attenuator then dampens the passing tire hiss. The system may also detect, dampen and/or attenuate continuous noise or other transient noises.


Alternative voice enhancement logic includes time frequency transform logic, a background noise estimator, a passing tire hiss noise detector, and a passing tire hiss noise attenuator. The time frequency transform logic converts a time varying input signal into a frequency domain output signal. The background noise estimator measures the continuous noise that may accompany the input signal. The passing tire hiss noise detector automatically identifies and models passing tire hiss noise, which may then be dampened by the passing tire hiss noise attenuator.


Other systems, methods, features, and advantages of the invention will be, or will become, apparent to one with skill in the art upon examination of the following figures and detailed description. It is intended that all such additional systems, methods, features, and advantages be included within this description, be within the scope of the invention, and be protected by the following claims.





BRIEF DESCRIPTION OF THE DRAWINGS

The invention can be better understood with reference to the following drawings and description. The components in the figures are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention. Moreover, in the figures, like referenced numerals designate corresponding parts throughout the different views.



FIG. 1 is a partial block diagram of voice enhancement logic.



FIG. 2 is a time-frequency spectrogram illustrating a signal having a sequence of sounds.



FIG. 3 shows a signal comprising passing tire hiss noise plus background noise, in the time-frequency domain.



FIG. 4 shows a signal comprising a vowel sound plus background noise, in the time-frequency domain.



FIG. 5 is a block diagram of the passing tire hiss noise detector of the voice enhancement logic of FIG. 1.



FIG. 6 is a pre-processing system coupled to the voice enhancement logic of FIG. 1.



FIG. 7 is a block diagram of an alternative voice enhancement system.



FIG. 8 is a flow diagram of a voice enhancement.



FIG. 9 shows a signal comprising both a vowel sound and a passing tire hiss noise in the time-frequency domain.



FIG. 10 shows the signal of FIG. 9 with the passing tire hiss removed in the time-frequency domain.



FIG. 11 shows the signal of FIG. 10 with a reconstructed vowel sound in the time-frequency domain.



FIG. 12 is a block diagram of voice enhancement logic within a vehicle.



FIG. 13 is a block diagram of voice enhancement logic interfaced to an audio system and/or a communication system.





DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

A voice enhancement logic improves the perceptual quality of a processed voice. The logic may automatically detect the shape and form of the noise associated with the hiss of tires of vehicles passing the receiver in a real or a delayed time. By tracking selected attributes, the logic may eliminate or dampen passing tire hiss noise using a limited memory that temporarily stores the selected attributes of the noise. The passing tire hiss noise can be detected and attenuated in the presence or absence of speech. The passing tire hiss noise may be detected and attenuated with some time buffering (e.g. 300-500 ms), or alternatively, the presence of passing tire hiss noise may be predicted based on modeled passing tire hiss noise and attenuated in real time. Alternatively or additionally, the logic may also dampen a continuous noise and/or the “musical noise,” squeaks, squawks, chirps, clicks, drips, pops, tones, or other sound artifacts that may be generated by some voice enhancement systems.



FIG. 1 is a partial block diagram of the voice enhancement logic 100. The voice enhancement logic may encompass hardware or software that is capable of running on one or more processors. The one or more processors may also be running zero, one or multiple operating systems. The highly portable logic includes a passing tire hiss noise detector 102 and a noise attenuator 104.


In FIG. 1 the passing tire hiss noise detector 102 may identify and model a noise associated with the hiss of tires of vehicles passing the receiver. While passing tire hiss noise occurs over a broad frequency range, the passing tire hiss noise detector 102 may be configured to detect and model the passing tire hiss noise that is received by the receiver at frequencies of interest. The passing tire hiss noise detector receives incoming sound, that in the short term spectra, may be classified into three broad categories: (1) Noise, which is the undesired sounds that are not part of the original speech signal; (2) Speech, which is the desired sounds part of the original speech signal; (3) Noise plus speech, which is a mixture of (1) and (2).


Noise can be broadly divided into two categories: (1a) non-periodic noises, which include sounds like passing tire hiss, rain, wind, and share the traits that they usually occur at non-periodic intervals, don't have a harmonic frequency structure, and have a transient, short time duration; (1b) periodic noises, which include repetitive sounds like turn indicator clicks, engine or drive train noise and windshield wiper swooshes and may have some harmonic frequency structure due to their periodic nature. Speech can also be broadly divided into two categories: (2a) unvoiced speech, such as consonants, without harmonic or formant structure; (2b) voiced speech, such as vowel sounds, which exhibits a regular harmonic structure, or harmonic peaks weighted by the spectral envelope that may describe the formant structure. Noise plus speech may comprise any mixture of non-periodic noises, periodic noises, unvoiced speech and/or voiced speech.


The passing tire hiss noise detector 102 may separate the noise-like segments from the remaining signal in a real or in a delayed time no matter how complex or how loud an incoming segment may be. The separated noise-like segments are analyzed to detect the occurrence of passing tire hiss noise, and in some instances, the presence of a continuous underlying noise. When passing tire hiss noise is detected, the spectrum is modeled, and the resulting passing tire hiss model is retained in a memory for use by the passing tire hiss noise attenuator 104. While the passing tire hiss noise detector 102 may store an entire model of a passing tire hiss noise signal, it also may store selected attributes in a memory. The stored passing tire hiss models may be used to create an average passing tire hiss model, or otherwise combined for future use by the passing tire hiss noise detector 102 or the passing tire hiss noise attenuator 104.


To overcome the effects of passing tire hiss noise, the passing tire hiss noise attenuator 104 substantially removes or dampens the passing tire hiss noise from the input signal. The voice enhancement logic 100 encompasses any system that substantially removes or dampens passing tire hiss noise. Examples of systems that may dampen or remove passing tire hiss noise include systems that use a signal and a passing tire hiss noise model such as (1) systems which use a neural network mapping of a noisy signal and a passing tire hiss model to a noise-reduced signal, (2) systems which subtract the passing tire hiss model from a noisy signal, (3) systems that use the noisy signal and the passing tire hiss model to select a noise-reduced signal from a code-book, (4) systems that in any other way use the noisy signal and the passing tire hiss model to create a noise-reduced signal based on a reconstruction or reduction of the masked signal. These systems may attenuate passing tire hiss noise, and in some instances, attenuate the continuous noise that may be part of the short-term spectra. The passing tire hiss noise attenuator 104 may also interface or include an optional residual attenuator that removes or dampens artifacts that may result in the processed signal. The residual attenuator may remove the “musical noise,” squeaks, squawks, chirps, clicks, drips, pops, tones, or other sound artifacts.



FIG. 2 is a time-frequency spectrogram illustrating a signal having a sequence of sounds comprising, from left to right, a simulated passing tire hiss noise 202, a voiced string of the digits “6702177” (indicated by reference characters 204, 206, 208, 210, 212, 214 and 216, respectively), and two real passing tire hiss noises 218 and 220. The simulated passing tire hiss noise 202 was generated using a broadband amplification in the frequency domain and a smoothly-varying function in the time domain that ramps smoothly upwardly then smoothly downwardly. Examples of suitable functions in the time domain include a Lorentzian function, a Gaussian function, a sine wave, and a smoothed triangular wave. As can be seen in FIG. 2, the simulated passing tire hiss noise 202 has a shape which is almost identical to the shapes of the two real passing tire hiss noises 218 and 220.



FIG. 3 shows an example signal comprising passing tire hiss noise plus background noise, in the time-frequency domain. FIG. 4 shows an example signal comprising a vowel sound plus background noise, in the time-frequency domain. It can be seen from FIGS. 3 and 4 that the shape of passing tire hiss noise in the time-frequency domain is distinct from that of voiced signals such as vowel sounds. A passing tire hiss detector 102 may use time-frequency modeling to discriminate passing tire hiss noise from speech signals.



FIG. 5 is a block diagram of an example passing tire hiss noise detector 102 that may receive or detect an input signal comprising noise, speech, and/or noise plus speech. A received or detected signal is digitized at a predetermined frequency. To assure a good quality voice, the voice signal is converted to a pulse-code-modulated (PCM) signal by an analog-to-digital converter 502 (ADC) having any common sample rate. A smooth window 504 is applied to a block of data to obtain the windowed signal. The complex spectrum for the windowed signal may be obtained by means of a fast Fourier transform (FFT) 506 that separates the digitized signal into frequency bins, with each bin identifying an amplitude and phase across a small frequency range. The spectral components of the frequency bins may be monitored over time by a modeler 508.


To detect a passing tire hiss, modeler 508 may fit a smoothly-varying function to a selected portion of the signal in the time-frequency domain. The smoothly-varying function may be a log-Lorentzian function, with a width determined by the speed of the passing vehicle generating the passing tire hiss noise, and a sharpness determined by the lateral distance of the passing vehicle from the receiver. A correlation between a smoothly-varying function and the signal envelope in the time domain over one or several frequency bands may identify a passing tire hiss. The correlation threshold at which a portion of the signal is identified as a passing tire hiss noise may depend on a desired clarity of a processed voice and the variations in width and sharpness of the passing tire hiss noise. Alternatively or additionally, the system may determine a probability that the signal includes passing tire hiss noise, and may identify a passing tire hiss noise when that probability exceeds a probability threshold. The correlation and probability thresholds may depend on various factors, including the presence of other noises or speech in the input signal. When the passing tire hiss noise detector 102 detects a passing tire hiss, the characteristics of the detected passing tire hiss may be provided to the passing tire hiss noise attenuator 104 for removal of the passing tire hiss noise.


As more windows of sound are processed, the passing tire hiss noise detector 102 may derive average noise models for the passing tire hiss. A time-smoothed or weighted average may be used to model the passing tire hiss and continuous noise estimates for each frequency bin. The average model may be updated when a passing tire hiss noise is detected in the absence of speech. Fully bounding a passing tire hiss noise when updating the average model may increase the probability of accurate detection.


To limit a masking of voice, the fitting of the smoothly-varying function to a suspected passing tire hiss noise may be constrained by rules. For example, a spectral flatness measure may be used to differentiate passing tire hiss noise from voiced signals, and may improve the accuracy of passing tire hiss noise detection, since passing tire hiss is broad spectrum noise and has a fairly smooth spectral shape, unlike voiced signals. Alternatively or additionally, in a vehicle equipped with MOST bus or similar technology, the voice enhancement logic 100 may be provided with information about whether or not the windows are open and passing tire hiss noise detection may be disabled or constrained when the windows are closed.


To overcome the effects of passing tire hiss noise, a passing tire hiss noise attenuator 104 may substantially remove or dampen the passing tire hiss noise from the signal by any method. One method may add the passing tire hiss model to a recorded or estimated continuous noise. In the power spectrum, the passing tire hiss model and continuous noise may then be subtracted from the unmodified signal. If an underlying speech signal is masked by a passing tire hiss or continuous noise, a conventional or modified interpolation method may be used to reconstruct the speech signal. A linear or step-wise interpolator may be used to reconstruct the missing part of the signal. An inverse FFT may then be used to convert the signal power to the time domain, which provides a reconstructed speech signal.


To minimize the “music noise,” squeaks, squawks, chirps, clicks, drips, pops, or other sound artifacts, an optional residual attenuator may also condition the voice signal before it is converted to the time domain. The residual attenuator may be combined with a passing tire hiss noise attenuator 104, combined with one or more other elements, or comprise a separate element.


The residual attenuator may track the power spectrum within a mid to high frequency range (e.g., from about 400 Hz up to about the Nyquist frequency, which is about one half the sample rate). When a large increase in signal power is detected an improvement may be obtained by limiting or dampening the transmitted power in the mid to high frequency range to a predetermined or calculated threshold. A calculated threshold may be equal to, or based on, the average spectral power of that same mid to high frequency range at an earlier period in time.


Further improvements to voice quality may be achieved by pre-conditioning the input signal before it is processed by the passing tire hiss noise detector 102. One pre-processing system may exploit the lag time caused by a signal arriving at different detectors that are positioned apart as shown in FIG. 6 at different times. If multiple detectors or microphones 602 are used that convert sound into an electric signal, the pre-processing system may include a controller 604 that automatically selects the microphone 602 and channel that senses the least amount of noise. When another microphone 602 is selected, the electric signal may be combined with the previously generated signal before being processed by the passing tire hiss noise detector 102.


Alternatively, passing tire hiss noise detection may be performed on each of the channels. A mixing of one or more channels may occur by switching between the outputs of the microphones 602. Alternatively or additionally, the controller 604 may include a comparator, and a direction of the signal may be detected from differences in the amplitude or timing of signals received from the microphones 602. Direction detection may be improved by pointing the microphones 602 in different directions. The passing tire hiss noise detection may be made more sensitive for signals originating outside of the vehicle.


The signals may be evaluated at only frequencies above a certain threshold (for example, by using a high-pass filter) which are of interest in certain applications. The threshold frequency may be updated over time as the average passing tire hiss model learns the expected frequencies of passing tire hiss noises. For example, when passing vehicles are traveling at high speeds, the threshold frequency for passing tire hiss noise detection may be set relatively high, since the maximum frequency of passing tire hiss noise increases with vehicle speed. Alternatively, controller 604 may combine the output signals of multiple microphones 602 at a specific frequency or frequency range through a weighting function.



FIG. 7 shows alternative voice enhancement logic 700 that also improves the perceptual quality of a processed voice. The enhancement is accomplished by time-frequency transform logic 702 that digitizes and converts a time varying signal to the frequency domain. A background noise estimator 704 measures the continuous or ambient noise that occurs near a sound source or the receiver. The background noise estimator 704 may comprise a power detector that averages the acoustic power in each frequency bin in the power, magnitude, or logarithmic domain.


To prevent biased background noise estimations at transients, a transient detector 706 may disable or modulate the background noise estimation process during abnormal or unpredictable increases in power. In FIG. 7, the transient detector 706 disables the background noise estimator 704 when an instantaneous background noise B(f, i) exceeds an average background noise B(f)Ave by more than a selected decibel level ‘c.’ This relationship may be expressed as:

B(f,i)>B(f)Ave+c  (Equation 1)

Alternatively or additionally, the average background noise may be updated depending on the signal to noise ratio (SNR). An example closed algorithm is one which adapts a leaky integrator depending on the SNR:

B(f)Ave′=aB(f)Ave+(1−a)S  (Equation 2)

where a is a function of the SNR and S is the instantaneous signal. In this example, the higher the SNR, the slower the average background noise is adapted.


To detect a passing tire hiss, passing tire hiss noise detector 708 may fit a smoothly-varying function to a selected portion of the signal in the time-frequency domain. The smoothly-varying function may be a log-Lorentzian function, with a width determined by the speed of the passing vehicle generating the passing tire hiss noise, and a sharpness determined by the lateral distance of the passing vehicle from the receiver. A correlation between a smoothly-varying function and the signal envelope in the time domain over one or more frequency bands may identify a passing tire hiss. The correlation threshold at which a portion of the signal is identified as a passing tire hiss noise may depend on a desired clarity of a processed voice and the variations in width and sharpness of the passing tire hiss noise. Alternatively or additionally, the system may determine a probability that the signal includes passing tire hiss noise, and may identify a passing tire hiss noise when that probability exceeds a probability threshold. The correlation and probability thresholds may depend on various factors, including the presence of other noises or speech in the input signal. When the noise detector 708 detects a passing tire hiss, the characteristics of the detected passing tire hiss may be provided to the noise attenuator 712 for removal of the passing tire hiss noise.


A signal discriminator 710 may mark the voice and noise of the spectrum in real or delayed time. Any method may be used to distinguish voice from noise. Spoken signals may be identified by (1) the narrow widths of their bands or peaks; (2) the broad resonances, which are also known as formants, which may be created by the vocal tract shape of the person speaking; (3) the rate at which certain characteristics change with time (i.e., a time-frequency model can be developed to identify spoken signals based on how they change with time); and when multiple detectors or microphones are used, (4) the correlation, differences, or similarities of the output signals of the detectors or microphones.



FIG. 8 is a flow diagram of a voice enhancement that removes some passing tire hiss noise and continuous noise to enhance the perceptual quality of a processed voice. At act 802 a received or detected signal is digitized at a predetermined frequency. To assure a good quality voice, the voice signal may be converted to a PCM signal by an ADC. At act 804 a complex spectrum for the windowed signal may be obtained by means of an FFT that separates the digitized signals into frequency bins, with each bin identifying an amplitude and a phase across a small frequency range.


At act 806, a continuous or ambient noise is measured. The background noise estimate may comprise an average of the acoustic power in each frequency bin. To prevent biased noise estimations at transients, the noise estimation process may be disabled during abnormal or unpredictable increases in power at act 808. The transient detection act 808 disables the background noise estimate when an instantaneous background noise exceeds an average background noise by more than a predetermined decibel level.


At act 810, a passing tire hiss noise may be detected when a high correlation exists between a smoothly function and the temporal and/or spectral characteristics of the input signal in the time and/or frequency domains. The detection of a passing tire hiss noise may be constrained by one or more optional acts. For example, if a vowel or another harmonic structure is detected, the passing tire hiss noise detection method may limit the passing tire hiss noise correction to values less than or equal to average values. An additional optional act may allow the average passing tire hiss model or attributes to be updated only during unvoiced segments. If a speech or speech mixed with noise segment is detected, the average passing tire hiss model or attributes are not updated under this act. If no speech is detected, the passing tire hiss model or each attribute may be updated through many means, such as through a weighted average or a leaky integrator. Many other optional acts may also be applied to the model.


If passing tire hiss noise is detected at act 810, at act 814, a signal analysis may discriminate or mark the spoken signal from the noise-like segments. Spoken signals may be identified by (1) the narrow widths of their bands or peaks; (2) the broad resonances, which are also known as formants, which may be created by the vocal tract shape of the person speaking; (3) the rate at which certain characteristics change with time (i.e., a time-frequency model can be developed to identify spoken signals based on how they change with time); and when multiple detectors or microphones are used, (4) the correlation, differences, or similarities of the output signals of the detectors or microphones.


To overcome the effects of passing tire hiss noise, a passing tire hiss noise is substantially removed or dampened from the noisy spectrum by any act. One exemplary act 816 adds the smoothly varying passing tire hiss model to a recorded or modeled continuous noise. In the power spectrum, the modeled noise may then be substantially removed from the unmodified spectrum by the methods and systems described above. If an underlying speech signal is masked by a passing tire hiss noise, or masked by a continuous noise, a conventional or modified interpolation method may be used to reconstruct the speech signal at act 818. A time series synthesis may then be used to convert the signal power to the time domain at act 820, which provides a reconstructed speech signal. If no passing tire hiss noise is detected at act 810, at act 820 the signal is converted into the time domain to provide the reconstructed speech signal.


Alternatively, a passing tire hiss noise attenuator may substantially remove or dampen the passing tire hiss from the signal by any method. One method may add the passing tire hiss model to a recorded or estimated continuous noise. In the power spectrum, the passing tire hiss model and the continuous noise may then be subtracted from the unmodified signal. If an underlying speech signal is masked by passing tire hiss or continuous noise, a conventional or modified interpolation method may be used to reconstruct the speech signal. FIG. 9 shows an example signal comprising both a vowel sound and a passing tire hiss noise. FIG. 10 shows the signal with the passing tire hiss removed, and FIG. 11 shows the signal with a reconstructed vowel sound. A linear or step-wise interpolator may be used to reconstruct the missing part of the signal. An inverse FFT may then be used to convert the signal power to the time domain, which provides a reconstructed voice signal.


The method shown in FIG. 8 may be encoded in a signal bearing medium, a computer readable medium such as a memory, programmed within a device such as one or more integrated circuits, or processed by a controller or a computer. If the methods are performed by software, the software may reside in a memory resident to or interfaced to the passing tire hiss noise detector 102, a communication interface, or any other type of non-volatile or volatile memory interfaced or resident to the voice enhancement logic 100 or 700. The memory may include an ordered listing of executable instructions for implementing logical functions. A logical function may be implemented through digital circuitry, through source code, through analog circuitry, or through an analog source such through an analog electrical, audio, or video signal. The software may be embodied in any computer-readable or signal-bearing medium, for use by, or in connection with an instruction executable system, apparatus, or device. Such a system may include a computer-based system, a processor-containing system, or another system that may selectively fetch instructions from an instruction executable system, apparatus, or device that may also execute instructions.


A “computer-readable medium,” “machine-readable medium,” “propagated-signal” medium, and/or “signal-bearing medium” may comprise any means that contains, stores, communicates, propagates, or transports software for use by or in connection with an instruction executable system, apparatus, or device. The machine-readable medium may selectively be, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, device, or propagation medium. A non-exhaustive list of examples of a machine-readable medium would include: an electrical connection “electronic” having one or more wires, a portable magnetic or optical disk, a volatile memory such as a Random Access Memory “RAM” (electronic), a Read-Only Memory “ROM” (electronic), an Erasable Programmable Read-Only Memory (EPROM or Flash memory) (electronic), or an optical fiber (optical). A machine-readable medium may also include a tangible medium upon which software is printed, as the software may be electronically stored as an image or in another format (e.g., through an optical scan), then compiled, and/or interpreted or otherwise processed. The processed medium may then be stored in a computer and/or machine memory.


The above-described systems may condition signals received from only one or more than one microphone or detector. Many combinations of systems may be used to identify and track passing tire hiss noises. Besides the fitting of a smoothly varying function to a suspected passing tire hiss, a system may detect and isolate any parts of the signal having greater energy than the modeled passing tire hiss. One or more of the systems described above may also be used in alternative voice enhancement logic.


Other alternative voice enhancement systems include combinations of the structure and functions described above. These voice enhancement systems are formed from any combination of structure and function described above or illustrated within the attached figures. The logic may be implemented in software or hardware. The term “logic” is intended to broadly encompass a hardware device or circuit, software, or a combination. The hardware may include a processor or a controller having volatile and/or non-volatile memory and may also include interfaces to peripheral devices through wireless and/or hardwire mediums.


The voice enhancement logic is easily adaptable to any technology or devices. Some voice enhancement systems or components interface or couple vehicles as shown in FIG. 12, instruments that convert voice and other sounds into a form that may be transmitted to remote locations, such as landline and wireless telephones and audio equipment as shown in FIG. 13, and other communication systems that may be susceptible to passing tire hiss noise.


The voice enhancement logic improves the perceptual quality of a processed voice. The logic may automatically learn and encode the shape and form of the noise associated with passing tire hiss in a real or a delayed time. By tracking selected attributes, the logic may eliminate, substantially eliminate, or dampen passing tire hiss noise using a limited memory that temporarily or permanently stores selected attributes of the passing tire hiss noise. The voice enhancement logic may also dampen a continuous noise and/or the squeaks, squawks, chirps, clicks, drips, pops, tones, or other sound artifacts that may be generated within some voice enhancement systems and may reconstruct voice when needed.


While various embodiments of the invention have been described, it will be apparent to those of ordinary skill in the art that many more embodiments and implementations are possible within the scope of the invention. Accordingly, the invention is not to be restricted except in light of the attached claims and their equivalents.

Claims
  • 1. A passing tire hiss noise attenuation system, comprising: a noise detector configured to compare an input signal to a passing tire hiss model and identify whether a noise in the input signal is passing tire hiss; anda noise attenuator coupled with the noise detector and configured to attenuate at least a portion of the identified passing tire hiss from the input signal to generate an output signal with reduced passing tire hiss noise.
  • 2. The system of claim 1, where the noise detector is configured to identify whether the input signal includes the passing tire hiss by fitting a function to a portion of the input signal.
  • 3. The system of claim 1, where the noise detector is configured to identify whether the input signal includes the passing tire hiss by fitting a function to a portion of the input signal in a time-frequency domain.
  • 4. The system of claim 1, where the noise detector is configured to identify whether the input signal includes the passing tire hiss by fitting a Lorentzian function to a portion of the input signal in a time-frequency domain.
  • 5. The system of claim 1, where the noise detector is configured to identify whether the input signal includes the passing tire hiss by fitting a smoothly varying function to a portion of the input signal.
  • 6. The system of claim 1 where the noise detector is configured to separate noise-like segments of the input signal from remaining portions of the input signal, and where the noise detector is configured to analyze the noise-like segments to identify whether the noise-like segments include passing tire hiss noise.
  • 7. The system of claim 6 where the noise detector is configured to derive the passing tire hiss model when the noise-like segments include passing tire hiss noise, where the noise detector is configured to store the passing tire hiss model in memory, and where the noise attenuator is configured to use the passing tire hiss model stored in memory to remove passing tire hiss from the input signal.
  • 8. The system of claim 1, where the noise detector is configured to receive information from an automotive bus about whether windows of a vehicle are open or closed, and where the noise detector is configured to disable or constrain passing tire hiss noise detection when the information indicates that the windows are closed.
  • 9. The system of claim 1 where the noise detector comprises a processor configured to run logic to detect the passing tire hiss from the input signal.
  • 10. A method of attenuating passing tire hiss noise, comprising: receiving an input signal;identifying, by a noise detector that comprises a processor configured to run logic to detect passing tire hiss, whether a noise in the input signal is passing tire hiss based on a comparison between the input signal and a passing tire hiss model; andattenuating at least a portion of the identified passing tire hiss from the input signal to generate an output signal with reduced passing tire hiss noise.
  • 11. The method of claim 10, where the step of identifying comprises identifying whether the input signal includes the passing tire hiss by fitting a function to a portion of the input signal.
  • 12. The method of claim 10, where the step of identifying comprises identifying whether the input signal includes the passing tire hiss by fitting a function to a portion of the input signal in a time-frequency domain.
  • 13. The method of claim 10, where the step of identifying comprises identifying whether the input signal includes the passing tire hiss by fitting a Lorentzian function to a portion of the input signal in a time-frequency domain.
  • 14. The method of claim 10, where the step of identifying comprises identifying whether the input signal includes the passing tire hiss by fitting a smoothly varying function to a portion of the input signal.
  • 15. The method of claim 10, where the step of identifying comprises: separating noise-like segments of the input signal from remaining portions of the input signal; andanalyzing the noise-like segments to identify whether the noise-like segments include passing tire hiss noise.
  • 16. The method of claim 15, further comprising: deriving the passing tire hiss model when the noise-like segments include passing tire hiss noise;storing the passing tire hiss model in memory; andremoving passing tire hiss from the input signal based on the passing tire hiss model stored in memory.
  • 17. The method of claim 10, further comprising: receiving information from an automotive bus about whether windows of a vehicle are open or closed; anddisabling or constraining passing tire hiss noise detection when the information indicates that the windows are closed.
  • 18. A non-transitory computer-readable medium with instructions stored thereon, where the instructions are executable by a processor to cause the processor to perform the steps of: comparing an input signal to a passing tire hiss model;identifying whether a noise in the input signal is passing tire hiss based on the comparison between the input signal and the passing tire hiss model; andattenuating at least a portion of the identified passing tire hiss from the input signal to generate an output signal with reduced passing tire hiss noise.
  • 19. The non-transitory computer-readable medium of claim 18, where the step of identifying comprises the step of identifying whether the input signal includes the passing tire hiss by fitting a function to a portion of the input signal in a time-frequency domain.
  • 20. The non-transitory computer-readable medium of claim 18, where the step of identifying comprises identifying whether the input signal includes the passing tire hiss by fitting a smoothly varying function to a portion of the input signal.
PRIORITY CLAIM

This application is a continuation of prior U.S. patent application Ser. No. 11/125,052, filed May 9, 2005, now U.S. Pat. No. 8,027,833, which is incorporated by reference.

US Referenced Citations (68)
Number Name Date Kind
4486900 Cox et al. Dec 1984 A
4531228 Noso et al. Jul 1985 A
4630305 Borth et al. Dec 1986 A
4811404 Vilmur et al. Mar 1989 A
4843562 Kenyon et al. Jun 1989 A
5027410 Williamson et al. Jun 1991 A
5056150 Yu et al. Oct 1991 A
5146539 Doddington et al. Sep 1992 A
5313555 Kamiya May 1994 A
5355717 Tanaka et al. Oct 1994 A
5400409 Linhard Mar 1995 A
5479517 Linhard Dec 1995 A
5495415 Ribbens et al. Feb 1996 A
5502688 Recchione et al. Mar 1996 A
5526466 Takizawa Jun 1996 A
5568559 Makino Oct 1996 A
5584295 Muller et al. Dec 1996 A
5596141 Nishikawa et al. Jan 1997 A
5617508 Reaves Apr 1997 A
5677987 Seki et al. Oct 1997 A
5680508 Liu Oct 1997 A
5692104 Chow et al. Nov 1997 A
5701344 Wakui Dec 1997 A
5933801 Fink et al. Aug 1999 A
5937070 Todter et al. Aug 1999 A
5949888 Gupta et al. Sep 1999 A
6011853 Koski et al. Jan 2000 A
6163608 Romesburg et al. Dec 2000 A
6167375 Miseki et al. Dec 2000 A
6173074 Russo Jan 2001 B1
6175602 Gustafsson et al. Jan 2001 B1
6192134 White et al. Feb 2001 B1
6199035 Lakaniemi et al. Mar 2001 B1
6208268 Scarzello et al. Mar 2001 B1
6405168 Bayya et al. Jun 2002 B1
6434246 Kates et al. Aug 2002 B1
6507814 Gao Jan 2003 B1
6587816 Chazan et al. Jul 2003 B1
6643619 Linhard et al. Nov 2003 B1
6687669 Schrögmeier et al. Feb 2004 B1
6782363 Lee et al. Aug 2004 B2
6822507 Buchele Nov 2004 B2
6859420 Coney et al. Feb 2005 B1
6910011 Zakarauskas Jun 2005 B1
7117149 Zakarauskas Oct 2006 B1
20010028713 Walker Oct 2001 A1
20020071573 Finn Jun 2002 A1
20020176589 Buck et al. Nov 2002 A1
20020178823 Inoue Dec 2002 A1
20030040908 Yang et al. Feb 2003 A1
20030216907 Thomas Nov 2003 A1
20040078200 Alves Apr 2004 A1
20040138882 Miyazawa Jul 2004 A1
20040165736 Hetherington et al. Aug 2004 A1
20040167777 Hetherington et al. Aug 2004 A1
20040239323 Taylor et al. Dec 2004 A1
20050114128 Hetherington et al. May 2005 A1
20050161138 Yukawa et al. Jul 2005 A1
20050240401 Ebenezer Oct 2005 A1
20060034447 Alves et al. Feb 2006 A1
20060074646 Alves et al. Apr 2006 A1
20060100868 Hetherington et al. May 2006 A1
20060115095 Glesbrecht et al. Jun 2006 A1
20060116873 Hetherington et al. Jun 2006 A1
20060136199 Nongpiur et al. Jun 2006 A1
20060287859 Hetherington et al. Dec 2006 A1
20070025814 Woodruff Feb 2007 A1
20070033031 Zakarauskas Feb 2007 A1
Foreign Referenced Citations (15)
Number Date Country
2158847 Sep 1994 CA
2157496 Oct 1994 CA
2158064 Oct 1994 CA
0 076 687 Apr 1983 EP
0 629 996 Dec 1994 EP
0 629 996 Dec 1994 EP
0 750 291 Dec 1996 EP
1 450 353 Aug 2004 EP
1 450 354 Aug 2004 EP
1 669 983 Jun 2006 EP
06269084 Sep 1994 JP
06319193 Nov 1994 JP
WO 00-41169 Jul 2000 WO
WO 01-56255 Aug 2001 WO
WO 01-73761 Oct 2001 WO
Non-Patent Literature Citations (17)
Entry
Avendano, C., Hermansky, H., “Study on the Dereverberation of Speech Based on Temporal Envelope Filtering,” Proc. ICSLP '96, pp. 889-892, Oct. 1996.
Berk et al., “Data Analysis with Microsoft Excel”, Duxbury Press, 1998, pp. 236-239 and 256-259.
Fiori, S., Uncini, A., and Piazza, F., “Blind Deconvolution by Modified Bussgang Algorithm”, Dept. of Electronics and Automatics—University of Ancona (Italy), ISCAS 1999.
Keijiro Iwao; “A study on the mechanism of tire/road noise”; Sep. 25, 1995; Vehicle Research Laboratory; pp. 139-144.
Learned, R.E. et al., A Wavelet Packet Approach to Transient Signal Classification, Applied and Computational Harmonic Analysis, Jul. 1995, pp. 265-278, vol. 2, No. 3, USA, XP 000972660. ISSN: 1063-5203. abstract.
Nakatani, T., Miyoshi, M., and Kinoshita, K., “Implementation and Effects of Single Channel Dereverberation Based on the Harmonic Structure of Speech,” Proc. of IWAENC—2003, pp. 91-94, Sep. 2003.
Puder, H. et al., “Improved Noise Reduction for Hands-Free Car Phones Utilizing Information on a Vehicle and Engine Speeds”, Sep. 4-8, 2000, pp. 1851-1854, vol. 3, XP009030255, 2000. Tampere, Finland, Tampere Univ. Technology, Finland Abstract.
Quatieri, T.F. et al., Noise Reduction Using a Soft-Dection/Decision Sine-Wave Vector Quantizer, International Conference on Acoustics, Speech & Signal Processing, Apr. 3, 1990, pp. 821-824, vol. Conf. 15, IEEE ICASSP, New York, US XP000146895, Abstract, Paragraph 3.1.
Quelavoine, R. et al., Transients Recognition in Underwater Acoustic with Multilayer Neural Networks, Engineering Benefits from Neural Networks, Proceedings of the International Conference EANN 1998, Gibraltar, Jun. 10-12, 1998 pp. 330-333, XP 000974500. 1998, Turku, Finland, Syst. Eng. Assoc., Finland. ISBN: 951-97868-0-5. abstract, p. 30 paragraph 1.
Seely, S., “An Introduction to Engineering Systems”, Pergamon Press Inc., 1972, pp. 7-10.
Shust, Michael R. and Rogers, James C., Abstract of “Active Removal of Wind Noise From Outdoor Microphones Using Local Velocity Measurements”, J. Acoust. Soc. Am., vol. 104, No. 3, Pt 2, 1998, 1 page.
Shust, Michael R. and Rogers, James C., “Electronic Removal of Outdoor Microphone Wind Noise”, obtained from the Internet on Oct. 5, 2006 at: <http://www.acoustics.org/press/136th/mshust.htm>, 6 pages.
Simon, G., Detection of Harmonic Burst Signals, International Journal Circuit Theory and Applications, Jul. 1985, vol. 13, No. 3, pp. 195-201, UK, XP 000974305. ISSN: 0098-9886. abstract.
Vaseghi; “Advanced Digital Signal Processing and Noise Reduction”; John Wiley and Sons; Second Edition; 2000.
Vieira, J., “Automatic Estimation of Reverberation Time”, Audio Engineering Society, Convention Paper 6107, 116th Convention, May 8-11, 2004, Berlin, Germany, pp. 1-7.
Wahab A. et al., “Intelligent Dashboard With Speech Enhancement”, Information, Communications, and Signal Processing, 1997. ICICS, Proceedings of 1997 International Conference on Singapore, Sep. 9-12, 1997, New York, NY, USA, IEEE, pp. 993-997.
Zakarauskas, P., Detection and Localization of Nondeterministic Transients in Time series and Application to Ice-Cracking Sound, Digital Signal Processing, 1993, vol. 3, No. 1, pp. 36-45, Academic Press, Orlando, FL, USA, XP 000361270, ISSN: 1051-2004. entire document.
Related Publications (1)
Number Date Country
20110311068 A1 Dec 2011 US
Continuations (1)
Number Date Country
Parent 11125052 May 2005 US
Child 13223863 US