Claims
- 1. A method of preserving perceptually relevant non-speech information in an audio signal during encoding of the audio signal, comprising:making a first determination of whether the audio signal is considered to comprise speech or noise information; making a second determination of whether the audio signal includes non-speech information that is perceptually relevant to a listener; and selectively overriding said first determination in response to said second determination.
- 2. The method of claim 1, wherein said step of making said second determination includes the additional steps of:determining, from the audio signal, correlation values using an open-loop long term prediction correlation analysis; and comparing a predetermined value to the correlation values associated with respective frames into which the audio signal is divided.
- 3. The method of claim 2, wherein said selectively overriding step includes overriding said first determination in response to a correlation value exceeding the predetermined value.
- 4. The method of claim 2, wherein said selectively overriding step includes overriding said first determination in response to a predetermined number of correlation values in a given time period exceeding the predetermined value.
- 5. The method of claim 4, wherein said selectively overriding step includes overriding said first determination in response to a predetermined number of consecutive correlation values exceeding the predetermined value.
- 6. The method of claim 2, including, for each said frame, finding a highest normalized correlation value of a high pass filtered version of the audio signal, said highest normalized correlation values respectively corresponding to said first-mentioned correlation values.
- 7. The method of claim 6, wherein said finding step includes, for each of the frames, finding a largest-magnitude normalized correlation value.
- 8. The method of claim 1, wherein said selectively overriding step includes overriding a first determination of noise in response to a second determination of perceptually relevant non-speech information.
- 9. A method of preserving perceptually relevant information in an audio signal, comprising:for each of a plurality of frames into which the audio signal is divided, finding a highest normalized correlation value of a high pass filter version of the audio signal by using an open-loop long term prediction correlation analysis; producing a first sequence of said normalized correlation values; determining a second sequence of representative values to represent respectively the normalized correlation values of the fist sequence; and comparing the representative values to a threshold value to obtain an indication of whether the audio signal contains perceptually relevant non-speech information.
- 10. The method of claim 9, wherein said finding step includes applying correlation analysis to the audio signal without producing the high pass filtered version of the audio signal.
- 11. The method of claim 9, wherein said finding step includes high pass filtering the audio signal and thereafter applying correlation analysis to the high pass filtered audio signal.
- 12. The method of claim 9, wherein said finding step includes, for each of the frames, finding a largest-magnitude normalized correlation value.
- 13. An apparatus for use in an audio signal encoder to preserve perceptually relative non-speech information contained in an audio signal, comprising:a classifier for receiving the audio signal and making a first determination of whether the audio signal is considered to comprise speech or noise information; a detector for receiving the audio signal and making a second determination of whether the audio signal includes non-speech information that is perceptually relevant to a listener; and logic coupled to said classifier and said detector, said logic having an output for indicating whether the audio signal includes perceptually relevant information, said logic operable to selectively provide at said output information indicative of said first determination, and also responsive to said second determination for selectively overriding at said output said information indicative of said first determination.
- 14. The apparatus of claim 13, wherein said detector is operable for comparing a predetermined value to correlation values associated with respective frames into which the audio signal is divided.
- 15. The apparatus of claim 14, wherein said logic is operable for overriding said information indicative of said first determination in response to a correlation value exceeding the predetermined value.
- 16. The apparatus of claim 14, wherein said logic is operable for overriding said information indicative of said first determination in response to a predetermined number of correlation values in a given time period exceeding the predetermined value.
- 17. The apparatus of claim 16, wherein said logic is operable for overriding said information indicative of said first determination in response to a predetermined number of consecutive correlation values associated with timewise consecutive frames exceeding the predetermined value.
- 18. The apparatus of claim 14, wherein said detector is operable for finding within each of said frames a highest normalized correlation value of a high pass filtered version of the audio signal, said highest normalized correlation values corresponding respectively to said first-mentioned correlation values.
- 19. The apparatus of claim 18, wherein each of said highest normalized correlation values represents a largest-magnitude normalized correlation value within the associated frame.
- 20. The apparatus of claim 13, wherein said logic is operable for overriding information indicative of a noise determination in response to said second determination indicating perceptually relevant non-speech information.
Parent Case Info
This application claims the priority under 35 USC 119(e)(1) of U.S. Provisional Application No. 60/109,556, filed on Nov. 23, 1998.
US Referenced Citations (9)
Provisional Applications (1)
|
Number |
Date |
Country |
|
60/109556 |
Nov 1998 |
US |