Claims
- 1. A system for efficiently determining a masking threshold to encode audio data, comprising:a psycho-acoustic modeler that includes a modeler manager configured to determine said masking threshold by analyzing said audio data using one or more linear parameters that are stored in non-logarithmic form, and a microprocessor configured to control said modeler manager to thereby determine said masking threshold.
- 2. The system of claim 1 wherein a bit allocator in an audio encoder device receives said masking threshold from said psycho-acoustic modeler, and responsively encodes only selected portions of said audio data with energy values in excess of said masking threshold to thereby conserve audio encoding resources.
- 3. The system of claim 1 wherein said psycho-acoustic modeler is implemented in one of a digital versatile disc device, a consumer electronics device, a computer device, and an electronic audio device.
- 4. The system of claim 1 wherein said microprocessor is implemented as a digital signal processor device that executes said modeler manager to thereby determine said masking threshold.
- 5. The system of claim 1 wherein said linear parameters include at least one of a masking component intensity value, a non-logarithmic mask index value, and a non-logarithmic spread function value.
- 6. The system of claim 4 wherein said masking threshold is formed of a series of respective minimum values of a global masking threshold across a series of critical frequency bands of said audio data, said global masking threshold being equal to the sum of an absolute masking threshold and a series of individual piecewise linear spread functions that each correspond to at least one of an associated tonal component and an associated noise component.
- 7. The system of claim 1 wherein said psycho-acoustic modeler includes at least one of a non-logarithmic tonal mask-index lookup table, a non-logarithmic noise mask-index lookup table, an intensity-independent spread-function factor lookup table, and an exponential function lookup table for calculating an intensity-dependent spread-function factor.
- 8. The system of claim 1 wherein said modeler manager identifies a masking component in said audio data, said masking component having an intensity factor X, said masking component being one of a tonal component and a noise component.
- 9. The system of claim 8 wherein said modeler manager performs a Fast Fourier Transform on said masking component before determining said intensity value X corresponding to said masking component.
- 10. The system of claim 8 wherein said modeler manager determines a component type corresponding to said masking component, said component type including at least one of said tonal component and said noise component.
- 11. The system of claim 10 wherein said modeler manager references a non-logarithmic mask-index lookup table to determine a mask index value AV corresponding to said masking component.
- 12. The system of claim 10 wherein said modeler manager references a non-logarithmic tonal mask-index lookup table to determine said mask index value AV when said masking component is said tonal component.
- 13. The system of claim 10 wherein said modeler manager references a non-logarithmic noise mask-index lookup table to determine said mask index value AV when said masking component is said noise component.
- 14. The system of claim 11 wherein said modeler manager calculates a spread function value VF corresponding to said masking component.
- 15. The system of claim 14 wherein said spread function value VF may be expressed by a formula:VF=Factor F*Factor G where said Factor F is a masker-component intensity-independent factor that depends upon a component frequency of said masking component, and said Factor G is a masker-component intensity-dependent factor that depends upon said intensity value X of said masking component.
- 16. The system of claim 15 wherein said modeler manager determines Factor F by referencing a non-logarithmic intensity-independent factor lookup table.
- 17. The system of claim 15 wherein said modeler manager utilizes an exponential-function lookup table during a calculation procedure to determine said Factor G.
- 18. The system of claim 14 wherein said modeler manager determines said masking threshold according to a formula:Masking Threshold=X*AV*VF where said X is said intensity value X, said AV is said mask index value AV, and said VF is said spread function value VF.
- 19. The system of claim 18 wherein said modeler manager sequentially recalculates a different respective value for said masking threshold corresponding to each of said masking components from said audio data to thereby produce a total tonal masking threshold and a total noise masking threshold.
- 20. The system of claim 19 wherein said modeler manager combines said total tonal masking threshold and said total noise masking threshold to thereby produce a total combined masking threshold for use in encoding said audio data.
- 21. A method for efficiently determining a masking threshold to encode audio data, comprising the steps of:determining said masking threshold with a modeler manager from a psycho-acoustic modeler by analyzing said audio data using one or more linear parameters that are stored in non-logarithmic form; and controlling said modeler manager with a microprocessor coupled to said psycho-acoustic modeler to thereby determine said masking threshold.
- 22. The method of claim 21 wherein a bit allocator in an audio encoder device receives said masking threshold from said psycho-acoustic modeler, and responsively encodes only selected portions of said audio data with energy values in excess of said masking threshold to thereby conserve audio encoding resources.
- 23. The method of claim 21 wherein said psycho-acoustic modeler is implemented in one of a digital versatile disc device, a consumer electronics device, a computer device, and an electronic audio device.
- 24. The method of claim 21 wherein said microprocessor is implemented as a digital signal processor device that executes said modeler manager to thereby determine said masking threshold.
- 25. The method of claim 21 wherein said linear parameters include at least one of a masking component intensity value, a non-logarithmic mask index value, and a non-logarithmic spread function value.
- 26. The method of claim 24 wherein said masking threshold is formed of a series of respective minimum values of a global masking threshold across a series of critical frequency bands of said audio data, said global masking threshold being equal to the sum of an absolute masking threshold and a series of individual piecewise linear spread functions that each correspond to at least one of an associated tonal component and an associated noise component.
- 27. The method of claim 21 wherein said psycho-acoustic modeler includes at least one of a non-logarithmic tonal mask-index lookup table, a non-logarithmic noise mask-index lookup table, an intensity-independent spread-function factor lookup table, and an exponential function lookup table for calculating an intensity-dependent spread-function factor.
- 28. The method of claim 21 wherein said modeler manager identifies a masking component in said audio data, said masking component having an intensity factor X, said masking component being one of a tonal component and a noise component.
- 29. The method of claim 28 wherein said modeler manager performs a Fast Fourier Transform on said masking component before determining said intensity value X corresponding to said masking component.
- 30. The method of claim 28 wherein said modeler manager determines a component type corresponding to said masking component, said component type including at least one of said tonal component and said noise component.
- 31. The method of claim 30 wherein said modeler manager references a non-logarithmic mask-index lookup table to determine a mask index value AV corresponding to said masking component.
- 32. The method of claim 30 wherein said modeler manager references a non-logarithmic tonal mask-index lookup table to determine said mask index value AV when said masking component is said tonal component.
- 33. The method of claim 30 wherein said modeler manager references a non-logarithmic noise mask-index lookup table to determine said mask index value AV when said masking component is said noise component.
- 34. The method of claim 31 wherein said modeler manager calculates a spread function value VF corresponding to said masking component.
- 35. The method of claim 34 wherein said spread function value VF may be expressed by a formula:VF=Factor F*Factor G where said Factor F is a masker-component intensity-independent factor that depends upon a component frequency of said masking component, and said Factor G is a masker-component intensity-dependent factor that depends upon said intensity value X of said masking component.
- 36. The method of claim 35 wherein said modeler manager determines Factor F by referencing a non-logarithmic intensity-independent factor lookup table.
- 37. The method of claim 35 wherein said modeler manager utilizes an exponential-function lookup table during a calculation procedure to determine said Factor G.
- 38. The method of claim 34 wherein said modeler manager determines said masking threshold according to a formula:Masking Threshold=X*AV*VF where said X is said intensity value X, said AV is said mask index value AV, and said VF is said spread function value VF.
- 39. The method of claim 38 wherein said modeler manager sequentially recalculates a different respective value for said masking threshold corresponding to each of said masking components from said audio data to thereby produce a total tonal masking threshold and a total noise masking threshold.
- 40. The method of claim 39 wherein said modeler manager combines said total tonal masking threshold and said total noise masking threshold to thereby produce a total combined masking threshold for use in encoding said audio data.
- 41. A computer-readable medium containing program instructions for efficiently determining a masking threshold by performing the steps of:determining said masking threshold with a modeler manager from a psycho-acoustic modeler by analyzing audio data using one or more linear parameters that are stored in non-logarithmic form; and controlling said modeler manager with a microprocessor coupled to said psycho-acoustic modeler to thereby determine said masking threshold.
- 42. A system for efficiently determining a masking threshold to encode audio data, comprising:means for determining said masking threshold by analyzing said audio data using one or more linear parameters; and means for controlling said means for determining said masking threshold.
CROSS-REFERENCE TO RELATED APPLICATIONS
This application is a continuation, and claims priority in, U.S. patent application Ser. No. 09/150,117, entitled “System and Method For Implementing A Masking Function In A Psycho-Acoustic Modeler,” filed on Sep. 9, 1998, now U.S. Pat. No. 6,195,633 issued Feb. 27, 2001.
US Referenced Citations (2)
Number |
Name |
Date |
Kind |
5963596 |
Benbassat et al. |
Oct 1999 |
A |
6041295 |
Hinderks |
Mar 2000 |
A |
Continuations (1)
|
Number |
Date |
Country |
Parent |
09/150117 |
Sep 1998 |
US |
Child |
09/738069 |
|
US |