The instant disclosure relates to audio processing. More specifically, portions of this disclosure relate to audio processing to compensate for speaker distortion.
Speakers are not capable of perfectly replicating sounds encoded in audio files. Trade-offs are made during speaker design and manufacturing to fit particular applications. For example, cost constraints may result in selection of materials for speakers that are not ideal. As another example, space constraints may result in construction of a speaker with a size that is not ideal for reproduction of all frequencies of sounds. Smaller speakers, such as those used in mobile phones, are generally less accurate with reproduction of sounds and can introduce distortion into the reproduced sounds. Furthermore, manufacturing imperfections in smaller speakers can introduce additional distortion into the reproduced sounds.
An example of distortion is mechanical rattle. Laptop/notebook computers, smart phones, and other devices with speakers often have a mechanical rattle when audio content with sufficient energy is present within middle frequencies. Such rattle may be perceivable to a listener when high-frequency audio content is also not present to mask the broadband rattle noise. Existing solutions to limit the volume of audio within middle frequencies that cause rattle may not be desirable as such solutions may cause noticeable loss of sound pressure level.
In accordance with the teachings of the present disclosure, certain disadvantages and problems associated with existing approaches to minimizing distortion, including mechanical rattle, may be reduced or eliminated.
In accordance with embodiments of the present disclosure, a method for reducing perceived audio distortion in a loudspeaker in response to an input audio signal may include attenuating the input audio signal within an attenuation frequency band of the input audio signal to generate a limited audio signal, generating masking audio to psychoacoustically mask absence of audio content attenuated from the attenuation frequency band of the input audio signal, and combining the limited audio signal and the masking audio to generate an output audio signal.
In accordance with these and other embodiments of the present disclosure, an apparatus may include an audio controller configured to perform steps for reducing perceived audio distortion in a loudspeaker in response to an input audio signal comprising: attenuating the input audio signal within an attenuation frequency band of the input audio signal to generate a limited audio signal, generating masking audio to psychoacoustically mask absence of audio content attenuated from the attenuation frequency band of the input audio signal, and combining the limited audio signal and the masking audio to generate an output audio signal.
Technical advantages of the present disclosure may be readily apparent to one having ordinary skill in the art from the figures, description and claims included herein. The objects and advantages of the embodiments will be realized and achieved at least by the elements, features, and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are explanatory examples and are not restrictive of the claims set forth in this disclosure.
A more complete understanding of the present embodiments and advantages thereof may be acquired by referring to the following description taken in conjunction with the accompanying drawings, in which like reference numbers indicate like features, and wherein:
In accordance with embodiments of this disclosure, a spectral tilt of an audio signal may be used to determine whether a speaker will introduce perceptible distortion during playback of the audio signal. The spectral tilt may be indicated by determining a ratio between energy in a distortion-producing frequency band and energy in a distortion-masking frequency band. Based on the determined spectral tilt, the distortion-producing frequency band may be attenuated to minimize the distortion introduced by the speaker.
Additionally, the distortion-masking frequency band may be amplified and/or augmented with generated harmonics to minimize perceptibility of the distortion produced by the speaker, and to partially offset the loss in sound pressure level incurred when attenuating the distortion producing frequency band. Harmonics generation may also mitigate the issue of nonexistent masking content by creating distortion masking content from lower frequency, distortion-producing content. Furthermore, harmonics generation may strengthen the perceived volume of the fundamental tones used to generate the harmonics. Preferably, harmonics generation may only occur when distortion-producing content is strong enough to trigger attenuation.
The spectral tilt value may be used, in conjunction with the input audio signal level, or the level of the input signal in the distortion producing frequency band of the input audio signal, to determine whether to apply attenuation to the input audio signal to minimize audible distortion resulting from reproduction of the input audio signal. At step 104, the spectral tilt value may be compared to a threshold value and the energy level of a distortion-producing frequency band compared to a threshold value to determine whether perceptible audio distortion is not sufficiently masked. If both are larger than the threshold value, then method 100 may continue to step 106. Otherwise, method 100 may proceed to step 110.
At step 106, an attenuation frequency band of the input audio signal may be attenuated to create a modified audio signal for driving a speaker. At step 108, the input audio signal may be augmented with generated harmonics in order to generate the modified audio signal in order to minimize perceptibility of the distortion produced by the speaker, and to partially offset the loss in sound pressure level incurred when attenuating the distortion producing frequency band. After completion of step 108, method 100 may end.
At step 110, the input audio signal may be passed through to drive the speaker with little or no modification. After completion of step 110, method 100 may end.
A block diagram for an example integrated circuit for implementing method 100 is shown in
The attenuation may be applied to all or part of the input audio signal to produce a modified audio signal for output at output node 204. A filter 230 may receive the input audio signal from input node 202 and may produce an audio signal 232A within an attenuation frequency band, an audio signal 232B within a harmonics generation band, and an audio signal 232C within other frequency bands. An attenuation block 220 may apply the determined attenuation from attenuation factor block 218 to audio signal 232A within the attenuation frequency band.
In addition or alternatively, a harmonics generator 234 may create harmonics from audio signal 232B within the harmonics generation band, wherein the harmonics generation band may be at frequencies higher than the attenuation band that may cause little or no rattle or other distortion. A user may psychoacoustically perceive that distortion-producing audio above the threshold is actually present (e.g., by the use of beat frequencies), even though such audio is not present. A dynamic range limiter 236 may apply a gain to the generated harmonics as a function of the attenuation from attenuation factor block 218 (e.g., in some embodiments the gain applied to harmonics generated by harmonics generator 234 may be inversely proportional to the attenuation applied by attenuation block 220). The presence of this additional high-frequency content may also mask rattle and/or other distortion.
Combiner 232 may combines the output of attenuation factor block 218 and the output of dynamic range limiter 236 with the audio signal 232C within the other frequency bands to produce a modified audio signal at output node 204. The modified audio signal may produce less audible distortion when played back through a speaker, mask audio content in the distortion-producing frequency band, and/or psychoacoustically replace attenuated audio with virtual audio via harmonics.
The frequency bands may be selected based on characteristics of the speaker reproducing the audio signal. The distortion-producing frequency band may be selected to align with frequency bands in which rattle or other distortion are naturally at high levels in a particular speaker. For example, for some speakers, rattle may exist between 2500 HZ and 8000 Hz. The distortion-masking frequency band may include sounds that when played have a psychoacoustic effect of covering the distortion from perception by a listener. In some embodiments, the distortion-producing frequency band may be the entire audible spectrum and the distortion-masking frequency band may be a subset of the audible spectrum.
Attenuation may be used to minimize the signal level of sounds in an attenuation frequency band in order to minimize distortion in the distortion-masking frequency band. For example, attenuation may be applied when the measured signal level in the distortion-producing frequency band of the input audio signal is high enough to generate noticeable distortion products in the distortion-masking frequency band, and the measured signal level in said distortion-masking frequency band of the input audio signal is insufficient to substantially mask said distortion products. The attenuation frequency band may be selected such that a distortion introduced by the speaker is minimized proportionally when the signal level in the frequency band is minimized. In some embodiments, the attenuation frequency band may overlap with the distortion-producing frequency band such that some or all of the energy in the distortion-producing frequency band is minimized.
Harmonics may be used to psychoacoustically replace attenuated audio from the attenuation band and/or mask distortion present in the distortion-producing frequency band. For example, harmonics may be applied when the measured signal level in the distortion-producing frequency band of the input audio signal is high enough to generate noticeable distortion products in the distortion-masking frequency band, and the measured signal level in said distortion-masking frequency band of the input audio signal is insufficient to substantially mask said distortion products. The harmonics generation band may be selected such that harmonics generated from audio signal 232B may psychoacoustically replace attenuated audio from the perception of a human listener and/or are at such levels capable of masking distortion that occurs. In some embodiments, the harmonics generation band may overlap with one or more of the distortion-producing frequency band, the distortion generating band, the attenuation band, and the unmodified band of audio signal 232C.
Harmonics generator 234 may be implemented by an “S” curve harmonics generator 238 followed by a lowpass filter 240. An example of an equation for S-curve harmonics generator 238, assuming the input audio signal ranges from a value of −1.0 to +1.0, may be given by:
where x represents the input signal to S-curve harmonics generator 238 and y represents the output.
In some embodiments, lowpass filter 240 may have a cutoff frequency of approximately 4000 Hz. In the embodiments represented by
The newly-limited attenuation band and newly-limited harmonics may then be added to the output of the band-stop filter of 282C. The output of filter 282C may represent all of the signal that is not part of the attenuation band, which may pass through the system unmodified. The output of the adder block 232 may be the modified audio output of the system, which includes the limited attenuation band and limited harmonics. Although certain computations and frequency ranges are illustrated in the embodiment of
Spectral tilt may be evaluated using various metrics. For example, one example of spectral tilt value may be determined in the integrated circuits of
At step 308, a signal level in an attenuation frequency band of the input audio signal may be attenuated. At step 310, the input audio signal may be augmented with generated harmonics in order to generate the modified audio signal in order to minimize perceptibility of the distortion produced by the speaker, and to partially offset the loss in sound pressure level incurred when attenuating the distortion producing frequency band. After completion of step 310, method 300 may end.
At step 312, the input audio signal may be passed through to drive the speaker with little or no modification. After completion of step 312, method 300 may end.
Another example of spectral tilt value may be based on a percentage of audio energy in a high-frequency band as a percentage of all audio energy in the input audio signal. This ratio may be expressed as a ratio of a first signal level of a subset of frequencies of the input audio signal and a second signal level of the entire input audio signal. The subset of frequencies may be higher in frequency than the distortion-producing frequency band. The high frequency content selected for determining the ratio may have some effect in masking distortion introduced by the speaker in the distortion-producing frequency bands of the input audio signal. An example integrated circuit for performing attenuation based on this evaluation of spectral tilt is shown in
The input audio signal may be processed with filters and attenuators to attenuate signal in the attenuation frequency band. The input audio signal may be filtered by bandpass filter 416 to select the attenuation frequency band from the input audio signal. The output of bandpass filter 416 may be modified by attenuation block 220 based on the output from attenuation factor block 218.
Further, the input audio signal may be filtered by bandpass filter 420 to select the harmonics generation band from the input audio signal, wherein the harmonics generation band may be at frequencies higher than the attenuation band that may cause little or no rattle or other distortion. Harmonics generator 434 may create harmonics from the audio signal generated by bandpass filter 420 within the harmonics generation band. A user may psychoacoustically perceive that distortion-producing audio above the threshold is actually present (e.g., by the use of beat frequencies), even though such audio is not present. A limiter 436 may apply a gain to the generated harmonics as a function of the attenuation from attenuation factor block 218 (e.g., in some embodiments the gain applied to harmonics generated by harmonics generator 434 may be inversely proportional to the attenuation applied by attenuation block 220). The presence of this additional high-frequency content may also mask rattle and/or other distortion.
A notch filter 418 may pass un-modified audio content around the attenuation frequency band to the combiner 232. The combiner 232 may recombine the un-modified audio content with an attenuated version of the attenuation frequency band and the generated harmonics. The output of the combiner 232 may be a modified audio signal at output node 404.
The example embodiments of
Referring back to
The example embodiments of
As shown in
As shown in
The phase signal from inverse tangent block 703 may be offset by adding a desired offset PHASEH 1 by summer block 706. The output of summer block 706 may then be scaled in multiplier block 707 by a desired real number HARM1 (e.g., a positive integer) to produce a harmonic phase signal HPHASE1. Harmonic phase signal HPHASE1 may be converted into an audio harmonic signal by taking a cosine of the harmonic phase signal by cosine block 708, to generate cosine signal C-HARM1.
Multiplier block 710 may multiply cosine signal C-HARM1 by a signal magnitude generated by absolute value block 705. Absolute value block 705 may create the signal magnitude from the in-phase and quadrature outputs from block 702, squaring each input, adding the squares, and taking a square root of the resulting sum. Multiplier block 711 may further multiply the output of multiplier block 710 by a desired scaling factor HAMP1, scaling the magnitude for that branch (i.e., harmonic).
A summing block 712 may add the results of the branches 704, 714, and 715 together, and a bandpass filter 713 may further filter the resulting signal to generate a harmonic signal.
A variation on the embodiments represented by
A second variation may be created by adding additional instantiations of bandpass filter 701 and Hilbert transform filter 702. The additional bandpass filter may be set to pass the harmonic band (unlike the rattle band of 701). The additional bandpass filter output may be input to the additional Hilbert transform filter 702. The outputs of the additional Hilbert transform filter may then be input to magnitude block 705 in lieu of to the input of absolute value block 705 inputs. By selecting the pass band of the second bandpass filter, for example, to be the band of the generated harmonics, the envelope of the original harmonics may be placed on the generated harmonic output to create an alternative harmonic generator.
The methods and systems described herein may be used in any suitable system, device, or apparatus. An example of such use may be in a personal media device for playing back music, high-fidelity music, and/or speech from telephone calls.
The schematic flow chart diagrams of methods 100, 300, and 500 are generally set forth as a logical flow chart diagram. Likewise, other operations for the circuitry are described without flow charts herein as sequences of ordered steps. The depicted order, labeled steps, and described operations are indicative of aspects of methods of the invention. Other steps and methods may be conceived that are equivalent in function, logic, or effect to one or more steps, or portions thereof, of the illustrated method. Additionally, the format and symbols employed are provided to explain the logical steps of the method and are understood not to limit the scope of the method. Although various arrow types and line types may be employed in the flow chart diagram, they are understood not to limit the scope of the corresponding method. Indeed, some arrows or other connectors may be used to indicate only the logical flow of the method. For instance, an arrow may indicate a waiting or monitoring period of unspecified duration between enumerated steps of the depicted method. Additionally, the order in which a particular method occurs may or may not strictly adhere to the order of the corresponding steps shown.
The operations described above as performed by a controller may be performed by any circuit configured to perform the described operations. Such a circuit may be an IC constructed on a semiconductor substrate and include logic circuitry, such as transistors configured as logic gates, and memory circuitry, such as transistors and capacitors configured as dynamic random access memory (DRAM), electronically programmable read-only memory (EPROM), or other memory devices. The logic circuitry may be configured through hard-wire connections or through programming by instructions contained in firmware. Further, the logic circuitry may be configured as a general-purpose processor (e.g., CPU or DSP) capable of executing instructions contained in software. The firmware and/or software may include instructions that cause the processing of signals described herein to be performed. The circuitry or software may be organized as blocks that are configured to perform specific functions. Alternatively, some circuitry or software may be organized as shared blocks that can perform several of the described operations. In some embodiments, the IC that is the controller may include other functionality. For example, the controller IC may include an audio coder/decoder (CODEC) along with circuitry for performing the functions described herein. Such an IC is one example of an audio controller. Other audio functionality may be additionally or alternatively integrated with the IC circuitry described herein to form an audio controller.
If implemented in firmware and/or software, functions described above may be stored as one or more instructions or code on a computer-readable medium. Examples include non-transitory computer-readable media encoded with a data structure and computer-readable media encoded with a computer program. Computer-readable media includes physical computer storage media. A storage medium may be any available medium that can be accessed by a computer. By way of example, and not limitation, such computer-readable media can comprise random access memory (RAM), read-only memory (ROM), electrically-erasable programmable read-only memory (EEPROM), compact disc read-only memory (CD-ROM) or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to store desired program code in the form of instructions or data structures and that can be accessed by a computer. Disk and disc includes compact discs (CD), laser discs, optical discs, digital versatile discs (DVD), floppy disks, and Blu-ray discs. Generally, disks reproduce data magnetically, and discs reproduce data optically. Combinations of the above should also be included within the scope of computer-readable media.
In addition to storage on computer readable medium, instructions and/or data may be provided as signals on transmission media included in a communication apparatus. For example, a communication apparatus may include a transceiver having signals indicative of instructions and data. The instructions and data are configured to cause one or more processors to implement the functions outlined in the claims.
Although the present disclosure and certain representative advantages have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the disclosure as defined by the appended claims. Moreover, the scope of the present application is not intended to be limited to the particular embodiments of the process, machine, manufacture, composition of matter, means, methods and steps described in the specification. For example, where general purpose processors are described as implementing certain processing steps, the general purpose processor may be a digital signal processor (DSP), a graphics processing unit (GPU), a central processing unit (CPU), or other configurable logic circuitry. As one of ordinary skill in the art will readily appreciate from the present disclosure, processes, machines, manufacture, compositions of matter, means, methods, or steps, presently existing or later to be developed that perform substantially the same function or achieve substantially the same result as the corresponding embodiments described herein may be utilized. Accordingly, the appended claims are intended to include within their scope such processes, machines, manufacture, compositions of matter, means, methods, or steps.
As used herein, when two or more elements are referred to as “coupled” to one another, such term indicates that such two or more elements are in electronic communication or mechanical communication, as applicable, whether connected indirectly or directly, with or without intervening elements.
This disclosure encompasses all changes, substitutions, variations, alterations, and modifications to the example embodiments herein that a person having ordinary skill in the art would comprehend. Similarly, where appropriate, the appended claims encompass all changes, substitutions, variations, alterations, and modifications to the example embodiments herein that a person having ordinary skill in the art would comprehend. Moreover, reference in the appended claims to an apparatus or system or a component of an apparatus or system being adapted to, arranged to, capable of, configured to, enabled to, operable to, or operative to perform a particular function encompasses that apparatus, system, or component, whether or not it or that particular function is activated, turned on, or unlocked, as long as that apparatus, system, or component is so adapted, arranged, capable, configured, enabled, operable, or operative. Accordingly, modifications, additions, or omissions may be made to the systems, apparatuses, and methods described herein without departing from the scope of the disclosure. For example, the components of the systems and apparatuses may be integrated or separated. Moreover, the operations of the systems and apparatuses disclosed herein may be performed by more, fewer, or other components and the methods described may include more, fewer, or other steps. Additionally, steps may be performed in any suitable order. As used in this document, “each” refers to each member of a set or each member of a subset of a set.
Although exemplary embodiments are illustrated in the figures and described below, the principles of the present disclosure may be implemented using any number of techniques, whether currently known or not. The present disclosure should in no way be limited to the exemplary implementations and techniques illustrated in the drawings and described above.
Unless otherwise specifically noted, articles depicted in the drawings are not necessarily drawn to scale.
All examples and conditional language recited herein are intended for pedagogical objects to aid the reader in understanding the disclosure and the concepts contributed by the inventor to furthering the art, and are construed as being without limitation to such specifically recited examples and conditions. Although embodiments of the present disclosure have been described in detail, it should be understood that various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the disclosure.
Although specific advantages have been enumerated above, various embodiments may include some, none, or all of the enumerated advantages. Additionally, other technical advantages may become readily apparent to one of ordinary skill in the art after review of the foregoing figures and description.
To aid the Patent Office and any readers of any patent issued on this application in interpreting the claims appended hereto, applicants wish to note that they do not intend any of the appended claims or claim elements to invoke 35 U.S.C. § 112(f) unless the words “means for” or “step for” are explicitly used in the particular claim.
The present disclosure is related to U.S. Pat. No. 10,225,654, which is incorporated herein by reference in its entirety. In addition, the present disclosure claims priority to U.S. Provisional Pat. Appl. No. 63/430,407, filed Dec. 6, 2022, which is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
63430407 | Dec 2022 | US |