1. Field of the Invention
Various features pertain to crosstalk cancellation of closely spaced speakers to achieve improved stereo sound quality.
2. Description of Related Art
Stereophonic sound, commonly called stereo, reproduces a sound using two or more independent audio channels. Typically, a symmetrical configuration of loudspeakers is used to create a pleasant and natural impression of sound heard from various directions, as in natural hearing. However, the use of multiple speakers or audio channels may create acoustic crosstalk. Acoustic crosstalk refers to the leakage or “bleeding” of sound from one sound wave into another sound wave.
Such acoustic crosstalk is particularly problematic where closely spaced speakers are employed. For instance, while listening to stereo signals using closely spaced speakers, the width of stereo image heard by the user is limited to the distance between the two stereo speakers. Stereo imaging refers to the recreation of sound waves that simulate position differences in the original sound source. When using stereo headphones, the sound is delivered directly into a user's ear, thereby avoiding the possibility of acoustic crosstalk. However, when using closely spaced speakers (such as on a mobile phone), the sound emitted by each speaker propagates through the air and is received by both the left and right ears, resulting in acoustic crosstalk. In order to widen the stereo image, it is desirable to completely eliminate or greatly reduce this acoustic crosstalk.
Consequently, a method is needed that reduces or eliminates the effects of acoustic crosstalk for closely spaced speakers.
A technique for canceling acoustic crosstalk is provided including a pre-processing filter and a crosstalk cancellation device. The pre-processing filter may be configured to obtain first and second channel signals and compensate or adjust the first and/or second channel signals for anticipated subsequent stage distortion by the crosstalk cancellation device. The crosstalk cancellation device maybe configured to receive the compensated first and second channel signals from the pre-processing filter. The crosstalk cancellation device then modifies the first channel signal to cancel anticipated acoustic crosstalk from the second channel signal, and modifies the second channel signal to cancel acoustic crosstalk from the first channel signal. The modified first channel signal is then transmitted over a first speaker and the modified second channel signal is transmitted over a second speaker.
One implementation provides a device comprising a pre-processing filter and a crosstalk cancellation device. The pre-processing filter may be configured to (a) obtain a first and second channel signals that comprise a stereo signal, (b) compensate for anticipated subsequent stage distortion to the first channel signal, and/or (c) compensate for anticipated subsequent stage distortion to the second channel signal. The crosstalk cancellation device may be configured to (a) obtain the compensated first channel signal, (b) modify the first channel signal to cancel anticipated acoustic crosstalk from the second channel signal, (c) obtain the compensated second channel signal, and/or (d) modify the second channel signal to cancel acoustic crosstalk from the first channel signal.
The pre-processing filter may include (a) a plurality of band-pass filters to divide the first channel signal into a plurality of frequency bands; and/or (b) at least one signal attenuators to attenuate a selected frequency band, thereby compensating for anticipated unwanted gain in that frequency band due to the crosstalk cancellation device.
The crosstalk cancellation device may be optimized for acoustic crosstalk cancellation at a particular distance. The crosstalk cancellation device may also be tuned for an approximate one sample delay between a direct path acoustic signal and a crosstalk path acoustic signal.
The combination of the pre-processing device and crosstalk cancellation device may provide a substantially flat frequency response over a frequency range of interest. The substantially flat frequency response may be characterized by the modified first channel signal having a substantially linear magnitude response over the frequency range of interest. The substantially flat frequency response may be characterized by the modified first channel signal having a substantially linear phase delay over the frequency range of interest.
A first speaker may be coupled to the crosstalk cancellation device to transmit the modified first channel signal. Similarly, a second speaker coupled to the crosstalk cancellation device to transmit the modified second channel signal. The first and second speakers may be spaced ten (10) centimeters apart or less.
Similarly, a method for crosstalk cancellation of a stereo signal is also provided comprising: (a) configuring a crosstalk cancellation device to modify a first channel signal of the stereo signal to cancel acoustic crosstalk from a second channel signal of the stereo signal; (b) ascertaining a frequency response characteristic for the crosstalk cancellation device for a range of desired frequencies; (c) providing the first and second channel signals to a pre-processing filter prior to reaching the subsequent stage crosstalk cancellation device; (d) configuring the pre-processing stage filter to compensate for anticipated distortion to the first channel signal caused by the subsequent stage crosstalk cancellation device; and/or (e) providing the compensated first channel signal from the pre-processing filter to the crosstalk cancellation device. The method may also involve (f) configuring the crosstalk cancellation device to modify the second channel signal to cancel acoustic crosstalk from the first channel signal; (g) configuring the pre-processing stage filter to compensate for distortions to the second channel signal caused by the crosstalk cancellation device; (h) providing the compensated second channel signal from the pre-processing filter to the crosstalk cancellation device; (i) transmitting the modified first channel signal from the crosstalk cancellation device via a first speaker; and/or (j) transmitting the modified second channel signal from the crosstalk cancellation device via a second speaker.
The modified first and second channel signals may have a substantially linear magnitude response over a frequency range of interest. The pre-processing filter may add linear phase delay to the left and right channel signals. The crosstalk cancellation device may be pre-optimized for acoustic crosstalk cancellation at an intended listener at a particular distance. In one example, the crosstalk cancellation device is tuned for an approximate one sample delay between a direct path acoustic signal and a crosstalk path acoustic signal.
Consequently, a stereo signal crosstalk canceller is provided comprising: (a) means for modifying a first channel signal at a crosstalk cancellation device to cancel acoustic crosstalk from a second channel signal; (b) means for ascertaining a frequency response characteristic for the crosstalk cancellation device for a range of desired frequencies; (c) means for providing the first and second channel signals to a pre-processing filter prior to reaching the subsequent stage crosstalk cancellation device; (d) means for compensating, at the pre-processing stage filter, for anticipated distortion to the first channel signal caused by the subsequent stage crosstalk cancellation device; (e) means for providing the compensated first channel signal from the pre-processing filter to the crosstalk cancellation device; (f) means for modifying the second channel signal at the crosstalk cancellation device to cancel acoustic crosstalk from the first channel signal; (g) means for compensating, at the pre-processing stage filter, for distortions to the second channel signal caused by the crosstalk cancellation device; (h) means for providing the compensated second channel signal from the pre-processing filter to the crosstalk cancellation device; (i) means for acoustically transmitting the modified first channel signal from the crosstalk cancellation device; and/or (j) means for acoustically transmitting the modified second channel signal from the crosstalk cancellation device.
A computer-readable medium is also provided comprising instructions for performing acoustic crosstalk cancellation of stereo signals, which when executed by a processor causes the processor to: (a) obtain a first and second channel signals, (b) compensate for anticipated subsequent stage distortion to the first channel signal; (c) modify the first channel signal to cancel anticipated acoustic crosstalk from the second channel signal; (d) compensate for anticipated subsequent stage distortion to the second channel signal; (e) modify the second channel signal to cancel acoustic crosstalk from the first channel signal; (f) transmit the modified first channel signal; and/or (g) transmit the modified second channel signal through a separate channel from the modified first channel signal.
Similarly, a processor is provided including a processing circuit configured to (a) obtain a first and second channel signals, (b) compensate for anticipated subsequent stage distortion to the first channel signal, (c) modify the first channel signal to cancel anticipated acoustic crosstalk from the second channel signal, (d) compensate for anticipated subsequent stage distortion to the second channel signal, (e) modify the second channel signal to cancel acoustic crosstalk from the first channel signal, (f) transmit the modified first channel signal, and/or (g) transmit the modified second channel signal through a separate channel from the modified first channel signal.
The features, nature, and advantages of the present aspects may become more apparent from the detailed description set forth below when taken in conjunction with the drawings in which like reference characters identify correspondingly throughout.
In the following description, specific details are given to provide a thorough understanding of the embodiments. However, it will be understood by one of ordinary skill in the art that the embodiments may be practiced without these specific detail. For example, circuits may be shown in block diagrams in order not to obscure the embodiments in unnecessary detail. In other instances, well-known circuits, structures and techniques may be shown in detail in order not to obscure the embodiments.
Also, it is noted that the embodiments may be described as a process that is depicted as a flowchart, a flow diagram, a structure diagram, or a block diagram. Although a flowchart may describe the operations as a sequential process, many of the operations can be performed in parallel or concurrently. In addition, the order of the operations may be re-arranged. A process is terminated when its operations are completed. A process may correspond to a method, a function, a procedure, a subroutine, a subprogram, etc. When a process corresponds to a function, its termination corresponds to a return of the function to the calling function or the main function.
In one or more examples and/or configurations, the functions described may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code means in the form of instructions or data structures and that can be accessed by a general-purpose or special-purpose computer, or a general-purpose or special-purpose processor. Also, any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk and disc, as used herein, includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above are also be included within the scope of computer-readable media.
Moreover, a storage medium may represent one or more devices for storing data, including read-only memory (ROM), random access memory (RAM), magnetic disk storage mediums, optical storage mediums, flash memory devices and/or other machine readable mediums for storing information.
Furthermore, embodiments may be implemented by hardware, software, firmware, middleware, microcode, or any combination thereof. When implemented in software, firmware, middleware or microcode, the program code or code segments to perform the necessary tasks may be stored in a computer-readable medium such as a storage medium or other storage(s). A processor may perform the necessary tasks. A code segment may represent a procedure, a function, a subprogram, a program, a routine, a subroutine, a module, a software package, a class, or any combination of instructions, data structures, or program statements. A code segment may be coupled to another code segment or a hardware circuit by passing and/or receiving information, data, arguments, parameters, or memory contents. Information, arguments, parameters, data, etc. may be passed, forwarded, or transmitted via any suitable means including memory sharing, message passing, token passing, network transmission, etc.
One feature provides crosstalk cancellation by employing a simplified version of the well-known Atal-Schroder crosstalk cancellation technique combined with a frequency compensation linear phase finite impulse response (FIR) filter to achieve relatively flat response at the output of crosstalk cancellation.
In 1966, Atal and Schroeder used physical reasoning to determine how a crosstalk canceller comprising only two loudspeakers placed symmetrically in front of a single listener could work. (See U.S. Pat. No. 3,236,949). The objective of a crosstalk canceller is to reproduce a desired signal at a single target position while canceling out the sound perfectly at all remaining target positions. The Atal-Schroeder crosstalk cancellation technique involves the addition, to the right-hand loudspeaker signal, of an out-of-phase version of the left channel signal anticipated to reach the right ear of the intended listener via crosstalk, and the addition, to the left-hand loudspeaker signal, of an out-of-phase version of the right-hand channel signal expected to reach the left ear of the listener via crosstalk.
The crosstalk canceller proposed by Atal-Schroeder focused on reproducing a phantom sound source anywhere within 180 degrees in front plane of the user. This technique is known to add unnatural coloration to the sound, especially when the speakers are spaced very close to each other. Other crosstalk cancellation techniques, such as head-related transfer function (HRTF), also called anatomical transfer function (ATF), are complex and computationally expensive to implement in real-world applications.
According to one implementation, a wider stereo expansion image may be achieved by simplifying the Atal-Schroeder crosstalk cancellation network and adding a pre-processing FIR filter. Adding the pre-processing FIR filter significantly reduces the tone coloration added by traditional crosstalk cancellation network. The pre-processor filter may be a linear phase filter that does not add additional phase distortion to the signal, thereby preserving relative delays between direct and crosstalk signal.
Many handheld devices such as mobile handsets (e.g., mobile phones, etc.) are equipped with stereo speakers to playback stereo multimedia content (e.g., voice, audio, music, etc.). However, due to a small form factor of many handheld devices, the stereo speakers are typically spaced very close to each other. For example, spacing of two (2) to six (6) centimeters (cm) is quite common in commercially available handheld devices. Since the speakers in such handheld devices are small in size, they often exhibit poor low frequency response and speaker distortion of low frequencies.
Stereo playback using very closely spaced speakers (e.g., two to six centimeters) pose serious limitations in delivering a good stereo effect even for good stereo content. This is mainly due to the high crosstalk signal received by opposite ear.
The pre-processing filter 106 may be a finite impulse response (FIR) filter which is configured to attenuate the lower band and higher band frequencies to compensate for frequency boost added by direct and crosstalk filters of the crosstalk cancellation network 108. The pre-processing filter 106 minimizes coloration and clipping issues. Applying the pre-processing filter in cascade with direct or crosstalk filter ensures that the combined frequency response is relatively flat over large range of frequencies. The pre-processing filter 106 outputs a left channel signal SL 120 and right channel signal SR 122 to the crosstalk cancellation network 108.
The crosstalk cancellation network 108 then modifies each channel signal to compensate for the anticipated or expected crosstalk at the listener's corresponding ear and transmits the audio signal through the corresponding left speaker 110 and right speaker 112. That is, a left output channel signal OUTL 124 propagates from the left speaker 110 and is intended for the listener's left ear 128, but as the left output channel signal OUTL 124 propagates through the air, it also reaches the listener's right ear 130 as crosstalk CLR 132. Similarly, a right output channel signal OUTR 126 propagates from the right speaker 112 and is intended for the listener's right ear 130, but as the right output channel signal OUTR 126 propagates through the air, it also reaches the listener's left ear 128 as crosstalk CRL 134. Consequently, the channel signals OUTL 124 and OUTR 126 from the left and right speakers 110 and 112, respectively, do not directly reach the left and right ears, respectively, but undergo a transformation while the sound is transmitted through the air. The left output channel signal OUTL 124 is transformed according to the left path acoustic transfer function HLL and the right-speaker-to-left-ear crosstalk signal CRL 134. Similarly, the right output channel signal OUTR 126 is transformed according to the right path acoustic transfer function HRR and the left-speaker-to-right-ear crosstalk signal CLR 132.
The resulting output [EL, ER] that listener 114 hears can be described by:
The purpose of the crosstalk cancellation network 108 is to eliminate this acoustic transfer function H in Equation 1, so that user gets the original stereo signals INL 116 and INR 118 at the left ear 128 and the right ear 130, respectively. The output signals OUTL 124 and OUTR 126 may be represented as the stereo input signals INL 116 and INR 118 modified by the crosstalk cancellation network 108 function Y, such that
Consequently, good crosstalk cancellation means that the network canceller function Y cancels the acoustic transfer function H, such that:
A typical Schroeder crosstalk cancellation network employs the knowledge of the angle at which the stereo speakers are located and the perceived angle where the phantom source is to be positioned. However, in implementing stereo sound on handheld devices, expanding the stereo image is of interest, not necessarily positioning a sound to a particular angle. Consequently, the crosstalk cancellation network 108 may implement a simplified version of the Schroeder crosstalk cancellation network where the signal paths related to phantom source locations are removed from the crosstalk network.
To achieve good crosstalk cancellation results, the crosstalk cancellation network may be tuned according to the crosstalk delay and gain parameters. The delay value is derived based on the geometrical setup of the speakers 204 and 206 and the intended listener's head 208 and converting the time delay into delay in samples at a sampling rate (e.g., 44.1 kHz).
For the one sample delay resulting from the configuration of
where crossgain<1.0, is the crosstalk attenuation. For closely spaced speakers, crosstalk attenuation is very close to 1.0. However due to sound absorption by the human head, the crossgain can be tuned to a smaller number.
To minimize these coloration and clipping issues, the current method uses a pre-processing FIR filter 302 (
It should be clearly understood that the signal channel characteristics illustrated in
Note that the configuration of the pre-processing filter 802 depends on the frequency response of the crosstalk cancellation network 804 at various bands of interest. Consequently, the band-pass filters 808, 810, and 812, signal attenuators 814 and 816 and/or signal amplifiers 818 may be designed to compensate for a corresponding frequency response characteristic of the crosstalk cancellation network 804 over a frequency range of interest. In this example for the left stereo input signal 806, the pre-processing filter 802 may compensate for the frequency response of the left direct path (HL direct in
While the pre-processing filter 802 shows a left stereo input signal 806 being processed, a similar filter may be employed for a right stereo input signal. Such filter may compensate for the frequency response of the right direct path (HR direct in
In some implementations, the crosstalk cancellation network 804 may operate like the crosstalk cancellation network 304 illustrated in
The first and second channel signals may be provided to a pre-processing filter prior to reaching the subsequent stage crosstalk cancellation device 906. The pre-processing stage filter may be configured to compensate for anticipated distortions to the first channel signal caused by the subsequent crosstalk cancellation device 908. For example, the pre-processing stage filter may compensate for unwanted amplification and/or attenuation of certain frequency bands by the crosstalk cancellation device. The compensated first channel signal is then provided from the pre-processing filter to the crosstalk cancellation device 910.
Similarly, the pre-processing stage filter may be configured to compensate for distortions to the second channel signal caused by the crosstalk cancellation device 912. The compensated second channel signal is then provided from the pre-processing filter to the crosstalk cancellation device 914.
The modified first and second channel signals are transmitted from the crosstalk cancellation device via a plurality of closely spaced speakers 916.
One or more of the components, steps, and/or functions illustrated in
Those of skill in the art would further appreciate that the various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system.
The various features described herein can be implemented in different systems. For example, the pre-processing filter and/or crosstalk cancellation network may be implemented in a single circuit or module, on separate circuits or modules, executed by one or more processors, executed by computer-readable instructions incorporated in a machine-readable or computer-readable medium, and/or embodied in a handheld device, mobile computer, and/or mobile phone.
It should be noted that the foregoing embodiments are merely examples and are not to be construed as limiting the invention. The description of the embodiments is intended to be illustrative, and not to limit the scope of the claims. As such, the present teachings can be readily applied to other types of apparatuses and many alternatives, modifications, and variations will be apparent to those skilled in the art.