Embodiments of the present disclosure generally relate to the field of audio signal processing and, more particularly, to crosstalk processing of multi-channel audio.
Crosstalk processing refers to processing of audio signals using contralateral and ipsilateral sound components, such as for crosstalk simulation or crosstalk cancellation. Crosstalk compensation refers to processing that adjusts for spectral defects caused by crosstalk processing. It is desirable to optimize the crosstalk processing and crosstalk compensation processing to increase computational speed and reduce computing resource usage.
Embodiments relate to enhancing an audio signal including a left channel and a right channel. A crosstalk processing including at least one filter and a delay, such as crosstalk cancellation or crosstalk simulation, is applied to a side (or spatial) channel of the left and right channels to generate a crosstalk processed signal. The side channel includes a difference between the left channel and the right channel. A mid (or nonspatial) channel of the left and right channels bypasses the crosstalk processing. The mid channel includes a sum of the left and right channels. A left output channel and a right output channel is generated using the crosstalk processed signal and the mid channel that bypasses the crosstalk processing.
In some embodiments, crosstalk compensation processing is applied to the side channel to generate a crosstalk compensated signal to adjust for spectral defects caused by the crosstalk processing applied to the side channel. The mid channel bypasses the crosstalk compensation processing. The left and right output channels are generated using the crosstalk compensated signal, the crosstalk processed signal, and the mid channel that bypasses the crosstalk processing and crosstalk compensation.
Other aspects include components, devices, systems, improvements, methods, processes, applications, computer readable mediums, and other technologies related to any of the above.
The features and advantages described in the specification are not all inclusive and, in particular, many additional features and advantages will be apparent to one of ordinary skill in the art in view of the drawings, specification, and claims. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter.
The Figures (FIG.) and the following description relate to the preferred embodiments by way of illustration only. It should be noted that from the following discussion, alternative embodiments of the structures and methods disclosed herein will be readily recognized as viable alternatives that may be employed without departing from the principles of the present invention.
Reference will now be made in detail to several embodiments of the present invention(s), examples of which are illustrated in the accompanying figures. It is noted that wherever practicable similar or like reference numbers may be used in the figures and may indicate similar or like functionality. The figures depict embodiments for purposes of illustration only. One skilled in the art will readily recognize from the following description that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles described herein.
Example Crosstalk Compensation Processing
Embodiments relate to crosstalk processing, and in some embodiments crosstalk compensation processing, for stereo audio signals including left and right channels. The crosstalk processing may include crosstalk cancellation for loudspeakers, or crosstalk simulation for headphones. The crosstalk compensation processing adjusts for spectral defects resulting from the crosstalk processing. To increase processing efficiency, the crosstalk processing or crosstalk compensation processing is applied to a side channel generated from the left and right channels, while a mid channel generated from the left and right channels is bypassed. This may be achieved by generating the side channel, applying the crosstalk processing or crosstalk compensation to the side channel, and combining the processed side channel with the mid channel. In another example, crosstalk processing may be applied to each of the left and right channels, with the result being further processed such that the crosstalk processing is effectively applied to the side channel and bypasses the mid channel. The resulting output signal exhibits a spectrally transparent mid while retaining spatial crosstalk characteristics (e.g., either simulation for headphones or cancellation for loudspeakers).
In a loudspeaker arrangement such as illustrated in
In a head-mounted speaker arrangement such as illustrated in
Example Audio Processing System
With reference to
With reference to
Example Crosstalk Cancellation Processor
The crosstalk cancellation processor 302 includes an in-out band divider 310, inverters 320 and 322, contralateral estimators 330 and 340, combiners 350 and 352, an in-out band combiner 360, an L/R to M converter 362, an L/R to S converter 364, and an M/S to L/R converter 366. These components operate together to divide the input channels TL, TR into in-band channels and out-of-band components, and perform a crosstalk cancellation on the in-band components to generate the output channels OL, OR.
By dividing the input audio signal T into different frequency band components and by performing crosstalk cancellation on selective components (e.g., in-band components), crosstalk cancellation can be performed for a particular frequency band while obviating degradations in other frequency bands. If crosstalk cancellation is performed without dividing the input audio signal T into different frequency bands, the audio signal after such crosstalk cancellation may exhibit significant attenuation or amplification in the nonspatial and spatial components in low frequency (e.g., below 350 Hz), higher frequency (e.g., above 12000 Hz), or both. By selectively performing crosstalk cancellation for the in-band (e.g., between 250 Hz and 14000 Hz), where the vast majority of impactful spatial cues reside, a balanced overall energy, particularly in the nonspatial component, across the spectrum in the mix can be retained.
The in-out band divider 310 separates the input channels XL, XR into in-band channels TL,In, TR,In and out-of-band channels TL,Out, TR,Out, respectively. Particularly, the in-out band divider 310 divides the left enhanced compensation channel TL into a left in-band channel TL,In and a left out-of-band channel TL,Out. Similarly, the in-out band divider 310 separates the right enhanced compensation channel TR into a right in-band channel TR,In and a right out-of-band channel TR,Out. Each in-band channel may encompass a portion of a respective input channel corresponding to a frequency range including, for example, 250 Hz to 14 kHz. The range of frequency bands may be adjustable, for example according to speaker parameters.
The inverter 320 and the contralateral estimator 330 operate together to generate a left contralateral cancellation channel SL to compensate for a contralateral sound component due to the left in-band channel TL,In. Similarly, the inverter 322 and the contralateral estimator 340 operate together to generate a right contralateral cancellation channel SR to compensate for a contralateral sound component due to the right in-band channel TR,In.
In one approach, the inverter 320 receives the in-band channel TL,In and inverts a polarity of the received in-band channel TL,In to generate an inverted in-band channel TL,In′. The contralateral estimator 330 receives the inverted in-band channel TL,In′, and extracts a portion of the inverted in-band channel TL,In′ corresponding to a contralateral sound component through filtering. Because the filtering is performed on the inverted in-band channel TL,In′, the portion extracted by the contralateral estimator 330 becomes an inverse of a portion of the in-band channel TL,In attributing to the contralateral sound component. Hence, the portion extracted by the contralateral estimator 330 becomes a left contralateral cancellation channel SL, which can be added to a counterpart in-band channel TR,In to reduce the contralateral sound component due to the in-band channel TL,In. In some embodiments, the inverter 320 and the contralateral estimator 330 are implemented in a different sequence.
The inverter 322 and the contralateral estimator 340 perform similar operations with respect to the in-band channel TR,In to generate the right contralateral cancellation channel SR. Therefore, detailed description thereof is omitted herein for the sake of brevity.
In one example implementation, the contralateral estimator 330 includes a filter 332, an amplifier 334, and a delay unit 336. The filter 332 receives the inverted input channel TL,In′ and extracts a portion of the inverted in-band channel TL,In′ corresponding to a contralateral sound component through a filtering function. An example filter implementation is a Notch or Highshelf filter with a center frequency selected between 5000 and 10000 Hz, and Q selected between 0.5 and 1.0. Gain in decibels (GdB) may be derived from Equation 1:
GdB=−3.0−log1.333(D) Eq. (1)
where D is a delay amount by delay unit 336 and 346 in samples, for example, at a sampling rate of 48 KHz. An alternate implementation is a Lowpass filter with a corner frequency selected between 5000 and 10000 Hz, and Q selected between 0.5 and 1.0. Moreover, the amplifier 334 amplifies the extracted portion by a corresponding gain coefficient GL,In, and the delay unit 336 delays the amplified output from the amplifier 334 according to a delay function D to generate the left contralateral cancellation channel SL.
The contralateral estimator 340 includes a filter 342, an amplifier 344, and a delay unit 346 that performs similar operations on the inverted in-band channel TR,In′ to generate the right contralateral cancellation channel SR. In one example, the contralateral estimators 330, 340 generate the left and right contralateral cancellation channels SL, SR, according to equations below:
SL=D[GL,In*F[TL,In′]] Eq. (2)
SR=D[GR,In*F[TR,In′]] Eq. (3)
where F[ ] is a filter function, and D[ ] is the delay function.
In some embodiments, a filter is integrated with an amplifier in a contralateral estimator. For example, the filter 332 may apply the gain of the amplifier 334 as part of a filtering function. In that sense, applying a filter to a signal or channel may include wideband adjustment of gain level in addition to adjustments based on frequency.
The configurations of the crosstalk cancellation can be determined by the speaker parameters. In one example, filter center frequency, delay amount, amplifier gain, and filter gain can be determined, according to an angle formed between two speakers with respect to a listener. In some embodiments, values between the speaker angles are used to interpolate other values.
The combiner 350 combines the right contralateral cancellation channel SR to the left in-band channel TL,In to generate a left in-band crosstalk channel UL, and the combiner 352 combines the left contralateral cancellation channel SL to the right in-band channel TR,In to generate a right in-band crosstalk channel UR.
The L/R to S converter 364 receives the left in-band crosstalk channel UL and the right in-band crosstalk channel UR, and generates a side in-band crosstalk channel US. The side in-band crosstalk channel US may be generated based on a difference between the left in-band crosstalk channel UL and the right in-band crosstalk channel UR.
The L/R to M converter 362 receives the left in-band channel TL,In and the right in-band channel TR,In, and generates a mid in-band channel TM,In. The mid in-band channel TM,In may be generated based on a sum of the left in-band channel TL,In and the right in-band channel TR,In.
The M/S to L/R converter 366 receives the mid in-band channel TM,In and the side in-band crosstalk channel US, and creates a left in-band crosstalk cancelled channel CL and a right in-band crosstalk cancelled channel CR. The left crosstalk cancelled in-band channel CL may be generated based on a sum of the mid in-band channel TM,In and the side in-band crosstalk channel US, and the right in-band crosstalk cancelled channel CR may be generated based on a difference between the mid in-band channel TM,In and the side in-band crosstalk channel US. The side in-band channel US is a side component of the left and right in-band crosstalk channels UL, UR, and is combined with mid in-band channel TM,In, which is a mid component of the in-band channels TL,In and TR,In.
The in-out band combiner 360 combines the left in-band channel CL with the out-of-band channel TL,Out to generate the left output channel OL, and combines the right in-band channel CR with the out-of-band channel TR,Out to generate the right output channel OR. The left output channel OL is a left crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 302, and the right output channel OR is a right crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 302. These crosstalk cancelled channels may be used as output of an audio processing system, or inputs to another component of the audio processing system (e.g., a crosstalk compensation processor 204 that adjusts for spectral defects caused by the crosstalk cancellation).
Accordingly, the left output channel OL includes the side component of the right contralateral cancellation channel SR corresponding to an inverse of a portion of the in-band channel TR,In attributing to the contralateral sound, and the right output channel OR includes the side component of the left contralateral cancellation channel SL corresponding to an inverse of a portion of the in-band channel TL,In attributing to the contralateral sound. In this configuration, a wavefront of an ipsilateral sound component output by the loudspeaker 110R according to the right output channel OR arrived at the right ear can cancel a wavefront of a contralateral sound component output by the loudspeaker 110L according to the left output channel OL. Similarly, a wavefront of an ipsilateral sound component output by the speaker 110L according to the left output channel OL arrived at the left ear can cancel a wavefront of a contralateral sound component output by the loudspeaker 110R according to right output channel OR. The left output channel OL is a left crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 302, and the right output channel OR is a right crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 302. Thus, contralateral sound components can be reduced to enhance spatial detectability.
The L/R to S converter 364 receives the left contralateral cancellation channel SL and the right contralateral cancellation channel SR, and generates a side contralateral cancellation channel SS based on a difference between the left contralateral cancellation channel SL and the right contralateral cancellation channel SR.
The M/S to L/R converter 368 receives the side contralateral cancellation channel SS and a zero mid channel, and generates a left contralateral in-band channel KL and a right contralateral in-band channel KR. The left contralateral in-band channel KL may be generated based on a sum of the side contralateral cancellation channel SS and the zero mid channel, and the right contralateral in-band channel KR may be generated based on a difference between the zero mid channel and the side contralateral cancellation channel SS.
The combiner 370 receives the right contralateral in-band channel KR and the left in-band channel TL,In, and generates the left crosstalk cancelled in-band channel CL by adding the right contralateral in-band channel KR and the left in-band channel TL,In. The combiner 372 receives the left contralateral in-band channel KL and the right in-band channel TR,In, and generates the right crosstalk cancelled in-band channel CR by adding the left contralateral in-band channel KL and the right in-band channel TR,In.
The in-out band combiner 360 combines the left crosstalk cancelled in-band channel CL with the out-of-band channel TL,Out to generate the left output channel OL, and combines the right crosstalk cancelled in-band channel CR with the out-of-band channel TR,Out to generate the right output channel OR. The left output channel OL is a left crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 304, and the right output channel OR is a right crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 304.
The L/R to S converter 364 receives the left contralateral cancellation channel SL and the right contralateral cancellation channel SR, and generates a side contralateral cancellation channel SS based on a difference between the left contralateral cancellation channel SL and the right contralateral cancellation channel SR.
The subtractor 374 receives the left in-band channel TL,In and the side contralateral cancellation channel SS, and generates the left crosstalk cancelled in-band channel CL based on a difference between the side contralateral cancellation channel SS and the left in-band channel TL,In.
The combiner 376 receives the right in-band channel TR,In and the side contralateral cancellation channel SS, and generates the right crosstalk cancelled in-band channel CR based on a sum of the side contralateral cancellation channel SS and the right in-band channel TR,In.
The in-out band combiner 360 combines the left in-band channel CL with the out-of-band channel TL,Out to generate the left output channel OL, and combines the right in-band channel CR with the out-of-band channel TR,Out to generate the right output channel OR. The left output channel OL is a left crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 306, and the right output channel OR is a right crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 306.
A common goal of crosstalk cancellation is that of perceptually removing the crosschannel signal when listening to a symmetric loudspeaker system, where the overall crosschannel signals are transformed identically. That is, the left channel may be delayed, filtered, inverted, and scaled identically to the right channel before summing to the opposite channel. If we assume symmetry in the left/right crosschannel signal transformations,
The in-out band divider 310 separates the input channels XL, XR into the in-band channels TL,In, TR,In and the out-of-band channels TL,Out, TR,Out, respectively. The L/R to M/S converter 378 is coupled to the in-out band divider 310 to receive the in-band channels TL,In, TR,In, and generates the side in-band channel TS,In and the mid in-band channel TM,In. The side in-band channel TS,In may be generated based on a difference between the left in-band channel TL,In and the right in-band channel TR,In. The mid in-band channel TM,In may be generated based on a sum of the left in-band channel TL,In and the right in-band channel TR,In.
The inverter 320 and the contralateral estimator 330 operate together to generate a side contralateral cancellation channel SS from the side in-band channel TS,In to compensate for a contralateral sound component due to the mid in-band channel TM,In. In particular, the inverter 320 receives the side in-band channel TS,In and inverts the polarity to generate an inverted side in-band channel TS,In′. The contralateral estimator 330 receives the inverted side in-band channel TS,In′, and extracts a portion of the inverted side in-band channel TS,In′ corresponding to a contralateral sound component through filtering. Because the filtering is performed on the inverted side in-band channel TS,In′, the portion extracted by the contralateral estimator 330 becomes an inverse of a portion of the side in-band channel TS,In attributing to the contralateral sound component. Hence, the portion extracted by the contralateral estimator 330 becomes the side contralateral cancellation channel SS.
The subtractor 380 receives the side in-band channel TS,In and the side contralateral cancellation channel SS, and generates a side crosstalk canceled in-band channel CS based on a difference between the side in-band channel TS,In and the side contralateral cancellation channel SS. In some embodiments, the inverter 320 and the contralateral estimator 330 are implemented in a different sequence.
The M/S to L/R converter 382 receives the mid in-band channel TM,In and the side crosstalk canceled in-band channel CS, and generates the left crosstalk canceled in-band channel CL and the right crosstalk canceled in-band channel CR. For example, the left crosstalk canceled in-band channel CL may be generated based on a sum of the mid in-band channel TM,In and the side crosstalk canceled in-band channel CS, and the right crosstalk canceled in-band channel CR may be generated based on a difference between the mid in-band channel TM,In and the side crosstalk canceled in-band channel CS.
The in-out band combiner 360 combines the left crosstalk canceled in-band channel CL with the out-of-band channel TL,Out to generate the left output channel OL, and combines the right crosstalk canceled in-band channel CR with the out-of-band channel TR,Out to generate the right output channel OR. The left output channel OL is a left crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 308, and the right output channel OR is a right crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 308.
The crosstalk cancellation processor 312 further includes an L/R to S converter 384 coupled to the in-out band divider 310 and the inverter 320, an M/S to L/R converter 386 coupled to the contralateral estimator 330, and combiners 388 and 390 coupled to the M/S to L/R converter 386, the in-out band divider 310, and the in-out band combiner 360. The L/R to S converter 384 receives the left in-band channel TL,In and the right in-band channel TL,In, and generates a side in-band channel TS,In based on a difference between the left in-band channel TL,In and the right in-band channel TL,In. The side in-band channel TS,In is processed by the inverter 320 and the contralateral estimator 330 to generate the side contralateral cancellation channel SS. The M/S to L/R converter 386 receives the side contralateral cancellation channel SS from the contralateral estimator 330 and a zero mid channel, and generates a left contralateral in-band channel KL and a right contralateral in-band channel KR. The left contralateral in-band channel KL may be generated based on a sum of the side contralateral cancellation channel SS and the zero mid channel, and the right contralateral in-band channel KR may be generated based on a difference between the zero mid channel and the side contralateral cancellation channel SS.
The combiner 388 receives the right contralateral in-band channel KR and the left in-band channel TL,In, and generates the left crosstalk cancelled in-band channel CL by adding the right contralateral in-band channel KR and the left in-band channel TL,In. The combiner 390 receives the left contralateral in-band channel KL and the right in-band channel TR,In, and generates the right crosstalk cancelled in-band channel CR by adding the left contralateral channel KL and the right in-band channel TR,In.
The in-out band combiner 360 combines the left crosstalk cancelled in-band channel CL with the left out-of-band channel TL,Out to generate the left output channel OL, and combines the right crosstalk cancelled in-band channel CR with the out-of-band channel TR,Out to generate the right output channel OR. The left output channel OL is a left crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 312, and the right output channel OR is a right crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 312.
The crosstalk cancellation processor 312 further includes a subtractor 392 and a combiner 394, each coupled to the contralateral estimator 330, the in-out band divider 310, and the in-out band combiner 360. The subtractor 392 receives the left in-band channel TL,In from the in-out band divider 310 and the side contralateral cancellation channel SS from the contralateral estimator 330, and generates the left crosstalk cancelled in-band channel CL based on a difference between the left in-band channel TL,In and the side contralateral cancellation channel SS. The combiner 394 receives the right in-band channel TR,In from the in-out band divider 310 and the side contralateral cancellation channel SS from the contralateral estimator 330, and generates the right crosstalk cancelled in-band channel CR based on a sum of the right in-band channel TR,In and the side contralateral cancellation channel SS.
The in-out band combiner 360 combines the left crosstalk cancelled in-band channel CL with the left out-of-band channel TL,Out to generate the left output channel OL, and combines the right crosstalk cancelled in-band channel CR with the out-of-band channel TR,Out to generate the right output channel OR. The left output channel OL is a left crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 314, and the right output channel OR is a right crosstalk cancelled channel of a crosstalk processed signal generated by the crosstalk cancellation processor 314.
The crosstalk cancellation processors shown in
OL=½(XL+XR)+½((AXR+XL)−(AXL+XR)) Eq. (4)
OR=½(XL+XR)−½((AXR+XL)−(AXL+XR)) Eq. (5)
The output channels OL and OR for the crosstalk cancellation processor 304 shown in
OL=XL+(0−½(AXL−AXR)) Eq. (6)
OR=XR+(0+½(AXL−AXR)) Eq. (7)
The output channels OL and OR for the crosstalk cancellation processor 306 shown in
OL=XL−½(AXL−AXR) Eq. (8)
OR=XR+½(AXL−AXR) Eq. (9)
The output channels OL and OR for the crosstalk cancellation processor 308 shown in
OL=½(XL+XR)+(½(XL−XR)−½(AXL−AXR)) Eq. (10)
OR=½(XL+XR)−(½(XL−XR)+½(AXL−AXR)) Eq. (11)
The output channels OL and OR for the crosstalk cancellation processor 312 shown in
OL=XL−½A(XL−XR) Eq. (12)
OR=XR+½A(XL−XR) Eq. (13)
The output channels OL and OR for the crosstalk cancellation processor 314 shown in
OL=XL−½(AXL−AXR) Eq. (14)
OR=XR+½(AXL−AXR) Eq. (15)
With algebraic manipulation, the Equations 4, 6, 8, 10, 12, and 14 for the left output channel OL are equivalent, and the Equations 5, 7, 9, 11, 13, and 14 for the right output channel OR are equivalent.
Example Crosstalk Simulation Processor
The crosstalk simulation processor 402 includes a left head shadow low-pass filter 422, a left head shadow high-pass filter 424, a left cross-talk delay 426, and a left head shadow gain 428 to process the left input channel XL. The crosstalk simulation processor 402 further includes a right head shadow low-pass filter 432, a right head shadow high-pass filter 434, a right cross-talk delay 436, and a right head shadow gain 438 to process the right input channel XR. The crosstalk simulation processor 402 further includes combiners 440 and 442, an L/R to M converter 444, an L/R to S converter 446, and an M/S to L/R converter 448.
The left head shadow low-pass filter 422 and the left head shadow high-pass filter 424 receive the left input channel XL and apply modulations that model the frequency response of the signal after passing through the listener's head. The use of both low-pass and high-pass filters may result in a more accurate model of the frequency response though the listener's head. In some embodiments, only one of the low-pass filter 422 or high-pass filter 424 are used. The output of the left head shadow high-pass filter 424 is provided to the left cross-talk delay 426, which applies a time delay to the output of the left head shadow high-pass filter 424. The time delay represents trans-aural distance that is traversed by a contralateral sound component relative to an ipsilateral sound component. The frequency response can be generated based on empirical experiments to determine frequency dependent characteristics of sound wave modulation by the listener's head. For example and with reference to
Similarly for the right input channel XR, the right head shadow low-pass filter 432 and right head shadow high-pass filter 434 receives the right input channel XR and applies a modulation that models the frequency response of the listener's head. The output of the right head shadow high-pass filter 434 is provided to the right crosstalk delay 436, which applies a time delay. The right head shadow gain 438 applies a gain to the output of the right crosstalk delay 436 to generate the right crosstalk simulation channel WR.
In some embodiments, the head shadow low-pass filters 422 and 432 have a cutoff frequency of 2,023 Hz. The head shadow high-pass filters 424 and 434 have a cutoff frequency of 150 Hz. The cross-talk delays 426 and 436 apply a 0.792 millisecond delay. The head shadow gains 428 and 438 apply a −14.4 dB gain. The application of the head shadow filters, crosstalk delay, and head shadow gain for each of the left and right channels may be performed in different orders.
In some embodiments, a head shadow filter is integrated with a head shadow gain. For example, the filter head shadow low-pass filters 422 and 432 may apply the gain of the head shadow gain 428 and 438 as part of a filtering function. In that sense, applying a filter to a signal or channel may include wideband adjustment of gain level in addition to adjustments based on frequency.
The combiner 440 is coupled to the right head shadow gain 438 and the L/R to S converter 446. The combiner 440 receives the left input channel XL and the right crosstalk simulation channel WR, and generates a left crosstalk channel VL by adding the left input channel XL and the right crosstalk simulation channel WR. The combiner 442 is coupled to the left head shadow gain 428 and the L/R to S converter 446. The combiner 442 receives the right input channel XR and the left crosstalk simulation channel WL, and generates a right crosstalk channel VR by adding the right input channel XR and the left crosstalk simulation channel WL.
The L/R to S converter 446 receives the left crosstalk channel VL and the right crosstalk channel VR, and generates a side crosstalk channel VS based on a difference between the left crosstalk channel VL and the right crosstalk channel VR.
The L/R to M converter 444 is coupled to the M/S to L/R converter 448. The L/R to M converter 444 receives the left input channel XL and the right input channel XR, and generates a mid channel XM based on a sum of the left input channel XL and the right input channel XR.
The M/S to L/R converter 448 is coupled to the L/R to M converter 444 and the L/R to S converter 446. The M/S to L/R converter 448 receives the side crosstalk channel VS and the mid channel XM, and generates the left output channel OL and the right output channel OR. The left output channel OL may be generated based on a sum of the side crosstalk channel Vs and the mid channel XM, and the right output channel OR may be generated based on a difference between the side crosstalk channel VS and the mid channel XM. The left output channel OL is a left crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 402, and the right output channel OR is a right crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 402.
The L/R to S converter 450 receives the left crosstalk simulation channel WL and the right crosstalk simulation channel WR, and generates a side crosstalk simulation channel WS based on a difference between the left crosstalk simulation channel WL and the right crosstalk simulation channel WR.
The M/S to L/R converter 452 receives the side crosstalk simulation channel WS and a zero mid channel, and generates a left crosstalk channel DL and a right crosstalk channel DR. The left crosstalk channel DL may be generated based on a sum of the side crosstalk simulation channel WS and the zero mid channel, and the right crosstalk channel DR may be generated based on a difference between the zero mid channel and the side crosstalk simulation channel WS.
The combiner 454 receives the right crosstalk channel DR and the left input channel XL, and generates the left output channel OL by adding the right crosstalk channel DR and the left input channel XL. The combiner 456 receives the left crosstalk channel DL and the right input channel XR, and generates the right output channel OR by adding the left crosstalk channel DL and the right input channel XR. The left output channel OL is a left crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 404, and the right output channel OR is a right crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 404.
The crosstalk simulation processor 406 further includes a subtractor 458 and a combiner 460, each coupled to the L/R to S converter 450. The subtractor 458 receives the left input channel XL and the side crosstalk simulation channel WS, and generates the left output channel OL based on a difference between the left input channel XL and the side crosstalk simulation channel WS. The combiner 460 receives the right input channel XR and the side crosstalk simulation channel WS, and generates the right output channel OR based on a sum of the right input channel XR and the side crosstalk simulation channel WS. The left output channel OL is a left crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 406, and the right output channel OR is a right crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 406.
A common goal of crosstalk simulation is that of perceptually simulating the experience of listening to a symmetric loudspeaker system over headphones, where the overall crosschannel signals are transformed identically. That is, the left channel may be delayed, filtered, and scaled identically to the right channel before summing to the opposite channel. If we assume symmetry in the left/right crosschannel signal transformations,
The L/R to M/S converter 462 receives the left input channel XL and the right input channel XR, and generates a mid channel XM and a side channel XS. The side channel XS may be generated based on a difference between the left input channel XL and the right input channel XR. The mid channel XM may be generated based on a sum of the left input channel XL and the right input channel XR.
The side head shadow low-pass filter 464 and the side head shadow high-pass filter 466 receive the side channel XS and apply modulations that model the frequency response of the signal after passing through the listener's head. The use of both low-pass and high-pass filters may result in a more accurate model of the frequency response though the listener's head. In some embodiments, only one of the low-pass filter 464 or high-pass filter 466 are used. The output of the side head shadow high-pass filter 466 is provided to the side cross-talk delay 468, which applies a time delay to the output of the side head shadow high-pass filter 466. The side head shadow gain 470 applies a gain to the output of the side crosstalk delay 426 to generate a side crosstalk simulation channel WS. The application of the head shadow filters, crosstalk delay, and head shadow gain for the side channel XS may be performed in different orders.
The subtractor 472 is coupled to the L/R to M/S converter 462 and side head shadow gain 470. The subtractor 472 receives the side channel XS and the side crosstalk simulation channel WS, and generates a side crosstalk channel Gs based on a difference between the side channel XS and the side crosstalk simulation channel WS.
The M/S to L/R converter 474 is coupled to the L/R to M/S converter 462 and the subtractor 472. The M/S to L/R converter 474 receives the mid channel XM and the side crosstalk channel Gs, and generates the left output channel OL and the right output channel OR. The left output channel OL may be generated based on a sum of the mid channel XM and the side crosstalk channel Gs, and the right output channel OL may be generated based on a difference between the mid channel XM and the side crosstalk channel Gs. The left output channel OL is a left crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 408, and the right output channel OR is a right crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 408.
The crosstalk simulation processor 410 further includes an L/R to S converter 476 coupled to the side head shado low-pass filter 464, an M/S to L/R converter 478 coupled to the side head shadow gain 470, a combiner 480 coupled to the M/S to L/R converter 478, and a combiner 482 coupled to the M/S to L/R converter 478. The L/R to S converter 476 receives the left input channel XL and the right input channel XR, and generates the side channel XS based on a difference between the left input channel XL and the right input channel XR. The side channel XS is processed by the side head shadow low-pass filter 464, the side head shadow high-pass filter 466, the side crosstalk delay 468, and the side head shadow gain 470 to generate the side crosstalk simulation channel WS.
The M/S to L/R converter 478 receives the side crosstalk simulation channel WS and a zero mid channel, and generates a left crosstalk simulation channel WL and a right crosstalk simulation channel WR. The left crosstalk simulation channel WL may be generated based on a sum of the side crosstalk simulation channel WS and the zero mid channel, and the right crosstalk simulation channel WR may be generated based on a difference between the zero mid channel and the side crosstalk simulation channel WS.
The combiner 480 receives the left input channel XL and the right channel WR, and generates the left output channel OL by adding the left input channel XL and the right crosstalk simulation channel WR. The combiner 482 receives the right input channel XR and the left channel crosstalk simulation WL, and generates the right output channel OR by adding the right input channel XR and the left crosstalk simulation channel WL. The left output channel OL is a left crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 410, and the right output channel OR is a right crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 410.
The crosstalk simulation processor 412 further includes a subtractor 484 and a combiner 486, each coupled to the side head shadow gain 470. The subtractor 484 receives the left input channel XL and the side crosstalk simulation channel WS, and generates the left output channel OL based on a difference between the left input channel XL and the side crosstalk simulation channel WS. The combiner 486 receives the right input channel XR and the side crosstalk simulation channel WS, and generates the right output channel OR based on a sum of the right input channel XR and the side crosstalk simulation channel WS. The left output channel OL is a left crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 412, and the right output channel OR is a right crosstalk simulated channel of a crosstalk processed signal generated by the crosstalk simulation processor 412.
The crosstalk simulation processors shown in
Example Crosstalk Compensation Processor
The crosstalk compensation processor 500 includes an L/R to M/S converter 512, a side component processor 530, and an M/S to L/R converter 514. The L/R to M/S converter 512 receives the left input channel XL and the right input channel XR, generates the mid channel Xm based on a sum of the input channels XL, XR, and generates the side channel Xs based on a difference between the input channels XL, XR.
The side component processor 530 includes a plurality of filters 550, such as m side filters 550(a), 550(b) through 550(m). The side component processor 530 generates a side crosstalk compensation channel Zs by processing the spatial channel Xs. In some embodiments, a frequency response plot of the spatial Xs with crosstalk processing can be obtained through simulation. By analyzing the frequency response plot, any spectral defects such as peaks or troughs in the frequency response plot over a predetermined threshold (e.g., 10 dB) occurring as an artifact of the crosstalk processing can be estimated. The side crosstalk compensation channel Zs can be generated by the side component processor 530 to compensate for the estimated peaks or troughs. Specifically, based on the specific delay, filtering frequency, and gain applied in the crosstalk processing, peaks and troughs shift up and down in the frequency response, causing variable amplification and/or attenuation of energy in specific regions of the spectrum. Each of the side filters 550 may be configured to adjust for one or more of the peaks and troughs. In some embodiments, the side component processor 530 may include a different number of filters.
In some embodiments, the side filters 550 may include a biquad filter having a transfer function defined by Equation 16:
where z is a complex variable, and a0, a1, a2, b0, b1, and b2 are digital filter coefficients. One way to implement such a filter is the direct form I topology as defined by Equation 17:
where X is the input vector, and Y is the output. Other topologies may be used, depending on their maximum word-length and saturation behaviors.
The biquad can then be used to implement a second-order filter with real-valued inputs and outputs. To design a discrete-time filter, a continuous-time filter is designed, and then transformed into discrete time via a bilinear transform. Furthermore, resulting shifts in center frequency and bandwidth may be compensated using frequency warping.
For example, a peaking filter may have an S-plane transfer function defined by Equation 18:
where s is a complex variable, A is the amplitude of the peak, and Q is the filter “quality,” and the digital filter coefficients are defined by:
where ω0 is the center frequency of the filter in radians and
Furthermore, the filter quality Q may be defined by Equation 19:
where Δf is a bandwidth and fc is a center frequency.
The M/S to L/R converter 514 receives the mid channel Xm and the side crosstalk compensation channel Zs, and generates the left output channel OL and the right output channel OR. The left output channel OL may be generated based on a sum of the mid channel Xm and the side crosstalk compensation channel Zs. The right output channel OR may be generated based on a difference between the mid channel Xm and the side crosstalk compensation channel Zs. The left output channel OL is a left crosstalk compensated channel of a crosstalk compensated signal generated by the crosstalk compensation processor 500, and the right output channel OR is a right crosstalk compensated channel of a crosstalk compensated signal generated by the crosstalk simulation compensation processor 500.
Example Crosstalk Compensation
The line 902 is a white noise input signal. The line 904 is a mid channel of the input signal that bypasses the crosstalk cancellation and crosstalk compensation. The line 906 is a side channel of the input signal after the crosstalk cancellation and crosstalk compensation. The crosstalk compensation results in the line 906 being generated from the crosstalk canceled side channel shown by the line 806 in the plot 800. For the crosstalk compensation, two side filters are applied to the side channel including a first peaknotch filter having a 6830 Hz center frequency, an 4.0 dB gain, and 1.0 Q, and a second peaknotch filter having a 15500 Hz center frequency, a −2.5 dB gain, and 2.0 Q. In general, the number of side filters applied by the crosstalk compensation processor, as well as their parameters, may vary.
The line 1102 is a white noise input signal. The line 1104 is a mid channel of the input signal that bypasses the crosstalk cancellation and crosstalk compensation. The line 1106 is a side channel of the input signal after the crosstalk cancellation and crosstalk compensation. The crosstalk compensation results in the line 1106 being generated from the crosstalk canceled side channel shown by the line 1006 in the plot 1000. For the crosstalk compensation, three side filters are applied to the side channel including a first peaknotch filter having a 4,000 Hz center frequency, an 8.0 dB gain, and 2.0 Q, and a second peaknotch filter having an 8,800 Hz center frequency, a −2.0 dB gain, and 1.0 Q, and a third peaknotch filter having an 15,800 Hz center frequency, a 1.5 dB gain, and 2.5 Q. The number of side filters applied by the crosstalk compensation processor, as well as their parameters, may vary.
Example Processing
An audio processing system receives 1205 an audio signal including a left channel and a right channel. The audio signal may be a stereo audio signal X with the left channel being mixed for a left speaker and the right channel being mixed for or a right speaker.
The audio processing system applies 1210 a crosstalk processing to a side channel of the left and right channels to generate a crosstalk processed signal. The crosstalk processing may include a crosstalk cancellation or a crosstalk simulation. A mid channel of the side channels may bypass the crosstalk processing.
For crosstalk cancellation, the audio processing system may include a crosstalk cancellation processor, such as the crosstalk cancellation processors 302, 304, 306, 308, 312, and 314 shown in
For crosstalk simulation, the audio processing system may include a crosstalk simulation processor, such as the crosstalk simulation processors 402, 404, 406, 408, 410, and 412 shown in
The audio processing system applies 1215 a crosstalk compensation processing to the side channel to generate a crosstalk compensated signal. The crosstalk compensation processing applied to the side channel adjusts for spectral defects caused by the crosstalk processing applied to the side channel. The mid channel may bypass the crosstalk compensation processing. The audio processing system may include the crosstalk compensation processor 500 as shown in
The audio processing system generates 1220 a left output channel and a right output channel using the crosstalk processed signal and the crosstalk compensated signal. The left and right output channels may also be generated using the mid channel that bypasses the crosstalk processing and crosstalk processing and crosstalk compensation processing. For example, the left output channel may be generated based on a sum of the result of the crosstalk processing and crosstalk compensation processing applied to the side channel and the mid channel that bypasses the crosstalk processing and crosstalk compensation processing. The right output channel may be generated based on a difference between the mid channel that bypasses the crosstalk processing and crosstalk compensation and the result of the crosstalk processing and crosstalk compensation processing applied to the side channel.
In some embodiments, each of the crosstalk processed signal and the crosstalk compensated signal may include a left and right channel, which may be used to respectively generate the left and right out channels. In some embodiments, the crosstalk compensation may be performed after the crosstalk processing as shown by the audio processing system 200 in
In some embodiments, the crosstalk processing and crosstalk compensation are performed in parallel, with their left output channels being combined (e.g., by the combiner 206) to generate the left output channel and their right output channels being combined to generate the right output channel, as shown by the audio processing system 210 in
In some embodiments, the crosstalk compensation is performed prior to the crosstalk cancellation, as shown by the audio processing system 214 in
In some embodiments, the crosstalk compensation processing is not performed, and the left and right output channels of the crosstalk processing are used to generate the left output channel OL and the right output channel OR, respectively.
The audio processing system provides 1225 the left output channel to a left speaker and the right output channel to a right speaker. If the crosstalk processing is crosstalk cancellation, the left and right speakers may be loudpseakers 110L and 110R, respectively. If the crosstalk processing is crosstalk simulation, the left and right speakers may be headphones 130L and 130R, respectively.
Example Computer
The storage device 1308 includes one or more non-transitory computer-readable storage media such as a hard drive, compact disk read-only memory (CD-ROM), DVD, or a solid-state memory device. The memory 1306 holds instructions and data used by the processor 1302. The pointing device 1314 is used in combination with the keyboard 1310 to input data into the computer system 1300. The graphics adapter 1312 displays images and other information on the display device 1318. In some embodiments, the display device 1318 includes a touch screen capability for receiving user input and selections. The network adapter 1316 couples the computer system 1300 to a network. Some embodiments of the computer 1300 have different and/or other components than those shown in
The computer 1300 is adapted to execute computer program modules for providing functionality described herein. For example, some embodiments may include a computing device including one or more modules configured to perform the processing crosstalk processing or crosstalk cancellation processing as discussed herein. As used herein, the term “module” refers to computer program instructions and/or other logic used to provide the specified functionality. Thus, a module can be implemented in hardware, firmware, and/or software. In one embodiment, program modules formed of executable computer program instructions are stored on the storage device 1308, loaded into the memory 1306, and executed by the processor 1302.
Upon reading this disclosure, those of skill in the art will appreciate still additional alternative embodiments of the disclosed principles herein. Thus, while particular embodiments and applications have been illustrated and described, it is to be understood that the disclosed embodiments are not limited to the precise construction and components disclosed herein. Various modifications, changes and variations, which will be apparent to those skilled in the art, may be made in the arrangement, operation and details of the method and apparatus disclosed herein without departing from the scope described herein.
Any of the steps, operations, or processes described herein may be performed or implemented with one or more hardware or software modules, alone or in combination with other devices. In one embodiment, a software module is implemented with a computer program product comprising a computer readable medium (e.g., non-transitory computer readable medium) containing computer program code, which can be executed by a computer processor for performing any or all of the steps, operations, or processes described.
Number | Name | Date | Kind |
---|---|---|---|
10299056 | Walsh | May 2019 | B2 |
20140355797 | North | Dec 2014 | A1 |
20150172812 | Wu | Jun 2015 | A1 |
20170230772 | Johnson | Aug 2017 | A1 |
20170230777 | Seldess et al. | Aug 2017 | A1 |
Number | Date | Country |
---|---|---|
WO 2017127271 | Jul 2017 | WO |
Entry |
---|
PCT International Search Report and Written Opinion, PCT Application No. PCT/US19/44547, dated Oct. 28, 2019, 20 pages. |
Number | Date | Country | |
---|---|---|---|
20200107124 A1 | Apr 2020 | US |