The present invention relates to an audio device capable of achieving virtual acoustic image localization at any desired position using sounds reproduced from speakers.
Conventionally, an audio reproduction technique (referred to as “virtual acoustic image localization technique” from now on) has been known which uses only two speakers, and has a listener perceive as if a sound source were present at any desired position in space.
A method of carrying out the virtual acoustic image localization technique is shown in the following Non-Patent Document 1, for example.
According to Non-Patent Document 1, the virtual acoustic image localization technique measures (or estimates) a transfer characteristic from a desired position in a space to ears of a person at any desired position in the same space in advance, and generates signals considered to reach the ears by a convolution of the transfer characteristic into an input sound source.
The signals thus generated are called “binaural signals”, and can make a listener feel as if the sound source were present at any given position by providing the binaural signals to the ears using a reproduction device such as headphones.
However, when the reproduction device is speakers, the following cross-talk cancellation processing becomes necessary to bring the binaural signals to the ears properly.
For example, in the speaker reproduction, if the signal to be provided to a first ear (right ear, for example) is reproduced with a first speaker (right speaker, for example) directly, “crosstalk” will occur in which the sounds produced from the right speaker reach not only to the right ear via a space transfer function G11, but to the left ear via the space transfer function G12, thereby being unable to provide the binaural signals to both ears properly.
In the case of being unable to provide the binaural signals to both ears properly, a problem occurs in that the acoustic image is not localized at a target position.
To solve the problem, the virtual acoustic image localization technique based on the speaker reproduction generally carries out cross-talk cancellation processing to suppress the crosstalk.
As for the example shown in
However, the foregoing cross-talk cancellation processing often causes deterioration in the sound quality because center-localized components (such as speech or vocal components) to be localized at the center are perceived to be pulled back, and hence cannot be heard clearly or are perceived as having echoes.
In addition, since it weakens low frequency components, it detracts impressive low frequency feeling.
Here, as for the center-localized components and low frequency components, in-phase components are dominant in both of them. In the following, a reason why the in-phase components are dominant will be described.
When generating the binaural signals for causing a particular sound source to be localized at the center, it is natural that the binaural signals are generated on the assumption that the sound source is placed in front of the listener.
When the sound source is placed in front of the listener, the sounds will arrive at the left ear and right ear of the listener almost at the same time. This can be understood from a reason that since a human face is almost symmetrical, the transfer characteristic from the frontal sound source position to the right ear is nearly equal to the transfer characteristic from the frontal sound source position to the left ear.
Not only in the binaural signals, but also in an ordinary stereo sound source, the center-localized components are recorded in a nearly right and left in-phase manner.
Accordingly, in the binaural signals and ordinary stereo signals, the in-phase components are dominant in the center-localized components. There are many cases where they are completely in-phase signals.
Next, when generating binaural signals that will cause a sound source to be localized at a 90-degree right side of a listener, the binaural signals are generated on the assumption that the sound source is placed at a 90-degree right side of the listener.
When the sound source is placed at the 90-degree right side of the listener, sounds will arrive at the right ear, first, and then at the left ear with a delay corresponding to the width of the face (difference in distance between the right and left ears).
It is known that a low frequency component is apt to diffract in comparison with middle to high frequency components. Thus, sounds with their amplitude intensity being little attenuated as compared with the sounds arriving at the right ear bend around and arrive at the left ear, as well.
In other words, the binaural signals become signals in which the signal for the right ear is output first, and then the signal for the left ear is output after a fixed time period. As for the low frequency components, the amplitude intensity difference between the right and left is small.
Here, the fixed time period, which is a delay time of a sound wave of about the face width, corresponds to the delay time of about 20-30 samples in a DVD audio signal sampled at 48000 Hz, for example.
Consider the case where the low frequency signal is 100 Hz or less. Then, its wavelength becomes 480 samples or more for one period.
Accordingly, even if delaying the low frequency signal of 100 Hz by 30 samples corresponding to the delay time of the face width, its phase is delayed only 1/16λ or less (where λ is a wavelength), which can be considered to be almost an in-phase signal without any problem.
At angles other than the right side 90 degrees, it is natural that the phase delays at the right and left become smaller than that.
Thus, as for the binaural signals, the low frequency components can be considered nearly in-phase components. In ordinary stereo sources, although the low frequency components are sometimes recorded while providing amplitude difference between the right and left, they are usually recorded as nearly in-phase components.
For the foregoing reasons, as for the center-localized components and low frequency components, the in-phase components are dominant in both of them.
Here, a case where in-phase component signals are input to the foregoing cross-talk cancellation processing will be described.
Here, they are schematic diagrams when approximating a transfer characteristic Hd by impulses and when approximating a transfer characteristic Hx by impulses involving delay and attenuation. Even if such approximation is not made, a rough inclination of the time response is the same.
When the in-phase components are input, the output signals of the audio device have the same time response for the right and left as shown in
In
About the ears of the listener sitting at the position supposed in the design stage of the cross-talk cancellation processing (referred to as “standard position” from now on), the response portions (b) cancel out each other, and the crosstalk is canceled completely.
However, when the listener shifts from the standard position even slightly, the response portions (b) do not cancel out each other so that the listener perceives deterioration in the sound quality with echoes.
In an actual listening environment, a listener is seldom sitting at the standard position so that in many cases the center-localized component signals have echoes. Thus, the acoustic image is pulled back, and the sound quality deteriorates as well.
The frequency characteristics of the output signals of the cross-talk cancellation processing to which the in-phase components are input have a peak in a middle range component of about 1000 Hz-3000 Hz as shown in
It is found in
As described above, in the conventional cross-talk cancellation processing, the center-localized components are pulled back theoretically, which causes the sound quality deterioration such as provided with echoes and the sound quality deterioration such as a weakened low frequency signal.
Besides the cross-talk cancellation processing disclosed in Non-Patent Document 1, the cross-talk cancellation processing is disclosed in the following Patent Documents 1 and 2, for example.
However, since the cross-talk cancellation processing operates in the completely same trend when the in-phase signals are input, the center-localized components are pulled back theoretically, which causes the sound quality deterioration such as provided with echoes and the sound quality deterioration such as a weakened low frequency signal.
With the foregoing configuration, the conventional audio device can bring, when the reproduction device is speakers, the binaural signals to the ears properly by carrying out the cross-talk cancellation processing. However, it has a problem of bringing about the sound quality deterioration of the center-localized components or low frequency components.
The present invention is implemented to solve the foregoing problem. Therefore it is an object of the present invention to provide an audio device capable of achieving good quality cross-talk cancellation processing which does not bring about the sound quality deterioration of the center-localized components or low frequency components.
An audio device in accordance with the present invention is configured in such a manner that it includes a signal processing means for providing an anti-phase component signal extracted by an anti-phase component extracting means with a transfer characteristic for canceling a crosstalk component, and that a first adding means adds the phase-inverted signal of the anti-phase component signal which is provided with the transfer characteristic for canceling the crosstalk component by the signal processing means and an in-phase component signal extracted by an in-phase component extracting means, and a second adding means adds the anti-phase component signal which is provided with the transfer characteristic for canceling the crosstalk component by the signal processing means and the in-phase component signal extracted by the in-phase component extracting means.
According to the present invention, since the audio device in accordance with the present invention is configured in such a manner that it includes a signal processing means for providing an anti-phase component signal extracted by the anti-phase component extracting means with a transfer characteristic for canceling a crosstalk component, and that a first adding means adds the phase-inverted signal of the anti-phase component signal which is provided with the transfer characteristic for canceling the crosstalk component by the signal processing means and an in-phase component signal extracted by an in-phase component extracting means, and a second adding means adds the anti-phase component signal which is provided with the transfer characteristic for canceling the crosstalk component by the signal processing means and the in-phase component signal extracted by the in-phase component extracting means, it offers an advantage of being able to achieve good quality cross-talk cancellation processing without bringing about sound quality deterioration of a center-localized component or low frequency component.
The best mode for carrying out the invention will now be described with reference to the accompanying drawings to explain the present invention in more detail.
An anti-phase component extracting section 2 receives the right signal R and left signal L of the audio signal, and carries out processing of extracting the anti-phase component signal S of the right signal R and left signal L. Incidentally, the anti-phase component extracting section 2 constitutes an anti-phase component extracting means.
As for the right signal R and left signal L of the audio signal received by the in-phase component extracting section 1 and anti-phase component extracting section 2, although they are preferably binaural signals, they are not limited to them. For example, any audio signals such as a signal output from a CD player or DVD player, a broadcast voice signal received with a DTV receiver, and a signal obtained by A/D converting an analog audio signal can become an object.
A signal processing section 3 carries out processing of providing the anti-phase component signal S extracted by the anti-phase component extracting section 2 with a transfer characteristic used for crosstalk component cancellation. More specifically, when the transfer characteristic for the sound which is reproduced from a speaker on one side (right speaker, for example) of the stereo speakers and arrives at the listener's ear on the same side as the speaker on the one side (right ear, for example), is represented by Hd, and the transfer characteristic for the sound which is reproduced from the speaker on the one side and arrives at the listener's ear on the other side of the speaker on the one side (left ear, for example), is represented by Hx, it carries out the processing of providing the anti-phase component signal S extracted by the anti-phase component extracting section 2 with the transfer characteristic (Hd+Hx)/(Hd−Hx). Incidentally, the signal processing section 3 constitutes a signal processing means.
An adder 4 adds the phase-inverted signal of the anti-phase component signal S, to which the transfer characteristic (Hd+Hx)/(Hd−Hx) is provided by the signal processing section 3, and the in-phase component signal M extracted by the in-phase component extracting section 1, and carries out processing of outputting the sum signal of the phase-inverted signal of the anti-phase component signal S and the in-phase component signal M as a driving signal Rout of the right speaker. Incidentally, the adder 4 constitutes a first adding means.
An adder 5 adds the anti-phase component signal S, to which the transfer characteristic (Hd+Hx)/(Hd−Hx) is provided by the signal processing section 3, and the in-phase component signal M extracted by the in-phase component extracting section 1, and carries out processing of outputting the sum signal of the anti-phase component signal S and the in-phase component signal M as a driving signal Lout of the left speaker. Incidentally, the adder 5 constitutes a second adding means.
In
Next, the operation will be described.
The right signal R and left signal L of the audio signal are bifurcated when they are input to the audio device, and are input to the in-phase component extracting section 1 and to the anti-phase component extracting section 2.
The in-phase component extracting section 1, receiving the right signal R and left signal L of the audio signal, extracts the in-phase component signal M of the right signal R and left signal L.
Here, as a method for the in-phase component extracting section 1 to extract the in-phase component signal M of the right signal R and left signal L, a method is conceivable which adds the right signal R and left signal L, and extracts the result of the addition as the in-phase component signal M, for example. The method is characterized by a low operation cost.
It is assumed here that the present embodiment 1 employs the method of extracting the in-phase component signal M by adding the right signal R and the left signal L, but it is not limited to the method. For example, it can also employ a method of extracting the in-phase component signal M using an adaptive digital filter.
More specifically, assume that the adaptive digital filter employs the right signal R as the input signal and the left signal L as the target signal of the adaptive digital filter (the input signal and the target signal are interchangeable), then it learns filter coefficients adaptively, and uses the output signal from the adaptive digital filter as the in-phase component signal M.
Besides, any extracting method of the in-phase component signal can be employed.
The anti-phase component extracting section 2, receiving the right signal R and left signal L of the audio signal, extracts the anti-phase component signal S of the right signal R and left signal L.
Here, as a method for the anti-phase component extracting section 2 to extract the anti-phase component signal S of the right signal R and left signal L, a method is conceivable which subtracts the right signal R from the left signal L (the left signal L and right signal R are interchangeable), and extracts the result of the subtraction as the anti-phase component signal S, for example. The method is characterized by a low operation cost.
It is assumed here that the present embodiment 1 employs the method of extracting the anti-phase component signal S by subtracting the right signal R from the left signal L, but it is not limited to the method. For example, it can also employ a method of extracting the anti-phase component signal S using an adaptive digital filter.
More specifically, assume that the adaptive digital filter employs the right signal R as the input signal and the left signal L as the target signal of the adaptive digital filter (the input signal and the target signal are interchangeable), then it learns filter coefficients adaptively, and uses the error signal between the output signal from the adaptive digital filter and the target signal as the anti-phase component signal S.
Besides, any extracting method of the anti-phase component signal can be employed.
The signal processing section 3, receiving the anti-phase component signal S of the right signal R and left signal L from the anti-phase component extracting section 2, performs digital filter processing on the anti-phase component signal S, thereby carrying out the processing of providing the anti-phase component signal S with the transfer characteristic (Hd+Hx)/(Hd−Hx) which is the transfer characteristic for canceling the crosstalk component.
The adder 4, receiving the in-phase component signal M of the right signal R and left signal L from the in-phase component extracting section 1 and the anti-phase component signal S to which the transfer characteristic (Hd+Hx)/(Hd−Hx) from the signal processing section 3 is provided, adds the phase-inverted signal of the anti-phase component signal S and the in-phase component signal M, and outputs the sum signal of the phase-inverted signal of the anti-phase component signal S and the in-phase component signal M as the driving signal Rout of the right speaker.
In other words, the adder 4 generates the driving signal Rout of the right speaker by adding the anti-phase component signal S output from the signal processing section 3 in the inverted phase and the in-phase component signal M output from the in-phase component extracting section 1 in the same phase, and outputs it.
The adder 5, receiving the in-phase component signal M of the right signal R and left signal L from the in-phase component extracting section 1 and receiving the anti-phase component signal S to which the transfer characteristic (Hd+Hx)/(Hd−Hx) from the signal processing section 3 is provided, adds the anti-phase component signal S and the in-phase component signal M, and outputs the sum signal of the anti-phase component signal S and the in-phase component signal M as the driving signal Lout of the left speaker.
In other words, the adder 5 generates the driving signal Lout of the left speaker by adding the anti-phase component signal S output from the signal processing section 3 in the same phase and the in-phase component signal M output from the in-phase component extracting section 1 in the same phase, and outputs it.
The operation of the adders 4 and 5 is applied to the case where the anti-phase component extracting section 2 extracts the anti-phase component signal S by subtracting the right signal R from the left signal L. In contrast, when the anti-phase component extracting section 2 extracts the anti-phase component signal S by subtracting the left signal L from the right signal R, they operate as follows.
More specifically, the adder 4 generates the driving signal Rout of the right speaker by adding the anti-phase component signal S output from the signal processing section 3 in the same phase and the in-phase component signal M output from the in-phase component extracting section 1 in the same phase, and outputs it.
On the other hand, the adder 5 generates the driving signal Lout of the left speaker by adding the anti-phase component signal S output from the signal processing section 3 in the inverted phase and the in-phase component signal M output from the in-phase component extracting section 1 in the same phase, and outputs it.
The driving signal Rout of the right speaker output from the adder 4 of the audio device and the driving signal Lout of the left speaker output from the adder 5 can be given by the following expression (1).
When the driving signal Rout of the right speaker output from the adder 4 is supplied to the right speaker, and when the driving signal Lout of the left speaker output from the adder 5 is supplied to the left speaker, the sounds ER and EL, which are reproduced by the right speaker and left speaker and arrive at the listener's ears, can be given by the following expression (2).
It is obvious from the expression (2) that the crosstalk component is completely eliminated from the sounds EL and ER arriving at the listener's right and left ears. However, it is also found that the characteristic (Hd+Hx) is provided.
The characteristic (Hd+Hx), however, is equivalent to the characteristic naturally provided when the sounds are reproduced from ordinary stereo speakers, and hence it does not cause any sound quality deterioration.
As is clear from
Accordingly, even if the listener shifts from the standard position, it can offer the good quality center-localized component without providing echoes to the center-localized component.
Here,
As is clear from
Accordingly, the low frequency component does not thin down, and hence impressive low frequency feeling can be offered.
As is clear from the foregoing description, according to the present embodiment 1, when the transfer characteristic for the sound which is reproduced by a speaker on one side (right speaker, for example) and arrives at the listener's ear on the same side as the speaker on the one side (right ear, for example), is denoted by Hd, and the transfer characteristic for the sound which is reproduced by the speaker on the one side and arrives at the listener's ear on the opposite side to the speaker on the one side (left ear, for example), is denoted by Hx, the embodiment 1 is configured in such a manner that it has the signal processing section 3 for providing the anti-phase component signal S extracted by the anti-phase component extracting section 2 with the transfer characteristic (Hd+Hx)/(Hd−Hx), and that the adder 4 adds the phase-inverted signal of the anti-phase component signal S provided with the transfer characteristic by the signal processing section 3 and the in-phase component signal M extracted by the in-phase component extracting section 1, and the adder 5 adds the anti-phase component signal S provided with the transfer characteristic by the signal processing section 3 and the in-phase component signal M extracted by the in-phase component extracting section 1. As a result, it offers an advantage of being able to achieve good quality cross-talk cancellation processing without bringing about the sound quality deterioration of the center-localized component or low frequency component.
Although the present embodiment 1 describes the processing for canceling the spatial crosstalk, it is not limited to it. For example, it is applicable to acoustic coupling within a box, which occurs when a plurality of speakers are mounted in the same box.
In this case, it is applicable by using as the transfer characteristic Hd the transfer characteristic caused by the amplifier section/speaker section/box and the like, and as the transfer characteristic Hx the acoustic coupling characteristic by which a speaker is coupled to the speaker on the other side.
A signal processing section 10 carries out processing of providing the anti-phase component signal S extracted by the anti-phase component extracting section 2 with the transfer characteristic (Hd+Hx)/(Hd−Hx) for canceling the crosstalk component in the same manner as the signal processing section 3 of
An adder 11 of the signal processing section 10 is a first adder that adds the anti-phase component signal S extracted by the anti-phase component extracting section 2 and a feedback signal output from a multiplier 13, and outputs the sum signal S1 of the anti-phase component signal S and the feedback signal.
A delay section 12 carries out the processing of delaying the sum signal S1 output from the adder 11 by n samples.
The multiplier 13 carries out processing of multiplying the sum signal S1 delayed by the delay section 12 by a constant α (α<1), and of outputting the multiplication result of the sum signal S1 and the constant α as the feedback signal.
An adder 14 is a second adder that adds the sum signal S1 output from the adder 11 and the feedback signal output from the multiplier 13, and outputs the addition result of the sum signal S1 and the feedback signal to the adders 4 and 5 as the anti-phase component signal S(Hd+Hx)/(Hd−Hx).
Next, the operation will be described.
Since it is the same as the foregoing embodiment 1 except for the signal processing section 10, only the operation of the signal processing section 10 will be described.
The signal processing section 10, receiving the anti-phase component signal S of the right signal R and left signal L from the anti-phase component extracting section 2, carries out processing of providing the anti-phase component signal S with the transfer characteristic (Hd+Hx)/(Hd−Hx) in the same manner as the signal processing section 3 of
More specifically, the adder 11 of the signal processing section 10, receiving the anti-phase component signal S of the right signal R and left signal L from the anti-phase component extracting section 2, adds the anti-phase component signal S and the feedback signal output from the multiplier 13, and outputs the sum signal S1 of the anti-phase component signal S and the feedback signal to the delay section 12 and adder 14.
The delay section 12, receiving the sum signal S1 from the adder 11, delays the sum signal S1 by the preset n samples, and outputs the delayed sum signal S1 to the multiplier 13.
The multiplier 13, receiving the delayed sum signal S1 from the delay section 12, multiplies the delayed sum signal S1 by a preset number α (α<1) to attenuate the signal intensity, and outputs the multiplication result of the sum signal S1 and the constant α to the adders 11 and 14 as the feedback signal.
The adder 14, receiving the sum signal S1 from the adder 11 and the feedback signal from the multiplier 13, adds the sum signal S1 and the feedback signal, and outputs the addition result of the sum signal S1 and the feedback signal to the adders 4 and 5 as the anti-phase component signal S(Hd+Hx)/(Hd−Hx).
In the present embodiment 2, the transfer characteristics Hd and Hx are approximated to simpler functions as shown in the following expression (3) to reduce the operation cost required by the signal processing section 10.
where Δ is the difference between the distance from the speaker on the one side to the ear on the side closer to the speaker and the distance from the speaker on the one side to the ear on the other side of the speaker, Fs is the sampling frequency of the audio signal, and c is the speed of sound.
The approximation shown by expression (3) indicates the behavior of sound waves when disregarding the reproduction environment (walls, floor and furniture of the room) and the diffraction/reflection with the shape of the countenance of the listener.
More specifically, it supposes only the distance attenuation of the amplitude intensity and the delay on the assumption of the transfer characteristics Hd and Hx.
In the reproduction environment of the audio device, since the room and the countenance of the listener (oval face/round face, small face/large face, and so on) cannot be determined in general, the approximation shown by expression (3) can also be said to be a robust approximation independently from the room or countenance of the listener at the reproduction.
Here, the output signal S2 of the signal processing section can be expressed by the following expression (4).
where z−n denotes the delay of n samples.
As is clear from expression (4), it is found that the signal processing section 10 of the present embodiment 2 can also provide the anti-phase component signal S with the transfer characteristic (Hd+Hx)/(Hd−Hx) in the same manner as the foregoing embodiment 1.
According to the present embodiment 2, since the signal processing section 10 is composed of only two adders 11 and 14, one delay section 12, one multiplier 13, and one feedback path, it offers an advantage of being able to reduce the operation cost very much.
A high frequency attenuation section 20 carries out processing of attenuating a high frequency component contained in the sum signal S1 output from the adder 11.
Although
A multiplier 21 carries out the processing of multiplying the sum signal S1 output from the adder 11 by a prescribed constant b0.
A delay section 22 carries out the processing of delaying the sum signal S1 output from the adder 11 by one sample.
A multiplier 23 carries out the processing of multiplying the sum signal S1 delayed by the delay section 22 by a prescribed constant b1.
An adder 24 carries out the processing of adding the multiplication result of the multiplier 21 and the multiplication result of the multiplier 23.
Next, the operation will be described.
The present embodiment 3 differs from the foregoing embodiment 2 in that the high frequency attenuation section 20 is mounted.
The high frequency attenuation section 20, receiving the sum signal S1 from the adder 11, performs moving average processing on the sum signal S1, thereby carrying out the processing of attenuating the high frequency component contained in the sum signal S1.
The contents of the processing of the high frequency attenuation section 20 will be described concretely below.
The multiplier 21 of the high frequency attenuation section 20, receiving the sum signal S1 from the adder 11, multiplies the sum signal S1 by the prescribed constant b0.
In addition, receiving the sum signal S1 from the adder 11, the delay section 22 delays the sum signal S1 by one sample.
After the delay section 22 delays the sum signal S1 by one sample, the multiplier 23 multiplies the delayed sum signal S1 by the prescribed constant b1.
The adder 24 adds the multiplication result of the multiplier 21 and the multiplication result of the multiplier 23, and outputs the addition result to the delay section 12.
Here, although the device is described in which the high frequency attenuation section 20 attenuates the high frequency component contained in the sum signal S1 by performing second-order moving average processing on the sum signal S1, it is not limited to it. For example, a device is also possible that attenuates the high frequency component by performing higher-order moving average processing. Furthermore, instead of the moving average processing, it is also possible to use, for example, an IIR filter or a low frequency component extracting filter to attenuate the high frequency component.
The present embodiment 3 approximates the transfer characteristics Hd and Hx by simple functions as shown by the following expression (5), thereby being able to achieve the reduction in the operation cost necessary for the signal processing section 10 and to apply more sophisticated transfer characteristics.
where L represents the characteristics when the second-order moving average processing is performed, which can be replaced by the characteristics of the higher-order moving average processing. Alternatively, it can be replaced by the frequency characteristics of the IIR filter or of the low frequency component extracting filter.
In contrast to the approximation shown by expression (3) in the foregoing embodiment 2, the approximation shown by expression (5) becomes an approximation in which the diffraction characteristics of the countenance are reflected.
More specifically, the transfer characteristic Hx, which has its high frequency component attenuated owing to the diffraction of the countenance, approximates the high frequency attenuation characteristics by L.
Accordingly, it gives a more sophisticated approximation than the foregoing embodiment 2.
Here, the output signal S2 of the signal processing section 10 can be represented by the following expression (6).
As is clear from expression (6), it is found that the signal processing section 10 in the present embodiment 3 can provide the anti-phase component signal S with the transfer characteristic (Hd+Hx)/(Hd−Hx) in the same manner as the foregoing embodiment 1 or 2.
According to the present embodiment 3, it offers an advantage of being able to achieve the high quality cross-talk cancellation processing considering the diffraction characteristics due to the countenance at a low operation cost about the same level as the foregoing embodiment 2.
A signal output section 32 receives and bifurcates the left signal L of the audio signal, outputs a first left signal L as the driving signal Lout1 of the left main speaker, and outputs a second left signal L to the anti-phase component extracting section 33. Incidentally, the signal output section 32 constitutes a second signal output means.
As for the right signal R and left signal L of the audio signal input to the signal output sections 31 and 32, although they are preferably binaural signals, they are not limited to them. For example, any audio signals such as a signal output from a CD player or DVD player, a broadcast voice signal received with a DTV receiver, and a signal obtained by A/D converting an analog audio signal can become an object.
The anti-phase component extracting section 33 receives the right signal R and left signal L of the audio signal output from the signal output sections 31 and 32, and carries out the processing of extracting the anti-phase component signal S of the right signal R and left signal L. Incidentally, the anti-phase component extracting section 33 constitutes an anti-phase component extracting means.
A signal processing section 34 carries out the processing of providing the anti-phase component signal S extracted by the anti-phase component extracting section 34 with the transfer characteristic for canceling the crosstalk component. More specifically, when the transfer characteristic for the sound which is reproduced from a main speaker on one side (right main speaker, for example) and arrives at the listener's ear on the other side of the main speaker on the one side (left ear, for example), is HD X, the transfer characteristic for the sound which is reproduced from the canceling speaker on the one side (right canceling speaker, for example) and arrives at the listener's ear on the same side as the canceling speaker on the one side (right ear, for example), is HS D, and the transfer characteristic for the sound which is reproduced from the canceling speaker on the one side and arrives at the listener's ear on the other side of the canceling speaker on the one side (left ear, for example), is HS X, the signal processing section 34 carries out the processing of providing the anti-phase component signal S extracted by the anti-phase component extracting section 34 with the transfer characteristic HD X/(HS D−HS X). Incidentally, the signal processing section 34 constitutes a signal processing means.
A phase inverting section 35 inverts the phase of the anti-phase component signal S provided with the transfer characteristic by the signal processing section 34, and carries out the processing of outputting the phase-inverted anti-phase component signal as the driving signal Rout2 of the right canceling speaker. Incidentally, the phase inverting section 35 constitutes a third signal output means.
A signal output section 36 carries out the processing of outputting the anti-phase component signal S provided with the transfer characteristic by the signal processing section 34 as the driving signal Lout2 of the left canceling speaker. Incidentally, the signal output section 36 constitutes a fourth signal output means.
In
Incidentally, HD D represents the transfer characteristic for the sound which is reproduced from the main speaker on the one side (right main speaker, for example) and arrives at the listener's ear on the same side as the main speaker on the one side (right ear, for example).
Next, the operation will be described.
The signal output section 31, receiving the right signal R of the audio signal, bifurcates the right signal R, outputs the first right signal R as the driving signal Rout1 of the right main speaker, and outputs the second right signal R to the anti-phase component extracting section 33.
The signal output section 32, receiving the left signal L of the audio signal, bifurcates the left signal L, outputs the first left signal L as the driving signal Lout1 of the left main speaker, and outputs the second left signal L to the anti-phase component extracting section 33.
The anti-phase component extracting section 33, receiving the right signal R and left signal L of the audio signal from the signal output sections 31 and 32, extracts the anti-phase component signal S of the right signal R and left signal L in the same manner as the anti-phase component extracting section 2 of
The signal processing section 34, receiving the anti-phase component signal S of the right signal R and left signal L from the anti-phase component extracting section 33, performs the digital filter processing on the anti-phase component signal S, thereby carrying out the processing of providing the anti-phase component signal S with transfer characteristic for canceling the crosstalk component HD X/(HS D−HS X).
The phase inverting section 35, receiving the anti-phase component signal S provided with the transfer characteristic from the signal processing section 34, inverts the phase of the anti-phase component signal S, and outputs the phase-inverted anti-phase component signal S as the driving signal Rout2 of the right canceling speaker.
The signal output section 36, receiving the anti-phase component signal S provided with the transfer characteristic from the signal processing section 34, outputs the anti-phase component signal S as the driving signal Lout2 of the left canceling speaker.
Here, the description is made by way of example in which the phase inverting section 35 outputs the phase-inverted anti-phase component signal S as the driving signal Rout2 of the right canceling speaker and the signal output section 36 outputs the anti-phase component signal S as the driving signal Lout2 of the left canceling speaker, and in which this operation is applied to the case where the anti-phase component extracting section 33 extracts the anti-phase component signal S by subtracting the right signal R from the left signal L.
In the case where the anti-phase component extracting section 33 extracts the anti-phase component signal S by subtracting the left signal L from the right signal R, the phase inverting section 35 outputs the phase-inverted anti-phase component signal S as the driving signal Lout2 of the left canceling speaker, and the signal output section 36 outputs the anti-phase component signal S as the driving signal Rout2 of the right canceling speaker.
As for the driving signal Rout1 of the right main speaker, the driving signal Lout1 of the left main speaker, the driving signal Rout2 of the right canceling speaker, and the driving signal Lout2 of the left canceling speaker, which are output from the audio device, they can be given by the following expression (7).
When the driving signal Rout1 of the right main speaker output from the audio device is output to the right main speaker, the driving signal Lout1 of the left main speaker is output to the left main speaker, the driving signal Rout2 of the right canceling speaker is output to the right canceling speaker, and the driving signal Lout2 of the left canceling speaker is output to the left canceling speaker, the sounds ER and EL which arrive at the ears of the listener after reproduced from the right and left main speakers and from the right and left canceling speakers are given by the following expression (8).
As is clear from expression (8), it is found that the sounds EL and ER arriving at the listener's right and left ears have their crosstalk components eliminated completely. It is also found, however, that the characteristic (HD D+HD X) is provided.
The characteristic (HD D+HD X), however, is equivalent to the characteristic naturally provided when the sounds are reproduced from the main speaker, and hence it does not bring about the sound quality deterioration.
In the present embodiment 4, as is clear from
Accordingly, even if the listener shifts from the standard position, it can offer the good quality center-localized component without adding echoes to the center-localized component.
In addition, it is found that the frequency characteristics of the in-phase component always become flat, and that the attenuation of the low frequency component does not occur theoretically.
Accordingly, the low frequency component does not thin down, and hence it offers an advantage of being able to provide an impressive low frequency feeling.
A signal processing section 40 carries out the processing of providing the anti-phase component signal S extracted by the anti-phase component extracting section 33 with the transfer characteristic HD X/(HS D−HS X) in the same manner as the signal processing section 34 of
A delay section 41 (first delay section) of the signal processing section 40 carries out the processing of delaying the anti-phase component signal S extracted by the anti-phase component extracting section 33 by n samples.
A multiplier 42 (first multiplier) carries out the processing of multiplying the anti-phase component signal S delayed by the delay section 41 by a constant α (α<1).
An adder 43 adds the anti-phase component signal S multiplied by the constant α with the multiplier 42 and the feedback signal output from a multiplier 45, and carries out the processing of outputting the sum signal of the anti-phase-component signal S and the feedback signal to a delay section 44, and of outputting the sum signal to the phase inverting section 35 and signal output section 36 as the anti-phase component signal S·HD X/(HS D−HS X).
The delay section 44 (second delay section) carries out the processing of delaying the sum signal output from the adder 43 by m samples.
The multiplier 45 (second multiplier) multiplies the sum signal delayed by the delay section 44 by a constant β (β<1), and carries out the processing of outputting the multiplication result of the sum signal and the constant β to the adder 43 as the feedback signal.
Next, the operation will be described.
Except for the signal processing section 40, since it is the same as the foregoing embodiment 4, only the operation of the signal processing section 40 will be described.
In the same manner as the signal processing section 34 of
More specifically, the delay section 41 of the signal processing section 40, receiving the anti-phase component signal S from the anti-phase component extracting section 33, delays the anti-phase component signal S by preset n samples, and outputs the delayed anti-phase component signal S to the multiplier 42.
Receiving the delayed anti-phase component signal S from the delay section 41, the multiplier 42 multiplies the delayed anti-phase component signal S by the preset constant α (α<1) to attenuate the signal intensity, and outputs the multiplication signal S1 of the delayed anti-phase component signal S and the constant α to the adder 43.
Receiving the multiplication signal S1 from the multiplier 42 and the feedback signal from the multiplier 45, the adder 43 adds the multiplication signal S1 and the feedback signal, outputs the sum signal S2 of the multiplication signal S1 and the feedback signal to the delay section 44, and outputs the sum signal S2 to the phase inverting section 35 and signal output section 36 as the anti-phase component signal S·HD X/(HS D−HS X).
Receiving the sum signal S2 from the adder 43, the delay section 44 delays the sum signal S2 by the preset m samples, and outputs the delayed sum signal S2 to the multiplier 45.
Receiving the delayed sum signal S2 from the delay section 44, the multiplier 45 multiplies the delayed sum signal S2 by the preset constant β (β<1) to attenuate the signal intensity, and outputs the multiplication result of the sum signal S2 and the constant β to the adder 43 as the feedback signal.
The present embodiment 5 reduces the operation cost necessary for the signal processing section 40 by approximating the transfer characteristics HD X, HS D and HS X by simple functions as shown in the following expression (9).
where Δ1 is the difference between the distance from the canceling speaker (right canceling speaker, for example) to the ear on the side closer to the canceling speaker (right ear, for example) and the distance from the main speaker (right main speaker, for example) to the ear on the other side of the main speaker (left ear, for example), and Δ2 is the difference between the distance from the canceling speaker (right canceling speaker, for example) to the ear on the side closer to the canceling speaker (right ear, for example) and the distance from the canceling speaker to the ear on the other side of the canceling speaker (left ear, for example).
In addition, Fs denotes the sampling frequency of the audio signal, and c represents the speed of sound.
The approximation shown by expression (9) indicates the behavior of sound waves when disregarding the reproduction environment (walls, floor and furniture of the room) and the diffraction/reflection with the shape of the countenance of the listener just as the approximation shown by expression (3).
Here, the output signal S2 of the signal processing section 40 can be expressed by the following expression (10).
where z−n denotes the delay of n samples, and z−m denotes the delay of m samples.
As is clear from expression (10), it is found that the signal processing section 40 of the present embodiment 5 can also provide the anti-phase component signal S with the transfer characteristic HD X/(HS D−HS X) in the same manner as the foregoing embodiment 4.
According to the present embodiment 5, since the signal processing section 40 is composed of only one adder 43, two delay sections 41 and 44, two multipliers 42 and 45, and one feedback path, it offers an advantage of being able to reduce the operation cost very much.
As described above, the present invention is suitable for an audio device that achieves good quality cross-talk cancellation processing without involving the sound quality deterioration of the center-localized component or low frequency component.
Number | Date | Country | Kind |
---|---|---|---|
2007-211018 | Aug 2007 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2008/002193 | 8/11/2008 | WO | 00 | 12/8/2009 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2009/022463 | 2/19/2009 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4239937 | Kampmann | Dec 1980 | A |
4975954 | Cooper et al. | Dec 1990 | A |
5414774 | Yumoto | May 1995 | A |
5862228 | Davis | Jan 1999 | A |
5974153 | Cashion et al. | Oct 1999 | A |
6091894 | Fujita et al. | Jul 2000 | A |
7242782 | Kasai et al. | Jul 2007 | B1 |
7492906 | Matsuoka et al. | Feb 2009 | B2 |
7702111 | Gunnarsson | Apr 2010 | B2 |
8085950 | Suzuki | Dec 2011 | B2 |
20040170281 | Nelson et al. | Sep 2004 | A1 |
20050220312 | Kasai et al. | Oct 2005 | A1 |
20070189551 | Kimijima | Aug 2007 | A1 |
20110064243 | Katayama et al. | Mar 2011 | A1 |
20110293111 | Chang et al. | Dec 2011 | A1 |
Number | Date | Country |
---|---|---|
7-46700 | Feb 1995 | JP |
2000-50399 | Feb 2000 | JP |
2000-506691 | May 2000 | JP |
WO-9730566 | Aug 1997 | WO |
Number | Date | Country | |
---|---|---|---|
20100172505 A1 | Jul 2010 | US |