The invention relates to a method and an apparatus for mixing N information time signals which are respectively converted from the time domain to the frequency domain into one of N complex information signals, where N is an integer greater than 1. Such a method or such an apparatus is used, for instance, for interpolating or extrapolating microphone signals.
EP 2994094B1 discloses a method and an apparatus where an interpolated or extrapolated signal is generated from at least two microphone signals by mixing the microphone signals.
The known method relates to applications where microphones are in a sound field, where they convert a sound field value (e.g., the sound pressure) at their respective microphone positions into microphone signals, and where an estimate of the value of the sound field measure outside the microphone positions is desired, i.e., at a position interpolated or extrapolated from the microphone positions.
In the known method, the interpolated or extrapolated signal is similar to the sound field value at the interpolated or extrapolated position. The known method uses energy-based weighting of complex spectral values as well as a summation of the weighted complex spectral values which includes a correction to compensate for an energy error. As a result of the correction in the known method, the interpolated or extrapolated signal has the property of deviating only insignificantly in its mean energy from the sound field value at the interpolated or extrapolated position and retains this property even if the sound field is generated by sound waves of more than one sound source. The factors of the weighting in the known method are derived from the coefficients in the mathematical representation of the interpolated or extrapolated “virtual” position.
In the known method, the phase of the interpolated or extrapolated signal is not equal to the phase of the sound field value at the interpolated or extrapolated position. This is even the case in the known method if a direct sound field emanates from a single sound source. In the case where the sound field results from the sound waves from more than one sound source, the signal interpolated or extrapolated according to the known method differs even more in its phase from the sound field value at the interpolated or extrapolated position. Further, in the known method, extrapolation beyond more than one time the distance of the microphones is not possible. The microphone signals and the mentioned interpolated or extrapolated signals are complex-valued signals which, as is common, describe the state of a variable, in the present case the sound field value, with respect to a frequency.
An interpolated or extrapolated position is usually computed as a combination of the positions interpreted as vectors, in particular as a coefficient-weighted sum of the vectors, with the additional condition that the sum of the coefficients is equal to 1. Due to the additional condition, the number of dimensions of the interpolation or extrapolation becomes 1 less than the number of positions. This thus describes, for example, in the case of 2 positions, a one-dimensionally interpolated position on the straight line through the positions, or in the case of 3 positions, a two-dimensionally interpolated position in the plane through the positions, or in case of 4 positions a three-dimensional interpolated or extrapolated position in space.
The coefficients may be used as control parameters in regard to the object of the invention.
It should be pointed out that in the case of a direct sound field emanating from a single sound source, a meaningful statement about the phase of the sound field value at an interpolated or extrapolated position can be made as there is a physical relationship between the phase and the position in space, which can be approximated as a linear function by assuming a plane wave front.
It should be noted that, in the case of a diffuse sound field, a meaningful statement about the energy of the sound field value at an interpolated or extrapolated position is possible because there is a physical relationship between the energy and the position in space, which, assuming temporal averaging, can be approximated as constant.
In many practical applications, there is a sound field resulting from the sound waves from more than one sound source or from a superposition of direct sound and diffused sound.
It is the object of the invention to further improve the generation of an interpolated or extrapolated signal from at least two microphone signals. The microphones, which convert a sound field value into the microphone signals, are located at different microphone positions in a sound field.
The goal is that the interpolated or extrapolated signal deviates in its phase and in its energy at most insignificantly, as far as possible, from the value that the sound field value has at a position interpolated or extrapolated from the microphone positions.
The method according to the invention is characterized according to the features of claim 1. Preferable embodiments of the method according to the invention are defined by claims 2 to 10. The apparatus according to the invention is characterized according to claim 11. Preferable embodiments of the apparatus according to the invention are defined by claims 12 and 13.
The invention will be further described in the following description of the figures.
The invention will be described in more detail in the following description of the figures with reference to several embodiments.
The mixing method according to the invention will be further detailed with reference to
The result signal produced by the mixture may then be equalized, by interpolation, to a microphone signal of a fictitious microphone located at a location between the two microphones on the line connecting the two microphones. With an extrapolation, the result signal may then be equalized to a microphone signal of a fictitious microphone located at a location outside the two microphones on the connecting line through the two microphones.
The two microphone signals are illustrated in
Thereafter, frequency-matching complex spectral values v1(f1, t1) and v2(f1, t1) of the two transformed complex information signals are mixed in a mixing method to obtain a result spectral value m(f1, t1), as schematically indicated in
It should be noted here that the mixing methods indicated by blocks 100 and 101 in
After an inverse transformation of the complex output information signal m(f, t1) from the frequency domain into the time domain, such as, for example, by means of an inverse Fourier transform, the mixed time signal sc(t) in the time interval W1 is obtained.
The method now described may then be repeated for a subsequent time interval, as indicated by W2 in
According to the invention, each of the (N=) two complex spectral values is converted into a first component and a second component at block 208 (in
The method is thus performed for all frequency-matching spectral values of the (N=) two complex information signals until, at block 218, the complex output information signal m(f, t1) is obtained. Thereafter, at block 220, the complex output information signal is converted into the mixed time signal sc(t) by a back transformation from the frequency domain to the time domain.
As stated above, blocks 206 through 214 may be performed at the same time in parallel with each other in another embodiment of the flowchart for directly obtaining the complex output information signal m(f, t1).
The end points of the first components OP3 and OP7 and the second components OP4 and OP8 lie on a circle K. In this embodiment of the invention, this means that the amplitudes or vector lengths of the first and second components are equal. The radius of the circle K is dependent on the absolute values of the two spectral values v1(f1, t1) and v2(f1, t1). In particular, the following applies:
A first energy value E1(f1, t1) is equal to: ABS (v1(f1, t1))2.
A second energy value E2 (f1, t1) is equal to: ABS (v2(f1, t1))2.
The radius R of the circle K is thus equal to: SQRT {(E1+E2)/2}.
The root of the arithmetic mean of the energy values is hence a measure of the radius.
The determination of the radius in this first exemplary embodiment signifies the use of the assumption that the sound field consists of a superimposition of two direct sound fields, wherein the two assumed direct sound fields are equal, and thus causes that the estimate of the sound field value at the interpolated or extrapolated location is, as far as possible, independent of whether there is a direct sound field share in the sound field.
In this regard, this section first describes the determination of the coefficients of a mix, e.g., an interpolation or extrapolation. A position interpolated or extrapolated from given positions can, as is known, be represented mathematically, for example, by a linear combination, which is to be used in the following.
If a mixture is an interpolation or an extrapolation, then the sum of the coefficients of the linear combination is equal to 1. A mathematical representation of a one-dimensionally linearly interpolated or extrapolated position L from two given positions L1 and L2 is
L=L1*c1+L2*c2,
where c1 and c2 are coefficients, with
c1+c2=1.
If the microphone positions of the microphones giving the corresponding microphone signals s1(t) and s2(t) are used for L1 and L2, then c1 and c2 are the coefficients of the interpolation or extrapolation according to the invention.
An interpolation of the first components OP3 and OP7 results in a combination component OP9 in
(arc length of circle section P3−P9)/(arc length of circle section P9−P7)=c1/c2.
As already mentioned above in connection with
(arc length of circle section P4−P10)/(arc length of circle section P10−P8)=c1/c2.
The steps described above with reference to
It is additionally mentioned here that the radius calculation always must be performed anew for new pairs of complex spectral values, such as v1(f2, t1) and v2(f2, t1).
In the method described above, a mixing has been performed which has led to an interpolation of the two information-time signals. This is because c1 and c2 were both positive and less than one. The method described above could also lead to extrapolation. In this case, one of the two coefficients c1 or c2 would be negative and the other greater than one, wherein c1+c2=1 would still apply. This would mean that points P9 and P10 are still on the circle, but outside of the section P3-K-P7 or P4-K-P8.
In a second embodiment, which will be further detailed with reference to
The end points of the first components OP3 and OP7 lie on a circle K′. In this embodiment of the invention, this means that the amplitudes or vector lengths of the first components OP3 and OP7 are equal to one another. The end points of the first components OP4 and OP8 lie on a circle K″. In this embodiment of the invention, this means that the amplitudes or vector lengths of the second components OP4 and OP8 are equal to one another.
The radii of the two circles K′ and K″ are unequal, but dependent on the absolute values of the two spectral values v1(f1, t1) and v2(f1, t1).
In this second embodiment, it is assumed that one of the two assumed direct sound fields dominates and thus causes the estimation of the value of the sound field variable at the interpolated or extrapolated position to be as accurate as possible for the direct sound field component that dominates the sound field. In particular, the following applies to the calculation of the radii:
EA=(E1+E2)/2+Ed
EB=(E1+E2)/2−Ed
Ed should be greater than zero. On the other hand, Ed must not become too large, because then, the division of one of the two spectral values into components would no longer be possible. This would be the one with the smaller vector length, in this case OP5 in
The radius R′ of the circle K′ is now equal to: SQRT (EA).
The radius R″ of the circle K″ is now equal to: SQRT (EB).
The point P9 divides the circle section P3-K′-P7 into two parts such that:
(arc length of circle section P3−P9)/(arc length of circle section P9−P7)=c1/c2.
As already mentioned above in connection with
The steps described above with reference to
It is additionally mentioned here that the radii calculation always must be performed anew for new pairs of complex spectral values, such as v1(f2, t1) and v2(f2, t1).
In the method described above, a mixing has been performed which has led to an interpolation of the two information-time signals. This is because c1 and c2 were both positive and less than one. The method described above could also lead to extrapolation. In this case, one of the two coefficients c1 or c2 would be negative and the other one greater than 1, wherein c1+c2=1 would still apply.
Unit 540 is provided to calculate the radius of the circle K in
In the first embodiment, only one value of the radius of the circle K is derived in unit 540, see
For deriving the first combination component, the coefficients c1 and c2 are also needed. It should be noted, however, and it will be explained later with reference to
These two coefficients are supplied via inputs 560 and 562, respectively, or the one coefficient is supplied via only one input, either 560 or 562, to the mixing apparatus. These inputs are coupled to associated inputs 564 and 566, respectively, of unit 548. The first combination component OP9 is then available at output 556 of unit 548.
The mixing apparatus further includes a unit 550. In unit 550, the two second components OP4 and OP8, generated by unit 508 and 520, respectively, are combined, under the control of a control line 568 from control unit 530, into a second combination component OP10, as already explained with reference to
For deriving the second combination component, the coefficients c1 and c2 are also needed. Inputs 560 and 562 of the mixing apparatus are coupled to associated inputs 574 and 576, respectively, of unit 550. The second combination component OP10 is then available at output 578 of unit 550.
The mixing apparatus further includes unit 580. In unit 580, the first and second combination components OP9 and OP10 are combined, under control via a control line 582 from control unit 530, into a result spectral value OP11, as described above in connection with
Control unit 530 controls the units in the mixing apparatus such that two frequency-matching spectral values of two complex information signals are repeatedly processed in accordance with the steps of generating a result spectral value as described with reference to
In this case, we have N=3 and a computational representation of a position L which is two-dimensionally linearly interpolated or extrapolated from three given positions L1, L2 and L3 is
L=L1*c1+L2*c2+L3*c3,
where c1, c2 and c3 are coefficients, with
c1+c2+c3=1.
If L1, L2 and L3 are replaced by the microphone positions of the microphones providing the corresponding microphone signals s1(t), s2(t) and s3 (t), then c1, c2 and c3 are the coefficients of the interpolation or extrapolation according to the invention.
Inputs 602, 603 and 604 are envisaged for receiving the (N=) three complex information signals v1(f, t1), v2(f, t1) and v2(f, t1), respectively. Input 602 is coupled to input 606 of unit 608. Input 603 is coupled to input 607 of unit 617. Input 604 is coupled to input 618 of unit 620. Units 608, 617 and 620 form a first unit for converting each of the frequency-matching spectral values of the (N=) three complex information signals into a first and a second component as described with reference to
Unit 640 is provided to calculate the radius of the circle K in
A first energy value E1 (f1, t1) is equal to: ABS (v1(f1, t1))2.
A second energy value E2 (f1, t1) is equal to: ABS (v2(f1, t1))2.
A third energy value E3 (f1, t1) is equal to: ABS (v3 (f1, t1))2.
The radius R of the circle K is now equal to: SQRT {(E1+E2+E3)/3}.
The following applies to the derivation of K′ and K″.
In this case, unit 640 derives the radii of the circles K′ and K″ from the energy values EA and EB (see
EA=(E1+E2+E3)/3+Ed
EB=(E1+E2+E3)/3−Ed
Ed should be greater than zero. On the other hand, Ed must not become too large, because then, the division of one of the three spectral values into components would no longer be possible.
The radius R′ of the circle K′ is now equal to: SQRT (EA).
The radius R″ of the circle K″ is now equal to: SQRT (EB).
Output 638 of unit 540 is coupled to inputs 614, 615 and 626 of units 608, 617 and 620, respectively, for supplying the value of the radius of the circle K′ to units 608, 617 and 620. Output 636 of unit 640 is coupled to inputs 616, 619 and 628 of units 608, 617 and 620, respectively, for supplying the value of the radius of the circle K″ to units 608, 617 and 620.
In the first embodiment, only one value of the radius of the circle K is derived in unit 640, see
The mixing apparatus further includes unit 648. In unit 648, the three first components OP3, OP7 and OP12 generated by units 608 and 617 and 620, respectively, are combined, under the control of control line 658 from control unit 630, into a first combination component OP19. This will be further detailed with reference to
α4=c1*α1+c2*α2+c3*α3 formula(1) or
α4′=c2*α2′+c3*α3′ formula(2)
where α4′ is the angle between OP3 and OP19, α2′ is the angle between OP3 and OP7, and α3′ is the angle between OP3 and OP12.
If formula (2) is used to derive OP19, it is assumed that c1=0, so that c2+c3=1.
To this end, outputs 6100 of unit 608, 611 of unit 622 are coupled to associated inputs 652, 654 and 655, respectively, of unit 648. Unit 648 also needs the radius value of the circle K or K′, see
The derivation of the first combination component OP19 from OP3, OP7 and OP12 takes place in unit 648 as already described with reference to
These three or two coefficients are supplied to the mixing apparatus via inputs 660, 662, 663 or inputs 662, 663. These inputs are coupled to associated inputs 664, 666 and 667, respectively, of unit 648. The first combination component OP9 is then available at output 656 of unit 648.
The mixing apparatus further includes a unit 650. In unit 650, the three second components OP4, OP8 and OP13, generated by unit 608, 617 and 620, respectively, are combined, under the control of a control line 668 from the control unit 630, into a second combination component OP2o, as already explained with reference to
For deriving the second combination component OP2o, the coefficients ct, c2 and c3 are also needed. Inputs 660, 662 and 663 of the mixing apparatus are coupled to associated inputs 674, 676 and 667, respectively, of unit 650. The second combination component OP20 is then available at output 678 of unit 650.
The mixing apparatus further includes unit 680. In unit 680, the first and second combination components OP19 and OP20 are combined, under control via control line 682 from control unit 630, into a result spectral value OP21, as described above in connection with
Control unit 630 controls the units in the mixing apparatus such that three frequency-matching spectral values of three complex information signals are repeatedly processed in accordance with the steps of generating a result spectral value as described with reference to
It goes without saying that for N greater than 3, the apparatus can be extended accordingly for mixing N complex information signals, with N greater than three. Thus, for N=4, a device contains:
Analogously, as described above for N=2 and N=3, a computational representation of a position L which is three-dimensionally linearly interpolated or extrapolated from four given positions L1, L2, L3 and L4 is
L=L1*c1+L2*c2+L3*c3+L4*c4,
where c1, c2, c3 and c4 are coefficients, with
c1+c2+c3+c4=1.
If L1, L2, L3 and L4 are replaced by the microphone positions of the microphones providing the corresponding microphone signals s1(t), s2(t), s3(t) and s4(t), then c1, c2, c3 and c4 are the coefficients of the interpolation or extrapolation according to the invention. In summary, the following can be said.
Splitting the frequency-matching frequency values in first and second components, and combining the first and second components, respectively, is based on the assumption that the sound field consists of the superposition of two direct sound fields, wherein each of the components corresponds to one of the assumed direct sound fields. By this assumption, a mixture (interpolation or extrapolation) can be used for the components, which simulates the physical relationship of the sound field variable of a direct sound field and the position in space. Using this assumption results in the mixed (interpolated or extrapolated) signal being a good estimate of the value of the sound field measure at the interpolated or extrapolated position, as long as the sound field is caused by the sound waves of up to two sound sources.
Due to the equality of the amplitudes of all the first components and the equality of the amplitudes of all second components, the simulation of the physical relationship can be very simplistic, namely limited to a direct sound field with a planar wave front.
The equality of the mean energy of the interpolated or extrapolated components and the mean energy of all microphone signals means that a side assumption is used under which the mean energy of the sound field value in space is constant. As a result of this side assumption, the interpolated or extrapolated signal is still a useful estimate of the sound field value at the interpolated or extrapolated position as long as the assumption of at most two direct sound components deviates from reality.
The equality of the energies of all first components causes that the energies of the first components do not have to be interpolated or extrapolated, but the energy of the first interpolated or extrapolated component can simply be equated to them. The latter is so done. As a result, the first interpolation or extrapolation boils down to an interpolation or extrapolation of the phases of the first components.
The same applies analogously for the second components, the second interpolated or extrapolated component, the second interpolation or extrapolation and the phases of the second components.
Number | Date | Country | Kind |
---|---|---|---|
102017000040732 | Apr 2017 | IT | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/DE2018/100326 | 4/10/2018 | WO | 00 |