The invention relates to a method for mixing microphone signals of an audio recording with a plurality of microphones.
It is recognized (“Handbuch der Tonstudiotechnik” by Michael Dickreiter et al., ISBN 978-3598117657, pp. 211-212, 230-235, 265-266, 439, 479) to use several microphones instead of a single microphone in order to capture vast acoustic sceneries during the production of audio recordings for canned music, films, broadcasting, sound archives, computer games, multi-media presentations or websites. Therefore the term “multi-microphone audio recording” is generally used. A vast acoustic scenery may be, e.g., a concert hall with an orchestra of several musical instruments. In order to capture tonal details each individual instrument is recorded with an individual microphone positioned closely to the instrument and, in order to record the overall acoustics including the echoes in the concert hall and audience noises (applause in particular), additional microphones are positioned in a greater distance.
Another example of a vast acoustic scenery is a drum set consisting of several pulsatile instruments which is recorded in a recording studio. For a “multi-microphone audio recording” individual microphones are positioned near each pulsatile instrument and an additional microphone is installed above the drummer.
Such multi-microphone recordings allow for a maximized number of acoustic and tonal details along with the overall acoustics of the scenery to be captured in a high quality and to shape them aesthetically satisfactory. Each microphone signal of the several microphones is usually recorded as a multi-trace recording. During the following mixing of the microphone signals further creative work is done. In special cases it is possible to mix immediately “live” and only record the product of the mixing.
The creative goals of the mixing process are generally the balance of volumes of all sound sources, a natural sound and a reality-like spatial impression of the overall acoustics.
During the common mixing technique in an audio mixing console or in the mixer function of digital editing systems, a sum of the added microphone signals is produced, conducted by a summing unit (“bus”) which is a technical realization of a common mathematical addition. In
100 a first microphone signal
101 a second microphone signal
110 a summation level based on an addition
111 a sum signal
199 a result signal
200 an nth sum signal
201 an n+2th microphone signal
210 an n+1th summation level based on an addition
211 an n+1th sum signal
With the multi-microphone audio recording at least two microphone signals contain portions of sound which originate from the same sound source due to the ineluctable multipath propagation of sound. As these portions of sound reach the microphones with varying delays due to their varying sound paths a comb-filter effect occurs with the common mixing technique in the summing unit which can be heard as sound changes and which run counter to the intended natural sound. In the common mixing technique those sound changes based on comb-filter effects can be reduced by an adjustable amplification and a possible adjustable delay of the recorded microphone signals. However, such a reduction is only restrictively possible in case of a multipath propagation of sound from more than a single sound source. In any case a significant adjustment of the mixing console or the digital editing system is required for figuring out the best compromise.
In the earlier DE 10 2008 056 704 a down-mixing (so-called “downmixing”) for the production of a two-channel audio format from a multi-channel (e.g., five-channel) audio format is described which projects phantom audio sources. Here two input signals are summed up, wherein a loading with a corrective factor of the spectral coefficients of one of the two input signals to be summed up is conducted; the input signal which is loaded with the corrective factor is prioritized over the other input signal. The determination of the corrective factor as described in DE 10 2008 056 704, however, leads to possibly audible disturbing ambient noises in cases in which the amplitude of the prioritized signal over the non-prioritized signal is low. The likelihood of occurrence of such disturbances is low, but it cannot be manipulated.
A method of mixing microphone signals of an audio recording with several microphones is known from WO 2004/084 185 A1 in which spectral values of overlapping time windows of samples of a first microphone signal and a second microphone signal respectively are generated. The spectral values of the first microphone signal are distributed onto the spectral values of the second microphone signal in a first summation level, wherein a dynamic correction of the spectral values of one of the microphone signals is conducted. Spectral values of a result signal are made up of the spectral values of the first summation signal which are subject to an inverse Fourier-transformation and block junction. Thus, for every block of samples individual corrective factors can be determined. The dynamic correction by a signal depending loading of spectral coefficients instead of a common addition reduces unwanted comb-filter effects during multi-microphone mixing which occur in the summing element of the mixing console or editing system due to common addition. However, with this method disturbing ambient noises are audible if the amplitude of the prioritized signal is low compared to that of the non-prioritized signal.
The task of the invention is to compensate the tonal change which occurs due to multipath propagation of sound portions during the mixing of multi-microphone recordings as far as possible.
The invention is described by means of the embodiments given in the figures wherein:
The reference numbers of
100 a first microphone signal
101 a second microphone signal
199 a result signal
201 an n+2th microphone signal
300 spectral values of the first microphone signal
301 spectral values of the second microphone signal
310 a first summing level
311 spectral value of a first sum signal
320 a block-building and spectral transformation unit
330 an inverse spectral transformation and block junction unit
399 spectral values of a result signal
400 spectral values of an nth sum signal
401 spectral values of an n+2th microphone signal
410 an n+1th summing level
411 spectral values of an n+1th sum signal
500 allocation unit
501 spectral values A(k) of the prioritized signal
502 spectral values B(k) of the non-prioritized signal
510 calculation unit for corrective factor values
511 corrective factor values m(k)
520 multiplier-summer unit
700 an nth building group consisting of unit 320 and the n+1th summing level 410.
The block diagram shown in
eA(k)=Real(A(k))·Real(A(k))+Imag(A(k))·Imag(A(k))
x(k)=Real(B(k))·Real(B(k))+Imag(A(k))·Imag(A(k))
w(k)=D·x(k)/eA(k)
m(k)=(w(k)2+1)(1/2)−w(k)
or the corrective factor m(k) is calculated as follows:
eA(k)=Real(A(k))·Real(A(k))+Imag(A(k))·Imag(A(k))
eB(k)=Real(B(k))·Real(B(k))+Imag(B(k))·Imag(B(k))
x(k)=Real(B(k))·Real(B(k))+Imag(A(k))·Imag(A(k))
w(k)=D·x(k)/eA(k)+L·eB(k))
m(k)=(w(k)2+1)(1/2)−w(k)
wherein it means that
m(k) is the kth corrective factor
A(k) is the kth spectral value of the signal to be prioritized
B(k) is the kth spectral value of the signal not to be prioritized
D is the grade of compensation
L is the grade of the limitation of the compensation
Grade D of compensation is a numeric value which determines in how far the sound changes due to comb-filter effects are balanced. It is chosen according to the creative demand and the intended tonal effect and is advantageously in the rage of 0 to 1. If D=0 the sound equals exactly the sound of conventional mixing. If D=1 the comb-filter effect is completely removed. For values of D between 0 and 1 the tonal result is accordingly between the ones for D=0 and D=1.
Grade L of the limitation of the compensation is a numeric value which determines in how far the probability of the occurrence of disturbing ambient noises is reduced. Said probability is given when the amplitude of the microphone signal to be prioritized is low in contrast to the microphone signal not to be prioritized. L>=0 is valid. If L=0 not reduction of the probability of disturbing ambient noises is given. Grade L is to be chosen that according to experience just as no more ambient noises can be heard. Typically grade L is of the order of 0.5. The bigger grade L the smaller the probability of ambient noises, but the balance of tonal changes as adjusted by D may also be reduced.
The spectral value A(k) of the signal to be prioritized 501 is additionally lead to a multiplier 520, whereas the spectral values B(k) of the signal not to be prioritized 502 is additionally lead into a summer 530. Furthermore, the corrective factor values m(k) of the output signal 511 are fed into the calculation unit 510 where they are multiplied complexly (according to real part and imaginary part) with the spectral values A(k) 501. The resulting values of the multiplier 520 are fed into the summer 530 where they are added complexly (according to real part and imaginary part) to the spectral values B(k) of the signal not to be prioritized 502. This results in the spectral values 311 of the first sum signal of the first summing level 310.
What is important for the prioritization is the multiplication of the corrective factor m(k) with exactly one of the two summands of the addition conducted in the summer 530. Thus, the complete signal path of this summand is “prioritized” from the microphone signal input to the summer 530.
It is apparent that this invention does not only refer to microphone signals but generally to every audio signal facing the problem described above.
Accordingly the input signals can be general audio signals which originate from audio recordings, which are available in the form of audio files or sound tracks which were saved for further editing in a storage.
Additionally the invention can be implemented in different ways, such as, e.g., a software, which runs on a computer, hardware, a combination thereof and/or a special circuit.
Number | Date | Country | Kind |
---|---|---|---|
10 2009 052 992 | Nov 2009 | DE | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2010/066657 | 11/2/2010 | WO | 00 | 5/29/2012 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2011/057922 | 5/19/2011 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5228093 | Agnello | Jul 1993 | A |
6339758 | Kanazawa et al. | Jan 2002 | B1 |
6668062 | Luo et al. | Dec 2003 | B1 |
7171007 | Rajan | Jan 2007 | B2 |
7315623 | Gierl et al. | Jan 2008 | B2 |
7327852 | Ruwisch | Feb 2008 | B2 |
7995767 | Amada | Aug 2011 | B2 |
8090111 | Aichner et al. | Jan 2012 | B2 |
8170882 | Davis | May 2012 | B2 |
8275147 | Beavis et al. | Sep 2012 | B2 |
Number | Date | Country |
---|---|---|
1333994 | Jan 2002 | CN |
1761998 | Apr 2006 | CN |
1926607 | Mar 2007 | CN |
101484938 | Jul 2009 | CN |
1 133 899 | Aug 2008 | EP |
2004084185 | Sep 2004 | WO |
WO 2005086139 | Sep 2005 | WO |
Entry |
---|
Bernfried Runow, Automatischer Stereo-Downmix Von 5.1-Mehrkanalproduktionen, Diplomarbeit, Hochschule Der Medien Stuttgart, Jul. 6, 2008, Chapter 7, XP002568518, English translation. |
Bernfried Runow, Automatischer Stereo-Downmix Von 5.1-Mehrkanalproduktionen, Diplomarbeit, Hochschule Der Medien Stuttgart, Jul. 6, 2008, Seiten 1-150, XP002568518. |
International Search Report issued Apr. 13, 2011 in PCT Application No. PCT/EP2010/066657, filed Nov. 2, 2010. |
International Preliminary Report issued issued Apr. 13, 2011 in PCT Application No. PCT/EP2010/066657, filed Nov. 2, 2010 (English translation). |
Bernfried Runow, Automatic Stereo Down Mix of Multichannel Productions, Diplomarbeit, Jul. 6, 2008, pp. 42-117 (url:http://www.b-public.de/da/da—runow—downmix.pdf). |
Number | Date | Country | |
---|---|---|---|
20120237055 A1 | Sep 2012 | US |