BRIEF DESCRIPTION OF THE DRAWINGS
The invention can be more fully understood by reading the subsequent detailed description and examples with references made to the accompanying drawings, wherein:
FIG. 1 is a schematic diagram of a broadside small array microphone beamforming apparatus according to an embodiment of the invention;
FIG. 2 is a schematic diagram of a bidirectional polar pattern and an omni-directional polar pattern according to an embodiment of the invention;
FIG. 3 is a schematic diagram of two single main lobe polar patterns and an omni-directional polar pattern according to an embodiment of the invention;
FIG. 4 is a schematic diagram of a directional microphone forming unit according to an embodiment of the invention;
FIG. 5 is a schematic diagram of an adaptive channel decoupling unit according to another embodiment of the invention;
FIG. 6 is a schematic diagram of an adaptive channel decoupling unit according to another embodiment of the invention;
FIG. 7 is a schematic diagram of an adaptive channel decoupling unit according to another embodiment of the invention;
FIG. 8 is a schematic diagram of a broadside small array microphone beamforming apparatus according to another embodiment of the invention;
FIG. 9 is a schematic diagram of a directional microphone forming unit according to another embodiment of the invention;
FIG. 10 is a schematic diagram of an adaptive channel decoupling unit according to another embodiment of the invention; and
FIG. 11 is a schematic diagram of an adaptive channel decoupling unit according to another embodiment of the invention.
DETAILED DESCRIPTION OF THE INVENTION
The following description is of the best-contemplated mode of carrying out the invention. This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense. The scope of the invention is best determined by reference to the appended claims.
FIG. 1 is a schematic diagram of broadside small array microphone beamforming apparatus 100 according to an embodiment of the invention. Broadside small array microphone beamforming apparatus 100 comprises omni-directional microphones Mic1 and Mic2, microphone calibration unit 110, directional microphone forming unit 120, adaptive channel decoupling unit 140, transformer 150, noise suppression units 160 and 170 and inverse transformer 180. Omni-directional microphones Mic1 and Mic2 respectively convert voice from desired near-end talker 101 into first and second signals S1 and S2. Second and first omni-directional microphones Mic1 and Mic2 and desired near-end talker 101 are respectively arranged at three points of a triangle, referred to as a broadside way, as shown in FIG. 1. Microphone calibration unit 110 receives first and second signals S1 and S2 and correspondingly outputs first and second calibration signals X1 and X2. Directional microphone forming unit 120 receives first and second calibration signals X1 and X2 to generate first directional microphone signal d1 with a bidirectional polar pattern. Adaptive channel decoupling unit 140 receives first calibration signal X1 and first directional microphone signal d1 to generate first main channel signal m1 and first reference channel signal r1 for noise detection. In another embodiment of the invention, adaptive channel decoupling unit 140 receives the sum of the first calibration signal X1 and the second calibration signal X2 and receives first directional microphone signal d1 to generate first main channel signal m1 and first reference channel signal r1 for noise detection.
FIG. 2 is a schematic diagram of bidirectional polar pattern 201 and omni-directional polar pattern 203 according to an embodiment of the invention. Bidirectional polar pattern 201 comprises two main lobes. One lobe points left and another lobe points right, one lobe points up and another lobe points down, or one lobe points right up and another lobe points left down. Desired talker 205 faces the null of bidirectional polar pattern 201, as shown in FIG. 2. According to an embodiment of the invention, first and second omni-directional microphones Mic1 and Mic2 form a directional microphone with bidirectional polar pattern 201 for noise detection, and one of first and second omni-directional microphones Mic1 and Mic2 is used as a main microphone.
FIG. 3 is a schematic diagram of two single main lobe polar patterns 301 and 302 and omni-directional polar pattern 303 according to an embodiment of the invention. Two single main lobe polar patterns 301 and 302 can be formed by two omni-directional microphones. One lobe points left and another lobe points right, one lobe points up and another lobe points down, or one lobe points right up and another lobe points left down. Desired talker 205 faces the cross point or the equal gain point of two single lobes 301 and 302, as shown in FIG. 3.
FIG. 4 is a schematic diagram of directional microphone forming unit 120 according to an embodiment of the invention. Directional microphone forming unit 120 comprises phase adjustment units 401 and 402 and subtractor 407. Phase adjustment unit 401 shifts first calibration signal X1 phase P1 to generate first shifted signal XP1. Phase adjustment unit 402 shifts second calibration signal X2 phase P2 to generate second shifted signal XP2. Subtractor 407 subtracts second shifted signal XP2 from first shifted signal XP1 to generate first directional microphone signal d1 with a bidirectional polar pattern, as shown in FIG. 2. According to an embodiment of the invention, phase P1 is zero and Phase P2 is also zero. Thus, first directional microphone signal d1 is equal to second calibration signal X2 subtracted by first calibration signal X1 (d1=X1−X2). First microphone signal d1 is a signal with a bidirectional polar pattern.
FIG. 5 is a schematic diagram of adaptive channel decoupling unit 500 according to another embodiment of the invention. Adaptive channel decoupling unit 500 comprises first voice activity detector (VAD1) 511, first adaptive filter 501, second voice activity detector (VAD2) 512 and second adaptive filter 502. First voice activity detector 511 receives first calibration signal X1 and first directional microphone signal d1 to generate first voice activity signal V1 for indicating the presence of desired voice. First adaptive filter 501 receives first calibration signal X1, first directional microphone signal d1 and first voice activity signal V1 and suppresses the desired voice of first directional microphone signal d1 to generate first reference channel signal r1. Second voice activity detector 512 receives first voice activity signal V1, first reference channel signal r1 and first calibration signal X1 to generate second voice activity signal V2 for indicating the presence of noise or interference. Second adaptive filter 502 receives second voice activity signal V2, first calibration signal X1, and first reference channel signal r1 and suppresses noise of first calibration signal X1 to generate first main channel signal m1.
FIG. 6 is a schematic diagram of adaptive channel decoupling unit 600 according to another embodiment of the invention. The difference between adaptive channel decoupling units 600 and 500 is the presence of adder 620. Adaptive channel decoupling unit 600 comprises adder 620, first voice activity detector (VAD1) 611, first adaptive filter 601, second voice activity detector (VAD2) 612 and second adaptive filter 602. Adder adds first calibration signal X1 and second calibration signal X2 to output third calibration signal X3. First voice activity detector 611 receives third calibration signal X3 and first directional microphone signal d1 to generate first voice activity signal V1 for indicating the presence of desired voice. First adaptive filter 601 receives third calibration signal X3, first directional microphone signal d1, and first voice activity signal V1 and suppresses the desired voice of first directional microphone signal d1 to generate first reference channel signal r1. Second voice activity detector 612 receives first voice activity signal V1, first reference channel signal r1 and third calibration signal X3 to generate second voice activity signal V2 for indicating the presence of noise or interference. Second adaptive filter 602 receives second voice activity signal V2, third calibration signal X3 and first reference channel signal r1 and suppresses noise of third calibration signal X3 to generate first main channel signal m1.
FIG. 7 is a schematic diagram of adaptive channel decoupling unit 700 according to another embodiment of the invention. Adaptive channel decoupling unit 700 comprises first voice activity detector (VAD1) 711, first adaptive filter 701, second voice activity detector (VAD2) 702, second adaptive filter 702, third adaptive filter 703 and selection criteria unit 721. First voice activity detector 711 receives first calibration signal X1 and first directional microphone signal d1 to generate first voice activity signal for indicating the presence of desired voice. First adaptive filter 701 receives first calibration signal X1, first directional microphone signal d1 and first voice activity signal V1 and suppresses the desired voice of first directional microphone signal d1 to generate first reference channel signal r1. Second voice activity detector 712 receives first reference signal r1 and first calibration signal X1 to generate second voice activity signal V2 for indicating the presence of noise or interference. Second adaptive filter 702 receives second voice activity signal V2, first calibration signal X1 and first reference channel signal r1 and suppresses one side (right side, one lobe of bidirectional polar pattern) noise of first calibration signal X1 to generate first adaptive filter signal Xn1. Third adaptive filter 703 receives second voice activity signal V2, first calibration signal X1 and first reference channel signal r1 and suppresses another side (left side, another lobe of bidirectional polar pattern) noise of first calibration signal X1 to generate second adaptive filter signal Xn2. Selection criteria unit 721 does a selection from first adaptive filter signal Xn1 and second adaptive filter signal Xn2 to output first main channel signal m1 according to first calibration signal X1. For example, m1=a*Xn1+b*Xn2.
Referring to FIG. 1, a transformer, such as Fast Fourier Transformer, 150 transforms first main channel signal m1 and first reference channel signal from time domain to frequency domain to correspondingly output first main signal m1 and first reference signal R1. First noise suppression unit 160 comprises noise estimating unit 162 and noise suppression unit 164. Noise estimating unit 162 generate ambient noise signal N1 by estimating noise of first reference signal R1. Noise suppression unit receives ambient noise signal N1, suppresses low frequency internal noise caused by forming the bidirectional microphone and generates first ambient noise signal N1′. Second noise suppression unit 170 comprises entire estimating unit 172, frequency domain voice activity detector 171 and noise suppression unit 174. Entire noise estimating unit 172 generates entire ambient noise signal N2 by estimating entire noise from first main signal M1 and first ambient noise signal N1′. Frequency domain voice activity detector 171 receives first main signal M1 and entire ambient noise signal N2 to generate third voice activity signal V3 for indicating noise. Noise suppression unit 174 receives entire ambient noise signal N2, first main signal M1 and third voice activity signal V3 to generate first clean voice signal M0 with ambient noise suppression. Inverse transformer, such as Inverse Fast Fourier Transformer, 180 transforms first main signal from frequency domain to time domain to generate second clear voice signal m0.
FIG. 8 is a schematic diagram of broadside small array microphone beamforming apparatus 800 according to another embodiment of the invention. Broadside small array microphone beamforming apparatus 800 comprises omni-directional microphones Mic11 and Mic12, microphone calibration unit 810, directional microphone forming unit 820, adaptive channel decoupling unit 840, transformer 850, noise suppression units 860 and 870 and inverse transformer 880. Omni-directional microphones Mic3 and Mic2 respectively convert voice from desired near-end talker 801 into first and second signals S1 and S2. Second and first omni-directional microphones Mic12 and Mic11 and desired near-end talker 801 are respectively arranged at three points of a triangle, referred to as a broadside way, as shown in FIG. 8. Microphone calibration unit 810 receives first and second signals S1 and S2 and correspondingly outputs first and second calibration signals X1 and X2. Directional microphone forming unit 120 receives first and second calibration signals X1 and X2 to generate first directional microphone signal d1 with one side polar pattern and second directional microphone signal d2 with another side lobe polar pattern, as shown in FIG. 3. Adaptive channel decoupling unit 840 receives first calibration signal X1, first directional microphone signal d1 and second directional microphone signal d2 to generate first main channel signal m1 and first reference channel signal r1 for noise detection. In another embodiment of the invention, adaptive channel decoupling unit 840 receives the sum of the first calibration signal X1 and the second calibration signal X2 and receives first directional microphone signal d1 and second directional microphone signal d2 to generate first main channel signal m1 and first reference channel signal r1 for noise detection.
As shown in FIG. 8, first and second omni-directional microphones Mic11 and Mic12 form two directional microphones with single lobe polar patterns for noise detection, and one of the first and second omni-directional microphones is used as a main microphone. Desired near-end talker 801 faces a cross point or a point of equal gains of two single polar pattern.
FIG. 9 is a schematic diagram of directional microphone forming unit 820 according to another embodiment of the invention. Directional microphone forming unit 820 comprises phase adjustment units 901, 902, 903 and 904 and subtractors 907 and 908. Phase adjustment unit 901 shifts first calibration signal X1 phase P10 to generate first shifted signal XP10. Phase adjustment unit 902 shifts second calibration signal X2 phase P20 to generate second shifted signal XP20. Phase adjustment unit 911 shifts first calibration signal X1 phase P11 to generate third shifted signal XP 11. Phase adjustment unit 912 shifts second calibration signal X2 phase P21 to generate fourth shifted signal XP21. Subtractor 907 subtracts second shifted signal XP20 from first shifted signal XP10 to generate first directional microphone signal d1 with one side single polar pattern 301, as shown in FIG. 3. Subtractor 908 subtracts fourth shifted signal XP21 from third shifted signal XP11 to generate second directional microphone signal d2 with another side single polar pattern 302, as shown in FIG. 3.
According to an embodiment of the invention, phases P10 and P21 are zero and Phases P20 and P11 are T (the delay for sound propagation between two microphones). Thus, omni-directional microphones Mic11 and Mic12 can form a first directional microphone with a single lobe polar pattern and second directional microphone with another single lobe polar pattern.
FIG. 10 is a schematic diagram of adaptive channel decoupling unit 1000 according to another embodiment of the invention. Adaptive channel decoupling unit 1000 comprises voice activity detectors 1011, 1012, 1013 and 1014 and adaptive filter 1001, 1002, 1003 and 1004. First Voice activity detector (VAD1) 1011 receives first calibration X1 and first directional microphone signal d1 to generate first voice activity signal V1 for indicating desired voice. First adaptive filter 1001 receives first calibration signal X1, first directional microphone signal d1 and first voice activity signal V1, and suppresses the desired voice of first directional microphone signal d1 to generate reference channel signal r1′. Second voice activity detector (VAD2) 1012 receives first voice activity signal V1, reference channel signal r1′ and first calibration signal X1 to generate second voice activity signal V2 for indicating noise or interference. Second adaptive filter 1002 receives second voice activity signal V2, first calibration signal X1 and reference channel signal r1′ and suppresses noise of first calibration signal X1 to generate main channel signal m1′. Third voice activity detector (VAD3) 1013 receives reference channel signal r1′ and second directional microphone signal d2 to generate third voice activity signal V3 for indicating the desired voice. Third adaptive filter 1003 receives reference channel signal r1′, second directional microphone signal d2 and third voice activity signal V3, and suppresses the desired voice of second directional microphone d2 to generate first reference channel signal r1. Fourth voice activity detector (VAD4) 1014 receives third voice activity signal V3, first reference channel signal r1 and main channel signal m1′ to generate fourth voice activity signal V4 for indicating noise of interference. Fourth adaptive filter 1004 receives fourth voice activity signal V4, main channel signal m1′, first reference channel signal r1 and suppresses noise of main channel signal m1′ to generate first main channel signal m1.
FIG. 11 is a schematic diagram of adaptive channel decoupling unit 1100 according to another embodiment of the invention. The difference between adaptive channel decoupling units 1000 and 1100 is adder 1101. Adder 1101 adds first calibration signal X1 and second calibration signal X2 to output calibration signal X0. Since the operation of adaptive channel decoupling unit 1100 in FIG. 11 is similar to the operation of adaptive channel decoupling unit 1000 in FIG. 10, it is not detailed here.
Referring to FIG. 1, first noise suppression unit 860 comprises noise estimating unit 862 and noise suppression unit 864 and second noise suppression unit 870 comprises entire estimating unit 872, frequency domain voice activity detector 871 and noise suppression unit 874. The operation of transformer 850, noise suppression units 860 and 870 and inverse transformer 880 is the same as that of transformer 150, noise suppression units 160 and 170 and inverse transformer 180. Thus, it is not detailed here.
While the invention has been described by way of example and in terms of preferred embodiment, it is to be understood that the invention is not limited thereto. To the contrary, it is intended to cover various modifications and similar arrangements (as would be apparent to those skilled in the art). Therefore, the scope of the appended claims should be accorded the broadest interpretation so as to encompass all such modifications and similar arrangements.