Linear differential directional microphone array

Information

  • Patent Grant
  • 11902755
  • Patent Number
    11,902,755
  • Date Filed
    Tuesday, November 12, 2019
    5 years ago
  • Date Issued
    Tuesday, February 13, 2024
    11 months ago
Abstract
Apparatus and method provided herein are directed to a linear differential directional microphone array (LDDMA), which takes into account the directionality of the array elements. The LDDMA may be designed by generating a steering vector for a linear array (LA) having preselected parameters including parameters δ, p, θ, N, and M, generating a constraint matrix based on the steering vector, reformulating the constraint matrix based on a microphone response matrix and a steering matrix, obtaining a beamformer by applying a minimum norm solution in terms of the constraint matrix, verifying a desired characteristic of the LA by calculating the beamformer for a desired direction, and constructing the LA based on the preselected parameters and the beamformer.
Description

This Application claims priority to and is a national stage of International Application No. PCT/CN2019/117371, filed Nov. 12, 2019, which is incorporated herein by reference.


BACKGROUND

Speech enhancement technology is an indispensable part for many far-field sound capturing devices in adverse environments. Both shotgun microphones (usually a super-cardioid capsule with long, hollow, slotted interference tube) and microphone arrays are capable of attenuating the ambient noise or interference due to their high directionality. Shotgun microphone is commonly used in many applications requiring low noise such as camera-specific, conference-only, or interview-specific situations. Although, this type of shotgun microphones can pick up the sound in a certain direction in a noisy environment, making the picked-up sound clearer and less noisy, they have fixed beamforming properties and are not tunable. Additionally, the cost associated with designing and producing such microphones is relatively high. In comparison, a microphone array with an appropriate signal processing algorithm can provide more flexible solutions.


Differential microphone array (DMA), among all microphone arrays, has been gaining attention recently. As one type of DMA, a linear differential microphone array (LDMA) has been extensively studied, however, many of the LDMA designs published appear to assume the use of the omni-directional microphones. Although a robust LDMA design can improve the white noise gain (WNG) with a minimum-norm solution by using more microphone elements than the order of LDMA, the WNG may still be relatively low, especially at the low frequencies, causing the well-known white noise amplification problem in the practical implementations. Additionally, the directivity factor (DF) of the conventional LDMA usually degrades as the frequency increases and a beampattern also tends to deform at high frequencies.





BRIEF DESCRIPTION OF THE DRAWINGS

The detailed description is set forth with reference to the accompanying figures. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The use of the same reference numbers in different figures indicates similar or identical items or features.



FIG. 1A illustrates an example diagram of a uniform linear array (ULA) of microphones with M directional microphones.



FIG. 1B illustrates an example diagram of a non-uniform linear array (NULA) of microphones with M directional microphones.



FIG. 2 illustrates an example linear differential directional microphone array (LDDMA).



FIGS. 3A and 3B illustrate beampatterns for the second order cardioid and the third order pattern, respectively, at 1 kHz.



FIGS. 4A and 4B illustrate beampatterns for the second order cardioid and the third order pattern, respectively, at 3 kHz.



FIGS. 5A and 5B illustrate beampatterns for the second order cardioid and the third order pattern, respectively, at 6 kHz.



FIGS. 6A and 6B illustrate the comparison in the white noise gain (WNG) and the directivity index (DI), respectively, for the 2nd-order LDDMA design with different types of microphones (p=1, p=0.5, p=0).



FIGS. 7A and 7B illustrate the comparison in the WNG and the DI, respectively, for the 3rd-order LDDMA design with different types of microphones (p=1, p=0.5, p=0).



FIG. 8 illustrates an example process for constructing an LDDMA.





DETAILED DESCRIPTION

A design method for a linear differential directional microphone array (LDDMA), which takes into account the directionality of the array elements, is provided. Some directional microphone elements have inherent unique property which may be advantageous over the omni-directional elements. The LDDMA may be implemented as a high-performance shotgun sound capturing device.


Omni-directional and directional microphone elements are commonly used in the industry. An omni-directional microphone picks up sound with an equal gain from all directions while a directional microphone picks up sound predominantly from some specific direction(s). Mathematically, the beampattern of a directional microphone can be expressed as u(p, θ, α)=p+(1−p)cos(θ−α), where θ is the sound incident angle, α is the steering direction of the microphone element and p defines the property of the directional microphone, for instance, it makes the well-known cardioid beampattern when p=0.5 and a dipole when p=0. The directional microphones may be any type of directional microphones including omni-directional, cardioid, dipole microphones, and the like.


Two approaches, a dedicated directional microphone using a single microphone cartridge with two sound inlets and a two-omnidirectional-element system with some appropriate digital signal processing, may be utilized to implement a directional microphone. The dedicated directional microphone approach is known to yield a much better directional microphone in term of signal-to-noise ratio (SNR) than the two-omnidirectional-element system approach. This performance advantage of the dedicated directional microphone is mainly due to the signal processing, which creates the directivity, being performed acoustically with the front and rear sound inlets. This unique property of the dedicated directional microphone may be utilized to achieve a better performance than the conventional LDMA. The dedicated directional microphone may come in the form of either Electret Condenser Microphones (ECMs) or Micro-Electro-Mechanical System (MEMS).



FIG. 1A illustrates an example diagram 100 of a uniform linear array (ULA) 102 with M directional microphones 104 and FIG. 1B illustrates an example diagram 106 of a non-uniform linear array (NULA) 108 with M directional microphones 110.


For the ULA 102 in FIG. 1A, the inter-element spacing is denoted as δ and all the directional microphones 104, 1 to M, are pointed rightward, i.e., α=0 (which will be omitted in the following description for simplicity). For the NULA 108 in FIG. 1B, the inter-element spacings vary and are denoted as δ1 . . . δM relative to the first directional microphone 110. All the directional microphones 110, 1 to M, are also pointed rightward. If a plane wave 112 impinges on the array 102 with an incident angle of θ, the steering vector, d, is then given by:

d(ω,θ)=[p+(1−p)cos θ][1e−jωδ cos θ/c . . . e−jω(M−1)δ cos θ/c]T,  (1)

where the superscriptT is the transpose operator, j=√{square root over (−1)} is the imaginary unit, ω=2πf is the angular frequency, and f is the temporal frequency. For comparison, the steering vector for a conventional ULA with omni-directional microphones may be expressed as:

a(ω,θ)=[1e−jωδ cos θ/c . . . e−jω(M−1)δ cos θ/c]T,  (2)

By combining the equation of the beampattern of a directional microphone with the equation for a conventional ULA with omni-directional microphones (2), the steering vector, d (ω, θ), may be expressed as:

d(ω,θ)=u(p,θ)a(ω,θ)  (3)


The beamforming problem may be interpreted as a spatial filter to estimate the signal from the desired look direction and suppress the signal from the undesired direction, by applying a complex weight vector:

h(ω)=[H1(ω)H2(ω) . . . HM(ω)]T.  (4)


Given the signal model, in the desired look direction θ=0, the beamformer exhibits a distortionless response, i.e., dH(ω, θ)h(ω)=1, where the superscriptH is the conjugate-transpose operator. In other directions, the beamformer shows a certain distortion on the response, i.e., dH(ω, θ)h(ω)<1.


The mathematical definitions of three widely-used performance measures, i.e., white noise gain (WNG), beampattern, and directivity factor (DF) are provided as follows. WNG shows the ability of a beamformer to suppress spatially uncorrelated noise, and is also the most convenient way to evaluate the sensitivity of a beamformer to some of its imperfections such as sensor noise, position errors, etc. WNG is defined as: W[h(ω)]=1/[hH(ω)h(ω)]. A beampattern illustrates the directional sensitivity of a beamformer to a plane wave 108 impinging on the array 102 from the incident angle θ as illustrated in FIG. 1A and is mathematically defined as B[h(ω), θ]=dH(ω, θ)h(ω). The frequency-invariant beampattern is usually preferred for the broadband speech processing.


Directivity factor (DF) is defined as the ratio between the array output response power in the desired steering direction and the power averaged over all directions, i.e., DF is computed as DF[h(ω)]=1/∫0π dϕ ∫0 dθ sin ϕ|B(ω, ϕ, θ)|2, where |B(ω, ϕ, θ)| is the is the beampattern in the spherical coordinate system; θ is the azimuth angle and the ϕ is the elevation angle. Directivity index (DI) is defined as DI[h(ω)]=10*log 10(DF[h(ω)]).


To design an Nth-order differential beamforming for a ULA with directional microphones, the problem may be formulated as linear system equations shown below.

R(ω,θ)h(ω)=c,  (5)

where θ is a constraint matrix R(ω, θ) of size (N+1)×M is given by:











R

(

ω
,
θ

)

=

[





d
H

(

ω
,
0

)







d
H

(

ω
,

θ
1


)





·




·




·






d
H

(

ω
,

θ
N


)




]


,




(
6
)








where dH(ω, θn), n=1, 2, . . . , N, is the steering vector of length M defined in the equation (1), and

θ=[0 θ1 . . . θN]T,  (7)
c=[1 c1 . . . cN]T,  (8)

are vectors of size (N+1) containing the design parameters of the beamformer. θ (bold letter face) indicates a null-position constraint vector as defined in the equation (7) and θ1 . . . θN usually define the desired null directions, and c1 . . . cN are the corresponding response for these directions, i.e., 0 for a null or a small value if some attenuation is desired.


Combining the equations (3) and (6) yields:

R(ω,θ)=U(p,θ)A(ω,θ),  (9)

where a steering matrix A(ω, θ) is constructed based on the steering vectors a(ω, θ) as shown below:











A

(

ω
,
θ

)

=

[





a
H

(

ω
,
0

)







a
H

(

ω
,

θ
1


)





·




·




·






a
H

(

ω
,

θ
N


)




]


,




(
10
)








and U(p, θ) is called a microphone response matrix and expressed as a diagonal matrix:

U(p,θ)=diag(1,u(p,θ1), . . . ,u(p,θN))  (11)


To maximize the WNG of the array 102 and solve the linear system equations of (5), a minimum-norm solution may be utilized to obtain an LDDMA beamformer as:

h(ω)=RH(ω,θ)R(ω,θ)RH(ω,θ)−1c  (12)

where the LDDMA beamformer with the minimum-norm solution may be recognized as the same form as that of the LDMA.


The difference is reflected in R(ω, θ) which consists of the conventional far-field steering vectors for omnidirectional microphones and the proposed directional microphone response vectors, as shown in the equation (9). Combining the equations (9) and (12), the LDDMA beamformer may be reformulated as:

h(ω)=AH(ω,θ)UH(p,θ)[U(p,θ)A(ω,θ)AH(ω,θ)UH(p,θ)]−1c.  (13)


This equation neatly shows the relationship between the solutions of a conventional LDMA and the proposed LDDMA, which extends the LDMA by introducing another degree of freedom, U(p, θ). In other words, the LDMA is a special case of the LDDMA when the microphone response matrix U(p, θ) is reduced to an identity matrix when p=1 for all microphones in the equation (11), i.e., the LDDMA may be used as a more general framework to design an LDMA.



FIG. 2 illustrates an example LDDMA 200. In this example, the LDDMA 200 is shown to have six microphones, 202, 204, 206, 208, 210, and 212 (M=6), linearly disposed with the inter-element spacing of 1 cm (δ=1 cm). The types of microphones that may be used are omni-directional (p=1), cardioid (p=0.5), and dipole (p=0). The plane wave 214 is shown to be have an incident angle of θ.


To evaluate the effects of different types of directional microphones, i.e., p, on the performance of an LDDMA beamformer, three types of commonly used microphone elements, omnidirectional (p=1), cardioid (p=0.5), and dipole (p=0), are used to form a ULA with the array configuration of δ=1 cm and M=6. The comparison of their beampatterns at frequencies of 1 kHz, 3 kHz and 6 kHz for two designs, i.e., a second-order cardioid with






θ
=


[



0



π
2



π



]

T






and c=[1 0 0]T and a third-order pattern with






θ
=


[



0



π
2





2

π

3



π



]

T






and c=[1 0 0 0]T are illustrated.



FIGS. 3A and 3B illustrate beampatterns for the second-order cardioid 302 and the third-order pattern 304, respectively, at 1 kHz, and FIGS. 4A and 4B illustrate beampatterns for the second-order cardioid 402 and the third-order pattern 404, respectively, at 3 kHz. As can be observed, the LDDMA beamformers at the low frequencies, 1 kHz and 3 kHz, match the desired beampattern well. However, as shown in FIGS. 5A and 5B, which illustrate beampatterns for the second-order cardioid 502 and the third-order pattern 504, respectively, at 6 kHz, the beampatterns deviate from the desired beampattern at the higher frequency, i.e., 6 kHz. The LDDMA with cardioid elements (p=0.5) obtain the most sidelobe attenuation for 2nd-order design, whereas the LDDMA with dipole microphones (p=0) has the most sidelobe attenuation for 3rd-order design. It is noted that the LDDMA with p=1 becomes the conventional LDMA with the omni-directional microphones.



FIGS. 6A and 6B illustrate the comparison in the WNG 602 and the DI 604, respectively, for the 2nd-order LDDMA design with different types of microphones (p=1, p=0.5, p=0) and FIGS. 7A and 7B illustrate the comparison in the WNG 702 and the DI 704, respectively, for the 3rd-order LDDMA design with different types of microphones (p=1, p=0.5, p=0).


As shown in FIG. 6A, the 2nd-order LDDMA with directional microphones (p=0.5 and p=0) exhibits a significantly higher WNG over the conventional LDMA (omni-directional microphones, p=1) at low frequencies, about 20 dB at frequencies below 400 Hz. In FIG. 6B, the 2nd-order LDDMA shows an identical DI with directional microphones for (p=0.5 and for p=0), which is higher than the DI of the conventional LDMA at the high frequencies above about 3 kHz.


As shown in FIG. 7A, the third-order LDDMA design with dipole microphones (p=0) obtains the best WNG in the low frequencies while the conventional LDMA (p=1) exhibits the worst. As shown in FIG. 7B, the LDDMA having both directional microphones (p=0.5 for cardioid and p=0 for dipole) yields a better DI than the one having omni-directional microphone (p=1) at high frequencies, above about 5.5 kHz while the types of microphones do not cause much difference, less than about 0.5 dB in the DI at low frequencies, below about 5.5 kHz.


Thus, the WNG and DI for the 3rd-order design perform similar to those for the 2nd-order design, that is, given the same constraints, the directional microphones are better suited in terms of the WNG and DI performance when constructing an LDMA than omni-directional microphones.



FIG. 8 illustrates an example process 800 for constructing an LDDMA. The LDDMA may include uniform and non-uniform LDDMA.


At block 802, a steering vector d(ω, θ) for a proposed apparatus, an LDDMA, may be generated. That is, some desired parameters of the LDDMA, including parameters δ, p, θ, N, and M, may be preselected for generating the steering vector d(ω, θ). At block 804, a proposed constraint matrix. R(ω, θ) may be generated based on the steering vector d(ω, θ). The constraint matrix R(ω, θ) may be reformulated, such as shown in the equation (9), based on a steering matrix and a microphone response matrix, such as the equations (10) and (11), respectively, and be a matrix of a size (N+1)×M, where N is an order of differential beam forming for the ULA and M is a number of microphones. The microphone response matrix may be derived based on a beampattern of a directional microphone with a sound incident angle θ, a steering direction α, and property of the directional microphone p as described above. For example, p=1 indicates omni-directional microphones, p=0.5 indicates cardioid microphones, and p=0 indicates dipole microphones. Although omni-directional, cardioid, are dipole microphones are described, the directional microphones may be any type of directional microphones.


Based on a minimum-norm solution, such as the equation (12) for maximizing the white noise gain (WNG), an LDDMA beamformer, such as h(ω) of the equation (13), may be obtained at block 806. As can be seen, the beamformer h(ω) is frequency dependent complex value weights.


At block 808, the LDDMA beamformer for a desired direction at a desired frequency may be calculated and stored in memory, and time domain frame-by-frame sensor signals through the LDDMA may be obtained at block 810. At block 812, all the time domain sensor signals may be transformed into the frequency domain sensor values. For each frame, the real value of signals in time domain will become a complex value in the frequency domain. The transformation method used may be short-time Fourier transform (STFT), filter-banks, wavelet transform, and the like. In the frequency domain, the LDDMA beamformer complex value weights may be loaded in a vector form (LDDMA beamformer vector) and a dot product of the frequency domain sensor signal complex values and the LDDMA beamformer vector may be obtained at block 814. Then the result of the dot product is a single complex value in the frequency domain, which may be transformed into a real value in the time domain signal by a corresponding inverse transform function.


As discussed above, the effects of different types of directional microphones to form a ULA, for example, omnidirectional (p=1), cardioid (p=0.5), and dipole (p=0), on the performance of the LDDMA beamformer, may be used with different array configurations having various inter-element spacing δ and number of elements M at different frequencies for different order patterns, to evaluate beampatterns as illustrated in FIGS. 3A, 3B, 4A, 4B, 5A, 5B, 6A, 6B, 7A, and 7B. An actual LDMMA may then be constructed based on the selected beampattern from the beampatterns and associated parameters, δ, p, θ, N, and M.


Some or all operations of the methods described above can be performed by execution of computer-readable instructions stored on a computer-readable storage medium, as defined below. The term “computer-readable instructions” as used in the description and claims, include routines, applications, application modules, program modules, programs, components, data structures, algorithms, and the like. Computer-readable instructions can be implemented on various system configurations, including single-processor or multiprocessor systems, minicomputers, mainframe computers, personal computers, hand-held computing devices, microprocessor-based, programmable consumer electronics, combinations thereof, and the like.


The computer-readable storage media may include volatile memory (such as random-access memory (RAM)) and/or non-volatile memory (such as read-only memory (ROM), flash memory, etc.). The computer-readable storage media may also include additional removable storage and/or non-removable storage including, but not limited to, flash memory, magnetic storage, optical storage, and/or tape storage that may provide non-volatile storage of computer-readable instructions, data structures, program modules, and the like.


A non-transient computer-readable storage medium is an example of computer-readable media. Computer-readable media includes at least two types of computer-readable media, namely computer-readable storage media and communications media. Computer-readable storage media includes volatile and non-volatile, removable and non-removable media implemented in any process or technology for storage of information such as computer-readable instructions, data structures, program modules, or other data. Computer-readable storage media includes, but is not limited to, phase change memory (PRAM), static random-access memory (SRAM), dynamic random-access memory (DRAM), other types of random-access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, compact disk read-only memory (CD-ROM), digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information for access by a computing device. In contrast, communication media may embody computer-readable instructions, data structures, program modules, or other data in a modulated data signal, such as a carrier wave, or other transmission mechanism. As defined herein, computer-readable storage media do not include communication media.


The computer-readable instructions stored on one or more non-transitory computer-readable storage media that, when executed by one or more processors, may perform operations described above with reference to FIG. 8. Generally, computer-readable instructions include routines, programs, objects, components, data structures, and the like that perform particular functions or implement particular abstract data types. The order in which the operations are described is not intended to be construed as a limitation, and any number of the described operations can be combined in any order and/or in parallel to implement the processes.


EXAMPLE CLAUSES

A. A method for constructing a linear array (LA) of microphones comprising: generating a steering vector for the LA having preselected parameters; generating a constraint matrix based on the steering vector; reformulating the constraint matrix based on a microphone response matrix and a steering matrix; obtaining a beamformer by applying a minimum norm solution in terms of the constraint matrix; verifying a desired characteristic of the LA by calculating the beamformer for a desired direction; and constructing the LA based on the preselected parameters and the beamformer.


B. The method as paragraph A recites, wherein the microphones of the LA are directional microphones and the LA is a linear differential directional microphone array (LDDMA).


C. The method as paragraph B recites, wherein the LDDMA is one of a uniform LDDMA or a non-uniform LDDMA.


D. The method as paragraph A recites, wherein the constraint matrix is a matrix of a size (N+1)×M, where Nis an order of differential beam forming for the LA and M is a number of microphones.


E. The method as paragraph A recites, wherein the microphone response matrix is derived based on a beampattern of a directional microphone with a sound incident angle, a steering direction, and property of the directional microphone.


F. The method as paragraph E recites, wherein the property of the directional microphone includes omni-directional, cardioid, and dipole.


G. The method as paragraph A recites. The method of claim 1, wherein obtaining the beamformer by applying the minimum norm solution in terms of the constraint matrix includes maximizing a white noise gain (WNG).


H. The method as paragraph A recites, wherein calculating the beamformer for the desired direction includes calculating the beamformer for the desired direction for at a desired frequency.


I. The method as paragraph H recites, wherein calculating the beamformer for the desired direction is based on time domain frame-by-frame sensor signals received through the LA.


J. The method as paragraph I recites, further comprising: transforming all of the time domain frame-by-frame sensor signals into frequency domain sensor values.


K. The method as paragraph J recites, further comprising: calculating a dot product of the frequency domain sensor values and a beamformer vector associated with complex value weights of the beamformer.


L. The method as paragraph K recites, wherein constructing the LA based on the preselected parameters and the beamformer includes constructing the LA based on the dot product.


M. A linear array (LA) comprising: a desired number of microphones linearly disposed and spaced with desired inter-microphone distances, the desired number of microphones and the desired inter-microphone distances verified by: generating a steering vector for the LA having preselected parameters; generating a constraint matrix based on the steering vector; reformulating the constraint matrix based on a microphone response matrix and a steering matrix; obtaining a beamformer by applying a minimum norm solution in terms of the constraint matrix; verifying a desired characteristic of the LA by calculating the beamformer for a desired direction; and constructing the LA based on the preselected parameters and the beamformer.


N. The LA as paragraph M recites, wherein the microphones of the LA are directional microphones and the LA is a linear differential directional microphone array (LDDMA).


O. The LA as paragraph N recites, wherein the LDDMA is one of a uniform LDDMA or a non-uniform LDDMA.


P. The LA as paragraph M recites, wherein the constraint matrix is a matrix of a size (N+1)×M, where N is an order of differential beam forming for the LA and M is a number of microphones.


Q. The LA as paragraph M recites, wherein the microphone response matrix is derived based on a beampattern of a directional microphone with a sound incident angle, a steering direction, and property of the directional microphone.


R. The LA as paragraph Q recites, wherein the property of the directional microphone includes omni-directional, cardioid, and dipole.


S. The LA as paragraph M recites, wherein obtaining the beamformer by applying the minimum norm solution in terms of the constraint matrix includes maximizing a white noise gain (WNG).


T. The LA as paragraph M recites, wherein calculating the beamformer for the desired direction includes calculating the beamformer for the desired direction for at a desired frequency.


U. The LA as paragraph T recites, wherein calculating the beamformer for the desired direction is based on time domain frame-by-frame sensor signals received through the LA.


V. The LA as paragraph U recites, further comprising: transforming all of the time domain frame-by-frame sensor signals into frequency domain sensor values.


W. The LA as paragraph V recites, further comprising: calculating a dot product of the frequency domain sensor values and a beamformer vector associated with complex value weights of the beamformer.


X. The LA as paragraph W recites, wherein constructing the LA based on the preselected parameters and the beamformer includes constructing the LA based on the dot product.


Y. A computer-readable storage medium storing computer-readable instructions executable by one or more processors, that when executed by the one or more processors, cause the one or more processors to perform operations comprising: generating a steering vector for the LA having preselected parameters; generating a constraint matrix based on the steering vector; reformulating the constraint matrix based on a microphone response matrix and a steering matrix; obtaining a beamformer by applying a minimum norm solution in terms of the constraint matrix; verifying a desired characteristic of the LA by calculating the beamformer for a desired direction; and constructing the LA based on the preselected parameters and the beamformer.


Z. The computer-readable storage medium as paragraph Y recites, wherein the microphones of the LA are directional microphones and the LA is a linear differential directional microphone array (LDDMA).


AA. The computer-readable storage medium as paragraph Z recites, wherein the LDDMA is one of a uniform LDDMA or a non-uniform LDDMA.


AB. The computer-readable storage medium as paragraph Y recites, wherein the constraint matrix is a matrix of a size (N+1)×M, where N is an order of differential beam forming for the LA and M is a number of microphones.


AC. The computer-readable storage medium as paragraph Y recites, wherein the microphone response matrix is derived based on a beampattern of a directional microphone with a sound incident angle, a steering direction, and property of the directional microphone.


AD. The computer-readable storage medium as paragraph AC recites, wherein the property of the directional microphone includes omni-directional, cardioid, and dipole.


AE. The computer-readable storage medium as paragraph Y recites, wherein obtaining the beamformer by applying the minimum norm solution in terms of the constraint matrix includes maximizing a white noise gain (WNG).


AF. The computer-readable storage medium as paragraph Y recites, wherein calculating the beamformer for the desired direction includes calculating the beamformer for the desired direction for at a desired frequency.


AG. The computer-readable storage medium as paragraph AF recites, wherein calculating the beamformer for the desired direction is based on time domain frame-by-frame sensor signals received through the LA.


AH. The computer-readable storage medium as paragraph AG recites, further comprising: transforming all of the time domain frame-by-frame sensor signals into frequency domain sensor values.


AI. The computer-readable storage medium as paragraph AH recites, further comprising: calculating a dot product of the frequency domain sensor values and a beamformer vector associated with complex value weights of the beamformer.


AJ. The computer-readable storage medium as paragraph AI recites, wherein constructing the LA based on the preselected parameters and the beamformer includes constructing the LA based on the dot product.


CONCLUSION

Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described. Rather, the specific features and acts are disclosed as exemplary forms of implementing the claims.

Claims
  • 1. A method for constructing a linear array (LA) of microphones comprising: generating a steering vector for the LA having preselected parameters;generating a constraint matrix based on the steering vector;reformulating the constraint matrix based on a microphone response matrix and a steering matrix;obtaining a beamformer by applying a minimum norm solution in terms of the constraint matrix;verifying a desired characteristic of the LA by calculating the beamformer for a desired direction; andconstructing the LA based on the preselected parameters and the beamformer.
  • 2. The method of claim 1, wherein the constraint matrix is a matrix of a size (N+1)×M, where N is an order of differential beamforming for the LA and M is a number of microphones.
  • 3. The method of claim 1, wherein the microphone response matrix is derived based on a beampattern of a directional microphone with a sound incident angle, a steering direction, and property of the directional microphone.
  • 4. The method of claim 1, wherein obtaining the beamformer by applying the minimum norm solution in terms of the constraint matrix includes maximizing a white noise gain (WNG).
  • 5. The method of claim 1, wherein calculating the beamformer for the desired direction includes calculating the beamformer for the desired direction for a desired frequency.
  • 6. The method of claim 5, wherein calculating the beamformer for the desired direction is based on time domain frame-by-frame sensor signals received through the LA.
  • 7. The method of claim 6, further comprising: transforming all of the time domain frame-by-frame sensor signals into frequency domain sensor values; andcalculating a dot product of the frequency domain sensor values and a beamformer vector associated with complex value weights of the beamformer.
  • 8. The method of claim 7, wherein constructing the LA based on the preselected parameters and the beamformer includes constructing the LA based on the dot product.
  • 9. A linear array (LA) comprising: a desired number of microphones linearly disposed and spaced with desired inter-microphone distances, the desired number of microphones and the desired inter-microphone distances verified by: generating a steering vector for the LA having preselected parameters;generating a constraint matrix based on the steering vector;reformulating the constraint matrix based on a microphone response matrix and a steering matrix;obtaining a beamformer by applying a minimum norm solution in terms of the constraint matrix;verifying a desired characteristic of the LA by calculating the beamformer for a desired direction; andconstructing the LA based on the preselected parameters and the beamformer.
  • 10. The LA of claim 9, wherein the microphones of the LA are directional microphones and the LA is a linear differential directional microphone array (LDDMA).
  • 11. The LA of claim 10, wherein the LDDMA is one of a uniform LDDMA or a non-uniform LDDMA.
  • 12. The LA of claim 9, wherein the constraint matrix is a matrix of a size (N+1)×M, where N is an order of differential beam forming for the LA and M is a number of microphones.
  • 13. The LA of claim 9, wherein the microphone response matrix is derived based on a beampattern of a directional microphone with a sound incident angle, a steering direction, and property of the directional microphone.
  • 14. The LA of claim 9, wherein obtaining the beamformer by applying the minimum norm solution in terms of the constraint matrix includes maximizing a white noise gain (WNG).
  • 15. The LA of claim 9, wherein calculating the beamformer for the desired direction includes calculating the beamformer for the desired direction for a desired frequency.
  • 16. A computer-readable storage medium storing computer- readable instructions executable by one or more processors, that when executed by the one or more processors, cause the one or more processors to perform operations comprising: generating a steering vector for a linear array (LA) having preselected parameters;generating a constraint matrix based on the steering vector;reformulating the constraint matrix based on a microphone response matrix and a steering matrix;obtaining a beamformer by applying a minimum norm solution in terms of the constraint matrix;verifying a desired characteristic of the LA by calculating the beamformer for a desired direction; andconstructing the LA based on the preselected parameters and the beamformer.
  • 17. The computer-readable storage medium of claim 16, wherein the constraint matrix is a matrix of a size (N+1)×M, where N is an order of differential beam forming for the LA and M is a number of microphones.
  • 18. The computer-readable storage medium of claim 16, wherein the microphone response matrix is derived based on a beampattern of a directional microphone with a sound incident angle, a steering direction, and property of the directional microphone.
  • 19. The computer-readable storage medium of claim 16, wherein obtaining the beamformer by applying the minimum norm solution in terms of the constraint matrix includes maximizing a white noise gain (WNG).
  • 20. The computer-readable storage medium of claim 16, wherein calculating the beamformer for the desired direction includes calculating the beamformer for the desired direction for a desired frequency.
PCT Information
Filing Document Filing Date Country Kind
PCT/CN2019/117371 11/12/2019 WO
Publishing Document Publishing Date Country Kind
WO2021/092740 5/20/2021 WO A
US Referenced Citations (15)
Number Name Date Kind
6904152 Moorer Jun 2005 B1
6914854 Heberley Jul 2005 B1
7254199 Desloge Aug 2007 B1
8194880 Avendano Jun 2012 B2
9202475 Elko et al. Dec 2015 B2
9521484 Kim Dec 2016 B2
9591404 Chhetri Mar 2017 B1
9749745 Chen Aug 2017 B2
9812116 Ushakov Nov 2017 B2
9930448 Chen Mar 2018 B1
9980042 Benattar et al. May 2018 B1
10356514 Elko Jul 2019 B2
11159879 Chen Oct 2021 B2
20120093344 Sun et al. Apr 2012 A1
20140270245 Elko Sep 2014 A1
Foreign Referenced Citations (4)
Number Date Country
101344582 Jan 2009 CN
105223544 Jan 2016 CN
109633527 Apr 2019 CN
110166098 Aug 2019 CN
Non-Patent Literature Citations (3)
Entry
International Search Report dated Aug. 10, 2020, from corresponding PCT Application No. PCT/CN2019/117371, 3 pages.
Written Opinion dated Aug. 10, 2020, from corresponding PCT Application No. PCT/CN2019/117371, 4 pages.
Written Opinio dated Aug. 10, 2020, from corresponding PCT Application No. PCT/CN2019/117371, 4 pages.
Related Publications (1)
Number Date Country
20220408183 A1 Dec 2022 US