This application is a National Stage application under 35 U.S.C. § 371 of International Application No. PCT/JP2020/028433, having an International Filing Date of Jul. 22, 2020, which claims priority to Japanese Application Serial No. 2019-145975, filed on Aug. 8, 2019, the disclosure of both of which are considered part of the disclosure of this application, and are incorporated in their entirety into this application.
The present invention relates to a speaker array, a signal processing apparatus, a signal processing method, and a signal processing program.
In public viewing and concerts, audio and music are played through a plurality of speakers installed at a screening venue. Sound reproduction that reproduces a sound field of a recording venue as it is using a large number of speakers has been used in recent years.
There is also an acoustic technology for generating a direction in which sound strongly propagates (a bright zone) and a direction in which sound hardly propagates (a dark zone) using a large number of speakers. There are applications such as personal audio that utilize this technology to allow audio content to be enjoyed while protecting privacy at home or in public facilities such as museums.
A speaker array control technique such as a wave field synthesis technique has been widely used as a technique for generating such a sound field.
There is a method called wave field synthesis for sound reproduction technology for reproducing a sound field of a recording location in a screening space (PTL 1). In the method based on PTL 1, after an acoustic signal at a location where the acoustic signal is recorded is collected by microphones installed at a plurality of locations, the arrival directions of the acoustic signal in vertical and horizontal directions are analyzed to physically reproduce the acoustic signal of the recording venue using a plurality of speakers installed in the screening space.
There is a technology in which an acoustic sink in a target sound field to be reproduced is assumed and a drive signal derived from a first-type Rayleigh integral is given to a speaker array to generate a sound image in front of speakers (NPL 1).
There is also a multipole sound source as a method of controlling the directivity of sound radiated from speakers (NPL 2). A multipole sound source is a technique for expressing the directivity of sound using a combination of primitive directivities such as dipoles and quadrupoles. Each of the primitive directivities is realized with a combination of sound sources of different polarities close to each other.
However, the methods described in PTL 1 and NPL 1 require a large number of speaker arrays covering the screening venue in order to reproduce a sound field in the entire screening venue, which may increase the scale of the apparatus.
Therefore, it is an object of the present invention to provide a technique for reducing the scale of the apparatus that reproduces a sound field.
A speaker array of an aspect of the present invention includes a plurality of speakers arranged at intersections of a first plurality of virtual lines arranged parallel to each other at equal intervals and a second plurality of virtual lines that are perpendicular to the first plurality of virtual lines and are arranged parallel to each other at equal intervals, wherein multipoles of a given order are superimposed using the plurality of speakers to realize wave field synthesis.
A signal processing apparatus of an aspect of the present invention includes a filter coefficient determination unit configured to calculate weighting coefficients for a plurality of speakers according to positions of the plurality of speakers, the plurality of speakers being arranged at intersections of a first plurality of virtual lines arranged parallel to each other at equal intervals and a second plurality of virtual lines that are perpendicular to the first plurality of virtual lines and are arranged parallel to each other at equal intervals, and a convolution calculation unit configured to multiply the weighting coefficients for the plurality of speakers by input signals to calculate output signals to be reproduced by the plurality of speakers.
A signal processing apparatus of an aspect of the present invention includes a filter coefficient determination unit configured to calculate weighting coefficients to be assigned to a plurality of speakers from weighting coefficients to be assigned to multipoles of a given order superimposed by the plurality of speakers and positions of the plurality of speakers, the plurality of speakers being arranged at intersections of a first plurality of virtual lines arranged parallel to each other at equal intervals and a second plurality of virtual lines that are perpendicular to the first plurality of virtual lines and are arranged parallel to each other at equal intervals, and a convolution calculation unit configured to multiply the weighting coefficients for the plurality of speakers by input signals to calculate output signals to be reproduced by the plurality of speakers.
A signal processing apparatus of an aspect of the present invention includes a focal coordinate determination unit configured to determine, as positions of a plurality of focused sound sources used to superimpose multipoles of a given order, intersections of a first plurality of virtual lines arranged parallel to each other at equal intervals and a second plurality of virtual lines that are perpendicular to the first plurality of virtual lines and are arranged parallel to each other at equal intervals, a circular harmonic series conversion unit configured to calculate weighting coefficients to be assigned to the multipoles of the given order superimposed by the plurality of focused sound sources from a circular harmonic series, a filter coefficient calculation unit configured to calculate weighted driving functions to be assigned to a plurality of speakers constituting a linear speaker array from the positions of the plurality of focused sound sources and the weighting coefficients assigned to the multipoles, and a convolution calculation unit configured to convolve the weighted driving functions with input signals to calculate output signals to be given to the plurality of speakers.
A signal processing method of an aspect of the present invention includes calculating, by a computer, weighting coefficients for a plurality of speakers according to positions of the plurality of speakers, the plurality of speakers being arranged at intersections of a first plurality of virtual lines arranged parallel to each other at equal intervals and a second plurality of virtual lines that are perpendicular to the first plurality of virtual lines and are arranged parallel to each other at equal intervals, and multiplying, by a computer, the weighting coefficients for the plurality of speakers by input signals to calculate output signals to be reproduced by the plurality of speakers.
One aspect of the present invention is a signal processing program for causing a computer to operate as the signal processing apparatus.
According to the present invention, it is possible to provide a technique for reducing the scale of the apparatus that reproduces a sound field.
Hereinafter, embodiments of the present invention will be described with reference to the drawings. In the description of the drawings, the same parts are denoted by the same reference signs and the description thereof will be omitted.
As illustrated in
The plurality of speakers of the speaker array 1 according to the embodiment of the present invention are formed in an xy plane as illustrated in
The speakers illustrated in
In
Each of the plurality of speakers reproduces a signal calculated by a signal processing apparatus 2 which will be described later, thereby realizing a sound field that protrudes in front of the speakers and has directivity. The signal processing apparatus 2 weights multipoles realized in the speaker array 1 to calculate input signals to the speakers.
The speaker array 1 realizes multipoles of a given order distributed in a plane. In the embodiments of the present invention, a basis multipole which is a multipole having an m-th order in the x-axis direction and an n-th order in the y-axis direction is defined as a (m, n) multipole. Basis multipoles refer to basic multipoles which are generally called dipoles, quadrupoles, or the like.
Basis multipoles and the directivity of their sound fields will be described with reference to
In order to superimpose multipoles of up to an N-th order, speakers are arranged at coordinates obtained by calculating equation (1) for 0≤m≤N.
[Math. 1]
x=(m−2μ)d(0≤μ≤m)
y=(n−2v)d(0≤v≤n) Equation (1)
A speaker in the center of each diagram shown in
The speaker array 1 according to the first embodiment has a plurality of speakers closely arranged in a grid pattern. Thus, the speaker array 1 can reduce the area in which the speaker array 1 is disposed. By superimposing multipoles of a given order using a plurality of speakers closely arranged in a grid pattern, the speaker array 1 can also reproduce a sound field that can reproduce the phase of sound as well with high accuracy, similar to the wave field synthesis technique.
A second embodiment will be described with respect to a method of reproducing the same sound fields as those generated by the basis multipoles using the speaker array 1 according to the first embodiment.
Signals to be reproduced by the speakers are processed by the signal processing apparatus 2 illustrated in
The signal processing apparatus 2 includes a filter coefficient determination unit 11 and a convolution calculation unit 12.
The filter coefficient determination unit 11 calculates weighting coefficients for the plurality of speakers included in the speaker array 1 according to the first embodiment illustrated in
The weighting coefficients for the plurality of speakers are calculated using equation (2). In equation (2), each speaker illustrated in
The convolution calculation unit 12 multiplies weighting coefficients for the plurality of speakers by the input signals to calculate output signals to be reproduced by the plurality of speakers. An output signal is calculated for each speaker. The output signals are signals to be reproduced by the speakers. As shown in equation (3), the output signal of each speaker is obtained by multiplying the input signal by the weight calculated for the speaker using equation (2).
[Math. 3]
Sμ,v(ω)=gμ,vm,n·S(ω) Equation (3)
In the second embodiment, the same sound fields as those generated by the basis multipoles can be reproduced by inputting signals obtained by multiplying the input signals by the weighting coefficients calculated using equation (2) to the speakers.
A third embodiment will be described with respect to a method of realizing a sound field generated by a sound source having complicated directivity with the speaker array 1 according to the first embodiment.
Signals to be reproduced by the speakers are processed by a signal processing apparatus 2a illustrated in
A filter coefficient determination unit 11a calculates weighting coefficients for the plurality of speakers of the speaker array 1 according to the first embodiment from weighting coefficients wm,n assigned to multipoles of a given order superimposed by the plurality of speakers and the positions of the speakers. The filter coefficient determination unit 11a receives the order of the speaker array 1 from the outside and specifies the positions of the speakers of the speaker array 1 using equation (1). The filter coefficient determination unit 11a calculates the weighting coefficients for the speakers from the outside using the weighting coefficients of the multipoles realized by the speaker array 1.
The weighting coefficients for the plurality of speakers are calculated using equation (4) and the weighting coefficients wm,n assigned to the multipoles of a given order. Equation (4) is a transformation of equation (2) using the weighting coefficients assigned to the (m, n) multipoles.
A convolution calculation unit 12a multiplies weighting coefficients wm,n for the plurality of speakers by the input signals to calculate output signals to be reproduced by the plurality of speakers. An output signal is calculated for each speaker. The output signals are signals to be reproduced by the speakers. As shown in equation (5), the output signal of each speaker is obtained by multiplying the input signal by the weight calculated for the speaker using equation (4). Equation (5) shows that the input signal multiplied by the summation of the weighting coefficients wm,n calculated using equation (4) while varying m and n over the ranges of 0≤m≤N and 0≤m≤N−m for the speaker corresponding to the index (α, β) is the output signal to be reproduced by the speaker.
The weighting coefficient wm,n to be assigned to the (m, n) multipole can be obtained, typically for a directional sound source which is a target on the sound collecting side, using a least squares method or the like for signals observed at control points arranged on a unit circle surrounding the sound source. Values calculated using the least squares method for the signals observed at the control points arranged on the unit circle can be calculated as elements of a matrix W by solving equation (6).
[Math. 6]
W=(GHG+λI)−1GHS
W=[w0,0 . . . w0,N]T
S=[S0(ω) . . . SL-1(ω)]T
G=[g0 . . . gL-1]
gq=[u0,0q . . . u0,nq]T
um,nq=G2D(rq,k)·(2dk)m+n cosmϕq sinnϕq Equation (6)
In the third embodiment of the present invention, a sound field generated by a sound source having a desired directivity can be reproduced by reproducing the output signals calculated using equation (5) through the speakers.
The third embodiment has been described with reference to the case where the weighting coefficients wm,n to be assigned to the multipoles are obtained using the least squares method or the like. However, the least squares method sometimes fails to reliably obtain the weighting coefficients wm,n because the least squares method often causes an underdetermined problem. A fourth embodiment will be described with reference to the case where the weighting coefficients wm,n of the multipoles are analytically calculated from a circular harmonic series. The fourth embodiment is suitable when a circular harmonic series Cm+n can be obtained from a sound field generated by a directional sound source which is a target on the sound collecting side.
Signals to be reproduced by the speakers are processed by a signal processing apparatus 2b illustrated in
A filter coefficient determination unit 11b calculates weighting coefficients wm,n to be assigned to the multipoles of a given order superimposed by the plurality of speakers of the speaker array 1 according to the first embodiment from a circular harmonic series Cm+n. The filter coefficient determination unit 11b receives the order of the speaker array 1 from the outside and specifies the positions of the speakers of the speaker array 1 using equation (1). The filter coefficient determination unit 11b calculates the weighting coefficient of each multipole realized by the speaker array 1 from a circular harmonic series in a sound collecting environment from the outside.
The filter coefficient determination unit 11b calculates the weighting coefficient wm,n to be assigned to each multipole by substituting the circular harmonic series Cm+n into equation (7).
The filter coefficient determination unit 11b further calculates the weighting coefficient to be assigned to each speaker from the above equation (4) and the weighting coefficients wm,n assigned to the multipoles of a given order.
When the weighting coefficient to be assigned to each speaker has been calculated, the same processing as in the third embodiment is performed. Specifically, a convolution calculation unit 12b multiplies weighting coefficients wm,n for the plurality of speakers by the input signals to calculate output signals to be reproduced by the plurality of speakers. An output signal is calculated for each speaker. The output signals are signals to be reproduced by the speakers. As shown in the above equation (5), the output signal of each speaker is obtained by multiplying the input signal by the weight calculated for the speaker using equation (4). Equation (5) shows that the input signal multiplied by the summation of the weighting coefficients wm,n calculated using equation (4) while varying m and n over the ranges of 0≤m≤N and 0≤m≤N−m for the speaker corresponding to the index (α, β) is the output signal to be reproduced by the speaker.
In the fourth embodiment, the signal processing apparatus 2b calculates the weighting coefficients wm,n of the multipoles from a circular harmonic series and calculates a weight to be assigned to each speaker from the weighting coefficients wm,n of the multipoles as described above. The signal processing apparatus 2b can further multiply the input signal by the weight assigned to each speaker of the speaker array 1 to generate a signal to be input to the speaker. The fourth embodiment can reproduce a sound field accurately by avoiding the least squares method which often causes an underdetermined problem.
In a fifth embodiment, a linear speaker array in which a plurality of speakers are arranged in a straight line is used and multipoles are weighted to realize a multipole sound source that protrudes in front of speakers and has directivity. In the fifth embodiment, a linear speaker array is used and multipoles are weighted to realize a multipole speaker array in which a plurality of focused sound sources are arranged in a grid pattern as illustrated in
The fifth embodiment will be described with reference to the case where the speakers of the speaker array are arranged in a straight line, but the present invention is not limited to this. The speaker array includes a plurality of speakers, while the plurality of speakers do not have to be arranged in a straight line.
In the fifth embodiment, a multipole sound source is realized by creating two or more focused sound sources of different polarities at positions close to each other. A focused sound source is a combination of non-directional point sound sources (monopole sound sources) of different polarities.
Signals to be reproduced by the speakers are processed by a signal processing apparatus 2c illustrated in
The focal coordinate determination unit 13 determines the intersections of a first plurality of virtual lines arranged parallel to each other at equal intervals and a second plurality of virtual lines perpendicular to the first plurality of virtual lines and arranged parallel to each other at equal intervals as the positions of a plurality of focused sound sources used to superimpose multipoles of a given order. The positions of the focused sound sources are provided symmetrically with respect to the center coordinates of the multipoles.
The focal coordinate determination unit 13 determines the positions of the focused sound sources according to equation (8).
[Math. 8]
x=xc+(m−2μ)d(0≤μ≤m)
y=yc+(n−2v)d(0≤v≤n) Equation (8)
Multipole sound sources are constructed by creating focused sound sources at coordinates obtained by calculating equation (8) for 0≤m≤N and 0≤n≤N−m.
The circular harmonic series conversion unit 14 calculates weighting coefficients to be assigned to multipoles of a given order superimposed by the focused sound sources from a circular harmonic series.
The circular harmonic series conversion unit 14 calculates the weighting coefficients wm,n to be assigned to the multipoles by substituting the circular harmonic series Cm+n into equation (7) as in the fourth embodiment.
The filter coefficient determination unit 11c calculates weighted driving functions to be assigned to the plurality of speakers of the linear speaker array from the positions of the focused sound sources and the weighting coefficients assigned to the multipoles.
The filter coefficient determination unit 11c calculates weighted driving functions to be convolved with input signals for the speakers of the linear speaker array based on each pair of focal coordinates determined by the focal coordinate determination unit 13. The filter coefficient determination unit 11c calculates a driving function using each pair of focal coordinates. For each multipole, the filter coefficient determination unit 11c calculates a weighted driving function to be assigned to each speaker from a combined driving function that is calculated from the coordinates and driving functions of focused sound sources constituting the multipole and a weight assigned to the multipole. Here, the filter coefficient determination unit 11c calculates the combined driving function of the multipole by summing functions multiplied by driving functions for the coordinates of focused sound sources included in the multipole. The filter coefficient determination unit 11c calculates a weighted driving function to be assigned to each speaker by weighting combined driving functions calculated for the multipoles by weights assigned to the multipoles and summing the resulting combined driving functions.
First, when calculating a weighted driving function for a given speaker, the filter coefficient determination unit 11c calculates a driving function for each focused sound source using equation (9).
The filter coefficient determination unit 11c calculates a weighted driving function for each speaker using the weights calculated by the circular harmonic series conversion unit 14.
[Math. 10]
D(xi,ω)Σα=−NNΣβ=−N+|α|N−|α|Sα,β(ω)·D2.5D(xi,xα,β,ω) Equation (10)
Although equations (9) and (10) relate to driving functions obtained in a two-dimensional plane assuming a line sound source as a sound source, other driving functions may be used. For example, driving functions obtained in a two-dimensional plane assuming a point sound source as a sound source may be used as shown in equations (11) and (12). The filter coefficient determination unit 11c may calculate the driving functions using equation (11) and calculate weighted driving functions of equation (12).
When the filter coefficient determination unit 11c has calculated the weighted driving functions for the speakers of the linear speaker array, the convolution calculation unit 15 convolves the weighted driving functions with the input signals to calculate output signals to be given to the speakers.
In the fifth embodiment, the signal processing apparatus 2c obtains weighted driving functions to be assigned to the speakers of the linear speaker array and convolves the weighted driving functions with the input signals to calculate output signals to be given to the speakers. In the fifth embodiment, it is possible to realize a virtual multipole speaker array where focal coordinates are provided in a grid pattern, like the positions of speakers illustrated in
For example, a general-purpose computer system including a central processing unit (CPU) (a processor) 901, a memory 902, a storage 903 (a hard disk drive (HDD) or a solid state drive (SSD)), a communication device 904, an input device 905, and an output device 906 as illustrated in
Each signal processing apparatus 2 may include an input/output interface for receiving/outputting input signals, conditions for calculating output signals, and output signals.
The signal processing apparatus 2 may be implemented on one computer or may be implemented on a plurality of computers. The signal processing apparatus 2 may also be a virtual machine implemented on a computer.
The signal processing program may be stored in a computer-readable recording medium such as an HDD, an SSD, a universal serial bus (USB) memory, a compact disc (CD), a digital versatile disc (DVD), or may be distributed via a network.
The present invention is not limited to the above embodiments and many modifications can be made within the scope of the gist thereof.
Number | Date | Country | Kind |
---|---|---|---|
2019-145975 | Aug 2019 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2020/028433 | 7/22/2020 | WO |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2021/024806 | 2/11/2021 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
10433093 | Setiawan | Oct 2019 | B2 |
20130216070 | Keiler et al. | Aug 2013 | A1 |
Number | Date | Country |
---|---|---|
2011244306 | Dec 2011 | JP |
2012169895 | Sep 2012 | JP |
2013545391 | Dec 2013 | JP |
2019047478 | Mar 2019 | JP |
Entry |
---|
Haneda et al., “Directivity synthesis using multipole sources based on spherical harmonic expansion,” Journal of the Acoustical Society of Japan, 2013, 69(11):577-588, 25 pages (with English Translation). |
Spors et al., “Physical and Perceptual Properties of Focused Sources in Wave Field Synthesis,” 127th Audio Engineering Society Convention paper 7914, Oct. 9, 2009, 19 pages. |
Number | Date | Country | |
---|---|---|---|
20220321998 A1 | Oct 2022 | US |