SIGNAL PROCESSING APPARATUS, SIGNAL PROCESSING METHOD, AND SIGNAL PROCESSING PROGRAM

Abstract
A signal processing device 1 includes an expansion coefficient calculation unit 11 that, from an outward sound field to be reproduced, calculates a spherical harmonics expansion coefficient for reproducing the sound field; an expansion coefficient conversion unit 12 that converts the calculated spherical harmonics expansion coefficient into a weight factor for superposition of multipole sources; a filter coefficient calculation unit 13 that, from the weight factor, calculates a filter coefficient corresponding to each speaker included in a multipole speaker array, the speaker providing output outwardly; and a convolution operation unit 14 that convolves the filter coefficient corresponding to each speaker into an input acoustic signal to calculate an output acoustic signal for each speaker.
Description
TECHNICAL FIELD

The present invention relates to a signal processing device, a signal processing method, and a signal processing program.


BACKGROUND ART

In recent years, reproduction schemes with multiple arranged speakers have been popularized for public viewing and at home. With popularization of video technologies, such as 3D (three-dimensional) videos and wide videos, efforts have also been made to achieve acoustic reproduction with a higher sense of presence. Specifically, directions of arrival and loudness of sounds are controlled by the speakers for scenes of videos. In particular, a desired sound field is reproduced through a sound field reproduction technology using a speaker array including multiple disposed speakers.


As a general sound field reproduction technology, there is a Pressure-Matching based approach of solving an inverse problem of matching the desired sound field and a sound field to be reproduced. Inverse problems, however, are ill-conditioned problems, and tend to give unstable solutions. In contrast, approaches based on analytic approaches with a spherical speaker array using spherical harmonics or a linear speaker array using an angle spectrum may often give stabler solutions than the inverse problems, and many such approaches have been proposed.


There is an approach of reproducing directional characteristics of the spherical harmonics through superposition of multipole sources (see Patent Literature 1). Patent Literature 1 applies a spherical harmonics expansion coefficient to the directional characteristics of the spherical harmonics reproduced through the superposition of the multipole sources, and thus reproduces the directional characteristics generated by the spherical harmonics, through the superposition of the multipole sources. The spherical harmonics expansion coefficient is obtained through an inverse problem such as a least square method, or spherical harmonic expansion of the sound field.


There is a method of reproducing the desired sound field through a mode-matching approach (Non-Patent Literature 1). Non-Patent Literature 1 collects sounds with a spherical microphone array, and reproduces an expanded sound field with the spherical speaker array.


There is also the multipole source as a method of controlling directivity of the sounds emitted from the speakers (Non-Patent Literature 2). The multipole source is an approach of expressing the directivity of the sounds with a combination of primitive directivities such as a dipole and a quadrupole. Each primitive directivity is achieved with a combination of sound sources having different polarities in proximity to one another.


CITATION LIST
Patent Literature



  • Patent Literature 1: Japanese Patent Laid-Open No. 2012-169895



Non-Patent Literature



  • Non-Patent Literature 1: M. A. Poletti, “Three-Dimensional Surround Sound Systems Based on Spherical Harmonics,” Journal of the Audio Engineering Society 53.11 (2005): p. 1004-1025.

  • Non-Patent Literature 2: Yoichi Haneda, Kenichi Furuya, Suehiro Shimauchi, “Directivity synthesis using multipole sources based on spherical harmonic expansion,” The Journal of Acoustical Society of Japan, vol. 69, No. 11, pp 577-588, 2013.



SUMMARY OF THE INVENTION
Technical Problem

None of the literatures, however, discloses or suggests a method of reproducing the desired sound field with a multipole speaker array including multiple speakers that provide output outwardly. Patent Literature 1 and Non-Patent Literature 2 only reproduce the directional characteristics, and do not reproduce the sound field. In addition, Non-Patent Literature 1 uses the spherical speaker array, and does not use the multipole speaker array including the multiple speakers that provide the output outwardly.


In addition, the multipole source is not an orthogonal function, and thus cannot expand the sound field as with the spherical harmonics that are orthogonal functions. Accordingly, the reproduction of the sound field with the multipole speaker array requires derivation of a weight factor for superposition of multipoles. While the derivation through the Pressure-Matching based approach may be conceived as a general approach, the approach calculates the inverse problem and thus is likely to undesirably give unstable solutions.


In this way, the conventional art cannot derive the weight factor for the superposition of the multipoles, and thus cannot reproduce the desired sound field with the multipole speaker array including the multiple speakers that provide the output outwardly.


An object of the present invention, which has been accomplished in view of the above situation, is to provide a technology of reproducing the desired sound field with the multipole speaker array including the multiple speakers that provide the output outwardly.


Means for Solving the Problem

A signal processing device of one aspect of the present invention includes an expansion coefficient calculation unit that, from an outward sound field to be reproduced, calculates a spherical harmonics expansion coefficient for reproducing the sound field; an expansion coefficient conversion unit that converts the calculated spherical harmonics expansion coefficient into a weight factor for superposition of multipole sources; a filter coefficient calculation unit that, from the weight factor, calculates a filter coefficient corresponding to each speaker included in a multipole speaker array, the speaker providing output outwardly; and a convolution operation unit that convolves the filter coefficient corresponding to each speaker into an input acoustic signal to calculate an output acoustic signal for each speaker.


A signal processing method of one aspect of the present invention includes a step of calculating, by a computer from an outward sound field to be reproduced, a spherical harmonics expansion coefficient for reproducing the sound field; a step of converting, by the computer, the calculated spherical harmonics expansion coefficient into a weight factor for superposition of multipole sources; a step of calculating, by the computer from the weight factor, a filter coefficient corresponding to each speaker included in a multipole speaker array, the speaker providing output outwardly; and a step of convolving, by the computer, the filter coefficient corresponding to each speaker into an input acoustic signal to calculate an output acoustic signal for each speaker.


One aspect of the present invention is a signal processing program for causing a computer to function as the above described signal processing device.


Effect of the Invention

According to the present invention, it is possible to provide the technology of reproducing the desired sound field with the multipole speaker array including the multiple speakers that provide the output outwardly.





BRIEF DESCRIPTION OF DRAWINGS


FIG. 1 is a diagram illustrating a sound collection environment and a reproduction environment in an embodiment of the present invention.



FIG. 2 is a block diagram showing a configuration of a signal processing device.



FIG. 3 is a diagram illustrating polar coordinates.



FIG. 4 is a diagram illustrating an example of spherical harmonics up to degree n=3.



FIG. 5 is a diagram illustrating an example of positions and sound pressures of point sources constituting multipole sources up to μ+ν=2.



FIG. 6 is a flowchart illustrating processing in the signal processing device.



FIG. 7 is a diagram illustrating a hardware configuration of a computer used as the signal processing device.





DESCRIPTION OF EMBODIMENT

An embodiment of the present invention will be described below with reference to the drawings. In the drawings, the same reference signs are attached to the same portions and the description thereof will be omitted.


A signal processing device 1 according to the embodiment of the present invention generates, from an input acoustic signal, an output acoustic signal that reproduces a desired sound field with a multipole speaker array.


With reference to FIG. 1, a sound collection environment for the desired sound field and a reproduction environment for the desired sound field will be described.


As shown in FIG. 1(a), a spherical microphone array collects sounds from a desired sound source O. The sound source O is an outward sound field that provides output outwardly. The spherical microphone array is configured with microphones disposed around the sound source O. Data of a sound field to be achieved by the desired sound source O is identified through the sound collection. It should be noted that the data of the sound field does not need to be identified through the sound collection, and may be identified through modeling of a sound field to be reproduced.


The signal processing device 1 reproduces the desired sound field identified in FIG. 1(a), with the multipole speaker array as shown in FIG. 1(b). The multipole speaker array includes multiple speakers P that provide output outwardly. The signal processing device 1 generates an output acoustic signal to be output to each speaker P constituting the multipole speaker array.


The embodiment of the present invention derives an analytic expansion coefficient of spherical harmonics for reproducing the outward sound field with spherical harmony (spherical speaker array). The reproduction of the outward sound field with the multipole speaker array is achieved through analytic conversion of the derived expansion coefficient into a weight factor for superposition of multipole sources.


With reference to FIG. 2, the signal processing device 1 according to the embodiment of the present invention will be described. The signal processing device 1 includes an expansion coefficient calculation unit 11, an expansion coefficient conversion unit 12, a filter coefficient calculation unit 13, and a convolution operation unit 14.


The expansion coefficient calculation unit 11 calculates, from the outward sound field to be reproduced, a spherical harmonics expansion coefficient for reproducing this outward sound field.


The sound field to be reproduced is calculated in accordance with Expression (1). Expression (1) expresses the sound field in polar coordinates as shown in FIG. 3. It should be noted that an x-axis direction and a y-axis direction are two axes orthogonal to each other on a plane where the multipole speaker array is disposed.









[

Math
.

1

]











S

(

θ
,
ϕ
,
ω

)

=




n
=
1







m
=

-
n


n




A
n
m

(
ω
)




γ
n
m

(

θ
,
ϕ

)





,




Expression



(
1
)








  • θ,ϕ: arguments indicating an arbitrary control point in the polar coordinates

  • ω: an angular frequency (=2πf: f denotes frequency)

  • m,n: an order and a degree of a multipole in each of the x-axis direction and the y-axis direction where −n≤m≤n, n≥0

  • Ynm(θ,ϕ): the spherical harmonics

  • Anm: the spherical harmonics expansion coefficient



The spherical harmonics in Expression (1) is defined in Expression (2).









[

Math
.

2

]












γ
n
m

(

θ
,
ϕ

)

=





2

n

+

1



(

n
-
m

)

!




4



π

(

n
+
m

)

!







P
n
m

(

cos

θ

)



e

j

m

ϕ




,




Expression



(
2
)








  • Pnm(⋅): an associated Legendre function

  • j: an imaginary number



The spherical harmonics expansion coefficient in Expression (1) is defined in Expression (3). Expression (3) is referred to as spherical harmonic expansion. The spherical harmonics expansion coefficient is obtained through the spherical harmonic expansion.





[Math. 3]






A
n
m(ω)=∫00πS(θ,ϕ,ω)Ynm(θ,ϕ)*sin θdϕdθ.   Expression (3)


It should be noted that an example of the spherical harmonics up to degree n=3 is shown in FIG. 4. In FIG. 4, dotted hatching and diagonal hatching denote positive phases and negative phases, respectively. Parts of order m greater than or equal to 0 denote real parts, while parts of order m less than 0 denote imaginary parts.


The expansion coefficient conversion unit 12 converts the spherical harmonics expansion coefficient, which has been calculated by the expansion coefficient calculation unit 11, into the weight factor for the superposition of the multipole sources.


The multipole sources will be described here. The multipole sources are sound sources including an opposite phase distribution of point sources having the same amplitude in positions extremely close to the origin. By way of example, Expression (4) expresses a sound pressure distribution of the multipole sources where the point sources are arranged at very small intervals of 2d on an x-y plane, as follows.









[

Math
.

4

]














M
μ
v

(

r
,
k

)

=




(
d
)


μ
+
v








μ
+
v






x
μ






y
v






G

3

D


(

r
,
k

)









=



-

jk

4

π






h
0

(
2
)


(
kr
)




(

-
jdk

)


μ
+
v




cos
μ



ϕsin
v


θ








Expression



(
4
)










    • Mμν(r,k): the sound pressure distribution of the multipole sources

    • k: a wave number (k=ω/c)
      • ω: angular frequency (=2πf)
      • c: sound speed, f: frequency

    • 2d the interval between the point sources

    • j: the imaginary number (=√{square root over (−1)})

    • h0(2): a spherical Hankel function of the second kind of order 0

    • μ,ν: the numbers of differentials in the x-axis direction and the y-axis direction where n≥μ+ν≥0 (μ≥0, ν≥0), |m|=μ+ν






FIG. 5 shows an example of the positions and sound pressures of the point sources constituting the multipole sources up to μ+ν=2. In FIG. 5, “∘” denotes g=1, “●” denotes g=−1, and “▴” denotes g=−2. The position of each sound source is expressed in Expression (5).





[Math. 5]






x
μ,α
=x
c+(μ−2α)d . . . (0≤α≤μ)






y
ν,β
=y
c+(ν−2β)d . . . (0≤β≤ν)   Expression (5)

    • xμ,α: the position (x-coordinate) of the point source
    • yν,β: the position (y-coordinate) of the point source
    • (xc,yc): the central coordinate of the multipole sources


The sound pressure of the point source with respect to the x-axis direction is defined in Expression (6). The sound pressure of the point source with respect to the y-axis direction is also defined similarly.









[

Math
.

6

]










g

μ
,
α


=

{



1



(

α
=
0

)







g


μ
-
1

,
α


-

g


μ
-
1

,

α
-
1







(

0
<
α
<
μ

)






-

g


μ
-
1

,

α
-
1







(

α
=
μ

)









Expression



(
6
)










    • gμ,α: the sound pressure of the point source with respect to the x-axis direction





The sound pressure of the point source constituting the multipole sources of order (μ,ν) is defined below.





[Math. 7]






g
μ,α
ν,β
=g
μ,α
·g
ν,β  Expression (7)

  • gμ,αν,β: the sound pressure of the point source
  • gμ,α: the sound pressure of the point source with respect to the x-axis direction
  • gν,β: the sound pressure of the point source with respect to the y-axis direction


The sound field obtained through the superposition of the multipole sources is expressed in Expression (8).









[

Math
.

8

]










S

(

r
,
ω

)

=




μ
=
0







v
=
0






D
μ
v

(
ω
)




M
μ
v

(

θ
,
ϕ
,
ω

)








Expression



(
8
)












M
μ
v

(

r
,
ω

)

=


-

jk

4

π








α
=
0

μ





β
=
0

v



g

μ
,
α


v
,
β





h
0

(
2
)


(

k




"\[LeftBracketingBar]"


r
-

r

μ
,
α


v
,
β





"\[RightBracketingBar]"



)











    • D: the weight factor for the multipole sources





In accordance with Expression (4) and Expression (8), the sound field with the multipole speaker array is defined by Expression (9).











[

Math
.

9

]











S

(

r
,
ω

)

=




μ
=
0







v
=
0






D
μ
v

(
ω
)



{


-

jk

4

π






h
0
2

(
kr
)




(

-
jdk

)


μ
+
v




cos
μ



ϕsin
v


θ

}








Expression



(
9
)










    • k: the wave number (k=ω/c)
      • ω: angular frequency (=2πf)
      • c: sound speed, f: frequency

    • j: the imaginary number (=√{square root over (−1)})

    • h0(2): the spherical Hankel function of the second kind of order 0

    • μ,ν: the numbers of differentials in the x-axis direction and the y-axis direction where n≥μ+ν≥0 (μ∞0, ν≥0), |m|=μ+ν

    • D: the weight factor for the multipole sources





In addition, the outward sound field is defined with the spherical harmonics in Expression (10).











[

Math
.

10

]














S

(

r
,
ω

)

=





n
=
0







m
=

-
n


n




A
n
m

(
ω
)




h
n

(
2
)


(
kr
)




Y
n
m

(


π
2

,
ϕ

)










=




h
n

(
2
)


(
kr
)

[








n
=
0




{


A
n
0



F
n
0



(

π
2

)


}


+




n
=
1






m
=
1

n








{



A
n
m




Y
n
m

(


π
2

,
ϕ

)


+


A
n

-
m





Y
n

-
m


(


π
2

,
ϕ

)



}




]







=




h
0

(
kr
)

[








n
=
0




{


j
n



A
n
0




F
n
0

(

π
2

)


}


+




n
=
1







m
=
1

n



j
n




F
n
m

(

π
2

)










{



A
n
m



e

j

m

ϕ



+



(

-
1

)

m



A
n

-
m




e


-
jm


ϕ




}




]








Expression



(
10
)














F
n
m

(
θ
)

=






2

n

+
1


4

π






(

n
-
m

)

!



(

n
+
m

)

!







P
n
m

(

cos

θ

)









    • hn(2)(⋅): the spherical Hankel function of the second kind of order n

    • Yn−m(θ,ϕ)=(−1)mYnm(θ,ϕ)

    • hn(2)(kr)=jnh0(2)(kr) (kr is sufficiently large)





With respect to Expression (10), Euler's theorem shown in Expression (11) and a binominal theorem may be used to modify Expression (10) as shown in Expression (12).











[

Math
.

11

]













e

jn

ϕ


=


(


cos

ϕ

+

j

sin

ϕ


)

m






Expression



(
11
)















[

Math
.

12

]












S

(

r
,
ω

)

=



h
0

(
kr
)

[





n
=
0




{


j
n



A
n
0




F
n
0

(

π
2

)


}


+




n
=
1







m
=
1

n





v
=
0

m



j

n
+
v





F
n
m

(

π
2

)



(



m




v



)



{


A
n
m

+



(

-
1

)


m
+
v




A
n

-
m




}



cos

m
-
v



ϕ


sin
v


ϕ







}





Expression



(
12
)









In addition, let μ in Expression (9) be m−ν. Then comparison of coefficients of cosm−νϕ sinν ϕ between in Expression (9) and in Expression (12) derives Expression (13).











[

Math
.

13

]













D

m
-
v

v

(

-

jk

4

π



)




(

-
jdk

)

m


=

{







n
=
m





j
n



A
n
0




F
n
0

(

π
2

)






(

m
=
0

)












n
=
m





j

n
+
v




F
n


m




(

π
2

)



(



m




v



)








{


A
n
m

+



(

-
1

)


m
+
v




A
n

-
m




}







(

m
>
0

)









Expression



(
13
)








Furthermore, let m in Expression (13) be μ+ν. Then the expression is arranged to give Expression (14). The expansion coefficient conversion unit 12 converts the spherical harmonics expansion coefficient into the weight factor for the superposition of the multipole sources, in accordance with Expression (14).











[

Math
.

14

]











D
μ
v

=


-


4

π

jk




{







n
=
0





j
n



A
n
0




F
n
0

(

π
2

)






(


μ
+
v

=
0

)










1


(

-
jdk

)

μ







n
=

μ
+
v






j

n
+
v





F
n



μ
+
v



(

π
2

)



(




μ
+
v





v



)









{


A
n

μ
+
v


+



(

-
1

)


μ
+

2

v





A
n


-
μ

-
v




}







(


μ
+
v

>
0

)










Expression



(
14
)










    • Dμν: the weight factor for the superposition of the multipole sources

    • A: the spherical harmonics expansion coefficient

    • m,n: the order and the degree of the multipole in each of the x-axis direction and the y-axis direction where −n≤m≤n, n≥0, m=μ+ν

    • j: an imaginary unit

    • d: an interval between neighboring speakers

    • k: the wave number (k=2πf/c)
      • f and c denote frequency and sound speed of a sound signal to be controlled, respectively





The filter coefficient calculation unit 13 calculates, from the weight factor, a filter coefficient corresponding to each speaker that is included in the multipole speaker array and provides the output outwardly. The filter coefficient calculation unit 13 obtains the filter coefficient corresponding to each speaker by multiplying the weight factor for the superposition of the multipoles, which has been output by the expansion coefficient conversion unit 12, by the gain of each speaker constituting the multipole sources.


The convolution operation unit 14 convolves the filter coefficient corresponding to each speaker into the input acoustic signal to calculate the output acoustic signal for each speaker. The convolution operation unit 14 calculates the output acoustic signal for each speaker, from the input acoustic signal that is input, and the filter coefficient corresponding to each speaker constituting the multipole speaker array.


The output acoustic signal output by the signal processing device 1 is input to each speaker constituting the multipole speaker array. The output acoustic signal is reproduced at each speaker to thereby reproduce the desired sound field.


(Signal Processing Method)


With reference to FIG. 6, a signal processing method according to the embodiment of the present invention will be described.


At step S1, the signal processing device 1 first acquires data of the sound field to be reproduced.


At step S2, from the data of the sound field acquired at step S1, the signal processing device 1 next calculates the spherical harmonics expansion coefficient. At step S3, the signal processing device 1 converts the spherical harmonics expansion coefficient calculated at step S2, into the weight factor for the superposition of the multipole sources.


At step S4, the signal processing device 1 calculates the filter coefficient for each speaker from the weight factor for the superposition of the multipole sources calculated at step S3. At step S5, the signal processing device 1 convolves the filter coefficient for each speaker calculated at step S4, into the input acoustic signal to calculate the output acoustic signal for each speaker.


Instead of deriving the weight factor directly from the multipole sources, the signal processing device 1 according to the embodiment of the present invention compares the sound field expressed with the spherical harmony to the sound field expressed with the multipole sources, and thereby analytically converts the sound field with the spherical harmonics into the weight factor for the superposition of the multipoles. The signal processing device 1 can thus generate the acoustic signal that reproduces the desired sound field with the multipole speaker array.


As the above described signal processing device 1 of the present embodiment, for example, a general-purpose computer system is used, which includes a CPU (Central Processing Unit, processor) 901, a memory 902, a storage 903 (HDD: Hard Disk Drive, SSD: Solid State Drive), a communication device 904, an input device 905, and an output device 906. In this computer system, each function of the signal processing device 1 is implemented by the CPU 901 executing a predetermined signal processing program loaded on the memory 902.


It should be noted that the signal processing device 1 may be implemented in one computer, or may be implemented in multiple computers. The signal processing device 1 may also be a virtual machine implemented in the computer.


The signal processing program for implementing each function of the signal processing device 1 may be stored in a computer-readable recording medium, such as an HDD, an SSD, a USB (Universal Serial Bus) memory, a CD (Compact Disc), and a DVD (Digital Versatile Disc), and may also be delivered via a network.


It should be noted that the present invention is not limited to the above described embodiment, and numerous modifications can be made within the scope of the gist of the present invention.


REFERENCE SIGNS LIST






    • 1 Signal processing device


    • 11 Expansion coefficient calculation unit


    • 12 Expansion coefficient conversion unit


    • 13 Filter coefficient calculation unit


    • 14 Convolution operation unit


    • 901 CPU


    • 902 Memory


    • 903 Storage


    • 904 Communication device


    • 905 Input device


    • 906 Output device

    • M Microphone

    • O Sound source

    • P Speaker




Claims
  • 1. A signal processing device, comprising: an expansion coefficient calculation unit, implemented using one or more computing devices, configured to, from an outward sound field to be reproduced, calculate a spherical harmonics expansion coefficient for reproducing the sound field;an expansion coefficient conversion unit, implemented using one or more computing devices, configured to convert the calculated spherical harmonics expansion coefficient into a weight factor for superposition of multipole sources;a filter coefficient calculation unit, implemented using one or more computing devices, configured to, from the weight factor, calculate a filter coefficient corresponding to each speaker included in a multipole speaker array, the each speaker providing output outwardly; anda convolution operation unit, implemented using one or more computing devices, configured to convolve the filter coefficient corresponding to the each speaker into an input acoustic signal to calculate an output acoustic signal for the each speaker.
  • 2. The signal processing device according to claim 1, wherein the expansion coefficient conversion unit converts the spherical harmonics expansion coefficient into the weight factor for the superposition of the multipole sources, in accordance with Expression (1):
  • 3. A signal processing method, comprising: calculating, by a computer from an outward sound field to be reproduced, a spherical harmonics expansion coefficient for reproducing the sound field;converting, by the computer, the calculated spherical harmonics expansion coefficient into a weight factor for superposition of multipole sources;calculating, by the computer from the weight factor, a filter coefficient corresponding to each speaker included in a multipole speaker array, the each speaker providing output outwardly; andconvolving, by the computer, the filter coefficient corresponding to the each speaker into an input acoustic signal to calculate an output acoustic signal for the each speaker.
  • 4. The signal processing method according to claim 3, wherein converting the calculated spherical harmonics expansion coefficient comprises converting the spherical harmonics expansion coefficient into the weight factor for the superposition of the multipole sources, in accordance with Expression (2):
  • 5. A non-transitory recording medium storing a signal processing program, wherein execution of the signal processing program causes one or more computers to perform operations comprising: calculating, an outward sound field to be reproduced, a spherical harmonics expansion coefficient for reproducing the sound field;converting the calculated spherical harmonics expansion coefficient into a weight factor for superposition of multipole sources;calculating, from the weight factor, a filter coefficient corresponding to each speaker included in a multipole speaker array, the each speaker providing output outwardly; andconvolving the filter coefficient corresponding to the each speaker into an input acoustic signal to calculate an output acoustic signal for the each speaker.
  • 6. The recording medium according to claim 5, wherein converting the calculated spherical harmonics expansion coefficient comprises converting the spherical harmonics expansion coefficient into the weight factor for the superposition of the multipole sources, in accordance with Expression (3):
PCT Information
Filing Document Filing Date Country Kind
PCT/JP2019/033870 8/29/2019 WO