This application is based upon and claims the benefit of priority from prior Japanese Patent Application No. 2023-146343, filed Sep. 8, 2023, the entire contents of which are incorporated herein by reference.
Embodiments described herein relate generally to an acoustic control apparatus, a storage medium and an acoustic control method.
Various services using voices are utilized. Examples are guidance using voices such as a voice guidance in a public space and a car navigation system, and a voice conversation using an online meeting system. Voices in these kinds of services are useful for a person who requires the guidance and a person who wants to have a conversation, but can be mere noise for a person who does not require the guidance and a person who does not want to have the conversation. That is, an area to which the voice is to be transmitted and an area to which the voice is not to be transmitted changes in accordance with a place using the service and a time zone. Accordingly, demands have arisen for an acoustic control technique that facilitates transmitting a sound in only a specific direction.
In general, according to one embodiment, an acoustic control apparatus includes a processor. The processor calculates acoustic filter coefficients which are smaller in number than a plurality of sound sources, based on a first relational expression between acoustic filters applied to a voice signal containing information of voice reproduced by each of the sound sources, which is based on a sound increase control law that increases voice pressure at a sound increase control point by voices reproduced by the sound sources, and a second relational expression between the acoustic filters, which is based on an acoustic power minimization control law that minimizes acoustic power for voices reproduced by the sound sources. The processor applies the calculated acoustic filter coefficients to an input voice signal, and branch a voice signal to which a common acoustic filter coefficient is applied, into at least two of the sound sources.
Embodiments will be described below with reference to the accompanying drawings.
A first embodiment will be described below.
The acoustic control apparatus 100 combines sound increase control and acoustic power minimization control using a plurality of speakers to silence a sound decrease area A1 around the speakers and facilitating transmission of sound to only a specific sound increase area A2. The sound increase control is to control the amplitudes of sounds emitted from the speakers to increase sound pressure in a specific direction. The acoustic power minimization control is to control the amplitudes and phases of sounds emitted from the speakers to minimize acoustic power when the speakers are regarded as one speaker.
The voice signal input unit 101 inputs a voice signal to the voice signal processing device 102. The voice signal input unit 101 may also input a voice signal to the control device 103. The voice signal contains sound information. The voice signal is prepared for the purpose of, e.g., playback. The voice signal may be generated each time playback is performed and may also be input by a user or the control device 103.
The voice signal processing device 102 processes the voice signal. The voice signal processing device 102 includes an amplifier 1021 and acoustic filters 1022L, 1022C and 1022R.
The amplifier 1021 amplifies the voice signal input from the voice signal input unit 101 using a gain G. The gain G may be a fixed value, e.g., 1 and may also be designated by the control device 103.
The acoustic filter 1022L filters the voice signal output from the amplifier 1021 in accordance with an acoustic filter coefficient qL designated by the control device 103. Then, the acoustic filter 1022L branches the filtered voice signal into signals and outputs them to the speakers 104L1 and 104L2. In this case, the speakers 104L1 and 104L2 generate sounds having the same amplitude and the same phase. The acoustic filter 1022C filters the voice signal output from the amplifier 1021 in accordance with an acoustic filter coefficient qC designated by the control device 103. Then, the acoustic filter 1022C outputs the filtered voice signal to the speaker 104C. The acoustic filter 1022R filters the voice signal output from the amplifier 1021 in accordance with an acoustic filter coefficient qR designated by the control device 103. Then, the acoustic filter 1022R outputs the filtered voice signal to the speaker 104R. These acoustic filters each transmit only the sound in a specific band of the voice signal. The acoustic filter coefficients qL, qC and qR can be set to be equal to the complex volume velocities of the speakers 104L, 104C and 104R.
The control device 103 calculates the acoustic filter coefficients qL, qC and qR to be given to the acoustic filters 1022L, 1022C and 1022R, based on the frequency of the voice signal and the interval between the speakers 104L, 104C and 104R. The control device 103 may set the gain G. The control device 103 will be described in detail later.
Each of the speakers 104L, 104C and 104R is a sound source for emitting sound corresponding to the filtered voice signal output from the corresponding acoustic filter. The speaker 104L operates as a left-side speaker, the speaker 104C operates as a center speaker, and the speaker 104R operates as a right-side speaker. In the configuration shown in
The control device 103 will now be described.
The acquisition unit 1031 acquires various types of information necessary to calculate an acoustic filter coefficient. Then, the acquisition unit 1031 supplies the acquired information to the acoustic filter coefficient calculation unit 1032. The information acquired by the acquisition unit 1031 contains, e.g., a frequency, speaker intervals and transfer functions.
The frequency is the frequency of a voice signal to be input from the voice signal input unit 101. The acquisition unit 1031 acquires information of the frequency from the voice signal input unit 101, for example. Note that the frequency can be converted into a wavenumber if sound velocity c is known. Since the wavenumber is used to calculate an acoustic filter coefficient, the acquisition unit 1031 may acquire information of the wavenumber from the voice signal input unit 101. In addition, if the frequency of the voice signal has a fixed value, the acquisition unit 1031 may supply information of the frequency having a prestored fixed value to the acoustic filter coefficient calculation unit 1032.
The speaker intervals are intervals between a plurality of speakers. The acquisition unit 1031 acquires the speaker intervals based on, e.g., user's input. The speaker intervals may be equal to each other or different from each other. Note that if the speakers are fixed, the speaker intervals can be handled as fixed values. In this case, the acquisition unit 1031 may supply information of speaker intervals having prestored fixed values to the acoustic filter coefficient calculation unit 1032.
The transfer functions are functions representing the transmission characteristics between a sound increase control point and the speakers 104L, 104C and 104R, and determined by the positional relationship between the sound increase control point and the speakers 104L, 104C and 104R. The sound increase control point is a control target position of the sound increase control. The transfer functions are represented by matrices containing, as elements, a spatial transmission characteristic CL of sound transmitted from the speaker 104L to the sound increase control point, a spatial transmission characteristic CC of sound transmitted from the speaker 104C to the sound increase control point and a spatial transmission characteristic CR of sound transmitted from the speaker 104R to the sound increase control point. The spatial transmission characteristics can be measured in, e.g., an anechoic room or audio-visual room having little sound reflection, from microphone acquisition signals obtained by radiating sounds based on a random signal or a time stretched pulse (TSP) signal from the speakers 104L, 104C and 104R and then collecting the sounds using a microphone placed at the sound increase control point. The acquisition unit 1031 acquires the transfer functions thus measured. Note that if the positions of the speakers 104L, 104C and 104R and the position of the sound increase control point are fixed, the transfer functions are handled as fixed transfer functions. In this case, the acquisition unit 1031 may supply prestored fixed transfer functions to the acoustic filter coefficient calculation unit 1032.
The acoustic filter coefficient calculation unit 1032 receives various types of information from the acquisition unit 1031 and receives the acoustic filter coefficient of at least one speaker from the acoustic filter coefficient storage unit 1033 to calculate acoustic filter coefficients of the remaining speakers. Then, the acoustic filter coefficient calculation unit 1032 supplies the acoustic filter coefficients to the acoustic filter setting unit 1034. The acoustic filter is calculated as given by the following equation (1) from a first relational expression and a second relational expression which are obtained based on sound increase control law and acoustic power minimization control law, respectively.
In the equation (1), qC is the acoustic filter coefficient of the speaker 104C, qR is the acoustic filter coefficient of speaker 104R and qL is the acoustic filter coefficient of the speaker 104L. In addition, n is the multiplication factor of sound pressure energy by the sound increase control. DL is a transfer function between the sound increase control point and the speaker 104L, DC is a transfer function between the sound increase control point and the speaker 104C, DR is a transfer function between the sound increase control point and the speaker 104R. In addition, k is a wavenumber, c is the sound velocity, dLC is an interval between the speakers 104L and 104C, dRL is an interval between the speakers 104L and 104R, and dCR is an interval between the speakers 104C and 104R.
The acoustic filter coefficient storage unit 1033 stores the acoustic filter coefficient of at least one of the speakers 104L, 104C and 104R. For example, the acoustic filter coefficient storage unit 1033 stores the acoustic filter coefficient of the speaker 104C. The acoustic filter coefficient storage unit 1033 can be installed if the number of speakers is three or more. If the number of speakers is l (l≥3), the acoustic filter coefficient storage unit 1033 stores at least (l−2) acoustic filter coefficients. In the first embodiment, the speakers 104L1 and 104L2 are regarded as one speaker 104L, as described above. In this case, the acoustic filter coefficient storage unit 1033 has only to store one acoustic filter coefficient, e.g., the acoustic filter coefficient of the speaker 104C.
The acoustic filter setting unit 1034 sets the acoustic filter coefficients, which are calculated by the acoustic filter coefficient calculation unit 1032, to their respective
On the basis of the sound increase control law, the amount of sound increase is determined by the superposition of sounds from the sound source. If, therefore, the number of sound sources increases, the sound increase effect is improved. In the first embodiment, two speakers 104L1 and 104L2 function as one left-side speaker 104L. The sound increase effect can thus be improved more than the fact that one speaker is caused to have the function of the left-side speaker.
On the basis of the acoustic power minimization control law, the amount of reduction in acoustic power is determined by the sum of amounts of reduction in acoustic power among sound sources. Even if the total volume velocities of the sound sources are the same, the two divided sound sources increase a new sum (radiation resistance), with the result that the advantage of reduction in acoustic power increases.
That is, since the left-side speaker is divided into two speakers 104L1 and 104L2, both the advantages of sound increase control and acoustic power minimization control will be improved.
As shown in
As shown in
As shown in
As described above, in the first embodiment, the sound increase control and the acoustic power minimization control are combined. Thus, a sudden sound pressure difference can be made between the sound increase and decrease areas. In the first embodiment, the left-side speaker is divided into speakers that emit sounds of the same amplitude and the same phase. Therefore, the advantages of the sound increase control and acoustic power minimization control is improved more than those of the sound increase control and acoustic power minimization control in the case where the number of speakers and the number of acoustic filter coefficients are the same.
The configuration of the first embodiment also has advantages in terms of commercial materials and implementation. That is, in the first embodiment, great advantages of sound increase control and acoustic power minimization control can be expected because the number of speakers is increased while maintaining the number of acoustic filter coefficients, namely, the number of control channels of the speakers. Since the number of control channels can be decreased, the configuration can be simplified. If the speaker interval is constant, the number of speakers is increased to improve the advantages of sound increase control and acoustic power minimization control, in other words, the speaker interval can be shortened if the number of speakers is the same with the advantages of sound increase control and acoustic power minimization control. If, therefore, a voice signal filtered by the same acoustic filter coefficient is branched into two speakers to perform sound increase control and acoustic power minimization control as in the first embodiment, the speaker interval can be shortened compared with a configuration in which a voice signal filtered by the same number of acoustic filter coefficient as the number of speakers is input to a separate speaker. This results in system miniaturization.
In the first embodiment, the left-side speaker 104L is divided into the speakers 104L1 and 104L2. The voice signal processing device 102 branches a voice signal to which the acoustic filter coefficient qL is applied, into the speakers 104L1 and 104L2. The left-side speaker may be divided into three or more speakers. In this case, the voice signal processing device 102 may branch a voice signal to which the acoustic filter coefficient qL is applied, into three or more speakers.
As described above, in the sound increase control and acoustic power minimization control in the first embodiment, the amount of decrease in the sound pressure level of the sound decrease area 2 that is symmetrical to the sound increase area with reference to the speakers is smaller than that of the sound decrease areas 1 and 3.
If a phase shifter is located, for example, at a stage preceding the acoustic filters 1022L, 1022C and 1022R to shift the phase of a voice signal and thus increase the amount of decrease in the sound pressure level in the sound decrease area 2 that is symmetrical to the sound increase area. In this case, the phase of the voice signal input to the acoustic filter 1022L is, for example, 240 degrees, the phase of the voice signal input to the acoustic filter 1022C is, for example, 0 degrees, and the phase of the voice signal input to the acoustic filter 1022R is, for example, 120 degrees. As the phase of the voice signal is shifted by 120 degrees by the phase shifter, the phase of sound emitted from each of the speakers 104L, 104C and 104R is also shifted by 120 degrees. Accordingly, the sound emitted from the speakers 104L, 104C and 104R becomes a traveling wave and flows toward the sound increase area, for example. Then, the sound pressure interference in the sound decrease areas 1 and 3 is reduced. As a result, the amount of decrease in the sound pressure level in the sound decrease area 2 becomes large.
In addition, instead of phase correction to the voice signal, the gain of a voice signal input to the speaker nearest to the sound increase control point is further lowered to obtain an advantage equivalent to that in the phase correction. In the example of
In the first embodiment, four speakers are used for the sound increase control and acoustic power minimization control. However, the number of speakers is not limited to four. If there are three or more speakers, the number of sound increase control points can be set to two or more. Specifically, if the number of speakers is l, the number of sound increase control points may be set to at most (l−1).
In the example of
If four speakers are arranged on a circumference with the speaker 104C centered, the acoustic filter coefficients qL, qC, qR and qU are each calculated as give by the following equation (2).
In the above equation (2), n is a multiplication factor of sound pressure energy at a first sound increase control point, for example, a sound increase control point at the azimuth angle of 0°, and m is a multiplication factor of sound pressure energy at a second sound increase control point, for example, a sound decrease control point at the azimuth angle of 180°. In the equation (2), DL1 is a transfer function between the first sound increase control point and the speaker 104, DL2 is a transfer function between the second sound increase control point and the speaker 104L, DC1 is a transfer function between the first sound increase control point and the speaker 104C, DC2 is a transfer function between the second sound increase control point and the speaker 104C, DR1 is a transfer function between the first sound increase control point and the speaker 104R, DR2 is a transfer function between the second sound increase control point and the speaker 104R, DU1 is a transfer function between the first sound increase control point and the speaker 104U, and DU2 is a transfer function between the second sound increase control point and the speaker 104U. Furthermore, in the equation (2), d is an interval between the speaker 104C and each of the speakers 104R, 104L and 104U and corresponds to the radius of a circle in which the speakers 104R, 104L and 104U are arranged, r is an interval between the speakers 104U and 104L, an interval between the speakers 104U and 104R, an interval between the speakers 104D and 104L and an interval between the speakers 104D and 104R.
Compare
In modification 2 of the first embodiment, the speakers are arranged in a two-dimensional shape. However, the speakers may be arranged in a three-dimensional shape, for example, at the center of a sphere and on the sphere as illustrated in
For example, in the arrangement of
A second embodiment will be described below. In the first embodiment, a voice signal filtered by a common acoustic filter is branched and input to a plurality of speakers to simplify the system and improve the advantages of sound increase control and acoustic power minimization control. The second embodiment is directed to an example in which a plurality of speaker groups are spaced to improve the directivity of sound to the sound increase area.
The first speaker group includes speakers 104L1, 104C1 and 104R1 arranged on a straight line. Similarly, the second speaker group includes speakers 104L2, 104C2 and 104R2 arranged on a straight line. The sound increase control point of the first speaker group is set alongside the speaker 104R1 on a straight line where the speakers 104L1, 104C1 and 104R1 are arranged. The sound increase control point of the second speaker group is set alongside the speaker 104R2 on a straight line where the speakers 104L2, 104C2 and 104R2 are arranged.
In the first speaker group, the speakers 104R1, 104C1 and 104L1 are arranged in this order from the front to the rear. In the second speaker group, the speakers 104R2, 104C2 and 104L2 are arranged in this order from the front to the rear. The interval between the speakers 104R1 and 104R2, the interval between the speakers 104C1 and 104C2, and the interval between the speakers 104L1 and 104L2 become longer in this order. In other words, the first and second speaker groups are arranged obliquely from the front toward the rear. In the following, the interval between the speakers 104C1 and 104C2 may be referred to as distance L between the first and second speaker groups. If the first and second speaker groups are spaced apart as shown in
The voice signal processing device 102a has the same configuration as that of the voice signal processing device 102 shown in
The voice signal processing device 102b has the same configuration as that of the voice signal processing device 102 shown in
The acoustic filter coefficients qL1, qc1, qR1, qL2, qC2 and qR2 are calculated as given by the following equation (3).
In the equation (3), n is a multiplication factor of sound pressure energy at a sound increase control point set for each of the first and second speaker groups. DLL1 is a transfer function between the sound increase control point set for the first speaker group and the speaker 104L1. DLC1 is a transfer function between the sound increase control point set for the first speaker group and the speaker 104C1. DLR1 is a transfer function between the sound increase control point set for the first speaker group and the speaker 104R1. DLL2 is a transfer function between the sound increase control point set for the first speaker group and the speaker 104L2. DLC2 is a transfer function between the sound increase control point set for the first speaker group and the speaker 104C2. DLR2 is a transfer function between the sound increase control point set for the first speaker group and the speaker 104R2. DRL1 is a transfer function between the sound increase control point set for the second speaker group and the speaker 104L1. DRC1 is a transfer function between the sound increase control point set for the second speaker group and the speaker 104C1. DRR1 is a transfer function between the sound increase control point set for the second speaker group and the speaker 104R1. DRL2 is a transfer function between the sound increase control point set for the second speaker group and the speaker 104L2. DRC2 is a transfer function between the sound increase control point set for the second speaker group and the speaker 104C2. DRR2 is a transfer function between the sound increase control point set for the second speaker group and the speaker 104R2. In addition, d is an interval between the speakers 104C1, 104R1 and 104L1 and an interval between the speakers 104C2, 104R2 and 104L2.
As described above, the two speaker groups are arranged obliquely from the front to the rear and thus the sound increase areas of the two speaker groups intersect. The sound pressure level increases rapidly at the intersection of the sound increase areas. That is, the directivity of sound is enhanced by arranging the two speaker groups such that their sound increase areas intersect.
sound pressure levels when sound is radiated with the acoustic filter coefficients set in accordance with the results of the equation (3). The distribution indicates that the sound pressure level varies according to the concentration As shown in
In
In
In each of
If the results of 3ch 1set are compared among
If the optimum inclination angle is determined, the first and second speaker groups can be inclined and spaced apart by the inclination angle. At this time, as shown in
The equation (3) represents an acoustic filter coefficient calculated by taking into account the transmission characteristics of sound radiated from a speaker of one speaker group to a sound increase control point of another speaker group. As a simpler method, two speaker groups are arranged whose acoustic filter coefficients are calculated independently on the basis of the equation (1) to improve the directivity of sound.
With the foregoing search method, the user can install two speaker groups such that he or she can hear the loudest sound while listening to sound. In this method, no acoustic filter coefficients need to be recalculated.
As described above, according to the second embodiment, the directivity of sound is enhanced by spacing two speaker groups. That is, a more localized sound increase area can be set.
In the second embodiment, the speakers of each speaker group are three speakers arranged on a straight line. However, the speakers of each speaker group need not necessarily be three speakers arranged on a straight line. For example, each speaker group may include four speakers including two speakers arranged vertically as described with reference to
In the example shown in
The first and second speaker groups are the same as those in
In modification 2, two voice signal input units 101a and 101b are provided. The voice signal input unit 101a inputs one of the stereo voice signals, for example, a right voice signal, to the voice signal processing device 102a. The voice signal input unit 101b inputs the other stereo voice signal, for example, a left voice signal, to the voice signal processing device 102b. The stereo voice signals may be generated in sequence each time it is reproduced or may be input, for example, by a user or the control device 103.
The voice signal processing device 102a has the same structure as that of the voice signal processing device 102 shown in
The voice signal processing device 102b has the same configuration as that of the voice signal processing device 102 shown in
The acoustic filter coefficients qL1, qC1, qR1, qL2, qC2 and qR2 may be the same acoustic filter coefficients as given by the equation (3).
In the configuration of modification 2 according to the second embodiment described above, the advantage obtained by spacing the two speaker groups while maintaining the advantage of the monaural sound source is combined with a stereo sound image advantage. Thus, the spatial directivity distribution of stereo sound sources is achieved.
An example of a hardware configuration of the acoustic control apparatus 100 described in each of the foregoing embodiments will be described below with reference to
As shown in
The control unit 209 includes, for example, a central processing unit (CPU), a random access memory (RAM), and/or a read only memory (ROM) to control each component in accordance with information processing. The control unit 209 may operate as a voice signal input unit 101, a voice signal processing device 102 and a control device 103. The control unit 209 may read an execution program stored in the storage unit 210 to perform a process.
The storage unit 210 is a medium that stores information such as programs so that it can be read by a computer, a machine and the like. The storage unit 210 can store information of a speaker interval, information of the frequency of a voice signal, and information of a transfer function. The storage unit 210 may be, for example, an auxiliary storage device such as a hard disk drive and a solid state drive. The storage unit 210 may also include a drive. The drive is a device that reads data from another auxiliary storage device, a recording medium, and the like, and includes, for example, a semiconductor memory drive (flash memory drive), a compact disk CD) drive, and a digital versatile disk (DVD) drive. The type of the drive may be selected as appropriate in accordance with the type of the storage medium.
The power supply unit 211 supplies power to each element of the acoustic control apparatus 100. The power supply unit 211 may also supply power to each element of a device including the acoustic control apparatus 100. The power supply 211 may include, for example, a secondary battery or an AC power supply.
The timing device 212 is a device that measures time. The timing device 212 may be, for example, a clock including a calendar, and provides the control unit 209 with information of the current year and month and/or date and time. The timing device 212 may be used to date and time a voice signal to be reproduced.
The communication interface 205 is, for example, a near-field communication (e.g., Bluetooth (registered trademark)) module, a wired local area network LAN) module, a wireless LAN module, or the like, and is an interface that performs wired or wireless communications via a network. Note that the network may be an internetwork including the Internet or other types of network such as an in-house LAN. In addition, the communication interface 205 may perform one-to-one communication using a universal serial bus (USB) cable or the like. The communication interface 205 may also include a micro USB connector. The communication interface 205 is an interface that connects an acoustic control apparatus to an external device such as electric devices of automobiles, trains and houses, and various types of communication devices. The communication interface 205 is controlled by the control unit 209 and receives various types of information from an external device via a network or the like. The various types of information include, for example, speaker interval information, voice signal frequency information and transfer function information, which are set in an external device.
The input unit 206 is a device that receives input signals, and may be, for example, a touch panel, a physical button, a mouse and a keyboard. The output device 207 is a device that outputs information by display, voice, and the like, such as a display and a speaker. The speaker interval information, voice signal frequency information and transfer function information may be input through the input unit 206.
The external interface 208 mediates between the main body of the acoustic control apparatus and the external device. The external device may be, for example, a printer, a memory and a communication device.
While certain embodiments have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. Indeed, the novel embodiments described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions and changes in the form of the embodiments described herein may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.
Number | Date | Country | Kind |
---|---|---|---|
2023-146343 | Sep 2023 | JP | national |