The present invention relates to a sound collection/reproduction method and device capable of collecting sound with directivity in an arbitrary direction by using an array of microphones which are arranged in proximity to one another and of reproducing the collected sound in an arbitrary reproduction system with a different number of channels and a different reproduction device.
A microphone array device which employs a plurality of microphones as sound collection devices in a sound field is known. A technology which targets sound signals which are to be collected in targeted positions from among sound signals which are collected from microphones which are actually arranged instead of actually installing microphones and collecting sound in that manner has been proposed with the object of reducing the number of microphones in the microphone array device. The invention of Patent Document 1 is representative of this technology and estimates the sound collection signals in arbitrary positions in dimensional directions in the number of two microphones for each dimension.
According to the invention of Patent Document 1, as shown in
Patent Document 1: Japanese Unexamined Patent Publication No. 2001-45590
Furthermore, the invention of Patent Document 1 performs signal processing which approximates the signals arriving at the two microphones from the sound source as planar waves. However, the sound waves in the actual sound field are not limited to being planar waves, meaning that the estimated positions of the received sound signals cannot be accurately obtained.
In addition, although detecting the phase difference, time difference, and frequency difference of the sounds collected by a plurality of microphones and estimating the received sound signals in an arbitrary position based on these differences may also be considered, in this case, when the plurality of microphones are arranged in proximity to one another, the detected value for the phase difference or the like is diminished and erroneous effects are readily introduced, making an accurate estimate difficult.
Furthermore, generally speaking, in the field of sound collection or the editing field, the setting of the sound amount is in a suitable state and the setting conditions for the sound collection and editing cannot be improved and must always be confirmed. For example, in cases where sound collection for a 5.1-channel surround system is carried out, a 5.1-channel reproduction system must actually be prepared and monitored. However, it is difficult to prepare a bulky reproduction system of this kind in the sound collection field and monitoring has generally been performed using headphones or two-channel monitor speakers. However, it has hitherto not been possible to confirm the status (the magnitude of the sound and the quality of each image fixed position) of each channel (the sound in each direction) of multichannel collected sound using conventional two-channel headphones or speakers.
The present invention was proposed in order to solve the above problems of the prior art and an object of the present invention is to provide a sound collection system which makes it possible to enhance and sample sounds arriving from arbitrary directions in a state where a plurality of microphones are arranged in proximity to one another (the curvature of the arriving wave plane is not limited to a parallel planar wave and can arbitrarily correspond to a spherical wave of an arbitrary curvature).
More specifically, the present invention provides a sound collection system which is capable of collecting sounds with an emphasis on sounds from arbitrary directions (the directivity of the microphones is directed in arbitrary directions) by signal-processing signals which are input to the plurality of microphones arranged in proximity to one another.
A further object of the present invention is to provide a sound collection system which is capable of obtaining a two-channel output as a result of inputting, to a virtual sound source reproduction processing unit, the output of N channels obtained as a result of signal-processing the signals which are input via each of the microphones, and monitoring this two-channel output by means of headphones or two-channel speakers.
Yet another object of the present invention is to provide a sound collection/reproduction system which is compatible with generally widespread reproduction systems or reproduction systems that will be developed in the future such as a 5.1-channel surround system or stereo system, or a virtual sound source reproduction processing unit, for example, by connecting the sound collection system to an arbitrary reproduction system with a different number of channels and a different reproduction device,
The invention of claim 1 is a sound collection/reproduction system according to which a plurality of sound collection devices are arranged in proximity to one another, control filters in a number corresponding to the number of reproduction channels are connected to each of the microphones, the output signals from the control filters of each of the channels are added for each of the channels and output from each of the reproduction channels, wherein the control filters are obtained by setting a plurality of control points in a sound field around the plurality of sound collection devices arranged in proximity to one another, determining a desired response function matrix and a transfer function matrix between the control points and each of the sound collection devices on the basis of measurement values, and, in cases where the directivity of the sound collection devices is designated, determining values of the control filters on the basis of the desired response function matrix and transfer function matrix between the control points corresponding to the designated directivity and each of the sound collection devices.
The invention of claim 2 is the invention according to claim 1, wherein the control filter is represented by H(ω)=[C(ω)T·C(ω)]−1C(ω)T·A(ω), where the control filter matrix is H(ω), the desired response function matrix is A(ω), and the transfer function is C(ω), and is obtained by solving the inverse matrix [C(ω)T·C(ω)]−1C(ω)T of the transfer function matrix C(ω).
The invention of claim 3 is the invention according to claim 1 or 2, wherein only the signal of the channel for which monitoring is performed is designated and extracted from among the signals of each of the channels from the adders of each of the channels, and the extracted channel signal is output to a two-channel speaker or headphones.
The invention of claim 4 is the invention of claim 3, wherein the virtual sound source reproduction processing unit divides the signals of each of the channels for use as left and right output signals of the reproduction device, and outputs the divided left and right signals of each of the channels to left and right reproduction devices via the control filters conforming to the characteristic of the reproduction device.
The invention of claim 5 is a sound collection/reproduction device having a plurality of sound collection devices arranged in proximity to one another, a digital signal processing unit which processes sounds collected by each of the sound collection devices, and a reproduction output unit which outputs a speech signal which is output by the digital signal processing unit, wherein the reproduction output unit is provided with reproduction devices of one or a plurality of channels, and the digital signal processing unit is provided with control filters in a number corresponding to the number of reproduction channels connected to each of the plurality of sound collection devices, and adders in a number corresponding to the number of channels, which add the outputs of the control filters of each of the reproduction channels connected to each of the sound collection devices for each channel, the outputs of the adder of each of the channels being connected to the reproduction devices of each of the channels of the reproduction processing unit.
In addition, the control filters are obtained by setting a plurality of control points in a sound field around the plurality of sound collection devices arranged in proximity to one another, determining a desired response function matrix and a transfer function matrix between the control points and each of the sound collection devices on the basis of measurement values, and, in cases where the directivity of the sound collection devices is designated, determining values of the control filters on the basis of the desired response function matrix and transfer function matrix between the control points corresponding to the designated directivity and each of the sound collection devices, and the digital signal processing unit is provided with a directivity control unit to which directivity control data are input in order to determine the directivity during sound collection by controlling the control filters.
The invention of claim 6 is the invention of claim 5, wherein the control filter is represented by H(ω)=[C(ω)T·C(ω)]−1C(ω)T·A(ω), where the control filter matrix is H(ω), the desired response function matrix is A(ω), and the transfer function is C(ω), and is obtained by solving the inverse matrix [C(ω)T·C(ω)]−1C(ω)T of the transfer function matrix C(ω).
The invention of claim 7 is the invention of claim 5 or 6, wherein a monitoring processing unit is connected to the digital signal processing unit, the monitoring processing unit being provided with a virtual sound source reproduction processing unit which converts the signal from the adder of each of the channels provided in the digital signal processing unit into an output signal of a two-channel monitoring device.
The invention of claim 8 is the invention of claim 7, wherein the virtual sound source reproduction processing unit comprises a control filter which divides the outputs from the adder of each of the channels provided in the digital signal processing unit into two to match two left and right channels of the monitoring device, and sets a filter coefficient corresponding to the monitoring device for each of the two divided left and right signals of each of the channels, left and right adders which add the outputs from the control filters of each of the channels, and an output unit which outputs signals from the left and right adders to each of the channels of the monitoring device.
The present invention constituted as described hereinabove affords the following effects. (1) Because microphones are arranged in closer proximity to one another than is the case with the existing systems, the physical scale of the whole system can be minimized. (2) Miniaturization is also possible from the perspective of data storage. (3), Since, information on the whole of the space sound field can be saved, compatibility with the existing sound field reproduction systems as well as the sound field reproduction systems that will be developed in the future is possible. (4) Cooperation with a virtualized (virtual sound source reproduction) system is straightforward and effective. (5) The sound collection status of each channel can be monitored by means of two-channels or headphones in a multichannel sound collection system which collects sound with an emphasis on sounds in a plurality of directions.
An embodiment of the sound collection/reproduction system of the present invention will be described next in specific terms with reference to the drawings.
(1) Example of Sound Collection Device:
The intervals between the respective microphones M1 to M4 are desirably intervals shorter than one quarter wavelength of the collected sound wave from the standpoint of the spatial sampling and the respective microphones M1 to M4 are disposed at a gap of about 10 mm in cases where the collected sound waves are in the audio bandwidth. However, the measurements are not restricted to those of this embodiment and may also range from about 100 mm to from 50 to 1 mm depending on the field of application. The number of channels from which sound is collected (the number of microphones) may be two or more.
(2) Reproduction/Equalizing Circuit:
An example of an algorithm which is used in the sound collection/reproduction system of the present invention will be described by means of the reproduction/equalizing circuit shown in
The desired response A is determined by the transfer function matrix A(ω) which is rendered by Equation (1) below.
A(ω)=[A1(ω) A2(ω) . . . Aq(ω)] [Equation 1]
Here, the desired response matrix A(ω) is acquired in a state where microphones M1 to M4 are arranged in the sound collection position of the sound field space by setting q control points around the microphones M1 to M4 as shown in
This transmission system C is determined, by means of the transfer function matrix C(ω) which is expressed by Equation 2 below.
Here, C11(ω) . . . C1M(ω) indicates the transmission coefficient between the first control point and each of the microphones and M indicates the number of control points. In addition, CN1(ω) . . . CNM(ω) indicates the transmission coefficient between the Nth control point and each of the microphones. The transmission coefficient C11(ω) . . . C1M(ω) is determined by measuring the transmission characteristic (attenuation and lag and so forth) between each of the microphones M1 to M4 and each of the control points.
The control filter H is determined by Equation 3 below on the basis of the desired response transfer function matrix A(ω) and the transfer function matrix C(ω).
H(ω)=[C(ω)T·C(ω)]−1C(ω)T·A(ω) [Equation 3]
In other words, as is evident from each of the above equations, because A(ω) which is contained in the control filter H is subtracted from the desired response transfer function matrix A(ω) by the adder Σ in the reproduction/equalizing circuit of
(3) Overall Constitution of the Sound Collection/Reproduction System:
As shown in
In
The digital signal processing unit 2 comprises control filters H11 to HMN which are connected to the output sides of the sound-collecting microphones I1 to IM respectively. In other words, control filters H in a number corresponding to the number of reproduction channels N are connected to each of the sound-collecting microphones I1 to IM. In addition, the control filters H used for the respective channels which are connected to the respective microphones are connected to the adders Σ1 to ΣN used for the respective reproduction channels.
The respective control filters H11 to HMN of the digital signal processing unit 2 have a directivity control unit 21 for inputting control data for determining the directivity of the sound-collecting microphones I1 to IM connected thereto. In other words, in order to perform sound collection with an emphasis on a sound which is produced in a desired direction and position among the sounds which are recorded in the sound field by the respective sound-collecting microphones H11 to HMN, the directivity control unit 21 inputs the direction and position as control data to the digital signal processing unit 2.
The directivity control unit 21 has control data directly input thereto manually by the user via the encoder and keyboard and so forth and has control data which change as time elapses input thereto by the computer program. In this case, q control points for measuring the desired response and one or a plurality of locations for the control points with which the desired response is obtained by subjecting the control points to a supplementary calculation are designated as the directivity control data to be input.
For example, only one control point may be designated in cases where the output channel is a single channel, and control points of a quantity and direction corresponding to the quantity and direction of the output channels are input as control data in cases where there are multiple channels.
When control points from which sound collection is to be performed are input by the operator, the directivity control unit 21 performs a calculation to determine the values of the respective control filters H11 to HMN in accordance with the algorithm shown in (2) on the basis of the desired response transfer function matrix A(ω) and transfer function matrix C(ω) pertaining to the control points obtained from measurement values and outputs the calculation result to the digital signal processing unit 2.
The monitoring processing unit 3 comprises two-channel monitoring output units O1 and O2 such as headphones or two-channel speakers. The signals from the adders Σ1 to ΣN of the respective reproduction channels are output via a virtual sound source reproduction processing unit 31 to the monitoring output units O1 and O2.
In other words, the virtual sound source reproduction processing unit 31 divides the signals from the adders Σ1 to ΣN of the respective reproduction channels into left and right speakers or headphones and, after the left and right signals resulting from the division have been transmitted by each of the control filters S1, C1 to Sn, and Cn, adds the outputs of the right-hand control filters S1 to Sn of the respective reproduction channels by means of the adder Σ01 before outputting the result to the monitoring output unit O1 and adds the outputs of the left-hand control filters C1 to Cn of the respective reproduction channels by means of the adder Σ02 before outputting the result to the monitoring output unit O2.
In this case, because the control filters S1, C1 to Sn, and Cn have different filter coefficients depending on the device such as speakers or headphones used as monitoring output units O1 and O2, signals which are adapted to listening using both ears of the listener are generated for each device.
In addition, the monitoring processing unit 3 is provided with the channel designation unit 32 for designating whether sound is to be monitored on either channel in cases where predetermined control points at which sound is to be collected are designated by the digital signal processing unit 2. The channel designation unit 32 designates only the signal of the channel being monitored among the signals of the respective channels output by the digital signal processing unit 2 and inputs this signal to the virtual sound source reproduction processing unit 31.
Reproduction processing unit 4 has reproduction output units O1 to ON for the respective channels which output the signals of the respective channel adders Σ1 to ΣN of the digital signal processing unit 2. The reproduction output units O1 to ON are connected to the inputs of arbitrary reproduction systems such as a stereo system, a 5.1-channel surround system, or a virtual sound source reproduction processing unit.
(4) Action of this Embodiment:
The action of the sound collection/reproduction system of this embodiment which is constituted as described hereinabove is as follows. First, prior to the sound collection, a plurality of each of the sound collection devices are arranged in the sound field space in a state of proximity to one another and a plurality of control points are set around the sound collection devices. In this state, the desired response function matrix A(ω) and the transfer function matrix C(ω) between the respective control points and the sound collection devices are determined from the measurement values by using each of the sound collection devices to record the sounds which are produced from each of these control points and the desired response function matrix A(ω) and the transfer function matrix C(ω) are stored in the directivity control unit 21.
However, each time the reproduction processing is carried out, it is determined on how many channels reproduction is to be performed, whereupon the corresponding quantity of reproduction devices is prepared for the reproduction processing unit 4 and connected to the reproduction output units O1 to ON of the respective channels provided in the reproduction device digital signal processing unit 2. The control filters H11 to HMN are also prepared in the quantity corresponding to the number of reproduction channels for each of the sound collection devices I1 to IM which are arranged in proximity to one another.
There is no need to determine the number of reproduction channels beforehand. The sounds recorded by each of the sound collection devices are stored in a storage device and, after the number of reproduction channels has been determined, the digital signal processing unit 2 which comprises the required number of control filters and adders as well as the reproduction devices can also be prepared.
In this state, the sounds recorded by each of the sound collection devices I1 to IM are input to the control filters H11 to HMN which are connected in a quantity corresponding to the number of channels to each of the sound collection devices.
Here, when the operator makes an input to the directivity control unit 21 indicating that sound collection is to be performed with an emphasis on sound in a particular direction, a control point which is obtained by measuring the desired response function and transfer function beforehand (or determined through calculations from the measurement values) is selected by the directivity control unit 21 on the basis of the direction and position thus input (the distance from the sound collection device), whereupon the directivity control unit 21 calls the desired response function matrix and transfer function matrix for the control point q and, by substituting the desired response function matrix and transfer function matrix in Equation 3, the values of the control, filters H11 to HMN are found by way of calculation.
In this case, because the distance and direction differ between the respective sound collection, devices I1 to IM and control points q, the desired response function and transfer function also differ. In addition, in cases where there is a plurality of reproduction channels, the direction of the directivity afforded to the sound collection devices for each channel (the direction in which the sound collection device performs sound collection with emphasis) differs and the values of the respective control filters are also different.
Thus, when the values of the respective control filters are determined, only the sound in the desired direction among the sounds of the respective sound collection devices is emphasized for each channel by the control filters H11 to HMN. Thereafter, the signals from the respective control filters are added by the adders Σ1 to ΣN for each channel and the result is output from the output unit reproduction units O1 to ON of the respective channels to the reproduction devices of each channel.
According to this embodiment, a designation of the channel to be monitored is issued by the channel designation unit 32 to the monitoring processing unit 3 in order to perform reproduction channel monitoring. Thus, only the signal of the desired channel is selected from among the signals of the adders Σ1 to ΣN of the respective channels which are provided in the digital signal processing unit and this signal is output to two-channel speakers or headphones which constitute the monitoring reproduction device via the control filters S1, C1 to Sn, and Cn. In this case, the optimum output can be obtained irrespective of the type of reproduction device by setting the coefficients of the control filters S1, C1 to Sn, and Cn in accordance with the reproduction device which is making the output.
(5) Effect of the Embodiment:
As mentioned earlier, this embodiment makes it possible to perform extraction with an emphasis on only the sound in the desired direction from among the collected sounds and reproduce the sound by collecting all of the sounds received in the sound field space and using a control filter to process these collected sounds without giving a particular directivity to the plurality of sound collection devices themselves.
In particular, because it is determined, using control data supplied to the control filters, which sound in which direction is emphasized in the sound reproduction, a sound collection/reproduction system with a high degree of freedom with which it is possible to freely determine the direction in which the sound is collected and the number of reproduction channels by changing the control data and which is not limited to parallel planar waves such as those of the prior art but rather which is also compatible with spherical waves with an arbitrary curvature.
Furthermore, the present invention finds the desired response and transfer function for preset control points q by measuring same or calculating same on the basis of the measurement values and determines control filters on the basis of the data based on the measurement values. Hence, even in cases where the directivity is applied in either direction of the sound collection device, an output which is an approximation of the desired response can be obtained by using an approximation method such as the least-squares method to solve the inverse matrix [C(ω)T·C(ω)]−1C(ω)T of the transfer function matrix C constituting the control filters H.
In addition, the sound collection direction that is input to the directivity control unit 21 can also be set manually by the operator. However, a value which varies moment by moment as time elapses can also be input by means of a computer program or the like. In this case, the values of the control filters H11 to HMN change as the control data thus input vary and a sound in the desired direction can be output to the desired channel.
In addition, this embodiment is constituted such that the outputs from the digital signal processing unit 2 are introduced to the monitoring processing unit 3 before being output to a two-channel reproduction device. Hence, the outputs to the reproduction channels can both be distinguished clearly from another channel sound and heard simply by operating the channel selection unit 32 provided in the monitoring processing unit 3. Naturally, although only the sound of a single reproduction channel can be monitored in this case also, the sounds of a plurality of channels which are output by the adders Σ1 to ΣN can also be simultaneously output to the monitoring device.
M1 to M4 . . . microphone 1 . . . sound collection device 2 . . . digital signal processing unit 21 . . . directivity control unit 3 . . . monitoring processing unit 31 . . . virtual sound source reproduction processing unit 32 . . . channel designation unit 4 . . . reproduction processing unit A . . . desired response C . . . transfer function H . . . control filters I1 to IM . . . sound-collecting microphones H11 to HMN . . . sound collection system control filters Σ1 to ΣN . . . adders O1 to ON . . . reproduction output units S1, C1 to Sn, Cn . . . control filters O1, O2 of virtual sound source reproduction processing unit . . . monitoring output unit
Number | Date | Country | Kind |
---|---|---|---|
2005-351359 | Dec 2005 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2006/323953 | 11/20/2006 | WO | 00 | 9/25/2009 |