The present technique pertains to an information processing apparatus and an information processing method, and particularly pertains to an information processing apparatus and an information processing method that enable a BRIR to be accurately reproduced.
It is possible to use a binaural room transfer function (BRIR: Binaural Room Impulse Response), which mathematically indicates how a sound reaches ears from a sound source in a certain space, to three-dimensionally reproduce a sound image by headphones.
For example, there is a case in which a BRIR is used to reproduce the acoustics of a studio for producing movie audio and the acoustics are used in the production of the movie audio. In this case, the BRIR used in order to reproduce the acoustics of the studio is measured by outputting audio corresponding to a measurement signal from each speaker in the studio, and collecting this audio by microphones worn on user's ears.
Typically, multiple studios are used separately according to production title or work content. Accordingly, BRIRs corresponding to respective studios become necessary in order to reproduce the acoustics of the respective studios. In order to measure BRIRs corresponding to respective studios, it is not efficient for a user to visit and make a precise measurement of all of the studios.
In contrast to this, PTL 1 describes a technique in which parameters for a direct sound, an initial reflected sound, and late reverberation which are obtained from a BRIR that is able to reproduce a space that will serve as a reference are each independently controlled and a BRIR for a space of interest is generated.
However, it has been difficult to use the technique described in PTL1 to accurately reproduce a BRIR measured in an actual space.
The present technique is made in the light of such a situation, and enables a BRIR to be accurately reproduced.
An information processing apparatus according to one aspect of the present technique is provided with a generation unit that generates a BRIR for a first measurement position on the basis of an RIR, the RIR being obtained on the basis of a sound outputted from a first sound source disposed in a first measurement environment, and an HRIR that corresponds to a positional relation between the first sound source and a first measurement position for the RIR within the first measurement environment.
In an information processing method according to one aspect of the present technique, an information processing apparatus generates a BRIR for a first measurement position on the basis of an RIR, the RIR being obtained on the basis of a sound outputted from a first sound source disposed in a first measurement environment, and an HRIR that corresponds to a positional relation between the first sound source and a first measurement position for the RIR within the first measurement environment.
In one aspect of the present technique, a BRIR for a first measurement position is generated on the basis of an RIR, the RIR being obtained on the basis of a sound outputted from a first sound source disposed in a first measurement environment, and an HRIR that corresponds to a positional relation between the first sound source and the first measurement position for the RIR within the first measurement environment.
Description is given below regarding aspects for implementing the present technique. The description is given in the following order.
It is possible to use a binaural indoor transfer function (BRIR: Binaural Room Impulse Response), which indicates a transfer characteristic of audio from a sound source to both ears in a certain indoor sound field, to three-dimensionally reproduce a sound image by headphones. The BRIR is an indoor impulse response that is measured by a dummy head (HATS: Head and Torso Simulators) or microphones worn on user's ears, and is divided into a head-related impulse response (HRIR) and a room impulse response (RIR).
In a space RM1, audio emitted from sound sources P1 through P3 reaches a user U1 as a direct sound, an initial reflected sound, and late reverberation, for example.
Among audio emitted from the sound sources P1 through P3, a direct sound, which is indicated by broken-line arrows in
The transfer characteristics of a direct sound, an initial reflected sound, and late reverberation are described by an RIR. The RIR indicates impact of the space RM1 in a BRIR, and represents, in the time domain, an indoor transfer function (RTF: Room Transfer Function), which is frequency domain information indicating a transfer characteristic of audio from the sound sources P1 through P3 to both ears of the user U1.
An HRIR indicates impact of the user U1 in the BRIR, and represents, in the time domain, a head-related transfer function (HRTF: Head Related Transfer function) that is frequency domain information indicating a transfer characteristic of audio which has reached a spherical surface SP1 centered on the user U1 and reaches both ears of the user U1.
In a case of using a BRIR to reproduce the acoustics of the space RM1, as illustrated in
An HRIR 0l and an HRIR 0r with respect to virtual sound sources corresponding to a direction of arrival of the direct sound are respectively convolved with the acoustic signal resulting from the RIR being convolved with the direct sound. An HRIR 1l through an HRIR Nl and an HRIR 1r through an HRIR Nr with respect to virtual sound sources corresponding to a direction of arrival of respective reflected sounds are convolved with acoustic signals resulting from RIRs being convolved with the reflected sounds 1 through N.
The HRIR 0l through HRIR Nl indicate transfer characteristics of audio from the virtual sound sources to the left ear of the user U1, and the HRIR 0r through HRIR Nr indicate transfer characteristics of audio from the virtual sound sources to the right ear of the user U1.
Acoustic signals to which the HRIR 0l through HRIR Nl are convolved are added together and reproduced from the left ear side of the headphones, and acoustic signals to which the HRIR 0r through HRIR Nr are convolved are added together and reproduced from the right ear side of the headphones, whereby audio emitted from the sound sources P1 through P3 in the space RM1 is reproduced.
Typically, in order to reproduce the acoustics of a certain space, a BRIR obtained by means of a HATS is used. It is known that, by using a BRIR for a user themselves in place of a BRIR obtained by means of an HATS, it is possible to more accurately reproduce the acoustics of the space.
For example, there is a case in which a BRIR is used to reproduce the acoustics of a studio for producing movie audio and the acoustics are used in the production of the movie audio. In this case, the BRIR used in order to reproduce the acoustics of the studio is measured by outputting audio corresponding to a measurement signal from each speaker in the studio, and collecting this audio by microphones worn on user's ears.
Typically, multiple studios are used separately according to production title or work content. Accordingly, BRIRs corresponding to respective studios become necessary in order to reproduce the acoustics of the respective studios. In order to measure BRIRs corresponding to respective studios, it is not efficient for a user to visit and make a precise measurement of all of the studios.
In contrast to this, PTL 1 describes a technique in which parameters for a direct sound, an initial reflected sound, and late reverberation which are obtained from a BRIR that is able to reproduce a space that will serve as a reference are each independently controlled and a BRIR for a space of interest is generated.
In addition, there is proposed a mechanism in which BRIRs for a direct sound and other audio are each measured in a different environment and these BRIRs are combined, whereby a BRIR for reproducing the acoustics of a certain space is generated. However, it has been difficult to accurately reproduce a BRIR measured in an actual space, even in the technique described in PTL 1 or the above-described mechanism.
An embodiment of the present technique has been conceived of after paying attention to the abovementioned point, and proposes a technique that enable a BRIR for reproducing the acoustics in a space of interest to be accurately reproduced without a user visiting the space. Description is given in detail below regarding the present embodiment.
The acoustic processing system in
Audio for a movie includes audio made by people such as narration or dialogue by actors, as well as various types of sounds such as sound effects, an environmental sound, and BGM. In a case where there is no need to distinguish the types of sounds from each other, as appropriate, description is given by collectively referring to audio below, but, in practice, a movie sound also includes types of sounds that are not audio.
As illustrated in
The RIR measurement apparatus 1 obtains an RIR that indicates a transfer characteristic of audio in an RIR measurement environment. The RIR measurement environment is referred to as a dubbing stage or the like, and is a movie theater that is used for audio production. A screen as well as multiple speakers are provided in the movie theater.
The HRIR measurement apparatus 2 obtains an HRIR that indicates a transfer characteristic of audio for both ears of a user in an HRIR measurement environment such as an anechoic room or a listening room. A producer of movie audio will be a user, and an HRIR that has been personalized for the user is measured.
The information processing apparatus 3 combines the RIR obtained by the RIR measurement apparatus 1 with the HRIR that is for the user and is obtained by the HRIR measurement apparatus 2, to thereby generate a BRIR. Reproduction using this BRIR is performed, whereby audio outputted from the speakers in the movie theater that corresponds to the RIR measurement environment is reproduced.
The RIR measurement apparatus 1, the HRIR measurement apparatus 2, and the information processing apparatus 3 each include a PC, for example. Note that the RIR measurement apparatus 1, the HRIR measurement apparatus 2, and the information processing apparatus 3 may be configured as one apparatus.
A speaker 11 that serves as a sound source is disposed in a studio RM11, which corresponds to the RIR measurement environment illustrated in A in
In this state, a reproduced sound is outputted from the speaker 11, and the reproduced sound is collected by the no-ears HATS 21, whereby a BRIR that is for the no-ears HATS 21 and indicates a transfer characteristic of audio from the speaker 11 in the studio RM11 to both ears of the no-ears HATS 21 is measured. The position in the studio RM11 at which the no-ears HATS 21 is disposed becomes a measurement position for the BRIR for the no-ears HATS 21. On the basis of the BRIR for this no-ears HATS 21, an RIR that indicates a transfer characteristic of audio in the studio RM11 is obtained. A method of obtaining the RIR is described below.
A speaker 31 that serves as a sound source is disposed in an anechoic room RM12, which corresponds to an HRIR obtainment environment illustrated in B in
In this state, a reproduced sound is outputted from the speaker 31, and the reproduced sound is collected by the microphones, whereby an HRIR which is for the user and indicates a transfer characteristic of audio from the speaker 31 to both ears of the user U1 is measured. The position of the user U1 within the anechoic room RM12 becomes a measurement position for an HRIR for the user U1.
Next, description is given regarding a BRIR generation method by the information processing apparatus 3.
In the studio RM11, it is possible to obtain a characteristic of the speaker 11 itself by a measurement by a microphone 41 disposed at a position that is very close to the speaker 11.
It is also possible to obtain a characteristic that corresponds to the sound field of the studio RM11, by a measurement by a microphone 41 which is disposed at a predetermined position in the studio RM11. This sound field characteristic includes a characteristic of the speaker 11 and a sound quality of the studio RM11.
The BRIR for the no-ears HATS 21 is obtained according to a measurement using the no-ears HATS 21 which is disposed at a predetermined position in the studio RM11. The BRIR for the no-ears HATS 21 includes a characteristic of the speaker 11, the sound quality of the studio RM11, and impact of the head and body portion of the HATS.
According to a measurement using a HATS (has-ears HATS) 42 on which pinna sections are provided and which is disposed at a predetermined position in the studio RM11, a BRIR that is for the has-ears HATS 42 and indicates a transfer characteristic of audio from the speaker 11 in the studio RM11 to both ears of the has-ears HATS 42 is obtained. The BRIR for the has-ears HATS 42 includes a characteristic of the speaker 11, the sound quality of the studio RM11, impact of the head and body portion of the HATS, and impact of the pinna sections of the HATS.
Typically, the BRIR for the has-ears HATS 42 is used in order to reproduce the acoustics of the studio RM11. However, the BRIR for the has-ears HATS 42 can be insufficient as data having an intended use of producing movie audio. This is because the BRIR for the has-ears HATS 42 includes the impact of the pinnas, head, and body portion of the has-ears HATS 42 instead of those belonging to a user and, when the BRIR for the has-ears HATS 42 is used, the reproducibility of the acoustics of the studio RM11, which is important in the production of movie audio, decreases.
In contrast to this, an objective of the acoustic processing system of the present technique is to measure, in an HRIR measurement environment, an HRIR that is for a user and includes the impact of the user's pinnas, head, and body, and use the HRIR for the user to obtain a BRIR that reproduces the acoustics of the studio RM11 in a manner that is more accurate than that of the BRIR for the has-ears HATS 42.
As described above, the BRIR for the no-ears HATS 21 which is measured in the studio RM11 illustrated in the left side in
In contrast, the HRIR for the user U1 that is measured in the anechoic room RM12 illustrated on the right side of
Accordingly, in a case of combining the BRIR for the no-ears HATS 21 and the HRIR for the user U1 unchanged to thereby generate a BRIR for reproducing the acoustics of the studio RM11, the generated BRIR includes the ITD and the ILD of the HATS, as well as a characteristic of the speaker 11, the sound quality of the studio RM11, and characteristics of the user. In order to obtain a BRIR that is equivalent to a BRIR that is actually measured in the studio RM11 by the user U1, it is necessary to cancel the ITD and the ILD of the HATS from the BRIR for the no-ears HATS 21.
Accordingly, as illustrated by a hollow arrow #11 in
Here, the left ear among the left and right ears is set as a sound recording point that is close to the speaker 31, which serves as a sound source. The information processing apparatus 3 calculates, as an ITD, a time difference between an amplitude peak for an HRIR that is for the left ear and is illustrated in the upper level of
BRIR_HL that is the BRIR for the left ear of the no-ears HATS 21 is illustrated in the frequency domain (BRTF) in the upper level of A in
In formulas (1) and (2), HRIR_HL is the HRIR for the left ear of the no-ears HATS 21, and HRIR_HR is the HRIR for the right ear of the no-ears HATS 21. HRIR_HL is illustrated in the frequency domain (HRTF) in the upper level of B in
On the basis of formulas (1) and (2), RIR_HL that is the RIR for the left ear of the no-ears HATS 21 and an RIR that is for the right ear of the no-ears HATS are obtained as in the following formulas (3) and (4). In the following formulas (3) and (4), HRIR_HL (−1) is an inverse function of HRIR_HL, and HRIR_HR (−1) is an inverse function of HRIR_HR. HRIR_HL (−1) is illustrated in the frequency domain in the upper level of C in
The information processing apparatus 3 uses the calculations indicated by the formulas (3) and (4) to cancel the HATS ILD from BRIR_HL and BRIR_HR, and extracts RIR_HL and RIR_HR. RIR_HL is illustrated in the frequency domain (RTF) in the upper level of D in
BRIR_UL that is a BRIR for the left ear of the user U1 and BRIR UR that is a BRIR for the right ear of the user U1 are respectively obtained by convolving RIR_HL with HRIR_UL and convolving RIR_HR with HRIR_UR, as indicated by the following formulas (5) and (6). HRIR_UL is an HRIR for the left ear of the user U1, and HRIR_UR is an HRIR for the right ear of the user U1.
As above, the information processing apparatus 3 cancels the ITD and the ILD of the HATS from the BRIR for the no-ears HATS 21.
Described with reference to
There is a method in which the user U1 uses a BRIR measured in a stage 2, to obtain a BRIR for reproducing the acoustics of a stage 1.
For example, this method is a method in which, as illustrated on the right side of
In a case where the distance or the disposed position of a speaker with respect to the measurement position for the BRIR for the has-ears HATS 42 at the stage 1 is treated as being completely equal to the distance or disposed position of a speaker with respect to the measurement position of the has-ears HATS 42 and the user U1 at the stage 2, it is expected that a BRIR having a comparatively high accuracy will be generated. However, in a case where the disposed positions of the speakers differ even slightly, a BRIR having a low accuracy may be generated, as indicated by a hollow arrow #22.
As illustrated in
Accordingly, data regarding the difference between the BRIR for the has-ears HATS 42 and the BRIR for the user U1 measured at the stage 2 does not match data regarding the difference between the BRIR for the has-ears HATS 42 and the BRIR for the user U1 measured at the stage 1. Therefore, it is difficult to accurately reproduce a BRIR that the user U1 has measured at the stage 1, with the conventional BRIR generation method described above.
Note that, in a case where there is completely the same manner of change, in conjunction with change of the disposed position of a speaker, for the HRIR for the user U1 and the has-ears HATS 42, it is possible to accurately reproduce a BRIR that is measured by the user U1 at the stage 1, with the conventional BRIR generation method described above.
In the acoustic processing system of the present technique, an HRIR measurement is performed in such a manner that a direction of the speaker 11 with respect to a BRIR measurement position in an RIR measurement environment is the same as a direction of the speaker 31 with respect to an HRIR measurement position in an HRIR measurement environment.
In addition, when an HRIR and an RIR are combined in the acoustic processing system, a gain for the HRIR for the user U1 is adjusted according to the distance from the measurement position for the BRIR for the no-ears HATS 21 to the speaker 11 in the RIR measurement environment. By adjusting the gain, it is possible to cause the position (direction and distance) of the speaker 11 with reference to the measurement position for the BRIR for the no-ears HATS 21 to virtually match the position (direction and distance) of the speaker 31 with reference to the measurement position for the HRIR for the user U1.
In such a manner, the acoustic processing system combines an RIR with an HRIR that corresponds to the positional relation between the speaker 11 and the measurement position for the BRIR for the no-ears HATS 21 in the RIR measurement environment, making it possible to accurately reproduce a BRIR that the user U1 has measured at the stage 1. Accordingly, if a user visits an HRIR measurement environment only once to thereby measure HRIR, the acoustic processing system can obtain, on the basis of an HRIR for the user, a BRIR for reproducing the acoustics of a location that differs to the HRIR measurement environment.
Thus far, it has been necessary for a user to visit each time, inter alia, a movie theater necessary for the production of movie audio and then measure BRIR, but it is sufficient if the user visits the HRIR measurement environment only once to perform a measurement, and thus, is becomes possible to remarkably reduce a load on the user.
As illustrated in
The input unit 101 is provided with a speaker setting obtainment unit 111, a tracking information obtainment unit 112, and a measurement data obtainment unit 113.
The speaker setting obtainment unit 111 obtains, from a setting file that indicates measurement conditions for an RIR or an HRIR, an audio signal used in the measurement of the BRIR for the no-ears HATS 21, and supplies the audio signal to a speaker control unit 121 in the control unit 102.
The tracking information obtainment unit 112 obtains, from the setting file, information indicating a direction of the no-ears HATS 21 when an BRIR is measured, and supplies the information to a HATS control unit 122 in the control unit 102.
The measurement data obtainment unit 113 obtains a BRIR that is measured using the no-ears HATS 21, and causes the storage unit 103 to store the BRIR.
The control unit 102 is provided with the speaker control unit 121 and the HATS control unit 122.
The speaker control unit 121 causes the speaker 11 to output a reproduced sound that corresponds to the audio signal supplied from the speaker setting obtainment unit 111.
The HATS control unit 122 controls, according to the information supplied from the tracking information obtainment unit 112, a HATS control mechanism 131 for controlling a direction of the no-ears HATS 21.
As illustrated by arrows in
Even in the HRIR measurement environment, an HRIR is measured in a state where the user U1 faces multiple directions. An RIR based on the BRIR for the no-ears HATS 21 corresponding to each direction is combined with an HRIR that is for the user and corresponds to a respective direction, whereby it becomes possible to generate a BRIR available when the user faces each direction in the RIR measurement environment. Reproduction is performed using this BRIR, whereby it is possible to reproduce the acoustics of the RIR measurement environment while supporting head tracking.
Note that the HRIR available when the user U1 faces multiple directions may be obtained using an acoustic simulation.
As illustrated in
The input unit 151 is provided with a speaker setting obtainment unit 161 and a measurement data obtainment unit 162.
The speaker setting obtainment unit 161 obtains, from a setting file, an audio signal that is used to measure the HRIR for the user U1 and the HRIR for the no-ears HATS 21 and information that indicates the positional relation between the speaker 11 and the measurement position for the BRIR for the no-ears HATS 21 in the RIR measurement environment, and supplies the audio signal and the information to a speaker control unit 171 in the control unit 152.
The positional relation between the speaker 11 and the measurement position for the BRIR for the no-ears HATS 21 in the RIR measurement environment includes, for example, the direction of the speaker 11 with respect to the measurement position for the BRIR for the no-ears HATS 21 and the distance from the measurement position to the speaker 11.
The positional relation between the measurement position and the speaker 11 is calculated from a three-view drawing or a CAD (Computer Aided Design) drawing of the RIR measurement environment, or obtained according to sensing using a 3D scanner, a laser distance meter, an angle measuring device, or the like. For example, the position of the speaker 11 with reference to the measurement position for the BRIR for the no-ears HATS 21 is obtained on the basis of an image captured by a point cloud scanner. In addition, a device that combines a laser distance meter and a rotary table can simultaneously measure the distance to and direction of the speaker 11 with reference to the measurement position for the BRIR for the no-ears HATS 21.
The measurement data obtainment unit 162 obtains the HRIR that is for the no-ears HATS 21 and is measured using the no-ears HATS 21, as well as an HRIR that is for a user and is measured using the microphones 181, and causes the storage unit 153 to store these HRIRs. The microphones 181 are worn on both ears of the user.
The control unit 152 is provided with the speaker control unit 171. The speaker control unit 171 causes the speaker 31 to output a reproduced sound that corresponds to the audio signal supplied from the speaker setting obtainment unit 161. In addition, the speaker control unit 171, according to information supplied from the speaker setting obtainment unit 161, controls a speaker control mechanism 182 for controlling the position of the speaker 31.
For example, the speaker control mechanism 182 includes a movable multi-speaker rig 191 in which multiple speakers 31 are provided on a spherical surface that surrounds a user or the no-ears HATS 21, as illustrated in A in
Alternatively, for example, the speaker control mechanism 182 includes a movable speaker apparatus 192 as illustrated in B in
The speaker control unit 171 controls the movable multi-speaker rig 191 or the movable speaker apparatus 192 such that the speaker 31 is disposed in a direction that is the same as the direction of the speaker 11 with respect to the measurement position for the BRIR for the no-ears HATS 21, with reference to a measurement position for the HRIR for the user U1 or the no-ears HATS 21.
Note that, as illustrated in
When a reproduced sound is outputted from each speaker 31 and an HRIR is measured, HRIRs with respect to the speakers 31 which are disposed omnidirectionally and centered on the HRIR measurement position are measured, as illustrated in
In this case, the information processing apparatus 3 selects, from among these multiple HRIRs, an HRIR with respect to the speaker 31 that is disposed at coordinates closest to coordinate information of the speaker 11 which is set with reference to the measurement position for the BRIR for the no-ears HATS 21, and uses the selected HRIR to generate a BRIR.
As illustrated in
The input unit 201 is provided with a BRIR obtainment unit 211 and an HRIR obtainment unit 212.
The BRIR obtainment unit 211 obtains the BRIR for the no-ears HATS 21 that is stored in the storage unit 103 in the RIR measurement apparatus 1, for example, and supplies the BRIR to an RIR extraction unit 221 in the data computation unit 202.
The HRIR obtainment unit 212 obtains an HRIR for the no-ears HATS 21 and an HRIR for a user, which are stored in the storage unit 153 in the HRIR measurement apparatus 2, for example. Here, an HRIR corresponding to the direction of the speaker 11 with respect to the measurement position for the BRIR for the no-ears HATS 21 is obtained. The HRIR obtainment unit 212 supplies the HRIR for the no-ears HATS 21 to the RIR extraction unit 221, and supplies the HRIR for the user to a combining unit 222 in the data computation unit 202.
The data computation unit 202 is provided with the RIR extraction unit 221 and the combining unit 222.
From the BRIR for the no-ears HATS 21 supplied from the BRIR obtainment unit 211, the RIR extraction unit 221 cancels the ITD and the ILD which are included in the HRIR for the no-ears HATS 21 supplied from the HRIR obtainment unit 212, to thereby extract an RIR for the RIR measurement environment. The RIR extraction unit 221 supplies the RIR for the RIR measurement environment to the combining unit 222.
The combining unit 222 adjusts, according to the distance from the measurement position for the BRIR for the no-ears HATS 21 in the RIR measurement environment to the speaker 11, the gain of the HRIR that is for the user and is supplied from the HRIR obtainment unit 212. Specifically, the combining unit 222 causes the gain of the HRIR for the user to attenuate according to a distance difference between the distance from the measurement position for the BRIR for the no-ears HATS 21 to the speaker 11 and the distance from the measurement position for the HRIR for the user to the speaker 31.
The combining unit 222 combines the RIR which is for the RIR measurement environment and is supplied from the RIR extraction unit 221 with the HRIR which is for the user and for which the gain has been adjusted, to thereby generate a BRIR for the user in the RIR measurement environment.
Next, with reference to the flow chart in
In step S1, the BRIR obtainment unit 211 obtains a BRIR for the no-ears HATS 21.
In step S2, the HRIR obtainment unit 212 obtains an HRIR for the no-ears HATS 21.
In step S3, the RIR extraction unit 221 cancels the ITD and the ILD which are included in the HRIR for the no-ears HATS 21, from the BRIR for the no-ears HATS 21 to thereby extract an RIR for the RIR measurement environment.
In step S4, the HRIR obtainment unit 212 obtains an HRIR for a user. The combining unit 222 adjusts, according to the distance from the measurement position for the BRIR for the no-ears HATS 21 in the RIR measurement environment to the speaker 11, the gain of the HRIR that is for the user.
In step S5, the combining unit 222 combines the RIR which is for the RIR measurement environment with the HRIR which is for the user and for which the gain has been adjusted, and generates a BRIR for the user in the RIR measurement environment.
By virtue of the above process, the information processing apparatus 3 can accurately reproduce a BRIR which is for reproducing acoustics of an RIR measurement environment, on the basis of an RIR and an HRIR that are obtained in a state where the position of the speaker 11 with reference to a measurement position for BRIR in the RIR measurement environment matches the position of the speaker 31 with reference to a measurement position for HRIR in an HRIR measurement environment.
The HRIR for a user may be, instead of actually measured data, data that is obtained by an estimation using an image in which the pinnas of the user appear, or data that is calculated according to an acoustic simulation using pinna sections modeled on the basis of a result of scanning the user's head. In addition, the HRIR for a user may be data measured using a HATS that is provided with pinna sections modeled on the basis of a result of scanning the user's head.
An HRIR that is not personalized to the user may be combined with an RIR for the RIR measurement environment. This HRIR is selected according to a recommendation that uses a database having many actual measurements, for example. The recommendation database may be, instead of actually measured data, data that is obtained using any one of an estimation using an image in which pinnas appear, an acoustic simulation, and an estimation or acoustic simulation that uses pinna sections that are randomly modeled.
Example of Reproducing BRIR for has-Ears BRIR 42 in RIR Measurement Environment
As illustrated in
A in
A in
Note that the transfer characteristic for the has-ears HATS 42 in the studio RM11 includes a characteristic of the speaker 11, the sound quality of the studio RM11, and the ITD and the ILD of the HATS. The transfer characteristic for the has-ears HATS 42 in the anechoic room RM12 includes the ITD and the ILD of the HATS as well as the ITD and the ILD of the HATS. Upon combining the transfer characteristics for the has-ears HATS 42 in the studio RM11 and the anechoic room RM12, the ITD and the ILD of the HATS overlap, and the ITD and the ILD of the HATS included in either transfer characteristic are canceled.
A in
It is also possible to measure an HRIR for a user in a 9.1.6 ch speaker system, for example, and, on the basis of the measured HRIR for the user and an image in which pinnas of the user appear, estimate an HRIR corresponding to a speaker that is virtually disposed in a direction different to a direction of a speaker included in the speaker system, with reference to an HRIR measurement position.
A series of processing described above can be executed by hardware, and can be executed by software. In a case where the series of processing is executed by software, a program which makes up this software is installed from a program recording medium onto a computer incorporated in dedicated hardware, a general-purpose personal computer, etc.
A CPU (Central Processing Unit) 501, a ROM (Read Only Memory) 502, and a RAM (Random Access Memory) 503 are mutually connected by a bus 504.
An input/output interface 505 is further connected to the bus 504. An input unit 506 including a keyboard, a mouse, or the like and an output unit 507 including a display, a speaker, or the like are connected to the input/output interface 505. In addition, a storage unit 508 including a hard disk, a non-volatile memory, or the like, a communication unit 509 including a network interface or the like, and a drive 510 for driving a removable medium 511 are connected to the input/output interface 505.
In a computer configured as above, the series of processes described above is performed by the CPU 501, for example, loading a program stored in the storage unit 508 into the RAM 503 via the input/output interface 505 and the bus 504 and executing the program.
For example, the program executed by the CPU 501 is provided by being recorded to the removable medium 511 or is provided via a wired or wireless transmission medium such as a local area network, the internet, or digital broadcasting, and is installed into the storage unit 508.
A program executed by a computer may be a program for which processes are performed in chronological order following the order described in the present specification, or may be a program for which processes are performed in parallel or at necessary timings such as when a call is performed.
Note that, in the present specification, a system means an aggregate of multiple components (apparatuses, modules (components), etc.), and it does not matter whether or not all components are within the same case. Accordingly, multiple apparatuses which are accommodated in separate cases and are connected to each other via a network and one apparatus in which multiple modules are accommodated in one case are both systems.
Note that effects described in the present specification are merely exemplary and are not limitative, and there may also be other effects.
An embodiment of the present technique is not limited to the embodiment described above, and various modifications are possible within a scope that does not deviate from the substance of the present technique.
For example, the present technique can have a cloud computing configuration in which one function is apportioned among multiple apparatuses via a network to jointly perform processes.
In addition, each step described above in the flow charts described above, in addition to being executed by one apparatus, can be executed by being apportioned among multiple apparatuses.
Further, in a case where multiple processes are included in one step, the multiple processes included in the one step, in addition to being executed by one apparatus, can be executed by being apportioned among multiple apparatuses.
The present technique can have such configurations as described below.
(1)
An information processing apparatus including: a generation unit that generates a BRIR for a first measurement position on the basis of an RIR, the RIR being obtained on the basis of a sound outputted from a first sound source disposed in a first measurement environment, and an HRIR that corresponds to a positional relation between the first sound source and a first measurement position for the RIR within the first measurement environment.
(2)
The information processing apparatus according to (1) above, in which
The information processing apparatus according to (2) above, in which
The information processing apparatus according to (2) or (3) above, in which
The information processing apparatus according to any one of (1) to (4) above, further including:
The information processing apparatus according to (5) above, in which
The information processing apparatus according to any one of (1) to (6) above, in which
The information processing apparatus according to (7) above, in which
The information processing apparatus according to (7) above, in which
The information processing apparatus according to any one of (1) to (6) above, in which
The information processing apparatus according to any one of (1) to (10) above, in which
The information processing apparatus according to any one of (1) to (11) above, in which
An information processing method including:
| Number | Date | Country | Kind |
|---|---|---|---|
| 2022-037170 | Mar 2022 | JP | national |
| Filing Document | Filing Date | Country | Kind |
|---|---|---|---|
| PCT/JP2023/006370 | 2/22/2023 | WO |