1. Field of the Invention
The present invention relates generally to audio sounds and, more specifically, to a method and apparatus for customizing HRTFs (Head Related Transfer Functions) of listeners to provide more convincing and pleasurable 3D (three dimensional) audio works.
2. Description of Related Art
Some newly emerging consumer audio devices provide the option for 3D sound, allowing a more realistic experience when listening to sound. In some applications, 3D sound allows a listener to perceive motion of an object from the sound played back on a 3D audio system.
Extensive research has established that human localize sound source location by using three major acoustic cues, the interaural time difference (ITD), interaural intensity difference (IID), and head-related transfer functions (HRTFs). These cues, in turn, are used in generating 3D sound in 3D audio systems. Among these three cues, ITD and IID occur when sound, from a source in space, arrive at both ears of a listener. When the source is at an arbitrary location in space, the sound wave arrives at both ears with different time delays due the unequal path length of wave propagation. This creates the ITD. Also, due to the head shadowing effects, the intensity of the sound waves arriving at both ears can be unequal. This creates the IID.
In order to synthesize a positioned 3D audio source, a particular set of ITD, IID and a pair of HRTFs for two ears are directly measured to construct an HRTF database for listeners. In order to simulate the motion of the sound source, in addition to the varying ITD and IID, many HRTF pairs have to be used to obtain a continuous moving sound image. In the prior arts, hundreds or thousands of measured HRTFs are used to fulfill this purpose. This increases storage space of an apparatus for accommodating the HRTF database. Furthermore, when the HRTF database receives an input signal, it needs to execute complicated comparisons and calculations to make sure a listener heard a 3D audio work, which increases execution time of the apparatus. There is thus a need for a more accurate HRTF model which provides a suitable HRTF for source locations in a continuous auditory space, without annoying mass calculation and storage space.
The present invention relates, in one aspect, to a method for providing a 3D audio work. The method includes providing a one-ear HRTF filter storing a one-ear HRTF therein and a related function synthesizer storing a related function therein, and inputting sound signals into the one-ear HRTF filter. The sound signals are converted into one-ear output sound signals by the one-ear HRTF filter. The one-ear output signals are sent out and synthesized to output sound signals for the other ear by the related function synthesizer. A method for providing the related function includes inputting sound signals into HRTF filters of two opposite ears and obtaining output sound signals from the HRTF filters which are respectively acting as raw signals and target signals. The raw signals enter into and are synthesized by a synthesizer to output sound signals. The output sound signals from the synthesizer compare with the target signals in a comparator and a related function registered in the synthesizer is accordingly regulated so as to obtain the related function which satisfies a minimum difference between the output sound signals from the synthesizer and the target signals.
The present invention relates, in another aspect, to an apparatus for providing a 3D audio work. The apparatus includes a one-ear HRTF filter and a related function synthesizer. The one-ear HRTF filter stores a one-ear HRTF therein. The one-ear HRTF filter converts input sound signals into output signals for the one-ear. The related function synthesizer stores a related function reflecting signal correlation between two opposite ears. The related function synthesizer receives and synthesizes the one-ear output signals to output signals for the other ear.
Other advantages and novel features of the present invention will become more apparent from the following detailed description of preferred embodiments when taken in conjunction with the accompanying drawings, in which:
Reference will now be made to the drawing figures to describe the preferred embodiment in detail.
The present invention provides a method and apparatus for providing a 3D (three-dimensional) audio work. The apparatus uses an HRTF filter to convert input sound signals into output sound signals which are received by one ear. The one-ear output sound signals are synthesize by a related function synthesizer to output sound signals, which are received by the other ear at the time the one-ear output signals are received by the one ear. Accordingly, the input sound signals are converted to two-ear output sound signals which are simultaneously received by the two ears and a 3D audio work for listeners is provided. During the synthesis of the one-ear output sound signals, the related function is stimulated by the one-ear output signals to obtain a related value corresponding to the one-ear output signals, and the related value adds to the one-ear output signals to obtain the output signals for the other ear.
Referring to
The present method and apparatus establish a one-ear HRTF database in the HRTF filter and a related function reflecting signals correlation between two ears in the related function synthesizer. Therefore, the present apparatus does not need measure too much data as the conventional apparatus which directly measures HRTFs of two ears. This reduces database storage space and amount of calculations required by the present apparatus. Details hereinafter show measurements and calculations for the HRTF database and the related function in accordance with the present invention.
The one-ear HRTF database is recorded with a set of HRTF coefficients (impulse responses) for the one-ear. The one-ear HRTF database is obtained by presenting a stimulus through a loudspeaker (the sound source) positioned at many locations in a 3D space, and at the same time collecting the impulse responses from a microphone embedded in one ear of a mannequin head or a real human subject. To simulate a moving sound, a set of continuous impulse responses that vary with respect to the sound source location are needed. The set of continuous impulse responses construct the one-ear HRTF database.
Referring to
In the above-mentioned method and apparatus for obtaining the related function, the synthesizer and the comparator cooperatively construct a wiener filter. The output signals being dealt with the wiener filter are FIRs (finite impulse responses), which are steadier and have more efficiency than IIFs (infinite impulse responses) when they have the same tab. The related function obtained from this method and apparatus not only includes characteristics of ITD (interaural time difference) and IID (interaural intensity difference), but also includes parameters of head, torso and shoulder effect, external ear effect, and 3D ambience. In calculation of the related function, there is no need to divide the ipsilateral or contralateral ear HRTF by the contralateral or ipsilateral ear HRTF. Therefore, the ipsilateral and contralateral ear HRTFs are not needed to be predigested and some parameters reflecting sound characteristics can not be abnegated during the predigestion. Thus, the related function obtained from the difference between the ipsilateral and the contralateral ear HRTFs is more accurately than the related function obtained from the predigestion between the ipsilateral and contralateral ear HRTFs. Accordingly, the output sound signals of the other ear (the ipsilateral or contralateral ear), which are synthesized from the one-ear (the contralateral or ipsilateral ear) sound signals by the related function synthesizer, more realistically reflect the sound heard by the other ear of the listeners. The output sound signals of the two ears are simultaneously heard by the two ears of the listeners, which provides a 3D audio work for the listeners.
The present method and apparatus obtains a 3D audio work via inputting sound signals into the one-ear HRTF filter and the related function synthesizer. The input sound signals are filtered to the one-ear output signals by the one-ear HRTF filter, and the one-ear output signals are synthesized to the two-ear output sound signals by the related function synthesizer via simple calculation of addition between the one-ear output signals and the related function. This reduces amount and complexity of calculation of the present apparatus. In addition, the one-ear HRTF filter reduces data storage space required by the present apparatus as compared to the conventional apparatus containing two-ear HRTF filters. Thus, the present apparatus has a lower cost and a better performance than the conventional apparatus which directly measures the two-ear HRTFs. Table 1 shows the amount of calculation and the storage space for the present apparatus and the conventional apparatus which directly measures the two-ear HRTFs. MAC represents multiply-add calculation of the present and conventional apparatuses.
Studying the data shown in table 1, the MAC of the present apparatus is lower than about 60% of that of the conventional apparatus. Thus, the present apparatus saves about 40% of the amount of calculation as compared to the conventional apparatus. This results in the data storage space of the present apparatus being saved.
It is to be understood, how ever, that even though numerous characteristics and advantages of the present invention have been set forth in the foregoing description, together with details of the structure and function of the invention, the disclosure is illustrative only, and changes may be made in detail, especially in matters of shape, size, and arrangement of parts within the principles of the invention to the full extent indicated by the broad general meaning of the terms in which the appended claims are expressed.
Number | Date | Country | Kind |
---|---|---|---|
200710075612.7 | Aug 2007 | CN | national |