Headtracked processing for headtracked playback of audio signals

Information

  • Patent Grant
  • 6766028
  • Patent Number
    6,766,028
  • Date Filed
    Tuesday, January 16, 2001
    24 years ago
  • Date Issued
    Tuesday, July 20, 2004
    20 years ago
Abstract
A method of simulating a spatial sound environment to a listener over headphones is disclosed comprising inputting a series of sound signals having spatial components; determining a current orientation of the headphones around the listener; determining a mapping function from a series of spatially static virtual speakers placed around the listener to each ear of the listener; utilising the current orientation to determine a current panning of the sound signals to the series of virtual speakers so as to produce a panned sound input signal for each of the virtual speakers; utilising the mapping function to map the panned sound input signal to each ear of the listener, and combining the mapped panned sound input signals to produce a left and right output signal for the headphones.
Description




FIELD OF THE INVENTION




The present invention relates to the creation of spatialized sounds utilizing a headtracked set of headphones.




BACKGROUND OF THE INVENTION




Methods for localizing sounds utilizing headphones and a headtracking unit are known. For example, in U.S. patent Ser. No. 08/723,614 entitled “Methods and Apparatus for Processing Spatialized Audio”, there is disclosed a system for virtual localization of a sound field around a listener utilizing a pair of headphones and a headtracking unit which determines the orientation of the headphones relative to an external environment. Unfortunately, the disclosed arrangement requires a high computational power or resource for real time rotation of a sound field environment so as to take into account any headphone movement relative to the desired sound field output.




Alternatively, without headtracking, a virtual speaker system over headphones can be simulated by using a pair of filters for each virtual sound source and then a post mixing of the results to produce left and right signals. For example, turning initially to

FIG. 1

, if it is desired to simulate to a user


1


over headphones eg.


2


,


3


a virtual sound environment, with, for example, the environment comprising the popular Dolby DIGITAL (Trade Mark) environment which includes a left,


5


, and right,


6


sound source in addition to a center cell source


7


and back left and right sound sources


8


and


9


, then one form of suitable arrangement may be as illustrated


10


in FIG.


2


. The arrangement


10


includes, for each channel eg.


11


providing a head related transfer function filter eg.


12


,


13


for each input channel which maps the sound source to each of the left and right ears so as to form left and right headphone channels


16


,


17


. Similarly, each of the other channels is similarly processed and the output summed to each head channel. The arrangement


10


in

FIG. 2

is provided for a system that does not utilize headtracking. The arrangement of

FIG. 2

requires significant length filters eg.


12


,


13


for each channel. Of course, many filter optimisations are possible in respect of the non treadtracked arrangement. An example of these optimisations include those disclosed in PCT Patent Application No. PCT AU99/00002 filed 6 Jan., 1999 by the present applicant entitled “Audio Signal Processing Method and Apparatus”.




One possible method utilized by others to perform headtracking is to use an enormous amount of computational memory for storing a large number of sets of filter coefficients. For example, a set of filter coefficients could be stored for every angle around a listener (for full 360 coverage), then, each time the listener rotated their head the filter coefficients could be updated to reflect the new angle. A cross fade to the new filter coefficients would remove any unwanted artefacts. This technique has the significant disadvantage that it requires an enormous amount of memory to store the large number of filtered coefficients.




An alternative technique is disclosed in U.S. Pat. No. 5,659,619 by Abel which utilizes a process of principle component analysis where the head related transfer function is assumed to consist of several individual filter structures which are all modified from a look-up table according to a current head angle. This method provides for a reduction in memory requirements.




However, it is only practical for short filters (short HRTF length) which provide for directionality of a sound source and it is not practical for a full room reverberant response in addition to the effective simulation of a full room.




It would be desirable to provide for a more efficient form of simulation of a sound surround environment over headtracked headphones in addition to the effective simulation of a full room reverberant response.




SUMMARY OF THE INVENTION




It is an object of the present invention to provide for a more efficient form of simulation of a surround sound environment over headtracked headphones.




In accordance with a first aspect of the present invention, there is provided a method of simulating a spatial sound environment to a listener over headphones comprising inputting a series of sound signals having spatial components; determining a current orientation of the headphones around the listener; determining a mapping function from a series of spatially static virtual speakers placed around the listener to each ear of the listener; utilising the current orientation to determine a current panning of the sound signals to the series of virtual speakers so as to produce a panned sound input signal for each of the virtual speakers; utilising the mapping function to map the panned sound input signal to each ear of the listener; and combining the mapped panned sound input signals to produce a left and right output signal for the headphones.




Preferably, the virtual speakers include a set of simulated speakers placed at substantially equal angles around the listener which can be placed substantially in a horizontal plane around a listener or placed so as to fully surround a listener in three dimensions. The present invention has particular application wherein the series of sound signals comprise a Dolby DIGITAL encoding of a sound environment.




In accordance with a second aspect of the present invention, there is provided an apparatus for simulating a spatial sound environment to a listener over headphones comprising input means for inputting a series of signals comprising a spatial sound environment; panning means for panning the series of signals amongst a predetermined number of virtual output signals to produce a plurality of virtual output speakers signals; head related transfer function mapping means for mapping the virtual output speaker signals to left and right headphone channel signals; and combining means for combining each of the left and right headphone channel signals into combined left and right headphone signals for playback over the headphones.




Preferably, the panning means, the head related transfer function mapping means and the combining means are implemented in the form of a suitably programmed digital signal processor.











BRIEF DESCRIPTION OF THE DRAWINGS




Notwithstanding any other forms which may fall within the scope of the present invention, preferred forms of the invention will now be described, by way of example only, with reference to the accompanying drawings in which:





FIG. 1

illustrates the concept of a surround sound system;





FIG. 2

illustrates a prior art arrangement for creating a surround sound environment over headphones;





FIG. 3

illustrates the utilization of a virtual speaker system in accordance with the preferred embodiment;





FIG. 4

is a schematic block diagram of the structure of the preferred embodiment;





FIGS. 5 and 6

illustrate the extension of the preferred embodiment to three dimensions; and





FIG. 7

illustrates one form of implementation of the preferred embodiment.











DESCRIPTION OF PREFERRED AND OTHER EMBODIMENTS




In the preferred embodiment, a fixed filter and coefficient structure is utilized to simulate a stationary virtual speaker array and then a speaker panner is utilized to position the virtual sound sources at desired positions. The preferred embodiment will be discussed with reference to a Surround Sound implementation of the popular Dolby DIGITAL format.




Turning to

FIG. 3

, there is illustrated a method of the preferred embodiment. The method of the preferred. embodiment comprises utilizing a set of virtual speakers


21


-


26


arranged around a listener


27


. A head related transfer function to each ear of the listener


27


is calculated for each of the virtual speakers


21


-


26


arranged around a listener


27


. The techniques utilized can be substantially the same as those described previously with reference to FIG.


2


and known in the prior art.




A series of virtual surround sound speakers


31


-


35


are then utilized having a stable external reference frame relative to the user


27


. Hence, as the user


27


turns their head, the virtual speaker


32


for example is panned between speakers


21


-


22


so as to locate the speaker


32


at the requisite point between speakers


21


and


22


. Similar panning occurs for each of the other virtual surround sound speakers


32


-


35


. Hence, each of the surround sound channel sources eg.


32


is panned between speakers so as to provide for the directionality of each sound source. The directionality of each sound source can be updated depending on the rotation of a listener's head and the speaker panning technique can be totally flexible and compatible with prior art panning techniques for conventional loudspeakers.




Turning now to

FIG. 4

, there is illustrated one form of arrangement of the preferred embodiment


40


. The preferred embodiment is based around two parts including a speaker panning section


41


and HRTF section


42


. The HRTF section


42


includes the usual series of filters eg. 43, 44 which map each of the virtual speakers


21


-


26


to the left and right ear of the listener


27


. The filter coefficients being substantially static.




The input channels for each of the surround sound sources


31


-


35


are input to an N input to M output speaker panner


46


. The speaker panner


46


also having as an input


47


the headtracking input signal from a listener's headphone. The speaker panner


46


can then be set to provide panning between the virtual output speakers


21


-


26


which are output eg.


49


.




The technique of the preferred embodiment can be extended to provide for headtracking of elevation and roll of a user's head position where such information is available from the headtracking unit. This can be achieved by extending the location of the stationary virtual speakers to be in a three-dimensional cube around a listener. For example, if eight virtual speakers are simulated representing the eight corners of a cube around a listener, then any panning system can also compensate for head movements around a Y and Z plane. Hence, in addition to yaw, elevation and roll can also be taken into account. Of course, the more virtual speakers utilized to create the virtual speaker space around a listener, the better the accuracy of the system. Once again, panning can be provided by means of a front end system that utilizes the headtracked yaw, elevation and roll position to determine the panning effect between speakers. For example, as illustrated in

FIG. 5

, the elevation of a listener


55


can be determined via a standard headtracking unit and utilized to pan three-dimensional sound sources


56


-


59


around speakers


50


-


53


in accordance with the requirements. Similarly, as illustrated in

FIG. 6

, the roll of a user's head


55


can be utilized for panning the virtual sound sources


66


-


69


between virtual speakers


61


-


64


again as a pre-processing step.




Turning now to

FIG. 7

, there is illustrated an example system


70


for implementation of the preferred embodiment. The system


70


includes a standard DVD digital input source


71


which is fed to an DIGITAL decoder


72


which again can be standard. The DIGITAL decoder outputs center channel


73


, front left and right channels


74


, and surround or back left and right channels


75


. The outputs


73


-


75


are fed to a DSP processing board


76


which operates with an attached memory


77


. One form of suitable DSP processing board can be the Motorola 56002 EVM evaluation board card designed to be inserted into a PC type computer and directly programmed therefrom and having suitable Analogue/Digital and Digital/Analogue converters.




A set of headphones


79


are provided which include headtracking capabilities in the form of an angular position circuit


80


. The angular position circuit


80


determines the yaw, elevation and roll and can comprise a Polhemus 3 space Insidetrak Tracking system available from Polhemus, 1 Hercules Drive, PO Box 560, Colchester, Vt. 05446, USA. The output from the angular position circuit


80


is converted to a digital form


81


for inputting to DSP chip


76


. The DSP chip


76


is responsible for implementing the core functionality of

FIG. 4

, outputting two digital channels to digital to analogue converter


82


which in turn outputs analogue left and the right headphone speaker channel data which can be amplified


83


,


84


in accordance with the requirements. The DSP chip


76


also implements the speaker panner mixing which pans the input sources


73


-


75


according to the input angular position. Further, a filter array is provided within the DSP


76


which simulates the virtual speaker array of six speakers in accordance with the previously known prior art techniques.




It would be therefore evident that the preferred embodiment provides for a simplified form of providing for full surround sound capabilities of the headtracked headphones in the presence of movement of the listener's head.




It would be appreciated by a person skilled in the art that numerous variations and/or modifications may be made to the present invention as shown in the specific embodiment without departing from the spirit or scope of the invention as broadly described. The present embodiment is, therefore, to be considered in all respects to be illustrative and not restrictive.



Claims
  • 1. A method of simulating a spatial sound environment to a listener over headphones comprising:inputting a series of sound signals having spatial components; determining a current orientation of said headphones around said listener; determining a mapping function from a series of spatially static virtual speakers placed around the listener to each ear of the listener; utilising said current orientation to determine a current panning of said sound signals to said series of virtual speakers so as to produce a panned sound input signal for each of said virtual speakers; utilising said mapping function to map said panned sound input signal to each ear of said listener; and combining said mapped panned sound input signals to produce a left and right output signal for said headphones.
  • 2. A method as claimed in claim 1 wherein said virtual speakers include a set of simulated speakers placed at substantially equal angles around said listener.
  • 3. A method as claimed in claim 1 wherein said virtual speakers are substantially in a horizontal plane around a listener.
  • 4. A method as claimed in claim 1 wherein said virtual speakers are placed so as to fully surround a listener in three dimensions.
  • 5. A method as claimed in claim 1 wherein said series of sound signals comprise a Dolby DIGITAL encoding of a sound environment.
  • 6. An apparatus for simulating a spatial sound environment to a listener over headphones comprising:input means for inputting a series of signals comprising a spatial sound environment for listening in a first reference frame; panning means for panning said series of signals amongst a predetermined number of virtual output signals to produce a plurality of panned virtual output speakers signals in a second reference frame that is fixed relative to the orientation of said headphones, said panning means accepting a signal indicative of the orientation of said headphones to said first reference fame; head related transfer function mapping means for mapping said panned virtual output speaker signals to left and right headphone channel signals; and combining means for combining each of said left and right headphone channel signals into combined left and right headphone signals for playback over said headphones, such that the head related transfer function mapping means and the means for combining need not vary for different orientations of said headphones to said first reference frame.
  • 7. An apparatus as claimed in claim 6 wherein said panning means, said head related transfer function mapping means and said combining means are implemented in the form of a suitably programmed digital signal processor.
  • 8. An apparatus for simulating a spatial sound environment to a listener over headphones comprising:an input device adapted to input a series of signals comprising a spatial sound environment for listening in a first reference frame; a panning module adapted to pan said series of signals amongst a predetermined number of virtual output signals to produce a plurality of panned virtual output speakers signals in a second reference frame that is fixed relative to the orientation of said headphones, said panning module accenting a signal indicative of the orientation of said headphones to said first reference frame; a head related transfer output mapping module adapted to map said panned virtual output speaker signals to left and right headphone channel signals; and a combining module adapted to combine each of said left and right headphone channel signals into combined left and right headphone signals for playback over said headphones, such that the head related transfer function mapping module and the combining module need not vary for different orientations of said headphones to said first reference fame.
  • 9. An apparatus as claimed in claim 8, wherein said panning module, said head related transfer function mapping module and said combining module are implemented in the form of a suitable programmed digital signal processor.
  • 10. An apparatus as claimed in claim 8, wherein said virtual output speaker signals correspond to virtual speakers which include a set of simulated speakers placed at substantially equal angles around said listener.
  • 11. An apparatus claimed in claim 10, wherein said virtual speakers are substantially in a horizontal plane around a listener.
  • 12. An apparatus as claimed in claim 10, wherein said virtual speakers are placed so as to fully surround a listener in three dimensions.
  • 13. A method as claimed in claim 8, wherein said series of signals comprise a Dolby DIGITAL encoding of a sound environment.
Priority Claims (1)
Number Date Country Kind
PP 2715 Mar 1998 AU
PCT Information
Filing Document Filing Date Country Kind
PCT/AU99/00242 WO 00
Publishing Document Publishing Date Country Kind
WO99/51063 10/7/1999 WO A
US Referenced Citations (2)
Number Name Date Kind
5809149 Cashion et al. Sep 1998 A
5822438 Sekine et al. Oct 1998 A
Foreign Referenced Citations (6)
Number Date Country
0 827 361 Mar 1998 EP
0932324 Jul 1999 EP
2340705 Feb 2000 GB
2339127 Dec 2000 GB
09093700 Apr 1997 JP
PCTUS9504839 May 1995 WO
Non-Patent Literature Citations (2)
Entry
Nick Flaherty, 3D Audio: New Directions in Rendering Realistic Sound, Electronic Engineering, Apr., 1998, pp. 49-52.
Floyd E. Toole. Two-Speaker Techniques for Three-Dimensional Sound, Audio, Jun.,1997, pp. 34-39.