This disclosure relates to the field of audio processing. In particular, the disclosure relates to a method of generating at least two audio channels from audio in an immersive audio format comprising a plurality of front and rear height audio channels and a plurality of front and rear non-height audio channels for playback the plurality of audio channels with an immersive loudspeaker system. The disclosure further relates to an apparatus comprising a processor configured to carry out the method, to a vehicle comprising the apparatus, to a program and a computer-readable storage medium.
Vehicles usually contain loudspeaker systems for audio playback. Loudspeaker systems in vehicles may be used to playback audio from, for example, tapes, CDs, audio streaming services or applications executed in an automotive entertainment system of the vehicle or remotely via a device connected to the vehicle. The device may be, e.g., a portable device connected to the vehicle wirelessly or with a cable. For example, most recently, streaming services such as Spotify and Tidal have been integrated into the automotive entertainment system, either directly in the vehicle's hardware (usually known as the “head unit”) or via a smart phone using Bluetooth or Apple CarPlay or Android Auto. The loudspeaker systems in vehicles may also be used to playback terrestrial and/or satellite radio. Conventional loudspeaker systems for vehicles are stereo loudspeakers systems. Stereo loudspeaker systems may include a total of four loudspeakers: a front pair of loudspeakers and a rear pair of loudspeakers, for the front and rear passengers, respectively. However, in more recent years, with the introduction of DVD players in vehicles, surround loudspeaker systems have been introduced in vehicles to support playback of DVD audio format.
Immersive audio is becoming mainstream in cinemas and homes listening environments. With immersive audio becoming mainstream in the cinema and the home, it is natural to assume that immersive audio will be played back also inside vehicles. Dolby Atmos Music is already available via various streaming services. Immersive audio is often differentiated from surround audio format by the inclusion of one or more overhead or height audio channels. Therefore, for playing back immersive audio, overhead or height loudspeakers are used.
While high end vehicles may contain multiple overhead or height loudspeakers, there are still vehicles which use a limited number of height stereo loudspeakers. In fact, height loudspeakers dramatically increase complexity of the loudspeaker system in the vehicles. The height loudspeaker needs to be placed in the roof of the vehicle which is usually not adapted for this purpose. For example, vehicles have usually a low roof which limits the available height for placement of height loudspeaker. Furthermore, vehicles are often sold with the option to mount a sunroof to uncover a window in the vehicle's roof, making a difficult industrial design challenge to integrate or place many height loudspeakers in the roof. Additional audio cables may also be required for such height loudspeakers. Placement of multiple overhead or height loudspeakers is also a concern when considering safety of the vehicle and the placement of, e.g., airbags, in the same or close location of overhead or height loudspeakers. For all these reasons, integration of multiple height loudspeakers in vehicles may be costly due to space and industrial design constraints. On the other hand, using only a limited number of height loudspeakers dramatically limit the spatial resolution of the sound in the height plane of the vehicle.
It would be advantageous to playback immersive audio content in an immersive loudspeaker system with at least one height loudspeaker. In the context of the present disclosure an “immersive loudspeaker system” is a loudspeaker/speaker system that comprises at least one height loudspeaker (e.g. located above a listener's head, i.e. in a so-called height plane) and a plurality of front and rear non-height loudspeakers (e.g. located below the listener's head, i.e. in a so-called base plane).
It would be advantageous to create a perception of sound height with a satisfactory spatial resolution by playing back immersive audio content into an immersive loudspeaker system including at least one height loudspeaker such that the user's audio experience is enhanced even without using more than two overhead loudspeakers.
An aspect of this disclosure provides a method of generating a plurality of audio channels from audio in an immersive format comprising a plurality of front and rear height audio channels and a plurality of front and rear non-height audio channels, for playing back the plurality audio channels with an immersive loudspeaker system comprising at least one height loudspeaker and a plurality of front and rear non-height loudspeakers. The method comprises applying a virtual height filter to a portion of each front and rear height audio channel of the plurality of front and rear height channels. The virtual height filter is configured for, when the at least one of the front and rear heigh audio channels is played back by a corresponding loudspeaker of the front and rear non-height loudspeakers, at least partially attenuating spectral components of the at least one height channel directly emanating from the loudspeaker from which the height channel is played back. The virtual height filter is also configured for at least partially amplifying spectral components of the height channel reflected from a roof or an area close to the roof inside the listening environment, to generate a plurality of front and rear virtual height filtered audio signals. The method further comprises adding each front and rear virtual height filtered audio signal of the plurality of front and rear virtual height filtered audio signals to the corresponding front and rear non-height audio channel of the plurality of front and rear non-height audio channels, to generate added audio signals for playback by the corresponding front or rear non-height loudspeaker. The method further comprises playing back the remaining portion of each front and rear height audio channel of the plurality of front and rear height channels with the at least one height loudspeaker.
In the context of the present disclosure the term “channel” means an audio signal plus optionally metadata in which the position is coded as a channel identifier, e.g., left-front or right-top surround; “channel-based audio” is audio formatted for playback through a pre-defined set of loudspeaker zones with associated nominal locations, e.g., 5.1, 7.1, and so on; the term “object” or “object-based audio” means one or more audio channels with a parametric source description, such as apparent source position (e.g., 3D coordinates), apparent source width, etc.
When the portion of height channel is played back from one of the non-height loudspeakers without filtering, sound may radiate along different paths. Some sound may radiate along a direct path from the non-height loudspeaker to a listening position (e.g., to a passenger's or driver's ears). Some other sound may radiate along a reflected path from the on-height loudspeaker to the listening position. For example, some sound may be reflected from the roof or area close to the roof inside the listening and therefore radiate from the roof or area close to the roof, to the listening position. The sound that radiates along the direct path is undesired when the height channel is played back by the non-height loudspeakers. By applying the virtual height filter to portions of height channel, the spectral components of the portions of the height channels reflected from the roof or the close to the roof are amplified while the spectral components of the portions of height channels directly emanated to the loudspeaker are attenuated. Configured as above the method compensates for the undesired direct sound and introduces perceptual height cues into the audio signal being fed to non-height loudspeakers, thereby improving the positioning and perceived quality of the virtual height signal. For example, a directional hearing model has been developed to create a virtual height filter, which when used to process audio being reproduced by the non-height loudspeakers, improves that perceived quality of the reproduction. In addition, by playing back the remaining portions of height audio channels directly with the at least one height loudspeaker a desired balance of the front and rear height audio channels among the non-height loudspeakers and the at least one height loudspeaker can be achieved for an improved spatial resolution in the height plane which provides an improved immersive listening experience. The improved spatial resolution in the height plane is achieved by just using a single height loudspeaker or only two height loudspeakers.
In an embodiment, the portion of each of the plurality of front and rear height channels may be a function of one or more of: a location of the at least one height loudspeaker along a length of the listening environment or a desired balance distribution of the plurality of audio channels among the loudspeakers of the immersive loudspeaker system.
In an embodiment, applying the virtual height filter to a portion of each of the plurality of front and rear height audio channels may comprise applying the virtual height filter to a portion of the front height audio channels which is proportional to a location of the at least one height loudspeaker moving away from the front and getting close to the rear of the listening environment.
In an embodiment, applying the virtual height filter to a portion of each of the plurality of front and rear height audio channels may comprise applying the virtual height filter to a portion of the rear height audio channels which is proportional to a location of the at least one height loudspeaker moving away from the rear and getting close to the front of the listening environment.
In an embodiment, playing back the remaining portion of each of the plurality of front and rear height audio channels with the at least one height loudspeaker may comprise playing back a remaining portion of each of the plurality of the front height audio channels which is proportional to a location of the at least one height loudspeaker moving away from the rear and getting close to the front of the listening environment.
In an embodiment, playing back the remaining portion of each of the plurality of front and rear height channels with the at least one height loudspeaker may comprise playing back a remaining portion of each of the plurality of the back channels which is proportional to a location of the at least one height loudspeaker moving away from the front and getting close to the rear of the listening environment.
In an embodiment, the audio in the immersive audio format may further comprise a front right and a front left non-height audio channels, a rear right and a rear left non-height audio channels, a top front right and a top front left height audio channels and a top rear right and a top rear left height audio channels. The front virtual height filtered audio signals may be added to the corresponding front right and front left non-height audio channels. The rear virtual height filtered audio signals may be added to the corresponding rear right and rear left non-height audio channels. These additions generate four added audio signals for playback by corresponding front right and front left, and rear right and rear left non-height loudspeakers.
In an embodiment, the immersive loudspeaker system may comprise a single top (overhead) loudspeaker.
In an embodiment, the remaining portions of the top front right height audio channel, top rear right height audio channel, top front left height audio channel and top rear left height audio channel may be summed up together and playback by the top (overhead) single top loudspeaker.
In an embodiment, the immersive loudspeaker system may comprise two top (overhead) loudspeakers, e.g., a top (overhead) right height loudspeaker and top (overhead) left height loudspeaker.
In an embodiment, the remaining portions of the top front right height audio channel and top rear right height audio channel may be summed up together and played back by top right height loudspeaker. Similarly, the remaining portions of the top front left height audio channel and top rear left height audio channel may be summed up together and played back by the top left height loudspeaker.
In an embodiment, the virtual height filter may have a filter transfer function and wherein the method further comprises determining the filter transfer function of the virtual height filter from one or more parameters identifying the filter transfer function.
In an embodiment, the method may further comprise storing the one or more parameters in a processor as a look-up table or as an analytical function.
In an embodiment, the virtual height filter may have a filter transfer function having a peak at a first frequency and a notch at a second frequency higher than the first frequency.
In an embodiment, at least two of the front and rear non-height loudspeakers may be laterally spaced with respect to a listening position. The method may further comprise determining a filter transfer function for the virtual height filter based on a relative distance of the at the least two loudspeakers from the listening position and on an elevation of the roof or area close to the roof relative to the listening position.
In an embodiment, at least two of the front and rear non-height loudspeakers may be laterally spaced with respect to a listening position. The method may further comprise obtaining a plurality of filter transfer functions for a plurality of virtual height filters based on a range of relative distances of said two non-height loudspeakers from the listening position and on a range of elevations of the roof or area close to the roof relative to the listening position and selecting one filter transfer function from the plurality of filter transfer functions.
In an embodiment, the selected filter transfer function may be the average of the plurality of filter transfer functions.
In an embodiment, selecting one filter transfer function from the plurality of filter transfer functions may comprise selecting one or more parameters identifying the selected filter transfer function based on an average distance of the at the least two loudspeakers from the listening position and based on an average elevation of the roof or area close to the roof relative to the listening position.
In an embodiment, the steps of obtaining, selecting, applying adding and playing back of the method described above may be iteratively applied for each selected filter transfer function until the filter transfer function provides a playback of the at least two channels with maximum perception of sound elevation.
In an embodiment, the method may further comprise applying a gain to the virtual height filter. In an embodiment, the gain may be user configurable.
In an embodiment, the listening environment is a vehicle.
Another aspect of this disclosure provides an apparatus comprising a processor and a memory coupled to the processor, wherein the processor is configured to carry out any of methods described in the present disclosure.
Another aspect of this disclosure provides a vehicle comprising such apparatus.
Other aspects of the present disclosure provide a program comprising instructions that, when executed by a processor, cause the processor to carry out the method of processing audio and further a computer-readable storage medium storing such program.
Embodiments of the present disclosure are illustrated by way of example, and not by way of limitation, in the accompanying drawings, wherein like reference numerals refer to similar elements, and in which:
Numerous specific details are described below to provide a thorough understanding of the present disclosure. However, the present disclosure may be practiced without these specific details. In addition, well-known parts may be described in less exhaustive detail. The figures are schematic and comprise parts relevant for understanding the present disclosure, whereas other parts may be omitted or merely suggested.
For example, with reference to
The immersive loudspeaker system may be for example a surround loudspeaker system with two additional height loudspeakers as shown with reference to
Alternatively, the immersive loudspeaker system may be for example a surround loudspeaker system with a single height loudspeaker as shown with reference to
Alternatively, in another example, the immersive loudspeaker system may be for example a surround loudspeaker system with two height loudspeakers as shown with reference to
With reference to
In an embodiment the audio in the immersive audio format may be audio rendered in the immersive audio format.
The immersive audio format of (e.g. rendered) audio may comprise at least one height channel. In an embodiment, the immersive audio format may be an object-based audio format supporting elevation, e.g. a Dolby Atmos format. In another embodiment, the immersive audio format may be channel-based audio format supporting elevation, e.g. a X.Y.Z audio format, where X≥2 is the number of front or surround audio channels, Y≥0 is, when present, a Low Frequency Effects or subwoofer audio channel, and Z≥1 is the at least one height audio channel. In an embodiment, the object-based audio format (e.g., supporting elevation) may be rendered or pre-rendered to a corresponding channel-based audio format for generating loudspeaker feeds corresponding to the channels of the channel-based audio format.
The method schematically illustrated in
With reference to
With reference to
In an embodiment, the portion of the front and rear height channels to which the virtual height filter is applied is a function of a location of the at least one height loudspeaker along a length of the vehicle. In an embodiment, the portion of the front and rear height channels to which the virtual height filter is applied is a linear function of the location of the at least one height loudspeaker along a length of the vehicle. For example, as shown in
In an embodiment, applying the virtual height filter to a portion of each of the plurality of front and rear height channels comprises applying the virtual height filter to a portion of the front height channels which is proportional to a location of the at least one height loudspeaker moving away from the front and getting close to the rear of the listening environment. For example, with reference to
In yet an embodiment, applying the virtual height filter to a portion of each of the plurality of front and rear height channels comprises applying the virtual height filter to a portion of the rear height channels which is proportional to a location of the at least one height loudspeaker moving away from the rear and getting close to the front of the listening environment. For example, with reference to
Similarly, in an embodiment, playing back the remaining portion of each of the plurality of front and rear height audio channels with the at least one height loudspeaker comprises playing back a remaining portion of each of the plurality of the front height audio channels which is proportional to a location of the at least one height loudspeaker moving away from the rear and getting close to the front of the listening environment. For example, with reference to
Similarly, in an embodiment, playing back the remaining portion of each of the plurality of front and rear height audio channels with the at least one height loudspeaker comprises playing back a remaining portion of each of the plurality of the rear height audio channels which is proportional to a location of the at least one height loudspeaker moving away from the front and getting close to the rear of the listening environment. For example, with reference to
It is understood that the embodiment described with reference to
In an embodiment, the portion of the front and rear height channels to which the virtual height filter is applied is a function of a desired balance distribution of the plurality of audio channels among the loudspeakers of the immersive loudspeaker system. The desired balance distribution may provide the best immersive audio experience for the listener. For example, during a calibration or test phase, it may be found that a particular ratio of the portion of height channel which is virtual filtered versus the remaining portion of height channel which is played back by the height loudspeaker provides a desired height spatial resolution, i.e. spatial resolution in the height plane.
In another embodiment, the portion of the front and rear height channels to which the virtual height filter is applied is both a function of a desired balance distribution of the plurality of audio channels among the loudspeakers of the immersive loudspeaker system and the location of the at least one height loudspeaker along a length of the vehicle. The desired spatial resolution in the height plane may be a function of the location of the height (or overhead) loudspeaker.
In general, as explained above, the choice of the portion of the front and rear height channels which is filtered by the virtual height filter is made on empirical data or experiments which provide the best performance of the immersive loudspeaker system.
In an embodiment, the portion of the front or rear height channels to which the virtual height filter is applied may be a function of a position of the listener (driver or passenger), relative to the position of the height (or overhead) loudspeaker. In an embodiment, the portion of the front or rear height channels to which the virtual height filter is applied may be dynamically adjusted/triggered in function of a change of the position of the listener relative to the position of the height (or overhead) loudspeaker. For example, the position of the listener may be adjusted in the height plane or base plane and the portion of the front or rear height channels to which the virtual height filter is applied may be changed based on this adjustment. For example, the adjustment of the portion can be triggered by an adjustment of the seat in any direction of the vehicle.
Virtual height filter 1300 is configured to at least partially attenuate spectral components of a portion of height channel 1010 which are directly emanating from loudspeaker 2, when such a portion of height channel 1010 is played back by such loudspeaker 2. Virtual height filter 1300 is further configured to at least partially amplify spectral components of the portion of height channel 1010 reflected from a roof or an area close to the roof inside the vehicle, to generate virtual height filtered audio signal 1175. Virtual height filter 1400 is configured to at least partially attenuate spectral components of a portion of height channel 1020 which are directly emanating from loudspeaker 1, when such portion of height channel 1020 is played back by such loudspeaker 1. Virtual height filter 1400 is further configured to at least partially amplify spectral components of the portion of height channel 1020 reflected from a roof or an area close to the roof inside the vehicle, to generate virtual height filtered audio signal 1200. Virtual height filter 2500 is configured to at least partially attenuate spectral components of a portion of height channel 1030 which are directly emanating from loudspeaker 3, when such portion of height channel 1030 is played back by such loudspeaker 3. Virtual height filter 2500 is further configured to at least partially amplify spectral components of the portion of height channel 1030 reflected from a roof or an area close to the roof inside the vehicle, to generate virtual height filtered audio signal 1225. Virtual height filter 2600 is configured to at least partially attenuate spectral components of a portion of height channel 1030 which are directly emanating from loudspeaker 4, when such portion of height channel 1040 is played back by such loudspeaker 4. Virtual height filter 2600 is further configured to at least partially amplify spectral components of the portion of height channel 1040 reflected from a roof or an area close to the roof inside the vehicle, to generate virtual height filtered audio signal 1250.
To explain further, reference is made to
To compensate for the undesired direct sound, it has been shown that incorporating signal processing to introduce perceptual height cues into the audio signal being fed to loudspeaker 6000 improves the positioning and perceived quality of the virtual height signal. For example, a directional hearing model has been developed to create a virtual height filter, which when used to process audio being reproduced by a loudspeaker, improves that perceived quality of the reproduction. In an embodiment, the virtual height filter is derived from both a physical loudspeaker location and a virtual loudspeaker location (above the listening position) with respect to the listening position. For the physical loudspeaker location, a first directional filter is determined based on a model of sound travelling directly from the loudspeaker location to the ears of a listener at the listening position. Such a filter may be derived from a model of directional hearing such as a database of HRTF (head related transfer function) measurements or a parametric binaural hearing model, pinna model, or other similar transfer function model that utilizes cues that help perceive height. Although a model that takes into account pinna models is generally useful as it helps define how height is perceived, the filter function is not intended to isolate pinna effects, but rather to process a ratio of sound levels from one direction to another direction, and the pinna model is an example of one such model of a binaural hearing model that may be used, though others may be used as well.
An inverse of this filter is next determined and used to remove the directional cues for audio travelling along a path directly from the physical loudspeaker location to the listening position. Next, for the virtual loudspeaker location, a second directional filter is determined based on a model of sound travelling directly from the virtual loudspeaker location to the ears of a listener at the same listening position using the same model of directional hearing. This filter is applied directly, imparting the directional cues the ear would receive if the sound were emanating from the virtual loudspeaker location above the listening position. In practice, the first directional filter and the second directional filter may be combined in a way that allows for a single filter that both at least partially removes (attenuates) the directional cues from the physical loudspeaker location, and at least partially inserts (amplify) the directional cues from the virtual loudspeaker location. Such a single filter provides a frequency response curve that is referred to herein as a “height filter transfer function,” “virtual height filter response curve,” “desired frequency transfer function,” “height cue response curve,” or similar words to describe a filter or filter response curve that filters, e.g., attenuate, direct sound components from height sound components in an audio loudspeaker system.
With regard to the filter model, if P1 represents the frequency response in dB of the first filter modeling sound transmission from the physical loudspeaker location and P2 represents the frequency response in dB of the second filter modeling sound transmission from the virtual loudspeaker location, then the total response of the virtual height filter PT in dB can be expressed as: PT=α(P2−P1), where a is a scaling factor or gain that controls the strength of the filter. With α=1, the filter is applied maximally, and with α=0, the filter does nothing (0 dB response). In practice, a may be set somewhere between 0 and 1 (e.g. α=0.5) based on the relative balance of reflected to direct sound. As the level of the direct sound increases in comparison to the reflected sound, so should a in order to more fully impart the directional cues of the virtual loudspeaker location to this undesired direct sound path. However, a should not be made so large as to damage the perceived timbre of audio travelling along the reflected path, which already contains the proper directional cues. In general, the exact values of the filters P1 and P2 will be a function of the azimuth of the physical loudspeaker location with respect to the listening position and the elevation of the reflected speaker location. This elevation is in turn a function of the distance of the physical loudspeaker location from the listening position and the difference between the height of the roof or area close to the roof (surface 6500 in
Curves 7200, 7300 and 7400 represent filter transfer functions for three different virtual height filters.
In an embodiment, as shown with reference to
The two front and/or the two rear non-height loudspeakers are laterally spaced with respect to a listening position (e.g., a driver or a passenger position in case of a vehicle). In an embodiment, as shown with reference to
For example, in one embodiment, one or more sensors may be located at or close to the listening positions to measure such relative distance of the at the least two front or rear loudspeakers from the listening position and the elevation of the roof or area close to the roof, relative to the listening position. For example, in an embodiment, such sensors may be embedded in the head rest of each seat of the vehicle approximatively at the same height of the listener's head. Said measurements may be performed at an initial calibration stage of the method or, alternatively, substantially real-time with playback of the audio.
Alternatively, additionally or optionally the filter transfer function of the virtual height filter may be based on predetermined absolute distances between the one or more listening positions and each of the non-height front and/or rear loudspeakers and predetermined elevation of the roof relative to the listening position. For example, distances between the one or more listening positions (for example any of the positions at seats 4100, 4200, 4300 or 4400 of
Alternatively, additionally or optionally, in an embodiment as shown with reference to
In an embodiment, still with reference to
In an embodiment, still with reference to
In an embodiment, said one or more seat sensors or a different set of sensors may be used to detect a new listening position, e.g., a new location of the listener's head (or location of the listener's hears). For example, the driver or passenger may adjust his own seat horizontally and/or vertically for a more comfortable seating position in the vehicle. In this embodiment, the method may retrieve/obtain (the filter transfer function of) the virtual height filter according to the new detected listening position. In this way the correct information, either based on a correct set of predetermined listener to loudspeakers distance information and set of predetermined roof elevation information, or based on actual measurements, may be used according to the new listening position. For example, if/when the predetermined one or more parameters identifying (the filter transfer function of) the virtual height filter are stored as an analytical function or a look up table (LUT), a different analytical function or a different LUT may correspond to a different (e.g. detected) seat or listening position. These embodiments in which step 1800 may be triggered upon detection of a movement of a listener may be combined with the embodiments described above in which the portion of the front or rear height channels to which the virtual height filter is applied may be dynamically adjusted/triggered in function of a change of the position of the listener relative to the position of the height (or overhead) loudspeaker. For example, both the one or more parameters identifying a particular filter transfer function and the portion of the front or rear height to which the particular filter transfer function is applied may be determined upon detection of a new listener position and/or based on the new listener position. The new listener position may be determined with any sensor or set of sensors described above. Therefore, yet a more accurate and targeted determination of the filter transfer functions and ratio of the portions of height channels which are to be filtered and the remaining portions of the height channels which are to be played back by the height loudspeaker can be achieved.
As explained above, the immersive loudspeaker system may comprise two height loudspeakers.
For example, with reference to
Virtual height filter 1300 is applied to a portion of height channel 1010 to generate virtual height filtered signal 1175. Virtual height filtered signal 1175 is added with non-height channel 1100 to generate signal 1018. Virtual height filter 1400 is applied to a portion of height channel 1020 to generate virtual height filtered signal 1200. Virtual height filtered signal 1200 is added with non-height channel 1050 to generate signal 1011. Virtual height filter 2500 is applied to a portion of height channel 1030 to generate virtual height filtered signal 1225. Virtual height filtered signal 1225 is added with non-height channel 1125 to generate signal 1033. Virtual height filter 2600 is applied to a portion of height channel 1040 to generate virtual height filtered signal 1250. Virtual height filtered signal 1250 is added with non-height channel 1150 to generate signal 1063. Channel signals 1011, 1018, 1033 and 1063 are fed to loudspeakers 1, 2, 3, and 4, respectively, for playback. The remaining portion of height channel 1020 is fed together with the remaining portion of height channel 1030 to height loudspeaker 7. The remaining portion of height channel 1010 is fed together with the remaining portion of height channel 1040 to height loudspeaker 6. By using two height loudspeakers 6 and 7 and the redistribution of the height channels among the height and the non-height loudspeakers of the immersive loudspeaker system, the spatial resolution in the height plane is further improved. This is better explained with reference to
It is understood that the various ways of determining the portions of the height channels based of height loudspeaker(s) which are virtual filtered and which have been described with reference to the embodiment of
It is also understood that similarly to the embodiment described with reference to
In an embodiment, the portion of the front and rear height channels to which the virtual height filter is applied is a function of a desired balance distribution of the plurality of audio channels among the loudspeakers of the immersive loudspeaker system. The desired balance distribution may provide the best immersive audio experience for the listener. For example, during a calibration or test phase, it may be found that a particular ratio of the portion of height channel which is virtual filtered versus the remaining portion of height channel which is played back by the height loudspeaker provides a desired height spatial resolution, i.e. a desired spatial resolution in the height plane.
In another embodiment, the portion of the front and rear height channels to which the virtual height filter is applied is both a function of a desired balance distribution of the plurality of audio channels among the loudspeakers of the immersive loudspeaker system and the location of the at least one height loudspeaker along a length of the vehicle. The desired spatial resolution in the height plane may be a function of the location of the height (or overhead) loudspeaker.
In general, as explained above, the choice of the portion of the front and rear height channels which is filtered by the virtual height filter is made on empirical data or experiments which provide the best performance of the immersive loudspeaker system.
In an embodiment, the portion of the front or rear height channels to which the virtual height filter is applied may be a function of a position of the listener (driver or passenger), relative to the position of the height (or overhead) loudspeaker. In an embodiment, the portion of the front or rear height channels to which the virtual height filter is applied may be dynamically adjusted/triggered in function of a change of the position of the listener relative to the position of the height (or overhead) loudspeaker. For example, the position of the listener may be adjusted in the height plane or base plane and the portion of the front or rear height audio channels to which the virtual height filter is applied may be changed based on this adjustment. For example, the adjustment of the portion can be triggered by an adjustment of the seat in any direction of the vehicle.
Any other suitable immersive audio format and/or speaker configuration can be envisaged, suitable for the specific implementation.
For example, in addition to the (input) channels of the examples shown with reference to
In some embodiments (not shown in the Figures), when the center channel is present in the input immersive audio format, the center channel may be mixed/added together with the non-height Front Left and Front Right channels. In such embodiments, adding the virtual filtered height channel(s) to the non-height audio channel(s) (i.e. the Front Left and/or the Front Right channels) may be performed after mixing the Front Left and Front Right channels with the Center channel. In other words, in such embodiments, the non-height channel signal that is added to the virtual filtered signal has been already mixed with the center channel.
Similar considerations are applicable for loudspeaker configurations as shown in
In some embodiments, the non-height channels, e.g. the Front Left and Front Right channels and/or the Rear Left and Rear Right channels, are processed prior to be mixed with the corresponding virtual filtered portions of the height channels. For example, the Front Left and Front Right channels and/or the Rear Left and Rear Right channels, may be processed to compensate for the off-center listening position of the passenger(s)/driver in the vehicle. Compensation of the off-center listening position may be performed with the algorithm described in EP1994795B1, which is hereby incorporated by reference in its entirety. In EP1994795B1 it was shown that it is possible to ‘virtual center’ two listening positions symmetrically off-center from the same pair of (stereo) loudspeakers at the same time. This follows the same principle of reducing the phase differences of an interaural phase difference (IDP) of a single listening position. In case of two listening positions, the phase differences of the IDP obtained for each of the two listening positions are simultaneously reduced such that each IDP at each listening position has across the desired frequency range values between −90 and 90 degrees. By compensating for the off-center listening positions and mixing the filtered portions of the height channels with the corresponding compensated Front and/or Rear non-height channels, panning of the content of the portion of height channels across the Front and/or Rear loudspeakers may be prevented.
A method of generating a plurality of audio channels from audio in an immersive audio format for playing back the plurality of audio channels with an immersive loudspeaker system comprising at least one height loudspeaker and a plurality of front and rear non-height loudspeakers has been described. Additionally, the present disclosure also relates to an apparatus for carrying out these methods. Furthermore, the present disclosure relates to a vehicle which may comprise an apparatus for carrying out these methods. An example of such apparatus 1440 is schematically illustrated in
The apparatus 1440 may be a server computer, a client computer, a personal computer (PC), a tablet PC, a set-top box (STB), a personal digital assistant (PDA), a cellular telephone, a smartphone, a web appliance, a network router, switch or bridge, or any machine capable of executing instructions (sequential or otherwise) that specify actions to be taken by that apparatus. Further, while only a single apparatus 1440 is illustrated in
The present disclosure further relates to a program (e.g., computer program) comprising instructions that, when executed by a processor, cause the processor to carry out some or all of the steps of the methods described herein.
Yet further, the present disclosure relates to a computer-readable (or machine-readable) storage medium storing the aforementioned program. Here, the term “computer-readable storage medium” includes, but is not limited to, data repositories in the form of solid-state memories, optical media, and magnetic media, for example.
Embodiments described herein may be implemented in hardware, software, firmware and combinations thereof. For example, embodiments may be implemented on a system comprising electronic circuitry and components, such a computer system. Examples of computer systems include desktop computer systems, portable computer systems (e.g. laptops), handheld devices (e.g. smartphones or tablets) and networking devices. Systems for implementing the embodiments may for example comprise at least one of an integrated circuit (IC), a programmable logic device (PLD) such as a field programmable gate array (FPGA), a digital signal processor (DSP), an application specific IC (ASIC), a central processing unit (CPU), and a graphics processing unit (GPU).
Certain implementations of embodiments described herein may comprise a computer program product comprising instructions which, when executed by a data processing system, cause the data processing system to perform a method of any of the embodiments described herein. The computer program product may comprise a non-transitory medium storing said instructions, e.g. physical media such as magnetic data storage media including floppy diskettes and hard disk drives, optical data storage media including CD ROMs and DVDs, and electronic data storage media including ROMs, flash memory such as flash RAM or a USB flash drive. In another example, the computer program product comprises a data stream comprising said instructions, or a file comprising said instructions stored in a distributed computing system, e.g. in one or more data centers.
The present disclosure is not restricted to the embodiments and examples described above. Numerous modifications and variations can be made without departing from the scope of the present disclosure, defined by the accompanying claims.
Various aspects of the present invention may be appreciated from the following enumerated example embodiments (A-EEEs and B-EEEs):
A-EEE1. A method of generating discrete channels from an immersive bitstream, comprising
B-EEE 1. A method (1000) of generating at least two audio channels from audio in an immersive audio format comprising at least one height audio channel (1010) and at least two non-height audio channels (1050, 1100), for playing back the at least two audio channels with a non-immersive loudspeaker system of at least two audio loudspeakers (1,2) inside a vehicle, the method comprising:
B-EEE 2. The method (1000) of B-EEE 1, wherein the audio in the immersive audio format further comprises at least two further non-height audio channels (1125,1150) and wherein the virtual height filtered audio signal (1175) is mixed with each one of the non-height audio channels (1050, 1100, 1125, 1150) to generate four audio channels (1008, 1016, 1032, 1064).
B-EEE 3. The method of any of the previous B-EEEs, wherein the audio in the immersive audio format comprises at least two height audio channels (1010,1020), and wherein the virtual height filter (1300, 1400) is applied to each one of the at least two height audio channels (1010, 1020) to generate at least two virtual height filtered audio signal (1175, 1200) and wherein each one of the virtual height filtered audio signals (1175, 1200) is mixed with one of the at least two non-height channels (1100, 1050).
B-EEE 4. The method of any one of the previous B-EEEs, wherein the audio in the immersive audio format comprises four height audio channels (1010,1020, 1030, 1040) and four non-height audio channels (1050, 1100, 1125, 1150), and wherein the virtual height filter (1300, 1400, 2500, 2600) is applied to each one of the four height audio channels (1010, 1020, 1030, 1040) to generate four virtual height filtered audio signals (1175, 1200, 1225, 1250) and wherein each one of the virtual height filtered audio signals (1175, 1200, 1225, 1250) is mixed with one of the four non-height channels (1100, 1050, 1125, 1150).
B-EEE 5. The method of any one of the previous B-EEEs, wherein the non-immersive loudspeaker system is a stereo or surround loudspeaker system.
B-EEE 6. The method of any one of the previous B-EEEs, wherein the virtual height filter has a filter transfer function and wherein the method further comprises determining the filter transfer function of the virtual height filter from one or more parameters identifying the filter transfer function.
B-EEE 7. The method of any one of the previous B-EEEs, wherein the virtual height filter has a filter transfer function having a peak at a first frequency and a notch at a second frequency higher than the first frequency.
B-EEE 8. The method of B-EEEs 6 and 7, wherein the one or more parameters are indicative of at least one value of: a peak, a first frequency, a notch, and a second frequency of the filter transfer function.
B-EEE 9. The method of any one of the previous B-EEEs, wherein the at least two audio loudspeakers (1,2) are laterally spaced with respect to a listening position.
B-EEE 10. The method of B-EEE 9, further comprising determining (1800) a filter transfer function for the virtual height filter based on a relative distance of the at the least two loudspeakers from the listening position and on an elevation of the roof or area close to the roof relative to the listening position.
B-EEE 11. The method of B-EEE 9, further comprising obtaining (1900) a plurality of filter transfer functions for a plurality of virtual height filters based on a range of relative distances of the at the least two loudspeakers from the listening position and on a range of elevations of the roof or area close to the roof relative to the listening position and selecting (2000) one filter transfer function from the plurality of filter transfer functions.
B-EEE 12. The method of claim 11, wherein the selected filter transfer function is the average of the plurality of filter transfer functions.
B-EEE 13. The method of claim 11 as far dependent on any of the claim 6 to 8, wherein selecting one filter transfer function from the plurality of filter transfer functions comprises selecting one or more parameters identifying the selected filter transfer function based on an average distance of the at the least two loudspeakers from the listening position and based on an average elevation of the roof or area close to the roof relative to the listening position.
B-EEE 14. The method of any of the B-EEEs 11 to 13, wherein the steps obtaining (1900), selecting (2000), applying (1500) and mixing (1700) are iteratively applied for each selected filter transfer function until the filter transfer function provides a playback of the at least two channels with maximum perception of sound elevation.
B-EEE 15. The method of any one of the B-EEEs 6 to 14, further comprising storing the one or more parameters in a processor as a look-up table or as an analytical function.
B-EEE 16. The method of any one of the preceding B-EEEs, further comprising applying a gain to the virtual height filter.
B-EEE 17. The method of B-EEE16, wherein the gain is user configurable.
B-EEE 18. The method of any one of the previous B-EEEs, wherein the audio in the immersive audio format is audio rendered in the immersive audio format and/or wherein the immersive audio format is Dolby Atmos, or any X.Y.Z audio format where X≥2 is the number of front or surround audio channels, Y≥0 is, when present, a Low Frequency Effects or subwoofer audio channel, and Z≥1 is the at least one height audio channel.
B-EEE 19. An apparatus configured to perform the method of any of B-EEEs 1-18.
B-EEE 20. A vehicle including a loudspeaker system of at least two audio loudspeakers (1,2), further comprising the apparatus of B-EEE 19.
B-EEE 21. A program comprising instructions that, when executed by a processor, cause the processor to carry out the method according to any one of the B-EEEs 1-18.
B-EEE 22. A computer-readable storage medium storing the program according to B-EEE 21.
Number | Date | Country | Kind |
---|---|---|---|
22179943.0 | Jun 2022 | EP | regional |
This application claims priority of the following priority application: U.S. provisional application 63/291,598 (reference: D21147AUSP1), filed 20-12-2021, U.S. provisional application 63/353,778 (reference: D21147AUSP2), filed 20-6-2022 and EP application EP22179943.0 (reference: D21147AEP), filed 20-6-2022.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2022/081955 | 12/19/2022 | WO |
Number | Date | Country | |
---|---|---|---|
63353778 | Jun 2022 | US | |
63291598 | Dec 2021 | US |