This application claims priority from European Application No. 16305199.8, entitled “METHOD, COMPUTE READABLE STORAGE MEDIUM, AND APPARATUS FOR MULTICHANNEL AUDIO PLAYBACK ADAPTATION FOR MULTIPLE LISTENING POSITIONS”, filed Feb. 19, 2016, the contents of which is hereby incorporated by reference in its entirety.
The present solution relates to a method for multichannel audio playback adaptation for multiple listening positions. Further, the solution relates to a computer readable storage medium having stored therein instructions enabling multichannel audio playback adaptation for multiple listening positions. Furthermore, the solution relates to an apparatus configured to perform multichannel audio playback adaptation for multiple listening positions.
For multichannel audio playback in a home environment the number of listeners and their positions are flexible. This makes it difficult to optimize the playback for all possible listener arrangements with only one optimization setting. State of the art playback systems handle this problem by optimizing the playback for a larger listening region. Unfortunately, this approach is not optimal in case only one listener is present.
One additional problem, which typically occurs in small living rooms, are listeners sitting close to a loudspeaker. This situation can be very annoying for these listeners.
A solution that enables a user to adapt the playback level of the loudspeakers in accordance with the actual positions of all listeners would be desirable in order to create a pleasant listening experience for all listeners.
According to one aspect, a method for multichannel audio playback adaptation comprises:
Similarly, a computer readable storage medium has stored therein instructions enabling multichannel audio playback adaptation, wherein the instructions, when executed by a computer, cause the computer to:
Also, in one embodiment an apparatus for multichannel audio playback adaptation comprises:
In another embodiment, an apparatus for multichannel audio playback adaptation comprises a processing device and a memory device having stored therein instructions, which, when executed by the processing device, cause the apparatus to:
According to the present solution the users defines one or more listening regions, e.g. using a user interface of the playback device or a dedicated input device. A listening region is the area where listeners are located relatively to the loudspeaker positions. A process adapts the output parameters, such as level/gain and delay, for the signals that are routed to loudspeakers that are too close to certain listeners. The critical distance may be determined based on the precedence or Haas effect. The whole process may be performed by the actual playback device, by the input device, or by another connected device. The output parameters are adapted such that sound produced by those loudspeakers is less dominant relative to sound produced by the other loudspeakers. This ensures a pleasant listening experience also for those listeners that sit close to the loudspeakers.
Preferably, the user input is obtained by a touch pad of an input device, e.g. of a tablet or smartphone. Today smartphones or tablets are available in many households. These devices are already equipped with a touch pad and are thus well suited as an input device. The listening regions are preferably specified by the user relative to the loudspeaker positions using a top view of the listening room. Only listening regions inside the area bounded by the loudspeakers are valid. Regions behind the loudspeakers are invalid. This aspect is preferably taken into account when the listening regions are specified by the user by checking whether the user tries to specify an invalid listening region. In such a case a warning is preferably given to the user.
In one embodiment, a further user input selecting a listening region among one or more defined listening regions is received. The output parameters of the loudspeakers are then adapted such that audio playback is optimized for the selected listening region. In this way the user can select the right listening region from a menu that best matches the actual positioning of the listeners.
For a better understanding the principles of embodiments of the invention shall now be explained in more detail in the following description with reference to the figures. It is understood that the invention is not limited to these exemplary embodiments and that specified features can also expediently be combined and/or modified without departing from the scope of the present invention as defined in the appended claims. In the drawings, the same or similar types of elements or respectively corresponding parts are provided with the same reference numbers in order to prevent the item from needing to be reintroduced.
In
For example, the processing device 32 can be a processor adapted to perform the steps according to one of the described methods. In an embodiment said adaptation comprises that the processor is configured, e.g. programmed, to perform steps according to one of the described methods.
A processor as used herein may include one or more processing units, such as microprocessors, digital signal processors, or combination thereof.
The storage unit 26 and the memory device 31 may include volatile and/or non-volatile memory regions and storage devices such as hard disk drives, DVD drives, and solid-state storage devices. A part of the memory is a non-transitory program storage device readable by the processing device 32, tangibly embodying a program of instructions executable by the processing device 32 to perform program steps as described herein according to the principles of the invention.
In the following further implementation details shall be described.
The proposed solution consists basically of three steps. In the first step the user creates multiple listening regions, e.g. using a user interface of a playback device. In the second step one set of loudspeaker correction gain and delay values is computed for each listening region. Both steps need to be performed only once because the gain and delay values are constant for each listening region. In the third step the gain and delay values for a selected listening region are applied to the signals of the corresponding loudspeakers.
In order to handle different listener arrangements, the user defines different listening regions for typical arrangements. A listening region specifies the area (region) where the listeners are located in the room. In the simplest case a listening region is described by one or more polygons or concatenated rectangles in a plane that is parallel to the floor and at the height of the listener's ears. This assumes that the ears of all listeners are approximately on the same level. If the height of the ears should also be variable a listening region can alternatively be defined by a three dimensional mesh grid or by concatenated cuboids. In the following for simplicity the 2D plane is used to describe the further processing.
The listening regions are preferably specified by the user relative to the loudspeaker positions using a top view of the listening room. It has to be assured that the distances between the loudspeakers and the borders of the listening region match the real distances in the room. Therefore, it is assumed that the absolute position of each loudspeaker, for example given in Cartesian coordinates, is known by the playback device. A grid of a variable scale, for example of 20 cm×20 cm, can be used to help the user to approximate the right dimensions of the listening region in relation to the positions of the loudspeakers.
One example of a listening region for one listener is illustrated in
For each listening region the following processing is applied. First the minimal distance between the listening region and each loudspeaker is computed. For all loudspeakers that have a minimal distance to the listening region that is smaller than a critical distance, correction gain and delay values are computed. The critical distance defines the distance for which the delay and level of the loudspeaker signal has to be adapted because it would otherwise be annoying for the closest listener. It is preferably computed from the well-known precedence or Haas effect. This effect describes the dominance of one sound source over another with respect to the relative delay and level differences between the sources. Thus a delay and a gain value are obtained to make the dominant sound from the closest loudspeaker less dominant relative to other loudspeaker signals.
In the following an exemplary solution for the computation of the critical distance rcrit, the gain value αi, and the time delay value Δti for the adaptation of the i-th loudspeaker signal xi at a minimal distance ri between the listener area and the loudspeaker shall be described. In a first step the vector vpointing to the geometric centroid of the loudspeaker setup is computed:
where L is the total number of loudspeakers and xi describes the position of the i-th loudspeaker by a vector in Cartesian coordinates. Also computed is the average distance of the given loudspeaker setup
where |v−x1| defines the Euclidean distance between the two vectors.
The critical distance can be computed from the precedence effect. The precedence effect says that if the time delay between two identical signals that arrive from two different directions at the listener is less than 5 ms, the listener will perceive only one source at a position in between the two impinging directions. Therefore, one source is not perceived as dominant over the other if the precedence effect is valid. As a compromise for all listeners it is assumed that a loudspeaker is perceived as dominant if its signal arrives Δtmax=5 ms earlier at the position v than a signal of a loudspeaker that has a distance of rmean to v. The critical distance between a loudspeaker and the listening array is then defined by
r
crit=max(rmean−c·Δtmax,0)
with the speed of sound
The pressure level of a spherical source, which is used to model the loudspeaker, is inversely proportional to the distance from the source. The idea is to correct the pressure level of the loudspeaker to the level at the distance rcrit by:
Under consideration of some boundary values the gain values are computed from:
For example, the gain value may be corrected to the level at the average loudspeaker distance if the critical distance is equal to zero.
The signal of a loudspeaker that has a minimal distance smaller than the critical distance is delayed in a way that the total runtime from the loudspeaker to the listening array is equal to
Therefore, the additional time delay values are determined by:
Finally, the corrected speaker signals {circumflex over (x)}i(t) are determined from
{circumflex over (x)}
i(t)=αi·xi(t−Δti).
For playback the user selects the listening region that best matches the actual positioning of the listeners. The gain and delay values that have been computed for the selected listening region are then applied to the corresponding loudspeaker signals.
Number | Date | Country | Kind |
---|---|---|---|
16305199.8 | Feb 2016 | EP | regional |