The present application claims priority to European Patent Application No. EP14194268, entitled “AUDIO SYSTEM AND METHOD,” and filed on Nov. 21, 2014, the entire contents of which are hereby incorporated by reference for all purposes.
Various embodiments relate to an audio system and to a corresponding method. In particular, various embodiments relate to techniques of routing channels of a multi-channel audio data to output channels of the audio system.
It is known to provide multi-channel audio data where audio tracks are provided for, e.g., two, five, seven, or an even larger number of channels. Playback of the multi-channel audio data can be achieved by employing a respectively configured audio system which typically comprises at least one processor having a respective number of output channels and possibly an amplifier as an end stage to which speakers may be connected.
Typically, the multi-channel audio data is provided with respect to a certain standard arrangement of speaker positions. If the positions of the speakers connected to the output channels do not deviate significantly from this standard arrangement, a good listening experience may be achieved. In particular, it may be possible to compensate to some extent for a misalignment of the positions of the speakers with respect to the standard arrangement. For example, differences in the distance between the various speakers with respect to an audio sweet spot may be compensated for. Typically, the audio sweet spot is therefore defined with respect to the positions of the speakers. At the audio sweet spot, playback of the plurality of speakers may be synchronized with respect to each other. Playback of surround sound becomes possible. In particular, typically a particularly good listening experience may be provided if the listening position of a user coincides with an audio sweet spot.
According to reference implementations, it is therefore known to compensate for deviations of the distance of the actual positions of the speakers with respect to audio sweet spot. Yet, such techniques face certain restrictions. According to the reference implementations, it may not be possible or only possible to a limited degree to compensate for deviations in other degrees of freedom in the setup of the speakers and/or changes of the listening position.
Thus, the disclosure provides advanced techniques of processing multi-channel audio data which remedy or alleviate at least some of the above-mentioned restrictions. For example, the disclosure provides for such techniques which enable to flexibly modify the routing of the audio data in view of changes of the listening position.
According to an embodiment, an audio system is provided. The audio system comprises at least one processor. The at least one processor is configured to receive multi-channel audio data from an audio source. The multi-channel audio data includes a plurality of channels. The at least one processor comprises a plurality of output channels. Each output channel is configured to be connected to a respective speaker. The at least one processor is coupled to a memory. The at least one processor is further configured to receive, from the memory, preset positions of the plurality of speakers and virtual positions. The virtual positions are associated with the plurality of channels of the multi-channel audio data. The at least one processor is further configured to establish control instructions based on the preset positions and the virtual positions. The at least one processor is further configured to route the channels of the multi-channel audio data to the output channels based on the control instructions.
For example, the multi-channel audio data may comprise two, three, four, five, six, seven, or even more channels. Each channel of the multi-channel audio data may relate to a specific audio track which is specified by digital or analogue audio data. For example the audio source may be a storage medium which stores the multi-channel audio data. Alternatively or additionally it is also possible that the audio source is a recording entity which records the multi-channel audio data.
Each one of the plurality of output channels may comprise a respective connector to which a speaker can be connected using, e.g., a wired connection. Generally, it is also possible that the speakers are connectable to the output channels via a wireless data communication. In this case, the output channels may each comprise a wireless interface. Generally, it is possible that the audio system also comprises the speakers. However, it is also possible that speakers are separate entities.
It is possible that the routing comprises audio processing of at least one channel of the multi-channel audio data is executed based on the control data. For example, the audio processing may employ techniques of digital audio processing and/or analogue audio processing. The audio processing may include applying filters, effects such as echo, fade, etc., delays, and/or phase shifts, etc.
In reference implementations, each channel of the multi-channel audio data is fixedly routed to a given output channel. Thus, according to reference implementations, each channel of the multi-channel audio data is fixedly associated with a certain speaker and is audibly perceived at the respective preset position of the certain speaker. From this routing according to reference implementations, the audio sweet spot may result.
According to various embodiments, the routing of the channels of the multi-channel audio data is flexibly set based on the control instructions. Thus, the virtual positions of the speakers—corresponding to the different channels of the multi-channel audio data—can be flexibly set. Here, the routing can include a phase shift or delay for the various output channels, e.g., audio processing. There may not be a fixed one-to-one correspondence between channels of the multi-channel audio data and the output channels.
The preset positions of the speakers may correspond to the actual positions of the speakers; e.g., the preset positions may be provided measuring the actual position of the speakers and/or retrieving the actual position of the speakers from a user input and/or estimating the actual position of the speakers, etc. Thus, the preset positions may be an approximation of the actual positions of the speakers.
The virtual positions may correspond to positions where the audio content of the associated channel of the multi-channel audio data is audibly perceived by a listener; in other words, different channels of the multi-channel audio data may be perceived at the different virtual positions. Here, in contrast to the above-mentioned reference implementations, it becomes possible that a channel of the multi-channel audio data is audibly perceived at the virtual position which may be flexibly set. The virtual positions may be set to conform with a listening position which may be defined in terms of location and/or orientation. In other words, the virtual position may be referred to as an emulated position of a respective virtual speaker. In general, the virtual positions may coincide or deviate from the preset positions. The virtual positions may be flexibly set or may be varied. This allows to account for changes in the listening position within or even outside of the audio sweet spot.
There may be a one-to-one correspondence between channels of the multi-channel audio data and output channels. For example, each channel of the multi-channel audio data may be routed to a different output channel. However, it is also possible that at least one channel of the multi-channel audio data is routed to a plurality of output channels.
The number of channels of the multi-channel audio data and output channels may vary. For example, the at least one processor may be configured to route a first number of channels of the multi-channel audio data to a second number of output channels based on the control instructions. The first number may be larger or smaller than the second number.
In such a scenario where the first number is smaller than the second number, it is possible to rely on otherwise unused output channels of the audio system to emulate the virtual positions. For example, the audio system may be a 7.1-audio system, providing seven output channels and an additional bass output channel. The multi-channel audio data may provide five audio channels. Then it may be possible to use all seven output channels to flexibly emulate various virtual positions which, e.g., deviate from the actual positions of the speakers.
Generally, the routing may include routing at least one channel of the multi-channel audio data to more than one output channel. The at least one processor may be configured to route at least one channel of the multi-channel audio data to two or more output channels based on the control instructions. For example, the control instructions may specify the amplitudes of each output channel to which the at least one channel of the multi-channel audio data is routed.
If playback of a given channel of the multi-channel audio data is executed for a plurality of output channels, then the corresponding virtual position deviates from the preset positions of the speakers. This allows to flexibly set the virtual position.
Likewise, it is possible that the least one processor is configured to mix at least two channels of the multi-channel audio data to a given output channel based on the control instructions. Thus, at a given output channel, playback of a superposition of a plurality of channels of the multi-channel audio data may be implemented. Then, it becomes possible to flexibly specify the virtual positions for a plurality of channels of the multi-channel audio data.
Said mixing may correspond to adding the amplitudes of the at least two channels of the multi-channel audio data in a defined manner; in particular, an amplitude or relative contribution to the signal output via the given output channel may be specified for each one of the at least two channels. Such information may be provided in the control instructions.
By determining the control instructions and by executing said routing based on the control instructions, it is possible to tailor the virtual positions. In particular, it becomes possible to flexibly set the virtual positions. This flexibly setting of the virtual positions may be—to some degree—independent of the preset positions. Thus, a larger flexibility in the actual positioning of the speakers may be achieved; in particular, it may not be required to physically position the speakers according to a certain standard arrangement for which the multi-channel audio data is provided. It may be possible to account for a varying listening position.
As mentioned above, it may be possible to account for changes of the position and/or the orientation of the listening position of a user. This may be flexibly done by a user according to the user's needs. Generally, it may be possible to adjust the virtual positions of the speakers such that a virtual stage defined by the channels of the multi-channel audio data conforms with the listening position. The virtual stage may be defined in terms of left-right/front-rear directions of the content of the multi-channel audio data. Audible perception of surround sound may be positioned with respect to the virtual stage. For example, if the listening position is rotated by 180°, in a scenario where the virtual positions are not adapted, left and right perception, as well as front and rear perception will be interchanged, respectively. Then, the listening position is rotated, but the virtual stage remains fixed. A degraded listening experience would result. By correspondingly adjusting the virtual positions, e.g., by turning the virtual stage by 180°, it is possible to compensate for this.
It should be noted that even when the virtual positions are adjusted, certain pre-defined settings associated with the speakers may be preserved. In particular, such settings may be persevered which correspond to implementation of the audio sweet spot. For example, adjusting the virtual positions may account for changes in the listening position within the audio sweet spot.
As can be seen from the above, by processing the multi-channel audio signal according to techniques as presented above, the channels can be distributed among the speakers in a manner that the virtual positions are perceived by a listener at positions that are optimal for the given listening situation. This allows to compensate, e.g., for off-center positioning of center speakers or main speakers which are not symmetrically placed.
The audio system may further comprise a human machine interface (HMI). The HMI may be configured to receive a user input from a user of the audio system. The at least one processor may be configured to determine and write to the memory at least one of the preset positions and/or at least one of the virtual positions based on the user input.
For example, the HMI may comprise elements selected from the group comprising: a keyboard, a mouse, a display, a remote control, a wireless transceiver configured to wirelessly receive the user input from a portable user equipment, a Local Area Network (LAN) transceiver configured to receive the user input from a LAN. For example, the user may be able to specify the virtual positions and/or the preset positions in an application executed on a smartphone.
Generally, the user input may take various forms. In one scenario, the user input may specify, for each one of the speakers, a coordinate corresponding to the preset position and a coordinate corresponding to the virtual position; the coordinates may be defined in a reference coordinate system.
In such a scenario, the user may be able to flexibly set and modify all of the preset positions and the virtual positions according to his needs. For example, the user may be able to manually move the virtual positions using the remote control and/or a smartphone application.
It is also possible that the user input is restricted to a more specific information. In particular, the user input may relate to a specific mapping or offset between virtual positions and preset positions. For example, the user input may indicate, for each one of the plurality of speakers, a rotation of a corresponding one of the virtual positions with respect to a reference direction. The reference direction may be optionally defined with respect to an orientation of the preset positions of the plurality of speakers. Alternatively or additionally, the reference direction may be defined by the preset positions, e.g., by the preset position of a center speaker or the like.
In such a scenario as mentioned above, the user may adjust the virtual positions based on the preset positions. This may make it comparably simple to achieve a desired listening experience.
For example, the user input may indicate a uniform rotation of the virtual positions with respect to the reference direction. Thereby, it becomes possible to specifically adapt the orientation of the virtual stage; this may be of value where the orientation of the listening position changes, e.g., without a change of the position of the listening position. This may be the case where the user employs two or more displays for synchronized playback of accompanying visual content. In such a scenario, the user can manually turn the audibly perceived stage.
For example, in such scenarios as mentioned above with respect to the rotation of one or more virtual positions, the preset positions for the speakers may correspond to the actual positions of the speakers. It is possible that the reference direction is defined by means of the actual positions of the speakers; e.g., the reference direction may be defined as a center direction symmetrically located between left and right directions defined by the respective speakers connected to the corresponding output channels. Then, it is possible that by specifying the rotation of the respective virtual positions with respect to the reference direction, particular ones or all of the virtual positions are offset against the reference direction.
Such scenarios that rely on a user input may be combined with presets. For example, the user input may indicate a selection of a given one of a plurality of candidate virtual positions as the at least one virtual position determined by the at least one processor based on the user input. For example, the user may define and store the plurality of candidate virtual positions as the presets. This allows fast and simple selection of the virtual positions. For example, it may be possible that the presets are provided for certain preferred or reoccurring listening positions, e.g., on a couch, a writing table, etc. For example, for the different listening positions different screens for playback of an accompanying visual content may be used. The listener may be orientated in another direction. The listener may switch between different listening positions, e.g., when moving from one part of a room to another part of the room.
It is also possible that the user input indicates a radial offset between the at least one virtual position determined by the at least one processor based on the user input and a reference radial distance. As discussed above with respect to the reference direction, it is possible that the reference radial distance is defined with respect to the preset positions.
In this manner, it is possible that the listener experiences the modified at least one virtual position to be closer or more remote. Thus a re-positioning of the listening position may accounted for. The radial offset may be achieved by appropriately adjusting amplitudes, phase, and/or delay of the routing of the various channels of the multi-channel audio data.
Above, primarily scenarios have been discussed where the user input parameterizes the virtual positions in some manner. However, sometimes it may be desirable to freely move the virtual position of a specific channel; this may be particularly relevant where an immediate audible feedback of the moved virtual position is provided.
The at least one processor may be configured to execute a positioning routine. The positioning routine may comprise the at least one processor routing a given channel of the multi-channel audio data at two or more output channels based on the control instructions established for a present virtual position. The positioning routine further comprises the HMI receiving the user input indicating an offset value for the present virtual position and the at least one processor adjusting said routing of the given channel of the multi-channel audio data based on the offset value. Channels of the multi-channel audio data other than the given channel may be muted. In such a manner, it is possible that the user can manually move the virtual sound sources and receive audible feedback as part of the positioning routine. In such a manner, it is possible that the user can accurately position the virtual position of a speaker.
In a further scenario, the user input may indicate an assignment of the at least one virtual position determined by the at least one processor based on the user input to the at least one preset position determined by the at least one processor based on the user input. In other words, the virtual positions of a specific channel of the multi-channel audio data may be mapped to certain preset positions. Here, the virtual positions may not deviate from the preset positions; nonetheless, a specific routing of channels of the multi-channel audio data to output channels is implemented depending on the user input. In such a scenario, the user may easily switch the routing of the output channels between different channels of the multi-channel audio data.
As can be seen from the above, generally the user input may specify one or more parameters. In particular, the parameters specified by the user input may vary in various scenarios. Generally, based on the user input it is possible to determine at least one of the preset positions and/or at least one of the virtual positions. In this regard, the user input may indicate the at least one virtual position of the at least one speaker and/or the at least one preset position determined by the at least one processor based on the user input with respect to at least one of a position of a display, a position of a user, a head orientation of the user, and a reference coordinate system. In this manner, it becomes possible that the user can easily select the appropriate preset position(s) and/or virtual position(s).
Generally, it is also possible that the virtual positions are determined on input parameters other than the user input. For example, it is possible that the virtual positions are determined automatically or semi-automatically. For example, for this purpose a calibration routine may be executed by the at least one processor. The calibration routine may comprise the at least one processor routing a reference signal via the output channels and the at least one processor controlling at least two microphone interfaces to each receive a recording of a playback of the reference signal as a respective calibration track. Then the at least one processor may be configured to determine the virtual positions based on the detected calibrations signals. By providing the at least two microphone interfaces, stereo recording becomes possible. In particular, it is possible to determine a direction with respect to a reference coordinate system of the various speakers, respectively the preset positions. The playback may be executed serially via the output channels; only one channel may be activated at a time. For example, the reference signal may be a pulse train signal and/or a signal of a given frequency or frequency bandwidth.
Then it may be possible to, e.g., rotate the virtual positions against the preset positions automatically such that the virtual positions align with a reference direction. The reference direction may, in turn, be defined with the reference coordinate system and/or a user input. For example, the reference coordinate system may be defined by a setup of microphones connected to the microphone interfaces.
Above, primarily techniques of determining the virtual positions and/or the preset positions have been discussed. Once the virtual positions and the preset positions have been established, it is possible to determine the control instructions. The control instructions may allow to appropriately route the channels of the multi-channel audio data to the output channels of the audio system such that the virtual positions are emulated based on the preset positions. In this regard, various techniques of determining the control instructions may be employed.
It is possible that the at least one processor is configured to determine the control instructions for each one of the virtual positions based on a spatial difference between the respective virtual position and at least one respective neighboring one of the preset positions. The at least one processor may be alternatively or additionally configured to determine the control instructions based on a difference vector between the virtual positions and the position of the listening position. Here, a length and/or orientation of the difference vector may be taken into account.
For example, if a virtual position is located in between two preset positions of two speakers, it is possible that both speakers contribute to the playback of the respective channel of the multi-channel audio data. For example, an amplitude of the routing to the different output channels may be determined by the spatial distance between said virtual position and each one of the two preset positions. Further, in order to appropriately account for a direction at which the user is located, the difference vector between the virtual positions and the listening position may be taken into account.
Above, primarily techniques are discussed where the preset positions are determined based on the user input. However, alternatively or additionally, it is also possible, e.g., to measure or otherwise establish the preset positions of the speakers.
This may be done as part of the above-mentioned calibration routine. For example, the audio system may further comprise the at least two microphone interfaces. The at least one processor may be configured to execute the calibration routine. The at least one processor may further be configured to determine the preset positions of the plurality of speakers based on the detected calibration signals.
The determining of the virtual positions and/or the preset positions may involve detecting time differences between the detecting of the playback at each one of the at least two microphone interfaces. Triangulation techniques may be employed.
In such a manner, it is possible to determine the preset positions at a comparably high accuracy. Moreover, automatically determining the preset positions becomes possible. In such scenarios, it may be unnecessary to rely on a user input.
According to an embodiment, a method is provided. The method comprises at least one processor of an audio system receiving multi-channel audio data from an audio source. The multi-channel audio data includes a plurality of channels. The method further comprises the at least one processor receiving preset positions of a plurality of speakers connectable to the audio system via respective output channels. The method further comprises the at least one processor receiving virtual positions associated with the plurality of channels of the multi-channel audio data. The method further comprises the at least one processor establishing control instructions based on the virtual positions and further based on the preset positions. The method further comprises the at least one processor routing the channels of the multi-channel audio data to the output channels based on the control instructions.
For example, the audio system according to a further embodiment can be configured to execute the method according to the presently discussed aspect.
For such a method, effects may be achieved which can be achieved for the audio system according to a further embodiment.
It is to be understood that the features mentioned above and features yet to be explained below can be used not only in the respective combinations indicated, but also in other combinations or in isolation, without departing from the scope of the present disclosure. Features of the above-mentioned aspects and embodiments may be combined with each other in other embodiments.
In the following, embodiments will be described in detail with reference to the accompanying drawings. It is to be understood that the following description of embodiments is not to be taken in a limiting sense. The scope of the disclosure is not intended to be limited by the embodiments described hereinafter or by the drawings, which are taken to be illustrative only.
The drawings are to be regarded as being schematic representations and elements illustrated in the drawings are not necessarily shown to scale. Rather, the various elements are represented such that their function and general purpose become apparent to a person skilled in the art. Any connection or coupling between functional blocks, devices, components, or other physical or functional units shown in the drawings or described herein may also be implemented by an indirect connection or coupling. A coupling between components may also be established over a wireless connection. Functional blocks may be implemented in hardware, firmware, software, or a combination thereof.
Hereinafter, techniques of routing multi-channel audio data at an audio system which comprises one or more processors having a plurality of output channels are discussed. For example, the audio system may be an audio-video receiver or the like. The processor is capable of routing the various channels of the multi-channel audio data to one or more output channels. The routing may include audio processing, e.g., applying filters, adding effects, adding delay, and/or adding a phase shift. Speakers can be connected to the output channels.
This routing is based on control instructions. The control instructions are determined based on preset positions of the speakers and further based on virtual positions associated with the channels of the multi-channel audio data. For example, the preset positions can correspond to actual positions of the speakers while the virtual positions of the speakers can correspond to positions where a virtual speaker corresponding to the associated channel of the multi-channel audio data is audibly perceived by a user. The virtual positions may or may not deviate from the preset positions.
Such a scenario is illustrated in
In
Namely, typically the 5.1 audio data is compiled with respect to a certain standard arrangement of speakers. In the standard arrangement, e.g., the actual positions of left and right front speakers should be symmetrically arranged with respect to a front center speaker according to some standards. As can be seen from
This may have particular impact where the multi-channel audio data is accompanied by visual content. Then, the listening position 150 may be positioned and orientated such that the playback of the multi-channel audio data fits to the playback of the accompanying visual content on a display 160. The virtual stage and the display 160 may be aligned. In this regard, the listening position 150 may be associated with an orientation (indicated in
To preserve virtual stage even when the preset positions 120-1-120-5 deviate from the standard arrangement of speakers, techniques according to various embodiments can be employed. These techniques rely on the emulation of virtual positions 110-1-110-5. These virtual positions 110-1-110-5 define the positions where the audio content of the channels of the audio data is audibly perceived.
As can be seen, in the scenario of
The virtual positions 110-1-110-4 are emulated by routing the channels of the multi-channel audio data to the output channels based on respectively determined control instructions. The control instructions are determined based on the preset positions 120-1-120-5 and based on the virtual positions 110-1-110-5. In particular, when determining the control instructions, the distance 180 between the virtual positions 110-1-110-4 and neighboring preset positions 120-1-120-5 can be taken into account.
The emulation of a given virtual position 110-1 is shown at greater detail in
As can be seen from
In
In
Referring further to
In
The virtual positions 110-1-110-5 may be set by a user input received via a HMI of the audio system and/or automatically set, e.g., according to a calibration routine. The virtual positions 110-1-110-5 may be set, according to the user input 410, such that the first listening position 150 is transitioned to the second listening position 350 (cf.
To compensate for the increased distance between the second listening position 350 and the center preset position 120-2 corresponding to the center speaker, the amplitude of playback for the left/right/center virtual positions 110-1-110-3 is increased (illustrated in
From a comparison of
The user input 410 can take various forms. For example, the user input 410 can be defined with respect to the position of the display 160, the listening position 150, 350, a head orientation of the user, or a reference coordinate system. By such techniques, the user may easily specify the desired virtual positions 110-1-110-5.
In
Generally, the user input 410 may operate according to presets. For example, the user may predefine certain listening positions 150, 350. Then, by selecting a specific listening position 150, 350, respectively by selecting specific candidate virtual positions from the list of presets, a simple and fast control becomes possible.
In
In
Thus, in the scenario of
For example, in the scenario of
In
The sound system 600 further comprises the memory 611. The memory 611 stores control data for the first processor 610. Executing the control data causes the first processor 610 to execute techniques according to various embodiment explained above. In particular, executing the control data causes the processor 610 to execute techniques associated with the establishing of the control instructions 215 based on the present positions 120-1-120-5 and virtual positions 110-1-110-5. The first processor 610 provides the control data to the second processor 620 to route the channels 610-1-610-4 of the multi-channel audio data 211 to the output channels 291-1-291-5 based on the control instructions 215.
The second processor 620 in the scenario of
Further shown in
In the scenario of
The sound system 600 further comprises two microphone interfaces which are connected to microphone 640-1, 640-2. The processors 610, 620 are configured to execute a calibration routine. The calibration routine comprises routing reference signals via the output channels 291-1-291-5. Via each one of the microphone interfaces the second processor 620 receives a recording of the playback of the reference signal as a respective calibration track. Based on the calibration tracks, the first processor 610 is configured to determine the preset positions 120-1-120-5 of the plurality of speakers 290-1-290-4. Employing such a calibration routine, it becomes possible to determine the preset positions 120-1-120-5 as the actual positions of the plurality of speakers 290-1-290-4 in an accurate manner. Further, the first processor 610 is configured to determine the virtual positions 110-1-110-5 based on the calibration tracks; e.g., the first processor 610 can be configured determine the virtual positions 110-1-110-5 such that the virtual stage aligns with respect to a reference direction which may be defined by a setup of the microphones 640-1, 640-2 and/or a corresponding user input. Asymmetries between left and right orientations can be compensated for. A center image may be aligned with the reference direction.
In
At S2, the preset positions 120-1-120-5 of the plurality of speakers 290-1-290-5 are received. For example, the preset positions can be stored in the memory 611. Further, at S2, the virtual positions 110-1-110-5 for the plurality of channels 210-1-210-4 are received. For example, the virtual positions can be stored in the memory 611.
Next, at S3, the control instructions 215 are determined based on the virtual positions 110-1-110-5 and based on the preset positions 120-1-120-5.
Next, at S4, the channels 210-1-210-4 of the multi-channel audio data 211 are routed to the output channels 291-1-291-5 based on the control instructions 215.
Although the disclosure has been shown and described with respect to certain preferred embodiments, equivalents and modifications will occur to others skilled in the art upon the reading and understanding of the specification. The present disclosure includes all such equivalents and modifications and is limited only by the scope of the appended claims.
For example, above the techniques of routing have been primarily discussed in terms of forwarding an audio track associated with a given channel of the multi-channel audio data. It is possible that the routing includes techniques of audio processing. Here, it is possible that to some larger or smaller degree the audio tracks associated with the different channels of the multi-channel audio data are re-computed or modified according to the audio processing.
Number | Date | Country | Kind |
---|---|---|---|
EP14194268 | Nov 2014 | EP | regional |