A system and method for controlling the directivity of dialogue channels separate from music and effects channels in a piece of sound program content is described. Other embodiments are also described.
Sound program content, including movies and television shows, are often composed of several distinct audio components, including dialogue of characters/actors, music and sound effects. Each of these component parts called stems may include multiple spatial channels and are mixed together prior to delivery to a consumer. For example, a production company may mix a 5.1 channel dialogue stream or stem, a 5.1 music stream, and a 5.1 effects stream into a single master 5.1 audio mix or stream. This master stream may thereafter be delivered to a consumer through a recordable medium (e.g., DVD or Blu-ray) or through an online streaming service. Although mixing dialogue, music, and effects to form a single master mix or stream is convenient for purposes of distribution, this process often results in poor audio reproduction for the consumer. For example, intelligibility of dialogue may become an issue because the dialogue component for a piece of sound program content must be played back using the same settings as music and effects components since each of these components are unified in a single master stream. Dialogue intelligibility has become a growing and widely perceived problem, especially amongst movies played through television sets where dialogue may be easily lost amongst music and effects.
An embodiment of the invention is related to an audio system that receives a piece of sound program content for playback from a content distribution system. The piece of sound program content may include multiple components or stems. For example, the piece of sound program content may include a multi-channel dialogue signal, a multi-channel music signal, and a multi-channel effects signal. In one embodiment, the multi-channel music signal may be combined or mixed with the multi-channel effects signal to form a combined multi-channel music and effects signal.
In one embodiment, the audio system or the content distribution system may determine a first set of directivity patterns for the multi-channel dialogue signal and a second set of directivity patterns for the combined multi-channel music and effects signal. Each of the directivity patterns in the first and second sets of directivity patterns may be characterized by a directivity index. The directivity index of a beam pattern defines the ratio of sound emitted at a target (e.g., a listener) in comparison to sound emitted generally into a listening area. In one embodiment, the first set of directivity patterns associated with channels of the dialogue signal have higher directivity indexes than the second set of directivity patterns associated with corresponding channels of the combined music and effects signal. By associating dialogue components with a higher directivity than music and effects components, the system described herein increases the intelligibility of dialogue for a piece of sound program content while allowing music and effects to retain conventional directivity having a typical ratio of direct-to-reverberant sound energy.
The above summary does not include an exhaustive list of all aspects of the present invention. It is contemplated that the invention includes all systems and methods that can be practiced from all suitable combinations of the various aspects summarized above, as well as those disclosed in the Detailed Description below and particularly pointed out in the claims filed with the application. Such combinations have particular advantages not specifically recited in the above summary.
The embodiments of the invention are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” embodiment of the invention in this disclosure are not necessarily to the same embodiment, and they mean at least one.
Several embodiments are described with reference to the appended drawings are now explained. While numerous details are set forth, it is understood that some embodiments of the invention may be practiced without these details. In other instances, well-known circuits, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.
As noted above, the loudspeaker arrays 3A-3F emit sound into the listening area 1. The listening area 1 is a location in which the loudspeaker arrays 3A-3F are located and in which a listener 4 is positioned to listen to sound emitted by the loudspeaker arrays 3A-3F. For example, the listening area 1 may be a room within a house or a commercial establishment or an outdoor area (e.g., an amphitheater).
The loudspeaker arrays 3A-3F shown in
Although six channel audio content is used as an example (e.g., 5.1 audio), the systems and methods described herein for optimizing sound reproduction may be similarly applied to any type of sound program content, including monophonic sound program content, stereophonic sound program content, eight channel sound program content (e.g., 7.1 audio), and eleven channel sound program content (e.g., 9.2 audio).
The loudspeaker arrays 3A-3F may be coupled to the audio receiver 2 through the use of wires and/or conduit. For example, as shown in
In other embodiments, the loudspeaker arrays 3A-3F may be coupled to the audio receiver 2 using wireless protocols such that the loudspeaker arrays 3A-3F and the audio receiver 2 are not physically joined but maintain a radio-frequency connection. For example, as shown in
As noted above, the loudspeaker arrays 3A-3F may include one or more transducers 5 housed in a single cabinet 6. For example,
Each transducer 5 may be individually and separately driven to produce sound in response to separate and discrete audio signals received from an audio source (e.g., the audio receiver 2). By allowing the transducers 5 in the loudspeaker arrays 3A-3F to be individually and separately driven according to different parameters and settings (including delays and energy levels), the loudspeaker arrays 3A-3F may produce numerous beam patterns with varied directivity indexes. For example,
The audio receiver 2 may include multiple inputs 7A-7D for receiving sound program content using electrical, radio, and/or optical signals from an external device or system. The inputs 7A-7D may be a set of digital inputs 7A and 7B and analog inputs 7C and 7D including a set of physical connectors located on an exposed surface of the audio receiver 2. For example, the inputs 7A-7D may include a High-Definition Multimedia Interface (HDMI) input, an optical digital input (Toslink), and a coaxial digital input. In one embodiment, the audio receiver 2 receives audio signals through a wireless connection with an external system or device. In this embodiment, the inputs 7A-7D include a wireless adapter for communicating with an external device using wireless protocols. For example, the wireless adapter may be capable of communicating using one or more of Bluetooth, IEEE 802.3, the IEEE 802.11 suite of standards, cellular Global System for Mobile Communications (GSM), cellular Code Division Multiple Access (CDMA), or Long Term Evolution (LTE).
General signal flow from the inputs 7A-7D will now be described. Looking first at the digital inputs 7A and 7B, upon receiving a digital audio signal through an input 7A or 7B, the audio receiver 2 uses a decoder 8A or 8B to decode the electrical, optical, or radio signals into a set of audio channels representing sound program content. For example, the decoder 8A may receive a single signal containing six audio channels (e.g., a 5.1 signal) and decode the signal into six audio signals for each of the six audio channels. The six audio channels/signals may respectively correspond to front left, front center, front right, left surround, right surround, and low-frequency effect audio channels. In another embodiment, the decoder 8A may receive multiple multi-channel audio signals corresponding to separate components of a single piece of sound program content. For example, the multiple signals decoded by the decoder 8A may correspond to a multi-channel dialogue signal/stem and a combined multi-channel music and effects signal/stem for a piece of sound program content. The decoder 8A may decode each of the received signals into corresponding channels for the piece of sound program content. The decoders 8A and 8B may be capable of decoding audio signals encoded using any codec or technique, including Advanced Audio Coding (AAC), MPEG Audio Layer II, and MPEG Audio Layer III.
Turning to the analog inputs 7C and 7D, each analog signal received by analog inputs 7C and 7D represents a single audio channel of the sound program content. Accordingly, multiple analog inputs 7C and 7D may be needed to receive each channel of a piece of multichannel sound program content (e.g., each channel of a multi-channel dialogue stream/stem and/or a multi-channel music and effects stream/stem). The analog audio channels may be digitized by respective analog-to-digital converters 9A and 9B to form digital audio channels.
The digital audio channels from each of the decoders 8A and 8B and the analog-to-digital converters 9A and 9B are output to the multiplexer 10. The multiplexer 10 selectively outputs a set of audio channels based on a control signal 11. The control signal 11 may be received from a control circuit or processor in the audio receiver 2 or from an external device. For example, a control circuit controlling a mode of operation of the audio receiver 2 may output the control signal 11 to the multiplexer 10 for selectively outputting a set of digital audio channels from one or more of the inputs 7A-7D.
The multiplexer 10 feeds the selected digital audio channels to an array processor 12 for processing. The channels output by the multiplexer 10 are processed by the array processor 12 to produce a set of processed audio signals for driving each loudspeaker array 3A-3F. In one embodiment, the array processor 12 may process the channels output by the multiplexer 10 using input from the directivity adjustment logic 13. As will be discussed in greater detail below, the directivity adjustment logic 13 may determine a set of beam patterns for a multi-channel dialogue signal of a piece of sound program content and a set of beam patterns for a combined multi-channel music and effects signal of the piece of sound program content. Each beam pattern in these sets of beam patterns may be characterized by separate directivity indexes, which are selected to improve the intelligibility of dialogue and overall reproduction of the sound program content.
The array processor 12 may operate in both the time and frequency domains using transforms such as the Fast Fourier Transform (FFT). The array processor 12 may be a special purpose processor such as an application-specific integrated circuit (ASIC), a general purpose microprocessor, a field-programmable gate array (FPGA), a digital signal controller, or a set of hardware logic structures (e.g., filters, arithmetic logic units, and dedicated state machines). As shown in
Turning now to
The method 16 may commence an operation 17 with the receipt of a piece of sound program content. The piece of sound program content may include multiple audio components or stems. For example, the sound program content may be an audio track for a movie and the audio components may include a multi-channel dialogue signal, a multi-channel music signal, and a multi-channel effects signal. As shown in
At operation 18, the multi-channel music signal and the multi-channel effects signal received at operation 17 are mixed together to generate a combined multi-channel music and effects signal. This combination may be performed for each set of channels that comprise the multi-channel music signal and the multi-channel effects signal. For example, as shown in
As shown in
Following combination of the multi-channel music signal with the multi-channel effects signal to produce a combined multi-channel music and effects signal, operation 19 transmits the multi-channel dialogue signal and the combined multi-channel music and effects signal to the receiver 2. As shown in
In one embodiment, the receiver 2 may receive the multi-channel dialogue signal and the combined multi-channel music and effects signal using one or more of the inputs 7A-7D. For example, in an embodiment in which the input 7A is a digital network interface, the receiver 2 may receive the multi-channel dialogue signal and the combined multi-channel music and effects signal using one or more network protocols.
Upon receiving the multi-channel dialogue signal and the combined multi-channel music and effects signal, operation 20 may determine a set of directivity patterns for the multi-channel dialogue signal and a separate set of directivity patterns for the combined multi-channel music and effects signal. In one embodiment, each directivity pattern determined at operation 20 may correspond to a separate channel of the multi-channel dialogue signal and the combined multi-channel music and effects signal. For example, for a 5.1 dialogue signal and a 5.1 combined music and effects signal, operation 20 may produce twelve directivity patterns (i.e., six directivity patterns for the six channels of the 5.1 dialogue signal and six directivity patterns for the six channels of the 5.1 combined music and effects signal).
In some embodiments, operation 20 may determine directivity patterns for a subset of channels in the multi-channel dialogue signal and the combined music and effects signal. For example, operation 20 may ignore a subwoofer channel such that separate directivity patterns are only generated for each mid and high range channel in the multi-channel dialogue signal and in the combined multi-channel music and effects signal. In this embodiment, the loudspeaker array 3F may be driven using a subwoofer channel of the dialogue and music and effects signals and/or low-frequency content of each other channel without directivity adjustment.
Each of the directivity patterns generated at operation 20 may be characterized by a directivity index. As noted above, directivity indexes describe the ratio of sound emitted at a target (e.g., the listener 4) in comparison to sound emitted generally into the listening area 1. For example, the directivity index for a beam pattern associated with the front center channel of the multi-channel dialogue signal may be 8 dB while the directivity index for a beam pattern associated with the front center channel of the combined multi-channel music and effects signal may be 3 dB. In this fashion, each channel of the dialogue signal and the combined music and effects signal may be separately adjusted according to audio preferences. For example, each channel of the dialogue signal may have a beam pattern with a higher directivity index than a corresponding channel of the music and effects signal. By associating dialogue components with a higher directivity than music and effects components, the method 16 increases the intelligibility of dialogue in a piece of sound program content while allowing music and effects to retain conventional directivity having a typical ratio of direct-to-reverberant sound energy.
In one embodiment, operation 20 may be performed by the directivity adjustment logic 13. The directivity adjustment logic 13 may be any set of hardware and software components that may determine directivity patterns with specified directivity indexes. In one embodiment, the directivity adjustment logic 13 may generate directivity patterns according to preferences of the user and/or based on the content or genre of the sound program content.
Although shown and described as operation 20 being performed by the receiver 2, in some embodiments operation 20 may be performed by the content distribution server 23. In these embodiments, data describing the beam patterns determined at operation 20 may be transported to the receiver 2 along with the multi-channel dialogue signal and the combined multi-channel music and effects signal. This beam pattern data may be stored as metadata for each of the dialogue and combined music and effects signals.
Following determination of a set of directivity patterns for each channel of both the multi-channel dialogue signal and the combined multi-channel music and effects signal, operation 21 may drive one or more loudspeakers 3A-3E to produce the directivity patterns from operation 20. In one embodiment, driving the loudspeaker arrays 3A-3E to produce the directivity patterns may include passing the generated directivity patterns to the array processor 12 of the receiver 2. The array processor 12 may generate a set of processed audio signals based on the directivity patterns and the audio signals/channels received from the multiplexer 10. In one embodiment, the array processor 12 may produce a set of processed audio signals for each channel of the multi-channel dialogue signal and each channel of the combined multi-channel music and effects signal. The processed audio signals may be transmitted at operation 21 to one or more transducers 5 in one or more of the loudspeakers 3A-3E using the digital-to-analog converters 14 and the power amplifiers 15 of the receiver 2. For example, as shown in
Although shown in
As noted above, directivity adjustment may be performed for a subset of channels in the multi-channel dialogue signal and the combined music and effects signal. For example, the method 16 may ignore a subwoofer channel such that separate directivity patterns are only generated for each mid and high range channel in the multi-channel dialogue signal and in the combined multi-channel music and effects signal. In this embodiment, the loudspeaker array 3F may be driven using a subwoofer channel of the dialogue and music and effects signals and/or low-frequency content of each other channel without directivity adjustment.
As shown in
As explained above, an embodiment of the invention may be an article of manufacture in which a machine-readable medium (such as microelectronic memory) has stored thereon instructions which program one or more data processing components (generically referred to here as a “processor”) to perform the operations described above. In other embodiments, some of these operations might be performed by specific hardware components that contain hardwired logic (e.g., dedicated digital filter blocks and state machines). Those operations might alternatively be performed by any combination of programmed data processing components and fixed hardwired circuit components.
While certain embodiments have been described and shown in the accompanying drawings, it is to be understood that such embodiments are merely illustrative of and not restrictive on the broad invention, and that the invention is not limited to the specific constructions and arrangements shown and described, since various other modifications may occur to those of ordinary skill in the art. The description is thus to be regarded as illustrative instead of limiting.
This application is a U.S. National Phase Application under 35 U.S.C. § 371 of International Application No. PCT/US2014/057829, filed Sep. 26, 2014, which claims the benefit of the earlier filing date of U.S. provisional application No. 62/000,226, filed May 19, 2014.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2014/057829 | 9/26/2014 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2015/178950 | 11/26/2015 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20030125933 | Saunders et al. | Jul 2003 | A1 |
20080212805 | Fincham | Sep 2008 | A1 |
20100183156 | Park | Jul 2010 | A1 |
20100296678 | Kuhn-Rahloff | Nov 2010 | A1 |
20110069850 | Harma | Mar 2011 | A1 |
Number | Date | Country |
---|---|---|
WO 2014036085 | Mar 2014 | WO |
Entry |
---|
PCT International Search Report and Written Opinion for PCT International Appln No. PCT/US2014/057829 dated Jan. 20, 2015 (9 pages). |
Number | Date | Country | |
---|---|---|---|
20170105084 A1 | Apr 2017 | US |
Number | Date | Country | |
---|---|---|---|
62000226 | May 2014 | US |