The present invention relates to technology for encoding and decoding digital audio, and more particularly to a method and apparatus for encoding and decoding scalable digital audio, which can efficiently package high-multichannel digital audio into a digital audio packet.
As the demand for enjoying movies using high-quality images and sounds increases, the importance of more dynamic and realistic sounds continues to increase. Accordingly, the number of consumers who spare no expense to purchase not only video apparatuses, such as projectors or large-sized displays, but also high-multichannel speaker systems is gradually increasing.
Representative digital audio formats include a format for 5.1 channel audio and a format for 7.1 channel sound.
5.1 channel sound is a method in which 5 speakers are disposed on left, center and right sides in front of a listener and on left and right sides behind the listener and a (0.1 channel) subwoofer adapted to augment low-pitched sound is separately configured. The front left and right and rear left and right speakers function to provide a three-dimensional (3D) sound effect for a fixed listener, and the front center speaker functions to fix the conversation of a performer on a screen on the front screen for the sake of a moving listener. Representative 5.1 channel sound methods include Dolby digital 5.1, known as AC-3 format, and DTS (Digital Theater Systems).
7.1 channel sound is a surround sound listening method composed of 7 directional speakers and a single subwoofer. 7.1 channel sound is a method that improves a 3D effect by adding 2 speakers on either side of conventional 5.1 channels composed of 5 directional speakers and a single subwoofer.
Referring to
Apart from the 5.1 channel sound and the 7.1 channel sound shown in
Referring to
In order to output sounds in association with objects via the array speakers behind the screen, conventional 5.1 channel sound and 7.1 channel sound are insufficient, and more audio channels must be assigned.
For example, if the number of speakers arranged behind the screen is 15, 12 additional audio channels are required compared to conventional 5.1 channels in which 3 speakers are arranged behind a screen.
Furthermore, one of the important issues of high-multichannel audio systems is compatibility with a conventional 5.1 channel sound system or 7.1 channel sound system. That is, even when the same digital audio packet is provided to a theater, sound must be able to be output via a larger number of channels, i.e., 5.1 channels, 7.1 channels or 16 or more channels in accordance with the sound system environment of the theater.
Accordingly, there is an urgent need for new encoding/decoding technology that can efficiently encode high-multichannel audio as well as conventional 5.1 channel audio and conventional 7.1 channel audio.
An object of the present invention is to effectively package high-multichannel sound data into a digital audio packet while maintaining compatibility with conventional sound systems, such as a 5.1 channel sound system and a 7.1 channel sound system.
Another object of the present invention is to provide audio data in an optimum form while maintaining compatibility with a sound system environment capable of supporting only a conventional sound system and a high-end sound system environment capable of real-time high-multichannel sound system decoding.
A further object of the present invention is to immediately output sounds without an inconvenient operation, such as decoding or decompression, in a sound system environment supporting only a conventional sound system and to rapidly restore high-multichannel audio data and then output realistic sounds in a high-multichannel high-end sound system environment.
In order to accomplish the above objects, according to the present invention, there is provided a method of decoding scalable digital audio, including: receiving a digital audio packet, including a plurality of pieces of direct audio channel data mapped to respective physical channels in a one-to-one correspondence and a plurality of pieces of indirect audio channel data indirectly mapped to respective physical channels; extracting the pieces of direct audio channel data from the digital audio packet; and matching the pieces of direct audio channel data to the respective physical channels in a one-to-one correspondence and then performing output.
Furthermore, according to an embodiment of the present invention, there is provided a method of decoding scalable digital audio, including: receiving a digital audio packet, including a plurality of pieces of direct audio channel data mapped to respective physical channels in a one-to-one correspondence and a plurality of pieces of indirect audio channel data indirectly mapped to respective physical channels; extracting the digital audio packet from the pieces of indirect audio channel data; generating pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of indirect audio channel data using the pieces of indirect audio channel data; and matching the pieces of audio channel data to the respective physical channels in a one-to-one correspondence and then performing output.
In this case, the pieces of direct audio channel data may be pulse code modulated wave files, and may be maintained in a state of being uncompressed while the digital audio packet is being packaged, distributed, and stored.
In this case, the digital audio packet may include a plurality of channel containers each adapted to record digital audio channel data; and each of the pieces of direct audio channel data and the pieces of indirect audio channel data may be mapped to one of the plurality of channel containers in a one-to-one correspondence.
In this case, the pieces of indirect audio channel data may be recorded in at least part of the remainder of the channel containers in which the pieces of direct audio channel data have not been recorded.
In this case, generating the pieces of audio channel data may include generating the pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of indirect audio channel data by performing a decompression process on the pieces of indirect audio channel data.
In this case, the digital audio packet may include 16 channel containers; the number of the pieces of direct audio channel data may be equal to or more than 6 and equal to or less than 11; and the number of pieces of indirect audio channel data may be 6.
In this case, the number of pieces of direct audio channel data may be even.
In this case, the pieces of audio channel data may be output as respective audio outputs of a high-multichannel audio system for a theater, which are equal to or more than 16 channels and equal to or fewer than 256 channels.
In this case, the method may further include receiving channel assignment information corresponding to the digital audio packet, and generating the pieces of audio channel data may be performed based on the channel assignment information.
Furthermore, according to an embodiment of the present invention, there is provided a method of coding scalable digital audio, including: generating a plurality of pieces of direct audio channel data mapped to respective physical channels in a one-to-one correspondence; generating pieces of indirect audio channel data that are synchronized with video sources corresponding to the pieces of direct audio channel data and that are indirectly mapped to respective physical channels; and generating a digital audio packet using the pieces of direct audio channel data and the pieces of indirect audio channel data.
In this case, the number of pieces of indirect audio channel data may be less than the number of physical channels corresponding to the pieces of indirect audio channel data.
In this case, the method may further include providing channel assignment information corresponding to the digital audio packet, together with the digital audio packet, so that the channel assignment information can be considered when the digital audio packet is decoded in a decoding stage.
Furthermore, according to an embodiment of the present invention, there is provided an apparatus for decoding scalable digital audio, including: an audio packet reception unit configured to receive a digital audio packet, including a plurality of pieces of direct audio channel data mapped to respective physical channels in a one-to-one correspondence and a plurality of pieces of indirect audio channel data indirectly mapped to respective physical channels; a direct audio channel extraction unit configured to extract the pieces of direct audio channel data from the digital audio packet; and an audio channel output unit configured to match the pieces of direct audio channel data to the respective physical channels in a one-to-one correspondence and then perform output.
Furthermore, according to an embodiment of the present invention, there is provided an apparatus for decoding scalable digital audio, including: an audio packet reception unit configured to receive a digital audio packet, including a plurality of pieces of direct audio channel data mapped to respective physical channels in a one-to-one correspondence and a plurality of pieces of indirect audio channel data indirectly mapped to respective physical channels; an indirect audio channel extraction unit configured to extract the digital audio packet from the pieces of indirect audio channel data; an indirect audio channel decoding unit configured to generate pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of indirect audio channel data using the pieces of indirect audio channel data; and an audio channel output unit configured to match the pieces of audio channel data to the respective physical channels in a one-to-one correspondence and then perform output.
Furthermore, according to an embodiment of the present invention, there is provided an apparatus for coding scalable digital audio, including: a direct audio channel generation unit configured to generate a plurality of pieces of direct audio channel data mapped to respective physical channels in a one-to-one correspondence; an indirect audio channel generation unit configured to generate pieces of indirect audio channel data that are synchronized with video sources corresponding to the pieces of direct audio channel data and that are indirectly mapped to respective physical channels; and a digital audio packet generation unit configured to generate a digital audio packet using the pieces of direct audio channel data and the pieces of indirect audio channel data.
Furthermore, according to an embodiment of the present invention, there is provided a method of decoding scalable digital audio, including: receiving a digital audio packet, including a plurality of pieces of uncompressed audio channel data maintained in a state of being uncompressed and a plurality of pieces of compressed audio channel data generated by compressing pieces of audio channel data corresponding to respective physical channels; extracting the pieces of uncompressed audio channel data from the digital audio packet; and matching the pieces of uncompressed audio channel data to the respective physical channels in a one-to-one correspondence and then performing output.
Furthermore, according to an embodiment of the present invention, there is provided a method of decoding scalable digital audio, including: receiving a digital audio packet, including a plurality of pieces of uncompressed audio channel data maintained in a state of being uncompressed and a plurality of pieces of compressed audio channel data generated by compressing pieces of audio channel data corresponding to respective physical channels; extracting the pieces of compressed audio channel data from the digital audio packet; generating the pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of compressed audio channel data using the pieces of compressed audio channel data; and matching the pieces of audio channel data to the respective physical channels in a one-to-one correspondence and then performing output.
In this case, the pieces of uncompressed audio channel data may be pulse code modulated wave files, and may be maintained in a state of being uncompressed while the digital audio packet is being packaged, distributed, and stored.
In this case, the digital audio packet may include a plurality of channel containers each adapted to record digital audio channel data; and each of the pieces of uncompressed audio channel data and the pieces of compressed audio channel data may be mapped to one of the plurality of channel containers in a one-to-one correspondence.
The pieces of compressed audio channel data may be recorded in at least part of the remainder of the channel containers in which the pieces of uncompressed audio channel data have not been recorded.
In this case, generating the pieces of audio channel data may include generating the pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of compressed audio channel data by performing a decompression process on the pieces of compressed audio channel data.
In this case, the digital audio packet may include 16 channel containers; the number of the pieces of uncompressed audio channel data may be equal to or more than 6 and equal to or less than 11; and the number of pieces of compressed audio channel data may be 6.
In this case, the number of pieces of uncompressed audio channel data may be even.
In this case, the pieces of audio channel data may be output as respective audio outputs of a high-multichannel audio system for a theater, which are equal to or more than 16 channels and equal to or fewer than 256 channels.
In this case, the method may further include receiving channel assignment information corresponding to the digital audio packet; generating the pieces of audio channel data may be performed based on the channel assignment information.
Furthermore, according to an embodiment of the present invention, there is provided a method of coding scalable digital audio, including: generating a plurality of pieces of uncompressed audio channel data that is maintained in a state of being uncompressed; generating pieces of compressed audio channel data that are synchronized with video sources corresponding to the pieces of uncompressed audio channel data and that are generated by compressing pieces of audio channel data corresponding to respective physical channels; and generating a digital audio packet using the pieces of uncompressed audio channel data and the pieces of compressed audio channel data.
In this case, the number of pieces of compressed audio channel data may be less than the number of physical channels corresponding to the pieces of compressed audio channel data.
In this case, the method may further include providing channel assignment information corresponding to the digital audio packet, together with the digital audio packet, so that the channel assignment information can be considered when the digital audio packet is decoded in a decoding stage.
Furthermore, according to an embodiment of the present invention, there is provided an apparatus for decoding scalable digital audio, including: an audio packet reception unit configured to receive a digital audio packet, including a plurality of pieces of uncompressed audio channel data maintained in a state of being uncompressed and a plurality of pieces of compressed audio channel data generated by compressing pieces of audio channel data corresponding to respective physical channels; an uncompressed audio channel extraction unit configured to extract the pieces of uncompressed audio channel data from the digital audio packet; and an audio channel output unit configured to match the pieces of uncompressed audio channel data to the respective physical channels in a one-to-one correspondence and then perform output.
Furthermore, according to an embodiment of the present invention, there is provided an apparatus for decoding scalable digital audio, including: an audio packet reception unit configured to receive a digital audio packet, including a plurality of pieces of uncompressed audio channel data maintained in a state of being uncompressed and a plurality of pieces of compressed audio channel data generated by compressing pieces of audio channel data corresponding to respective physical channels; a compressed audio channel extraction unit configured to extract the pieces of compressed audio channel data from the digital audio packet; a compressed audio channel decoding unit configured to generate the pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of compressed audio channel data using the pieces of compressed audio channel data; and an audio channel output unit configured to match the pieces of audio channel data to the respective physical channels in a one-to-one correspondence and then perform output.
Furthermore, according to an embodiment of the present invention, there is provided an apparatus for coding scalable digital audio, including: an uncompressed audio channel generation unit configured to generate a plurality of pieces of uncompressed audio channel data that is maintained in a state of being uncompressed; a compressed audio channel generation unit configured to generate pieces of compressed audio channel data that are synchronized with video sources corresponding to the pieces of uncompressed audio channel data and that are generated by compressing pieces of audio channel data corresponding to respective physical channels; and a digital audio packet generation unit configured to generate a digital audio packet using the pieces of uncompressed audio channel data and the pieces of compressed audio channel data.
According to the present invention, it may be possible to effectively package high-multichannel sound data into a digital audio packet while maintaining compatibility with conventional sound systems, such as a 5.1 channel sound system and a 7.1 channel sound system.
Furthermore, according to the present invention, it may be possible to provide audio data in an optimum form while maintaining compatibility with a sound system environment capable of supporting only a conventional sound system and a high-end sound system environment capable of real-time high-multichannel sound system decoding.
Moreover, according to the present invention, it may be possible to immediately output sounds without an inconvenient operation, such as decoding or decompression, in a sound system environment supporting only a conventional sound system and to rapidly restore high-multichannel audio data and then output realistic sounds in a high-multichannel high-end sound system environment.
The present invention will be described in detail below with reference to the accompanying drawings. Redundant descriptions and descriptions of well-known functions and configurations that may make the gist of the present invention unnecessarily obscure will be omitted below. The embodiments of the present invention are provided to fully describe the present invention to persons having ordinary knowledge in the art to which the present invention pertains. Accordingly, the shapes, sizes and the like of components in the drawings may be exaggerated for clearer description.
Preferred embodiments according to the present invention will be described in detail with reference to the accompanying drawings.
Referring to
In the high-multichannel sound system shown in
In the sound system shown in
In this case, the heights of the surround speakers must be flush with the middle level of the screen. In the case of stadium seating, the height may be maintained in proportion to the arrangement of seats.
The ceiling speakers arranged in two rows may be horizontally aligned with the speakers of front speakers in second rows from the left and right sides. Furthermore, the ceiling speakers may be located in three side surround regions.
High-multichannel audio data, such as 30.2 channel data shown in
Referring to
A left (L) channel in front of a screen corresponds to the leftmost loudspeaker behind the screen on the basis of an auditorium.
A center (C) channel in front of the screen corresponds to a center loudspeaker behind the screen on the basis of the auditorium.
A right (R) channel in front of the screen corresponds to the rightmost loudspeaker behind the screen on the basis of the auditorium.
Screen sub (LFE) channels below the screen correspond to bandwidth-limited low frequency-dedicated loudspeakers located at an end of the screen.
Left surround (LS) channels on the left side of the auditorium correspond to sets of loudspeaker located along a left wall between the screen and a rear wall facing the theater screen.
Right surround (RS) channels on the right side of the auditorium correspond to sets of loudspeakers located along a right wall between the screen and the rear wall facing the theater screen.
Center Surround (CS) channels correspond to sets of loudspeakers located at the center of the rear wall facing the screen.
A left center (LC) channel in front of the screen corresponds to a loudspeaker located between the center of the screen and a left end.
A right center (RC) channel screen in front of the screen corresponds to a loudspeaker located between the center and a right end.
A VERTICAL HEIGHT LEFT (VHL), a VERTICAL HEIGHT CENTER (VHC) and a VERTICAL HEIGHT RIGHT (VHR) channels correspond to loudspeakers located at and suspended from the upper end of the screen.
A top center surround (TS) channel corresponds to a loudspeaker located at a horizontal and vertical center above the auditorium.
A left wide (LW) channel corresponds to a loudspeaker exposed on the left side of the screen.
A right wide (RW) channel corresponds to a loudspeaker exposed on the right side of the screen.
A rear left surround (RLS) channel corresponds to a loudspeaker on the left side of a rear wall facing the screen.
A rear right surround (RRS) channel corresponds to a loudspeaker on the right side of the rear wall facing the screen.
In addition to the channels shown in
As described in conjunction with
In particular, decoding or decompressing transmitted data and output sounds is not a heavy load in a theater in which high-multichannel audio system equipment has been installed because expensive equipment has been installed, whereas performing decoding or decompression for audio output may be a heavy load in a theater in which only a conventional sound system, such as a 5.1 channel system, has been installed.
Accordingly, the present invention enables the generation of optimum audio data packets to be achieved by recording direct audio channel data, such as 5.1 channel data, in channel containers without compression and reducing the amount of indirect audio channel data corresponding to high-multichannel audio, such as 16 or more channel audio, through compression and then recording the compressed indirect audio channel data in channel containers.
Furthermore, the present invention enables the generation of optimum audio data packets to be achieved by recording uncompressed audio channel data, such as 5.1 channel data, in channel containers without compression and reducing the amount of compressed audio channel data corresponding to high-multichannel audio, such as 16 or more channel audio, through compression and then recording the compressed indirect audio channel data in channel containers.
Referring to
Since the 5.1 channel sound corresponds to channels (direct audio channels) that are mapped to physical channels in a 1:1 correspondence, channels (indirect audio channels) that are not mapped to physical channels in a 1:1 correspondence may be assigned to the remaining 10 channel containers.
In this case, each of the physical channels refers to a channel used in a theater sound system, and may correspond to a single loudspeaker or a plurality of loudspeakers. That is, an audio channel with which a theater provided with a digital audio packet can immediately output sound is a direct audio channel, and an audio channel with which a theater must perform decoding or decompression to output sound is an indirect audio channel.
Furthermore, since the 5.1 channel sound corresponds to channels (uncompressed audio channels) that are mapped to physical channels in a 1:1 correspondence and are kept uncompressed, channels (compressed audio channels) generated by compressing pieces of audio channel data corresponding to respective physical channels may be assigned to the remaining 10 channel containers.
That is, an audio channel with which a theater provided with a digital audio packet can immediately output sound without a decompression or decoding operation is an uncompressed audio channel, and an audio channel with which a theater must perform decoding or decompression to output sound is a compressed audio channel.
Referring to
Since 7.1 channel sound corresponds to channels (direct audio channels) that are mapped to physical channels in a 1:1 correspondence, channels (indirect audio channels) that are not mapped to physical channels in a 1:1 correspondence may be assigned to the remaining 8 channel containers.
Furthermore, since 7.1 channel sound corresponds to channels (uncompressed audio channels) that are mapped to physical channels in a 1:1 correspondence and are kept uncompressed, channels (compressed audio channels) generated by compressing pieces of audio channel data corresponding to respective physical channels may be assigned to the remaining 8 channel containers.
Referring to
More specifically, in the case of providing 5.1 channels, HI/VI and motion data, 6 pieces of 5.1 channel audio data may be contained and transmitted in channel containers 1 to 6 in the sequence of L, R, C, LFE, Ls and Rs, the HI and VI channels, i.e., channels for hearing-impaired persons and visually impaired persons, may be contained and transmitted in channel containers 7 and 8, and a channel for motion data may be contained and transmitted in channel container 13. In this case, channel containers for containing indirect audio channel data (or compressed audio channel data) may be any one or more of channel containers 9 to 12 and 14 to 16. Furthermore, in an embodiment, one or more of channels for providing HI/VI and motion data may be omitted, in which case channel containers corresponding to the omitted channels may be assigned for indirect audio channel data (or compressed audio channel data).
In the case of providing 7.1 channel SDDS, HI/VI and motion data, 8 pieces of 7.1 channel SDDS audio data may be contained and transmitted in channel containers 1 to 6 and 9 to 10 in the sequence of L, R, C, LFE, Ls, Rs, Lc and Rc, HI and VI channels, i.e., channels for hearing-impaired persons and visually impaired persons, may be contained and transmitted in channel containers 7 and 8, and a channel for the motion data may be contained and transmitted in channel container 13. In this case, channel containers for containing indirect audio channel data (or compressed audio channel data) may be any one or more of channel containers 11 to 12 and 14 to 16. Furthermore, in an embodiment, one or more of the channels for providing HI/VI and motion data may be omitted, in which case channel containers corresponding to the omitted channels may be assigned for indirect audio channel data (compressed audio channel data).
In the case of providing 7.1 channel DS, HI/VI and motion data, 8 pieces of 7.1 channel DS audio data may be contained and transmitted in channel containers 1 to 6, 11 and 12 in the sequence of L, R, C, LFE, Lss, Rss, Lrs and Rrs, HI and VI channels, i.e., channels for hearing-impaired persons and visually impaired persons, may be contained and transmitted in channel containers 7 and 8, and a channel for motion data may be contained and transmitted in channel container 13. In this case, channel containers for containing indirect audio channel data (compressed audio channel data) may be any one or more of channel containers 9, 10 and 14 to 16. Furthermore, in an embodiment, one or more of the channels for providing HI/VI and motion data may be omitted, in which case channel containers corresponding to the omitted channels may be assigned for indirect audio channel data (compressed audio channel data).
As shown in
Referring to
More specifically, in the case of providing 5.1 channels and HI/VI, 6 pieces of 5.1 channel audio data may be contained and transmitted in channel containers 1 to 6 in the sequence of L, R, C, LFE, Ls and Rs, and HI and VI channels, i.e., channels for hearing-impaired persons and visually impaired persons, may be contained and transmitted in channel containers 7 and 8. In this case, channel containers for containing indirect audio channel data (or compressed audio channel data) may be any one or more of channel containers 9 to 16.
Furthermore, in the case of providing 7.1 channel DS and HI/VI, 8 pieces of 7.1 channel DS audio data may be contained and transmitted in channel containers 1 to 6 and 11 to 12 in the sequence of L, R, C, LFE, Lss, Rss, Lrs and Rrs, and HI and VI channels, i.e., channels for hearing-impaired persons and visually impaired persons, may be contained and transmitted in channel containers 7 and 8. In this case, channel containers for containing indirect audio channel data (compressed audio channel data) may be any one or more of channel containers 9, 10, and 13 to 16.
Pieces of audio channel data that are indirectly mapped to respective physical channels are assigns to channel containers to which indirect direct audio channel data has not been assigned. Furthermore, pieces of compressed audio channel data generated by compressing pieces of audio channel data corresponding to respective physical channel are assigned to channel containers to which uncompressed audio channel data has not been assigned.
In the example shown in
As described above, according to the present invention, a digital audio packet is generated using not only direct audio channel data (uncompressed audio channel data) not requiring decoding or compression but also indirect audio channel data (compressed audio channel data) enabling high-multichannel audio data to be contained in a relatively small number of channel containers via decoding or compression using high-end hardware, and thus backward compatibility can be ensured and also realistic 3D sound can be effectively provided.
In this case, the digital audio packet is a set of pieces of digital data, and may correspond to a bundle of packets, frames or pieces of data.
Referring to
As described above, both direct audio channel data and indirect audio channel data are all included in a single digital audio packet, and thus optimum backward compatibility can be achieved and also 16 or higher-channel realistic 3D sound can be provided.
Referring to
As described above, both uncompressed audio channel data and compressed audio channel data are all included in a single digital audio packet, and thus optimum backward compatibility can be achieved and also 16 or higher-channel realistic 3D sound can be provided.
Referring to
In this case, the physical channels refer to channels that are used for sound output in a theater. For example, the physical channels may be 6 channels, i.e., L, R, C, LFE, Ls and Rs channels, in the case of a 5.1 channel sound system, and may be 8 channels, i.e., L, R, C, LFE, Ls, Rs, Lc and Rc channels, in the case of a 7.1 channel sound system.
That is, the pieces of direct audio channel data may be pieces of data corresponding to L, R, C, LFE, Ls and Rs channels in a 5.1 channel sound system, and may be pieces of data corresponding to L, R, C, LFE, Ls, Rs, Lc and Rc channels in the case of a 7.1 channel sound system.
In this case, the pieces of direct audio channel data may be pulse code modulated wave files, and may be maintained in the state of being uncompressed while the digital audio packet is being packaged, distributed, and stored.
Furthermore, in the method of coding scalable digital audio according to the present embodiment of the present invention, pieces of indirect audio channel data that are synchronized with video sources corresponding to the respective pieces of direct audio channel data and that are indirectly mapped to respective physical channels are generated at step S1020.
In this case, the number of pieces of indirect audio channel data may be less than the number of physical channels corresponding to the pieces of indirect audio channel data.
For example, the pieces of indirect audio channel data may have been compressed such that original 32 (the number of physical channels) pieces of channel audio data can be contained in 6 (the number of pieces of indirect audio channel data) channels.
Furthermore, in the method of coding scalable digital audio according to the present embodiment of the present invention, a digital audio packet is generated using the pieces of direct audio channel data and the pieces of indirect audio channel data at step S1030.
In this case, the digital audio packet may include a plurality of channel containers each adapted to record digital audio channel data. Each of the pieces of direct audio channel data and the pieces of indirect audio channel data may be mapped to one of the plurality of channel containers in a one-to-one correspondence.
In this case, the pieces of indirect audio channel data may be recorded in at least part of the remainder of the channel containers in which the pieces of direct audio channel data have not been recorded.
In this case, the pieces of indirect audio channel data may be compressed in order to be recorded in a number of channel containers less than the number of physically required channel containers, and then may be recorded in the at least part of the remainder.
In this case, the digital audio packet may include 16 channel containers, the number of pieces of direct audio channel data may be equal to or more than 6 and equal to or less than 11, and the number of pieces of indirect audio channel data may be 6.
In this case, the number of pieces of direct audio channel data may be even. As described above, the number of corresponding channel containers may be made even by providing an even number of pieces of direct audio channel data, and thus an AES pair may be formed.
In this case, the pieces of indirect audio channel data may correspond to high-multichannel audio sources for a theater, which are equal to or more than 16 channels and equal to or fewer than 256 channels.
In this case, the method of coding scalable digital audio, which is shown in
Referring to
In this case, physical channels refer to channels that are used for sound output in a theater. For example, the physical channels may be 6 channels, i.e., L, R, C, LFE, Ls and Rs channels, in the case of a 5.1 channel sound system, and may be 8 channels, i.e., L, R, C, LFE, Ls, Rs, Lc and Rc channels, in the case of a 7.1 channel sound system.
That is, the pieces of uncompressed audio channel data may be pieces of data corresponding to L, R, C, LFE, Ls and Rs channels in the case of a 5.1 channel sound system, and may be pieces of data corresponding to L, R, C, LFE, Ls, Rs, Lc and Rc channels in the case of a 7.1 channel sound system.
In this case, the pieces of uncompressed audio channel data may be respective pulse code modulated wave files, and may be maintained in the state of being uncompressed while the digital audio packet is being packaged, distributed, and stored.
Furthermore, in the method of coding scalable digital audio according to the present embodiment of the present invention, pieces of compressed audio channel data that are synchronized with video sources corresponding to the pieces of uncompressed audio channel data and that are generated by compressing pieces of audio channel data corresponding to respective physical channels are generated at step S2020.
In this case, the number of pieces of compressed audio channel data may be less than the number of physical channels corresponding to the pieces of compressed audio channel data.
For example, the pieces of compressed audio channel data may be have been compressed such that original 32 (the number of physical channels) pieces of channel audio data can be contained in 6 (the number of pieces of compressed audio channel data) channels.
Furthermore, in the method of coding scalable digital audio according to the present embodiment of the present invention, a digital audio packet is generated using the pieces of uncompressed audio channel data and the pieces of compressed audio channel data at step S2030.
In this case, the digital audio packet may include a plurality of channel containers each adapted to record digital audio channel data. Each of the pieces of uncompressed audio channel data and the pieces of compressed audio channel data may be mapped to one of the plurality of channel containers in a one-to-one correspondence.
In this case, the pieces of compressed audio channel data may be recorded in at least part of the remainder of the channel containers in which the pieces of uncompressed audio channel data have not been recorded.
In this case, the pieces of compressed audio channel data may be compressed in order to be recorded in a number of channel containers less than the number of physically required channel containers, and then may be recorded in the at least part of the remainder.
In this case, the digital audio packet may include 16 channel containers, the number of pieces of uncompressed audio channel data may be equal to or more than 6 and equal to or less than 11, and the number of pieces of compressed audio channel data may be 6.
In this case, the number of pieces of uncompressed audio channel data may be even. As described above, the number of corresponding channel containers may be made even by providing an even number of pieces of uncompressed audio channel data, and thus an AES pair may be formed.
In this case, the pieces of compressed audio channel data may correspond to high-multichannel audio sources for a theater, which are equal to or more than 16 channels and equal to or fewer than 256 channels.
In this case, the method of coding scalable digital audio, which is shown in
Referring to
Furthermore, in the method of decoding scalable digital audio, the pieces of direct audio channel data are extracted from the digital audio packet at step S1120.
Furthermore, in the method of decoding scalable digital audio, the pieces of direct audio channel data are matched to the respective physical channels in a one-to-one correspondence and are then output at step S1130.
Referring to
Furthermore, in the method of decoding scalable digital audio, the pieces of uncompressed audio channel data are extracted from the digital audio packet at step S2120.
Furthermore, in the method of decoding scalable digital audio, the pieces of uncompressed audio channel data are matched to the respective physical channels in a one-to-one correspondence and are then output at step S2130.
Referring to
Furthermore, in the method of decoding scalable digital audio, the pieces of indirect audio channel data are extracted from the digital audio packet at step S1220.
Furthermore, in the method of decoding scalable digital audio, pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of indirect audio channel data are generated using the pieces of indirect audio channel data at step S1230.
In this case, at step S1230, the pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of indirect audio channel data may be generated by performing a decompression process on the pieces of indirect audio channel data.
In this case, the pieces of audio channel data may be output as respective audio outputs of a high-multichannel audio system for a theater, which are equal to or more than 16 channels and equal to or fewer than 256 channels.
Furthermore, in the method of decoding scalable digital audio, the pieces of audio channel data are matched to the physical channels in a one-to-one correspondence and are then output at step S1240.
Referring to
Furthermore, in the method of decoding scalable digital audio, the pieces of compressed audio channel data are extracted from the digital audio packet at step S2220.
Furthermore, in the method of decoding scalable digital audio, the pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of compressed audio channel data are generated using the pieces of compressed audio channel data at step S2230.
In this case, at step S2230, the pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of compressed audio channel data may be generated by performing a decompression process on the pieces of compressed audio channel data.
In this case, the pieces of audio channel data may be output as respective audio outputs of a high-multichannel audio system for a theater, which are equal to or more than 16 channels and equal to or fewer than 256 channels.
Furthermore, in the method of decoding scalable digital audio, the pieces of audio channel data are matched to the respective physical channels in a one-to-one correspondence and are then output at step S2240.
In the methods of decoding scalable digital audio, which are shown in
In the methods of decoding scalable digital audio, which are shown in
In this case, the pieces of indirect audio channel data (the pieces of compressed audio channel data) may be recorded in at least part of the remainder of the channel containers in which the pieces of direct audio channel data (the pieces of uncompressed audio channel data) have not been recorded.
In this case, the digital audio packet may include 16 channel containers, the number of pieces of direct audio channel data (the pieces of uncompressed audio channel data) is equal to or more than 6 and equal to or less than 11, and the number of pieces of indirect audio channel data (the number of pieces of compressed audio channel data) may be 6.
In this case, the number of pieces of direct audio channel data (the number of pieces of uncompressed audio channel data) may be even.
Although not shown in
Referring to
The direct audio channel generation unit 1310 generates a plurality of pieces of direct audio channel data mapped to respective physical channels in a one-to-one correspondence.
In this case, the pieces of direct audio channel data may be pulse code modulated wave files, and may be maintained in the state of being uncompressed while the digital audio packet is being packaged, distributed, and stored.
In this case, the number of pieces of direct audio channel data may be even.
The indirect audio channel generation unit 1320 generates pieces of indirect audio channel data that are synchronized with video sources corresponding to the pieces of direct audio channel data and that are indirectly mapped to respective physical channels.
In this case, the number of pieces of indirect audio channel data may be less than the number of physical channels corresponding to the pieces of indirect audio channel data.
In this case, the pieces of indirect audio channel data may correspond to high-multichannel audio sources for a theater, which are equal to or more than 16 channels and equal to or fewer than 256 channels.
The digital audio packet generation unit 1330 generates a digital audio packet using the pieces of direct audio channel data and the pieces of indirect audio channel data.
In this case, the digital audio packet may include a plurality of channel containers each adapted to record digital audio channel data. Each of the pieces of direct audio channel data and the pieces of indirect audio channel data may be mapped to one of the plurality of channel containers in a one-to-one correspondence.
In this case, the pieces of indirect audio channel data may be recorded in at least part of the remainder of the channel containers in which the pieces of direct audio channel data have not been recorded.
In this case, the pieces of indirect audio channel data may be compressed in order to be recorded in a number of channel containers less than the number of physically required channel containers, and then may be recorded in the at least part of the remainder.
In this case, the digital audio packet may include 16 channel containers, the number of pieces of direct audio channel data may be equal to or more than 6 and equal to or less than 11, and the number of pieces of indirect audio channel data may be 6.
In this case, the apparatus for coding scalable digital audio, which is shown in
Referring to
The uncompressed audio channel generation unit 2310 generates a plurality of pieces of uncompressed audio channel data that is maintained in the state of being uncompressed.
In this case, the pieces of uncompressed audio channel data may be pulse code modulated wave files, and may be maintained in the state of being uncompressed while the digital audio packet is being packaged, distributed, and stored.
In this case, the number of pieces of uncompressed audio channel data may be even.
The compressed audio channel generation unit 2320 generates pieces of compressed audio channel data that are synchronized with video sources corresponding to the pieces of uncompressed audio channel data and that are generated by compressing pieces of audio channel data corresponding to respective physical channels.
In this case, the number of pieces of compressed audio channel data may be less than the number of physical channels corresponding to the pieces of compressed audio channel data.
In this case, the pieces of compressed audio channel data may correspond to high-multichannel audio sources for a theater, which are equal to or more than 16 channels and equal to or fewer than 256 channels.
The digital audio packet generation unit 2330 generates a digital audio packet using the pieces of uncompressed audio channel data and the pieces of compressed audio channel data.
In this case, the digital audio packet may include a plurality of channel containers each adapted to record digital audio channel data. Each of the pieces of uncompressed audio channel data and the pieces of compressed audio channel data may be mapped to one of the plurality of channel containers in a one-to-one correspondence.
In this case, the pieces of compressed audio channel data may be recorded in at least part of the remainder of the channel containers in which the pieces of uncompressed audio channel data have not been recorded.
In this case, the pieces of compressed audio channel data may be compressed in order to be recorded in a number of channel containers less than the number of physically required channel containers, and then may be recorded in the at least part of the remainder.
In this case, the digital audio packet may include 16 channel containers, the number of pieces of uncompressed audio channel data may be equal to or more than 6 and equal to or less than 11, and the number of pieces of compressed audio channel data may be 6.
In this case, the apparatus for coding scalable digital audio, which is shown in
Referring to
The audio packet reception unit 1410 receives a digital audio packet, including a plurality of pieces of direct audio channel data mapped to respective physical channels in a one-to-one correspondence and a plurality of pieces of indirect audio channel data indirectly mapped to respective physical channels.
In this case, the digital audio packet may include a plurality of channel containers each adapted to record digital audio channel data. Each of the pieces of direct audio channel data and the pieces of indirect audio channel data may be mapped to one of the plurality of channel containers in a one-to-one correspondence.
The direct audio channel extraction unit 1420 extracts the pieces of direct audio channel data from the digital audio packet.
In this case, the pieces of direct audio channel data may be pulse code modulated wave files, and may be maintained in the state of being uncompressed while the digital audio packet is being packaged, distributed, and stored.
In this case, the number of pieces of direct audio channel data may be even.
The indirect audio channel extraction unit 1430 extracts the pieces of indirect audio channel data from the digital audio packet.
In this case, the pieces of indirect audio channel data may be recorded in at least part of the remainder of the channel containers in which the pieces of direct audio channel data have not been recorded.
In this case, the digital audio packet may include 16 channel containers, the number of pieces of uncompressed audio channel data may be equal to or more than 6 and equal to or less than 11, and the number of pieces of compressed audio channel data may be 6.
The indirect audio channel decoding unit 1440 generates pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of indirect audio channel data the pieces of indirect audio channel data.
In this case, the indirect audio channel decoding unit 1440 may generate the pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of indirect audio channel data by performing a decompression process on the pieces of indirect audio channel data.
In this case, the pieces of audio channel data may be output as respective audio outputs of a high-multichannel audio system for a theater, which are equal to or more than 16 channels and equal to or fewer than 256 channels.
The audio channel output unit 1450 matches the pieces of audio channel data to the physical channels in a one-to-one correspondence and then performs output.
Although not shown in
Referring to
The audio packet reception unit 2410 receives a digital audio packet, including a plurality of pieces of uncompressed audio channel data maintained in the state of being uncompressed and a plurality of pieces of compressed audio channel data generated by compressing pieces of audio channel data corresponding to respective physical channels.
In this case, the digital audio packet may include a plurality of channel containers each adapted to record digital audio channel data. Each of the pieces of uncompressed audio channel data and the pieces of compressed audio channel data may be mapped to one of the plurality of channel containers in a one-to-one correspondence.
The uncompressed audio channel extraction unit 2420 extracts the pieces of uncompressed audio channel data from the digital audio packet.
In this case, the pieces of uncompressed audio channel data may be pulse code modulated wave files, and may be maintained in the state of being uncompressed while the digital audio packet is being packaged, distributed, and stored.
In this case, the number of pieces of uncompressed audio channel data may be even.
The compressed audio channel extraction unit 2430 extracts the pieces of compressed audio channel data from the digital audio packet.
In this case, the pieces of compressed audio channel data may be recorded in at least part of the remainder of the channel containers in which the pieces of uncompressed audio channel data have not been recorded.
In this case, the digital audio packet may include 16 channel containers, the number of pieces of uncompressed audio channel data may be equal to or more than 6 and equal to or less than 11, and the number of pieces of compressed audio channel data may be 6.
The compressed audio channel decoding unit 2440 generates the pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of compressed audio channel data using the pieces of compressed audio channel data.
In this case, the compressed audio channel decoding unit 2440 may generate the pieces of audio channel data corresponding to a number of physical channels more than the number of pieces of compressed audio channel data by performing a decompression process on the pieces of compressed audio channel data.
In this case, the pieces of audio channel data may be output as respective audio outputs of a high-multichannel audio system for a theater, which are equal to or more than 16 channels and equal to or fewer than 256 channels.
The audio channel output unit 2450 matches the pieces of audio channel data to the physical channels in a one-to-one correspondence and then performs output.
Although not shown in
The methods of decoding scalable digital audio and the methods of encoding scalable digital audio according to the present invention may be implemented in the form of program instructions that can be executed by various computer means, and may be recorded on a computer-readable storage medium. The computer-readable storage medium may include program instructions, data files, and data structures solely or in combination. Program instructions recorded on the storage medium may have been specially designed and configured for the present invention, or may be known to or available to those who have ordinary knowledge in the field of computer software. Examples of the computer-readable storage medium include all types of hardware devices specially configured to record and execute program instructions, such as magnetic media, such as a hard disk, a floppy disk, and magnetic tape, optical media, such as compact disk (CD)-read only memory (ROM) and a digital versatile disk (DVD), magneto-optical media, such as a floptical disk, ROM, random access memory (RAM), and flash memory. Examples of the program instructions include machine code, such as code created by a compiler, and high-level language code executable by a computer using an interpreter.
The above-described methods and apparatuses for decoding/encoding scalable digital audio according to the present invention are not limited to the configurations and methods of the above-described embodiments, but some or all of the embodiments may be configured to be selectively combined such that the embodiments can be modified in various manners.
Number | Date | Country | Kind |
---|---|---|---|
10-2013-0044985 | Apr 2013 | KR | national |
10-2013-0044987 | Apr 2013 | KR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/KR2014/003466 | 4/21/2014 | WO | 00 |