1. Statement of the Technical Field
The inventive arrangements relate generally to methods and apparatus for video systems, and more particularly to digital video disc (DVD) storage media.
2. Description of the Related Art
DVD (digital video disc or digital versatile disc) is an optical disc technology which can store much more data than a CD-ROM. In particular, using conventional red laser technology, a single layer DVD can hold 4.7 GB of data on each of its two sides and a dual layer DVD can hold 9.0 GB of data on each of its two sides. In comparison, a CD-ROM can hold approximately 600 MB of data. Due to their large storage capacity and their convenience of use, DVD's have quickly become the preferred storage medium for video replay, replacing both video cassettes tapes and laser discs. In particular, a typical DVD-video can hold a 133-minute movie on one of its two sides using MPEG-2 file compression. The other side of the DVD oftentimes is used for DVD identification and labeling purposes.
There are potentially two primary presentation formats for movies recorded on DVD's, namely standard definition (SD) which has either a 16:9 or 4:3 aspect ratio, and high definition (HD) which has a 16:9 aspect ratio. However, because a typical single layer DVD only can hold up to a 133 minute SD movie on a side, DVD-videos are usually provided only with one of the two presentation formats. In some instances, however, both sides of the DVD are used to provide both presentation formats. Specifically, a standard definition version of a movie can be provided on one side of the DVD while a high definition version of the movie could be provided on the other side. Unfortunately, when both sides of the DVD are used for data storage, there is little room for DVD identification and labeling. Accordingly, what is needed is a DVD storage technology which can be used to store both SD and HD versions of a movie on a single side of a DVD. Moreover, such a DVD should be compatible with existing SD-DVD players.
The present invention relates to a method of providing multiple versions of a digital recording by multiplexing a base layer with an enhancement layer on a storage medium. The base layer can have base data representing a first version of the digital recording and the enhancement layer can have enhancement data which can be combined with the base data to represent a second version of the digital recording. The first version of the digital recording can include standard definition program content, and the second version of the digital recording can include high definition program content.
The multiplexing step can include the step of interleaving the base layer with the enhancement layer. The interleaving step can further include the steps of dividing video objects within the base layer into base interleave units and dividing video objects within the enhancement layer into enhancement interleave units. The base interleave units and enhancement interleave units can be stored on the storage medium in an alternating fashion. The base layer and the enhancement layer can further be stored on a single side of the storage medium, which can be a digital video disc (DVD).
The number of base interleave units associated with the base layer can be approximately equal to the number of enhancement interleave units associated with the enhancement layer. Also, the playback time correlating to the base interleave units can be approximately equal to a playback time correlating to the enhancement interleave units.
The base data can be stored in a format substantially similar to MPEG-2 and the enhancement data can be stored in a format substantially similar to H.264. Further, one or more time stamps can be provided with the base layer. For example, a decoder time stamp and a presentation time stamp can be provided with the base layer. Time stamps also can be provided for the enhancement layer.
The method also can include the step of providing a first program chain which links together cells associated with base interleave units. Further, a second program chain can be provided which can be merged with the first program chain to form a hybrid program chain. The hybrid program chain can link together the cells associated with the base interleave units and cells associated with enhancement interleave units in an order appropriate for generating the second version of the digital recording.
Alternatively, a first program chain can be provided which links together cells associated with base interleave units to generate a first bitstream. Further, a second program chain can be provided which links together cells associated with enhancement interleave units to generate a second bitstream. The first and second bitstreams can be merged during playback to generate the second version of the digital recording.
The present invention also relates to a DVD medium including a base layer and an enhancement layer. The base layer can include base data representing a first version of a digital recording and the enhancement layer can include enhancement data which can be combined with the base data to represent a second version of the digital recording. The second version of the digital recording can include high definition program content. The base layer and the enhancement layer can be interleaved.
Video objects within the base layer can be divided into base interleave units and video objects within the enhancement layer can be divided into enhancement interleave units. The base interleave units and the enhancement interleave units can be stored on the storage medium in an alternating fashion. A number of base interleave units can be approximately equal to a number of enhancement interleave units. Moreover, a playback time correlating to the base interleave units can be approximately equal to a playback time correlating to the enhancement interleave units.
The base data can be stored on the DVD medium in a format substantially similar to MPEG-2 and the enhancement data can be stored on the DVD medium in a format substantially similar H.264. Further, the base layer and the enhancement layer can be stored on a single side of the DVD medium. The base layer and the enhancement layer can include at least one time stamp, for example a decoder time stamp and/or a presentation time stamp.
The DVD medium can include a first program chain which links together cells associated with base interleave units. The DVD medium also can include a second program chain which can be merged with the first program chain to form a hybrid program chain. The hybrid program chain can link together the cells associated with the base interleave units and cells associated with enhancement interleave units in an order appropriate for high definition playback.
Alternatively, the DVD medium can include a first program chain which links together cells associated with base interleave units to generate a first bitstream. The DVD medium also can include a second program chain which links together cells associated with enhancement interleave units to generate a second bitstream. The first and second bitstreams can be merged during playback to generate the second version of the digital recording.
An embodiment of the present invention describes a scalable video encoding scheme that can allow multiple versions of a title, such as a movie or other presentation, to be stored on a single storage medium. For example, one standard definition (SD) version and one high definition (HD) version of a title can be stored on a single digital video disc (DVD). In particular, SD data representing the SD version can be coded and stored on the DVD using a compression scheme compatible with existing SD-DVD players, for example MPEG-2. The HD version can be coded as enhancement data which, when combined with the SD data, can be used to generate HD playback in a hybrid HD-DVD player. Accordingly, DVD's can be produced which are compatible both with SD-DVD players and hybrid HD-DVD players. Notably, the SD data can be coded as a base layer on the DVD while the enhancement data can be coded as an enhancement layer. The base layer and the enhancement layer can be interleaved using the multi-angle/seamless branching feature of DVD.
Referring to
The decomposition unit 110 can parse the original HD sequence into base pixels and enhancement pixels. The base pixels can be pixels representing an SD version of the sequence. The enhancement pixels can be pixels which can be recombined with the base pixels to represent an HD version of the sequence.
The base encoder 112 can encode the base pixels into a format recognizable by an SD-DVD and output a base data bitstream. Similarly, the enhancement encoder 124 can encode the enhancement pixels using a suitable coding scheme and output an enhancement data bitstream. In the preferred arrangement, the compression scheme provided by the encoders 112, 124 should provide adequate coding efficiency to store both the SD data and the enhancement data on a single side of a DVD. Accordingly, one side of the DVD can be available for labeling purposes. For example, the base encoder 112 can encode the base pixels using an MPEG-2 format. Compression schemes which can be used for coding the enhancement data are modified versions of H.264, also known as JVT, MPEG-4 Part 10 or AVC, or any other suitable compression scheme. Although MPEG-2 also can be used for coding the enhancement data, MPEG-2 may not provide as much compression as other compression schemes, such as H.264. In consequence, the use of MPEG-2 for coding the enhancement data may limit the length or quality of an HD version of a title stored on a DVD.
In the instance that a modified version of H.264 is used, the modification can include an extra filtering step which can be used during motion compensation. Further, transform coefficients can be scanned in an order that differs from the order in which transform coefficients are scanned for non-scalable H.264. The new scan order can place base layer coefficients representing blocks of pixels before associated enhancement layer coefficients. Restrictions also can be made on coding modes.
It should be noted that audio/video compression schemes are known to the skilled artisan. Notably, both MPEG and H.26× are evolving sets of standards for video and audio compression. MPEG standards are generated by the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC), and currently include MPEG-1, MPEG-2, MPEG-4, and MPEG-7. MPEG-21 is currently under development. H.26× standards are generated by the International Telecommunication Union—Telecommunication Standardization Sector (ITU-T) and currently include H.261, H.262, H.263 and H.264. Of course, video and audio coding standards are continually evolving. Accordingly, those skilled in the art will recognize that the present invention is not limited to the particular coding standards identified herein.
Continuing in
The interpolator 116 can interpolate blocks of reconstructed base pixels into blocks of pixels that are size compatible with HD pixel blocks. For example, the interpolator 116 can interpolate 11×9 blocks of reconstructed base pixels into 16×16 blocks of reconstructed base pixels. The decomposition unit 110 then can subtract the 16×16 blocks of reconstructed base pixels from correlating blocks of pixels in the original HD sequence to generate the enhancement pixels.
Further, the summing block 118 can sum the 16×16 blocks of reconstructed base pixels with reconstructed enhancement pixel blocks generated by the enhancement encoder 124 to generate summed pixel blocks. The summed pixel blocks can be trimmed to usable values by the clipper 120. For example, the clipper 120 can limit 8 bit summed pixels to values between −128 and 127. The HD frame buffer 122 can buffer the summed pixel blocks for use by the enhancement encoder 124 in creating groups of pictures in the enhancement data bitstream.
Lastly, a data store 126 can be provided for storing the base data bitstream and the enhancement data bitstream. The data store 126 can be an optical storage medium, a magnetic storage medium, a magneto-optical storage medium, an electronic storage medium, or any other storage medium which can store digital data. For example, in one arrangement, the data store 126 can be a DVD. The DVD can be single layer or multi-layer. Moreover, the DVD can contain data on one or two sides. In another arrangement, the data store 126 can be another storage type, such hard disc drive (HDD), RAM, and so on. In such an arrangement, the base and enhancement data streams can be transferred to one or more DVD's from the data store 126.
The base data bitstream can be recorded onto the DVD as a base layer and the enhancement data bitsream can be recorded onto the DVD as an enhancement layer. The base layer and enhancement layer can be multiplexed such that SD-DVD players can read and decode the base layer and a hybrid HD-DVD player can read and decode both layers.
In a preferred arrangement, the base layer and enhancement layer are multiplexed by interleaving the layers. Referring to
For example, a video object within the base layer can comprise base interleave units ILVUB1, ILVUB2, ILVUB3, etc. Likewise, a video object within the enhancement layer can comprise enhanced interleave units ILVUE1, ILVUE2, ILVUE3, and so on. In such an instance, the interleave units can be interleaved into the following order: ILVUB1, ILVUE1, ILVUB2, ILVUE2, ILVUB3, ILVUE3.
Time stamps can be added to the base layer and to the enhancement layer. For example, a decoder time stamp (DTS) can be added to the base and enhancement layers. The DTS can synchronize decoding of video to insure that the video can be played back at an appropriate time. A presentation time stamp (PTS) also can be added to the base and enhancement layers. The PTS can be used to coordinate presentation of pictures within a video stream. Accordingly, presentation of video can be synchronized with correlating audio portions of a title. The DTS and PTS time stamps can be stored in a header of each packetized elementary stream (PES). There can be one header associated with each physical sector on the storage medium. These time stamps can be associated with correlating intra (I) pictures.
The seamless branching and multi-angle aspects of DVD typically operate by providing multiple program chains. Each program chain can provide an alternate version of video playback. Program chains typically contain programs, which are ordered collections of pointers to cells within video objects. Each cell can be associated with one or more interleave units. The program chains can link cells together and indicate what order the cells are to be played. Importantly, individual cells can be used by more than one program chain.
In the present invention, at least two program chains (PGC's) can be created, one for SD playback of the DVD, and one enhancement program chain which can be used for HD playback of the DVD. The SD program chain can be recognizable by an SD-DVD player and can link together the cells associated with the base interleave units. Accordingly, the SD program chain can be used to generate an SD bitstream for playback.
The enhancement program chain can be recognized by hybrid HD-DVD players. In one arrangement, the enhancement program chain can be provided without an assigned entry point. The enhancement program chain can be designed such that the enhancement program chain can be merged with the SD program chain to form a hybrid program chain. The hybrid program chain can be formed once the HD-DVD recognizes that a particular disc is a hybrid HD-DVD. In this arrangement, the hybrid program chain can link together the cells associated with the base interleave units and the cells associated with the enhancement interleave units in an order appropriate for HD playback.
In an alternate arrangement, the enhancement program chain can link together the cells associated with the enhancement interleave units and can be used to generate an enhancement bitstream. The enhancement bitstream can be merged with the SD bitstream to generate an HD bitstream for HD playback.
Referring to
All of the transformations shown in
The matrices that are provided for the horizontal and vertical downsampling will depend on which coefficients are coded into the base layer. Accordingly, if the coefficient selection is adaptive, either multiple versions of the matrices will be required, or additional processing steps will be required to complete the downsampling.
Referring now to
Referring to
As with the decomposition, the interpolation can be provided in two steps using a matrix transformation. The first step can include a pre-multiplication by a first interpolation matrix to interpolate vertically, and then a second step can be performed which includes a post-multiplication by a second interpolation matrix to interpolate horizontally. This two step process provides greater flexibility in the ratio of resolutions between the enhancement and base layers. The pre-multiplication and post-multiplication can be performed in any order, although the order in which the multiplications are performed should be the same for both the encoder and the decoder. Again, if the coefficient selection is adaptive, either multiple versions of the matrices will be required, or additional processing steps will be required to complete the interpolation.
Referring to
The base decoder 610 can receive the base data bitstream, for example after it has been read from a DVD by a playback interface, and decode the base data bitstream into an uncompressed format. For example, the base decoder 610 can decode the base data bitstream from an MPEG-2 format into a luminance and chrominance video stream having SD frames formed of pixels reconstructed in the decoding process. The luminance and chrominance video stream then can be forwarded to a display, or further encoded into another format, for example into an NTSC, PAL, SECAM, S-video, or any other suitable format. The base decoder 610 can be coupled to an SD frame buffer 612 for buffering standard-definition frames during the decoding process. The SD frame buffer 612 can further provide an output suitable for an SD display.
The enhancement decoder 616 can decode an enhancement data bitstream, for example after it has been read from a DVD by a playback interface, and decode the enhancement data bitstream into an uncompressed format. For example, the enhancement decoder 616 can decode the enhancement data bitstream from an H.264 format into blocks of enhancement pixels which can be combined with blocks of reconstructed SD pixels for use in generating an HD video stream.
The interpolator 614 can receive reconstructed base pixels from the base decoder 610 and interpolate blocks of the reconstructed base pixels into blocks of pixels that are size compatible with HD pixel blocks. For example, the interpolator 614 can interpolate 11×9 blocks of base pixels into 16×16 blocks of base pixels. The interpolation process can follow an interpolation algorithm which is much the same as the interpolation process described for
The summing block 618 can sum blocks of base pixels generated by the interpolator 614 with blocks of pixels generated by the enhancement encoder 616 to generate reconstructed HD frames. Pixel values within the reconstructed HD frames can be trimmed to usable values by the clipper 620. For example, the clipper 620 can limit 8 bit pixels to values between −128 and 127. The HD frame buffer 622 can buffer the reconstructed HD frames for use by the enhancement decoder 616 in creating the enhancement pixels during the decoding process. The HD frame buffer also can provide an output suitable for an HD display. For example, the output can provide a luminance and chrominance video stream having the reconstructed HD frames.
The present invention can be realized in hardware, software, or a combination of hardware and software. The present invention can be realized in a centralized fashion in one computer system, or in a distributed fashion where different elements are spread across several interconnected computer systems. Any kind of computer system or other apparatus adapted for carrying out the methods described herein is suitable. A typical combination of hardware and software can be a general purpose computer system with a computer program that, when being loaded and executed, controls the computer system such that it carries out the methods described herein.
The present invention also can be embedded in a computer program product, which comprises all the features enabling the implementation of the methods described herein, and which when loaded in a computer system is able to carry out these methods. Computer program in the present context means any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after either or both of the following: a) conversion to another language, code or notation; b) reproduction in a different material form.
This invention can be embodied in other forms without departing from the spirit or essential attributes thereof. Accordingly, reference should be made to the following claims, rather than to the foregoing specification, as indicating the scope of the invention.
This application claims the benefit of U.S. Provisional Patent Application Ser. No. 60/396,397, entitled “Hybrid MPEG-2/H.26L Scalability for HD-DVD” and filed Jul. 16, 2002, and U.S. Provisional Patent Application Ser. No. 60/430,558, entitled “Hybrid Scalable CODEC For Single Disc SD/HD-DVD” and filed Dec. 3, 2002, both of which are incorporated by reference herein in their entirety.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US03/21943 | 7/15/2003 | WO | 1/14/2005 |
Number | Date | Country | |
---|---|---|---|
60396397 | Jul 2002 | US | |
60430558 | Dec 2002 | US |