This invention relates in general to digital systems and more specifically to a system and method for allowing fast angle or scene changing in playback of video productions such digital versatile disc (DVD) playback devices.
Today's video playback systems often provide several new features. One feature, called “multi-angle,” allows a user to select a desired camera angle from which to view a currently displayed scene. For example, if a user is watching a music video and the user would like a close-up of the singer, instead of the currently presented medium shot, the user can depress a button on a remote control device and select the singer's close-up angle. A user can select a different scene or shot, entirely, such as requesting that a view of a guitar player, rather than the singer, be displayed.
Any number and type of angle (or other scene selections) can be made as long as they are within the performance ability of the video playback system and as long as they are provided by the video content. Additional content, such as audio, menu, sub-picture or other information, can also be the subject of angle or scene changes. For example, in the case where a user selects a shot of the guitar player, the audio track can be changed so that the guitar sound is louder in the audio mix.
Typically, today's playback systems limit the number of possible angles that can be selected by a user at any point in time. For example, up to 9 different angles are provided in a standard DVD-Video specification published by the DVD Format/Logo Licensing Corporation, entitled “DVD Specifications for Read-Only Disc/Part 3: Video Specifications, Version 1.13” that is hereby incorporated by reference as if set forth in full in this specification.
Another limitation of today's playback systems is that the angle switch is not instantaneous. Typically there can be anywhere from one-half to 6 seconds, or so, of delay after a user makes an angle change selection until the selected angle actually appears on a display screen.
This delay is due to a number of factors. One factor is the time required to “flush” or update various buffers and other components in the playback system to remove data relating to the deselected track and fill the buffers with data relating to the selected track. Depending on the speed of the system, and the number and size of the buffers, the updating of buffers can take several seconds.
Another factor in the delay is that the video information on the DVD is stored in compressed form, and includes other associated data (e.g., parity, error correction, channel assignment, header information, etc.). The DVD data must be decompressed, or decoded, and the associated data may require additional processing before the video information is made available in a format that is fit for display. The delay due to decoding and other processing can typically be about one-half of a second.
Embodiments of the present invention provide faster user-selected scene switching during playback of a video production. In one embodiment, several channels of video (e.g., several video streams each depicting a different camera angle) are read from interleaved data on a DVD disc. Each channel's data is maintained in a channel buffer that is input to a video decoder via a selector. When a signal from a user input device indicates that a user has selected a new channel, the selector can immediately provide the selected channel's data from the channel buffer to the video decoder, thus providing an increase in switching speed.
In one embodiment the invention provides a method for providing fast scene switching in a video playback device, wherein the video playback device includes one or more buffers coupled to a decoder via a selector, the method comprising obtaining two or more encoded video streams; storing the two or more encoded video streams into the one or more buffers; accepting a signal from a user input device to select a video stream; and using the selector to direct the selected video stream to the decoder.
In another embodiment the invention provides an apparatus for providing fast scene switching, the apparatus comprising a video playback device, wherein the video playback device includes one or more buffers coupled to a decoder via a selector; an input for obtaining two or more encoded video streams; one or more instructions stored in a machine readable medium for storing the two or more encoded video streams into the one or more buffers; an input for receiving a signal from a user input device; and one or more instructions stored in a machine readable medium for using the selector to direct a selected video stream to the decoder, wherein the selected video stream is selected by the signal from the user input device to the decoder.
In another embodiment the invention provides a method for providing fast scene switching in a video playback device, wherein the video playback device includes a buffer coupled to a decoder, the method comprising reading information from two or more interleaved video streams; determining which of the two or more interleaved video streams is a current stream; storing information for the determined stream into the buffer; and using the selector to direct the selected video stream information to the decoder.
In another embodiment the invention provides an apparatus for providing fast scene switching, the apparatus comprising a video playback device, wherein the video playback device includes a buffer coupled to a decoder; an input for obtaining information from two or more interleaved video streams; one or more instructions stored in a machine readable medium for determining which of the two or more interleaved video streams is a current stream; one or more instructions stored in a machine readable medium for storing information for the determined stream into the buffer; and one or more instructions stored in a machine readable medium for using the selector to direct the selected video stream information to the decoder.
Two basic embodiments are presented. In each embodiment, data for multiple streams (e.g., multi-angle, scene branching, etc.) of video are read from a DVD disc or other content source. In a first embodiment, each separate stream's data is sent to a demultiplexer for sorting and storing in a buffer for later selection and presentation to a decoder and ultimately to a display device. Thus, in this first embodiment, data for each stream is assembled in sequence in a buffer. In a second embodiment, although data for each stream is still read from the DVD and provided to a demultiplexer, the demultiplexer only provides a currently single stream to be buffered, decoded and displayed.
In
Note that although the present invention is directed to DVD applications that any other source of video or image information can be acceptable. For example, a compact disc (CD) read-only memory (CDROM), hard disk drive, solid state memory, digital network (including satellite, digital subscriber line (DSL), cable), etc. can be a source for video information.
Demux/Mux 120 includes circuitry to detect different streams from the video stream source (in this case DVD 100 and read hardware 110) and segregate the information into separate video streams. In a preferred embodiment, a DVD playback system is designed to process video data compressed according to a Motion Picture Experts Group (MPEG) standard, with interleaving and other format characteristics as specified in the DVD standards described above, and in related DVD specifications published by the same standards body. In the present standard, up to 9 different video streams, audio and subpicture information can be obtained from the DVD.
Demux/Mux 120 can include any type of circuitry or process for performing stream information segregation. In a preferred embodiment, a demultiplexer is used to separate the different interleaved blocks and a multiplexer is used to direct blocks belonging to a specific stream, or channel, to a predetermined location in stream buffer 130.
The video streams, audio and subpicture information are segregated into different areas in stream buffer 130. Note that stream buffer 130 can be one or more separate blocks of memory, chips, systems, etc. It is not strictly necessary to place all information into a single physical buffer, but a typical design may realize benefits of speed, lower manufacturing cost and simplicity of design by using a single buffer. Any number of channels can be accommodated. In
Channels may have associated other data such as an audio track or subpicture information. This associated data can be unique to each channel in which case the associated data can be segregated into one or more areas in the stream buffer, similarly to the channel information. Or, as shown in
At the output of stream buffer 130 is selector 140. Selector circuitry is responsive to a signal from a user input device, such as remote control 170, to direct specific stream or channel information from stream buffer 130 to the input of decoder 150. Decoder 150 can be a standard DVD decoder for decompressing the channel information into image information for display 160. In a preferred embodiment, selector 140 includes a demultiplexer for selecting a portion of the stream buffer for output to the decoder. Note that different approaches to the design of
A second embodiment uses a demultiplexer with additional functionality. The demultiplexer is provided with multiple streams' data, as before, but the demultiplexer sends only the data for a current stream (i.e., the stream being viewed) to the track or angle buffer (or other buffer). This approach may have an advantage in that more of a standard DVD player's hardware can be used without modification.
This approach can utilize intelligence in the demultiplexer to filter the desired angle data to the decoder from a high-rate input stream combining all angle data. An enlarged track buffer feeds the demultiplexer at a rate of at least n * r, where n is the number of angles and r is the data rate of a single angle's bitstream. This embodiment may take advantage of existing back-end chipsets with programmable demultiplexers allowing for implementation without back-end hardware modifications.
Typically, a demultiplexer works at the “pack level” of data. Packs can include video, audio, subpicture, navigation packets and other data. Thus, there may not be defining characteristics in a video pack that declare to which angle the pack data belongs. In such cases, a modification can be made to either the pack or packet header of the data source (e.g., to the DVD data). Each header allows for a non-zero amount of stuffing, or padding data. The first byte of this padding data could be used to designate the angle number the pack or packet belongs to, allowing the demultiplexer to identify the stream's context on the fly. For a DVD source this modification can be done during the authoring stage, using a customized authoring or multiplexing tool. Additional details on pack structure can be found in, e.g., DVD Specifications for Read-Only Disc/Part 3 v1.1, sec. VI5.
In a preferred embodiment, the demultiplexer is assured of having access to the next requested angle at the next “Group of Pictures” (GOP) boundary in order to prevent decoder video input buffer underflow. This may require data transfer between the track buffer and demultiplexer at higher than n * r immediately following a new angle requests. It is possible to add greater intelligence to the front-end in order to parse the data and eliminate redundant data (such as audio and subpicture packs which are replicated among all angles) and/or to re-interleave data at the pack level to shorten distance between data belonging to any particular angle.
Although the invention has been described with reference to specific embodiments thereof, these embodiments are merely illustrative, and not restrictive, of the invention. For example, although embodiments of the invention has been discussed primarily with respect to a DVD medium, any other type of media or source of audio tracks can be used to equal advantage. A storage medium can include magnetic, optical, memory, etc. Embodiments of the invention can be used with a streaming audio source such as from a satellite, cable television network, telephone modem, or Internet or other digital network system or communication channel. Any digital transmission system, format, encoding, encryption or compression approaches can be used with the present invention.
Although a user input device is a preferred way of obtaining an angle change selection, other embodiments may use other triggers for an angle change selection. For example, a change selection or other command or input can be automatically generated by a device, obtained from a network or other data source, or obtained in other ways. Other embodiments can also use multiple decoders so that multiple streams are decoded simultaneously by different decoders. It may also be possible to use multiple decoders on a single stream as where different decoders process different pieces of a buffered stream. Other variations are possible.
Any suitable programming language can be used to implement the routines of the present invention including C, C++, Java, assembly language, etc. Different programming techniques can be employed such as procedural or object oriented. The routines can execute on a single processing device or multiple processors. Although the steps, operations or computations may be presented in a specific order, this order may be changed in different embodiments. In some embodiments, multiple steps shown as sequential in this specification can be performed at the same time. The sequence of operations described herein can be interrupted, suspended, or otherwise controlled by another process, such as an operating system, kernel, etc. The routines can operate in an operating system environment or as stand-alone routines occupying all, or a substantial part, of the system processing.
In the description herein, numerous specific details are provided, such as examples of components and/or methods, to provide a thorough understanding of embodiments of the present invention. One skilled in the relevant art will recognize, however, that an embodiment of the invention can be practiced without one or more of the specific details, or with other apparatus, systems, assemblies, methods, components, materials, parts, and/or the like. In other instances, well-known structures, materials, or operations are not specifically shown or described in detail to avoid obscuring aspects of embodiments of the present invention.
A “computer-readable medium” or “machine-readable medium” for purposes of embodiments of the present invention may be any medium that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, system or device. The computer readable medium can be, by way of example only but not by limitation, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, system, device, propagation medium, or computer memory.
A “processor” or “process” includes any human, hardware and/or software system, mechanism or component that processes data, signals or other information. A processor can include a system with a general-purpose central processing unit, multiple processing units, dedicated circuitry for achieving functionality, or other systems. Processing need not be limited to a geographic location, or have temporal limitations. For example, a processor can perform its functions in “real time,” “offline,” in a “batch mode,” etc. Portions of processing can be performed at different times and at different locations, by different (or the same) processing systems. Although specific media (e.g., DVD, CD, CDROM) may be discussed, any type of machine-readable media can be used.
Reference throughout this specification to “one embodiment”, “an embodiment”, or “a specific embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention and not necessarily in all embodiments. Thus, respective appearances of the phrases “in one embodiment”, “in an embodiment”, or “in a specific embodiment” in various places throughout this specification are not necessarily referring to the same embodiment. Furthermore, the particular features, structures, or characteristics of any specific embodiment of the present invention may be combined in any suitable manner with one or more other embodiments. It is to be understood that other variations and modifications of the embodiments of the present invention described and illustrated herein are possible in light of the teachings herein and are to be considered as part of the spirit and scope of the present invention.
Embodiments of the invention may be implemented by using a programmed general purpose digital computer, by using application specific integrated circuits, programmable logic devices, field programmable gate arrays, optical, chemical, biological, quantum or nanoengineered systems, components and mechanisms may be used. In general, the functions of the present invention can be achieved by any means as is known in the art. Distributed, or networked systems, components and circuits can be used. Communication, or transfer, of data may be wired, wireless, or by any other means.
It will also be appreciated that one or more of the elements depicted in the drawings/figures can also be implemented in a more separated or integrated manner, or even removed or rendered as inoperable in certain cases, as is useful in accordance with a particular application. It is also within the spirit and scope of the present invention to implement a program or code that can be stored in a machine-readable medium to permit a computer to perform any of the methods described above.
Additionally, any signal arrows in the drawings/Figures should be considered only as exemplary, and not limiting, unless otherwise specifically noted. Furthermore, the term “or” as used herein is generally intended to mean “and/or” unless otherwise indicated. Combinations of components or steps will also be considered as being noted, where terminology is foreseen as rendering the ability to separate or combine is unclear.
As used in the description herein and throughout the claims that follow, “a”, “an”, and “the” includes plural references unless the context clearly dictates otherwise. Also, as used in the description herein and throughout the claims that follow, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise.
The foregoing description of illustrated embodiments of the present invention, including what is described in the Abstract, is not intended to be exhaustive or to limit the invention to the precise forms disclosed herein. While specific embodiments of, and examples for, the invention are described herein for illustrative purposes only, various equivalent modifications are possible within the spirit and scope of the present invention, as those skilled in the relevant art will recognize and appreciate. As indicated, these modifications may be made to the present invention in light of the foregoing description of illustrated embodiments of the present invention and are to be included within the spirit and scope of the present invention.
Thus, while the present invention has been described herein with reference to particular embodiments thereof, a latitude of modification, various changes and substitutions are intended in the foregoing disclosures, and it will be appreciated that in some instances some features of embodiments of the invention will be employed without a corresponding use of other features without departing from the scope and spirit of the invention as set forth. Therefore, many modifications may be made to adapt a particular situation or material to the essential scope and spirit of the present invention. It is intended that the invention not be limited to the particular terms used in following claims and/or to the particular embodiment disclosed as the best mode contemplated for carrying out this invention, but that the invention will include any and all embodiments and equivalents falling within the scope of the appended claims.
Although specific examples of standard or prior art hardware have been described, embodiments of the invention can be adapted for use with any suitable hardware including variations from the standard hardware, or future playback devices.
The scope of the invention is to be determined solely by the appended claims.
This invention claims priority from U.S. Provisional Patent Application Ser. No. 60/548,207 [Attorney Doc. No. 021572-001100US] filed on Feb. 27, 2004 which is hereby incorporated by reference as if set forth in full in this document.
Number | Date | Country | |
---|---|---|---|
60548207 | Feb 2004 | US |