The present invention relates generally to data processing systems and more particularly to an active stream format for holding multiple media streams.
Conventional file and/or stream formats for transmitting multiple data streams of varying media are limited in several respects. First, these formats are generally limited in the packet sizes that are available for encapsulating data. Such formats, if they specify packets, specify the packets as a given fixed size. Another limitation of such formats is that they do not facilitate the use of error correction codes. A further weakness of these conventional formats is that they do not provide flexibility in timing models for rendering the data encapsulated within the format. An additional limitation with such formats is that they are not well adapted for different transport mediums that have different levels of reliability and different transmission capabilities.
In accordance with a first aspect of the present invention, a computer system has a logical structure for encapsulating multiple streams of data that are partitioned into packets for holding samples of data from the multiple data streams. A method of incorporating error correction into the logical structure is performed on the computer system. In accordance with this method, a portion of at least one packet is designated for holding error correcting data. The error correcting data is then stored in the designated portion of the packet.
In accordance with another aspect of the present invention, multiple streams of data are stored in packets and error correcting data is stored in at least some of the packets. The packets are encapsulated into a larger stream and information regarding what error correcting methods are employed for the packets is also stored in the packets.
In accordance with yet another aspect of the present invention, samples of data from multiple data streams are stored in packets, and replicas of information are stored in at least some of the packets. A flag is set in each of the packets that holds replicas to indicate that the packets hold the replicas. The packets are encapsulated into a larger logical structure and transmitted to a destination.
In accordance with a further aspect of the present invention, a logical structure is provided for encapsulating multiple streams of data where the streams of data are stored in packets. Clock licenses that dictate advancement of a clock are stored in multiple ones of the packets. The logical structure is transmitted from a source computer to a destination computer. The clock is advanced at the destination computer as dictated by the clock license for each packet that holds a clock license in response to the receipt or processing of the packet at the destination computer.
In accordance with an additional aspect of the present invention, a stream format is provided for encapsulating multiple streams of data. The stream format includes a field for specifying a packet size for holding samples of the multiple streams of data. In a logical structure that adopts the stream format, a value is stored in the field that corresponds to the desired packet size. Packets of the desired size are stored within the logical structure and the logical structure is transmitted over a transport medium to the destination.
In accordance with a further aspect of the present invention, a stream format is provided for encapsulating multiple streams of data. A field is included in a logical structure that adopts the stream format for holding a value that specifies a maximum bit rate at which the multiple streams may be rendered at the destination. A value is stored in the field and the logical structure is transmitted over a transport medium to a destination.
In accordance with another aspect of the present invention, a stream format is provided for encapsulating multiple data streams and a new media type is dynamically defined. An identifier of the media type is stored in a logical structure that adopts the stream format and packets of the new media type are stored in the logical structure.
The preferred embodiment of the present invention employs an active stream format (ASF) for holding multiple media streams. ASF is well suited for storage of multimedia streams as well as transmission of multiple media streams over a transport medium. ASF is constructed to encapsulate diverse multimedia streams and facilitates optimal interleaving of respective media streams. ASF specifies the packetization of data and provides flexibility in choosing packet sizes. In addition, ASF enables the specification of a maximum data transmission rate. As such, the packetization and transmission of media streams may be tailored to facilitate the bandwidth limitations of the system on which media streams are stored or transmitted.
ASF facilitates the use of error correction and error concealment techniques on the media streams. In unreliable transport mediums, such error correction and error concealment is highly beneficial. ASF is independent of media types and is extensible to handle newly defined media types. ASF supports flexible timing approaches and allows an author of an ASF stream to specify the synchronization of events. ASF supports synchronized rendering using a variety of synchronization clock types and provides index information which can be used as markers for lookup to provide playback features such as fast forward and fast reverse.
The header_object 32 is followed in the header section 28 by a properties_object 34, such as depicted in
The properties_object 34 also holds a created field 64 that holds a timestamp that specifies when the ASF stream was created. A num_packet field 65 holds a 64-bit value that defines the number of packets in the data section 30. A play_duration field 66 holds a 32-bit number that specifies the play duration of the entire ASF stream in 100-nanosecond units. For example, if the ASF stream 16 holds a movie, the duration field 66 may hold the duration of the movie. The play_duration field 66 is followed by a send_duration field 67 that corresponds to send the ASF stream in 100-nanosecond units. A preroll field 68 specifies the amount of time to buffer data before starting to play, and the flags field 70 holds 32-bits of bit flags.
The properties object 34 includes a min_packet_size field 72 and a max_packet_size field 74. These fields 72 and 74 specify the size of the smallest and largest packets 48 in the data section 30, respectively. These fields help to determine if the ASF stream 16 is playable from servers that are constrained by packet size. For constant bit rate streams, these values are set to have the same values. A maximum_bit_rate field 76 holds a value that specifies the maximum instantaneous bit rate (in bits per second) of the ASF stream.
One of the beneficial features of ASF is its ability for facilitating different packet sizes for data of multiple media streams.
As mentioned above, the properties_object 34 holds a value in the maximum_bit_rate field 76 that specifies an instantaneous maximum bit rate in bits per second that is required to play the ASF stream 16. The inclusion of this field 76 helps to identify the requirements necessary to play the ASF stream 16.
The header section 28 (
The stream_properties_object 36 holds at least three fields 92, 98 and 104 for holding information relating to error concealment strategies. In general, ASF facilitates the use of error concealment strategies that seek to reduce the effect of losing information regarding a given sample of media data. An example of an error concealment strategy is depicted in
The error_concealment_strategy field 92 holds a UUID that identifies the error concealment strategy that is employed by the associated stream. The error_concealment_len field 98 describes the number of bytes in an error concealment data block that is held in the error_concealment_data entries 104. The properties associated with the error concealment strategy are placed in the error_concealment_data entries 104. The number of entries will vary depending upon the error concealment strategy that is adopted.
The stream_properties_object 36 includes a stream_number field 100 that holds an alias to a stream instance. The stream_properties_object 36 also includes an offset field 94 that holds an offset value to the stream in milliseconds. This value is added to all of the timestamps of the samples in the associated stream to account for the offset of the stream with respect to the timeline of the program that renders the stream. Lastly, the stream_properties_object 36 holds a type_specific_len field 96 that holds a value that describes the number of bytes in the type_specific_data entries 102. The type_specific_data entries 102 hold properties values that are associated with the stream type.
The header section 28 (
The header section 28 may include a marker_object 40. The marker_object 40 holds a pointer to a specific time within the data section 30. The marker_object enables a user to quickly jump forward or backward to specific data points (e.g., audio tracks) that are designated by markers held within the marker_object 40.
The header section 28 may also include an error_correction_object 42 for an error correction method that is employed in the ASF stream. Up to four error correction methods may be defined for the ASF stream 16 and, thus, up to four error_correction_objects 42 may be stored within the header section 28 of the ASF stream 16.
The error_correction_object 42 includes an object_id field 150 and a size field 152, like those described above for the other objects in the header section 28. The error_correction_object 42 also includes an error_correction_id 154 that holds UUID that identifies the error correcting methodology associated with the object 42. The error_correction_data_len field 156 specifies the length in bytes of the error_correction_data entries 158 that hold octets for error correction. The error_correction_object 42 is used by the destination computer 12 (
The header section 28 of the ASF stream 16 may also hold a clock_object 44 that defines properties for the timeline for which events are synchronized and against which multimedia objects are presented.
The ASF stream 16 enables script commands to be embedded as a table in the script_command_object 45. This object 45 may be found in the header section 28 of the ASF stream 16. The script commands ride the ASF stream 16 to the client where they are grabbed by event handlers and executed.
The num_commands field 184 specifies the total number of script commands that are to be executed. The num_types field 186 specifies the total number of different types of script_command types that have been specified. The type_names field 188 is an array of type_names_struc data structures.
The command_entry field 190 identifies what commands should be executed at which point in the timeline. The command_entry field 190 is implemented as a table of script commands. Each command has an associated command_entry element 198 as shown in
The script commands may be of a URL type that causes a client browser to be executed to display an indicated URL. The script command may also be of a file name type that launches another ASF file to facilitate “continuous play” audio or video presentations. Those skilled in the art will appreciate that other types of script commands may also be used.
The header section 28 of the ASF stream 16 may also include a codec_object 46. The codec_object 46 provides a mechanism to embed information about a codec dependency that is needed to render the data stream by that codec. The codec object includes a list of codec types (e.g., ACM or ICM) and a descriptive name which enables the construction of a codec property page on the client.
As mentioned above, the data section 30 follows the header section 28 in the ASF stream 16. The data section includes a data_object 47 and interleave_packets 48. A data_object 47 marks the beginning of the data section 30 and correlates the header section 28 with the data section 30. The packets 48 hold the data payloads for the media stream stored within the ASF stream 16.
Each packet 48 has a format like that depicted in
The initial_structure 244 may also include opaque data 300 if the opaque_data bit 286 is set. The initial structure includes a byte of flags 302. The most significant bit is a reserved bit 304 that is set to a value of “0.” The next two bits constitute the packet_len_type field 306 that indicate the size of the packet_len field 256. The next subsequent two bits constitute the padding_len_type field 272 that indicate the size of the padding_len field 274. These two bits are followed by another 2-bit field that constitutes the sequence_type of field 276 that specifies the size of the sequence field 258. The last bit is the multiple_payloads_present bit 278 that specifies whether are not multiple payloads are present.
The initial_structure 244 is followed by a stream_flag field 246 that holds a byte consisting of four 2-bit fields. The first two bits constitute a stream_id_type field 248 that specifies the size of the stream_id field 314 within the payload_struc 266. The second most significant bits constitute the object_id_type field 250 and indicate the number of bits in the object_id field 316 of the payload_struc 266 as either 0-bits, 8-bits, 16-bits or 32-bits. The third most significant two bits constitute the offset_type field 252, which specifies the length of the offset field 318 within the payload_struc 266 as either 0-bits, 8-bits, 16-bits or 32-bits. The least two significant bits constitute the replicated_data_type field 254 and these bits indicate the number of bits that are present for the replicated_data_len field 320 of the payload_struc 266.
The packet 48 also includes a packet_len field 256 that specifies the packet length size. The sequence field 258 specifies the sequence number for the packet. The padding_len field 260 contains a number that specifies the number of padding bytes that are present at the end of the packet to pad out the packet to a desirable size.
The packet 48 also contains a clock_data field 262 that contains data representing time information. This data may include a clock license that contains a system clock reference that drives the progression of the time line under the timing model and a duration that specifies the effective duration of the clock license. The duration field limits the validity of the license to a time specified in milliseconds. Under the model adopted by the preferred embodiment of the present invention, the source computer 10 issues a clock license to the destination computer 12 that allows the clock of the destination computer 12 to progress forward for a period of time. The progression of time is gated by the arrival of a new piece of data that contains a clock value with a valid clock license that is not expired.
The packet 48 also includes a payload-flag field 264 that specifies a payload length type and a designation of the number of payloads present in the packet. The payload-flag field 264 is followed by one or more payload_strucs 266. These structures contain payload information which will be described in more detail below. The final bits within the packet 48 may constitute padding 268.
The payload_struc 266 may also include a replicated_data_len field 320 that specifies the number of bytes of replicated data present in the replicated_data field 322. As was discussed above, for protection against possible errors, the packet 48 may include replicated data. This replicated data is stored within the replicated_data field 322.
The payload_len field 323 specifies the number of payload bytes present in the payload held within the payload_data field 325. The payload_data field 326 holds an array of payloads (i.e., the data).
The ASF stream may also include an index_object 49 that holds index information regarding the ASF stream 16.
The index_object includes a time_delta field 330 that specifies a time interval between index entries. The time represents a point on the timeline for the ASF stream 16. A max_packets field 332 specifies a maximum value for packet_count fields, which will be described in more detail below. A num_entries field 334 is a 32-bit unsigned integer that describes the maximum number of index entries that are defined within the index_info array 336. This array 336 is an array of index_information structures. Each index_info structure holds a packet field that holds a packet number associated with the index entry and a packet_count field specifies the number of the packet to send with the index entry so as to associate the index entries with the packets. In
While the present invention has been described with reference to a preferred embodiment thereof, those skilled in the art will appreciate that various changes in form and detail may be made without departing from the intended scope of the invention as defined in the appended claims. For example, the present invention may be practiced with a stream format that differs from the format described above. The particulars described above are intended merely to be illustrative. The present invention may be practiced with stream formats that include only a subset of the above-described fields or include additional fields that differ from those described above. Moreover, the length of the values held within the fields and the organization of the structures described above are not intended to limit the scope of the present invention.
The present application claim priority under 35 U.S.C. §120 as a continuation of U.S. patent application Ser. No. 10/376,428, filed Feb. 28, 2003, which is a divisional of U.S. patent application Ser. No. 09/510,565, filed on Feb. 22, 2000, which is a divisional of U.S. patent application Ser. No. 08/813,151, filed on Mar. 7, 1997, now U.S. Pat. No. 6,041,345, which claims priority from Provisional Application Ser. No. 60/013,029, filed on Mar. 8, 1996, and which claims priority from Provisional Application Ser. No. 60/028,789, filed on Oct. 21, 1996, all of which are incorporated herein in their entireties by reference.
Number | Name | Date | Kind |
---|---|---|---|
4819226 | Beckner et al. | Apr 1989 | A |
4893306 | Chao et al. | Jan 1990 | A |
4914527 | Asai et al. | Apr 1990 | A |
5053946 | Jain | Oct 1991 | A |
5086402 | Sterling, II | Feb 1992 | A |
5278848 | Yamaguchi | Jan 1994 | A |
5296643 | Kuo et al. | Mar 1994 | A |
5361334 | Cawley | Nov 1994 | A |
5388264 | Tobias, II et al. | Feb 1995 | A |
5450248 | Van Eijck et al. | Sep 1995 | A |
5452297 | Hiller et al. | Sep 1995 | A |
5454297 | Phillips | Oct 1995 | A |
5491685 | Klein et al. | Feb 1996 | A |
5506844 | Rao | Apr 1996 | A |
5519780 | Woo et al. | May 1996 | A |
5566330 | Sheffield | Oct 1996 | A |
5588009 | Will | Dec 1996 | A |
5610653 | Abecassis | Mar 1997 | A |
5633880 | Lokhoff | May 1997 | A |
5689518 | Galand et al. | Nov 1997 | A |
5701300 | Jeon et al. | Dec 1997 | A |
5715176 | Mobini | Feb 1998 | A |
5719786 | Nelson et al. | Feb 1998 | A |
5729471 | Jain et al. | Mar 1998 | A |
5734589 | Kostreski et al. | Mar 1998 | A |
5740463 | Oshima et al. | Apr 1998 | A |
5754782 | Masada | May 1998 | A |
5781534 | Perlman et al. | Jul 1998 | A |
5815707 | Krause et al. | Sep 1998 | A |
5832219 | Pettus | Nov 1998 | A |
5872923 | Schwartz et al. | Feb 1999 | A |
5933603 | Vahalia et al. | Aug 1999 | A |
5956454 | Shinohara et al. | Sep 1999 | A |
6044396 | Adams | Mar 2000 | A |
6052507 | Niida et al. | Apr 2000 | A |
6424714 | Wasilewski et al. | Jul 2002 | B1 |
7206822 | Levi et al. | Apr 2007 | B2 |
7342924 | Levi et al. | Mar 2008 | B2 |
Number | Date | Country | |
---|---|---|---|
20050058133 A1 | Mar 2005 | US |
Number | Date | Country | |
---|---|---|---|
60028789 | Oct 1996 | US | |
60013029 | Mar 1996 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 09510565 | Feb 2000 | US |
Child | 10376428 | US | |
Parent | 08813151 | Mar 1997 | US |
Child | 09510565 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10376428 | Feb 2003 | US |
Child | 10975287 | US |