1. Field of the Invention
The present invention is directed to persistent data storage and in particular to persistent storage of real-time data.
2. Background Information
A wide variety of media have been employed for storing real-time-type data persistently. In addition to the media traditionally employed to store such information in an analog manner, compact disks, digital versatile disks, and magnetic disks are all currently employed to store such data digitally. Because real-time data tends to be voluminous, a great degree of effort has been dedicated to storing it digitally in a format that makes data storage and use economical and convenient.
As a consequence, persistent real-time-data storage has achieved a high degree of sophistication and effectiveness.
But I have recognized that a further advance in such storage will greatly contribute to the ease with which such storage and the resultant playback can be performed. Specifically, in storing real-time data, I format the data as payloads of RTP packets, i.e., as payloads of packets substantially of the type described in the Internet community's Request for Comments 1889 (“RFC 1889”). The payloads are stored with the RTP header's accompanying timestamp information so as to specify the data's relative playback timing explicitly rather than only implicitly, from their positions in the data stream.
Since real-time data tend to be transmitted and received in RTP packets, such storage makes it quite convenient and computationally inexpensive to store received real-time information and to transmit such information when it has been retrieved from storage. Additionally, the fact that the RTP format lends itself to separate storage of video and audio information yields significant versatility to such data's playback.
The invention description below refers to the accompanying drawings, of which:
The transmitter and receiver will typically be implemented in personal-computer or similar processors, which operate in accordance with stored programs contained in respective persistent stores 22 and 24, typically magnetic disks. Operating in accordance with this programming, such processors can format data for storage in accordance with the present invention's teachings and act as drivers that operate the players 20 to play back data thus stored. In one application of the present invention's teachings the receiver 18 may record the received information in the persistent store 24 instead of, or contemporaneously with, driving the player 20 with that information.
So, although the drawing's legend for block 18 is “receiver,” that block not only receives packets but also operates the persistent store 24 and drives the players 20. Similarly, the “transmitter” 14 performs several functions in addition to transmitting.
The transmitter 14 will typically break up the sampler 12's output sequence into discrete packets according to the RFC 1889 specification, so the receiver 18 will receive the real-time information in the form of RTP packets.
If the Ethernet frame indicates that, say, FIG. 1's receiver 18 is the proper recipient node, that receiver further consults the Ethernet header to determine which of its processes is to receive the Ethernet frame's payload. In the illustrative scenario, that process is the Internet Protocol (“IP”) process, whose purpose is to determine whether—and, if so, how—to forward the received information along other links to which the recipient node may be coupled.
The IP process interprets the first part of the Ethernet frame's payload as an IP header, as
The receiver 18 thereupon further inspects the IP header for an indication of the process to which the IP payload's contents should be delivered. In the illustrated scenario, that process is the User Datagram Protocol (“UDP”) process. The UDP process is the IP suite's transport-level protocol for delivering without providing reliability services. Its header, like the previous headers, includes demultiplexing information to identify the process that can appropriately interpret the UDP datagram's payload.
In this case, that process is the RTP process. That process interprets the UDP datagram's payload as beginning with an RTP header, whose format
The time duration represented by a single increment of the timestamp value depends on the particular type of data stream being transmitted, but the receiver typically infers that value from the contents of the RTP header's PT field, which is a code for the type of data that the RTP payload represents. If, as
To play the information thus received, the receiver 18 drives the player 20 with this information at times that correspond to the timestamps that accompany the information. That is, it establishes a relation between the advancing timestamp values and values on a local real-time clock, and it does not apply data to its player until its local wall-clock time reaches the time that corresponds to the timestamp value.
Now, although the transmitted information is ordinarily to be played in the fashion just described, the receiver 18 may instead, or additionally, record the transmitted information in persistent storage. In accordance with the present invention, the format with which real-time information is recorded is largely that set forth in the above-mentioned RTP specification. Specifically, the file containing the real-time information includes timestamp-containing headers to indicate the relative timing with which the various stored information is to be played.
A typical file for containing such information may, for instance, consist of a number of segments of which each has a format similar to that of
In any event, when the information is to be played or transmitted, the system for doing so plays or transmits it accordance with the timestamps. That is, it reads the timestamp for a given segment and then does not play or transmit it until the wall-clock time that the timestamp value dictates. It the system retransmits the stored media packets, it must replace the stored timestamp with a value consistent with the sender's wall clock as specified in RFC 1889. It must also replace the received synchronization-source identifier with one appropriate for the sender. In some cases the stored data will follow other data in the same session, and it will be preferable in such situations for the stored sequence number to be so modified as to indicate a discontinuity in the current media stream. A difference greater than one between successive sequence numbers is such a discontinuity indicator.
The present invention's teachings will often be employed in multimedia situations, in which associated audio information is to be played with video information. As those skilled in the art will recognize, audio and associated video are usually transmitted in separate RTP sessions, and the present invention's implementations will typically store the information in that fashion. For instance, suppose that a multimedia presentation entitled “Scene 1” is to be stored. The video information may be stored in RTP format in a file called, say, “scene 1.261,” where the “.261” extension may indicate that the format of the file's information is that specified by the International Telecommunications Union's H.261 standard. The audio may be stored in a file called, for instance, “scene 1.711” to indicate that it conforms to the G.711 audio-data standard.
Now, the timestamp information contained in the video file will not in general correspond to that in the audio file. As was mentioned above, timestamp values are assigned randomly at the beginning of a session, and the video and audio information will have been transmitted in separate sessions. Moreover, the time-interval length represented by one increment of the video timestamp values will not in general be the same as the length that an audio-timestamp-value increment represents. It may therefore be considered desirable in some cases to add information that represents the relative timing between the two files.
But most implementers will not find this necessary. Ordinarily, the first samples from two related files can be counted on to have been taken close enough in time to be considered simultaneous, so timing can be based on this assumption. Additionally, the time-interval length represented by a single time-stamp-value increment may be inferred from, say, the file-name extension.
The present invention's approach to persistent storage lends itself to a wide variety of uses. As was just explained, a receiver can use it to keep a record of received information. For example, a receiver of a so-called voice-over-IP signal or of a videoconference signal may use the present invention to record the substance of a conference or other telephone call. This is clearly a simple way to store information that has been received in RTP format.
But the present invention is also advantageous for storing information that has not yet been transmitted. FIG. 1's transmitter 14 may place the samples received from the sampling equipment 12 into the RTP format. Acting as a persistent-store driver, transmitter 14 stores the resultant packets in store 22 and then retrieves them for subsequent transmission or playback on local equipment (not shown). Transmitter 14 may, for instance, be a video- and/or audio-clip server, which sends stored information on demand to client network nodes. Storage in the RTP format greatly simplifies this process.
Additionally, the type of separate audio and video storage to which the RTP format lends itself, together with the fact that such information tends to be sent in different RTP sessions, affords a significant degree of flexibility. As was stated above, a given storage directory may contain a file called “scene 1.261” to represent audio information and another called “scene 1.711” to represent the related video information. The same directory may include a further file called, say, “scene 1.723,” containing RTP information whose payload conforms to a different audio standard, which some clients may prefer. So the present invention makes it convenient to store different audio versions for the same video clip, and vice versa. This applies not only to storage formats but also to, say, languages. For example, German, French, and Spanish versions of the audio for a film can be stored in different files and used with the same video file.
This mechanism also provides a simple, efficient method to insert stored media clips into a real-time media stream. A typical example of this function is to broadcast a message such as “Conference will end in five minutes” to all participants in a scheduled conference.
The present invention's teachings will also find considerable use for testing.
From the foregoing description, it is apparent that the present invention's teachings can be employed in a wide range of embodiments and thus constitute a significant advance in the art.
The present application is a Divisional of application Ser. No. 11/525,434, filed Sep. 22, 2006, which is a continuation of U.S. patent application Ser. No. 09/606,582, which was filed on Jun. 29, 2000, the entire contents of both of which are hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
Parent | 11525434 | Sep 2006 | US |
Child | 13233840 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 09606582 | Jun 2000 | US |
Child | 11525434 | US |