The present invention relates to encoding data and more particularly to manipulating audio data so that it can be encoded along with video data.
Typically a movie includes a sequence of video frames together with a corresponding sequence of audio frames (i.e., a video track and an audio track). Synchronization of these frames on playback is crucial for an audience's appreciation of the movie. However, these sequences are generally processed separately because of characteristic differences between video and audio data. Compression is an example of a processing step that is performed separately for video and audio data.
The nature of video data requires that compression be performed separately. Video data is typically a frame corresponding to a two-dimensional display. For example, a DVD (Digital Video Disk) typically employs a 720×480 array of pixels where each pixel contains a multi-bit value, such as 16-bit, 24-bit or 32-bit, that corresponds to an enumerated color.
Audio data on the other hand, is typically time-varying waveform data that represents a voltage or current rather than color. The data can be 16-bit values or higher bit values that correspond to the voltage or current that will drive a speaker.
Because of these characteristic differences, separate encoders and decoders are used for video and audio data. Having two separate encoders and decoders is an inefficient use of resources and costly. Further, synchronization between the separate encoders and decoders may not be maintained. It would, therefore, be desirable to use one encoder and decoder for both video and audio data. The present invention provides a mechanism for allowing audio data to be manipulated so that it can be concurrently encoded and decoded with video data.
A method for representing audio data in a format that can be operated upon independently, or merged with video data. The method includes replacing each audio information element in an audio sequence with a corresponding color from a color palette.
The present invention is illustrated by way of example, and not limitation, in the figures of the accompanying drawings in which like references denote similar elements, and in which:
a illustrates a representative audio signal;
b illustrates a representative digitally sampled audio signal;
Methods and apparatus for manipulating audio data so that it may be encoded and decoded along with video data are described. In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be evident, however, to one skilled in the art that the present invention may be practiced with a variety of data, especially audio and video, without these specific details. In other instances, well-known operations, steps, functions and elements are not shown in order to avoid obscuring the invention.
Various operations will be described as multiple discrete steps performed in turn in a manner that is most helpful in understanding the present invention. However, the order of description should not be construed as to imply that these operations are necessarily performed in the order that they are presented, or even order dependent. Lastly, repeated usage of the phrases “in one embodiment,” “an alternative embodiment,” or an “alternate embodiment” does not necessarily refer to the same embodiment, although it may.
a illustrates a representative audio signal. Before an audio signal can be digitally encoded and transmitted it needs to be transformed into a digital signal, although implementation of the present invention will typically occur on audio signals that have previously been transformed into digital signals. To transform audio signal 100 into a digital signal, audio signal 100 is typically sampled by an analog to digital converter at a predetermined rate to produce snapshots of the value of the audio signal at equally spaced intervals, as is conventionally known. Depending on the audio scheme being implemented a certain number of samples make up a frame. Typically, samples are encoded or processed using frames.
b illustrates a representative digitally sampled audio signal. Digitally sampled audio signal 104 is a sequence of digital values, also termed digital audio signal elements, that are spaced apart by the same time interval. The sequence of digital audio signal elements can be represented in a two column table in which each row contains the time a sample was taken and the digital value of the sampled audio signal at the sample time. Table 106 shows such a table or data.
Since audio and video data have different formats, audio data is not conventionally appended to video data and encoded with it. The present invention provides a mechanism for manipulating audio data so that it can be appended to video data for later encoding concurrently with the video data.
The color audio frame of process 300 is added to a corresponding video frame to produce an augmented video frame that can be encoded and later decoded. In order to not obscure the present invention, methods and apparatus of adding the color audio frame to a corresponding video frame, and then operating upon the augmented frame will not be described in greater detail. Conventional methods, as well as methods described in co-pending applications entitled “Method And Apparatus For Determining Patterns Within Adjacent Blocks Of Data bearing attorney reference number 042503/0259665 and filed on the same day as this application in the U.S. Patent Office, and U.S. application Ser. No. 09/727,096 entitled “Method And Apparatus For Encoding Information Using Multiple Passes And Decoding In A Single Pass” filed on Nov. 29, 2000, both of which are assigned to the same assignee as the present invention, can be used for adding the color audio frame to the video frame to obtain the augmented frame and then operating upon the augmented frame, although it will be understood that the present invention can be used independently and without reference to a video frame.
After the augmented video frame is decoded, the color audio can be separated from the augmented video frame and the digital audio signal elements recovered.
Thus, methods and apparatus for manipulating audio data so that it may be encoded independently in a different form, and, more preferably, along with video data have been described. Although the present invention has been described with reference to specific exemplary embodiments, it will be evident to one of ordinary skill in the art that various modifications and changes may be made to these embodiments without departing from the broader spirit and scope of the invention as set forth in the claims. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense.
| Number | Name | Date | Kind |
|---|---|---|---|
| 5191319 | Kiltz | Mar 1993 | A |
| 6411289 | Zimmerman | Jun 2002 | B1 |
| 6507742 | Lin et al. | Jan 2003 | B1 |
| Number | Date | Country |
|---|---|---|
| 0 675 478 | Oct 1995 | EP |
| Number | Date | Country | |
|---|---|---|---|
| 20030081146 A1 | May 2003 | US |