1. Field of the Invention
The present invention relates to a method of encoding and decoding audio-visual information and a recording medium stored with formatted audio-visual information. More particularly, the present invention relates to appropriately formatting synchronization data, control data, audio information, and video information for being stored in a recording medium with a small storage capacity or bandwidth, thereby achieving economical and beneficial reproduction of the audio-visual information.
2. Description of the Related Art
Among various currently used recording media, optical storage media are able to provide relatively large storage capacity with a high density through using an extremely short wavelength of a laser beam. The most commonly used optical storage media are compact disks (CD), which may be categorized as a compact disk-digital audio (CD-DA), a compact disk-read only memory (CD-ROM), a compact disk-interactive (CD-I), a video compact disk (VCD), and a digital versatile disk (DVD). The CD-DA may be used to record music data. The CD-ROM has two data formats of “Mode 1” for storing computer data and “Model 2” for storing audio-visual information. The CD-I provides a real-time interactive function and stores sound, still picture, and motion picture data. The VCD and DVD employ a technique of motion picture experts group (MPEG) to compress audio-visual information.
Although the VCD and DVD can store a large capacity of audio-visual information and achieves high quality real-time music play back and image reproduction, which is a remarkable success in industrial and entertainment business, the applications of the VCD and DVD to recording, playing back, and reproducing the audio-visual information are unfortunately subjected to the following disadvantages.
In order to store a tremendous amount of audio-visual information within a finite space on the VCD and DVD, it is necessary to compress the audio-visual information through using the complicated MPEG technique. As a result, the method of encoding the data as well as the encoder that executes such encoding method become much more complicated. Additionally, a complicated decoder and a specially designed audio-visual reproducing device are required for reproducing the compressed audio-visual information stored on the VCD and DVD. For example, a DVD player, instead of a CD-DA player, is necessary for playing the DVD in order to reproduce the stored audio-visual information. As well known by people, the DVD player is more expensive than the CD-DA player. Such difference in price obviously results from the complicated decoding method and decoder employed within the DVD player.
The complication and high cost of the current audio-visual information apparatus have already prevented the circulation and usage of the audio-visual information. Especially for entertainment and education applications serving children and young people, it is desired to provide an economical and beneficial solution to the recording, playing back, and reproducing of the audio-visual information.
Therefore, an object of the present invention is to provide a method of encoding and a method of decoding audio-visual information for easily, economically, and effectively recording, playing back, and reproducing the audio-visual information.
Another object of the present invention is to provide a recording medium stored with formatted audio-visual information, for achieving easy, economical, and effective applications of recording, playing back, and reproducing the audio-visual information.
Although the present invention is usually applied to store a reduced amount of audio-visual information in a small-capacity recording medium, an acceptable degree of audio-visual reproducing quality is successively obtained. In one embodiment of the present invention, the methods of encoding and decoding the audio-visual information may be applied to the CD-DA. Conventionally, the CD-DA can record no information but the normal music data, and the CD-DA player can play back no optical media but the CD-DA. However, the present invention discloses an appropriate format that is named “universal audio-video frame format” by the Inventors, for effectively storing the audio-visual information in the CD-DA. Consequently, the circulation of the audio-visual information is facilitated and there will be much more applications developed on the basis of the present invention since the high quality play back and reproduction of the audio-visual information can be performed by simply using the low-cost CD-DA player.
The methods of encoding and decoding audio-visual information according to the present invention are preferably used for the recording medium with a small storage capacity or bandwidth, such as the CD-DA, the flash memory of the cellular phone, and the like. The recording medium according to the present invention is preferably used for storing the video information to be reproduced on an image display device with a small size or resolution, such as a 216-pixel by 160-pixel liquid crystal display.
According to one aspect of the present invention, a method of encoding audio-visual information is provided. Audio information having a plurality of bytes is prepared. Video information having a plurality of bytes is prepared. At least one synchronization field is configured in the audio information to form at least one synchronization-audio packet (SAP). Each of the at least one SAP has at least one byte of the audio information. At least one control field is configured in the audio information to form at least one control-audio packet (CAP). Each of the at least one CAP has at least one byte of the audio information. At least one video field is configured and the audio information and the video information are merged to form at least one video-audio packet (VAP). Each of the at least one VAP has at least one byte of the audio information. The at least one SAP, the at least one CAP, and the at least one VAP are combined to form at least one universal audio-video frame (UAVF). The at least one UAVF is recorded in a recording medium. The at least one synchronization field stores at least one synchronization data for marking a start of the at least one UAVF. The at least one control field stores at least one control data for reproducing the video information.
According to another aspect of the present invention, a recording medium of audio-visual information is provided. Plural bytes of audio information are recorded in the recording medium for playing back as sound. Plural bytes of video information are recorded in the recording medium for reproducing as image. At least one synchronization-audio packet (SAP) is recorded in the recording medium. Each of the at least one SAP has a synchronization field and a first audio field. The first audio field stores at least one byte of the audio information. At least one control-audio packet (CAP) is recorded in the recording medium. Each of the at least one CAP has a control field and a second audio field. The second audio field stores at least one byte of the audio information. At least one video-audio packet (VAP) is recorded in the recording medium. Each of the at least one VAP has a video field and a third audio field. The third audio field stores at least one byte of the audio information. The at least one SAP, the at least one CAP, and the at least one VAP are combined to form the at least one UAVF.
According to still another aspect of the present invention, a method of decoding audio-visual information is provided. The audio-visual information is formatted by at least one universal audio-video frame (UAVF) having at least one synchronization-audio packet (SAP), at least one control-audio packet (CAP), and at least one video-audio packet (VAP). Data stored in at least one synchronization field of the at least one SAP is detected for determining a start of the at least one UAVF. A first portion of the audio information is accessed from the at least one SAP. Data stored in at least one control field of the at least one CAP is detected. A second portion of the audio information is accessed from the at least one CAP. The video information stored in at least one video field of the at least one VAP is accessed. A third portion of the audio information is accessed from the at least one VAP. The video information stored in the at least one video field is reproduced in response to the data stored in the at least one control field. The first to third portions of the audio information are played back.
The above-mentioned and other objects, features, and advantages of the present invention will become apparent with reference to the following descriptions and accompanying drawings, wherein:
The preferred embodiments according to the present invention will be described in detail with reference to the drawings.
In a step ES2, the digital video information 20 may be generated from a video source 201 by performing a video signal processing step ES2′. The video source 201 may include an analog source and/or a digital source. For example, the video signal processing step ES2′ may consist of sampling, sub-sampling, tuning for the video quality, and the like, which are well known by one skilled in the art. The video signal processing step ES2′ may also include a conventional video compression technique such that the digital video information 20 is generated by compression. In one embodiment of the present invention, the video source 201 may be 24-bit bitmap format video data, and converted into 4-bit bitmap format video data through the sub-sampling of the video signal processing step ES2′. In a case where the digital video information 20 is directly provided, i.e. the video source 201 is the 4-bit bitmap format video data, the additional video signal processing step ES2′ becomes unnecessary.
In a step ES3, at least one synchronization field is configured in the digital audio information 10 and then filled with synchronization data, thereby generating audio information 30 containing at least one synchronization-audio packet (SAP).
In a step ES4, at least one control field is configured in the digital audio information 30 containing the SAP, and then filled with control data, thereby generating audio information 40 containing both of the SAP and at least one control-audio packet (CAP).
In a step ES5, at least one video field is configured while the digital audio information 40 containing the SAP and the CAP is merged with the digital video signal 20, thereby generating an audio-visual information 50 formatted by at least one universal audio-video frame (UAVF) consisting of at least one SAP, at least one CAP, and at least one video-audio packet (VAP).
In one embodiment of the present invention, the recording medium is implemented by a CD-DA with a diameter of 108 mm for storing the audio-visual information formatted by the UAVF. Typically, the specification of the CD-DA output is 16 bits per channel at a rate of 44.1 K samples per second. Due to dual channels (i.e. right and left channels) the CD-DA provides a bandwidth of 44,100*16*2/8=1 76,400 byte/sec, provided that each byte has eight bits. When the frame rate is set as 9 frames per second, the storage capacity of the CD-DA is 176,400/9=1 9,600 bytes during one frame, i.e. 1/9 seconds. When a display with a resolution of 216-pixel by 160-pixel is employed, the video information required for displaying one frame is 216*160*4/8=17,280 bytes if each pixel is expressed by a 4-bit data. When the audio information is stored in the CD-DA under a condition that every ten bytes of data contains one byte of the audio information, 1,960 bytes of the audio information can be stored during one frame ( 1/9 seconds). That is, the sampling rate of the audio information is 1,960*9=17.64K per second.
Because the audio information and the video information are mixed together and then recorded within the two channels of the CD-DA, it is necessary to use the synchronization data for identifying the start of each UAVF and the position of the audio information. As described above, the storage capacity of the CD-DA during one frame ( 1/9 seconds) is 19,600 bytes wherein 17,280 bytes are arranged to store the video information and 1,960 bytes are arranged to store the audio information. As a result, 360 bytes are available for storing the synchronization data and/or the control data, such as the gamma table or other parameters regarding the play back and reproduction of the audio-visual information.
It should be noted that although the encoding method according to the present invention may effectively store the audio-visual information on the CD-DA with the diameter of 108 mm, the present invention is not limited to this and may be applied to store the audio-visual information on various types of recording media, including a cassette tape, a floppy disk, a semiconductor memory, a game card, a compact disk with an arbitrary diameter, and so on.
While the invention has been described by way of examples and in terms of preferred embodiments, it is to be understood that the invention is not limited to the disclosed embodiments. To the contrary, it is intended to cover various modifications. Therefore, the scope of the appended claims should be accorded the broadest interpretation so as to encompass all such modifications.
Number | Date | Country | Kind |
---|---|---|---|
093114916 | May 2004 | TW | national |