The present invention relates to processing of dynamic graphic content, in particular, to a method and apparatus for encoding/decoding dynamic graphic content.
Dynamic graphic content is rapidly prevailing with the rapid development of television meeting, VCD, digital TV and HDTV in recent years. The graphic content mentioned herein is a combination of text and pictures. The dynamic graphic content features such elements as forms, buttons, and targeted information, whose appearance is determined by the device on behalf of internal states and of its user.
As shown in
However, it is costly to add said processing capability to the user device. Typically, it demands more powerful CPUs, graphic co-processors, additional memory for code and data, and pixel-based picture memory. So dynamic graphic content is not accessible to low cost devices.
Another way as depicted in
However, this method still has defects. In such a method, as many views as variants should be created according to the number of dynamic elements included in the dynamic graphic content. Suppose there are N dynamic elements in a dynamic graphic content, denoted as eI, . . . ,eN. Element ei has Mi different appearance states, denoted by 0, . . . ,Mi-I. Thus, the number of static views to create is equal to the product of Mi (i=I˜N), denoted as Mi in
Therefore, a novel method for providing dynamic graphic content is required to compress dynamic pictures economically and effectively, and to save bandwidth and memory without large modifications to the user device.
An object of the present invention is to solve the above-mentioned technical problems residing in the related art.
An aspect of the present invention provides a method for encoding dynamic graphic content in a block-based video predict-encoding scheme, comprising: encoding a view in which all of the plurality of dynamic elements being in a first state as a reference picture; encoding the views in which at least one of the plurality of dynamic elements being in a state other than the first state as differential pictures with regards to said reference picture, to form a differential picture sequence; multiplexing said reference picture and said differential picture sequence together, and providing the result video signals.
Preferably, the method for encoding dynamic graphic content of the invention is implemented in a MPEG encoding scheme.
Another aspect of the present invention provides a method for decoding video signals resulted from the method for encoding dynamic graphic content of the invention, comprising: decoding the reference picture; decoding the differential pictures corresponding to the state of dynamic elements that have changed with respect to said reference picture.
Preferably, the decoding method of the invention further comprising a step of skipping the differential pictures corresponding to the state of dynamic elements that has not changed with respect to said reference picture.
Still another aspect of the present invention provides a device for implementing the methods of the invention for encoding/decoding dynamic graphic content.
Still another aspect of the invention provides a broadcasting system and a video signals offering apparatus comprising the graphic encoding device of the invention.
Still another aspect of the invention provides a video player and a user device comprising the decoding device of the invention.
It will be appreciated that the method of the present invention can be applied to variant-predict encoding scheme, such as MPEG-1, 2, 4, DivX, H261, H262, H263, and H264, and the like.
The above and other objects, features and advantages of the invention will be more clearly understood from the following detailed description taken in conjunction with the accompanying drawings, in which:
A detailed description to the embodiments of the present invention will be o provided as follows.
In a block (object)-based predict-encoding scheme, pictures are segmented into blocks (or objects), with each block occupying a constant area in the pictures. In the present invention, the pictures are segmented so that different dynamic elements are positioned in different blocks (objects). Each dynamic element occupies a constant area regardless of its states. This allows keeping the same layout in all variant views. The elements are non-overlapping not only in the pixel domain, but also in the coded domain. For example, MPEG-1 and MPEG-2 use block grids in encoding process, and different elements should fall on distinct blocks.
The preferred embodiment of the present invention will be described in detail by taking MPEG video encoding standards as an example for convenience sake. Please note that the MPEG encoding process scheme merely serves to explain the invention as a example, and is not intended to limit the invention. The method of the present invention can be applied to variant-predict encoding scheme, such as MPEG-1, 2, 4, DivX, H261, H262, H263, and H264, and the like.
In the method of the present invention, view of (e1=0, e2=0, . . . ,eN=0) is encoded as an Intra-picture (I-picture). Then views (e1=l, e2=0, . . . ,eN=0), . . . ,(e1=M1−l, e2=0, . . . ,eN=0) are encoded as differential pictures with regards to the encoded view (e1=0, e2=0, . . . ,eN=0). Here, differential encoding is the base of most video coding schemes, and especially the MPEG. Differential pictures are called P-pictures (predicted-pictures) in MPEG. The process continues for (e1=0, e2=l e3=0, . . . ,eN=0), . . . , (e1=0, e2=M2−1, e3=0, . . . , eN=0) till (e1=0, .. ., eN-1=0, eN=1), . . . , (e1=0, . . . , eN-1=0, eN=MN−1), see
In encoding scheme using differential (or predictive) encoding, the above processing can be optimized by using a single encoder. This process is depicted in
This leads to an encoded video sequence formed by a group of 1 intra-picture +{Σ(Mi−1) I=1, . . . ,N} predicted pictures. This sequence is short, so it is typically repeated in time until its content is outdated.
To further reduce bandwidth, preferably, the video signal contains an intra-picture !o no less than every predetermined time period. Predicted pictures, whose encoded forms are very compact, which simply indicate “no change with regards to previous picture”, can be added to the sequence if it is less than the predetermined time period. For example, for predetermined time period of ½ second at the rate of 25 pictures per second, the number of P-picture, {Σ(Mi−1) i=1, . . . ,N}, should be 11. Here, the ½ second refers to the maximum latency for switching between views.
Table 1 below shows the comparison between the methods for dynamic graphic pre-processing of the present invention and the prior art, for a same latency between view switching at the receiver end.
As can be seen, not only {Σ(Mi−I) i=I, . . . , N) is significantly less than {product of Mi i=1, . . . , N}, but also the size of P-pictures is an order of magnitude (10×) less than I-pictures. Thus, the pre-processing of dynamic graphic content of the prevent invention allows significant bandwidth savings.
The decoding method of the present invention will be explained with reference to
A legacy video decoder can play back the video signal encoded according to the method of the present invention.
To display the view corresponding to (e1, e2, . . . ,eN) (where ei is a value within 0, . . . ,Mi−I denoting the appearance of the element), the decoder should first decode the I-picture before decoding P-picture. P-pictures encoding a state change in one of the elements can be denoted as the size N vector (0, . . . ,0, fi≠0, 0, . . . ,0) where i is an index within 1˜N and fi is the appearance of the element to within 0, . . . ,Mi−I. Then, for all i such as ei≠0, P-pictures (0, . . . ,0, fi=ei, 0, . . . ,0) will be decoded while other P-pictures will be skipped.
This decoding process can be performed in the decoder for encoding schemes based on block/object coding and differential encoding shown in
The state machine depicted in
The state machine depicted in
If the encoding scheme is MPEG, the decoding process can be performed thanks to slight modifications to the legacy MPEG decoder shown in
Once the desired view is constructed, it should be frozen on the screen until the graphic content changes. Typically, freezing a picture in decoding process is to conceal an erroneous stream, but in the present invention, it is a normal processing. For example, in a MPEG decoder, the VLD will wait for the synchronization word of the next picture while the last picture being frozen. The state machine in
So, the benefit of the decoding process of the present invention is that user device doesn't need to be re-designed significantly. In particular, this process can be performed in legacy video decoders.
Although the invention has been explained by taking MPEG encoding scheme as an example, it should be understood that, the MPEG scheme merely serves to explain the invention as an example, and is not intended to limit the invention. The invention can be conveniently applied to other block (object)-based predict-coding schemes. In addition, the details set above should not be deemed limitation to the invention. It is apparent for those skilled in the art that there are different substitutions, modifications and changes for the invention.
Number | Date | Country | Kind |
---|---|---|---|
02158390.0 | Dec 2002 | CN | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB03/06249 | 12/29/2003 | WO | 1/18/2006 |