This invention relates to video compression techniques and in one example, to MPEG2.
It is well understood that the selection of an appropriate group of pictures (GOP) structure is important to achieving quality encoding at a given bit rate. At lower bit rates it will generally be necessary to employ relatively large numbers of predicted (P and B) pictures and a relatively small number of Intra-coded (I) pictures. There are, however, lower limits upon the number of I-pictures, dictated in different applications by requirements to contain drift and the promulgation of errors and to ensure an acceptable lock-on period when switching to a particular bitsteam. An additional factor lies in the visibility of the GOP structure, particularly at low bit rates. Thus the introduction of an I-picture, dictated by the requirements that have been mentioned, can itself represent an artefact in a generally quiet picture sequence.
It is an object of this invention to address these apparently conflicting is requirements.
Accordingly, the present invention consists in one aspect in a method of compression coding a sequence of pictures, the method comprising the steps of generating a measure of picture difference; comparing said measure with a picture difference threshold; and utilising this comparison in selecting between the use of predicted and non-predicted compressed pictures (I-pictures), wherein the threshold has an initial value adapted to promote the use of an I-picture at or close to a scene change in the picture sequence, the threshold changing in time in such a manner as to increase the probability of selection of a I-picture with an increase in the time since selection of the last I-picture.
In a further aspect, the invention consists in apparatus for compression coding of a sequence of pictures, comprising measuring means for generating a measure of picture difference; generator means for generating a picture difference threshold; comparator means for comparing said measure with the picture difference threshold; and adaption means for utilising this comparison in selecting between the use of predicted and non-predicted compressed pictures (I-pictures), the generator means capable of adapting the picture difference threshold to have an initial value which promotes the use of an I-picture at or close to a scene change in the picture sequence; and changing in time the threshold in such a manner as to increase the probability of selection of a I-picture with an increase in the time since selection of the last I-picture.
In this way it can be arranged that with relatively frequent scene changes, every I-picture is located at or close to a scene change. This has the advantages of improved coding efficiency since predictive coding will be inefficient across a scene change, and of low visibility of the compressed picture structure, since any noticeable difference in character between predicted and non-predicted compressed pictures will be masked by the scene change.
Where the interval between scene changes is large, and it becomes desirable to select an I-picture within a scene, the change in threshold according to this invention will promote the selection of an I-picture at a discontinuity in the picture sequence, such as the movement into shot of a large object. As the time since the last I-picture increases, the amount of discontinuity necessary to provoke the selection of an I-picture will progressively reduce.
In one embodiment, the threshold reduces linearly from a value which is expected to be achieved only by a “true” scene change, to a value at which a the selection of an I-picture is inevitable, irrespective of picture content. The period of this threshold variation might typically be 120 fields or 2 seconds. A different period could of course be chosen and the threshold variation may be non-linear.
The location of an I-picture at a discontinuity will of course provide the same advantages as location at a scene change, to a degree dependent upon the amount of the discontinuity.
The measure of picture difference can take a wide variety of forms, it can operate in the video domain and take the form of an accumulated sum of pixel by pixel differences. More complex methods for detecting scene changes are known and can be applied in accordance with this invention, for example through modification of the threshold for scene change identification. Reference is for example directed to EP-A-0 748 560 which describes methods for detecting scene changes or cuts, utilising a lack of correlation between successive pictures rather than an arithmetical picture difference.
It will also be possible to provide a measure of picture difference in the MPEG domain, utilising the amount of the prediction error.
The invention will now be described by way of example with reference to the accompanying drawings, in which:
In a lower bit ram MPEG2 encoder, it is useful to employ a GOP structure such as that shown in
At a discontinuity which exceeds the current threshold value (signified in 30
There are advantages in the pattern of B- and P-pictures remaining generally constant. The number (here two) of successive B-pictures determines the amount of delay required to accommodate the re-ordering of pictures, necessary to ensure that each B-picture arrives after both of the two pictures from which it is predicted. Changes in the number of successive B-pictures are therefore not desirable.
In consequence, if a discontinuity which exceeds the current threshold value occurs immediately before a B-picture, selection of an I-picture is deferred until the next location at which a P-picture would otherwise have been sent. This is illustrated in
In an embodiment, the apparatus illustrated in
It will be understood that this invention has been described by way of example only and that a wide variety of further modifications are possible without departing from the scope of the invention. For example, the invention extends to a computer program or computer program product for carrying out any of the methods described herein.
Number | Date | Country | Kind |
---|---|---|---|
0018628.8 | Jul 2000 | GB | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/GB01/03402 | 7/27/2001 | WO | 00 | 10/23/2003 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO02/11453 | 2/7/2002 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5592226 | Lee et al. | Jan 1997 | A |
5774593 | Zick et al. | Jun 1998 | A |
6057893 | Kojima et al. | May 2000 | A |
6714594 | Dimitrova et al. | Mar 2004 | B1 |
6731684 | Wu | May 2004 | B1 |
6804301 | Wu et al. | Oct 2004 | B1 |
20010014121 | Kaye et al. | Aug 2001 | A1 |
Number | Date | Country |
---|---|---|
WO 9524095 | Sep 1995 | WO |
WO 0019726 | Apr 2000 | WO |
Number | Date | Country | |
---|---|---|---|
20040179590 A1 | Sep 2004 | US |