This application claims the benefit, under 35 U.S.C. §119 of French Patent Application 0759829, filed Dec. 13, 2007.
The present invention relates to a method and a device for video picture format conversion. It can be implemented in a video coder or decoder or directly in any type of display device.
There are currently two formats associated with standard HDTV (High Definition TV). These two formats are the 720p format and the 1080i format. The 720p format produces progressive pictures comprising 720 lines and 1280 columns and the 1080i format produces interlaced pictures comprising 1080 lines and 1920 columns. Each of these formats has advantages that are specific to it. The 720p format presents a better temporal resolution and reproduces fast moving objects without creating a blur effect while the 1080i format presents a better spatial resolution. The programme broadcasters have chosen to use one or other of these formats but not both together. There is therefore a real necessity to process, prior to display, the format of broadcast programmes, to transform it into a format supported by the display device used to display said programmes.
It is known in the art to change the format of a video picture by first carrying out a detection of the local edge orientation in the input picture then carrying out an interpolation based on the detected orientations. A distance representative of the edge orientation is calculated at a point of the input picture having the same spatial position (i.e. the same spatial coordinates) as the pixel of the output picture to be interpolated and this distance is then used to calculate the value of this pixel of the output picture. This method is relatively complex and consuming in terms of calculation resources.
The purpose of the present invention is to improve this method.
The present invention relates to a format conversion method of an input picture to an output picture, comprising the following steps:
According to a particular embodiment, the neighbouring pixels of the first set belong to at least two consecutive lines or columns of pixels of the input picture.
The calculation of the distance representative of the edge orientation is carried out on a single point of the zone of the input picture to reduce the number of calculations. This point is thus advantageously situated at the centre of said zone.
According to the invention, the distance representative of the edge orientation at a point of the input picture depends on a local gradient of video components of the input picture at said point.
According to a particular embodiment, if the calculated distance exceeds a predefined maximum value, the distance is capped at said maximum value.
According to another particular embodiment, if the calculated distance exceeds a predefined maximum value, the distance is adjusted to a null value.
According to the invention, the value of the pixel of the output picture is advantageously determined by bilinear interpolation of pixels of the second set.
The invention also relates to a device for format conversion of an input picture into an output picture, the input picture and the output picture comprising the pixels, comprising:
The invention will be better understood upon reading the following description, provided as a non-restrictive example and referring to the annexed drawings wherein:
The method of the invention is used to carry out a format conversion of an input picture. The method produces an output picture having a format different to that of the input picture, the output picture comprising a larger or smaller number of lines and/or columns than the input picture. The lines and/or columns of the output picture are generated by interpolation of the lines and/or columns of the input picture.
According to the invention, the interpolation is an “edge orientated interpolation”. This interpolation is more particularly realised when based on a detection of local edge orientations in the input picture. This detection is carried out by calculating the distances representative of the orientation of these edges. An example of the calculation of this distance will be described later in the description.
An example of the calculation of the representative distance d of the edge orientation at a point of the input picture is described hereafter. This distance d, is calculated at a point P of the input picture as shown in
where I(x,y) designates the luminance component of the pixel at coordinates (x,y).
The distance d to point P is then taken to equal:
The distance d calculated for the pixel P is shown in
According to the invention, this distance d calculated at the point P is used to determine the value of all the pixels of the output picture belonging to the zone of the output picture corresponding to the zone delimited by the points at coordinates (xm,yn), (xm+1,yn), (xm+1,yn+1) and (xm,yn+1) in the input picture. It should be noted that the coordinates (xm,yn), (xm+1,yn), (xm+1,yn+1), (xm,yn+1) are the coordinates in the input picture. The output picture having a number of columns and a number of lines different to those of the input picture, the pixels delimiting this zone have different coordinates in the output picture. Naturally it is possible to extend this zone to any zone surrounding the point P having a different size and/or form.
The format conversion modifies the number and position of lines and/or columns of pixels of the picture. For example, in the output picture, a line y″ of pixels is present between the lines yn and yn+1 of the input picture and a column of pixels x″ is present between the columns of pixels xm and xm+1 of the input picture. A pixel Pout of the output picture is present at the intersection of the line y″ and the column x″ as shown in
With reference again to
A vertical phase φ=|y″−yn| representing the distance between the line y″ of the pixel Pout and the line yn of the input picture is defined. φ is comprised between 0 and 1.
The columns of pixels xm and xm+1 of the input picture closest to the column xout are also determined.
X″=r·xout
xm=E(x″)
A horizontal phase θ=x″−xm−0.5 representing the distance between the line x′ of the pixel Pout and the pixel P of the input picture is defined. θ is comprised between −0.5 and 0.5.
If the line segment S is translated horizontally so that it passes through the pixel Pout, this segment then links the point at coordinates (xyn,yn) and the point at coordinates (xyn+1,yn+1).
Then xyn=x″+2·φ·d+θ=x″+dsup and xyn+1=x″+2·(1−φ)·d+θ=x″−dinf are obtained.
It should be noted that |dinf+dsup|=2d.
The second set of pixels determined at step 120 is then for example:
In the example of
With reference again to
α=|E(xyn)−xyn|
β=|E(xyn+1)−xyn+1|
The value of the pixel P at coordinates (xout, yout) in the output picture is then calculated using the following formula that corresponds to a bilinear interpolation:
Iout(xout,yout)=(1−φ)·└(1−α)·Iin(E(xyn), yn)+α·Iin(E(xyn)+sgn(d), yn)┘+φ[(1−β)·Iin(E(xyn+1), yn+1)+β·Iin(E(xyn+1)+sgn(d), yn+1)]
The calculation is carried out on each video component of the pixel. The distance d calculated for the luminance video component is used for the calculation of all the video components of the pixel from the output picture.
Other formulas for the calculation of the video components of pixels of the output picture can be considered. In the example above, the interpolation is bilinear. The use of a linear interpolation or an interpolation by selection of the closest neighbour or a combination of linear or bilinear interpolations can be envisaged. These interpolations are well known by those in the profession. Use of a greater number of lines or columns can also be envisaged. Finally changing the interpolation formula according to the distance module d can be provided for.
The filter used is for example the following 5×5 coefficients filter:
In the method of the invention previously defined, the distance calculated at step 110 or 510 is used as is to determine the pixels of the second set. As a variant, if the distance calculated exceeds a predefined maximum value, this distance is capped at this maximum value. This maximum value is for example equal to 4 pixels for an SD (Standard Definition) picture. In fact, beyond this maximum value, the value of the distance d cannot be low thus introducing errors into the interpolation. According to another variant, if the distance calculated exceeds this maximum value, the distance is adjusted to a null value. This second variant enables even further reduction of the risk of errors while being however less precise in the interpolation of some edges.
Naturally, the invention is not limited to the aforementioned embodiments. Any format conversion method or device in which distances representative of the edge orientations are calculated at points of the input picture independently of the grid of the output picture, each point being associated with a zone of the picture, and in which the values of pixels in a zone of the output picture are calculated from the distance calculated for this zone and from the position of these output pixels with respect to the point of the input picture falls into the field of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
0759829 | Dec 2007 | FR | national |
Number | Name | Date | Kind |
---|---|---|---|
5532751 | Lui | Jul 1996 | A |
6122017 | Taubman | Sep 2000 | A |
6771835 | Han et al. | Aug 2004 | B2 |
7023487 | Adams | Apr 2006 | B1 |
7075580 | Jiang | Jul 2006 | B2 |
7116372 | Kondo et al. | Oct 2006 | B2 |
7126643 | Song et al. | Oct 2006 | B2 |
7242412 | Sellers | Jul 2007 | B2 |
7259794 | Chang et al. | Aug 2007 | B2 |
7319797 | Hung | Jan 2008 | B2 |
7321400 | Chou et al. | Jan 2008 | B1 |
7379625 | Wang et al. | May 2008 | B2 |
7379626 | Lachine et al. | May 2008 | B2 |
7382929 | Cooper | Jun 2008 | B2 |
7397973 | Lee | Jul 2008 | B2 |
7403234 | Lertrattanapanich et al. | Jul 2008 | B2 |
7406208 | Chiang | Jul 2008 | B2 |
7412096 | Neuman et al. | Aug 2008 | B2 |
7414671 | Gallagher et al. | Aug 2008 | B1 |
7440016 | Keshet et al. | Oct 2008 | B2 |
7474789 | Wang et al. | Jan 2009 | B2 |
7519232 | Neuman et al. | Apr 2009 | B2 |
7548275 | Honda | Jun 2009 | B1 |
7554559 | Lu et al. | Jun 2009 | B2 |
7590307 | Wang et al. | Sep 2009 | B2 |
7782401 | Chou | Aug 2010 | B1 |
7876979 | Lee et al. | Jan 2011 | B2 |
7945120 | Tanaka | May 2011 | B2 |
8139153 | Kervec et al. | Mar 2012 | B2 |
20030206667 | Wang et al. | Nov 2003 | A1 |
20040119884 | Jiang | Jun 2004 | A1 |
20050073607 | Ji et al. | Apr 2005 | A1 |
20050122426 | Winger et al. | Jun 2005 | A1 |
20060268168 | Au et al. | Nov 2006 | A1 |
20070052845 | Adams | Mar 2007 | A1 |
20080024658 | Wang et al. | Jan 2008 | A1 |
20080068498 | Chang et al. | Mar 2008 | A1 |
20080231746 | Lertrattanapanich et al. | Sep 2008 | A1 |
20090207304 | Kervec | Aug 2009 | A1 |
20090214132 | Yamada et al. | Aug 2009 | A1 |
Number | Date | Country |
---|---|---|
1 313 312 | May 2003 | EP |
2 422 264 | Jul 2006 | GB |
Number | Date | Country | |
---|---|---|---|
20090167936 A1 | Jul 2009 | US |