Method for motion estimation using a low-bit edge image

Description

BACKGROUND OF THE INVENTION

1. Field of Invention

The present invention relates to a method for a motion estimation algorithm. More particularly, the present invention relates to a method for a motion estimation algorithm based on block matching. The invention uses a low bit resolution oriented edge image, by which appropriate difference blocks for DCT coding are generated and encoding efficiency is enhanced.

2. Description of Related Art

Block-based motion estimation (ME) have been adopted to eliminate temporal redundancy of image sequence by MPEP1/2 and H.261, matched difference block for DCT coding is obtained under certain matching criterion. Due to high operation load of full search (FS), ME has been the main bottleneck in real time image encoding system, many fast ME algorithms have been developed to reduce the computation cost.

In most ME algorithms, sum of absolute difference (SAD) of luminance pixel is used as searching criterion, and results in the best matched difference block with global or local minimum energy, i.e., minimum direct current (DC) coefficients after DCT, but the alternating current (AC) coefficients might be large and encoding efficiency is reduced.

SUMMARY OF THE INVENTION

The invention provides a method for a motion estimation algorithm. The motion estimation algorithm using low bit resolution integrated edge image (LRIEI) instead of luminance image to obtain difference block with small AC coefficients, edge image created by Sobel filters is employed to improve encoding quality, on the other hand, operation cost is reduced to low bit resolution.

The invention also provides a method for a motion estimation algorithm, in which a new algorithm using low-bit resolution oriented edge image (LROEI) is described in this invention. Using low-bit resolution oriented edge in motion estimation can result in flatter image blocks which is in demanded by texture-compress unit (such as DCT), as a result, the encoding efficiency is improved, and the operation load is reduced by low-bit resolution.

As embodied and broadly described herein, the invention provides a method for motion estimation. In the method, an oriented edge image is generaged by an edge-detect-filtering process in according with contour and texture of an original luminance image. An subsampled edge image is generated by performing an low-bit resolution oriented edge image process upon the edge image. A wide-range search is performed upon the subsampled edge image to generate a motion vector. A small-range full search is performed upon the generated motion vector to obtain a final motion vector by using decoded luminance pixels.

In the above-mentioned method for motion estimation, the edge-detect-filtering process including performing a x-direction filtering process and a y-direction filtering process to generate a x-direction oriented edge pixels and a y-direction oriented edge pixels; and generating the oriented edge image by adding the x-direction edge pixels and the y-direction edge pixels.

In the above-mentioned method for motion estimation, the x-direction edge pixels and the y-direction edge pixels are generated in accordance with following equations, respectively:

- where Cgx/Cgy and Rgx/Rgy are Gx/Gy edge intensity of a current block and a reference block respectively;
  
  SADgxy(i, j)=SADgx+SADgy=x+y

SADgxy is the sum of absolute difference of the original luminance image and is used for a search criterion.

In the above-mentioned method for motion estimation, the low-bit resolution oriented edge image process including performing a non-uniform quantization process upon the edge image. The non-uniform quantization process including providing a plurality of quantization levels in accordance with a predetermined bit-length; and quantizing the x-direction edge pixels and the y-direction edge pixels in accordance with the quantization levels to generate the subsampled edge image. The bit-length is depended upon encoding quality, the more the bit-length, the better the encoding quality. The bit-length can be 8-bit, 6-bit, or 4-bit, for example.

As embodied and broadly described herein, the invention provides a method for motion estimation. In the method for motion estimation, an oriented edge image is generated by an edge-detect-filtering process in according with contour and texture of an original luminance image. The edge-detect-filtering process including performing a x-direction filtering process and a y-direction filtering process to generate a x-direction oriented edge pixels and a y-direction oriented edge pixels; and generating the oriented edge image by adding the x-direction edge pixels and the y-direction edge pixels. In the method, an subsampled edge image is generated by performing an low-bit resolution oriented edge image process upon the edge image. The low-bit resolution oriented edge image process including performing a non-uniform quantization process upon the edge image. The non-uniform quantization process including providing a plurality of quantization levels in accordance with a predetermined bit-length, wherein the bit-length is depended upon encoding quality, the more the bit-length, the better the encoding quality; and quantizing the x-direction edge pixels and the y-direction edge pixels in accordance with the quantization levels to generate the subsampled edge image. In the method further includes using the subsampled edge image to generate a motion vector for the original luminance image.

In the above-mentioned method for motion estimation, the x-direction edge pixels and the y-direction edge pixels are generated in accordance with following equations, respectively:

- where Cgx/Cgy and Rgx/Rgy are Gx/Gy edge intensity of a current block and a reference block respectively;
  
  SADgxy(i, j)=SADgx+SADgy=x+y

SADgxy is the sum of absolute difference of the original luminance image and is used for a search criterion.

It is to be understood that both the foregoing general description and the following detailed description are exemplary, and are intended to provide further explanation of the invention as claimed.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings are included to provide a further understanding of the invention, and are incorporated in and constitute a part of this specification. The drawings illustrate embodiments of the invention and, together with the description, serve to explain the principles of the invention. In the drawings,

FIG. 1 showing an average percentage statistical distribution of the an edge-detect-filtering process in according with contour and texture of an original luminance image of a preferred embodiment of the invention; and

FIG. 2A is a diagram of a conventional method for a motion estimation algorithm;

FIG. 2B is a diagram of a preferred embodiment of a method for an algorithm using low bit resolution oriented edge image in the invention; and

FIG. 3 is a diagram of a preferred embodiment of a hierarchical subsampling telescopic search using the motion estimation algorithm of the invention.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

The invention also provides a method for a motion estimation algorithm, in which a new algorithm using low bit resolution oriented edge image (LROEI) is described in this paper. Compared with LRIEI, LROEI is more accurate to denote the feature of objects in image. As a result, encoding quality is improved further. As LRIEI, low bit resolution guarantees low hardware cost. The algorithm is regular for very large scaled integrated circuit (VLSI) realization and easily combined with other ME algorithm such as a hierarchical subsampling telescopic search. Mismach between DCT and conventional ME

DCT, the texture-compression unit of H.261 and MPEG 1/2, transforms 8×8 pixel block to corresponding frequency domain and packs energy in low-frequency coefficients (LFC). Flatter the block (fewer edge points in the block), that is more energy concentrates in LFC, results in fewer bit amount of quantified AC coefficients, so that encoding efficiency is improved.

However, the conventional ME algorithm using sum of absolute difference (SAD) of luminance pixel only guarantee minimum DC coefficient, but not the flattest one. As shown in Table 1 as followed:

TABLE 1

Table 1 shows two 6*6 luminance difference image M & N, and two 4*4 extracted centro-blocks M₀& N₀as sample:

M₀(1:4, 1:4)=M(2:5, 2:5)
N₀(1:4, 1:4)=N(2:5, 2:5)

The sums of M₀& N₀are 48 and 336 respectively. Apparently, conventional ME algorithms select block M₀as the best match, but the DCT/Q results in Table 2 shows that block N₀requires less encoding bit amount and should be selected as the best match.

TABLE 2

DCT/Q

Results of M₀& N₀

embedded image

Edge Image and Sobel Filter

Edge image, which is employed for ME to search flatter block, includes both contour and texture of objects and denotes high frequency component of the image. Sobel edge-detect-filters are engaged to create edge image due to smoothness and low noise. Sobel filters are composed of both x-direction and y-direction filters Sx and Sy respectively.

$\begin{matrix} S_{y} = [\begin{matrix} \begin{matrix} 1 \\ 0 \end{matrix} \\ - 1 \end{matrix}] \times [\begin{matrix} \begin{matrix} 1 \\ 2 \end{matrix} \\ 1 \end{matrix}] = [\begin{matrix} - 1 & - 2 & - 1 \\ 0 & 0 & 0 \\ 1 & 2 & 1 \end{matrix}] & (1) \\ S_{x} = [\begin{matrix} \begin{matrix} 1 \\ 0 \end{matrix} \\ - 1 \end{matrix}] \times [\begin{matrix} \begin{matrix} 1 \\ 2 \end{matrix} \\ 1 \end{matrix}] = [\begin{matrix} - 1 & 0 & 1 \\ - 2 & 0 & 2 \\ - 1 & 0 & 1 \end{matrix}] & (2) \end{matrix}$

Gy=Filter2(Sy, L) (3)
Gx=Filter2(Sx, L) (4)

where Filter2 is 2D digital filter, L is 8-bit luminance image. Gx and Gy are 11 bit x-direction and y-direction edge intensity respectively. For LRIEI, Sobel edge intensity is integration of Gx and Gy:

G=|Gx|+|Gy| (5)
Gx and Gy are used respectively (oriented edge intensity) to describe the edge feature in detail. For example, according to equation (5), (Gx, Gy)=(1,3) or (−3,1) will result in same integrated edge intensity G. Obviously, using oriented edge intensity Gx and Gy respectively are more accurate for ME. To extract the oriented edge intensity of one pixel, 8 adjacent pixels and 8 addition operations are needed.

Due to oriented edge intensity is employed, the function of ME should be

$\begin{matrix} x = \sum_{k = 0}^{N - 1} \sum_{l = 0}^{N - 1} \langle C g x (x + k, y + 1) - R g x (x + i + k, y + j + l) \rangle \\ y = \sum_{k = 0}^{N - 1} \sum_{l = 0}^{N - 1} \langle C g y (x + k, y + 1) - R g y (x + i + k, y + j + l) \rangle \\ S A D g x = \sum_{k = 0}^{N - 1} \sum_{l = 0}^{N - 1} \langle C g x - R g x (i, j) \rangle \\ S A D g y = \sum_{k = 0}^{N - 1} \sum_{l = 0}^{N - 1} \langle C g y - R g y (i, j) \rangle \\ S A D g x y (i, j) = S A D g x + S A D g y \end{matrix}$

Where Cgx/Cgy and Rgx/Rgy are Gx/Gy edge intensity of current and reference block respectively, SADgxy is the SAD of them and used as searching criterion. For above example, the SADgxy of block M₀, N₀are 436 and 146, therefore, the flatter block N₀is selected.

LROEI & Non-Uniform Quantization

LROEI is generated by non-uniform quantization (NUQ) to reduce operation load at the minimal quality degradation. The average percentage statistical distribution (shown in Table 3 and FIG. 1) of the Sobel processed oriented edge intensity is done first for five benchmarked video sequences (flower), FTBL (football) and MBCL (mobile & Calendar). Each one is composed with 60 fields.

The data in Table 3 and FIG. 1 show the average percentage distribution of pixels. For example, 1.4% x-direction edge pixels (Gx) are less than −2⁸(−256), and 3.2% Gx are between −2⁸and −2⁷.

TABLE 3

The average distribution (%) of edge intensity

Val
−2⁸
−2⁷
−2⁶
−2⁵
−2⁴
−2³
−2²
−2⁰
2²
2³
2⁴
2⁵
2⁶
2⁷
2⁸
2¹⁰

Gx
1.4
3.2
5.2
6.9
7.9
8.4
6.9
9.6
10
7.4
8.9
7.9
6.6
4.9
3
1.5

Gy
4.6
5.7
7.2
8.3
8.1
6.2
4.1
5.5
6.8
4.6
6.5
7.9
7.9
6.5
5.3
4.8

In the proposed algorithm, the mail goal of quantization is to make the quantized pixels distribute as uniformly as possible in all levels. Obviously, NUQ performs better than uniform quantization because edge pixels distribute non-uniform. Unlike luminance pixels, most edge pixels distribute in low bits. If 7 bit least significant bits of Gx are truncated directly to achieve 4-bit resolution quantization, 48% pixels (that between 0 and 2⁷) will be quantized to 0000, and the only 3% pixels distribute in other 14 levels.

FIG. 1 shows if logarithm axis used for Gx/Gy, the edge pixels distribute evenly. In order to study how the bit-length influence encoding quality, different bit-length NUQ shemes are designed based on the logarithm distribution. 8-Bit NUQ

More quantization levels are, more information are kept, to guarantee encoding quality, 8-bit NUQ scheme is designed (4-bit for Gx and 4-bit for Gy) as in Table 4:

Table 4

GGx/GGy
Corresponding

Levels
Gx/Gy

0
−2¹⁰to −2⁸

1
−2⁸to −2⁷

2
−2⁷to −2⁶

3
−2⁶to −2⁵

4
−2⁵to −2⁴

5
−2⁴to −2³

6
−2³to −2²

7
−2²to 0

8
0 to 2²

9
2²to 2³

10
2³to 2⁴

11
2⁴to 2⁵

12
2⁵to 2⁶

13
2⁶to 2⁷

14
2⁷to 2⁸

15
2⁸to 2¹⁰

Where Gx/Gy are 11-bit-signed oriented edge pixel, and GGx/Ggy are quantized 4-bit oriented edge pixels. The quantization schemes for Gx and Gy are same.

6-Bit NUQ

To reduce the computation load, more redundance of edge pixel are Q scheme is designed (3 bit for Gx and 3 bit for Gy) as in Table 5:

TABLE 5

Ggx/Ggy Level
Corresponding Gx
Corresponding Gy

0
−2¹⁰to −2⁶
−2¹⁰to 2⁷

1
−2⁶to −2⁴
−2⁷to −2⁵

2
−2⁴to −2²
−2⁵to −2³

3
−2²to 0
−2³to 0

4
0 to 2²
0 to 2³

5
2²to 2⁴
2³to 2⁵

6
2⁴to 2⁶
2⁵to 2⁷

7
2⁶to 2¹⁰
2⁷to 2¹⁰

The quantization schemes for Gx and Gy in Gxy6 are different due to the different distribution of Gx and Gy shown in Table 5.

4-Bit NUQ

Based on Gxy6, a more contracted 4-bit NUQ scheme is designed (2 bits for Gx and 2 bits for Gy) as in Table 6:

TABLE 6

GGx/GGy

Level
Corresponding Gx
Corresponding Gy

0
−2¹⁰to −2⁴
−2¹⁰to −2⁵

1
−2⁴to 0
−2⁵to 0

2
0 to 2⁴
0 to 2⁵

3
2⁴to 2¹⁰
2⁵to 2¹⁰

The quantization schemes for Gx and Gy are also different. Combined ME Algorithm

Hierarchical subsampling telescopic search (HSTS), as shown in FIG. 3, is combined to achieve large range search with low operation load. The HSTS composes two steps:

(1). Wide-range search is processed in the first step, subsampling telescope algorithm using LROEI is employed to find a coarse motion vector (MV). 4:1 subsampling reduces both block size and search area, telescope search based on temporal correlation of motion is adopted to reduce operation load further.
(2). To refine the results, in the second step, small range FS is processed over centering on the MV obtained from the first step, decoded luminance pixels are used.

Simulation and Comparison

Five sets of CCIR601 video sequences are used for simulation, each set is composed of 28 frames/56 fields (720*480 pixels/frame, 30 frames/sec.). Only luminance component is considered. Field prediction is adopted, M=2. The maximal possible search range of HSTS is 210 pixels in horizontal direction and 56 pixels in vertical direction. Constant quantization matrixes are used to generate bit-stream.

Combined with HSTS, five scheme using different images are studied:

L8: 8 bit luminance image.
UQL4: 4-bit uniform quantized luminance image (truncating 4 bit LSB of L8)
Gxy8: 8-bit LROEI
Gxy6: 6-bit LROEI
Gxy4: 4-bit LROEI
The PSNR simulation results of five schemes are listed in Table 7.

TABLE 7

PSNR Comparison of Various Algorithm Scheme

L8
UQL4
Gxy8
Gxy6
Gxy4

BUS
30.158
30.082
30.239
30.243
30.228

Cher
29.065
29.058
29.122
29.118
29.114

Flower
29.256
29.234
29.289
29.289
29.285

Ftbl
35.0222
34.933
35.005
35.009
35.001

Mbcl
27.411
27.401
27.454
27.451
27.445

Avg.
30.183
30.142
30.222
30.222
30.125

Compared with L8, the performance of LROEI algorithm is improved, and the PSNR is not sensitive to bit-length of the oriented edge pixels, the results of Gxy8, Gxy6 and Gxy4 are nearly same. Taking notice of that thee PSNR of UQL4 is the lowest one among Table 7. The results show that oriented edge image are superior to luminance image, even 4-bit oriented edge image can achieve better encoding quality.

A new ME algorithm using LROEI is proposed in the invention. Simulation results show that LROEI are appropriate to search MV and difference block for DCT-based video compressing system. Combined with HSTS, Gxy8/6/4 can improve encoding efficiency. The hardware cost, bandwidth and power consumption of motion estimation are reduced due to low-bit resolution. It is very easy to be combined with other fast ME algorithm. Therefore, this algorithm is useful for real-time wide range search video encoding applications such as HDTV. The hardware can be modified through reducing the bit number directly and changing the computation components slightly, only Sobel filter and NUQ are required.

It will be apparent to those skilled in the art that various modifications and variations can be made to the structure of the present invention without departing from the scope or spirit of the invention. In view of the foregoing, it is intended that the present invention cover modifications and variations of this invention provided they fall within the scope of the following claims and their equivalents.

Claims

1. A method for motion estimation, comprising: generating an oriented edge image by an edge-detect-filtering process in according with contour and texture of an original luminance image;generating a subsampled edge image by performing a low-bit resolution oriented edge image process upon the edge image;performing a wide-range search upon the subsampled edge image to generate a motion vector;performing a small-range full search upon the generated motion vector to obtain a final motion vector by using decoded luminance pixels.
2. The method for motion estimation of claim 1, wherein the edge-detect-filtering process comprising: performing a x-direction filtering process and a y-direction filtering process to generate a x-direction oriented edge pixels and a y-direction oriented edge pixels; andgenerating the oriented edge image by adding the x-direction edge pixels and the y-direction edge pixels.
3. The method for motion estimation of claim 1, wherein the x-direction edge pixels and the y-direction edge pixels are generated in accordance with the following equations, respectively:
4. The method for motion estimation of claim 3, wherein the low-bit resolution oriented edge image process including performing a non-uniform quantization process upon the edge image, and the non-uniform quantization process including: providing a plurality of quantization levels in accordance with a predetennined bit-length; andquantizing the x-direction edge pixels and the y-direction edge pixels in accordance with the quantization levels to generate the subsampled edge image.
5. The method for motion estimation of claim 3, wherein the bit-length is 8-bit.
6. The method for motion estimation of claim 3, wherein the bit-length is 6-bit.
7. The method for motion estimation of claim 3, wherein the bit-length is 4-bit.
8. The method for motion estimation of claim 3, wherein the bit-length is depended upon encoding quality, wherein the more the bit-length, the better the encoding quality.
9. A method for motion estimation, comprising: generating an oriented edge image by an edge-detect-filtering process in according with contour and texture of an original luminance image, the edge-detect-filtering process including: performing AN x-direction filtering process and a y-direction filtering process to generate AN x-direction oriented edge pixels and a y-direction oriented edge pixels; andgenerating the oriented edge image by adding the x-direction edge pixels and the y-direction edge pixels;generating A subsampled edge image by performing A low-bit resolution oriented edge image process upon the edge image, the low-bit resolution oriented edge image process including performing a non-uniform quantization process upon the edge image, and the non-uniform quantization process including:providing a plurality of quantization levels in accordance with a predetermined bit-length, wherein the bit-length is depended upon encoding quality, the more the bit-length, the better the encoding quality.; andquantizing the x-direetion edge pixels and the y-direction edge pixels in accordance with the quantization levels to generate the subsampled edge image; andusing the subsampled edge image to generate a motion vector for the original luminance image.
10. The method for motion estimation of claim 9, wherein the x-direction edge pixels and the y-direction edge pixels are generated in accordance with the following equations, respectively:

US Referenced Citations (5)

Number	Name	Date	Kind
5512956	Yan	Apr 1996	A
6370279	Paik	Apr 2002	B1
6961466	Imagawa et al.	Nov 2005	B1
20030142750	Oguz et al.	Jul 2003	A1
20040066964	Neubauer et al.	Apr 2004	A1

Related Publications (1)

	Number	Date	Country
	20040081240 A1	Apr 2004	US

Method for motion estimation using a low-bit edge image

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications

Term Extension

Abstract

Description

Claims

US Referenced Citations (5)

Related Publications (1)