Certain embodiments of the invention relate to communications. More specifically, certain embodiments of the invention relate to a method and system for mitigating motion trail artifacts and improving low contrast contours in temporal filtering based noise reduction.
Analog video may be received through broadcast, cable, and VCRs. The reception is often corrupted by noise, and therefore to improve the visual quality, noise reduction may be needed. Digital video may be received through broadcast, cable, satellite, Internet, and video discs. Digital video may be corrupted by noise, which may include coding artifacts, and to improve the visual quality, noise reduction may be beneficial. Various noise filters have been utilized in video communication systems such as set top boxes and TVs. However, inaccurate noise characterization, especially during scenes with motion, may result in artifacts caused by the filtering, which are more visually detrimental than the original noise.
In video system applications, random noise present in video signals, such as NTSC or PAL analog signals, for example, may result in images that are less than visually pleasing to the viewer. To address this problem, noise reduction (NR) operations may be utilized to remove or mitigate the noise present.
Further limitations and disadvantages of conventional and traditional approaches will become apparent to one of skill in the art, through comparison of such systems with some aspects of the present invention as set forth in the remainder of the present application with reference to the drawings.
A system and/or method is provided for mitigating motion trail artifacts and improving low contrast contours in temporal filtering based noise reduction, substantially as shown in and/or described in connection with at least one of the figures, as set forth more completely in the claims.
These and other advantages, aspects and novel features of the present invention, as well as details of an illustrated embodiment thereof, will be more fully understood from the following description and drawings.
Certain embodiments of the invention may be found in a system and/or method for mitigating motion trail artifacts and improving low contrast contours in temporal filtering based noise reduction. Aspects of the method and/or system may comprise one or more processors and/or circuits that are operable to generate one or more noise reduced current output pictures of video data utilizing a generated motion compensated blended picture of video data and a corresponding generated non-motion compensated blended picture of the video data. In this regard, a motion compensated blended picture of video data may be generated utilizing one or more motion compensated partitioned windows of picture data. Also, a corresponding non-motion compensated blended picture of video data may be generated utilizing one or more non-motion compensated partitioned windows of the picture data. In this regard, one or more of windows of picture data from a first picture of video data and one or more motion compensated windows of picture data from a motion compensated previously generated output picture of video data may be partitioned. The partitioned widows of picture data from the first picture of the video data may be compared with corresponding ones of the partitioned motion compensated windows of picture data from the motion compensated previously generated output picture of video data. During temporal filtering based noise reduction, the first picture of video data may be blended with the motion compensated previously generated output picture of the video data based on the comparison. In this regard, a blending factor may, for example, be shifted based on the comparison. The shifted blending factor may enable a reduction in motion trail artifacts in the blended first picture of video data with the motion compensated previously generated motion compensated output picture of video data in temporal filtering based noise reduction.
Furthermore, during temporal filtering based noise reduction, one or more non-motion compensated windows of picture data from a motion compensated previously generated output picture of the video data may be partitioned. The partitioned widows of picture data from the first picture of video data may be compared with corresponding ones of the partitioned non-motion compensated windows of picture data from the motion compensated previously generated output picture of the video data. In this regard, the first picture of video data may be blended without utilizing motion compensation with the motion compensated previously generated output picture of video data based on the comparison. Accordingly, a blending factor may be shifted based on the comparison. The shifted blending factor may, for example, enable a reduction in motion trail artifacts in the non-motion compensated blended first picture of video data with the motion compensated previously generated output picture of video data.
One or more non-motion compensated windows of picture data from a non-motion compensated previously generated output picture of the video data may be partitioned. The partitioned widows of picture data from the first picture of video data may be compared with corresponding ones of the partitioned windows of picture data from the non-motion compensated previously generated output picture of the video data. In this regard, the first picture of video data may be blended without utilizing motion compensation with the non-motion compensated previously generated output picture of video data based on the comparison. A blending factor may be shifted based on the comparison. The shifted blending factor may, for example, enable a reduction in motion trail artifacts in the non-motion compensated blended first picture of video data with the non-motion compensated previously generated output picture of video data of temporal filtering based noise reduction.
A blending factor may be determined for blending the blended first picture of video data and a corresponding motion compensated previously generated output picture of the video data with the blended first picture of the video data and the non-motion compensated previously generated output picture of the video data. The determined blending factor may be utilized when blending the blended pictures of video data.
The video processing block 102 may be enabled to receive a video input stream and, in some instances, to buffer at least a portion of the received video input stream in the input buffer 112. In this regard, the input buffer 112 may comprise suitable logic, circuitry, and/or code that may be enabled to store at least a portion of the received video input stream. Similarly, the video processing block 102 may be enabled to generate a filtered video output stream and, in some instances, to buffer at least a portion of the generated filtered video output stream in the output buffer 114. In this regard, the output buffer 114 may comprise suitable logic, circuitry, and/or code that may be enabled to store at least a portion of the filtered video output stream.
The filter 116 in the video processing block 102 may comprise suitable logic, circuitry, and/or code that may be enabled to perform an IIR filtering operation with noise reduction (IIR-NR) on the current pixel. In this regard, the filter 116 may be enabled to operate in a plurality of filtering modes, where each filtering mode may be associated with one of a plurality of supported filtering operations. The filter 116 may utilize video content, filter coefficients, threshold levels, and/or constants to generate the filtered video output stream in accordance with the filtering mode selected. In this regard, the video processing block 102 may generate blending factors to be utilized with the appropriate filtering mode selected. One or more of the blending factors may be adjusted to reduce unwanted video processing effects. For example, motion trails and/or smearing distortions that may be artifacts of excessive filtering of images comprising low contrast contours, for example, images comprising areas of flesh tones in a human face, may be reduced. The registers 110 in the video processing block 102 may comprise suitable logic, circuitry, and/or code that may be enabled to store information that corresponds to filter coefficients, threshold levels, and/or constants, for example. Moreover, the registers 110 may be enabled to store information that corresponds to a selected filtering mode.
The processor 104 may comprise suitable logic, circuitry, and/or code that may be enabled to process data and/or perform system control operations. The processor 104 may be enabled to control at least a portion of the operations of the video processing block 102. For example, the processor 104 may generate at least one signal to control the selection of the filtering mode in the video processing block 102. Moreover, the processor 104 may be enabled to program, update, and/or modify filter coefficients, threshold levels, and/or constants in at least a portion of the registers 110. For example, the processor 104 may generate at least one signal to retrieve stored filter coefficients, threshold levels, and/or constants that may be stored in the memory 106 and transfer the retrieved information to the registers 110 via the data/control bus 108. The memory 106 may comprise suitable logic, circuitry, and/or code that may be enabled to store information that may be utilized by the video processing block 102 to reduce noise in the video input stream. The processor 104 may also be enabled to determine noise levels for a current video picture based on an early-exit algorithm (EEA) or an interpolation estimate algorithm (IEA), for example. The memory 106 may be enabled to store filter coefficients, threshold levels, and/or constants, for example, to be utilized by the video processing block 102. U.S. application Ser. No. 11/313,592 filed Dec. 20, 2005, provides a detailed description of the early-exit algorithm (EEA) and the interpolation estimate algorithm (IEA), and is hereby incorporated by reference in its entirety.
In operation, the processor 104 may select a filtering mode of operation and may program the selected filtering mode into the registers 110 in the video processing block 102. Moreover, the processor 104 may program the appropriate values for the filter coefficients, threshold levels, and/or constants into the registers 110 in accordance with the selected filtering mode. The video processing block 102 may receive the video input stream and may filter pixels in a video picture in accordance with the selected filtering mode. In some instances, the video input stream may be stored in the input buffer 112 before processing. The video processing block 102 may generate the appropriate blending factors needed to perform the noise reduction filtering operation selected by the processor 104. The video processing block 102 may generate the filtered video output stream after performing the noise reduction filtering operation. In some instances, the filtered video output stream may be stored in the output buffer 114 before being transferred out of the video processing block 102.
Pixels in consecutive video pictures are said to be collocated when having the same picture location, that is, . . . , Pn−1(x,y), Pn(x,y), Pn+1(x,y), . . . , where Pn−1 indicates a pixel value in the previous video picture 202, Pn indicates a pixel value in the current video picture 204, Pn+1 indicates a pixel value in the next video picture 206, and (x,y) is the common picture location between pixels. As shown in
Operations of the video processing block 102 in
The IIR-NR block 318 may comprise suitable logic, circuitry, and/or code that may be enabled to IIR filter the current pixel, Pn. The IIR-NR block 318 may also be enabled to generate an IIR-blended current pixel given by the expression:
P′n,out
where the IIR blending factor, αiir, controls the contribution of the previously generated output picture Out(−1) to the IIR-blended current pixel. The delay block 320 may comprise suitable logic, circuitry, and/or code that may be enabled to delay by one video picture the transfer of the recursive feedback from the output of the IIR-NR block 318 to the MM calculation block 312b and to the input of the IIR-NR block 318. In this regard, both the MM calculation block 312b and the IIR-NR block 318 may utilize a recursive feedback operation based on the previously generated output picture Out(−1) or a previously generated non-MC output signal Out_nmc(−1).
In operation, the current pixel, Pn and the previously generated non-MC output signal Out_nmc(−1) may be received by the MM calculation block 312b and the IIR-NR block 318. The MM calculation block 312b may generate the IIR blending factor, αiir. The IIR-NR block 318 may IIR filter the current pixel, Pn, and may utilize the current pixel and the previously generated output picture Out(−1) to perform the operation described by equation (3).
Motion-adaptive IIR filtering methods may achieve significant noise reduction but may result in artifacts such as motion trails and/or blurring of moving objects. To avoid these motion artifacts, IIR noise reduction operations may be configured conservatively, limiting, in some instances, the ability to reduce noise components.
The video converter 532 may comprise suitable logic, circuitry and/or code that may be enabled to receive video data from a video source in YCbCr 4:2:2 format, for example. The video converter 532 may be enabled to convert the received video data to YCbCr in the 4:4:4 format before motion estimation and motion compensation operations are performed to facilitate motion estimation and motion compensation of chroma components. The chroma samples may be interpolated to the same sample density as the luma samples. The 4:2:2 to 4:4:4 interpolation may utilize a 4-tap filter, for example. The even indexed 4:4:4 chroma pixels may be generated from the half-value indexed 4:2:2 chroma samples. The odd indexed 4:4:4 chroma samples may be interpolated using four 4:2:2 chroma samples, for example, two to the left and two to the right of the current position.
The MC filter block 522 may comprise suitable logic, circuitry and/or code that may be enabled to perform motion compensation, motion estimation, and temporal filtering operation on the incoming video data. The MC filter block 522 may be enabled to receive a previous input picture In(−1), a previous output picture Out(−1) from memory 526, and a current input picture In(0) from the video converter 532. The MC filter block 522 may be enabled to utilize the received previous input picture In(−1), a previous output picture Out(−1) from memory 526, and a current input picture In(0) from the video converter 532 to generate a current output signal Out_mc(0) to the blend block 518 and the blend calculation block 520.
The non-MC filter block 524 may comprise suitable logic, circuitry and/or code that may be enabled to perform motion adaptive temporal filtering (MATF). The non-MC filter block 524 may comprise an IIR filter. The non-MC filter block 524 may be enabled to receive a current input picture In(0) from the video converter 532. The non-MC filter block 524 may be enabled to receive the previous input picture In(−1), and a previous output picture Out(−1) from the memory 526. The previous output picture Out(−1) may be recursively fed back to the non-MC filter block 524. The non-MC filter block 524 may be enabled to utilize the received previous input picture In(−1), a previous output picture Out(−1), and a current input picture In(0) to generate an output signal Out_nmc(0) to the blend block 528 and the blend calculation block 530.
The memory 526 may comprise suitable logic, circuitry, and/or code that may be enabled to store at least a portion of consecutive video pictures. The memory 526 may be enabled to store a previous input picture In(−1), and a previous output picture Out(−1). The memory 526 may be enabled to receive a current input picture In(0) from the video converter 532, and a current output picture Out(0) from the blend block 528.
The blend calculation block 530 may comprise suitable logic, circuitry and/or code that may be enabled to receive the output signals generated from the MC filter block 522 and the non-MC filter block 524 and generate a blend control signal to the blend block 528. The blend calculation block 530 may be enabled to blend together the 4:2:2 outputs of the MC filter block 522 and the non-MC filter block 524. The blend calculation block 530 may be enabled to generate a blend control variable that may represent the confidence metric that a MV may represent the motion of the content at the current pixel In(0). The MV selected for the MC operation may be referred to as MV#0, or the MV with the lowest measured cost.
The blend calculation block 530 may be enabled to estimate the confidence metric of MV#0 by utilizing a combination of three metrics. For example, a first metric (cost_MV#1−cost_MV#0), which indicates how much better MV#0 is than the next lowest cost MV, or MV#1, a second metric (cost_zero_MV−cost_MV#0), which indicates how much better MV#0 is compared to the zero (0,0) vector, and a third metric may be the horizontal edge strength, edge_strength_adj. These three metrics may be combined as indicated by the following expressions:
confidence—mv=max((cost_zero—MV−cost—MV#0),(cost—MV#1−cost—MV#0) (11a)
confidence=max(0,confidence—mv−edge_strength—adj) (11b)
The value of MV#1 may not affect the result except when MV#0 is the zero MV, since for other cases of MV#0, (cost_zero_MV>=cost_MV#1) and (cost_zero_MV−cost_MV#0)>=(cost_MV#1−cost_MV#0). Therefore, the MV#1 may not be calculated except when MV#0=zero_MV. The motion estimation (ME) may be performed once for every non-overlapping pixel block of size 3×1, for example. For the 3 pixels in the pixel block (of 3×1), MV#0, MV#1 and the (0,0)-MV may be equal. The edge_strength_adj value may be computed once for every 3×1 pixel block, for example, and the same confidence value may be utilized for blending each of the 3 pixels in the pixel block. The confidence value may be processed through a non-linearity to generate a blending control variable blend_MC_NonMC. The non-linearity may be in the form of K4*(1−K5/(d*d)), where d=4*confidence/(size of window), for example, and K4 and K5 are parameters that may be set according to a degree of filtering corresponding to a noise level of an input video and/or the expectations of a subsequent encoder.
The blend block 528 may comprise suitable logic, circuitry and/or code that may be enabled to receive a plurality of input signals: a MC out signal (Out_mc(0)), a blend control signal from the blend calculation block 530, and a non-MC out signal (Out_nmc(0)) from the non-MC filter block 524. The blend block may be enabled to generate a current output picture Out(0) utilizing the received signals: a MC out signal (Out_mc(0)), a blend control signal from the blend calculation block 530, and a non-MC out signal (Out_nmc(0)).
The output of the blend block 528 may be generated by blending the outputs of the MC filter block 522 (Out_mc(0)), and the output of the non-MC filter block 524 (Out_nmc(0)) as follows:
Out(0)=(blendmc
The blending factor blendmc
The blending factor for the non-MC path and the MC path including non-linearities may be calculated based on the following expressions:
where Kmc
Combining equations (11b) and (13a), the normalized confidence value may be approximated based on the following expression:
m>>max(0,(a0*(confidence—mv−edge_strength—adj))>>10) (13d)
where
a0=((4*1024)/(me_window_size—w*me_window_size—h)) (13e)
In the above implementation, a0 may be a programmable unsigned integer value, for example, since the window size may be programmable. The constants in equations (13d) and (13e) may be set to facilitate fixed point operations; they are not a restriction for the invention.
The non-MC path block 524 may be enabled to receive a current input picture In(0) from the video converter 532. The non-MC path block 524 may be enabled to receive the previous input picture In(−1), and a previous non-MC output signal (Out_nmc(−1)) from the memory 526. The previous non-MC output signal (Out_nmc(−1)) may be recursively fed back to the non-MC filter block 522. The memory bandwidth utilized by this embodiment may be higher than the bandwidth utilized by the system of
The memory 526 may be enabled to store a previous input picture In(−1), a previous output picture Out(−1), and a previous non MC output signal (Out_nmc(−1)). The memory 526 may be enabled to receive a current input picture In(0) from the video converter 532, a current non-MC output signal (Out_nmc(0)) and a current output picture Out(0) from the blend block 528.
The non-MC path block 524 may be enabled to receive a current input picture In(0) and a previously generated output picture Out(−1) from the video converter 532. The non-MC filter block 524 may be enabled to utilize the received current input picture In(0), and the previously generated output picture Out(−1) to generate an output signal Out_nmc(0) to the blend block 528 and the blend calculation block 530.
The memory 526 may be enabled to receive a current output picture Out(0) from the blend block 528. The memory 526 may be enabled to store a previously generated output picture Out(−1) and feed back the previously generated output picture Out(−1) to the video converter 532.
In operation, the MC path of the MCTF may perform motion estimation (ME) to determine a motion vector (MV) to represent motion of image content at each current block. The value of a metric representing the confidence of the validity of the resulting motion vector may be determined. Motion compensation of each current block may be performed from the previous MCTF output image in the search range 676 of previous output picture Out(−1) 672 by utilizing a motion vector. The amount of residual signal and noise after motion compensation may be measured at each pixel. The MCTF may include performing IIR temporal filtering of the input image or current input picture In(0) 674 in conjunction with the motion compensated previous output image or previous output picture Out(−1). The IIR filter may be controlled by the measurement of the MC residual after processing by a non-linear transfer function.
The window filter cost 602 may comprise dimensions w×h comprising a width of w pixels and a height of h pixels where each pixel may be represented by (i, j). The filter cost window 602 may have a center pixel (i=x, j=y) and may be partitioned into three sections horizontally comprising the section of i less than x pixels 612, the section of i equals x pixels 614 and the section of i greater than x pixels 616. The filter cost window 602 may be partitioned into three sections vertically comprising the section of j less than y pixels 604, the section of j equals y pixels 606 and the section of j greater than y pixels 608.
The filter control block 682 may comprise suitable logic, circuitry and/or code that may be enabled to calculate the differences between two pictures of video over a measurement window. This measurement window may be referred to as a filter cost window of the non-MC IIR filter block 684.
In accordance with an embodiment of the invention, the filter cost window size may be configurable, for example, to be 7×5 or 5×3. The non-MC IIR filter block 684 cost window size may also be independently configurable. The MC filter 522 (
The filter control block 682 may be enabled to compare the current input picture In(0) at each pixel with the previous output picture Out(−1) without motion compensation. The filter cost metric, when configured to use chroma SSD (sum of signed differences), may be represented, for example, based on the following expression:
costnon
Similarly, the filter cost metric, when configured to use chroma SAD (sum of absolute differences), may be represented, for example, based on the following expression:
costnon
The weighting factors in equations (20a) and (20b) may favor luma more than each chroma as luma may carry more information.
The output of the non-MC IIR filter block 684 may be specified, for example, as indicated by the following expression:
Out—nmc(0)=(αIIR*In(0))+(256−αIIR)*(Out(−1))+128)/256 (21)
where αIIR is the output of the filter control block 682, and Out(−1) is the previous output picture. In another embodiment of the invention, the even indexed samples may be filtered although both In(0) and Out(−1) have been converted to the 4:4:4 format for the chroma sample filtering operations, which may lower the cost of the filter without any significant impact on visual quality.
The non-MC IIR filter block 684 may comprise suitable logic, circuitry and/or code that may be enabled to generate an output Out_nmc(0) to the blend block 528 (
αIIR=K2(1−(K3/d2)) (22)
where d=4*filter_cost/(size of window), for example, and K2 and K3 are parameters that may be set according to the desired degree of filtering corresponding to the noise level of the input video and/or the expectations of a subsequent encoder.
The non-MC path blending factor including non-linearities may be calculated, for example, based on the following expressions:
where LOW_THD may be a lower threshold value, Knon
where x, y, and z may be suitable rational values.
Adjustments may be made for alpha for static pixels relative to the low threshold. This may be useful for noise reduction in a static noisy background of relatively noisy video. For example, in instances when the x component and y component of MV#0 are equal to zero, the adjustment may be determined utilizing the following exemplary pseudocode.
In this regard, nmc_alpha_low_adj and mc_alpha_low_adj may be programmable values, for example, in the range of [0, 127], inclusive. The values mnc_alpha_adj and mc_alpha_adj may be utilized to lower the blending factor lower bounds for both the MC and nonMC paths, as shown below.
The adjusted non-MC path blending factor may be determined based on the following expressions:
where LOW_THD_nonMC may be a programmable value in the range of [0, 256], and the function clip3( ) may be specified, for example, based on the following pseudocode:
In the equations (23a), (23b) and (23c), the cost and confidence values may be normalized by scaling. The division by the window size may be costly. Combining equations (20a) and (20b) with equation (23a), the normalized cost for the non-MC path may be approximated, for example, utilizing the following expressions:
m=(c0*luma_SAD+c1*abs(Cb_SSD)+c1*abs(Cr_SSD))>>14 (23m)
or
m=(c0*luma_SAD+c1*abs(Cb_SAD)+c1*abs(Cr_SAD))>>14 (23n)
where
c0=((4*8192)/(cost_window_size—w*cost_window_size—h) (23o)
c1=((4*4096)/(cost_window_size—w*cost_window_size—h) (23p)
In the above implementation, c0 and c1 may be programmable unsigned integer values, for example, since the window size is programmable and may be constrained, for example, based on the following expressions:
c0+2*c1=(int)(65536/(cost_window_size—w*cost_window_size—h)) (23q)
In various embodiments of the invention, an adjustment, for example, a shift, may be made to the filter cost calculation for the non-MC path. The adjustment may decrease unwanted artifacts, for example, motion trails. In this regard, the luma signed difference at a co-located position (i, j) may be defined as the luma difference, which may be specified using the following expression:
SDiffn(i,j)=In—Y(i,j,0)−Out—Y(i,j,−1) (23r)
A sum of signed differences for four regions of the w×h filter cost window 602 described with respect to
When the filter cost window size has dimensions w×h=7×5, the value nmc_shift for luma may be defined utilizing the following expression:
nmc_shift=max(|Ssum_right|,|Ssum_left|,|Ssum_lower|,|Ssum_upper|)/16. (23w)
When the filter cost window size has dimensions w×h that are smaller than 7×5 the value nmc_shift for luma may be defined utilizing the following expression:
nmc_shift=max(|Ssum_right|,|Ssum_left|,|Ssum_lower|,|Ssum_upper|)/8 (23x)
where nmc_shift may be clipped within [0, 255] for an 8 bit system or [0, 1023] for a 10-bit system.
In various embodiments of the invention, nmc_shift for chroma components may be similarly defined utilizing corresponding chroma components.
The non-MC path cost may be adjusted utilizing nmc_shift. For example, in instances when the x component or the y component of the motion vector mv#0 does not equal zero, the non-MC cost may be approximated utilizing the following expressions.
m≈[(c0*luma_SAD+c1*abs(Cb_SSD)+c1*abs(Cr_SSD))>>14]+nmc—adj*nmc_shift (23y)
or
m≈[(c0*luma_SAD+c1*abs(Cb_SAD)+c1*abs(Cr_SAD))>>14]+nmc—adj*nmc_shift (23z)
where nmc_adj may be a programmable weight in the range of [0, 31], inclusive. In (23y) or (23z), m may be clipped to [0, 1024] for 8 bit systems or to [0, 4096] for 10-bit systems. In this manner, noise may be reduced without causing motion artifacts such as motion trails.
The motion estimation (ME) block 620 may comprise suitable logic, circuitry and/or code that may be enabled to search a previous MCTF output picture Out(−1) after being converted to the 4:4:4 format in the search range 676 (
Each candidate motion vector may be evaluated using a cost metric measured over a window 678 of pixels. The size of the window 678 (
cost=(2*(luma_SAD)+abs(Cb_SSD)+abs(Cr_SSD)+2)/4 (24a)
where Cb_SSD and Cr_SSD are the SSD values of Cb and Cr components respectively. Notwithstanding, the cost metric may also be denoted as follows:
cost=(2*luma_SAD+Cb_SAD+Cr_SAD+2)/4,
where Cb_SAD and Cr_SAD are the SAD values of Cb and Cr components respectively. Alternative cost metrics and combinations of cost metrics may be used. For example SAD may be used for both luma and chroma.
The weighting factors in equation (24a) may favor luma more than each chroma as luma may carry more information. The motion estimation block 620 may be enabled to search a reference picture, for example, Out(−1) within a search range 676. The first stage of this search may include integer MV positions. The lowest and second lowest cost MV's may be determined, and they may be labeled as MV#0 and MV#1, respectively.
The cost metric for neighboring half-pixel position MV's with a +/−½ pixel MV offset in with respect to the vector MV#0 may be determined. In accordance with an embodiment of the invention, the cost metric for eight neighboring half-pixel position MV's with a +/−½ pixel MV offset in both horizontal and vertical axes of the MV#0 may be determined. Alternatively, vectors along only the horizontal axis may be evaluated. The lowest cost MV and the second lowest cost MV may be updated during the half-pel search. The updated lowest cost MV and second lowest cost MV may be labeled as MV#0 and MV#1 respectively. The MV#1 may be utilized in a subsequent step for determining a confidence metric of the choice of MV#0. The half-pixel positions may be created using two-tap interpolation filters, for example.
The motion compensation (MC) block 622 may comprise suitable logic, circuitry and/or code that may be enabled to generate motion-compensated pixels from a reference image or previous output picture Out(−1), in the 4:4:4 format, by utilizing the lowest cost half-pel MV or MV#0 generated by the MC block 622.
The filter control block 624 may comprise suitable logic, circuitry and/or code that may be enabled to compare the current input image or picture In(0) at each pixel with the motion compensated result from the ME function using MV#0 on the previous output picture Out(−1), for example. These comparisons may be performed in the 4:4:4 format or alternatively in another format, for example the 4:2:2 format. The comparison may be performed using another measurement window that is similar in principle to the ME window 678. However, the filter control window may have a different size such as 5×3 or 7×5, for example. A cost metric may be generated over the measurement window, using a cost metric that may be similar to the ME cost metric. The filter cost metric may be represented utilizing the following expression:
costmc
The filter cost may be calculated for each pixel in the input picture In(0), and the ME cost may be calculated using windows of the input picture that may step by 3 pixels every MV, for example. The window for the filter cost, costmc
In accordance with an embodiment of the invention, the filter cost window size may be configurable, for example, to be 7×5 or 5×3. The non-MC path filter cost window may also be configurable. The MC path and non-MC path filter cost window sizes may be independently configurable.
The filter cost value (filter_cost) may be mapped to filter coefficients of the IIR temporal filter in the MC path by utilizing the non-linear transfer function specified by the following expression:
αmc
where d=16*cost_mc_iir/(size of window). The value 16 may be changed and other values may be utilized accordingly, to facilitate fixed point operations. K0 and K1 are parameters that may be set according to the desired degree of filtering corresponding to the noise level of the input video and/or the expectations of a subsequent encoder.
The MC path blending factor including non-linearities may be calculated utilizing the following expressions:
where Kmc
Adjustments may be made for alpha for static pixels relative to the low threshold. This may be useful for noise reduction in a static noisy background of relatively noisy video. For example, in instances when the x component and y component of MV#0 are equal to zero, the following adjustment may be determined utilizing the following pseudocode:
In this regard, mc_alpha_low_adj may be a programmable value, for example, in the range of [0, 127], inclusive. The values mnc_alpha_adj and mc_alpha_adj may be utilized to lower the blending factor lower bounds for both the MC and non-MC paths as shown below:
The adjusted MC path blending factor may be determined utilizing the following expressions:
where LOW_THD_MC may be a programmable value in the range of [0, 256]. A factor for blending the non-MC and MC paths may be determined utilizing the following expressions:
Combining equation (24b) with equation (26a), the normalized cost for the MC path may be approximated utilizing the following expression:
m>>(b0*luma_SAD+b1*abs(Cb_SSD)+b1*abs(Cr_SSD))>>14 (26j)
where
b0=((16*8192)/(cost_window_size—w*cost_window_size—h) (26k)
b1=((16*4096)/(cost_window_size—w*cost_window_size—h) (26l)
In the above implementation, b0 and b1 may be programmable unsigned integer values, for example, since the window size may be programmable and may be constrained as indicated by the following expression:
b0+2*b1=(int)(262144/(cost_window_size—w*cost_window_size—h)) (26m)
In various embodiments of the invention, an adjustment, for example, a shift, may be made to the filter cost calculation for the motion compensation (MC) path. The adjustment may decrease unwanted artifacts, for example, motion trails. In this regard, the luma signed difference at a co-located position (i, j) may be defined as the luma difference between the current picture and the MC recursive feedback picture, utilizing the following expression:
SDiffn(i,j)=In—Y(i,j,0)−Out—Y(i+mvx,j+mvy,−1) (26n)
In instances when motion estimation (ME) may be done only in the horizontal direction, the y component of the motion vector mvy may be equal to zero.
A sum of signed differences for four regions of the w×h filter cost window 602 described with respect to
When the filter cost window size has dimensions w×h=7×5, the value mc_shift for luma may be defined utilizing the following expression.
mc_shift=max(|Ssum_right|,|Ssum_left|,|Ssum_lower|,|Ssum_upper|)/16 (26s)
When the filter cost window size has dimensions w×h that are smaller than 7×5 the value mc_shift for luma may be defined as specified by the following expression:
mc_shift=max(|Ssum_right|,|Ssum_left|,|Ssum_lower|,|Ssum_upper|)/8 (26t)
where nmc_shift may be clipped within [0, 255] for an 8 bit system or [0, 1023] for a 10-bit system.
In various embodiments of the invention, mc_shift for chroma components may be similarly defined utilizing corresponding chroma components.
A value determined based on mc_shift may be utilized to adjust the MC path filter cost as follows. For example, in instances when the x component or the y component of the motion vector mv#0 does not equal zero, the MC filter cost may be approximated as specified utilizing the following expression:
m≈[(b0*luma_SAD+b1*abs(Cb_SSD)+b1*abs(Cr_SSD))>>14]+mc—adj*mc_shift (26u)
where mc_adj may be a programmable weight in the range of [0, 31], inclusive. In (23o) or (230), the approximation m may be clipped to [0, 1024] for 8 bit systems or to [0, 4096] for 10-bit systems.
The temporal filter 626 may comprise suitable logic, circuitry and/or code that may be enabled to generate a motion compensated output out_mc(0) to the blend block 528. The temporal filter in the MC path may be an IIR filter, for example. The feedback term or the output picture Out(−1) may be the previously generated output from the entire MCTF. The MC temporal filter 626 output may be specified utilizing the following expression:
Out—mc(0)=((αMC
where αMC
In another embodiment of the invention, the even indexed samples may be filtered although both In(0) and Out(−1) have been converted to the 4:4:4 format for the chroma sample filtering operations. This may lower the cost without any significant impact on visual quality. The temporal filter 626 may be enabled to generate the resulting output in the 4:2:2 format, for example.
In another embodiment of the invention, the horizontal and vertical edge gradients may be calculated and utilized to adjust the blending control to combine the MC path output Out_mc(0) and non-MC path output Out_nmc(0). The vertical gradient may be utilized to decrease the confidence level, and the horizontal gradient may offset the effect introduced by the vertical gradient, in order to reduce the flickering effect of near horizontal edges in the combined result. The edge strength calculations may be performed on the difference of the luma components of the current unfiltered picture In(0) and the previous output reference or filtered picture Out(−1).
The vertical gradient or the horizontal edge strength may be calculated by applying a plurality of filter templates to a neighborhood of the luma component that may correspond to the difference between the current input picture In(0) and the previous output picture Out(−1). The filter templates may be centered at the center pixel of a 3×1 pixel block, for example. The gradient may be calculated once for each 3×1 pixel block, for example. The plurality of filter templates may be represented, for example, as follows:
The horizontal edge strength may be calculated, for example, as indicated utilizing the following expression:
h_edge—diff=max(abs(temp1),abs(temp2))/2 (29)
where temp1 and temp2 are the output values generated by applying the plurality of filter templates to a neighborhood of the luma component of the difference between the current input picture In(0) and the previous output picture Out(−1).
The horizontal gradient or the vertical edge strength may be calculated by applying the following two templates to the neighborhood of the luma difference between the current input picture In(0) and the previous output picture Out(−1), centered at the center pixel of the 3×1 pixel block, for example. The plurality of filter templates may be represented, for example, as follows:
The vertical edge strength may be calculated for example, as indicated utilizing the following expression:
v_edge—diff=max(abs(temp3),abs(temp4))/2 (31)
where temp3 and temp4 are the output values generated by applying the plurality of filter templates to a neighborhood of the luma component of the difference between the current input picture In(0) and the previous output picture Out(−1). The final value of the edge strength that may be utilized to adjust the confidence level may be calculated, for example, utilizing the following expression:
edge_strength—adj=max(0,h_edge—diff−v_edge—diff) (32)
In order to improve noise reduction effectiveness it may be necessary to achieve both significant noise reduction in areas of little or no motion and be free of motion artifacts such as motion trails, motion blur, jittering or wobbling in areas where there is motion. In this manner, noise may be reduced without causing motion artifacts such as motion trails.
The motion estimation (ME) block 620 may comprise suitable logic, circuitry and/or code that may be enabled to search a previous MCTF output picture Out(−1) in the search range 676 (
Each candidate motion vector may be evaluated using a cost metric measured over a window 678 of pixels. The size of the window 678 (
cost=(2*luma_SAD+abs(Cb_SSD)+abs(Cr_SSD)+2)/4 (24a)
where Cb_SSD and Cr_SSD are the SSD values of Cb and Cr components respectively. Alternatively, the cost metric may be defined as follows: cost=(2*luma_SAD+Cb_SAD+Cr_SAD+2)/4, where Cb_SAD and Cr_SAD are the SAD values of Cb and Cr components respectively.
The weighting factors in equation (24a) may favor luma more than each chroma as luma may carry more information. The motion estimation block 620 may be enabled to search a reference picture, for example, Out(−1) within a search range 676. The first stage of this search may include integer MV positions. The lowest and second lowest cost MV's may be determined, and they may be labeled as MV#0 and MV#1, respectively.
The edge strength confidence and blending calculation block 625 may comprise suitable logic, circuitry and/or code that may be enabled to receive the cost metrics for the MV#0, MV#1 and the (0,0) motion vector, and the luma components of the current input picture In(0) and the previous output picture Out(−1). The edge strength adjustment value may be calculated substantially as in equations (28), (29), (30), (31) and (32). A confidence value may be calculated substantially as in equations (11a) and (11b).
The motion compensation (MC) block 622 may comprise suitable logic, circuitry and/or code that may be enabled to generate motion-compensated pixels from a reference image or previous output picture Out(−1), by utilizing the lowest cost half-pel MV or MV#0 generated by the MC block 622.
The filter control block 624 may be substantially as described in
Out—mc(0)=((αMC
where αMC
In accordance with an embodiment of the invention, a method and system for motion compensated temporal filtering using infinite impulse response (IIR) filtering may comprise at least one circuit, for example, a non-MC filter block 524 that enables generation of a corresponding non-motion compensated output picture of video data (Out_nmc(0)) from at least one infinite impulse response (IIR) filtered output picture of video data (IIR_out(0)). The MC filter module may be enabled to generate a motion compensated output picture of video data (Out_mc(0)) by blending at least one motion compensated picture of video data mc(0) and at least one current input picture of video data (in(0)). In this regard, mc(0) may be motion compensated from out(−1).
The blend block 528 may enable blending of the motion compensated output picture of video data (Out_mc(0)) and the generated corresponding non-motion compensated output picture of video data (Out_nmc(0)). At least one output picture of video data (Out(0)) may be generated by utilizing the generated motion compensated output picture of video data (Out_mc(0)) and the generated corresponding non-motion compensated output picture of video data (Out_nmc(0)).
The MC filter block 522 may enable utilization of the previously generated portion of the output picture of video data (Out(−1)) to determine at least one motion vector (MV) based on a cost metric to represent a motion of the video data of at least one input picture of the video data (In(0)). The motion compensated output picture of video data (Out_mc(0)) may be generated by utilizing at least one determined motion vector to represent the motion of video data. The MC filter block may also determine a confidence metric of the determined motion vector.
The blend calculation block 530 (
confidence—mv=max((cost_zero—MV−cost—MV#0),(cost—MV#1−cost—MV#0) (11a)
confidence=max(0,confidence—mv−edge_strength—adj) (11b)
The motion compensated output picture of video data (Out_mc(0)) may be generated by temporal filtering of video data with the determined motion vector (MV). The temporal filter in the MC path may be an IIR filter, for example. The feedback term or the output picture Out(−1) may be the previously generated output from the entire MCTF. The MC temporal filter 626 output may be specified, for example, utilizing the following expression:
Out—mc(0)=((αMC
The cost metric may comprise a combination of a sum of absolute differences (SAD) for luma components of video data and a sum of signed differences (SSD) for chroma components of video data. A cost metric may be generated over the measurement window, using a cost metric that may be similar to the ME cost metric. The filter cost metric may be represented, for example, utilizing the following expression:
costmc
The filter cost may be calculated for each pixel in the input picture In(0), and the ME cost may be calculated using windows of the input picture that may step by 3 pixels every MV, for example.
The motion vector (MV) may be determined based on a lowest cost of a plurality of candidate MVs that may be applied to at least one input picture of video data (In(0)). The motion estimation (ME) block 620 (
The previously generated portion of the output picture of video data (Out(−1)) may be fed back to the MC filter block 522 to determine the motion compensated output picture of video data (Out_mc(0)). The previously generated portion of the output picture of video data (Out(−1)) may be fed back to the non-MC filter block 524 to determine the generated corresponding non-motion compensated output picture of video data (Out_nmc(0)).
A blending factor, αblend may be determined for blending the motion compensated output picture of video data (Out_mc(0)) and the generated corresponding non-motion compensated output picture of video data (Out_nmc(0)). A horizontal edge gradient and/or a vertical edge gradient may be calculated to adjust the determined blending factor αblend. The vertical gradient may be utilized to decrease the confidence level, and the horizontal gradient may offset the effect introduced by the vertical gradient, in order to reduce the flickering effect of near horizontal edges in the combined result. At least one IIR filtered output picture of video data (IIR_out(0) may be generated based on an IIR blending factor αiir. The IIR blending factor αiir may be dynamically modified based on a motion metric parameter.
In an embodiment of the invention, one or more current output pictures of video data Out(0) from the blend block 528, for example, may be generated utilizing a generated motion compensated blended picture of video data, MCout, from the motion compensated filter 522 and a corresponding generated non-motion compensated blended picture of the video data, non-MCout, from the non-motion compensated filter 524, for example. In this regard, a motion compensated blended picture of video data, MCout, may be generated utilizing one or more motion compensated partitioned windows of picture data, for example, an exemplary partitioned window of picture data is illustrated in
With regard to output from the MC filter 522, MCout, one or more of windows of picture data from a first picture of video data, for example, In(0) and one or more motion compensated windows of picture data from a motion compensated previously generated output picture of video data, for example, Out(−1) may be partitioned. The partitioned widows of picture data from the first picture of video data In(0) may be compared with corresponding ones of the partitioned motion compensated windows of picture data from the motion compensated previously generated output picture of video data Out(−1) in the MC filter 522. The first picture of video data In(0) may be blended with the motion compensated previously generated output picture of the video data Out(−1) based on the comparison. Out(−1) is a final output from the blend block 528 that may receive data from the output of the MC filter 522. In this regard, Out(−1) may be referred to as the motion compensated (MC) previously generated output picture of video data. Furthermore, a blending factor that may be utilized by the MC filter 522, may be shifted based on the comparison. The shifted blending factor may enable, for example, a reduction in motion trail artifacts in the non-motion compensated blended first picture of video data with the motion compensated previously generated output picture of video data.
With regard to the output of the non-MC filter 524, non-MCout, one or more non-motion compensated windows of picture data from a motion compensated previously generated output picture of video data, Out(−1) may be partitioned, as shown, for example, in
A blending factor, for example, output from the blend calculation block 530 may be determined for blending by the blend block 528, the blended first picture of video data In(0) and a corresponding motion compensated previously generated output picture of the video data, MCout, with the blended first picture of the video data In(0) and the non-motion compensated previously generated output picture of the video data, non-MCout, for example. The determined blending factor may be utilized when blending the blended pictures of video data comprising MCout and non-MCout shown in
Another embodiment of the invention may provide a machine and/or computer readable storage and/or medium, having stored thereon, a machine code and/or a computer program having at least one code section executable by a machine and/or a computer, thereby causing the machine and/or computer to perform the steps as described herein for mitigating motion trail artifacts and improving low contrast contours in temporal filtering based noise reduction.
Accordingly, the present invention may be realized in hardware, software, or a combination of hardware and software. The present invention may be realized in a centralized fashion in at least one computer system, or in a distributed fashion where different elements are spread across several interconnected computer systems. Any kind of computer system or other apparatus adapted for carrying out the methods described herein is suited. A typical combination of hardware and software may be a general-purpose computer system with a computer program that, when being loaded and executed, controls the computer system such that it carries out the methods described herein.
The present invention may also be embedded in a computer program product, which comprises all the features enabling the implementation of the methods described herein, and which when loaded in a computer system is able to carry out these methods. Computer program in the present context means any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after either or both of the following: a) conversion to another language, code or notation; b) reproduction in a different material form.
While the present invention has been described with reference to certain embodiments, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted without departing from the scope of the present invention. In addition, many modifications may be made to adapt a particular situation or material to the teachings of the present invention without departing from its scope. Therefore, it is intended that the present invention not be limited to the particular embodiment disclosed, but that the present invention will include all embodiments falling within the scope of the appended claims.
This application makes reference to, claims priority to, and claims the benefit of U.S. Provisional Application Ser. No. 61/228,417, filed on Jul. 24, 2009. This application also makes reference to U.S. application Ser. No. 11/619,444, which was filed on Jan. 3, 2007. Each of the above referenced applications is hereby incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
20070069980 | MacInnis | Mar 2007 | A1 |
20070139560 | Zhong | Jun 2007 | A1 |
20070139568 | Zhong et al. | Jun 2007 | A1 |
20080062327 | MacInnis et al. | Mar 2008 | A1 |
20080063064 | MacInnis et al. | Mar 2008 | A1 |
20080101469 | Ishtiaq et al. | May 2008 | A1 |
20090016451 | Chen | Jan 2009 | A1 |
Number | Date | Country | |
---|---|---|---|
20110019093 A1 | Jan 2011 | US |
Number | Date | Country | |
---|---|---|---|
61228417 | Jul 2009 | US |