1. Field of the Invention
The present invention is generally related to a technique for enhancing the quality of an image. More particularly, the present invention relates to a region-of-interest (ROI) video-coding algorithm based on fuzzy control method for a video encoder, for example, a H.263+ type video encoder.
2. Description of the Related Art
The demand for applications of the digital video communication, such as videoconferencing and videophone, has increased considerably. However, the transmission rates over network are restricted, hence very low bit-rate video coding for such applications is an important technology to reduce the data rate of picture sequence without losing much of its subjective quality. Most implementations of these standards give equal importance to each block. While different blocks within the same picture may be coded with different modes, no one block is more important than the other is. This model is not appropriate for any region-of-interest (ROI) application on video sequence. In H.263+ standard, the distortion weight parameter and the signal variance at macro-block (MB) layer are adjusted to control the qualities at different regions. The blocks correspond to some focus areas are more important than the blocks in the background or unwanted areas. Allocating more bandwidth towards the quality of areas that user focuses on, while sacrificing background or unwanted areas quality is a better coding strategy for video sequences like video conferencing. Except the ROI has more high quality, it may discard some background information to improve the encoding speed. Like maximum bit transfer (MBT), the background is always encoded with the coarsest quantization level as in. A region-based blurring algorithm to reduce bit-rate in very low bit-rate video coding is adopted. Another method improves quality at ROI significantly by three fixed factors to each ROI MBs and non-ROI MBs in order to enhance the quality of ROI regions, and reduce the bits for coding the background. The present invention can improve ROI quality adaptively according to fuzzy logic rate control and it is suitable for real time videoconferencing.
Fuzzy logic was first proposed by L. A. Zadeh working at Berkeley in 1965 and it is modeled after the natural way people arrive at solutions in three points. The first point: applying different solution methodologies to the same problem. The second point: applying more than one of our rules to the same problem at the same time. The third point: accepting a certain amount of imprecision, which is very important at helping us arrive at workable solutions. Obviously, normal rate control algorithms in different standard test models, such as TMN5, TMN8, and etc., are conformed to these three points. In each test models, there are particular mathematical solutions to determine the quantization parameters for each MB and a few inaccuracies are acceptable to estimate the bit rate for the next MB. It seems that a fuzzy logic control could play a suitable role in solving the rate control in video coding.
a shows a block diagram of a conventional feedback control system 100. This controller makes its decisions about what to do based on either a mathematical model of the process or a fixed set of mathematical relationship.
b shows a block diagram of a fuzzy logic control system 150. The fuzzy logic controller 150 uses as its guide a set of response rules established by the knowledgeable operators or system engineers. Referring to
To summarize, as the information highway has already begun, and with a limited transmission rate, a method for enhancing an image is needed. Currently, a region-of-interest (ROI) method that can improve an image's quality is already existed. However, the present solutions for the ROI methods still have barriers in the performance. Therefore and for the foregoing reasons, there is a desperate need for a method or algorithm that is able to obtain a high quality video image.
The present invention is directed to a method and apparatus that satisfies the need to enhance the quality of an image in applications such as videophone and videoconferencing. To achieve these and other advantages and in accordance with the purpose of the invention, as embodied and broadly described herein, a new method and apparatus based on region-of-interest (ROI) and fuzzy logic control are provided.
First, the method separates a plurality of region-of-interest regions from a plurality of non-region-of-interest regions of an image. Then, an input from the region-of-interest regions is sent to a fuzzy logic controller, wherein the fuzzy logic controller is used for enhancing the quality of the region-of-interest regions and the overall quality of an output image.
In one preferred embodiment of the present invention, the input from the region-of-interest regions is calculated from a first control input and a second control input from the region-of-interest regions. Wherein, the first control input and the second control input comprise a first variance from a present (i)th macro-block and a variance difference, respectively. The variance difference is calculated by subtracting a second variance of a previous (i−1)th macro-block from the first variance and then dividing by the first variance. The (i)th macro-block and the (i−1)th macro-block represent a sequence of macro-block within one of the region-of-interest regions and the (i−1)th macro-block is a previous macro-block of the (i)th macro-block.
In another preferred embodiment of the present invention, the fuzzy logic control includes a methodology to convert the control inputs to fuzzy predicates
In another preferred embodiment of the present invention, the fuzzy logic control includes a controlling function to calculate a linguistic membership function for determining a fuzzy situation of the main control input. The controlling function uses center of area (COA) method to determine the linguistic membership function.
In another embodiment of the present invention, the fuzzy logic control includes a plurality of lookup tables for making a decisional level and producing a weighted factor to emphasize the qualities of one of the region-of-interest regions.
In yet another embodiment of the present invention, the lookup tables comprise a plurality of scaled lookup tables for providing a priority-like quality for one of the region-of-interest regions. Wherein, the scaled lookup tables are formed by using a one-fixed and one-various membership function.
To summarize, a fuzzy controlled ROI video coding is provided. The fuzzy controlled ROI video coding has the capability of adjusting the output quality of an image adaptively. The approach can enhance the quality of ROI easily, maintain the constant bit-rate to avoid buffer overflow, and achieve good quality easily with fewer bit-rates than previous works. The multiple ROI video coding can also enhance each ROI's output quality significantly without complex computation.
It is to be understood that both the foregoing general description and the following detailed description are exemplary, and are intended to provide further explanation of the invention as claimed.
The accompanying drawings are included to provide a further understanding of the invention, and are incorporated in and constitute a part of this specification. The drawings illustrate embodiments of the invention and, together with the description, serve to explain the principles of the invention.
a illustrates a conventional feedback control algorithm.
b illustrates a conventional fuzzy logic control algorithm.
The present invention now will be described more fully hereinafter with reference to the accompanying drawings, in which preferred embodiments of the invention are shown. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art. Like numbers refer to like elements throughout.
To begin with, a region-of-interest video coding by fuzzy control, consisted of two main components: (1) a region-of-interest, and (2) a fuzzy control. Referring to
Also referring to
Referring to
In one embodiment of the present invention, a set of different output fuzzy tables is scaled by the original output fuzzy in order to have different priorities to different ROI regions 330.
As an experimentation for one embodiment of the present invention shows the embodiment of the present invention has a better performance than other existing methodologies. In the experimental results, three sequences: Carphone; Claire; and Foreman are tested. In order to define the ROI regions in a frame, a face detection is used to select ROI automatically. Four different methods in the test sequences are compared. The four different methods are: coding a frame without ROI (WR), coding the ROI regions by multiplying a weighted factor (WA) α, coding the ROI regions by three factors (TF), and the presnet invention (Fuzzy). The four different methods are all set to the similar average bit-rate. In an implementation, QP is set to 5 and 3 for I-frame and P-frame at target bit-rate 64 kbits/sec, and 15 and 13 for I-frame and P-frame at target bit-rate 32 kbits/sec, respectively. In WA, the weighted factor is set to be 450. In TF, the three factors are set to be 450, 2, and 10, respectively. In order to compare the other two methods in similar weights, ZE13 is set to be 450 and LP1˜LN25 are set to be in 350˜550.
As illustrated from
The present invention is suitable in any image processing. It is particular useful for real-time video coding. Accordingly, the present invention can enhance the quality of ROI easily and maintain the constant bit-rate to avoid buffer overflow. It can achieve good quality easily with fewer bit-rates than previous works. The multiple ROI video coding can also enhance each ROI's quality significantly without complexity computation.
It will be apparent to those skilled in the art that various modifications and variations can be made to the structure of the present invention without departing from the scope or spirit of the invention. In view of the foregoing, it is intended that the present invention cover modifications and variations of this invention provided they fall within the scope of the following claims and their equivalents.