1. Field of the Invention
The present invention relates to techniques for intelligent video surveillance, more particularly to a method and a device for detecting stationary targets.
2. Description of Related Art
A conventional video surveillance system often relies on human operators to determine whether suspicious incidents happen in surveillance video. The disadvantage of the conventional video surveillance is that the operator cannot maintain a high level of vigilance all the time so that the surveillance accuracy cannot always be ensured. An intelligent video surveillance system is provided to determine whether suspicious incidents happen in surveillance video by intelligently analyzing and processing the surveillance video data.
Stationary targets are a type of important monitoring targets in the video surveillance. Accidents, e.g. illegal parking on sidewalk, illegal moving of valuables, or retention targets appearing on a defense region etc., can be captured via the stationary target detection.
Generally, the conventional method for detecting stationary targets is based on a method for detecting and tracking moving targets. The conventional method comprises: establishing a background image in which all objects being long-term stationary in the surveillance video; detecting and tracking any new objects in subsequent videos; determining these new objects as the stationary targets if the sizes and the positions of the moving targets do not change for a period of time.
However, the conventional method for detecting stationary targets may make an erroneous conclusion if the stationary targets are shadowed. Additionally, the conventional method for detecting stationary targets is sensitive to noise interference.
Thus, improved techniques for method and device for detecting stationary targets are desired to overcome the above disadvantages.
This section is for the purpose of summarizing some aspects of the present invention and to briefly introduce some preferred embodiments. Simplifications or omissions in this section as well as in the abstract or the title of this description may be made to avoid obscuring the purpose of this section, the abstract and the title. Such simplifications or omissions are not intended to limit the scope of the present invention.
In general, the present invention is related to techniques for detecting stationary targets in videos or frame images. According to one aspect of the present invention, a sequence of frame images is being received from a video system. Each of the frame images into a plurality of image blocks, and dividing a background image is divided into a plurality of corresponding background image blocks. Characteristic values of the image blocks in each of the frame images are calculated. A plurality of characteristic value sequences is then formed, each of the characteristic value sequences comprises a predefined number of characteristic values for each of the image blocks in the frame images. A histogram of each of the characteristic value sequences is computed to determine whether one of the image blocks in one of the frame images contains a stationary target.
One of the features, benefits and advantages in the present invention is to provide techniques for detecting motionless or stationary targets in a surveillance system.
Other objects, features, and advantages of the present invention will become apparent upon examining the following detailed description of an embodiment thereof, taken in conjunction with the attached drawings.
These and other features, aspects, and advantages of the present invention will become better understood with regard to the following description, appended claims, and accompanying drawings where:
The detailed description of the present invention is presented largely in terms of procedures, steps, logic blocks, processing, or other symbolic representations that directly or indirectly resemble the operations of devices or systems contemplated in the present invention. These descriptions and representations are typically used by those skilled in the art to most effectively convey the substance of their work to others skilled in the art.
Reference herein to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the invention. The appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Further, the order of blocks in process flowcharts or diagrams or the use of sequence numbers representing one or more embodiments of the invention do not inherently indicate any particular order nor imply any limitations in the invention.
Embodiments of the present invention are discussed herein with reference to
Improved techniques for detecting stationary targets according to one embodiment of the present invention are provided to divide each frame image into a plurality of image blocks, compute a characteristic value of each of the blocks in each frame image, and detect whether each image block contains a stationary target by analyzing the characteristic values of corresponding image blocks in different frame images. Thereby, it has a higher reliability and is insensitive to noise interference. Furthermore, the stationary targets can be detected even if it is shadowed.
At 101, a monitoring area of a video surveillance system is divided into a plurality of monitoring blocks. Accordingly, each frame image captured by the video surveillance system is divided into a plurality of image blocks. A background image is also divided into a plurality of corresponding (background) image blocks. The image blocks in different frame images correspond to each other and are assigned same block numbers.
At 102, a characteristic value of each image block of each frame image is computed. At 103, the characteristic values of the image blocks with the same block number in frame images are stored to form a characteristic value sequence, thereby a plurality of characteristic value sequences are formed. Each characteristic value sequence corresponds to one monitoring block and comprises a predefined number of characteristic values. When a new characteristic value is stored into the characteristic value sequence, the new characteristic value will push out a characteristic value firstly stored, namely first-in-first-out order.
At 104, the background image is initialized by assigning an initial characteristic value to each of the corresponding background image blocks according to a corresponding characteristic value sequence if the background image isn't initialized; otherwise, it is detected whether an image block of a current frame image contains a stationary target by comparing the characteristic value sequences with the characteristic values of corresponding background image blocks. If the image block of the current frame image contains the stationary target, the image block is outputted; otherwise, the background image block is updated according to the corresponding characteristic value sequence.
At 206, a characteristic value of a current image block in the current frame image is computed. After the current image block is processed, a next image block of the video sequence may be used as the current image block to continue the method 200. The characteristic value may be an average (gray) value of all pixels in the current image block in one embodiment. One advantage of using the average gray value as the characteristic value is simple and fast, and the disadvantage is that it is sensitive to changes in lighting. In order to avoid the effect of lighting, some complicated characteristic values, e.g. an angle characteristic value or a gray characteristic value being insensitive to changes in lighting, can be used.
Referring to the formula (1), GM is the average gray value of the image block, N is the pixel length and width of the image block, I(i,j) is a gray value of a pixel (i,j) in the image block. Referring to the formula (2), GA is the angle characteristic value.
The characteristic values of the image blocks with the same block number in different frame images are stored to form a characteristic value sequence, thereby a plurality of characteristic value sequences are formed. Each characteristic value sequence corresponds to one monitoring block and comprises a predefined number of characteristic values. When a new characteristic value is stored into the characteristic value sequence, the characteristic value firstly stored (in a buffer) is discarded for the newly calculated characteristic value sequence.
After 206, the characteristic value of the current image block is stored in the corresponding characteristic value sequence. The process 200 is then taken to 208, where a histogram of the current image block is obtained by counting the corresponding characteristic value sequence. At 210, a peak distribution of the histogram of the current image block is analyzed. At 212, it is determined whether there is only one peak and an accumulated distribution probability of areas adjacent to the peak is larger than a first predefined threshold. If yes, that indicates that the current image block may be stationary area, the method 200 is taken to 214; otherwise, that indicates that the current image block may be an area with motion targets, the method 200 returns to 206.
At 214, it is determined whether a background image block corresponding to the current image block is initialized. If yes, the method 200 proceeds at 216; otherwise, the method 200 enters 220, where the background image block corresponding to the current image block is initialized by assigning the peak characteristic value of the histogram of the current image block thereto. After 220, the method 200 is taken to 224. At 224, it is determined whether all image blocks of the current frame image are processed. If yes, the method 200 is taken to 226; otherwise, the method 200 is taken to 206. It is noted that all background image blocks forming one whole background image are initialized by repeatedly performing the operations 206, 208, 210, 212, 214, 220 and 224.
At 216, it is determined whether a difference between the peak characteristic value of the histogram of the current image block and the characteristic value of corresponding background image block is larger than a second predefined threshold. If yes, the method 200 is taken to 218, where it is determined that the current image block contains a stationary target; otherwise, the method 200 is taken to 222, where the background image block is updated by reassigning the peak characteristic value of the histogram of the current image block thereto. Then, the method 200 is taken to 224.
At 226, the image blocks of the current frame image containing the stationary targets are outputted. Then, the method 200 returns to 204 to continue the stationary target detection.
When a target moving slowly and having identical color appears in the monitoring scene, the target may be updated or initialized to the background image because the histogram thereof may have only one peak. In a preferred embodiment, the operation of detecting and tracking moving targets 202 is introduced into the method 200 to exclude the image blocks containing the moving targets during initializing or updating the background image, thereby minimizing effects of the moving targets on the background image.
The features and advantages in the present invention comprise: (1) the stationary targets can be detected even if the stationary targets is shadowed because of dividing the monitoring area into the plural monitoring blocks in the present invention; (2) it is insensitive to noise interference because a sequence of characteristic values of one image block are considered to determine whether the image block contains the stationary targets.
The dividing module 301 is configured to divide each frame image into a plurality of image blocks. The image blocks in different frame images corresponding to the same position have the same block number.
The computing module 302 is configured to compute a characteristic value of each image block of each frame image.
The statistic module 303 is configured to form a plurality of characteristic value sequences each comprising a predefined number of characteristic values of the image blocks with the same block number and generate a histogram according to each characteristic value sequence.
The processing module 304 is configured to initialize each background image block according to corresponding histogram and determine whether one image block contains a stationary target by comparing corresponding histogram with corresponding background image block. If the image block contains the stationary target, the image block is outputted; otherwise, the background image block is updated according to corresponding histogram.
In a preferred embodiment, the device further comprises an excluding module configured to exclude the characteristic values of the image blocks containing the moving targets from corresponding characteristic value sequences.
The present invention has been described in sufficient details with a certain degree of particularity. It is understood to those skilled in the art that the present disclosure of embodiments has been made by way of examples only and that numerous changes in the arrangement and combination of parts may be resorted without departing from the spirit and scope of the invention as claimed. Accordingly, the scope of the present invention is defined by the appended claims rather than the foregoing description of embodiments.
Number | Date | Country | Kind |
---|---|---|---|
200910079279.6 | Jun 2009 | CN | national |