Embodiments of the present invention comprise methods and systems for automatically detecting regions in digital images.
The content of a digital image can have considerable impact on the compression of the digital image, both in terms of compression efficiency and compression artifacts. Pictorial regions in an image are not efficiently compressed using compression algorithms designed for the compression of text. Similarly, text images are not efficiently compressed using compression algorithms that are designed and optimized for pictorial content. Not only is compression efficiency affected when a compression algorithm designed for one type of image content is used on a different type of image content, but the decoded image may exhibit annoying compression artifacts.
Further, image enhancement algorithms designed to sharpen text, if applied to pictorial image content, may produce visually annoying artifacts in some areas of the pictorial content. In particular, those areas of the pictorial content containing strong edges may be affected. While smoothing operations may enhance a natural image, the smoothing of text regions is seldom desirable.
The detection of regions of a particular content type in a digital image can improve compression efficiency, reduce compression artifacts, and improve image quality when used in conjunction with a compression algorithm or image enhancement algorithm designed for the particular type of content.
The semantic labeling of image regions based on content is also useful in document management systems and image databases.
Reliable and efficient detection of regions of pictorial content type and other image regions in digital images is desirable.
Embodiments of the present invention comprise methods and systems for classifying an image into regions using a masking condition and an entropy measure.
The foregoing and other objectives, features, and advantages of the invention will be more readily understood upon consideration of the following detailed description of the invention taken in conjunction with the accompanying drawings.
Embodiments of the present invention will be best understood by reference to the drawings, wherein like parts are designated by like numerals throughout. The figures listed above are expressly incorporated as part of this detailed description.
It will be readily understood that the components of the present invention, as generally described and illustrated in the figures herein, could be arranged and designed in a wide variety of different configurations. Thus, the following more detailed description of the embodiments of the methods and systems of the present invention is not intended to limit the scope of the invention but it is merely representative of the presently preferred embodiments of the invention.
Elements of embodiments of the present invention may be embodied in hardware, firmware and/or software. While exemplary embodiments revealed herein may only describe one of these forms, it is to be understood that one skilled in the art would be able to effectuate these elements in any of these forms while resting within the scope of the present invention.
An exemplary region-detection system 20 is shown in
The effectiveness and reliability of a region-detection system may depend on the feature or features used for the classification.
Embodiments of the present invention comprise methods and systems for region detection in a digital image. Some embodiments of the present invention comprise methods and systems for region detection in a digital image wherein the separation between feature values corresponding to image regions may be accomplished by masking, prior to feature extraction, pixels in the image for which a masking condition is met. In some embodiments, the masked pixel values may not be used when extracting the feature value from the image.
In exemplary embodiments of the present invention shown in
In the exemplary embodiments of the present invention shown in
When a pixel is accumulated in the histogram 74, a counter for counting the number of non-mask pixels in the block of the masked image is incremented 75. When all pixels in a block have been examined 78, 79, the histogram may be normalized 80. The histogram may be normalized 80 by dividing each bin count by the number of non-mask pixels in the block of the masked image. In alternate embodiments, the histogram may not be normalized and the counter may not be present.
An entropy measure 55 may be calculated 56 for the histogram 53 of a block of the masked image. The entropy measure 55 may be considered an image feature of the input image. The entropy measure 55 may be considered any measure of the form:
where N is the number of histogram bins, h(i) is the accumulation or count of bin i, and ƒ(·) may be a function with mathematical characteristics similar to a logarithmic function. The entropy measure 55 may be weighted by the proportion of pixels that would have been counted in a bin, but were masked. The entropy measure is of the form:
where w(i) is the weighting function. In some embodiments of the present invention, the function ƒ(h(i)) may be log2(h(i)).
In the embodiments of the present invention shown in
In some embodiments of the present invention, a moving window of pixel values centered, in turn, on each pixel of the image, may be used to calculate the entropy measure for the block containing the centered pixel. The entropy may be calculated from the corresponding block in the masked image. The entropy value may be used to classify the pixel at the location on which the moving window is centered.
In other embodiments of the present invention, the entropy value may be calculated for a block of the image, and all pixels in the block may be classified with the same classification based on the entropy value.
In some embodiments of the present invention, the masking condition may be based on the edge strength at a pixel.
In some embodiments of the present invention, a level of confidence in the degree to which the masking condition is satisfied may be calculated. The level of confidence may be used when accumulating a pixel into the histogram. Exemplary embodiments in which a level of confidence is used are shown in
In exemplary embodiments of the present invention shown in
In the exemplary embodiments of the present invention shown in
When a pixel is accumulated in the histogram 124, a counter for counting the number of non-mask pixels in the block of the masked image is incremented 125. When all pixels in a block have been examined 128, 129, the histogram may be normalized 130. The histogram may be normalized 130 by dividing each bin count by the number of non-mask pixels in the block of the masked image. In alternate embodiments, the histogram may not be normalized and the counter not be present.
An entropy measure 105 may be calculated 106 for the histogram of a neighborhood of the masked image as described in the previous embodiments. In the embodiments of the present invention shown in
In some embodiments of the present invention, the masking condition may comprise a single image condition. In some embodiments, the masking condition may comprise multiple image conditions combined to form a masking condition.
In some embodiments of the present invention, the entropy feature may be used to separate the image into two regions. In some embodiments of the present invention, the entropy feature may be used to separate the image into more than two regions.
The terms and expressions which have been employed in the foregoing specification are used therein as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding equivalence of the features shown and described or portions thereof, it being recognized that the scope of the invention is defined and limited only by the claims which follow.
Number | Name | Date | Kind |
---|---|---|---|
4414635 | Gast et al. | Nov 1983 | A |
4741046 | Matsunawa | Apr 1988 | A |
5001767 | Yoneda et al. | Mar 1991 | A |
5034988 | Fujiwara | Jul 1991 | A |
5157740 | Klein et al. | Oct 1992 | A |
5280367 | Zuniga | Jan 1994 | A |
5293430 | Shiau | Mar 1994 | A |
5339172 | Robinson | Aug 1994 | A |
5348474 | Pasini | Sep 1994 | A |
5353132 | Katsuma | Oct 1994 | A |
5379130 | Wang et al. | Jan 1995 | A |
5481622 | Gerhardt et al. | Jan 1996 | A |
5581667 | Bloomberg | Dec 1996 | A |
5588072 | Wang | Dec 1996 | A |
5642137 | Kitazumi | Jun 1997 | A |
5649025 | Revankar | Jul 1997 | A |
5682249 | Harrington et al. | Oct 1997 | A |
5689575 | Sako et al. | Nov 1997 | A |
5694228 | Peairs et al. | Dec 1997 | A |
5696842 | Shirasawa | Dec 1997 | A |
5767978 | Revankar | Jun 1998 | A |
5768403 | Suzuki | Jun 1998 | A |
5778092 | MacLeod et al. | Jul 1998 | A |
5809167 | Al-Hussein | Sep 1998 | A |
5848185 | Koga et al. | Dec 1998 | A |
5854853 | Wang | Dec 1998 | A |
5867277 | Melen et al. | Feb 1999 | A |
5900953 | Bottou et al. | May 1999 | A |
5903363 | Yaguchi et al. | May 1999 | A |
5923775 | Snyder et al. | Jul 1999 | A |
5943443 | Itonori et al. | Aug 1999 | A |
5946420 | Noh | Aug 1999 | A |
5949555 | Sakai et al. | Sep 1999 | A |
5956468 | Ancin | Sep 1999 | A |
5987171 | Wang | Nov 1999 | A |
5995665 | Maeda | Nov 1999 | A |
6020979 | Zeck et al. | Feb 2000 | A |
6084984 | Ishikawa | Jul 2000 | A |
6175427 | Lembeck et al. | Jan 2001 | B1 |
6175650 | Sindhu et al. | Jan 2001 | B1 |
6178260 | Li et al. | Jan 2001 | B1 |
6198797 | Majima et al. | Mar 2001 | B1 |
6215904 | Lavallee | Apr 2001 | B1 |
6222932 | Rao et al. | Apr 2001 | B1 |
6233353 | Danisewicz | May 2001 | B1 |
6246791 | Kurzweil et al. | Jun 2001 | B1 |
6256413 | Hirabayashi | Jul 2001 | B1 |
6272240 | Li et al. | Aug 2001 | B1 |
6298173 | Lopresti | Oct 2001 | B1 |
6301381 | Hayashi | Oct 2001 | B1 |
6308179 | Petersen et al. | Oct 2001 | B1 |
6347153 | Triplett et al. | Feb 2002 | B1 |
6360009 | Li et al. | Mar 2002 | B2 |
6373981 | de Queiroz et al. | Apr 2002 | B1 |
6389164 | Li et al. | May 2002 | B2 |
6400844 | Fan et al. | Jun 2002 | B1 |
6473522 | Lienhart et al. | Oct 2002 | B1 |
6522791 | Nagarajan | Feb 2003 | B2 |
6526181 | Smith et al. | Feb 2003 | B1 |
6577762 | Seeger et al. | Jun 2003 | B1 |
6594401 | Metcalfe et al. | Jul 2003 | B1 |
6661907 | Ho et al. | Dec 2003 | B2 |
6718059 | Uchida | Apr 2004 | B1 |
6728391 | Wu et al. | Apr 2004 | B1 |
6728399 | Doll | Apr 2004 | B1 |
6731789 | Tojo | May 2004 | B1 |
6731800 | Barthel et al. | May 2004 | B1 |
6766053 | Fan et al. | Jul 2004 | B2 |
6778291 | Clouthier | Aug 2004 | B1 |
6782129 | Li et al. | Aug 2004 | B1 |
6901164 | Sheffer | May 2005 | B2 |
6950114 | Honda et al. | Sep 2005 | B2 |
6993185 | Guo et al. | Jan 2006 | B2 |
7020332 | Nenonen et al. | Mar 2006 | B2 |
7027647 | Mukherjee et al. | Apr 2006 | B2 |
7062099 | Li et al. | Jun 2006 | B2 |
7079687 | Guleryuz | Jul 2006 | B2 |
7133565 | Toda et al. | Nov 2006 | B2 |
7181059 | Duvdevani et al. | Feb 2007 | B2 |
7190409 | Yamazaki et al. | Mar 2007 | B2 |
7206443 | Duvdevani et al. | Apr 2007 | B1 |
7221805 | Bachelder | May 2007 | B1 |
7483484 | Liu et al. | Jan 2009 | B2 |
7518755 | Gotoh et al. | Apr 2009 | B2 |
20010016077 | Oki | Aug 2001 | A1 |
20010050785 | Yamazaki | Dec 2001 | A1 |
20020031268 | Prabhakar et al. | Mar 2002 | A1 |
20020037100 | Toda et al. | Mar 2002 | A1 |
20020064307 | Koga et al. | May 2002 | A1 |
20020076103 | Lin et al. | Jun 2002 | A1 |
20020110283 | Fan et al. | Aug 2002 | A1 |
20020168105 | Li | Nov 2002 | A1 |
20030086127 | Ito et al. | May 2003 | A1 |
20030107753 | Sakamoto | Jun 2003 | A1 |
20030133612 | Fan | Jul 2003 | A1 |
20030133617 | Mukherjee | Jul 2003 | A1 |
20030156760 | Navon et al. | Aug 2003 | A1 |
20040001624 | Curry et al. | Jan 2004 | A1 |
20040001634 | Mehrotra | Jan 2004 | A1 |
20040042659 | Guo et al. | Mar 2004 | A1 |
20040083916 | Isshiki | May 2004 | A1 |
20040096102 | Handley | May 2004 | A1 |
20040179742 | Li | Sep 2004 | A1 |
20040190027 | Foster et al. | Sep 2004 | A1 |
20040190028 | Foster et al. | Sep 2004 | A1 |
20040205568 | Breuel et al. | Oct 2004 | A1 |
20040240733 | Hobson et al. | Dec 2004 | A1 |
20050008221 | Hull et al. | Jan 2005 | A1 |
20050100219 | Berkner et al. | May 2005 | A1 |
20050100220 | Keaton et al. | May 2005 | A1 |
20050129310 | Herley | Jun 2005 | A1 |
20050163374 | Ferman et al. | Jul 2005 | A1 |
20050174586 | Yoshida et al. | Aug 2005 | A1 |
20050180647 | Curry et al. | Aug 2005 | A1 |
20050281474 | Huang | Dec 2005 | A1 |
20050286758 | Zitnick et al. | Dec 2005 | A1 |
20060072830 | Nagarajan et al. | Apr 2006 | A1 |
20060133690 | Bloomberg et al. | Jun 2006 | A1 |
20060153441 | Li | Jul 2006 | A1 |
20060221090 | Takeshima et al. | Oct 2006 | A1 |
20060229833 | Pisupati et al. | Oct 2006 | A1 |
20060269159 | Kim et al. | Nov 2006 | A1 |
20070291120 | Campbell et al. | Dec 2007 | A1 |
20080212864 | Bornefalk | Sep 2008 | A1 |
Number | Date | Country |
---|---|---|
07-107275 | Apr 1995 | JP |
08-065514 | Mar 1996 | JP |
2002-325182 | Nov 2002 | JP |
2003-123072 | Apr 2003 | JP |
2004-110606 | Apr 2004 | JP |
2007-235953 | Sep 2007 | JP |
2005067586 | Jul 2005 | WO |
2006066325 | Jun 2006 | WO |
Number | Date | Country | |
---|---|---|---|
20070206855 A1 | Sep 2007 | US |