The present application is related to co-pending patent application Ser. No. 14/310,799, entitled “SYSTEM FOR RANKING AND SELECTING EVENTS IN MEDIA COLLECTIONS”, which is hereby incorporated in their entirety by this reference.
The invention relates generally to the field of digital image processing, and in particular to methods and systems for ranking and selecting events in consumer media collections.
The proliferation of digital cameras and scanners has lead to an explosion of digital images, creating large personal image databases. Since taking digital pictures is easy and practically free, consumers no longer restrict picture-taking to important events and special occasions. Images are being captured frequently, and of day-to-day occurrences in the consumers' life. Since a typical user has already accumulated many years of digital images, browsing the collection to find images taken during important events is a time-consuming process for the consumer.
There has been work in grouping images into events. U.S. Pat. No. 6,606,411, assigned to A. Loui and E. Pavie, entitled “A method for automatically classifying images into events,” issued Aug. 12, 2003 and U.S. Pat. No. 6,351,556, assigned to A. Loui, and E. Pavie, entitled “A method for automatically comparing content of images for classification into events,” issued Feb. 26, 2002, disclose algorithms for clustering image content by temporal events and sub-events. According to U.S. Pat. No. 6,606,411 events have consistent color distributions, and therefore, these pictures are likely to have been taken with the same backdrop. For each sub-event, a single color and texture representation is computed for all background areas taken together. The above two patents teach how to cluster images and videos in a digital image collection into temporal events and sub-events. The terms “event” and “sub-event” are used in an objective sense to indicate the products of a computer mediated procedure that attempts to match a user's subjective perceptions of specific occurrences (corresponding to events) and divisions of those occurrences (corresponding to sub-events). Another method of automatically organizing images into events is disclosed in U.S. Pat. No. 6,915,011, assigned to A. Loui, M. Jeanson, and Z. Sun, entitled “Event clustering of images using foreground and background segmentation” issued Jul. 5, 2005. The events detected are chronologically ordered in a timeline from earliest to latest.
Using the above methods, it is possible to reduce the amount of browsing required by the user to locate a particular event by viewing representatives of the events along a timeline, instead of each image thumbnail. However, a typical user may still generate hundreds of such events over a few year period, and more prolific picture-takers can easily exceed a few thousands detected events. It will be a very tedious task for the user to browse through their collection to pick various events or sub-events to create a photo product such as a collage or photobook. Hence, there is a need for new methods and systems to automatically rank the events and to select the preferred set of events based on some relevant criteria. In addition, the present invention also teaches how to select events from the ranked list of events based on a calculated target distribution, which can be computed using the distribution of one or more event attributes of the events derived from the media collection. Further, event ranking and selection can also be tied to social networks, where different user input such as tags and comments, will be used for aid in the ranking and selection.
There has been other work in event clustering using metadata. U.S. Pat. No. 7,860,866, assigned to Kim el at., entitled “Heuristic event clustering of media using metadata,” issued Dec. 28, 2010, discloses algorithms for clustering an media collection into event based on time difference and location difference between consecutive media files. However the above patent does not teach how to rank or select event from a media collection, which is the main idea in the present invention. The '866 patent only teaches how to cluster media files into separate events with no ranking information. There also has been work in identifying media assets using contextual information. U.S. Pat. No. 8,024,311, assigned to Wood and Hibino, entitled “Identifying media assets from contextual information,” issued on Sep. 20, 2011, discloses a method to select media assets by identifying an event using the received contextual information such as text data, gesture data, or audio data. The above patent clearly depends on a user to first provide some contextual information as input before it can identify the appropriate event, and the subsequent selection of the media assets. This is a different application as it requires user input and direction, whereas the present invention teaches how to automatically rank and select events without user input. Further, the '311 patent only identify one event (see
The organization and retrieval of images and videos is a problem for the typical consumer. It is useful for the user to be able to browse an overview of important events in their collection. Technology disclosed in prior art allows the classification of images in a collection into events, but not the ability to ascertain the importance or ranking of such events. As a result, these include uninteresting or common day-to-day events that inflate the number of events to the point where it is difficult to find more important events even when browsing a list of events. This invention teaches a method and system for automatically ranking events that have been detected from a media collection. In addition, it also discloses how to select events from a ranked list of events based on a calculated target distribution, which can be computed using the distribution of one or more event attributes of the events derived from the media collection.
In accordance with the present invention, there is provided a method and system for ranking events in media collections comprising designating a media collection, using a processor to cluster the media collection items into a hierarchical event structure, using the processor to identify and count visually similar sub-events within each event in the hierarchical event structure, using the processor to determine a ranking of events based on the count of sub-events within each event, and associating the determined ranking with each event in the media collection.
In another embodiment of the present invent, there is provide a method for selecting events from media collections comprising designating a media collection, using a processor to cluster the media collection items into a hierarchical event structure, using the processor to identify and count visually similar sub-events within each event in the hierarchical event structure, using the processor to determine a ranked list of events based on the count of sub-events within each event, using the processor to calculate a target distribution that is based on the distribution of one or more event attributes of the events derived from the media collection, and selecting events from the ranked list of events based on the calculated target distribution.
The present invention can be implemented in computer systems as will be well known to those skilled in the art. In the following description, some embodiments of the present invention will be described as software programs. Those skilled in the art will readily recognize that the equivalent of such a method may also be constructed as hardware or software within the scope of the invention.
Because image manipulation algorithms and systems are well known, the present description will be directed in particular to algorithms and systems forming part of, or cooperating more directly with, the method in accordance with the present invention. Other aspects of such algorithms and systems, and hardware or software for producing and otherwise processing the image signals involved therewith, not specifically shown or described herein can be selected from such systems, algorithms, components, and elements known in the art. Given the description as set forth in the following specification, all software implementation thereof is conventional and within the ordinary skill in such arts. Videos in a collection are included in the term “images” in the rest of the description.
The present invention can be implemented in computer hardware and computerized equipment. For example, the method can be performed in a digital camera, a multimedia smart phone, a digital printer, on an internet server, on a kiosk, and on a personal computer. Referring to
It should also be noted that the present invention can be implemented in a combination of software or hardware and is not limited to devices which are physically connected or located within the same physical location. One or more of the devices illustrated in
Referring to
Referring to
The events detected continue to be chronologically ordered in a timeline from earliest to latest. Using the method described above, it is not possible to detect single events that span a long period of time (days) and encompass a variety of activities and settings (for example, a long vacation covering multiple destinations) or events that occur in distinct parts separated by some hours from each other (for example, a sporting event with many matches or a wedding). Gaps in photo-taking corresponding to the overnight period also cause breaks in event continuity. Further processing is needed to detect these super-events, defined as a grouping of multiple contiguous events that may span multiple days. Inter-event duration, defined as the time duration between the last image of one event and the first image of the next event on a continuous timeline, is computed for each event. The events are then treated as single points on a time axis, separated by the inter-event durations. A density-based clustering method is applied to these points (ref. Data Mining Concepts and Techniques by Han and Kamber, Elsevier, 2006, supra, pp. 418-420) to cluster events into super-events when they are separated by relatively small duration gaps (for example, less than 18 hours). The final three-level hierarchical event representation includes super-events, events and sub-events. After this point, the term “event” refers to the top-level of the hierarchical event representation˜which can be a super-event or an event. Referring to
Referring to
Referring to
In one aspect of the present invention, the number of sub-events in the event is used to rank events in descending order of importance. Since each sub-event extracted using the method disclosed in U.S. Pat. No. 6,606,411 has consistent color distribution as determined by block-level color histogram similarity; more sub-events in an event indicates that these pictures are likely to have been taken with diverse backgrounds that increase the scope of the event. This justifies a higher ranking when there are more sub-events. In another embodiment, the significance score, defined as the residual divided by the variance (σ), is used to rank the events, with a higher score getting a higher rank. The significance score generated at the end of the significant event detection described earlier indicates how well the event fits into the estimated model, with a higher score indicating a lower fit, and therefore, the event is more likely to be something unusual and important in the collection.
In another aspect of the present invention, the interestingness of an event can be modeled as shown in
In another aspect of the present invention, the albums of images a user uploads for sharing to social networks are gathered, along with social interactions such as “likes”, comments, and tags associated with each image. The images in these albums are treated as a set of images that have no capture date-time information, but are in a list arranged by the time of upload. This list of images is merged into a user's personal image collection that resides on their private storage (which can be on a personal computer, mobile device or online storage) using the method described in U.S. Pat. No. 7,831,599 “Additive clustering of images lacking individualized date-time information” by Das et al issued Sep. 11, 2010. This patent describes a method that uses a dynamic programming-based formulation to merge images lacking capture time into an organized collection where events have already been computed and capture date-time information exists. The method computes image similarity scores based on image content, and ensures that the ordering of the incoming list of images is maintained. After merging the shared images into the user's personal collection, the number of social interactions (“likes”, comments and tags) derived from the shared images are counted for each event in the user collection that contains shared images from the merging process. The events are ranked in decreasing order of number of social interactions.
In another aspect of the present invention, the number of images that are marked by the user is counted for each event, and the events are ranked in decreasing order of the number of user markings, The user markings can take different forms including being marked a “favorite” either at time of capture on the capture device itself, or later on the site of storage (computer or online storage); marked as to be used for sharing; or marked with a star rating system provided by the capture device or storage site with the maximum star rating allowed.
Referring to
Event size refers to the number of assets (images or video) contained in the top-level event (i.e., a super-event or an event). The media type of an event refers to the ratio of videos to images in the event, discretized into a pre-specified number of bins. The media type indicates the mix of video and images in an event.
Referring to
Referring to
Referring to
Referring to
A method for ranking events in media collections comprises designating a media collection, using a processor to cluster the media collection items into a hierarchical event structure, using the processor to identify and count visually similar sub-events within each event in the hierarchical event structure, using the processor to determine a ranking of events based on the count of sub-events within each event, and associating the determined ranking with each event in the media collection.
The ranking of events can be based on the significance score of the event, on a distribution that models the importance of an event over an elapsed time period, on a score or distribution that models the interestingness of an event over an elapsed time period, on metadata from social networks such as number of likes and comments, on metadata from social networks through the analysis of user tags and comments, or on the number of images in the event that have been marked by the user as being a favorite or to be used for sharing.
A method for selecting events from media collections comprises designating a media collection, using a processor to cluster the media collection items into a hierarchical event structure, using the processor to identify and count visually similar sub-events within each event in the hierarchical event structure, using the processor to determine a ranked list of events based on the count of sub-events within each event, using the processor to calculate a target distribution that is based on the distribution of one or more event attributes of the events derived from the media collection, and selecting events from the ranked list of events based on the calculated target distribution.
The event attribute used in the target distribution can be the event class, the event size, or the media type of the event. The ranking of events is based on the significance score of the event, on a distribution that models the importance of an event over an elapsed time period, on scores or a distribution that models the interestingness of an event over an elapsed time period, on metadata from social networks such as number of likes and comments, on metadata from social networks through the analysis of user tags and comments, or on the number of images in the event that have been marked by the user as being a favorite or to be used for sharing.
A system for ranking events in media collections comprises a processor-accessible memory for storing a media collection, and a processor for clustering the media collection items into a hierarchical event structure, for identifying and visually counting similar sub-events within each event in the hierarchical event structure, for determining a ranking of events based on the count of sub-events within each event, and for associating the determined ranking with each event in the media collection.
A system for selecting events from media collections comprises a processor-accessible memory for storing a media collection and a processor for clustering the media collection items into a hierarchical event structure, for identifying and visually counting similar sub-events within each event in the hierarchical event structure, for determining a ranked list of events based on the count of sub-events within each event, for calculating a target distribution based on the distribution of one or more event attributes of the events derived from the media collection, and for selecting events from the ranked list of events based on the calculated target distribution.
The invention has been described in detail with particular reference to certain preferred embodiments thereof, but it will be understood that variations and modifications can be effected within the spirit and scope of the invention.
Number | Name | Date | Kind |
---|---|---|---|
6351556 | Loui | Feb 2002 | B1 |
6460036 | Herz | Oct 2002 | B1 |
6606411 | Loui | Aug 2003 | B1 |
6915011 | Loui | Jul 2005 | B2 |
7406436 | Reisman | Jul 2008 | B1 |
7831599 | Das | Nov 2010 | B2 |
7860866 | Kim | Dec 2010 | B2 |
8024311 | Wood | Sep 2011 | B2 |
8108408 | Kondo | Jan 2012 | B2 |
8340436 | Das | Dec 2012 | B2 |
8392430 | Hua | Mar 2013 | B2 |
8429026 | Kolawa | Apr 2013 | B1 |
8718386 | Das | May 2014 | B2 |
20040208377 | Loui | Oct 2004 | A1 |
20060104520 | Kraus | May 2006 | A1 |
20060271526 | Charnock | Nov 2006 | A1 |
20070100563 | Webb | May 2007 | A1 |
20070127783 | Kuramoto | Jun 2007 | A1 |
20080205772 | Blose | Aug 2008 | A1 |
20090094518 | Lawther | Apr 2009 | A1 |
20090109298 | Wan | Apr 2009 | A1 |
20090148071 | Ohwa | Jun 2009 | A1 |
20090161962 | Gallagher | Jun 2009 | A1 |
20090319560 | Cheng | Dec 2009 | A1 |
20100070448 | Omoigui | Mar 2010 | A1 |
20100124378 | Das | May 2010 | A1 |
20110213655 | Henkin | Sep 2011 | A1 |
20110317982 | Xu | Dec 2011 | A1 |
20120014560 | Obrador | Jan 2012 | A1 |
20120143921 | Wilson | Jun 2012 | A1 |
20120191716 | Omoigui | Jul 2012 | A1 |
20120210229 | Bryant | Aug 2012 | A1 |
20130054620 | Stokes | Feb 2013 | A1 |
20130230230 | Ajemba | Sep 2013 | A1 |
20140002342 | Fedorovskaya | Jan 2014 | A1 |
20140365506 | Gong | Dec 2014 | A1 |
20150169641 | Alldrin | Jun 2015 | A1 |
Number | Date | Country | |
---|---|---|---|
20150006523 A1 | Jan 2015 | US |
Number | Date | Country | |
---|---|---|---|
61840031 | Jun 2013 | US |