The invention pertains to discovery of information from video data, and particularly to finding items disclosed in the information. More particularly, the invention pertains to determining relationships among the items.
The invention is a system for information discovery of items, such as individuals or objects, from video-based tracks. The system may compute similarities of characteristics of the items and present the results in a matrix form. A similarity portrayal may have nodes representing the items with edges between the nodes. The edges may have weights in the form of vectors indicating similarities of the characteristics between the nodes situated at the ends of the edges. The edges may be augmented with temporal and spatial properties from the tracks which cover the items. These properties may play a part in a multi-objective presentation of information about the items in terms of a negative or supportive basis. The presentation may be partitioned into clusters which may lead to a merger of items or tracks. The system may pave a way for good group discovery in things like video-based social networks.
A key challenge that needs to be addressed by nodal video data analysis is to enable robust cross-data analysis in the presence of node ambiguity. This may be due to the uncertainty that typically results from tracking entities in the presence of occlusions, stops and spatial and temporal gaps.
A crucial step is node disambiguation, which correlates subjects across cameras and time (e.g., if a subject leaves the view of a camera and later returns). This step may be crucial to enable integrated data mining or analyses across time and space. The primary means one may use to correlate subjects is to compare results of a face and/or body similarity computation. Given two images of subjects, the similarity computation may compute a score that specifies how similar the two images are. Therefore, if a single image is compared against all other images in the image database, an ordered list of images may be generated for it.
The similarity computation may have a number of disadvantages. First, due to the non-linear nature of the computation, only order can be derived from the results, but not comparative magnitude. E.g., assume image A is compared to images B and C and results in similarity metrics of 10 and 20, respectively. It does not necessarily follow then that B is twice as likely as C to be a match to A. While B is more similar to A than C, nothing more can really be said regarding the relative similarity. Another disadvantage is that general threshold values cannot necessarily be used across images. E.g., one cannot necessarily create a static rule that any pair of images with a similarity score over one hundred are to be considered different subjects. For some images, one hundred may be a good score. For others, it may be a poor match. Therefore, using only a similarity measure between images may be insufficient for node disambiguation.
The present invention is based on the following observations. The same subject cannot be observed in different places at the same time. In order for a subject to be observed at different locations, the time to travel to that location should be sufficient. Two tracks of similar subjects are more likely to belong to the same person if they are (almost) contiguous. That is, it appears more advantageous to cluster two similar tracks if they are also similar in time and space then to cluster two similar tracks that are not close in time and space.
The present node disambiguation approach may rely on multi-objective partitioning algorithms to cluster together tracks that are likely to represent the same person that a company, such as Honeywell International Inc., may apply to multi-modal data arising from a video recognition domain, including face and body similarity data, kinematic data, archived social network data, and so forth, to detect, correlate, and disambiguate individuals and groups across space and time.
One may use exclusivity constraints to indicate that two nodes may not refer to the same subject. Subjects that are observed at different locations at about the same time may not necessarily be clustered together. In addition, subjects observed at different location may not necessarily be clustered together if the temporal gap between observations is not sufficient for the subject to travel from one location to another.
Additionally, the similarity weights to connect two subjects may be dynamically adjusted based on temporal and spatial proximity. The more closely in time and space the subjects are the more importance one may put on similarity of those two subjects. Thus, the subjects observed over large temporal and spatial gap should only be clustered together if their similarity measure is extremely strong.
Multi-objective graph partitioning may compute clusters given graphs that have multiple types of edge and nodes, whose edge weights cannot be meaningfully combined.
Information in a graph may also or instead be in a form of a portrayal, rendition, presentation, depiction, layout, representation, or the like.
After the similarity graph 85 construction, a graph augmentation at symbol 86 may bring in the track special and temporal properties and tie them into the graph already having vectors for the characteristics. A result may be a multi-objective graph 87 of the items, tracks, nodes or persons in a form of vector edges with the characteristics in terms of similarity values between the nodes. A multi-objective graph partitioner 88 may take the values of the edge vectors and determine which nodes belong in the same cluster with a similarity score calculated by an algorithm. The result may be clusters 89. From these cluster 89 indications, tracks 81 may be a merge track process 90 accordingly resulting in merged tracks 91.
In flow diagram 80, similarity computation 82 and similarity matrix may be in a similarity module 101. Graph constructor 84 and similarity graph 85 may be in a graph module 102. Graph augmentation 86 and multi-objective graph 87 may be in an augmentation module 103. Multi-objective graph partitioner 88 and clusters 89 may be in a cluster module 104. Merge tracks 90 may be a merger module 90.
The following applications may be relevant. U.S. patent application Ser. No. 12/547,415, filed Aug. 25, 2009, and entitled “Framework for Scalable State Estimation Using Multi Network Observations”, is hereby incorporated by reference. U.S. patent application Ser. No. 12/369,692, filed Feb. 11, 2009, and entitled “Social Network Construction Based on Data Association”, is hereby incorporated by reference. U.S. patent application Ser. No. 12/187,991, filed Aug. 7, 2008, and entitled “System for Automatic Social Network Construction from Image Data”, is hereby incorporated by reference. U.S. patent application Ser. No. 12/124,293, filed May 21, 2008, and entitled “System Having a layered Architecture for Constructing a Dynamic Social Network from Image Data”, is hereby incorporated by reference.
In the present specification, some of the matter may be of a hypothetical or prophetic nature although stated in another manner or tense.
Although the present system has been described with respect to at least one illustrative example, many variations and modifications will become apparent to those skilled in the art upon reading the specification. It is therefore the intention that the appended claims be interpreted as broadly as possible in view of the prior art to include all such variations and modifications.
Number | Name | Date | Kind |
---|---|---|---|
6266053 | French et al. | Jul 2001 | B1 |
7016307 | Vasudev et al. | Mar 2006 | B2 |
7249123 | Elder et al. | Jul 2007 | B2 |
7366759 | Trevithick et al. | Apr 2008 | B2 |
7512628 | Chess et al. | Mar 2009 | B2 |
7519200 | Gokturk et al. | Apr 2009 | B2 |
7800616 | Said et al. | Sep 2010 | B2 |
8041082 | Baluja et al. | Oct 2011 | B1 |
20070174272 | Carter et al. | Jul 2007 | A1 |
20070239764 | Song et al. | Oct 2007 | A1 |
20070250898 | Scanlon et al. | Oct 2007 | A1 |
20070296863 | Hwang et al. | Dec 2007 | A1 |
20080010304 | Vempala et al. | Jan 2008 | A1 |
20080091723 | Zuckerberg et al. | Apr 2008 | A1 |
20090290755 | Ma et al. | Nov 2009 | A1 |
20090292549 | Ma et al. | Nov 2009 | A1 |
20090307772 | Markham et al. | Dec 2009 | A1 |
20100036875 | Miezianko et al. | Feb 2010 | A1 |
Number | Date | Country |
---|---|---|
2008039401 | Apr 2008 | WO |
Entry |
---|
Apostal et al., “Checkmate Network Security Modeling, IEEE Computer Society, Proceedings of DARPA Information Survivability Conference & Exposition II,” pp. 214-226, Jun. 12-14, 2001. |
U.S. Appl. No. 12/816,314, filed Jun. 15, 2010. |
Djidjev, “A Scalable Multilevel Algorithm for Graph Clustering and Community Structure Detection,” Algorithms and Models for the Web-Graph, Fourth International Workshop, 17 pages, Nov. 30-Dec. 1, 2006. |
Goldszmidt et al., “Qualitative Probabilities for Default Reasoning, Belief Revision, and Casual Modeling,” Artificial Intelligence, vol. 84, pp. 57-112, 1996. |
Hanneman et al., “Introduction to Social Network Methods,” Online at http://faculty.ucr.ed/-hanneman/nettext/, 296 pages, printed Apr. 4, 2008. |
Heimerdinger, “Scyllarus Intrusion Detection Report Correlator and Analyzer,” Proceedings of the DARPA Information Survivability Conference and Exposition (DISCEX '03), vol. II, 12 pages, 2003. |
Karypis et al., “Chameleon: Hierarchial Clustering Using Dynamic Modeling,” IEEE Computer Society, vol. 32, No. 8, pp. 68-74, Aug. 1999. |
Karypis et al., “Multilevel Algorithms for Multi-Constraint Graph Partitioning,” 25 pages, May 5, 1998. |
Karypis et al., “Parallel Graph Partitioning and Sparse Matrix Ordering Library Version 3.1,” Parametis, 29 pages, Aug. 15, 2003. |
Kenyon, “Adaptive Response Tool Foils Hacker Intrusion,” Signal, pp. 33-35, Aug. 1999. |
Ketkar et al., “Subdue: Compression-Based Frequent Pattern Discovery in Graph Data,” OSDM 2005, Proceedings of the First International Workshop on Open Source Data Mining, pp. 71-76, 2005. |
Musliner et al., “World Modeling for the Dynamic Construction of Real-Time Control Plans,” Artificial Intelligence vol. 74, pp. 83-127, 1995. |
Schloegel et al., “A New Algorithm for Multi-Objective Graph Partitioning,” Euro-Par' 99 Parallel Processing, 5th International Euro-Par Conference Proceedings, pp. 322-331, Aug. 31-Sep. 3, 1999. |
Schloegel et al., “Graph Partitioning for High-Performance Scientific Simulations,” Sourcebook of Parallel Computing, pp. 491-541, 2002. |
Schloegel et al., “Parallel Static and Dynamic Multi-Constraint Graph Partitioning,” Concurrency and Computation: Practice and Experience, vol. 14, pp. 219-240, 2002. |
Number | Date | Country | |
---|---|---|---|
20120005208 A1 | Jan 2012 | US |