Thanks to advances in imaging technologies, people take more pictures than ever before. Further, the proliferation of media sharing applications has increased the demand for picture sharing to a greater degree than ever before. Yet the flood of photos, and the need to sort through them to find relevant pictures, has actually increased the time and effort required for sharing pictures. As a result, it is often the case that either pictures that are less than representative of the best pictures, or no pictures at all, end up getting shared.
Most people have many pictures or videos that include different people (or other entities) including family, friends, acquaintances, and strangers. In many cases, not all entities are treated equally when deciding the importance of image. For example, images that show family and friends are typically treated as more important than images that show mostly strangers or the like. But sorting through today's large number of images to select the more important ones can be prohibitively time-consuming.
The summary provided in this section summarizes one or more partial or complete example embodiments of the invention in order to provide a basic high-level understanding to the reader. This summary is not an extensive description of the invention and it may not identify key elements or aspects of the invention, or delineate the scope of the invention. Its sole purpose is to present various aspects of the invention in a simplified form as a prelude to the detailed description provided below.
The invention encompasses technologies for grouping images, and ranking the images and the groupings, based on entities shown in the images. Images may be grouped based on faces shown in the images. Different images with faces that indicate the same entity (e.g., Adam) may be automatically grouped together. Different images with faces that indicate the same multiple entities (e.g., the people in my family) may also be automatically grouped together. Such automatic grouping may be based on facial recognition technologies. Further, images and groups of images may be automatically ranked based on the faces shown and entities represented. Such rankings may also be influenced by adjacent data that indicates family and friends and the like, and that can be used to identify such entities in the images.
Many of the attendant features will be more readily appreciated as the same become better understood by reference to the detailed description provided below in connection with the accompanying drawings.
The detailed description provided below will be better understood when considered in connection with the accompanying drawings, where:
like-numbered labels in different figures are used to designate similar or identical elements or steps in the accompanying drawings.
The detailed description provided in this section, in connection with the accompanying drawings, describes one or more partial or complete example embodiments of the invention, but is not intended to describe all possible embodiments of the invention. This detailed description sets forth various examples of at least some of the technologies, systems, and/or methods invention. However, the same or equivalent technologies, systems, and/or methods may be realized according to examples as well.
Although the examples provided herein are described and illustrated as being implementable in a computing environment, the environment described is provided only as an example and not a limitation. As those skilled in the art will appreciate, the examples disclosed are suitable for implementation in a wide variety of different computing environments.
Computing environment 100 typically includes a general-purpose computing system in the form of a computing device 101 coupled to various components, such as peripheral devices 102, 103, 101 and the like. These may include components such as input devices 103, including voice recognition technologies, touch pads, buttons, keyboards and/or pointing devices, such as a mouse or trackball, that may operate via one or more input/output (“I/O”) interfaces 112. The components of computing device 101 may include one or more processors (including central processing units (“CPU”), graphics processing units (“GPU”), microprocessors (“μP”), and the like) 107, system memory 109, and a system bus 108 that typically couples the various components. Processor(s) 107 typically processes or executes various computer-executable instructions and, based on those instructions, controls the operation of computing device 101. This may include the computing device 101 communicating with other electronic and/or computing devices, systems or environments (not shown) via various communications technologies such as a network connection 114 or the like. System bus 108 represents any number of bus structures, including a memory bus or memory controller, a peripheral bus, a serial bus, an accelerated graphics port, a processor or local bus using any of a variety of bus architectures, and the like.
System memory 109 may include computer-readable media in the form of volatile memory, such as random access memory (“RAM”), and/or non-volatile memory, such as read only memory (“ROM”) or flash memory (“FLASH”). A basic input/output system (“BIOS”) may be stored in non-volatile or the like. System memory 109 typically stores data, computer-executable instructions and/or program modules comprising computer-executable instructions that are immediately accessible to and/or presently operated on by one or more of the processors 107.
Mass storage devices 104 and 110 may be coupled to computing device 101 or incorporated into computing device 101 via coupling to the system bus. Such mass storage devices 104 and 110 may include non-volatile RAM, a magnetic disk drive which reads from and/or writes to a removable, non-volatile magnetic disk (e.g., a “floppy disk”) 105, and/or an optical disk drive that reads from and/or writes to a non-volatile optical disk such as a CD ROM, DVD ROM 106. Alternatively, a mass storage device, such as hard disk 110, may include non-removable storage medium. Other mass storage devices may include memory cards, memory sticks, tape storage devices, and the like.
Any number of computer programs, files, data structures, and the like may be stored in mass storage 110, other storage devices 104, 105, 106 and system memory 109 (typically limited by available space) including, by way of example and not limitation, operating systems, application programs, data files, directory structures, computer-executable instructions, and the like.
Output components or devices, such as display device 102, may be coupled to computing device 101, typically via an interface such as a display adapter 111. Output device 102 may be a liquid crystal display (“LCD”). Other example output devices may include printers, audio outputs, voice outputs, cathode ray tube (“CRT”) displays, tactile devices or other sensory output mechanisms, or the like. Output devices may enable computing device 101 to interact with human operators or other machines, systems, computing environments, or the like. A user may interface with computing environment 100 via any number of different I/O devices 103 such as a touch pad, buttons, keyboard, mouse, joystick, game pad, data port, and the like. These and other I/O devices may be coupled to processor 107 via I/O interfaces 112 which may be coupled to system bus 108, and/or may be coupled by other interfaces and bus structures, such as a parallel port, game port, universal serial bus (“USB”), fire wire, infrared (“IR”) port, and the like.
Computing device 101 may operate in a networked environment via communications connections to one or more remote computing devices through one or more cellular networks, wireless networks, local area networks (“LAN”), wide area networks (“WAN”), storage area networks (“SAN”), the Internet, radio links, optical links and the like. Computing device 101 may be coupled to a network via network adapter 113 or the like, or, alternatively, via a modem, digital subscriber line (“DSL”) link, integrated services digital network (“ISDN”) link, Internet link, wireless link, or the like.
Communications connection 114, such as a network connection, typically provides a coupling to communications media, such as a network. Communications media typically provide computer-readable and computer-executable instructions, data structures, files, program modules and other data using a modulated data signal, such as a carrier wave or other transport mechanism. The term “modulated data signal” typically means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communications media may include wired media, such as a wired network or direct-wired connection or the like, and wireless media, such as acoustic, radio frequency, infrared, or other wireless communications mechanisms.
Power source 190, such as a battery or a power supply, typically provides power for portions or all of computing environment 100. In the case of the computing environment 100 being a mobile device or portable device or the like, power source 190 may be a battery. Alternatively, in the case computing environment 100 is a desktop computer or server or the like, power source 190 may be a power supply designed to connect to an alternating current (“AC”) source, such as via a wall outlet.
Some mobile devices may not include many of the components described in connection with
Those skilled in the art will realize that storage devices utilized to provide computer-readable and computer-executable instructions and data can be distributed over a network. For example, a remote computer or storage device may store computer-readable and computer-executable instructions in the form of software applications and data. A local computer may access the remote computer or storage device via the network and download part or all of a software application or data and may execute any computer-executable instructions. Alternatively, the local computer may download pieces of the software or data as needed, or distributively process the software by executing some of the instructions at the local computer and some at remote computers and/or devices.
Those skilled in the art will also realize that, by utilizing conventional techniques, all or portions of the software's computer-executable instructions may be carried out by a dedicated electronic circuit such as a digital signal processor (“DSP”), programmable logic array (“PLA”), discrete circuits, and the like. The term “electronic apparatus” may include computing devices or consumer electronic devices comprising any software, firmware or the like, or electronic devices or circuits comprising no software, firmware or the like.
The term “firmware” typically refers to executable instructions, code, data, applications, programs, program modules, or the like maintained in an electronic device such as a ROM. The term “software” generally refers to computer-executable instructions, code, data, applications, programs, program modules, or the like maintained in or on any form or type of computer-readable media that is configured for storing computer-executable instructions or the like in a manner that is accessible to a computing device. The term “computer-readable media” and the like as used herein is strictly limited to one or more apparatus, article of manufacture, or the like that is not a signal or carrier wave per se. The term “computing device” as used in the claims refers to one or more devices such as computing device 101 and encompasses client devices, mobile devices, one or more servers, network services such as an Internet service or corporate network service, and the like, and any combination of such.
In summary, system 200 is configured for detecting faces in input images 211, generating a face identifier for each detected face in the set of images, grouping images that include faces of the same entity, and ranking images and/or groups of images based on the faces detected in the images. A set of images is typically provided by one or more sources as input 212 to the system. Such sources include camera phones, digital cameras, digital video recorders (“DVRs”), computers, digital photo albums, social media applications, image and video streaming web sites, and any other source of digital images. Note that one or more actual images may be input, or references to images, or any combination of such. Further, the phrase “grouping images”, adding an image to a group”, and the like as used herein include grouping actual images, grouping references to images, and/or any combination of the foregoing.
Facial recognition engine 210 is a module that accepts an image as input 212, detects one or more faces in the image, and detects various features in recognized faces. In one example, the functionality of module 210 may be provided in the form of a software development kit (“SDK”). In one example, facial recognition engine 210 may provide facial recognition data as one or more outputs, each of which may be stored in data store 220. One output may be in the form of a face identifier that identifies a detected face in an image 212. Given multiple detected faces in an image, a unique face identifier is typically provided for each face detected in the image. In one example, a face identifier may be a RECT data structure or the like that bounds certain aspects of the face it identifies. Such a RECT data structure may indicate a position in the image of the face it identifies, and/or may indicate a size or relative size of the identified face in the image. Any face identifier(s) that are output 212 may be accepted as input by data store 220, grouping engine 230, and/or ranking engine 240.
Another output 212 of facial recognition engine 210 may be in the form of a set of facial feature descriptors that describe facial features detected in a face corresponding to the face's identifier. Given multiple face identifiers as input, a corresponding set of facial feature descriptors is typically provided for each face identifier. In one example, the set of facial feature descriptors may be in the form of coordinates for each detected facial feature, such as the eyes, eyebrows, nose, and mouth of the face.
Another output 212 of facial recognition engine 210 may be in the form of a face score corresponding to a face identifier. Such a face score may be an overall quality score for the face that is based on facial feature analysis. In one example, the score may be a value between zero and one. In other examples, the score may be represented by a value within a continuous range, or by a quantization such as high, medium, or low, or the like. In one example, the face score may represent an overall measure of the quality of the face in the image, and may be based on a combination of analyzed aspects such as face sharpness, face expression, face pose, proximity to image edges, open/closed state of the face's eyes and mouth, and/or other aspects.
Another output 212 of facial recognition engine 210 may be in the form of a face signature that, across the images in the set, uniquely identifies an entity that the face represents, at least within the scope of the detected features. For example, if various face shots of Adam appear in several images in a set, then each of Adam's face shots will have the same face signature that uniquely identifies the entity “Adam”, at least within the scope of the detected features. Such a face signature is typically based on analysis of a face's identifier and corresponding set of facial feature descriptors. Such face signatures may be used to determine other faces in other images of the set that represent the same entity, and thus may be used to determine a frequency that a particular entity appears in the image set. The term “same entity” as used herein typically refers to a particular entity (e.g., a person).
Another output 212 of facial recognition engine 210 may be the set of images provided as input 211. One or more of these images, and/or references to them, may be stored in data store 220. Further, such images may also be retrieved as input 211 to facial recognition engine 210, as may any other of the outputs 212 stored in data store 220. Note that the movement of an image described herein, such as providing, retrieving, or the like, refers to movement of the actual image itself and/or to a reference to the actual image.
One example of facial recognition engine 210 is provided as system 200 described in U.S. patent application Ser. No. 14/266,795 entitled “RATING PHOTOS FOR TASKS BASED ON CONTENT AND ADJACENT SIGNALS” that was filed on Apr. 30, 2014 and that is incorporated herein by reference in its entirety.
Data store 220 is a persistent data store such as one or more database and/or other storage system. Data store 220 may be integral to system 200 or may be separate. Further, adjacent information input 221 from adjacent information sources may be provided to system 200 via data store 220, or may be provided directly to system 200 without necessarily being stored in data store 220. Adjacent information may be obtained from sources that are generally unrelated or indirectly related to the images in the set. In general, any system or data source that can be accessed by system 200 may be an adjacent information source. Non-limiting examples of adjacent information sources include calendars, social media applications, news sources, blogs, email, location tracking information, and any other source.
Grouping engine 230 is a module that accepts one or more of the outputs of facial recognition engine 210 directly and/or from data store 220. Grouping engine 230 may group images that include one or more faces with the same face signature. For example, if the first and third images in a set of images both include a face with the same face signature, then those two images may be grouped together, along with any other images in the set that include a face with the same face signature. Thus, each image in the group includes at least one face that represents the same entity as the other images in the group. Faces with a face score below a certain threshold may be excluded from the grouping as if not present in the image in which they are detected.
Grouping engine may also analyze face information from an image, such as a face identifier and/or the corresponding set of facial feature descriptors. The results of such analysis may be used separate from or in addition to face signatures in the grouping of images.
Groups that share faces of a single entity in common may be described as single-entity groups. Groups that share faces of two different entities in common may be described as double-entity groups, and so forth. For example, each image in a set may each show the same family of five people. In this example, all images in the set may be grouped together in a five-entity group because each image share five face signatures in common, one for each member of the family. Such a five-entity group may be described as a larger-entity group than a single-entity group or a four-entity group, and so forth.
Images without detected faces may be grouped together. Images with one or more detected faces that do not share face signatures with faces in any other images may be grouped by themselves.
Groupings of images produced by grouping engine 230 may be provided as output 232. Such provided groups may comprise the images themselves, or may be comprised of references to the images, or any combination of the foregoing. Such groupings may be automatically provided, such as being presented in photo albums, shared via social media applications, or the like. Priority in presenting such groupings may be given to groupings with larger numbers of images, and/or to larger-entity groups.
Ranking engine 240 is a module that accepts one or more of the outputs of facial recognition engine 210 directly and/or from data store 220, and/or groupings of images provided by grouping engine 230. Ranking engine 240 may rank images that include one or more faces based on face scores of the faces detected in the image. Faces with a score below a certain threshold may be excluded from the rankings as if not present in the image in which they are detected. Such scores may be weighted to reflect the relative importance of various faces and/or face aspects in the image. For example, faces with a higher frequency of appearance in the set of images may be weighted higher than those with a lower frequency of appearance, such as determined by face signatures and/or groupings. Further, faces of entities that are determined to be friends or family or the like of a person providing the set of images may be weighted higher than faces of entities that are not so determined. In one example, such a determination may be based on adjacent information input 221, or based on other input to system 200 such as input provided by the person or other entity.
Ranking engine 240 may also rank groupings of images based on the faces of the common entities in the images in the groupings. For example, larger-entity groupings may be ranked higher than smaller-entity groupings. Further, groupings with a larger number of faces of entities that are determined to be friends or family or the like may be ranked higher than groupings with a lesser number of such.
Rankings of images produced by ranking engine 240 may be provided as output 242. Such provided rankings may comprise the images themselves, or may be comprised of references to the images, or any combination of the foregoing. Such rankings may be automatically provided, such as being presented in photo albums, shared via social media applications, or the like.
Block 320 typically indicates receiving facial recognition data for a face detected in an image. Such data may include a face identifier, a set of facial feature descriptors, and face score, and/or a face signature. The image and/or a reference to the image may also be provided. Once the facial recognition data is received, method 300 typically continues at block 330.
Block 330 typically indicates considering a face score of the face corresponding to the received face identifier. Such a face score may be received at step 320 as part of the facial recognition data. In one example, the face score may be weighted. If the face score, with or without its weight, is below a certain threshold, then the face may be dropped from grouping consideration. In this case, method 300 continues at block 360. Otherwise, method 300 typically continues at block 340.
Block 340 typically indicates considering a face signature of the face corresponding to the received face identifier. Such a face signature may be received at step 320 as part of the facial recognition data. Note that the face signature indicates an entity that the face represents. In one example, the face signature is compared to that of any existing group(s). If no existing group is associated with the face signature, then a new group is created for that face signature, and the image is added to that group (step 350). For each existing group that is associated with the face signature, the image is added to that group (step 350). In this example, single-entity groups are created and filled.
For example, if the face signature indicates the entity “Adam”, and no group for images with faces of Adam exists, then a group is created for images with faces of Adam, and the image is added. If a group for faces of Adam already exists, then the image with Adam's face is added to the Adam group (step 350). The term “single-entity group” as used herein refers to a group of images where each image in the group includes a detected face that represents the same entity.
In another example, if the image with the face signature indicating Adam is also associated with a face signature indicating “Mary”, then the image is also added (step 350) to any existing group for Adam and Mary, or to a newly-created group if one does not already exist, and so forth for each additional face signature associated with the image. In this example, multi-entity groups are created and filled. Once the face signature(s) is considered and the image is added to the appropriate group(s), then the method typically continues at step 360. The term “multi-entity group” as used herein refers to an n-entity group of images where each image in the group includes a detected face that represents each of the same a entities. In one example, given a three-entity group with several images, each image includes three detected faces, one that represents each of Adam, Mary, and Jim.
Block 360 typically indicates determining if any more faces are present in the image. In one example, this determination is made based on the facial recognition data received at step 320. If there are additional faces in the image that have not yet been considered (e.g., based on a face identifier), then the method typically continues at step 330 for one of the yet-to-be considered faces in the image. Otherwise, the method typically continues at step 370.
Block 370 typically indicates determining if any more images are present in the set of images. If there are additional images in the set that have not yet been considered, then the method typically continues at step 310 for one of the yet-to-be considered images in the set. Otherwise, the method is typically done.
Block 420 typically indicates receiving facial recognition data for a face detected in an image. Such data may include a face identifier, a set of facial feature descriptors, and face score, and/or a face signature. The image and/or a reference to the image may also be provided. Once the facial recognition data is received, method 400 typically continues at block 430.
Block 430 typically indicates considering a face score of the face corresponding to the received face identifier. Such a face score may be received at step 420 as part of the facial recognition data. In one example, the face score may be weighted. If the face score, with or without its weight, is below a certain threshold, then the face may be dropped from ranking consideration. In this case, method 400 continues at block 460. Otherwise, method 400 typically continues at block 440.
Block 440 typically indicates considering a face signature of the face corresponding to the received face identifier. Such a face signature may be received at step 420 as part of the facial recognition data. Note that the face signature indicates an entity that the face represents. The face signature may be used to determine a frequency at which faces of a particular entity appears in an image or group of images. In one example, the face signature of an entity is associated with a count of detected faces in an image and/or a group of images that correspond to the entity. Facial recognition engine 210 may provide such a count given the face signature and an image, set of images, and/or grouping of images, or references thereto.
For example, if the face signature indicates the entity “Adam”, and the face signature is associated with a larger number of detected faces in the image and/or group of images, then the image and/or group may be ranked higher than if the face signature is associated with a lesser number of such detected faces (step 450).
In another example, if the image with the face signature indicating Adam is also associated with a face signature indicating “Mary”, then the image is a multi-entity image. In this example, multi-entity images with detected faces of a larger number of entities may be ranked higher than multi-entity images with a lesser number of such detected faces (step 450). In a similar fashion, multi-entity groups with a larger number of entities may be ranked higher than multi-entity groups with a lesser number of entities.
In yet another example, the rankings may be weighted by or based on the face signature indicating an entity determined to be a friend or family or the like. In this example, images and/or groups of images with a larger number of friends or family or the like may be ranked higher that images and/or groups of images with a lesser number of such.
The ranking of images or groups of images may be in the form of a relative rank between the images or groups. In another example, the ranking may be in the form of a score or priority assigned to each image or group.
Once the face signature is considered, then the method typically continues at step 460. Further, rankings may be finalized after all detected faces in an image and/or group of images, or all face signatures associated with an image and/or group of images, have been processed. For example, ranking information may have been accumulated for each considered face and/or each considered face signature. Such accumulated rankings may then be consolidated into a single ranking for an image and/or group of images. In one example, a sum or product of such accumulated rankings may be assigned as a finalized ranking to an image and/or group of images.
Block 460 typically indicates determining if any more faces are present in the image. In one example, this determination is made based on the facial recognition data received at step 420. If there are additional faces in the image that have not yet been considered (e.g., based on a face identifier), then the method typically continues at step 430 for one of the yet-to-be considered faces in the image. Otherwise, the method typically continues at step 470.
Block 470 typically indicates determining if any more images need to be considered, such as when processing a group of images as opposed to an individual image. If there are additional images that have not yet been considered, then the method typically continues at step 410 for one of the yet-to-be considered images. Otherwise, the method is typically done.
In view of the many possible embodiments to which the invention and the forgoing examples may be applied, it should be recognized that the examples described herein are meant to be illustrative only and should not be taken as limiting the scope of the present invention. Therefore, the invention as described herein contemplates all such embodiments as may come within the scope of the following claims and any equivalents thereto.
This Application is a Continuation of, and claims benefit from or priority of U.S. patent application Ser. No. 14/264,619 that was filed on Apr. 29, 2014, and that is incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
4868653 | Golin | Sep 1989 | A |
5475425 | Przyborski | Dec 1995 | A |
5544258 | Levien | Aug 1996 | A |
5687011 | Mowry | Nov 1997 | A |
6028960 | Graf | Feb 2000 | A |
6283858 | Hayes, Jr. | Sep 2001 | B1 |
6297825 | Madden | Oct 2001 | B1 |
6389181 | Shaffer | May 2002 | B2 |
6683982 | Kohn | Jan 2004 | B1 |
6757027 | Edwards | Jun 2004 | B1 |
6879709 | Tian | Apr 2005 | B2 |
7027054 | Cheiky et al. | Apr 2006 | B1 |
7082211 | Simon | Jul 2006 | B2 |
7155037 | Nagai et al. | Dec 2006 | B2 |
7194114 | Schneiderman | Mar 2007 | B2 |
7200561 | Moriya | Apr 2007 | B2 |
7254257 | Kim | Aug 2007 | B2 |
7337112 | Moriya | Feb 2008 | B2 |
7551754 | Steinberg | Jun 2009 | B2 |
7577295 | Constantin | Aug 2009 | B2 |
7577297 | Mori | Aug 2009 | B2 |
7580952 | Logan | Aug 2009 | B2 |
7639877 | Shiota | Dec 2009 | B2 |
7680327 | Weiss | Mar 2010 | B2 |
7715598 | Li | May 2010 | B2 |
7751599 | Chen | Jul 2010 | B2 |
7804982 | Howard et al. | Sep 2010 | B2 |
7864967 | Takeuchi | Jan 2011 | B2 |
7978925 | Souchard | Jul 2011 | B1 |
8019060 | Martino | Sep 2011 | B2 |
8024564 | Bassani et al. | Sep 2011 | B2 |
8078623 | Chou | Dec 2011 | B2 |
8121408 | Omori | Feb 2012 | B2 |
8130281 | Kaneda et al. | Mar 2012 | B2 |
8150098 | Gallagher | Apr 2012 | B2 |
8154384 | Hirai | Apr 2012 | B2 |
8155400 | Bronstein | Apr 2012 | B2 |
8165352 | Mohanty | Apr 2012 | B1 |
8170298 | Li | May 2012 | B2 |
8212294 | Hoke | Jul 2012 | B2 |
8212894 | Nozaki | Jul 2012 | B2 |
8213690 | Okada et al. | Jul 2012 | B2 |
8224036 | Maruyama | Jul 2012 | B2 |
8306280 | Nozaki | Nov 2012 | B2 |
8326000 | Jung et al. | Dec 2012 | B2 |
8331632 | Mohanty | Dec 2012 | B1 |
8335851 | Vendrow | Dec 2012 | B1 |
8345934 | Obrador | Jan 2013 | B2 |
8358811 | Adams et al. | Jan 2013 | B2 |
8364802 | Keagy et al. | Jan 2013 | B1 |
8384791 | Porter | Feb 2013 | B2 |
8410903 | Hirai | Apr 2013 | B2 |
8418139 | Dhanakshirur et al. | Apr 2013 | B2 |
8421874 | Okamoto | Apr 2013 | B2 |
8488847 | Nozaki | Jul 2013 | B2 |
8526683 | Maruyama | Sep 2013 | B2 |
8532347 | Bourdev | Sep 2013 | B2 |
8538091 | Kaneda | Sep 2013 | B2 |
8553037 | Smith et al. | Oct 2013 | B2 |
8559722 | Tsuji | Oct 2013 | B2 |
8611678 | Hanson | Dec 2013 | B2 |
8620649 | Gao | Dec 2013 | B2 |
8624994 | Kaneda et al. | Jan 2014 | B2 |
8649604 | Steinberg et al. | Feb 2014 | B2 |
8680439 | Shei et al. | Mar 2014 | B2 |
8705806 | Nakano | Apr 2014 | B2 |
8761512 | Buddemeier | Jun 2014 | B1 |
8839222 | Brandwine et al. | Sep 2014 | B1 |
8880439 | Archambeau | Nov 2014 | B2 |
8924315 | Archambeau | Dec 2014 | B2 |
9262152 | Kurian et al. | Feb 2016 | B1 |
9330630 | Kerofsky | May 2016 | B2 |
9373179 | Negrila et al. | Jun 2016 | B2 |
9645808 | Turpie | May 2017 | B1 |
20030068100 | Covell | Apr 2003 | A1 |
20040240711 | Hamza | Dec 2004 | A1 |
20050163372 | Kida | Jul 2005 | A1 |
20060034542 | Aoyama | Feb 2006 | A1 |
20060088209 | Yu | Apr 2006 | A1 |
20060155777 | Shih et al. | Jul 2006 | A1 |
20060244845 | Craig | Nov 2006 | A1 |
20060253491 | Gokturk | Nov 2006 | A1 |
20060280341 | Koshizen | Dec 2006 | A1 |
20060290705 | White | Dec 2006 | A1 |
20070002478 | Mowry | Jan 2007 | A1 |
20070053607 | Mitsunaga | Mar 2007 | A1 |
20070058878 | Gomilla | Mar 2007 | A1 |
20070136324 | Xu et al. | Jun 2007 | A1 |
20070172099 | Park | Jul 2007 | A1 |
20080014563 | Visani | Jan 2008 | A1 |
20080089561 | Zhang | Apr 2008 | A1 |
20080183751 | Cazier et al. | Jul 2008 | A1 |
20080204598 | Maurer et al. | Aug 2008 | A1 |
20080212894 | Demirli | Sep 2008 | A1 |
20090010623 | Date et al. | Jan 2009 | A1 |
20090028380 | Hillebrand | Jan 2009 | A1 |
20090087099 | Nakamura | Apr 2009 | A1 |
20090116749 | Cristinacce | May 2009 | A1 |
20090180671 | Lee | Jul 2009 | A1 |
20090185723 | Kurtz | Jul 2009 | A1 |
20090193445 | Thakker | Jul 2009 | A1 |
20090219405 | Kaneda et al. | Sep 2009 | A1 |
20090313546 | Katpelly | Dec 2009 | A1 |
20100008233 | Ee et al. | Jan 2010 | A1 |
20100027663 | Dai | Feb 2010 | A1 |
20100054544 | Arguelles | Mar 2010 | A1 |
20100082316 | Chawla et al. | Apr 2010 | A1 |
20100189313 | Prokoski | Jul 2010 | A1 |
20100205177 | Sato | Aug 2010 | A1 |
20100329533 | Omi | Dec 2010 | A1 |
20110007174 | Bacivarov | Jan 2011 | A1 |
20110010319 | Harada | Jan 2011 | A1 |
20110031113 | Lopatin et al. | Feb 2011 | A1 |
20110052081 | Onoe | Mar 2011 | A1 |
20110064331 | Andres del Valle | Mar 2011 | A1 |
20110087666 | Chou et al. | Apr 2011 | A1 |
20110091113 | Ito | Apr 2011 | A1 |
20110129159 | Cifarelli | Jun 2011 | A1 |
20110135166 | Wechsler | Jun 2011 | A1 |
20110158536 | Nakano | Jun 2011 | A1 |
20110176058 | Biswas et al. | Jul 2011 | A1 |
20120014560 | Obrador | Jan 2012 | A1 |
20120027311 | Cok | Feb 2012 | A1 |
20120076427 | Hibino | Mar 2012 | A1 |
20120084655 | Gallagher | Apr 2012 | A1 |
20120106859 | Cheatle | May 2012 | A1 |
20120148159 | Kaneda et al. | Jun 2012 | A1 |
20120188382 | Morrison | Jul 2012 | A1 |
20120236184 | Jia et al. | Sep 2012 | A1 |
20120308124 | Belhumeur | Dec 2012 | A1 |
20130148864 | Dolson et al. | Jun 2013 | A1 |
20130151441 | Archambeau | Jun 2013 | A1 |
20130156275 | Amacker | Jun 2013 | A1 |
20130179172 | Nakamura et al. | Jul 2013 | A1 |
20130226587 | Cheung | Aug 2013 | A1 |
20130227415 | Gregg | Aug 2013 | A1 |
20130243328 | Irie | Sep 2013 | A1 |
20130266196 | Kono | Oct 2013 | A1 |
20130297769 | Chang et al. | Nov 2013 | A1 |
20130305210 | Sharma et al. | Nov 2013 | A1 |
20130311423 | Price et al. | Nov 2013 | A1 |
20140006420 | Sparrow et al. | Jan 2014 | A1 |
20140029859 | Libin | Jan 2014 | A1 |
20140046914 | Das | Feb 2014 | A1 |
20140050419 | Lerios | Feb 2014 | A1 |
20140072242 | Wei | Mar 2014 | A1 |
20140087355 | Henry et al. | Mar 2014 | A1 |
20140211065 | Sudheendra | Jul 2014 | A1 |
20140304297 | Lian et al. | Oct 2014 | A1 |
20140341443 | Cao | Nov 2014 | A1 |
20150160961 | Johnson et al. | Jun 2015 | A1 |
20150234725 | Cillis et al. | Aug 2015 | A1 |
20150302040 | Amigud et al. | Oct 2015 | A1 |
20150347734 | Beigi | Dec 2015 | A1 |
20160379343 | Suri et al. | Dec 2016 | A1 |
20170364737 | Chan et al. | Dec 2017 | A1 |
20190354748 | Lee et al. | Nov 2019 | A1 |
Number | Date | Country |
---|---|---|
1337619 | Feb 2002 | CN |
1649420 | Aug 2005 | CN |
101930595 | Dec 2010 | CN |
102368194 | Mar 2012 | CN |
102722364 | Oct 2012 | CN |
102884526 | Jan 2013 | CN |
1965389 | Sep 2008 | EP |
2 096 577 | Sep 2009 | EP |
2 267 655 | Dec 2010 | EP |
2 312 462 | Apr 2011 | EP |
2009082814 | Jul 2009 | WO |
2009128021 | Oct 2009 | WO |
2011014138 | Feb 2011 | WO |
Entry |
---|
“Non-Final Office Action Issued in U.S. Appl. No. 14/264,619”, dated Nov. 2, 2016, 11 Pages. |
“Final Office Action Issued in U.S. Appl. No. 14/266,795”, dated Dec. 21, 2017, 17 Pages. |
“Non Final Office Action Issued in U.S. Appl. No. 15/497,423”, dated Jun. 25, 2018, 8 Pages. |
Friedman, Jerome H., “Greedy Function Approximation: A Gradient Boosting Machine”, In the Proceeding of Annals of Statistics, vol. 29, Issue 5, Feb. 24, 1999, 34 Pages. |
Lee, et al., “Deterministic Coreference Resolution Based on Entity-Centric, Precision-Ranked Rules”, In Journal of Computational Linguistics, vol. 39 Issue 4, Dec. 1, 2013, 32 Pages. |
Sparck Jones, et al., “Report on the Need for and Provision of an ‘Ideal’ Information Retrieval Test Collection”, Retrieved From «http://sigir.org/files/museum/pub-14/pub_14.pdf», Dec. 11, 1975, 44 Pages. |
Wu, et al, “Who's Mary Jane? Understanding Fictional Characters”, In the Proceedings of 23rd International World Wide Web Conference, Apr. 7, 2014, 212 Pages. |
Kumar, “Attribute and Simile Classifiers for Face Verification”, Proceedings of the 12th IEEE International Conference on Computer Vision (ICCV), Oral Session 14: Recognition, Detection and Matching, Sep. 27-Oct. 4, 2009. |
Kumar, “Describable Visual Attributes for Face Verification and Image Search”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 33, Issue 10, Oct. 2011. |
Kumar, “Face Recognition Using Gabor Wavelets”, Proceedings of the 40th IEEE Asilomar Conference on Signals, Systems and Computers, Oct. 29-Nov. 1, 2006. |
Lei, “Face Recognition by Exploring Information Jointly in Space, Scale and Orientation”, IEEE Transactions on Image Processing, vol. 20, No. 1, Jan. 2011. |
Li, “Probabilistic Models for Inference about Identity”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, No. 1, Jan. 2012 (published online May 13, 2011). |
Liang, “Face Alignment via Component-based Discriminative Search”, Proceedings of the 10th European Conference on Computer Vision (ECCV'08), Part II, Oct. 12-18, 2008. |
Moghaddam, “Bayesian face recognition”, The Journal of Pattern Recognition Society, vol. 33, No. 11, Nov. 2000. |
Nguyen, “Cosine Similarity Metric Learning for Face Verification”, Proceedings of the 10th Asian Conference on Computer Vision (ACCV'10), Part II, Nov. 8-12, 2010. |
Jala, “A Generalized Local Binary Pattern Operator for Multiresolution Gray Scale and Rotation Invariant Texture Classification”, Proceedings of the 2nd International Conference on Advances in Pattern Recognition (ICAPR'01), Mar. 11-14, 2001. |
Phillips, “The FERET Evaluation Methodology for Face-Recognition Algorithms”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, No. 1 0, Oct. 2000. |
Ramanan, “Local Distance Functions: A Taxonomy, New Algorithms, and an Evaluation”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 33, No. 4, Apr. 2011 (published online Jun. 28, 2010). |
Seo, “Face Verification Using the LARK Representation”, IEEE Transactions on Information Forensics and Security, vol. 6, No. 4, Dec. 2011. |
Susskind, “Modeling the joint density of two images under a variety of transformations”, Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR'11), Jun. 20-25, 2011. |
Taigman, “Leveraging Billions of Faces to Overcome Performance Barriers in Unconstrained Face Recognition”, arXiv:1108.1122v1, Aug. 4, 2011. |
Taigman, “Multiple One-Shots for Utilizing Class Label Information”, Proceedings of the British Machine Vision Conference (BMVC'09), Sep. 7-10, 2009. |
Wang, “A Unified Framework for Subspace Face Recognition”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, No. 9, Sep. 2004. |
Wang, “Boosted Multi-Task Learning for Face Verification with Applications to Web Image and Video Search”, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'09), Jun. 20-25, 2009. |
Wang, “Subspace Analysis Using Random Mixture Models”, Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), Jun. 20-25, 2005. |
Wang, “Bayesian Face Recognition Using Gabor Features”, Proceedings of the 2003 ACM SIGMM Workshop on Biometrics Methods and Applications (WBMA'03), Nov. 8, 2003. |
European Patent Office, Officer Cristina Darolti, PCT International Preliminary Report on Patentability for Application No. PCT/US2015/026971, 8 pages, dated Aug. 10, 2016, Netherlands. |
European Patent Office, Officer Matthieu Denoual, PCT International Preliminary Report on Patentability for Application No. PCT/US2015/027689, dated Jul. 18, 2016, 9 Pages, Netherlands. |
European Patent Office, Officer Arnaud Maetz, PCT International Preliminary Report on Patentability for Application No. PCT/US2015/036595, 8 pages, dated Oct. 7, 2016, Netherlands. |
European Patent Office, Officer Paula Ipinazar, PCT International Preliminary Report on Patentability for Application No. PCT/US2015/035219, pp. 1-9, dated Jun. 23, 2016, Germany. |
PCT Second Written Opinion for Application No. PCT/US2015/036595, dated May 31, 2016. |
PCT Second Written Opinion for Application No. PCT/US2015/027689, dated Apr. 1, 2016. |
PCT Second Written Opinion for Application No. PCT/US2015/035219, dated Apr. 26, 2016. |
PCT International Preliminary Report on Patentability for Application No. PCT/US2015/027688, dated Apr. 26, 2016. |
PCT Second Written Opinion for Application No. PCT/US2015/026971, dated Mar. 30, 2016. |
“International Search Report & Written Opinion Issued in PCT Application No. PCT/US2015/036595”, dated Sep. 24, 2015, 10 Pages. |
PCT International Search Report and Written Opinion for Application No. PCT/US2015/027689, dated Jul. 8, 2015. |
European Patent Office, Officer Cristina Darolti, PCT International Search Report and Written Opinion for Application No. PCT/US2015/026971, dated Jul. 24, 2015, 15 Pages, Netherlands. |
Tian, “Handbook of Face Recognition, Chapter 11, Facial Expression Analysis”, Springer, 2005. |
Cootes, “Handbook of Face Recognition, Chapter 3, Modeling Facial Shape and Appearance”, Springer, 2005. |
Weinberger, “Distance Metric Learning for Large Margin Nearest Neighbor Classification”, Proceedings of the Conference on Advances in Neural Information Processing Systems 18 (NIPS'05), Dec. 5-8, 2005. |
Yin, “An Associate-Predict Model for Face Recognition”, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'11), Jun. 20-25, 2011. |
Ying, “Distance Metric Learning with Eigenvalue Optimization”, Journal of Machine Learning Research, vol. 13, No. 1, Jan. 2012. |
Zhang, “Two-Dimensional Bayesian Subspace Analysis for Face Recognition”, Proceedings of the 4th International Symposium on Neural Networks (ISNN'07), Part II, Jun. 3-7, 2007. |
Zhu, “A Rank-Order Distance based Clustering Algorithm for Face Tagging”, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'11), Jun. 20-25, 2011. |
Li, “Joint and Implicit Registration for Face Recognition”, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'09), Jun. 20-25, 2009. |
PCT International Search Report and Written Opinion for Application No. PCT/US2015/035219, dated Sep. 29, 2015. |
Cvetkovic, “Image enhancement circuit using non-linear processing curve and constrained histogram range equalization”, In Proceedings of the SPIE and IS&T Conference on Visual Communications and Image Processing, vol. 5308, Jan. 7, 2004. |
PCT International Search Report and Written Opinion for Application No. PCT/US2015/027688, dated Jul. 9, 2015. |
PCT Written Opinion for Application No. PCT/US/2015/027688, dated Feb. 9, 2016. |
Tsay, et al., “Personal Photo Organizer based on Automated Annotation Framework”, In 5th International Conference on Intelligent Information Hiding and Multimedia Signal Processing, Sep. 12, 2009, 4 pages. |
Van Zwol, et al., “Prediction of Favourite Photos using Social, Visual, and Textual Signals”,In Proceedings of the International Conference on Multimedia, Oct. 25, 2010, 4 pages. |
De Neve, et al., “Face Recognition for Personal Photos using Online Social Network Context and Collaboration”, In Guest Lecture at KAIST, Dec. 14, 2010,54 pages. |
Mavridis, et al., “Friends with Faces: How Social Networks Can Enhance Face Recognition and Vice Versa”, In Proceedings of Computational Social Networks Analysis: Trends, Tools and Research Advances, May 24, 2010, 30 pages. |
Choi, et al., “Face Annotation for Personal Photos Using Collaborative Face Recognition In Online Social Networks”, In 16th International Conference on Digital Signal Processing, Jul. 5, 2009,8 pages. |
Starr, Michelle, “Facial recognition app matches strangers to online profiles”, Published on: Jan. 7, 2014, Available at: http://www.cnet.com.au/facial-recognition-app-matches-strangers-to-on line-profiles-339346355. htm. |
Ding, “Handbook of Face Recognition, Second Edition, Chapter 12, Facial Landmark Localization”, Springer, 2011. |
Shah, “All Smiles: Automatic Photo Enhancement by Facial Expression Analysis”, In Proceedings of the Conference on Visual Media Production (CVMP'12), Dec. 5-6, 2012. |
Perenson, “In-depth look at Google+ photo update with the team that designed it”, dpreview.com, May 17, 2013. |
Lanitis, “Toward Automatic Simulation of Aging Effects on Face Images”, In Proceedings of the IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, No. 4, Apr. 2002. |
Lu, “Context-Aware Textures”, In Journal of ACM Transactions on Graphics, vol. 26, No. 1, Jan. 2007. |
Gooch, “Color2Gray: Salience-Preserving Color Removal”, In Proceedings of the ACM SIGGRAPH Transactions on Graphics, vol. 24, No. 3, Jul. 2005. |
Dong, “Image Retargeting by Content-Aware Synthesis”, arXiv:1403.6566v1 [cs.GR], Mar. 26, 2014. |
Shan, “Image-Based Surface Detail Transfer”, In IEEE Computer Graphics and Applications, vol. 24, No. 3, May-Jun. 2004. |
Papadopoulos, “Image Clustering Through Community Detection on Hybrid Image Similarity Graphs”, In Proceedings of the 17th IEEE International Conference on Image Processing (ICIP'1 0), Sep. 26-29, 2010. |
Choudhury, “A framework for robust online video contrast enhancement using modularity optimization”, In Proceedings of the IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, No. 9, Sep. 2012. |
“Interest Point Detection”, From Wikipedia, the free encyclopedia, Last modified on Jun. 1, 2013. |
Belhumeur, “Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, No. 7, Jul. 1997. |
Cao, “Face Recognition with Learning-based Descriptor”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR'10), Jun. 13-18, 2010. |
Chen, “Bayesian Face Revisited: A Joint Formulation”, Proceedings of the 12th European Conference on Computer Vision (ECCV'12), Part III, Oct. 7-13, 2012. |
Chen, “Supplemental material for Bayesian Face Revisited: A Joint Formulation”, Feb. 2013. |
Davis, “Information-Theoretic Metric Learning”, Proceedings of the 24th International Conference on Machine Learning (ICML'07), Jun. 20-24, 2007. |
Diez, “Optimization of a Face Verification System Using Bayesian Screening Techniques”, Proceedings of the 23rd IASTED International Multi-Conference on Artificial Intelligence and Applications, Feb. 14-16, 2005. |
Guillaumin, “Is that you? Metric Learning Approaches for Face Identification”, Proceedings of the IEEE 12th International Conference on Computer Vision (ICCV'09), Sep. 29-Oct. 2, 2009. |
Huang, “Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments”, Proceedings of the 10th European Conference on Computer Vision (ECCV'08), Oct. 12-18, 2008. |
IOFFE, “Probabilistic Linear Discriminant Analysis”, Proceedings of the 9th European Conference on Computer Vision (ECCV'06), May 7-13, 2006. |
“Second Office Action Issued in Chinese Patent Application No. 201580032034.1”, dated Apr. 22, 2019, 13 Pages. |
“Second Office Action Issued in Chinese Patent Application No. 201580034192.0”, dated Apr. 22, 2019, 10 Pages. |
“Office Action Issued in European Patent Application No. 15721468.5”, dated May 24, 2019, 8 Pages. |
“First Office Action Issued in Chinese Patent Application No. 201580018659.2”, dated Dec. 25, 2018, 19 Pages. |
“Office Action Issued in Chinese Patent Application No. 201580018659.2”, dated Aug. 23, 2019, 15 Pages. |
“Office Action Issued in Chinese Patent Application No. 201580024796.7”, dated Aug. 26, 2019, 18 Pages. |
“Office Action Issued in Chinese Patent Application No. 201580032034.1”, dated Aug. 30, 2019, 6 Pages. |
“Office Action Issued in Chinese Patent Application No. 201580034192.0”, dated Sep. 2 , 2019, 7 Pages. |
“Second Office Action And Search Report Issued in Chinese Application No. 201580028549.4”, dated Jul. 3, 2019, 10 Pages. |
“Non Final Office Action Issued in U.S. Appl. No. 16/019,939”, dated Sep. 10, 2019, 12 Pages. |
“Second Office Action Issued in Chinese Patent Application No. 201580029370.0”, dated Sep. 6, 2019, 6 Pages. |
“Office Action Issued in European Patent Application No. 15745006.5”, dated Dec. 20, 2019, 4 Pages. |
“Office Action Issued in Chinese Patent Application No. 201580018659.2”, dated Dec. 25, 2019, 11 Pages. |
Number | Date | Country | |
---|---|---|---|
20170364737 A1 | Dec 2017 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 14264619 | Apr 2014 | US |
Child | 15692494 | US |