The current state of media-capture technology allows users to generate and store a large number of digital media items, such as photographs, videos, voice recordings, and so on. For example, a user may use his or her smartphone or wearable computing device to produce dozens of media items in the course of a single day. The user may then transfer these media items to a personal computer and/or a cloud storage service.
However, the proliferation of digital media makes it difficult for users to later retrieve media items of interest. In some cases, a user may simply forget that certain media items exist. In other cases, a user may have a vague recollection of generating the media items, but the user may have difficulty finding them again. In traditional practice, a user may manually organize collections of media items into meaningful folders. The user may then manually search through a directory of folders to find the desired media items. In addition, or alternatively, a user may add descriptive tags to the media items. The user may then use a keyword-based search interface to attempt to find media items of interest, that is, by finding media items having tags which match specified search terms. These approaches, however, offer poor user experience. For instance, these approaches are labor-intensive and cumbersome in nature, and are not always successful in retrieving the desired media items.
The above potential drawbacks in existing retrieval strategies are cited by way of illustration, not limitation; existing retrieval strategies may have further shortcomings.
A Media Presentation System (MPS) is described herein which receives and analyzes a plurality of media items pertaining to a user. The MPS then attempts to match the user's current activity with at least one pattern of previous user activity which is exhibited by the media items. The MPS then generates and delivers a user interface presentation to the user that conveys at least one media item that pertains to the pattern of previous user activity.
By virtue of the above approach, the user will receive media items that are relevant to his or her current circumstance, in a timely fashion, and without having to manually hunt for the media items, or without even having to remember that the media items exist. The media items may allow the user to enjoyably reminiscence about previous events that are relevant to his or her current situation.
Consider one concrete example. A user may visit her grandmother every year, around the same time, and in the same city. In a current visit, the MPS can detect that the user is engaged in a particular activity, namely, visiting her grandmother. The MPS can then determine that the current activity matches a pattern of prior conduct by the user—that is, visiting her grandmother on a yearly basis over the course of several prior years. The MPS can then deliver a collection of digital photographs to the user which captures her prior trips to visit her grandmother. The user may enjoy the retrospective provided by the collection, particularly since it coincides with her current activity.
The MPS can formulate the user interface presentation in different ways, such as a timeline-type format, a collage-type format, a time lapse animation sequence, and so on. In one particular case, the MPS can also present the user interface presentation in the context of an ongoing conversation between two more users, conducted via a communication system (such as a video communication system). The media items that are displayed may show snapshots or video clips taken from prior communication sessions between the two users, and/or other media items that are relevant to the two users. The media items in that context may facilitate conversation between the two users, as well as add to the enjoyment of the two users.
The above approach can be manifested in various types of systems, devices, components, methods, computer readable storage media, data structures, graphical user interface presentations, articles of manufacture, and so on.
This Summary is provided to introduce a selection of concepts in a simplified form; these concepts are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.
The same numbers are used throughout the disclosure and figures to reference like components and features. Series 100 numbers refer to features originally found in
This disclosure is organized as follows. Section A provides an overview of a Media Presentation System (MPS). Section B sets forth processes which describe one manner of operation of the MPS of Section A. Section C describes illustrative computing functionality that can be used to implement any aspect of the features described in Sections A and B.
As a preliminary matter, some of the figures describe concepts in the context of one or more structural components, variously referred to as functionality, modules, features, elements, etc. The various components shown in the figures can be implemented in any manner by any physical and tangible mechanisms, for instance, by software running on computer equipment, hardware (e.g., chip-implemented logic functionality), etc., and/or any combination thereof. In one case, the illustrated separation of various components in the figures into distinct units may reflect the use of corresponding distinct physical and tangible components in an actual implementation. Alternatively, or in addition, any single component illustrated in the figures may be implemented by plural actual physical components. Alternatively, or in addition, the depiction of any two or more separate components in the figures may reflect different functions performed by a single actual physical component.
Other figures describe the concepts in flowchart form. In this form, certain operations are described as constituting distinct blocks performed in a certain order. Such implementations are illustrative and non-limiting. Certain blocks described herein can be grouped together and performed in a single operation, certain blocks can be broken apart into plural component blocks, and certain blocks can be performed in an order that differs from that which is illustrated herein (including a parallel manner of performing the blocks). The blocks shown in the flowcharts can be implemented in any manner by any physical and tangible mechanisms, for instance, by software running on computer equipment, hardware (e.g., chip-implemented logic functionality), etc., and/or any combination thereof.
As to terminology, the phrase “configured to” encompasses any way that any kind of physical and tangible functionality can be constructed to perform an identified operation. The functionality can be configured to perform an operation using, for instance, software running on computer equipment, hardware (e.g., chip-implemented logic functionality), etc., and/or any combination thereof.
The term “logic” encompasses any physical and tangible functionality for performing a task. For instance, each operation illustrated in the flowcharts corresponds to a logic component for performing that operation. An operation can be performed using, for instance, software running on computer equipment, hardware (e.g., chip-implemented logic functionality), etc., and/or any combination thereof. When implemented by computing equipment, a logic component represents an electrical component that is a physical part of the computing system, however implemented.
The following explanation may identify one or more features as “optional.” This type of statement is not to be interpreted as an exhaustive indication of features that may be considered optional; that is, other features can be considered as optional, although not expressly identified in the text. Further, any description of a single entity is not intended to preclude the use of plural such entities; similarly, a description of plural entities is not intended to preclude the use of a single entity. Finally, the terms “exemplary” or “illustrative” refer to one implementation among potentially many implementations.
A. Overview of the Media Presentation System
The media items can include any type of content, or any combination of digital content types. For example, a media item can include any combination of: static image content; video content; audio content; graphic content (e.g., produced by a game application, simulator, etc.); textual content, and so on. A user may use one or more media sources (106, 108, . . . , 110) to produce the media items. For example, the user may use a digital camera to generate digital photographs. The user may use a video camera to produce digital videos. The user may use one or more audio recording devices to produce audio items. The user may use a game console to produce graphical items, and so on. In some cases, a media source may represent a device that is designed for the main purpose of recording digital media. A digital camera is one such type of device. In other cases, a media source may correspond to a device that performs multiple functions, one of which corresponds to recording digital media. A smartphone is an example of one such device.
In other cases, a media source may represent an archive data store at which the user stores media items, such as a cloud-based data store. In other cases, a media source may correspond to a user's social network profile or the like at which the user maintains media items, and so on. Generally, the assumption here is that the user creates his or her media items, e.g., by taking his or her own digital photographs. But in other cases, at least some of the media items may be selected by the user, but produced by others.
A data receiving component 112 receives media items from the various media sources (106, 108, . . . 110). The data receiving component 112 can collect media items using a push-based approach, a pull-based approach, or some combination thereof. In a push-based approach, a user may expressly and manually upload media items to the data receiving component 112. Or a media source may automatically initiate the transfer of media items to the data receiving component 112. In a pull-based approach, the data receiving component 112 may poll the various media sources (106, 108, . . . 110) and collect any new media items they may provide.
The data receiving component 112 may store the media items and the supplemental data in a data store 114. More specifically, the data store 114 can store media items for a plurality of users, not just the single user depicted in
A media analysis component 116 analyzes the media items to provide an analysis result. The following description will provide a detailed explanation of one manner of operation of the media analysis component 116. By way of overview, the media analysis component 116 can first filter out low-quality media items and redundant media items. The media analysis component 116 can then perform content analysis on each media item to determine the characteristics of the media item. The media analysis component 116 can then update an index provided in a data store 118 to reflect the results of its analysis. The index serves as a mechanism that can be used to later retrieve media items that have desired characteristics. The media analysis component 116 can also store a corpus of processed media items in a data store 120. The processed media items may correspond to the original set of collected media items, minus the media items that have been assessed as having low quality and/or being redundant. The media analysis component 116 can also optionally transform some of the original media items in any manner to produce the processed media items, such as performing cropping, resizing, etc. on the original media items.
An event detection component 122 detects an input event. The input event reflects a current activity of the user. Here, the term “current activity” is intended to have broad meaning. The current activity generally refers to behavior by the user that is centered on or associated with the current time, but does not necessarily occur at the current time. For example, the current activity may describe behavior that has occurred, is presently occurring, or is about to occur, with respect to a current point in time. Further, the current activity may describe a wide variety of actions, such as performing a task, attending an event, visiting a location, and so on.
For example, in one case, an input event may indicate that the user is about to participate in an event that is scheduled to occur on a specified future date, or is currently taking part in the event on the specified date, or has recently taken part in the event. The event detection component 122 can detect this type of event based primarily on a determination of the current date in relation to the specified date. For example, the event detection component 122 can generate an input event when the calendar date reaches December 23rd, based on the presumption that the user may be about to engage in, holiday-related activities.
In another case, an input event may indicate that the user is about to visit a particular location. Or the input event may indicate that the user is currently visiting that location, or has recently visited the location. The event detection component 122 can detect this type of event in different ways, such by identifying geographical reference coordinates that are associated with media items that the user is currently uploading. In one case, the user may be actually present at the identified location at the time he or she has uploaded the media items. In another case, the user may no longer be at that site. In addition, or alternatively, the event detection component 122 can determine the location of the user via one or more location determination mechanisms that are incorporated into the user's mobile computing device or which are otherwise accessible to these devices, such as satellite-based location-determination services, triangulation mechanisms, dead-reckoning mechanisms, proximity-to-signal-source-based mechanisms, etc. The motion of the computing device (as assessed by accelerometers, gyroscopes, etc.) may also be relevant to the user's current activity.
In another case, an input event may indicate that the user is performing a specific activity, where that activity is not necessarily tied to a telltale location or time. For example, the input event may indicate that the user is currently playing a particular sport. The event detection component 122 can detect such an activity in different ways, such as by receiving a media item from the user that depicts the activity, and leveraging the media analysis component 116 to recognize the activity that is taking place in the media item. The event detection component 122 can alternatively, or in addition, detect the location of the user in the above-described manner, e.g., by comparing objects that appear in the user's recently uploaded media items with telltale landmark objects, associated with particular places.
In another case, an input event may indicate that the user is currently interacting with another user via a communication system of any type, such as a video communication system, an instant messaging system, an Email system, and so on. Or the input event may indicate that the user is preparing to take part in a communication session, or has just taken part in such a session. The event detection component 122 can detect this type of input event based on information supplied by the communication system. The event detection component 122 can also receive information from the communication system which reveals the identities of the people involved in the communication session.
In another case, an input event may indicate that the user is currently performing a particular activity in an online space, by himself or herself, or with another person. For example, the user may be currently shopping for a certain item, reading a particular news article on a particular topic, performing a financial transaction, playing a game, and so on. The event detection component 122 can detect the above types of actions, with the appropriate permission of the user, based on information exchanged between the user and one or more online services.
The above-described input events are cited by way of example, not limitation.
A presentation processing component 124 performs two functions. It first determines whether the current presumed user activity, reflected by the input event(s), matches a previous pattern of user activity, as exhibited by the media items that have been processed by the media analysis component 116. These media items are referred to below as relevant media items. If such a pattern is detected, then the presentation processing component 124 generates a user interface presentation that conveys one or more of the relevant media items to the user. If there are no relevant media items, then the presentation processing component 124 will refrain from generating a user interface presentation. Alternatively, in the absence of relevant media items, the presentation processing component 124 may present other default content, such as randomly selected photo items drawn from a user's personal archive of items, etc. Or the presentation processing component 124 can make a low confidence guess as to the user's current circumstance, and present media items that match that guess.
The description will later explain in greater detail how the presentation processing component 124 performs the above two tasks. By way of overview, with respect to the first task, the presentation processing component 124 attempts to find one or more previously-captured (historical) media items that have one or more characteristics in common with the user's current presumed activity.
Further, the presentation processing component 124 may choose only those matching patterns of previous user activity that represent significant events, from the standpoint of the user, based on one or more significance-based considerations. For example, the presentation processing component 124 may determine that the user is currently riding the bus to her work. The presentation processing component 124 may further determine that the user's current activity matches a previous pattern of conduct exhibited by the user—namely, repeatedly riding the bus to work. However, based on one configuration of the MPS 102, the presentation processing component 124 may label this previous pattern of conduct as insignificant, and thus refrain from generating a user interface presentation under this circumstance. That is, the presentation processing component 124 makes the rebuttable/correctable assumption that an event that is extremely common is also uninteresting.
A delivery framework 126, while not considered part of the MPS 102 itself, may deliver the user interface presentation, generated by the presentation processing component 124, to at least the user. In one case, the delivery framework 126 may send a notification to the user's computing device (e.g., which may correspond to a tablet-type device, a smartphone, etc.). The user may then affirmatively respond to an invitation provided by the notification to invoke the media experience provided by the MPS 102. In another case, the delivery framework 126 corresponds to an Email system, or the like. For example, the delivery framework 126 can include the relevant media items as an attachment to an Email. In another case, the delivery framework 126 can provide a link which the user can activate to view the relevant media items, and so on.
In another case, the delivery framework 126 can integrate the relevant media items into a more encompassing user interface presentation provided by any communication system, such as a video communication system, an instant messaging communication system, and so on. For example, in the context of a video communication system (such as the Skype™ communication system provided by Microsoft® Corporation of Redmond, Wash.), the overall user interface presentation may include a first portion devoted to displaying a real-time picture of a first and second user who are communicating with each other, or just the remote user (from the vantage point of a local user who is interacting with the user interface presentation). The user interface presentation may devote a second portion for displaying related media items that pertain to the first user and/or the second user. For example, in one particular case, the relevant media items may correspond to snapshots or video clips extracted from previous communication sessions between the first user and the second user.
In another case, the MPS 102 can incorporate the user interface presentation into a screensaver or the like, presented by the user's computing device. In another case, the MPS 102 can incorporate the user interface presentation into a tile or widget, etc. The user's computing device can present the tile or widget in any display context. Furthermore, this tile or widget may give a preview of the full user interface presentation, which may entice the user to select it to view the full presentation.
Finally, an optional sharing component 128 may allow the user (who receives the user interface presentation) to share the relevant media items with one or more other users. For example, the user may interact with the sharing component 128 to post the collection of relevant media items to the user' social network page. Or the user may interact with the sharing component 128 to send the relevant media items to another person (such as the user's spouse or friend), particularly in those cases in which the media items also pertain to the other person. For example, the user may send the relevant media items via an Email system, a video communication system, etc. In one case, the relevant media items contain a visual indicator (e.g., a digital watermark or the like) which indicates that they originated from a service associated with the MPS 102.
In another case, any other user, with the permission of the user who owns the digital media items, may subscribe to the collections of relevant items that are provided to the user. For example, the user's friend may subscribe to the user's collections of media items, generated by the MPS 102, especially when the friend is included in the collection. In response to the subscription, the MPS 102 will deliver any collection to both the user and his or her friend.
Other implementations of the MPS 102 can omit one or more features described above, and illustrated in
Advancing to
In another case, the functions performed by the MPS 102 are distributed between the remote computing functionality 304 and local computing functionality 302. Local MPS functionality 312 runs on the local computing functionality 302, in conjunction with one or more local data stores 314. For example, the local MPS functionality 312 may perform some media analysis functions, while the remote MPS functionality 308 may perform other media analysis functions. For instance, the local computing functionality 302 may rely on the remote computing functionality 304 to perform image analysis functions that are resource-intensive in nature, and are therefore more efficiently performed by the remote computing functionality 304 (which may have more robust computing resources compared to the local computing functionality 302).
The local computing functionality 302 may correspond to any computing device described above with reference to
The MPS 102 may interact with the video communication system 402 in delivering relevant media items to the user. For example, in interaction path 408, the MPS 102 may receive media items from the video communication system 402. Those media items may correspond to snapshots and/or video clips of the first user and the second user over a span of time, and over several video communication sessions. The MPS 102 may also independently receive media items uploaded by the first user and/or the second user, which do not necessarily originate from prior video sessions.
In the interaction path 410, the video communication session may send call setup data to the MPS 102. The call setup data indicates that the first user and the second user have initiated a current communication session. More specifically, the call setup data can identify the first user and the second user based on user credentials submitted by the first user and the second user, e.g., in the course of setting up the communication session.
The call setup data may also constitute an input event that triggers the MPS 102 to generate a user interface presentation. That is, in response to the input event, the MPS 102 generates one or more relevant media items. In an interactive path 412, the MPS 102 delivers the relevant media items to the video communication system, which, in turn, integrates the relevant media items into the overall user interface presentation that it provides to the first user and the second user. One or more networks 414 communicatively couple the above-identified components together.
A filtering component 502 may perform processing on an original set of media items to remove noise from the set of media items. For example, the filtering component 502 can apply known blur detection techniques to identify and remove media items that exhibit blurry images. In addition, or alternatively, the filtering component 502 can apply known frame analysis techniques to identify and remove any media item in which a presumed focal point of interest is not well centered within the frame.
In addition, or alternatively, the filtering component 502 can apply known image analysis techniques to reduce the number of similar media items in the set of original media items. For example, the filtering component 502 can extract image features associated with each of the media items in the set of original media items. The filtering component 502 may then use the image features to cluster the media items into groups of having presumed similar content. The filtering component 502 can then use any technique (such as a random technique) to choose one or more representative media items from each cluster.
A content analysis component 504 analyzes the content of the media items which survive the filtering operation performed by the filtering component 502. More specifically, an image analysis component 506 can perform image-related analysis on the media items (assuming, that is, that the media items have image or video content). A tag analysis component 508 may provide analysis on any supplemental information which accompanies the media items, such as textual metadata or the like. Although not shown, the content analysis component 504 can include yet other content analysis modules, such as an audio analysis component which analyzes information extracted from audio media items.
The image analysis component 506, in turn, can encompass plural techniques for analyzing the image content of the media items. For example, the image analysis component 506 can use known face recognition technology to determine the presence of human faces in the media items. The image analysis component 506 can also use the face recognition technology to identify the number of people present in each media item, if any.
Further, the image analysis component 506 may provide a database of telltale objects that may appear in the media items, and which users typically regard as significant, and/or which a particular user regards as significant. For example, one such object may correspond to a birthday cake. Another such object may correspond to a Christmas tree, and so on. The image analysis component 506 can then compare the image content of the media items with the objects to determine whether the media items contain any of these objects. The image analysis component 506 can leverage the same technique to determine the identities of people who appear in the media items, e.g., by comparing the media items to reference images or feature signatures associated with people having established identities.
Further, the image analysis component 506 may compare the media items with a corpus of other reference media items that have been tagged by one or more other users, e.g., using a crowdsourcing technique or the like. For example, one such reference media item may include metadata which identifies the media item as a picture of the Fish Market, which is a well-known tourist attraction in the city of Seattle. Upon finding a match between a newly received media item and at least one reference media item, the image analysis component 506 may annotate the new media items with the same tags as the reference media item. That is, upon finding a new media item that resembles a previous media item of the Fish Market, the image analysis component 506 may annotate the new media item with a tag that indicates that it pertains to the Fish Market.
The image analysis component 506 may perform yet other image analysis techniques to analyze the content of the media items. The above techniques are cited by way of example, not limitation.
The tag analysis component 508 can perform any linguistic analysis on the supplemental information that is associated with the media items. For example, the tag analysis component 508 can select media items that are tagged with keywords that are regarded as particularly noteworthy, such as the words “birthday,” “anniversary,” “vacation,” and so on. The tag analysis component 508 can perform other linguistic analysis tasks, such as by expanding the supplemental information to include synonyms, etc. The tag analysis component 508 can also perform any techniques to extract the underlying meaning of the supplemental information, such as Latent Semantic Analysis (LSA), etc.
An optional editing interface 510 may allow a user to manually assist the content analysis component 504 in interpreting the media items. For example, the user may use the editing interface 510 to indicate that a particular media item includes a particular person. In one case, the content analysis component 504 may prompt the user for the above type of assistance whenever it cannot automatically interpret a media item with a sufficient degree of confidence.
An indexing component 512 updates an index provided in a data store 118 based on the characteristics of the media items identified by the content analysis component 504. For example, the index may correspond to an inverted index. The inverted index may identify different characteristics that the media items may potentially possess. For each such characteristic, the index may then identify the media items that actually possess that characteristic. For example, one characteristic may correspond to a birthday cake. The inverted index may identify those media items that have been determined to include image content that resembles a birthday cake. The indexing component 512 updates the index by establishing links between the media items analyzed by the content analysis component 504 and the identified characteristics of those media items.
As a result of its analysis, the media analysis component 116 produces analysis results. The analysis results may reflect, in part, the updated indexing information stored in the index.
In one implementation, for instance, the trigger-determination component 602 can identify one or more characteristics of an input event. The trigger-determination component 602 can then use those characteristics as lookup keys to find the media items (if any) which share one or more of those characteristics. The trigger-determination component 602 may use the index to perform this task. A group of media items that share the identified characteristics correlates to a previous pattern of user activity. For example, assume that the input event corresponds to a cruise that a couple takes on their wedding anniversary. A group of media items that capture this event from prior years establishes a prior pattern of user activity for that couple.
The trigger-determination component 602 can also apply one or more significance-based considerations to determine whether the input event is significant to the user, and therefore warrants delivery of related media items to the user. In one case, for example, the trigger-determination component 602 can identify the number of media items that match the input event. The trigger-determination component 602 may conclude that a very small number of media items evinces a not-yet-significant event, as the identified media items fail to establish a meaningful pattern at this point in time. On the opposite extreme, the trigger-determination component 602 may conclude that a very large number of matching media items indicates that the items may also be insignificant, as their very ordinariness may suggest that the events will not interest the user. However, such configuration choices are application-specific in nature. In other cases, for instance, the trigger-determination component 602 may choose to deliver related media items predicated on only a small number of matching media items.
In addition, or alternatively, the trigger-determination component 602 can attach different weights to different characteristics. For example, an administrator and/or a user may establish, in advance, the events and objects that are considered particularly significant, such as birthdays, anniversaries, annual vacations, etc. An administrator and/or user may also establish, in advance, that any media item that includes two or more people is potentially significant. Moreover, an administrator and/or user can define the events and objects that considered particularly insignificant, such as trips to the supermarket. In application, the trigger-determination component 602 can indicate that an event is significant if it has one or more characteristics that have been labeled as significant.
A user can specify significance-related information in any manner. For example, the user may explicitly specify that information via a setup/configuration page provided by the MPS 102. In addition, or alternatively, the MPS 102 can extract such information from other sources that pertain to the user, such as the user's calendar system, social network profile, and on. In addition, or alternatively, the trigger-determination component 602 may leverage crowdsourcing resources to identify significant and insignificant events and objects, and hence to establish the weights assigned to these features.
More generally stated, the trigger-determination component 602 can apply any computation(s) to assess the relevance of a candidate media item, with respect to the user's current circumstance. For example, the trigger-determination component 602 may apply any discrete mathematical equation(s), any algorithm(s), a linear or non-linear model of any type produced by a machine-learning process, an expert system, any clustering-based algorithm(s), and so on, or any combination thereof. For example, in one non-limiting case, the trigger-determination component 602 can generate a relevance score based on a linear combination of weighted factors, including any of the factors described above, including priorities expressed by the system, the user, plural users, etc.
Further note that, in some situations, the trigger-determination component 602 may use any of the above techniques to match the user's present circumstance to plural different patterns of conduct that are happening at the same time, along with plural sets of associated media items. For example, the user may be performing two significant telltale activities at generally the same time, such as celebrating a birthday at a favorite vacation location (where the birthday celebration constitutes one significant event, regardless of where it occurs, and the visit to the vacation location constitutes another significant event, regardless of when and why that visit occurs). The trigger-determination component 602 can use any of the above techniques (and factors) to rank the relevance of the matching media items. In addition, in one implementation, the trigger-determination component 602 can apply one or more factors to ensure that the selected collection of media items forms a cohesive theme or narrative; in other cases, the trigger-determination component 602 may accommodate a more varied collection of media items that matches different patterns associated with the user's present circumstance.
A presentation generation component 604 generates a user interface presentation, when prompted to do so by the trigger-determination component 602. The user interface presentation shows one or more media items that are determined to be related to the user's current activity.
A synchronization component 606 synchronizes a collection of video media items that have been determined to pertain to the same pattern of user activity. For example, the synchronization component 606 can identify a common reference event in each of the video media items, such as the start of the event. The synchronization component 606 can then configure the group of video media items such that they simultaneously run, starting from the reference event. In addition, or alternatively, the synchronization component 606 can match up the frames of a first video media item with the frames of a second video media item, e.g., based on an analysis of similar content in those frames. The synchronization component 606 can use the resultant comparison results to further synchronize the video media items, e.g., such that similar frames are controlled to play back at the same times.
A learning component 608 receives feedback information from the user. Based on that information, the learning component 608 modifies the operation of the presentation processing component 124, and/or any other component of the MPS 102. For example, the user may use various techniques to evaluate the output of the MPS 102, such as by indicating whether the media items presented to him or her are interesting. The learning component 608 can use those evaluation results to promote the future generation of media items that the user likes, and to discourage the generation of media items that the user does not like. The learning component 608 can also take into account the likes and dislikes of other users, based on the presumption that general trends in user preferences may apply to the particular user under consideration.
In one particular case, the learning component 608 can specifically use feedback from the user to adjust the weight that it associated with different features, and/or by adjusting other tunable parameters. For example, assume that the user repeatedly indicates that he or she is not interested in media items having Christmas trees, despite the fact that this feature may have had a high original default weight. In response, the learning component 608 can lower the relevance weight of the Christmas tree feature. The user will thereafter see fewer or no media items having Christmas trees. In other cases, the MPS 102 can use models produced by machine learning techniques to perform its functions. The MPS 102 can use the feedback provided by the user to dynamically retrain its models.
Starting with
The user interface presentation 702 displays a collection of media items that were taken on previous instances of the anniversary date. Here, the user interface presentation 702 arranges the collection of media items using a timeline format. That is, a timeline 704 spans a period of time that begins on the user's wedding day and ends on the current date. The user has selected one such media item 706 within the timeline 704, e.g., corresponding to an instance of the anniversary that occurred in the year 2011. A feedback control mechanism 708 allows the user to evaluate whether the particular media item is interesting or not interesting, or whether the entire collection is interesting or not interesting. One or more components of the MPS 102 can modify their manner of operation based on the feedback information provided via such a feedback control mechanism 708.
The user interface presentation 802 displays a collection of media items that were taken on previous annual visits to the site in question. Here, the user interface presentation 802 arranges the collection of media items in a collage 804, but the user interface presentation 802 could use any other format (such as a timeline format) to convey the same collection.
The user interface presentation 802 may also include one or more control mechanisms. A first control mechanism 806 allows the user to instruct the MPS 102 to upload the collection of media items to a social network service. A second control mechanism 808 allows the user to instruct the MPS 102 to send the collection of media items to specified person, such as a particular person (“Joanne”) who also is prominently featured in the media items, e.g., corresponding to the user's spouse. Alternatively, the user interface presentation 802 may include a control mechanism that allows the user to share the collection of media items with any specified person, as manually specified by the user. A third control mechanism 810 allows the user to store the collection of media items in an archive data store. The MPS 102 can treat the user's interaction with any of the above-described control mechanisms (806, 808, 810, etc.) as an implicit approval of the collection of media items presented by the user interface presentation 802. The MPS 102 may use such actions as feedback information to improve the operation of one or more of its components. The user interface presentation may also include the explicit feedback control mechanism 708 described above with reference to
In some cases, the MPS 102 may automatically invoke its service upon the start of any communication session. In other cases, the MPS 102 may automatically invoke its service for only some communication sessions, such as communication sessions that last longer than a prescribed conversation length, e.g., because the reminiscing fostered by the MPS 102 may be more desirable and useful in the context of longer calls. Further, the MPS 102 can allow any participant to control the triggering factors that will invoke the MPS service, or to disable the MPS service entirely. A user can make these selections via a configuration tool or the like. Further, upon the occurrence of each triggering event, the MPS 102 can optionally ask each participant to accept or decline the MPS service. In the example of
Assume that the user “David” is interacting with a local version of the user interface presentation 902 (while “Philip” is interacting with another local version of the user interface presentation 902). The user interface presentation 902 that is presented to David may include a first section 904 that at least shows a real time video of the second user, Philip. Or the first section 904 may show a real time video of both David and Philip (as in the particular case of
The user interface presentation 902 also includes a second section 906 which shows a collection of media items that are determined to be relevant to the communication session, arranged in any format. For example, the user interface presentation 902 arranges the media items in a timeline format. But the user interface presentation 902 could have alternatively used the collage format to present the media items, and/or some other format. In one case, the MPS 102 may draw the media items from a remote common data store, e.g., to which both David and Philip have previously uploaded their photo items. In another case, the MPS 102 may draw the media items that it presents to David from a local (or remote) personal data store associated with David, and draw the media items that it presents to Philip from a local (or remote) personal data store associated with Philip. David's data store may contain the same content as Philip's data store, or different content.
Further, regardless of where the media items originate, the set of media items that are presented to David may be the same as the set of media items that are presented to Philip, or different from the media items that are presented to Philip. For example, consider the scenario in which the MPS 102 draws from the personal data stores of David and Philip, where those data stores contain different photo items. The MPS 102 can present the most relevant photo items from David's data store for presentation to David, and present the most relevant photo items from Philip's data store for presentation to Philip, there being no expectation that the two sets of media items will be the same. Further, either David or Philip can optionally decide to share personal photo items with each other during their conversation. Or these users can configure the MPS service to automatically share the photo items. In the following explanation, however, assume that the MPS service presents the same collection of media items to both David and Philip.
In one case, the media items that are presented are considered relevant because they depict either the first user or the second user, or preferably both the first user and the second user. For example, the media items may include a collection of digital photographs taken by David or Philip which include both David and Philip. One such digital photograph may show these two friends on their graduation from college. Another such digital photograph may show these friends at a recent conference, and so on.
Alternatively, or in addition, the media items may correspond to media items captured by the video communication system itself. For example, the video communication system may have taken one or more snapshots and/or video clips (and/or audio clips) during each interaction session between David and Philip, over the course of many previous interactions between David and Philip.
In addition, the presentation processing component 124 can present the above-described media items to the two users (David and Philip) outside the context of their communication session. For example, the presentation processing component 124 can present the related media items to the users within a prescribed window of time, starting from the end of the communication session. Alternatively, the presentation processing component 124 can present the related media items only when some milestone has been reached, such as after determining that the users have been communicating with each other over a span of time that has reached some threshold, such as a three-year mark, etc.
In another case, the MPS 102 can detect the subject matter of the user's conversation in the current communication session, or over the course of two or more previous communication sessions. The MPS 102 can perform this task, for instance, based on extracting keywords from the users' speech, and/or based on explicit topic selections made by the users, and/or by based on implicit topic selections made by the users (e.g., when either David or Philip browses to a particular site in the course of the communication session), etc. The MPS 102 can then present media items that are most pertinent to the identified topic, while also depicting one or more of the users (David and Philip). For example, if one of the user's visits a football-related site, the MPS 102 can present digital photographs of David and Philip attending football games, if those photographs exist.
More specifically, a section 1104 can devote plural sections for playing back plural video media items, each devoted to a separate race that has occurred in particular year. A start/stop control button 1106 allows the user to initiate the playback of all the video media items, simultaneously. The video media items are synchronized such that they play back in coordinated fashion, e.g., starting from a point in time at which the races begin. By virtue of this type of presentation, the user (Bob) can compare his performance over the years, e.g., by identifying trends in his performance. A timer 1108 may display the amount of time that has elapsed in the playback of the video media items.
B. Illustrative Processes
Starting with
C. Illustrative Computing Functionality
The computing functionality 1502 can include one or more processing devices 1504, such as one or more central processing units (CPUs), and/or one or more graphical processing units (GPUs), and so on.
The computing functionality 1502 can also include any storage resources 1506 for storing any kind of information, such as code, settings, data, etc. Without limitation, for instance, the storage resources 1506 may include any of: RAM of any type(s), ROM of any type(s), flash devices, hard disks, optical disks, and so on. More generally, any storage resource can use any technology for storing information. Further, any storage resource may provide volatile or non-volatile retention of information. Further, any storage resource may represent a fixed or removable component of the computing functionality 1502. The computing functionality 1502 may perform any of the functions described above when the processing devices 1504 carry out instructions stored in any storage resource or combination of storage resources.
As to terminology, any of the storage resources 1506, or any combination of the storage resources 1506, may be regarded as a computer readable medium. In many cases, a computer readable medium represents some form of physical and tangible entity. The term computer readable medium also encompasses propagated signals, e.g., transmitted or received via physical conduit and/or air or other wireless medium, etc. However, the specific terms “computer readable storage medium” and “computer readable medium device” expressly exclude propagated signals per se, while including all other forms of computer readable media.
The computing functionality 1502 also includes one or more drive mechanisms 1508 for interacting with any storage resource, such as a hard disk drive mechanism, an optical disk drive mechanism, and so on.
The computing functionality 1502 also includes an input/output module 1510 for receiving various inputs (via input devices 1512), and for providing various outputs (via output devices 1514). The input devices 1512 can include any of key entry devices, mouse entry devices, touch-enabled entry devices, voice entry devices, and so on. One particular output mechanism may include a presentation device 1516 and an associated graphical user interface (GUI) 1518. The computing functionality 1502 can also include one or more network interfaces 1520 for exchanging data with other devices via one or more networks 1522. One or more communication buses 1524 communicatively couple the above-described components together.
The network(s) 1522 can be implemented in any manner, e.g., by a local area network, a wide area network (e.g., the Internet), point-to-point connections, etc., or any combination thereof. The network(s) 1522 can include any combination of hardwired links, wireless links, routers, gateway functionality, name servers, etc., governed by any protocol or combination of protocols.
Alternatively, or in addition, any of the functions described in the preceding sections can be performed, at least in part, by one or more hardware logic components. For example, without limitation, the computing functionality 1502 can be implemented using one or more of: Field-programmable Gate Arrays (FPGAs); Application-specific Integrated Circuits (ASICs); Application-specific Standard Products (ASSPs); System-on-a-chip systems (SOCs); Complex Programmable Logic Devices (CPLDs), etc.
In closing, the functionality described above can employ various mechanisms to ensure the privacy of user data maintained by the functionality, in accordance with user expectations and applicable laws of relevant jurisdictions. For example, the functionality can allow a user to expressly opt in to (and then expressly opt out of) the provisions of the functionality. The functionality can also provide suitable security mechanisms to ensure the privacy of the user data (such as data-sanitizing mechanisms, encryption mechanisms, password-protection mechanisms, etc.).
Further, the description may have described various concepts in the context of illustrative challenges or problems. This manner of explanation does not constitute a representation that others have appreciated and/or articulated the challenges or problems in the manner specified herein. Further, the claimed subject matter is not limited to implementations that solve any or all of the noted challenges/problems.
Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.