This disclosure relates to modifying an interest curve that quantities interesting moments within videos based on activities and sub-activities captured within the videos.
Manually identifying interesting moments within videos for video summary generation may be difficult and time consuming. Improving automatic identification of interesting moments within videos may encourage capture/use/share of the videos.
This disclosure relates to identifying interesting moments within videos. Video information defining video content may be accessed. The video content may have a progress length and include a capture of an activity at one or more moments in the progress length. The capture of the activity may include a capture of a sub-activity associated with the activity. Portions of the video content may be associated individually with values of an interest metric such that the values of the interest metric of the individual portions of the video content as a function of progress through the video content form an interest curve for the video content. The activity captured within the video content may be identified. The sub-activity captured within the video content may be identified based on the identification of the activity. An activity metric modifier for the interest curve at the one or more moments in the progress length may be determined based on the identification of the activity and the identification of the sub-activity. The interest curve may be modified at the one or more moments based on the activity metric modifier.
A system that identifies interesting moments within videos may include one or more processors, and/or other components. The processor(s) may be configured by machine-readable instructions. Executing the machine-readable instructions may cause the processor(s) to facilitate identifying interesting moments within videos. The machine-readable instructions may include one or more computer program components. The computer program components may include one or more of a video information component, an activity component, a metric component, an interest curve component and/or other computer program components. In some implementations, the computer program components may include an emotion component and an orientation component.
The video information component may be configured to access video information defining video content. Accessing the video information may include one or more of acquiring, analyzing, determining, examining, locating, obtaining, receiving, retrieving, reviewing, and/or otherwise accessing the video information. The video information component may access video information from one or more locations. The video information component may be configured to access video information defining video content during capture of the video content and/or after capture of the video content by one or more image sensors.
Video content may refer to media content that may be consumed as one or more videos. Video content may include one or more videos stored in one or more formats/container, and/or other video content. Video content may have a progress length. Video content may include a capture of an activity at one or more moments in the progress length. The capture of the activity may include a capture of a sub-activity associated with the activity. In some implementations, video content may include one or more of spherical video content, virtual reality content, and/or other video content.
Portions of the video content may be associated individually with values of an interest metric. The values of the interest metric of the individual portions of the video content as a function of progress through the video content may form an interest curve for the video content. The video content may include video frames. In some implementations, the portions of the video content being associated individually with values of the interest metric may include the video frames being associated individually with the values of the interest metric.
The activity component may be configured to identify the activity captured within the video content. In some implementations, the activity may be identified with an activity accuracy.
The activity component may be configured to identify the sub-activity captured within the video content based on the identification of the activity and/or other information. In some implementations, the sub-activity may be identified with a sub-activity accuracy. In some implementations, the identification of the sub-activity may be characterized by a probability distribution and/or other information. In some implementations, the probability distribution may include identification of a set of sub-activities associated with the activity, accuracies associated with the identification of individual sub-activities, and/or other information. In some implementations, the probability distribution may characterized by a degree of unimodality.
The emotion component may be configured to identify an emotion captured within the video content at the one or more moments in the progress length. In some implementations, the emotion may be identified with an emotion accuracy. In some implementations, the identification of the emotion may be characterized by a probability distribution and/or other information.
The orientation component may be configured to determine an orientation of an image sensor and/or other information. The image sensor may have captured the video content. The orientation component may determine the orientation of the image sensor when the image sensor captured the activity at the one or more moments in the progress length.
The metric component may be configured to determine an activity metric modifier for the interest curve at the one or more moments in the progress length based on one or more of the identification of the activity, the identification of the sub-activity, and/or other information. In some implementations, the activity metric modifier may be determined further based on the activity accuracy of the activity identification and the sub-activity accuracy of the sub-activity identification. In some implementations, the activity metric modifier may be determined further based on the probability distribution of the sub-activity identification. In some implementations, the individual sub-activities may be associated with individual modifier values and/or other information. In some implementations, the activity metric modifier may be determined further based on the degree of unimodality of the probability distribution of the sub-activity identification.
In some implementations, the metric component may be configured to determine an emotion metric modifier for the interest curve at the one or more moments in the progress length based on the emotion captured within the video content and/or other information. In some implementations, the emotion metric modifier may be determined further based on the emotion accuracy of the emotion identification and the probability distribution of the emotion identification.
In some implementations, the metric component may be configured to determine an orientation metric modifier for the interest curve at the one or more moments in the progress length based on the orientation of the image sensor that captured the video content and/or other information.
The interest curve component may be configured to modify the interest curve at the one or more moments based on the activity metric modifier and/or other information. In some implementations, the interest curve component may be configured to modify the interest curve at the one or more moments further based on the emotion metric modifier. In some implementations, the interest curve component may be configured to modify the interest curve at the one or more moments further based on the orientation metric modifier.
These and other objects, features, and characteristics of the system and/or method disclosed herein, as well as the methods of operation and functions of the related elements of structure and the combination of parts and economies of manufacture, will become more apparent upon consideration of the following description and the appended claims with reference to the accompanying drawings, all of which form a part of this specification, wherein like reference numerals designate corresponding parts in the various figures. It is to be expressly understood, however, that the drawings are for the purpose of illustration and description only and are not intended as a definition of the limits of the invention. As used in the specification and in the claims, the singular form of “a”, “an”, and “the” include plural referents unless the context clearly dictates otherwise.
The electronic storage 12 may be configured to include electronic storage medium that electronically stores information. The electronic storage 12 may store software algorithms, information determined by the processor 11, information received remotely, and/or other information that enables the system 10 to function properly. For example, the electronic storage 12 may store information relating to video information, video content, interest metric, interest curve, activity, sub-activity, activity/sub-activity identification, emotion, emotion identification, image sensor orientation, image sensor orientation identification, activity metric modifier, emotion metric modifier, orientation metric modifier, and/or other information.
Referring to
The video information component 102 may be configured to access video information and/or other information. The video information may define video content. Accessing the video information may include one or more of acquiring, analyzing, determining, examining, locating, obtaining, receiving, retrieving, reviewing, and/or otherwise accessing the video information. The video information component 102 may access video information from one or more locations. For example, the video information component 102 may access the video information from a storage location, such as the electronic storage 12, electronic storage of one or more image sensors (not shown in
The video information component 102 may be configured to access video information defining video content during capture of the video content and/or after capture of the video content by one or more image sensors. For example, the video information component 102 may access video information defining video content while the video content is being captured by one or more image sensors. The video information component 102 may access video information defining video content after the video content has been captured and stored in memory (e.g., the electronic storage 12, buffer memory).
Video content may refer to media content that may be consumed as one or more videos. Video content may include one or more videos stored in one or more formats/container, and/or other video content. A video may include a video clip captured by a video capture device, multiple video clips captured by a video capture device, and/or multiple video clips captured by separate video capture devices. A video may include multiple video clips captured at the same time and/or multiple video clips captured at different times. A video may include a video clip processed by a video application, multiple video clips processed by a video application and/or multiple video clips processed by separate video applications.
Video content may have a progress length. A progress length may be defined in terms of time durations and/or frame numbers. For example, video content may include a video having a time duration of 60 seconds. Video content may include a video having 1800 video frames. Video content having 1800 video frames may have a play time duration of 60 seconds when viewed at 30 frames/second. Other time durations and frame numbers are contemplated.
Video content may include a capture of one or more activities at one or more moments in the progress length. For example, video content may include a capture of a surfing activity that spans a duration in the progress length of the video content. The capture of an activity may include a capture of one or more sub-activities associated with the activity. For example, a surfing activity may be associated with one or more sub-activities, such as paddling, standing on a surfboard, performing a trick, entering a wave tunnel, surfing in a particular way, falling in a particular way, and/or other sub-activities. A capture of a surfing activity may include capture of one or more sub-activities associated with surfing. Other types of activities and sub-activities are contemplated.
Portions of the video content may be associated individually with values of one or more interest metrics. Portions of the video content may include one or more video frames of the video content. An interest metric may refer to one or more measurements indicating whether portions of the video content include capture of visuals that are of interest to one or more users. Capture of visuals may include capture of one or more objects (e.g., animate/inanimate objects), scenes, actions, and/or other visuals. Values of the Interest metric(s) may be measured in numbers, characters, and/or other measurements. In some implementations, values of interest metric(s) may be determined based on analysis of the visual, audio, and/or other information relating to the capture of the video content. Analysis of the visual, audio, and/or other information may be performed using the video content and/or reduced versions of the video content (e.g., having lower resolution, lower framerate, higher compression).
In some implementations, values of the interest metric(s) may indicate one or more measurements of one or more characteristics of the portions of the video content that quantifies user interest. For example, values of the interest metric(s) may indicate measurement(s) of intensities of visuals and/or audios captured within the portions, activities/events captured within the portions, and/or other information. Other measurements are contemplated.
In some implementations, values of the interest metric(s) may indicate a probability that the portions of the video content include a visual/audio capture of a particular visual, a particular object, a particular scene, a particular action, a particular activity, and/or other information. For example, values of the interest metric(s) may indicate a probability that the portion of the video content includes a particular person, a particular movement, a particular sporting event, a particular emotion (e.g., laughing, smiling, excited), and/or other information. Other probabilities are contemplated.
Values of the interest metric(s) may provide a “goodness” measure that is agnostic as to the particular visuals, objects, scenes, action, and/or activities captured within the video content. Values of the interest metric(s) may provide a measure that indicates whether the visuals captured within the video content are those that are of likely to be of interest to a user regardless of the particular visuals, objects, scenes, action, and/or activities captured within the video content. For example, a “goodness” measure may indicate the extent to which the video content includes visuals with particular brightness, contrast, color histogram, blur/sharpness, number of faces, particular faces, salient object, smooth motion, steady shot vs. shakiness, scene composition, framing and/or other measures that indicates “goodness” regardless of the activity captured within the video content.
Values of the interest metric(s) providing a “goodness” measure may provide a base measure of interest in the video content. Criteria for determining general “goodness” in video content may not identify relevant moments (e.g., highlight moments, moments of interest) for particular visuals, objects, scenes, action, and/or activities. For example, a “goodness” measure (base measure of interest) may not identify relevant moments for a surfing activity. Falling in a spectacular way may be of interest in a surfing activity, but the motion associated with such falls may lead to the corresponding portion of the video content having poor “goodness”/base measure.
The values of the interest metric of the individual portions of the video content as a function of progress through the video content may form an interest curve for the video content. For example,
In some implementations, the portions of the video content being associated individually with values of the interest metric may include video frames of the video content being associated individually with the values of the interest metric. For example, referring to
As shown in
The activity component 104 may be configured to identify one or more activities and one or more sub-activity captured within the video content. The activity component 104 may identify one or more sub-activities captured within the video content based on the identification of the activity and/or other information. Analysis of the video content for activity/sub-activity identification may be performed using the video content and/or reduced versions of the video content (e.g., having lower resolution, lower framerate, higher compression). The activities and sub-activities may be identified using computer vision and/or other techniques.
For example, activities may be identified using a coarse-grained classifier and sub-activities may be identified using a fine-grained classifier. A coarse-grained classifier may provide one or more predictions of what activities/types of activities are captured within the video content and a fine-grained classifier may provide one or more predictions of what sub-activities/types of sub-activities are captured within the video content. An activity may be associated with a fine-grained classifier for identifying sub-activities associated with the activity. Based on the most likely activity captured within the video content, the fine-grained classifier associated the most likely activity may be used to identify the sub-activities. For example, referring to
Identification of the activities/sub-activities captured within the video content may be performed on a per-frame basis and/or a per-group-of-frames basis. For example, identification of the activities/sub-activities captured within the video content may be performed on per a “time slice” of the video content, where a time slice represents a certain number of frames/duration. For example, a time slice may represent ten frames and/or a sixth of a second. Other times slices are contemplated.
In some implementations, exponential moving average/smoothing may be used for identification of the activities/sub-activities captured within the video content. Using exponential moving average/smoothing may prevent the coarse and/or fine-grained classifiers from jumping between identification of different activities/sub-activities. For example, without the use of the exponential moving average/smoothing, the classifiers may jump between identification of activity captured within video content as mounting biking to surfing, then back from surfing to mountain biking.
The activities and/or sub-activities may be identified with a given amount of accuracy (activity accuracy, sub-activity accuracy). Accuracy may indicate probability of the prediction of the activity/sub-activity identification being correct. For example, a surfing activity within video content may be identified with certain amount of accuracy (e.g., 60%). The sub-activities within the video content may be identified with certain amount(s) of accuracy.
In some implementations, the identification of the sub-activities may be characterized by a probability distribution and/or other information. The probability distribution may include identification of a set of sub-activities associated with the activity, accuracies associated with the identification of individual sub-activities, and/or other information. For example,
A probability distribution may characterized by a degree of unimodality and/or other information. A degree of unimodality may indicate the extent to which the probability distribution includes a unique mode (only a single highest/maximum value). For example, a probability distribution including probability values for eight different sub-activities may have a smaller degree of unimodality than a probability distribution including probability values for four different sub-activities.
The emotion component 106 may be configured to identify one or more emotions captured within the video content at one or more moments in the progress length. Analysis of the video content for emotion identification may be performed using the video content and/or reduced versions of the video content (e.g., having lower resolution, lower framerate, higher compression). The emotions may be identified using computer vision and/or other techniques. For example, faces captured within the video content may be detected to determine bounding boxes for the faces. Visuals within the individual bounding boxes may be fed through one or more neural networks to obtain prediction of emotions captured within the video content. Different types of emotions may be identified by the emotion component 106. As a non-limiting example, the emotion component 106 may be configured to identify anger, disgust, fear, happy, sad, surprise, neutral, and/or other emotions.
For example, the emotion component 106 may identify emotion(s) expressed by people captured within the video content at the moment 322 and/or other moments. The emotions may be identified with given amounts of accuracy (emotion accuracy). In some implementations, the identification of the emotions may be characterized by a probability distribution and/or other information. The probability distribution may include identification of a set of emotions, and/or other information. The probability distribution may include different probabilities of prediction for different emotions identified at the moment in the progress of the video content.
The orientation component 108 may be configured to determine one or more orientations of one or more image sensors and/or other information. The image sensor(s) may have captured the video content. The orientation component 108 may determine the orientation of the image sensor(s) when the image sensor(s) captured the activities/sub-activities at one or more moments in the progress length. The orientation component 108 may determine the orientation of the image sensor(s) based on sensor information (e.g., orientation sensor, accelerometer, gyroscope) generated at the particular moments, and/or based on analysis of the visuals captured within the video content (e.g., detection and orientation of horizon) at the particular moments.
The metric component 110 may be configured to determine one or more activity metric modifiers for the interest curve at one or more moments in the progress length based on one or more of the identification of the activity, the identification of the sub-activity, and/or other information. For example, referring to
An activity metric modifier may include a parameter that decreases the values of an interest curve based on the activity accuracy and/or other information, and a parameter that increases the value of the interest curve based on one or more of the activity accuracy, identification of the sub-activities, sub-activity accuracy, and the degree of unimodality of the identified sub-activities probability distribution. For example, values of an interest curve modified by an activity metric modifier may be given by I′=(1−maxC)·I+maxC·A, where I′=modified interest curve value; I=original interest curve value (e.g., base measure of interest); maxC=the highest predicted activity (e.g., maximum prediction accuracy of the captured activity from the coarse-grained classifier, for example, 0.60 for 60% accuracy in prediction of a surfing activity); and A=activity-specific modifier value.
The modified interest curve values may be an interpolation between the original interest curve value (e.g., base measure of interest) and one or more values that boost or suppress the original values. An activity metric modifier may modify the values of the interest curve such that (1) higher probability of activity identification (maxC) results in greater impact on the modified interest curve values by the particular activity-specific modifier value (A) and lower impact on the modified interest curve values by the original interest curve values (I); and (2) lower probability of activity identification (maxC) results in lower impact on the modified interest curve values by the particular activity-specific modifier value (A) and greater impact on the modified interest curve values by the original interest curve values (I).
For example, if no activities were identified at a particular moment in the video content, the value of the interest curve at that particular moment is not modified (original value (I) is used). If an activity was identified at a particular moment in the video content with a 100% accuracy, the original value of the interest curve has no impact on modified interest curve value at that particular moment. Instead, the value of the modified interest curve at that particular moment is the value of the activity-specific modifier value (A).
The activity-specific modifier value (A) may be calculated by:
where A=activity-specific modifier value, Araw=raw fine-grained activity-specific modifier value, Hnorm=normalized entropy of the sub-activity distribution, Nclasses=number of identified sub-activities, p(x)=predicted distribution of the sub-activities, boost(x)=multiplier applied to the predicted probability of the sub-activities action, and variable x ranges over each sub-activities in the predicted distribution.
The activity-specific modifier value may be calculated based on the fine-grained sub-activity distribution and/or other information. Identification of the sub-activities (prediction accuracy/confidence score) may be represented using a vector (prediction vector). The prediction vector may add up to one (probabilities of the identified sub-activities add up to 1). Particular sub-activities may be assigned a boost (greater than one)/suppression value (less than one) and may be represented using a vector (boost/suppression vector). The boost/suppression values may be calculated empirically to provide certain boost value(s) for desired/interesting sub-activities (e.g., 2.0 for standing on a surfboard) and certain suppression value(s) for not desired/uninteresting sub-activities (e.g., 0.1 for paddling).
The activity specific modifier value may be calculated by (1) taking a dot product of the prediction vector for the sub-activities and the boost/suppression vector, which results in a raw fine-grained activity-specific modifier value (Araw), and (2) modifying the raw fine-grained activity-specific modifier value by taking into account the entropy of the distribution (Hnorm). The entropy of the distribution (Hnorm) may modify the raw fine-grained activity-specific modifier value based on the degree of unimodality of the identified sub-activities probability distribution. The entropy of the distribution (Hnorm) may be normalized (e.g., between zero and one) such that the minimum entropy value is zero (when there is a single predicted sub-activity) and the maximum entropy value is one (the distribution of predicted sub-activities is flat). Using the entropy value provides for modification of the interest curve value that reduces the impact of the boost/suppression associated with the particular sub-activities when the entropy is high (the fine-grained classifier result does not have high confidence). Using the square of the normalized entropy to calculate the activity-specific modifier value (A) may decrease the effect of the raw fine-grained activity-specific modifier value (Araw) on the activity-specific modifier value (A) when the predicted sub-activities distribution is flatter.
In some implementations, the metric component 110 may be configured to determine one or more emotion metric modifiers for the interest curve at one or more moments in the progress length based on the emotion(s) captured within the video content and/or other information. For example, particular emotion may be assigned a boost (greater than one)/suppression value (less than one) and may be represented using a vector (boost/suppression vector). The boost/suppression values may be calculated empirically to provide certain boost value(s) for desired/interesting emotions and certain suppression value(s) for not desired/uninteresting emotions. For example, the boost/suppression values for example emotions may include: 1.25 for anger; 1.25 for disgust; 3.0 for fear; 3.0 for happy; 0.5 for sad; 3.0 for surprise; 1.25 for neutral; and/or other values. The emotion metric modifier(s) (E) may be equal to or determined based on the boost/suppression values.
In some implementations, the emotion metric modifier(s) may be determined further based on the accuracy of the emotion identification (emotion accuracy) and the probability distribution of the emotion identification. For example, a particular type of emotion may be associated with sub-emotions. Coarse and fine-grained classifier (as discussed above with respect to activity and sub-activities) may be used to determine emotion metric(s) based on one or more of identification, accuracy, probability distribution, unimodality, and/or other information.
In some implementations, the metric component 110 may be configured to determine an orientation metric modifier for the interest curve at one or more moments in the progress length based on the orientation of the image sensor(s) that captured the video content and/or other information. The orientation metric modifier may reflect whether the image sensor(s) were correctly oriented at the time of video content capture. The orientation metric modifier may modify (e.g., suppress) values of an interest curve based on a determination that the image sensor(s) that captured the video content was incorrectly oriented at the time of video content capture. For example, the input for the orientation metric modifier calculation may include a probability distribution [x_0, x_90, x_180] for each of the orientation classes. In some implementations, x_0 and x_180 may be added together if it is difficult to differentiate between the two orientations. The normalized exponential moving average of the values (e.g., x_0+x_180) may be used as a multiplier (O) for the values of the interest curve.
The interest curve component 112 may be configured to modify the interest curve at one or more moments based on one or more metric modifiers and/or other information. Modifying the interest curve may include increasing values and/or decreasing values of the interest curve corresponding to different moments within the video content. For example, the interest curve may be modified to (1) include new peak(s), (2) include new dip(s), (3) increase value(s) of existing peak(s), (4) increase value(s) of existing dip(s), (5) decrease value(s) of existing peak(s), (6) decrease value(s) of existing dip(s), and/or other changes.
The interest curve component 112 may modify the interest curve based on the activity metric modifier and/or other information: I′=(1−maxC)·I+maxC·A, where I′=modified interest curve value; I=original interest curve value (e.g., base measure of interest); maxC=the highest predicted activity (e.g., maximum prediction accuracy of the captured activity from the coarse-grained classifier, for example, 0.60 for 60% accuracy in prediction of a surfing activity); and A=activity-specific modifier value.
For example,
In some implementations, the interest curve component 112 may modify the interest curve further based on the emotion metric modifier (e.g., multiplying I′ by the emotion metric multiplier (E)). In some implementations, the interest curve component 112 may modify the interest curve further based on the orientation metric modifier (e.g., multiplying I′ by the orientation metric multiplier (O)). For example, modification of the interest curve further based on the emotion metric modifier and the orientation metric modifier may be given by: I″=O·E·I′. Other combinations of activity metric modifier, emotion metric modifier, orientation metric modifier, and/or other information are contemplated to modify the interest curve.
In some implementations, changes between the original interest curve and the modified interest curve may be used for video content analysis/editing/sharing. The changes between the original interest curve and the modified interest curve may be used to identify portions within the video content for automatic/suggested analysis, editing, and/or sharing. For example, referring to
In some implementations, video content may include one or more of spherical video content, virtual reality content, and/or other video content. Spherical video content and/or virtual reality content may define visual content viewable from one or more points of view as a function of progress through the spherical/virtual reality video content.
Spherical video content may refer to a video capture of multiple views from a single location. Spherical video content may include a full spherical video capture (360 degrees of capture) or a partial spherical video capture (less than 360 degrees of capture). Spherical video content may be captured through the use of one or more cameras/image sensors to capture images/videos from a location. The captured images/videos may be stitched together to form the spherical video content.
Virtual reality content may refer to content that may be consumed via virtual reality experience. Virtual reality content may associate different directions within the virtual reality content with different viewing directions, and a user may view a particular directions within the virtual reality content by looking in a particular direction. For example, a user may use a virtual reality headset to change the user's direction of view. The user's direction of view may correspond to a particular direction of view within the virtual reality content. For example, a forward looking direction of view for a user may correspond to a forward direction of view within the virtual reality content.
Spherical video content and/or virtual reality content may have been captured at one or more locations. For example, spherical video content and/or virtual reality content may have been captured from a stationary position (e.g., a seat in a stadium). Spherical video content and/or virtual reality content may have been captured from a moving position (e.g., a moving bike). Spherical video content and/or virtual reality content may include video capture from a path taken by the capturing device(s) in the moving position. For example, spherical video content and/or virtual reality content may include video capture from a person walking around in a music festival.
While the description herein may be directed to video content, one or more other implementations of the system/method described herein may be configured for other types media content. Other types of media content may include one or more of audio content (e.g., music, podcasts, audio books, and/or other audio content), multimedia presentations, images, slideshows, visual content (one or more images and/or videos), and/or other media content.
Implementations of the disclosure may be made in hardware, firmware, software, or any suitable combination thereof. Aspects of the disclosure may be implemented as instructions stored on a machine-readable medium, which may be read and executed by one or more processors. A machine-readable medium may include any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computing device). For example, a tangible computer readable storage medium may include read only memory, random access memory, magnetic disk storage media, optical storage media, flash memory devices, and others, and a machine-readable transmission media may include forms of propagated signals, such as carrier waves, infrared signals, digital signals, and others. Firmware, software, routines, or instructions may be described herein in terms of specific exemplary aspects and implementations of the disclosure, and performing certain actions.
Although the processor 11 and the electronic storage 12 are shown to be connected to the interface 13 in
Although the processor 11 is shown in
It should be appreciated that although computer components are illustrated in
While the computer program components are described herein as being implemented via processor 11 through machine readable instructions 100, this is merely for ease of reference and is not meant to be limiting. In some implementations, one or more functions of computer program components described herein may be implemented via hardware (e.g., dedicated chip, field-programmable gate array) rather than software. One or more functions of computer program components described herein may be software-implemented, hardware-implemented, or software and hardware-implemented.
The description of the functionality provided by the different computer program components described herein is for illustrative purposes, and is not intended to be limiting, as any of computer program components may provide more or less functionality than is described. For example, one or more of computer program components may be eliminated, and some or all of its functionality may be provided by other computer program components. As another example, the processor 11 may be configured to execute one or more additional computer program components that may perform some or all of the functionality attributed to one or more of computer program components described herein.
In some implementations, some or all of the functionalities attributed herein to the system 10 may be provided by external resources not included in the system 10. External resources may include hosts/sources of information, computing, and/or processing and/or other providers of information, computing, and/or processing outside of the system 10.
The electronic storage media of the electronic storage 12 may be provided integrally (i.e., substantially non-removable) with one or more components of the system 10 and/or removable storage that is connectable to one or more components of the system 10 via, for example, a port (e.g., a USB port, a Firewire port, etc.) or a drive (e.g., a disk drive, etc.). The electronic storage 12 may include one or more of optically readable storage media (e.g., optical disks, etc.), magnetically readable storage media (e.g., magnetic tape, magnetic hard drive, floppy drive, etc.), electrical charge-based storage media (e.g., EPROM, EEPROM, RAM, etc.), solid-state storage media (e.g., flash drive, etc.), and/or other electronically readable storage media. The electronic storage 12 may be a separate component within the system 10, or the electronic storage 12 may be provided integrally with one or more other components of the system 10 (e.g., the processor 11). Although the electronic storage 12 is shown in
In some implementations, method 200 may be implemented in one or more processing devices (e.g., a digital processor, an analog processor, a digital circuit designed to process information, a central processing unit, a graphics processing unit, a microcontroller, an analog circuit designed to process information, a state machine, and/or other mechanisms for electronically processing information). The one or more processing devices may include one or more devices executing some or all of the operation of method 200 in response to instructions stored electronically on one or more electronic storage mediums. The one or more processing devices may include one or more devices configured through hardware, firmware, and/or software to be specifically designed for execution of one or more of the operation of method 200.
Referring to
At operation 202, the activity captured within the video content may be identified. In some implementations, operation 202 may be performed by a processor component the same as or similar to the activity component 104 (Shown in
At operation 203, the sub-activity captured within the video content may be identified. In some implementations, operation 203 may be performed by a processor component the same as or similar to the activity component 104 (Shown in
At operation 204, an activity metric modifier for the interest curve at the one or more moments in the progress length may be determined based on the identification of the activity and the identification of the sub-activity. In some implementations, operation 204 may be performed by a processor component the same as or similar to the metric component 110 (Shown in
At operation 205, the interest curve may be modified at the one or more moments based on the activity metric modifier. In some implementations, operation 205 may be performed by a processor component the same as or similar to the interest curve component 112 (Shown in
Although the system(s) and/or method(s) of this disclosure have been described in detail for the purpose of illustration based on what is currently considered to be the most practical and preferred implementations, it is to be understood that such detail is solely for that purpose and that the disclosure is not limited to the disclosed implementations, but, on the contrary, is intended to cover modifications and equivalent arrangements that are within the spirit and scope of the appended claims. For example, it is to be understood that the present disclosure contemplates that, to the extent possible, one or more features of any implementation can be combined with one or more features of any other implementation.
Number | Name | Date | Kind |
---|---|---|---|
6633685 | Kusama | Oct 2003 | B1 |
7222356 | Yonezawa | May 2007 | B1 |
7483618 | Edwards | Jan 2009 | B1 |
7512886 | Herberger | Mar 2009 | B1 |
7885426 | Golovchinsky | Feb 2011 | B2 |
7970240 | Chao | Jun 2011 | B1 |
8180161 | Haseyama | May 2012 | B2 |
8326786 | Martel | Dec 2012 | B1 |
8396878 | Acharya | Mar 2013 | B2 |
8446433 | Mallet | May 2013 | B1 |
8606073 | Woodman | Dec 2013 | B2 |
8611422 | Yagnik | Dec 2013 | B1 |
8612463 | Brdiczka | Dec 2013 | B2 |
8718447 | Yang | May 2014 | B2 |
8763023 | Goetz | Jun 2014 | B1 |
8774560 | Sugaya | Jul 2014 | B2 |
8971623 | Gatt | Mar 2015 | B2 |
8990328 | Grigsby | Mar 2015 | B1 |
9041727 | Ubillos | May 2015 | B2 |
9077956 | Morgan | Jul 2015 | B1 |
9142257 | Woodman | Sep 2015 | B2 |
9253533 | Morgan | Feb 2016 | B1 |
9342376 | Jain | May 2016 | B2 |
9396385 | Bentley | Jul 2016 | B2 |
9418283 | Natarajan | Aug 2016 | B1 |
20020165721 | Chang | Nov 2002 | A1 |
20040001706 | Jung | Jan 2004 | A1 |
20040128317 | Sull | Jul 2004 | A1 |
20050025454 | Nakamura | Feb 2005 | A1 |
20050108031 | Grosvenor | May 2005 | A1 |
20050198018 | Shibata | Sep 2005 | A1 |
20060080286 | Svendsen | Apr 2006 | A1 |
20060115108 | Rodriguez | Jun 2006 | A1 |
20070204310 | Hua | Aug 2007 | A1 |
20070230461 | Singh | Oct 2007 | A1 |
20080044155 | Kuspa | Feb 2008 | A1 |
20080123976 | Coombs | May 2008 | A1 |
20080131853 | Kunitz | Jun 2008 | A1 |
20080152297 | Ubillos | Jun 2008 | A1 |
20080163283 | Tan | Jul 2008 | A1 |
20080177706 | Yuen | Jul 2008 | A1 |
20080183843 | Gavin | Jul 2008 | A1 |
20080253735 | Kuspa | Oct 2008 | A1 |
20080292273 | Wang | Nov 2008 | A1 |
20080313541 | Shafton | Dec 2008 | A1 |
20090019995 | Miyajima | Jan 2009 | A1 |
20090125559 | Yoshino | May 2009 | A1 |
20090213270 | Ismert | Aug 2009 | A1 |
20090252474 | Nashida | Oct 2009 | A1 |
20100046842 | Conwell | Feb 2010 | A1 |
20100086216 | Lee | Apr 2010 | A1 |
20100104261 | Liu | Apr 2010 | A1 |
20100183280 | Beauregard | Jul 2010 | A1 |
20100199182 | Lanza | Aug 2010 | A1 |
20100231730 | Ichikawa | Sep 2010 | A1 |
20100245626 | Woycechowsky | Sep 2010 | A1 |
20100251295 | Amento | Sep 2010 | A1 |
20100274714 | Sims | Oct 2010 | A1 |
20100278504 | Lyons | Nov 2010 | A1 |
20100278509 | Nagano | Nov 2010 | A1 |
20100281375 | Pendergast | Nov 2010 | A1 |
20100281386 | Lyons | Nov 2010 | A1 |
20100318660 | Balsubramanian | Dec 2010 | A1 |
20110075990 | Eyer | Mar 2011 | A1 |
20110093798 | Shahraray | Apr 2011 | A1 |
20110103700 | Haseyama | May 2011 | A1 |
20110137156 | Razzaque | Jun 2011 | A1 |
20110170086 | Oouchida | Jul 2011 | A1 |
20110206351 | Givoly | Aug 2011 | A1 |
20110242098 | Tamaru | Oct 2011 | A1 |
20110293250 | Deever | Dec 2011 | A1 |
20120014673 | O'Dwyer | Jan 2012 | A1 |
20120027381 | Kataoka | Feb 2012 | A1 |
20120030029 | Flinn | Feb 2012 | A1 |
20120057852 | Devleeschouwer | Mar 2012 | A1 |
20120123780 | Gao | May 2012 | A1 |
20120141019 | Zhang | Jun 2012 | A1 |
20120210205 | Sherwood | Aug 2012 | A1 |
20120246114 | Edmiston | Sep 2012 | A1 |
20120283574 | Park | Nov 2012 | A1 |
20120311448 | Achour | Dec 2012 | A1 |
20130040660 | Fisher | Feb 2013 | A1 |
20130136193 | Hwang | May 2013 | A1 |
20130151970 | Achour | Jun 2013 | A1 |
20130166303 | Chang | Jun 2013 | A1 |
20130182166 | Shimokawa | Jul 2013 | A1 |
20130195429 | Fay | Aug 2013 | A1 |
20130197967 | Pinto | Aug 2013 | A1 |
20130208942 | Davis | Aug 2013 | A1 |
20130235071 | Ubillos | Sep 2013 | A1 |
20130239051 | Albouze | Sep 2013 | A1 |
20130259390 | Dunlop | Oct 2013 | A1 |
20130259399 | Ho | Oct 2013 | A1 |
20130282747 | Cheng | Oct 2013 | A1 |
20130283301 | Avedissian | Oct 2013 | A1 |
20130287214 | Resch | Oct 2013 | A1 |
20130300939 | Chou | Nov 2013 | A1 |
20130318443 | Bachman | Nov 2013 | A1 |
20130330019 | Kim | Dec 2013 | A1 |
20130343727 | Rav-Acha | Dec 2013 | A1 |
20140072285 | Shynar | Mar 2014 | A1 |
20140093164 | Noorkami | Apr 2014 | A1 |
20140096002 | Dey | Apr 2014 | A1 |
20140105573 | Hanckmann | Apr 2014 | A1 |
20140149865 | Tanaka | May 2014 | A1 |
20140152762 | Ukil | Jun 2014 | A1 |
20140161351 | Yagnik | Jun 2014 | A1 |
20140165119 | Liu | Jun 2014 | A1 |
20140169766 | Yu | Jun 2014 | A1 |
20140212107 | Saint-Jean | Jul 2014 | A1 |
20140219634 | McIntosh | Aug 2014 | A1 |
20140226953 | Hou | Aug 2014 | A1 |
20140232818 | Carr | Aug 2014 | A1 |
20140245336 | Lewis, II | Aug 2014 | A1 |
20140282661 | Martin | Sep 2014 | A1 |
20140300644 | Gillard | Oct 2014 | A1 |
20140328570 | Cheng | Nov 2014 | A1 |
20140334796 | Galant | Nov 2014 | A1 |
20140341528 | Mahate | Nov 2014 | A1 |
20140366052 | Ives | Dec 2014 | A1 |
20150015680 | Wang | Jan 2015 | A1 |
20150022355 | Pham | Jan 2015 | A1 |
20150029089 | Kim | Jan 2015 | A1 |
20150039646 | Sharifi | Feb 2015 | A1 |
20150067811 | Agnew | Mar 2015 | A1 |
20150071547 | Keating | Mar 2015 | A1 |
20150113009 | Zhou | Apr 2015 | A1 |
20150156247 | Hensel | Jun 2015 | A1 |
20150186073 | Pacurariu | Jul 2015 | A1 |
20150287435 | Land | Oct 2015 | A1 |
20150318020 | Pribula | Nov 2015 | A1 |
20150373281 | White | Dec 2015 | A1 |
20150375117 | Thompson | Dec 2015 | A1 |
20150382083 | Chen | Dec 2015 | A1 |
20160005440 | Gower | Jan 2016 | A1 |
20160026874 | Hodulik | Jan 2016 | A1 |
20160027470 | Newman | Jan 2016 | A1 |
20160027475 | Hodulik | Jan 2016 | A1 |
20160029105 | Newman | Jan 2016 | A1 |
20160034786 | Suri | Feb 2016 | A1 |
20160055885 | Hodulik | Feb 2016 | A1 |
20160078119 | Aravkin | Mar 2016 | A1 |
20160094601 | Besehanic | Mar 2016 | A1 |
20160103830 | Cheong | Apr 2016 | A1 |
20160105308 | Dutt | Apr 2016 | A1 |
20160110877 | Schwartz | Apr 2016 | A1 |
20160189752 | Galant | Jun 2016 | A1 |
20160225405 | Matias | Aug 2016 | A1 |
20160225410 | Lee | Aug 2016 | A1 |
20160234345 | Roberts | Aug 2016 | A1 |
20160260000 | Yamakawa | Sep 2016 | A1 |
20160286235 | Yamamoto | Sep 2016 | A1 |
20160292881 | Bose | Oct 2016 | A1 |
20160358603 | Azam | Dec 2016 | A1 |
20160366330 | Boliek | Dec 2016 | A1 |
20170185846 | Hwangbo | Jun 2017 | A1 |
20170255831 | Bernal | Sep 2017 | A1 |
20170336922 | Vaskevitch | Nov 2017 | A1 |
Number | Date | Country |
---|---|---|
H09181966 | Jul 1997 | JP |
2005252459 | Sep 2005 | JP |
2006053694 | Feb 2006 | JP |
2006053694 | Feb 2006 | JP |
2008059121 | Mar 2008 | JP |
2009053748 | Mar 2009 | JP |
2011188004 | Sep 2011 | JP |
2011188004 | Sep 2011 | JP |
2006001361 | Jan 2006 | WO |
2009040538 | Apr 2009 | WO |
2012057623 | May 2012 | WO |
2012057623 | May 2012 | WO |
2012086120 | Jun 2012 | WO |
Entry |
---|
Deng et al. (“Semantic Analysis and Video Event Mining in Sports Video,” 22nd International Conference on Advanced Information Networking and Applications — Workshops, Mar. 25-28, 2008) (Year: 2008). |
Cui et al. (“A Novel Event-Oriented Segment-of-Interest Discovery Method for Surveillance Video,” IEEE International Conference on Multimedia and Expo, Jul. 2-5, 2007) (Year: 2007). |
Chen et al. (“Movie emotional event detection based on music mood and video tempo,” Digest of Technical Papers International Conference on Consumer Electronics, Jan. 7-11, 2006 (Year: 2006). |
PCT International Written Opinion for PCT/US2015/041624, dated Dec. 17, 2015, 7 Pages. |
PCT International Search Report and Written Opinion for PCT/US15/12086 dated Mar. 17, 2016, 20 pages. |
Schroff et al., “FaceNet: A Unified Embedding for Face Recognition and Clustering,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, 10 pgs. |
Parkhi et al., “Deep Face Recognition,” Proceedings of the British Machine Vision, 2015, 12 pgs. |
Iandola et al., “SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size,” arXiv:1602.07360, 2016, 9 pgs. |
Ioffe et al., “Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift,” arXiv:1502.03167, 2015, 11 pgs. |
He et al., “Deep Residual Learning for Image Recognition,” arXiv:1512.03385, 2015, 12 pgs. |
Han et al., Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding, International Conference on Learning Representations 2016, 14 pgs. |
PCT International Search Report and Written Opinion for PCT/US2015/023680, dated Oct. 6, 2015, 13 pages. |
Iandola et al., “SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size”, arXiv:1602.07360v3 [cs.CV] Apr. 6, 2016 (9 pgs.). |
Yang et al., “Unsupervised Extraction of Video Highlights via Robust Recurrent Auto-encoders” arXiv:1510.01442v1 [cs.CV] Oct. 6, 2015 (9 pgs). |
Tran et al., “Learning Spatiotemporal Features with 3D Convolutional Networks”, arXiv:1412.0767 [cs.CV] Dec. 2, 2014 (9 pgs). |
PCT International Search Report for PCT/US15/41624 dated Nov. 4, 2015, 5 pages. |
PCT International Search Report for PCT/US15/23680 dated Aug. 3, 2015, 4 pages. |
PSonar URL: http://www.psonar.com/about retrieved on Aug. 24, 2016, 3 pages. |
PCT International Preliminary Report on Patentability for PCT/US2015/023680, dated Oct. 4, 2016, 10 pages. |
Nicole Lee, Twitter's Periscope is the best livestreaming video app yet; Mar. 26, 2015 URL:http://www.engadget.com/2015/03/26/periscope/ [Retrieved Aug. 25, 2015] 11 pages. |
FFmpeg, “Demuxing,” Doxygen, Dec. 5, 2014, 15 Pages, [online] [retrieved on Jul. 13, 2015] Retrieved from the internet <URL:https://www.ffmpeg.org/doxygen/2.3/group_lavf_encoding.html>. |
FFmpeg, “Muxing,” Doxygen, Jul. 20, 2014, 9 Pages, [online] [retrieved on Jul. 13, 2015] Retrieved from the internet <URL: https://www.ffmpeg.org/doxyg en/2. 3/structA VP a ck et. html>. |
FFmpeg, “AVPacket Struct Reference,” Doxygen, Jul. 20, 2014, 24 Pages, [online] [retrieved on Jul. 13, 2015] Retrieved from the internet <URL:https://www.ffmpeg.org/doxygen/2.5/group_lavf_decoding.html>. |
Japanese Office Action for JP Application No. 2013-140131, dated Aug. 5, 2014, 6 pages. |
Office Action for U.S. Appl. No. 13/831,124, dated Mar. 19, 2015, 14 pages. |
Ernoult, Emeric, “How to Triple Your YouTube Video Views with Facebook”, SocialMediaExaminer.com, Nov. 26, 2012, 16 pages. |