Claims
- 1. A method for performing content-based temporal segmentation of video sequences comprising the steps of:(a) transmitting the video sequence to a processor; (b) detecting a plurality of shot boundary locations, wherein a frame color histogram difference and a frame pixel difference are computed for the video sequence, and a clustering operation is used to generate two distinct classes of shot boundary and non-shot boundary locations; (c) detecting within the video sequence uniform intensity temporal segments; (d) detecting within the video sequence fade in and fade out temporal segments by analyzing temporal frame properties, starting from an endpoint of the identified uniform intensity temporal segments; (e) detecting within the video sequence dissolve temporal segments; (f) refining and combining individual temporal segment locations to produce a single list of the individual temporal segment locations contained in the video sequence; including checking against detected uniform intensity segment locations, fade segment locations, and dissolve segment locations; and (g) outputting the single list of individual temporal segment locations within the video sequence including shot boundary locations, fade-in and fade-out segment locations, uniform intensity segment locations, and dissolve segment locations.
- 2. A method for performing content-based temporal segmentation of video sequences comprising the steps of:(a) transmitting the video sequence to a processor; (b) detecting a plurality of shot boundary locations, wherein a frame color histogram difference and a frame pixel difference are computed for the video sequence, and a clustering operation is used to generate two distinct classes of shot boundary and non-shot boundary locations; (c) identifying within the video sequence a plurality of groups of uniform frames by analyzing color properties of temporal frames; (d) detecting the content of a plurality of adjacent uniform frames, and refining the plurality of groups of uniform frames to detect a plurality of uniform intensity temporal segments; (e) analyzing the color properties of temporal frames starting from an endpoint of the uniform intensity temporal segments to detect fade-in and fade-out temporal segments; (f) detecting within the video sequence dissolve temporal segments; (g) refining and combining individual temporal segment locations to produce a single list of the individual temporal segment locations contained in the video sequence; including: (g1) removing from a list of shot boundary locations any frame that has been detected as a shot boundary and that also has been flagged as part of a uniform intensity segment, a fade segment, or a dissolve segment; (g2) dropping adjacent shot boundaries that are closer than a predefined number of frames; (g3) eliminating detected spurious shot boundaries resulting from sudden increases in frame luminance characteristics; and (h) outputting the single list of individual temporal segment locations within the video sequence including the detected shot boundary locations fade-in and fade-out temporal segment locations, uniform intensity segment locations, and dissolve segment locations.
- 3. The method of claim 2, wherein the step of identifying the plurality of groups of uniform frames includes analyzing temporal frame color component histogram variance.
- 4. The method of claim 2, wherein the step of analyzing the color properties of temporal frames for detection of fade-in and fade-out temporal segments includes analyzing a difference in color histogram variance between adjacent temporal frames.
- 5. The method of claim 1, wherein the step of identifying the uniform intensity temporal segments includes analyzing temporal frame color component histogram variance.
- 6. The method of claim 1, wherein the step of analyzing the temporal frame properties for detection of fade-in and fade-out temporal segments includes analyzing a difference in color histogram variance between adjacent temporal frames.
- 7. A computer storage medium having instructions stored therein for causing a computer to perform the method of claim 1.
- 8. A computer storage medium having instructions stored therein for causing a computer to perform the method of claim 2.
- 9. A computer storage medium having instructions stored therein for causing a computer to perform the method of claim 3.
- 10. A computer storage medium having instructions stored therein for causing a computer to perform the method of claim 4.
- 11. A computer storage medium having instructions stored therein for causing a computer to perform the method of claim 5.
- 12. A computer storage medium having instructions stored therein for causing a computer to perform the method of claim 6.
CROSS REFERENCE TO RELATED APPLICATIONS
This is a divisional of application Ser. No. 08/902,545, filed Jul. 29, 1997 now U.S. Pat No. 6,195,458 by Warnick et al., entitled A METHOD FOR CONTENT-BASED TEMPORAL SEGMENTATION OF VIDEO.
US Referenced Citations (4)
Number |
Name |
Date |
Kind |
5778108 |
Coleman, Jr. |
Jul 1998 |
A |
5959697 |
Coleman, Jr. |
Sep 1999 |
A |
6195458 |
Warnick et al. |
Feb 2001 |
B1 |
6351556 |
Loui et al. |
Feb 2002 |
B1 |
Non-Patent Literature Citations (1)
Entry |
Zhang et al. , Automatic partitioning of full-motion video, Apr. 10, 1993, Multimedia System 1:10-28, pp. 10-28. |