The present invention relates to descriptions of audio-visual material.
Digital audiovisual material is becoming increasingly available to users through digital TV broadcast, digital video cameras, digital video discs, and personal computer based access to multimedia on the Internet or other network. In addition, persistent large-volume storage and non-linear access to audiovisual content is becoming available in consumer devices. Consequently, there is a need for rapid navigation and searching capabilities to enable users to efficiently discover and consume the contents of audiovisual material.
The extensive proliferation of audio-visual material available to users has the potential to overwhelm the consumer and lead to frustration at the inability to search and view content in an efficient manner. Viewing summaries of the content allows the consumer to skip irrelevant content and view the desired content quickly and easily. Further, multiple different summaries, if available, may provide the user with alternative views of a particular program that the user could choose from depending on personal preferences or usage conditions.
Limited summary selection capabilities are appearing more frequently in current technologies, such as the digital video disk (DVD). DVD movies normally provide “scene selections” or “chapter selections” that have a visual array of thumbnails and textual titles associated with each scene. This permits the user to click on the thumbnail of the desired scene, jump to that scene, and start playback. Playback typically continues until the end of the movie, unless the user makes another selection. While somewhat limited, these features provide the capability to index for the purpose of jumping to an arbitrary position and continue playback from that position.
Referring to
To define these interactions, a set of description schemes containing data describing the content of the material may be used. User preferences 12 may be used in several different areas to maximize both the user's enjoyment and the system functionality. The preferences describing the topics and subject matter of interest to the user is used in both searching for and navigating the audiovisual programs 14. These two sets of data, the user preferences 12 and program descriptions 14, are correlated in the filtering and search engine 16 to identify the preferred programs.
The programs identified by the filtering and search engine 16 are then forwarded to a browsing module 18 along with the user's browsing preferences. Another output of the filtering and search engine 16 are preferred programs that the user has designated for storage. These are stored in the storage module 20. The programs selected by the user with the browsing module 18 are then sent to a display 22. The user may utilize multimedia title descriptions of preferred programs to navigate among the programs that the user wants to consume. Once a program is selected, a summary description of that particular program is correlated with user's browsing preferences to offer the user a preferred summary.
The display 22 receives the programs and displays them in accordance with the user's device preferences as to the operation of the display. User's device preferences may include, for example, device settings such as volume setting that may vary with the genre of the program that is being consumed. The display and user's interaction with the display, such as stopping a program before its end and consuming certain types of programs with certain device settings, also provides information in a manner analogous to a feedback loop to update and log the usage history 24. The usage history 24 may be mapped against the preferences by a mapping module 26. This information is then used in conjunction with user inputs by the user preference module 12.
These user preferences may be useful in many contexts, not just an audiovisual presentation system. User preferences and usage history may be transmitted to the provider of audiovisual programs 14 to receive selected programming or directly receive program segments that are preferred by the user. In the latter case, user preferences may be correlated with summary descriptions at the provider side to select and directly deliver summarized audiovisual programs to the user. The preferences may also be transferred to a “smart card” 28 or similar, portable storage and ultimately transferred to another system by the user.
However, it is noted that a framework for the description of the individual description schemes at the user, program, or device level are needed. As illustrated in
The present inventors came to the realization that the previously existing searching description scheme (segment description scheme in MPEG-7) and the navigation description scheme (summary description scheme in MPEG-7), as described in MPEG-7, are inconsistent with one another if it is desirable to navigate portions of a video and simultaneously obtain information regarding the content of those portions. In particular, MPEG-7 does not provide sufficient syntax structures to physically or logically link the segments identified by the navigation description scheme with the video identified by the searching description scheme. Referring to
The description scheme structure within MPEG-7 permits a hierarchical nesting structure for segment descriptions of the video and descriptions for groups of segments of the video. The permitted hierarchical nesting structure is very flexible and permits nearly any desirable interrelationship to be defined. Referring to
To overcome the non-deterministic nature of the hierarchical structure the preferred system imposes at least one or more of the following restrictions. A segment group may reference either other segments or other segment groups, but not both, as illustrated in
With the set of permissible interconnections limited a set of rules is useful in order to permit the user to view the available content in an organized manner or otherwise select a set of segments for presentation in a particular order. In this manner the playback, navigation, and presentation order may be unambiguously defined. Therefore different systems will interpret the segmentation data the same. One type of organizational technique is to define segments type “alternativeGroups”, which may not contain segments and shall only contain subgroups. The user may select one of the groups from the set of groups at the same hierarchical level originating from the same parent group. Referring to
Another type of organizational technique is to define a set of segments of type “tableOfContents”, which presents the groups and segments defined therein in an ordered manner such that the hierarchical order may be observed. For example, a portion of the ordered groups shown in
Portions of the hierarchical structure of segment groups may be designated as “alternativeGroups” and other portions of the hierarchical structure may be designated as “tableOfContents”. Preferably, the two different designations of the hierarchical structure are non-overlapping, but may be overlapping, if desired. These designations are preferably not directly associated with segments.
Existing video summarization systems provide segmentation data for each video and permit the selective viewing of each video according to the segmentation data. While beneficial, the present inventors determined that facilitating the grouping of segments from a plurality of different programs to be viewed within a single presentation defined by a single description scheme is beneficial and not previously possible. A “virtual program” consisting of segments from a plurality of different programs may be dynamically constructed and presented, without the need for physically creating the program on a persistent storage medium. Thus the description scheme syntax may facilitate the identification of the “virtual program” where the relevant segments may be located with multiple different segments of multiple different programs being identified within a single description scheme syntax, as illustrated in
Principally, existing description schemes for audiovisual content permit linking to external content, such as a web site of a news article on a relevant topic or other material of related interest to the content. However, it was determined that more focused external content may be selected for the user if the content was associated with segments of the video, as opposed to the entire video. In this manner, a single video may include multiple links to external content, each link being associated with a different portion of the video.
Another issue that arises with respect to selecting segments within multiple different video streams is the different techniques that may be used to indicate “time”. For example, the frame rate is not always the same, with movie film typically being 24 frames per second and television being 30 frames per second (each frame consisting of two fields). In addition, the time base for MPEG-1, DVD's, MPEG-2, VCR, Movies, Internet based streaming media, etc., are not the same. Accordingly, the modified description scheme includes a time base indicator of the time base associated with the particular segment, as illustrated in
The following may be used to describe the descriptive properties of segments:
The terms and definitions may be as follows:
The following element and complex type may be used to define a segment.
The terms and definitions may be as follows:
The following element and complex types may be used to define segment grouping.
The names and definitions may be as follows:
The allowed types may be defined as follows:
Various validity constraints may be imposed on the proposed description scheme to ensure that (i) it fits the data model of
These validity constraints reduce the complexity of the resulting descriptions by limiting the degree of nesting in the hierarchy. The navigation order of segments or segment groups is determined by the order of references to the segments in a segment group.
The entities may be defined as follows:
The entity relationships may be defined, as follows.
The following element and complex type define a structure for holding segmentation-related metadata.
The names and definitions may be as follows:
The terms and expressions employed in the foregoing specification are used therein as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding equivalents of the features shown and described or portions thereof, it being recognized that the scope of the invention is defined and limited only by the claims that follow.
This is a continuation of U.S. patent application Ser. Nos. 10/058,869 filed Jan. 28, 2002, now abandoned which claims the benefit of 60/269,786 filed Feb. 18, 2001 for SEGMENTATION METADATA FOR AUDIO-VISUAL CONTENT.
Number | Name | Date | Kind |
---|---|---|---|
4183056 | Evans et al. | Jan 1980 | A |
4253108 | Engel | Feb 1981 | A |
4298884 | Reneau | Nov 1981 | A |
4321635 | Tsuyuguchi | Mar 1982 | A |
4520404 | von Kohorn | May 1985 | A |
4729044 | Kiesel | Mar 1988 | A |
4937685 | Barker et al. | Jun 1990 | A |
5027400 | Baji et al. | Jun 1991 | A |
5101364 | Davenport et al. | Mar 1992 | A |
5109482 | Bohrman | Apr 1992 | A |
5148154 | MacKay et al. | Sep 1992 | A |
5200825 | Perine | Apr 1993 | A |
D348251 | Hendricks | Jun 1994 | S |
5333091 | Iggulden et al. | Jul 1994 | A |
5339393 | Duffy et al. | Aug 1994 | A |
D354059 | Hendricks | Jan 1995 | S |
5424770 | Schmelzer et al. | Jun 1995 | A |
5434678 | Abecassis | Jul 1995 | A |
5452016 | Ohara et al. | Sep 1995 | A |
D368263 | Hendricks | Mar 1996 | S |
5521841 | Arman et al. | May 1996 | A |
5559549 | Hendricks et al. | Sep 1996 | A |
5589945 | Abecassis | Dec 1996 | A |
5600364 | Hendricks et al. | Feb 1997 | A |
5600573 | Hendricks et al. | Feb 1997 | A |
5610653 | Abecassis | Mar 1997 | A |
5634849 | Abecassis | Jun 1997 | A |
5635982 | Zhang et al. | Jun 1997 | A |
D381991 | Hendricks | Aug 1997 | S |
5654769 | Ohara et al. | Aug 1997 | A |
5659350 | Hendricks et al. | Aug 1997 | A |
5664046 | Abecassis | Sep 1997 | A |
5664227 | Mauldin et al. | Sep 1997 | A |
5675752 | Scott et al. | Oct 1997 | A |
5682195 | Hendricks et al. | Oct 1997 | A |
5684918 | Abecassis | Nov 1997 | A |
5696869 | Abecassis | Dec 1997 | A |
5710884 | Dedrick | Jan 1998 | A |
5717814 | Abecassis | Feb 1998 | A |
5724472 | Abecassis | Mar 1998 | A |
5734853 | Hendricks et al. | Mar 1998 | A |
5761881 | Wall | Jun 1998 | A |
5774357 | Hoffberg et al. | Jun 1998 | A |
5778108 | Coleman, Jr. | Jul 1998 | A |
5797001 | Augenbraun et al. | Aug 1998 | A |
5798785 | Hendricks et al. | Aug 1998 | A |
5805733 | Wang et al. | Sep 1998 | A |
5821945 | Yeo et al. | Oct 1998 | A |
D402310 | Hendricks | Dec 1998 | S |
5861881 | Freeman et al. | Jan 1999 | A |
5867386 | Hoffberg et al. | Feb 1999 | A |
5875107 | Nagai et al. | Feb 1999 | A |
5875108 | Hoffberg et al. | Feb 1999 | A |
5892536 | Logan et al. | Apr 1999 | A |
5900867 | Schindler et al. | May 1999 | A |
5901246 | Hoffberg et al. | May 1999 | A |
5903454 | Hoffberg et al. | May 1999 | A |
5913013 | Abecassis | Jun 1999 | A |
5920300 | Yamazaki et al. | Jul 1999 | A |
5920477 | Hoffberg | Jul 1999 | A |
5923365 | Tamir et al. | Jul 1999 | A |
5926624 | Katz et al. | Jul 1999 | A |
5933811 | Angles et al. | Aug 1999 | A |
5956026 | Ratakonda | Sep 1999 | A |
5958006 | Eggleston et al. | Sep 1999 | A |
5959681 | Cho | Sep 1999 | A |
5959697 | Coleman, Jr. | Sep 1999 | A |
5969755 | Courtney | Oct 1999 | A |
5973683 | Cragun et al. | Oct 1999 | A |
5986690 | Hendricks | Nov 1999 | A |
5986692 | Logan et al. | Nov 1999 | A |
5987211 | Abecassis | Nov 1999 | A |
5990927 | Hendricks et al. | Nov 1999 | A |
5990980 | Golin | Nov 1999 | A |
5995095 | Ratakonda | Nov 1999 | A |
6002211 | Abecassis | Dec 1999 | A |
6002833 | Abecassis | Dec 1999 | A |
6011895 | Abecassis | Jan 2000 | A |
6014183 | Hoang | Jan 2000 | A |
6038367 | Abecassis | Mar 2000 | A |
6052554 | Hendricks et al. | Apr 2000 | A |
6055018 | Swan | Apr 2000 | A |
6067401 | Abecassis | May 2000 | A |
6072934 | Abecassis | Jun 2000 | A |
6081750 | Hoffberg et al. | Jun 2000 | A |
6088455 | Logan et al. | Jul 2000 | A |
6091886 | Abecassis | Jul 2000 | A |
RE36801 | Logan et al. | Aug 2000 | E |
6100941 | Dimitrova et al. | Aug 2000 | A |
6141041 | Carlbom et al. | Oct 2000 | A |
6141060 | Honey et al. | Oct 2000 | A |
6144375 | Jain et al. | Nov 2000 | A |
6151444 | Abecassis | Nov 2000 | A |
D435561 | Pettigrew et al. | Dec 2000 | S |
6160989 | Hendricks et al. | Dec 2000 | A |
6161142 | Wolfe et al. | Dec 2000 | A |
6169542 | Hooks et al. | Jan 2001 | B1 |
6181335 | Hendricks et al. | Jan 2001 | B1 |
6195497 | Nagasaki et al. | Feb 2001 | B1 |
6201536 | Hendricks et al. | Mar 2001 | B1 |
6208805 | Abecassis | Mar 2001 | B1 |
6215526 | Barton et al. | Apr 2001 | B1 |
6216129 | Eldering | Apr 2001 | B1 |
6219837 | Yeo et al. | Apr 2001 | B1 |
6230501 | Bailey, Sr. et al. | May 2001 | B1 |
6233389 | Barton et al. | May 2001 | B1 |
6236395 | Sezan et al. | May 2001 | B1 |
6252544 | Hoffberg | Jun 2001 | B1 |
6269216 | Abecassis | Jul 2001 | B1 |
6275268 | Ellis et al. | Aug 2001 | B1 |
6289165 | Abecassis | Sep 2001 | B1 |
6304665 | Cavallaro et al. | Oct 2001 | B1 |
6304715 | Abecassis | Oct 2001 | B1 |
6342904 | Vasudevan et al. | Jan 2002 | B1 |
6363160 | Bradski et al. | Mar 2002 | B1 |
6418168 | Narita | Jul 2002 | B1 |
6490320 | Vetro et al. | Dec 2002 | B1 |
6549643 | Toklu et al. | Apr 2003 | B1 |
6556767 | Okayama et al. | Apr 2003 | B2 |
6597859 | Leinhart et al. | Jul 2003 | B1 |
6665423 | Mehrotra et al. | Dec 2003 | B1 |
6675158 | Rising et al. | Jan 2004 | B1 |
6678635 | Tovinkere et al. | Jan 2004 | B2 |
6691126 | Syeda-Mahmood | Feb 2004 | B1 |
6724933 | Lin et al. | Apr 2004 | B1 |
6741655 | Chang et al. | May 2004 | B1 |
6774917 | Foote et al. | Aug 2004 | B1 |
6829781 | Bhagavath et al. | Dec 2004 | B1 |
6931595 | Pan et al. | Aug 2005 | B2 |
6970510 | Wee et al. | Nov 2005 | B1 |
6981129 | Boggs et al. | Dec 2005 | B1 |
6993245 | Harville | Jan 2006 | B1 |
20020013943 | Haberman et al. | Jan 2002 | A1 |
20020018594 | Xu et al. | Feb 2002 | A1 |
20020069218 | Sull et al. | Jun 2002 | A1 |
20020080162 | Pan et al. | Jun 2002 | A1 |
20020083473 | Agnihotri et al. | Jun 2002 | A1 |
20020108112 | Wallace et al. | Aug 2002 | A1 |
20020120929 | Schwalb et al. | Aug 2002 | A1 |
20020141619 | Standridge et al. | Oct 2002 | A1 |
20020184220 | Teraguchi et al. | Dec 2002 | A1 |
20020194589 | Cristofalo et al. | Dec 2002 | A1 |
20030001880 | Holtz et al. | Jan 2003 | A1 |
20030026592 | Kawahara et al. | Feb 2003 | A1 |
20030081937 | Li | May 2003 | A1 |
20030177503 | Sull et al. | Sep 2003 | A1 |
20040017389 | Pan et al. | Jan 2004 | A1 |
20040088289 | Xu et al. | May 2004 | A1 |
20040125124 | Kim et al. | Jul 2004 | A1 |
20040125877 | Chang et al. | Jul 2004 | A1 |
20040227768 | Bates et al. | Nov 2004 | A1 |
Number | Date | Country |
---|---|---|
11-052267 | Feb 1999 | JP |
11-261908 | Sep 1999 | JP |
2000-013755 | Jan 2000 | JP |
WO9965237 | Dec 1999 | WO |
Entry |
---|
Qian et al., “Description Schemes for Consumer Video Applications,” ISO/IEC JTC1/SC29/WG11-MPEG-7 Proposal, Feb. 1999. |
Qian et al., Description Schemes for Consumer Video Applications, ISO/IEC JTC1/SC29/WG11-MPEG-7 Proposal, Feb. 1999. |
John S. Boreczky and Lynn D. Wilcox. “A Hidden Markov Model Framework for Video Segmentation Using Audio and Image Features,” FX Palo Alto Laboratory, Palo Alto, CA 94304 USA, 4 pages, Date Unknown. |
Michael G. Christel, Alexander G. Hauptmann, Adrienne S. Warmack, Scott A. Crosby, “Adjustable Filmstrips and Skims as Abstractions for a Digital Video Library,” Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15213 USA, 7 pages, Date Unknown. |
Peng Xu, Shih-Fu Chang, “Algorithms and System for High-Level Structure Analysis and Event Detection in Soccer Video,” Cloumbia University, ADVENT—Technical Report #111, Jun. 2001, 11 pages. |
Dennis Yow, Boon-Lock Yeo, Minerva Yeung and Bede Liu, “Analysis and Presentation of Soccer Highlights from Digital Video,” Department of Electrical Engineering, Princeton University, Princeton, NJ 08544, to Appear in the Proceedings, Second Asian Conference on Computer Vision (ACCV '95), 5 pages. |
Jonathan D. Courtney, “Automatic Video Indexing via Object Motion Analysis,” Pattern Recognition, vol. 30, No. 4, pp. 607-625, 1997. |
Drew D. Saur, Yap-Peng Tan, Sanjeev R. Kulkarni, and Peter J. Ramadge, “Automated Analysis and Annotation of Basketball Video,” SPIE vol. 3022, 1997, pp. 176-187. |
Hao Pan, Baoxin Li, and M. Ibrahim Sezan, “Automatic Detection of Replay Segments in Broadcast Sports Programs by Detection of Logos in Scene Transitions,” IEEE 2002, pp. IV-3385-IV-3388. |
Yihong Gong, Lim Teck Sin, Chua Hock Chuan, Hongjiang Zhang, Masao Sakauchiff,“Automatic Parsing of TV Soccer Programs,” IEEE 1995, pp. 167-174. |
Yong Rui, Anoop Gupta, and Alex Acero, “Automatically Extracting Highlights for TV Baseball Programs,” ACM Multimedia 2000, pp. 105-115. |
Nuno Vasconcelos and Andrew Lippman, “Bayesian Modeling of Video Editing and Structure: Semantic Features for Video Summarization and Browsing,” IEEE 1998, 5 pages. |
Padhraic Smyth, “Belief Networks, Hidden Markov Models, and Markov Random Fields: a Unifying View,” To appear in Pattern Recognition Letters, 1998, pp. 1-11. |
T. Lambrou, P. Kudumakis, R. Speller, M. Sandler, A. Linney, “Classification of Audio Signals Using Statistical Features on Time and Wavelet Transform Domains,” IEEE 1998, pp. 3621-3624. |
Rainer Lienhart, Comparison of Automatic Shot Boundary Detection Algorithms, Part of the IS&T/SPIE conference on Storage and Retrieval for Image and Video Databases VII, San Jose, California, Jan. 1999, SPIE vol. 3656, pp. 290-301. |
John Canny, “A Computational approach to Edge Detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-8, No. 6, Nov. 1986, pp. 679-698. |
Richard Qian, Niels Haering, and Ibrahim Sezan, “A computational approach to Semantic Event Detection,” IEEE 1999, pp. 200-206. |
F. Arman, R. Depommier, A. Hsu, and M-Y. Chiu, “Content-based Browsing of Video Sequences,” To appear in the Proceedings of ACM International Conference on Multimedia '94, Oct. 15-20, San Francisco, California, USA, 7 pages. |
Hongjiang Zhang, Stephen W. Smoliar and Jian Hua Wu, “Content-Based Video Browsing Tools,” SPIE vol. 2417, 1995, pp. 389-398. |
Stephen W. Smoliar and Hongjiang Zhang, “Content-Based Video Indexing and Retrieval,” IEEE 1994, pp. 62-72. |
Stefan Eickeler and Stefan Muller, “Content-Based Video Indexing of TV Broadcast News Using Hidden Markov Models,” Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Phoenix, AZ, 1999, 4 pages. |
Zhu Liu and Qian Huang, “Detecting News Reporting Using Audio/visual Information,” IEEE 1999, pp. 324-328. |
Y. Kawai, et al. “Detection of Replay Scenes in Broadcasted Sports Video by Focusing on Digital Video Effects,” IEICE (D-II), vol. J84-D-II, No. 2, pp. 432-435, Feb. 2001 (in Japanese). |
Vikrant Kobla, Daniel Dementhon, and David Doermann, Detection of Slow-Motion Replay Sequences for Identifying Sports Videos, Laboratory for Language and Media Processing, University of Maryland, College Park, MD 20742-3275, 1999, pp. 135-140. |
H. Pan, P. Van Beek, M.I. Sezan, “Detection of Slow-Motion Replay Segments in Sports Video for Highlights Generation,” Proceedings of IEEE International conference on Acoustics, Speech, and Signal Processing, Salt Lake City, UT, 2001, 4 pages. |
Minerva Yeung, Boon-Lock Yeo and Bede Liu, “Extracting Story Units from Long Programs for Video Browsing and Navigation,” Proceedings of Multimedia '96, 1996 IEEE, pp. 296-305. |
Boon-Lock Yeo and Bede Liu, “On the Extraction of DC Sequence from MPEG Compressed Video,” IEEE 1995, pp. 260-263. |
Frank R. Kschischang, Brendan J. Frey, and Hans-Andrea Loeliger, “Factor Graphs and the Sum-Product Algorithm,” IEEE Transactions on Information Theory, vol. 47, No. 2, Feb. 2001, pp. 498-519. |
Wayne Wolf, “Hidden Markov Model Parsing of Video Programs,” Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 2609-2611. |
John S. Boreczky and Lynn D. Wilcox, “A Hidden Markov Model Framework for Video Segmentation Using Audio and Image Features,” Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Seattle, WA, 1998, 4 pages. |
Bilge Gunsel, Yue Fu and A Murat Tekalp, “Hierarchical Temporal Video Segmentation and Content Characterization,” SPIE vol. 3229, pp. 46-56. |
M.R. Naphade et al., “A High-Performance Shot Boundary Detection Algorithm Using Multiple Cues,” Proceedings of IEEE International Conference on Image Processing, Chicago, IL 1998, pp. 884-887. |
Vikrant Kobla, Daniel Dementhon, and David Doermann, “Identifying sports videos using replay, text, and camera motion features,” Laboratory for Language and Media Processing, University of Maryland, college Park, MD 20742-3275, USA, 12 pages. |
B. B. Chaudhuri, N. Sarkar, and P. Kundu, “Improved fractal geometry based texture segmentation technique,” IEE Proceedings-E, vol. 140, No. 5, Sep. 1993, pp. 233-241. |
Toshio Kawashima et al., “Indexing of Baseball Telecast for Content-based Video Retrieval,” 1998 IEEE, pp. 871-874. |
S. E. Levinson, L. R. Rabiner, and M. M. Sondhi, “An Introduction to the Application of the Theory of Probabilistic Functions of a Markov Process to Automatic Speech Recognition,” the Bell System Technical Journal, vol. 62, No. 4, Apr. 1983, pp. 1035-1074. |
Dulce Ponceleon, et al., “Key to Effective Video Retrieval: Effective Cataloging and Browsing,” 1998 ACM Multimedia, pp. 99-107. |
Baoxin Li and M. Ibrahim Sezan, “Event Detection and Summarization in Sports Video,” Sharp Laboratories of America, 5750 NW Pacific Rim blvd., Camas, Washington 98607, 5 pages. |
Noboru Babaguchi, et al., “Linking Live and Replay Scenes in Broadcasted Sports Video,” ACM Multimedia 2000, pp. 205-208. |
Giridharan Iyengar and Andrew Lippman, “Models for automatic classification of video sequences,” SPIE vol. 3312 1997, pp. 216-227. |
Nevenka Dimitrova and Forouzan Golshani, “Motion Recovery for Video Content Classification,” ACM Transactions on Information Systems, vol. 13, No. 4, Oct. 1995, pp. 408-439. |
Yao Wang, Zhu Liu, and Jin-Cheng Huang, “Multimedia Content Analysis,” IEEE Signal Processing Magazine, Nov. 2000, pp. 12-35. |
Mark T. Maybury and Andrew E. Merlino, “Multimedia Summaries of Broadcast News,” IEEE 1997, pp. 442-449. |
Shin'Ichi Satoh and Takeo Kanade, “Name-It: Association of Face and Name in Video,” School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, Dec. 20, 1996. |
Stuart J. Golin, “New metric to detect wipes and other gradual transitions in video,” Part of the IS&T/SPIE Conference on Visual communications and Image Processing '99, Jan. 1999, SPIE vol. 3653, 6 pages. |
Ullas Gargi, Rangachar Kasturi, and Susan H. Strayer, Performance Characterization of Video-Shot-Change Detection Methods, IEEE Transactions on Circuits and Systems for Video Technology, vol. 10, No. 1, Feb. 2000, pp. 1-13. |
Michael T. Chan, Rockwell Science Center, 1049 Camino Dos Rios, Thousand Oaks, CA 91360 and You Zhang and Thomas S. Huang Department of Electrical and Computer Engineering Coordinated Science Laboratory, and Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, “Real-Time Lip Tracking and Bimodal Continuous Speech Recognition,” 6 pages, Date Unknown. |
Boon-Lock Yeo and Minerva M. Yeung, “Retrieving and Visualizing Video,” Communications of the ACM, Dec. 1997/vol. 40, No. 2, pp. 43-52. |
H. B. Lu, Y. J. Zhang, and Y. R. Yao, “Robust Gradual Scene Change Detection,” Proceedings of IEEE International Conference on Image Processing, Kobe, Japan, 1999. |
Richard J. Qian, M. Ibrahim Sezan, and Kristine E. Matthews, “A Robust Real-Time Face Tracking Algorithm,” IEEE 1998, pp. 131-135. |
Lexing Xie, “Segmentation and Event Detection in Soccer Audio,” EE 6820 Project, Soccer Audio, Columbia University, May 15, 2001, 9 pages. |
Riccardo Leonardi and Pierangelo Migliorati, “Semantic Indexing of Multimedia Documents,” IEEE 2002, Apr.-Jun. 2002, pp. 44-51. |
R. W. Picard, “A society of models for video and image libraries,” IBM Systems Journal, vol. 35, Nos. 3&4, 1996, pp. 292-312. |
Alberto Del Bimbo, Enrico Vicario and Daniele Zingoni, “A Spatial Logic for Symbolic Description of Image Contents,” Journal of Visual Languages and Computing (1994) 5, pp. 267-286. |
B. S. Manjunath and W. Y. Ma, “Texture Features for Browsing and Retrieval of Image Data,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 18, No. 8, Aug. 1996, pp. 837-842. |
Richard W. Conners and Charles A. Harlow, “A theoretical comparison of Texture Algorithms,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-2, No. 3, May 1980, pp. 204-222. |
Noboru Babaguchi, “Towards Abstracting Sports Video by Highlights,” IEEE 2000, pp. 1519-1522. |
Stephen S. Intille, “Tracking Using a Local Closed-World Assumption: Tracking in the Football Domain,” M.I.T. Media Lab Perceptual computing Group Technical Report No. 296, pp. 1-62. |
Lawrence R. Rabiner, “A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition,” Proceedings of the IEEE, vol. 77, No. 2, Feb. 1989, pp. 257-286. |
Richard O. Duda and Peter E. Hart, “Use of the Hough Transformation to Detect Lines and Curves in Pictures,” Communications of the ACM, Jan. 1972, vol. 15, No. 1, pp. 11-15. |
Rainer Lienhart, Silvia Pfeiffer, and Wolfgang Effelsberg, “Video Abstracting,” Communications of the ACM, Dec. 1997/ vol. 40, No. 12, pp. 55-62. |
Michael A. Smith and Takeo Kanade, “Video Skimming for Quick Browsing based on Audio and Image Characterization,” School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, Jul. 30, 1995, 22 pages. |
Daniel Dementhon, Vikrant Kobla and David Doermann, “Video Summarization by Curve Simplification,” ACM Multimedia 1998, pp. 211-218. |
Chung-Lin Huang and Chih-Yu Chang, “Video Summarization using Hidden Markov Model,” IEEE 2001, pp. 473-477. |
Ken Masumitsu and Tomio Echigo, “Video Summarization Using Reinforcement Learning in Eigenspace,” IBM Research, Tokyo Research Laboratory, 1623-14, Shimotsuruma, Yamato-shi, Kanagawa, Japan, 4 pages, Date Unknown. |
Minerva M. Yeung and Boon-Lock Yeo, “Video visualization for compact Presentation and Fast Browsing of Pictorial Content,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 7, No. 5, Oct. 1997, pp. 771-785. |
Stephen S. Intille and Aaron F. Bobick, “Visual Tracking Using Closed-Worlds,” MIT Media Laboratory Perceptual computing Section Technical Report No. 294, Nov. 1994, pp. 1-18. |
Sunghoon Choi, Yongduek Seo, Hyunwoo Kim, and Ki-Sang Hong, “Where are the ball and players?: Soccer Game Analysis with color-based Tracking and Image Mosaick,” Dept. of EE, Pohang University of Science and Technology, San 31 Hyoja Dong, Pohang, 790-784, Republic of Korea, pp. 1-15, Date Unknown. |
www.pvi.com, at least one year prior to filing. |
MPEG-7 Multimedia Description Schemes WD (Version 3.0), ISO/IEC JTC 1/SC 29/WG 11N3411, May 2000, Geneva. |
“DDL Working Draft 3.0,” ISO/IEC JTC1/SC29/WG11 N3391, MPEG00/May 2000 (Geneva). |
“MPEG-7 Multimedia Description Schemes XM (Version 3.0),” ISO/IEC JTC 1/SC 29/WG 11/N3410, May 2000, Geneva. |
“MPEG-7 Visual part of eXperimentation Model Version 6.0,” ISO/IEC JTC1/SC29/WG11/N3398, Geneva, Jun. 2000. |
“Visual Working Draft 3.0,” ISO/IEC JTC1/SC29/WG11/N3399, Jun. 2000, Geneva. |
Alan E. Bell, “The Dynamic Digital Disk,” IEEE Spectrum, Oct. 1999, pp. 28-35. |
MPEG-7 Decsription Schemes (V0.5), ISO/IEC JTC1/SC29/WG11/N2844, MPEG 99, Jul. 1999, Vancouver. |
“MPEG-7 Media/Meta DSs upgrade (V0.2),” ISO/IEC JTC1/SC29/WG11/MXXXX, MPEG 99, Oct. 1999, Melbourne. |
ISO/IEC/JTC1/SC 29N 3966; Mar. 12, 2001, pp. 1-508. |
ISO/IEC/JTC1/SC 29N 3705; Nov. 17, 2001, pp. 1-542. |
XML Schema, Part 1, Structures, W3C Working Draft, May 6, 1999, pp. 1-37. |
XML Schema, Part 2: Datatypes; World Wide Web Consortium Working Draft, May 6, 1999, pp. 1-37. |
Millar, et. al.: A Schema for TV-Anytime Segmentation Metadata, AN 195 Contribution from My TV:ES/TN.XTV.KM002, Issue KM0021.C; NDS Systems Division, 2000; pp. 1-27. |
Millar, et. al.,: A Schema for TV-Anytime Segmentation Metadata, AN 195r1 My TV Project; ES/TN.XTV.KM002, Issue KM0021.C; NDS Systems Division, 2000, pp. 1-28. |
Number | Date | Country | |
---|---|---|---|
20050154763 A1 | Jul 2005 | US |
Number | Date | Country | |
---|---|---|---|
60269786 | Feb 2001 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10058869 | Jan 2002 | US |
Child | 10867981 | US |