The present invention generally relates to TV recording systems, and more particularly, to ways to generate and present thumbnails derived from recorded TV programs.
With the convergence of home entertainment technologies, there are a growing number of devices that store many different forms of content, such as music, movies, pictures, TV programs, videos, games, and so forth. Devices like digital video recorders (DVRs), game consoles, and entertainment-configured computers (e.g., computers that run the Windows® XP Media Center operating system from Microsoft Corporation) enable users to record, manage, and playback many different forms of content. Even less featured devices, such as set-top boxes, can be designed to record multiple types of content.
As such devices are configured to store more content and offer more functionality, the ability to present the various forms of recorded content in a cohesive, understandable, and user-friendly manner continues to be a challenge. This is particularly true for recorded TV programs. Traditionally, home entertainment devices have simply provided a title or other text identifier of recorded TV programs. It would be useful if other techniques for identifying and navigating recorded TV programs were developed.
Thumbnail images representative of recorded TV programs are generated and presented to aid a user in browsing the recorded TV programs. In one implementation, a temporary thumbnail image is generated when a TV program first starts recording. The temporary thumbnail is used to populate quickly any user interface (UI) screens that reference the TV program, even as the program is recording. Once the TV program has reached a threshold amount of recording (e.g., a prescribed duration of recording or completion of the recording), a permanent thumbnail image is generated and associated with the TV program. The permanent thumbnail is then presented in any subsequent UI screens that reference the TV program, replacing the temporary thumbnail. The user can navigate among the thumbnail images presented in the various UI screens and select a particular recorded TV program by choosing the corresponding thumbnail image. According to another implementation, display of the thumbnail images within the UI screens may be further controlled by preference settings, such as parental controls.
The detailed description is provided with reference to the accompanying figures. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The use of the same reference numbers in different figures indicates similar or identical items.
The following disclosure describes techniques for generating and presenting thumbnail images that are representative of recorded TV programs. The thumbnail images, or simply “thumbnails”, are created from actual video content of the TV programs and hence, are visually descriptive of the recorded programs. The thumbnail images are presented in a user interface to assist a viewer in browsing among the recorded TV programs and selecting a particular recorded TV program.
The following discussion is directed generally to entertainment systems, and particularly to television-based systems, such as broadcast TV systems, interactive TV systems, cable TV systems, and Web-enabled TV systems. While aspects of the described systems and methods can be implemented in any number of television-enabled entertainment systems, and within any number and types of client devices, the systems and methods are described in the context of the following exemplary system.
The client device 102 receives and stores the media content 108 distributed by the content providers 106. In particular, the client device 102 is configured to receive and record TV programs broadcast or otherwise transmitted by the content providers 106. Examples of TV programs include news, sitcoms, comedies, TV movies, infomercials, talk shows, sporting events, and so on. The client device 102 can be implemented in many ways, including as a TV-enabled computer, a computer-based media server, a set-top box, a satellite receiver, a TV recorder with a hard disk, a digital video recorder (DVR), a game console, an information appliance, and so forth.
The client device 102 receives the media content 108 via various transmission media 110, such as satellite transmission, radio frequency transmission, cable transmission, and/or via any number of other transmission media, such as a file transfer protocol over a network (e.g., Internet or Intranet) and/or data packet communication. The client device 102 includes one or more media content inputs 112, which may include tuners that can be tuned to various frequencies or channels to receive television signals and/or Internet Protocol (IP) inputs over which streams of media content are received via an IP-based network.
The client device 102 also includes one or more processors 114 which process various instructions to control operation of client device 102, to execute applications stored on the client device, and to communicate with other electronic and computing devices. The processors 114 may further include a content processor to receive, process, and decode media content and program data. The client device 102 is also equipped with an audio/video output 116 that provides audio and video data to the display 104, or to other devices that process and/or display, or otherwise render, the audio and video data. Video and audio signals can be communicated from the client device 102 to the display 104 via an RF (radio frequency) link, S-video link, composite video link, component video link, analog audio connection, or other similar communication links.
The client device 102 is equipped with different types of memory components, including both volatile and non-volatile memory. In this example, the client device 102 has a recording media 120 and a cache 122. The recording media 120 may be implemented in many ways using various non-volatile storage media, such as hard disk drives, RAID systems, recordable and/or rewritable discs, and so forth. Cache 122 can be implemented, for example, as random access memory (RAM) for faster access during data processing in client device 102. Although not shown, the client device may further include one or more data memory components as well as a program memory to store applications.
One or more application programs can be stored in program memory and executed by the processor(s) 114. Representative applications shown in
The DVR and playback application 136 records media content received from the content providers 106 in the recording media 120. The recorded media content 140 includes TV programs that a viewer has recorded to watch at a later time. The DVR and playback application 136 also facilitates playback of the recorded media content 140 on the display 104.
The UI application 132 allows a user to browse and select content recorded on the client device 102. The UI application 132 supports interactive and graphical UI screens that identify the media content 140 stored in the recording media 120 and offer options for handling the media content 140 in some manner. For example, the UI screens might enable navigation to various recorded content (e.g., audio, still images, video, TV programs, etc.), list recently recoded content, or provide detailed information on specific content. One exemplary UI screen 142 is depicted on the display 104. This UI screen 142 shows the most recently recorded TV programs.
The EPG application 134 generates a program guide for presentation on the display 104. The program guide includes a schedule indicating when particular content will be broadcast for viewing and on which channel the content will be broadcast. The EPG application 134 enables a viewer to navigate through the program guide and locate broadcast programs, recorded programs, video on-demand programs and movies, interactive game selections, and other media access information or content of interest to the viewer. EPG data 144 is downloaded from the content providers 106 and stored in recording media 120, where it is accessed by the EPG application 134 to populate the program guide.
The thumbnail generator 130 creates thumbnail images 150 representative of the recorded TV programs and stores the thumbnail images 150 in the cache 122. The thumbnail images, or just “thumbnails”, are derived from the actual video content of the TV programs and are used by the UI application 132 and/or the EPG application 134 to visually represent the recorded TV programs in the UI screens. By storing the thumbnails 150 in the cache 122, the thumbnails 150 are available for immediate retrieval to populate the appropriate UI screens. The thumbnails 150 may alternatively, or additionally, be stored in other memory, such as the recording media 120.
In
According to one implementation, the thumbnail generator 130 is configured to create two thumbnails as the TV program is being recorded. A temporary thumbnail is derived when the TV program first begins recording. In one implementation, the temporary thumbnail is derived from a video frame extracted from a beginning portion of the TV program. For instance, the thumbnail generator 130 selects the first non-black video frame in the TV program from which to derive the temporary thumbnail. In this manner, the thumbnail image is generated within seconds and is available for immediate display in a UI screen as soon as the TV program begins recording.
A permanent thumbnail is subsequently generated when more of the TV program has been recorded. The permanent thumbnail is generated using video detection techniques which rely on analysis of a larger portion of the TV program. Such analysis might include, for example, avoidance of a black frame, avoidance of any frames associated with a commercial, avoidance of padding, scene analysis, examination of program metadata, and so forth. One example of a suitable technique for isolating and deriving thumbnail candidates from a recorded video program is disclosed in U.S. Pat. No. 7,986,372, entitled “Systems and Methods for Smart Media Content Thumbnail Extraction”, which was filed Aug. 2, 2004, and is assigned to Microsoft Corporation. The video detection techniques may be performed on the program after a predetermined amount is recorded (e.g., 15 minutes, 30 minutes, etc.) or after the entire program is recorded.
The thumbnail created from the TV program can be static or dynamic. A static thumbnail contains a still image generated from the video content. A dynamic thumbnail consists of multiple images generated from the TV program, which are then played in a continuous loop so that the thumbnail appears animated.
The temporary and permanent thumbnails 150 are stored in the cache 122 in association with corresponding TV programs so that when the UI application 132 and/or the EPG application 134 display information about a particular TV program, the corresponding thumbnail is retrieved from the cache 122 and presented in a UI screen. One way to associate the thumbnails with the TV programs is via a data structure stored in memory, such as program or data memory.
Multiple entries 220(1)-220(N) are shown in the data structure 200. Each entry contains information pertaining to a TV program. Here, entry 220(1) is for a TV program entitled “CSI: Miami”, which has an identifier of “31456” and a content rating of “TV 14”. This TV program is associated with a temporary thumbnail identified as “00032.TMP”. Entry 220(1) is an example of a TV program that has just started recording, and hence has a temporary thumbnail associated therewith.
Another entry 220(N) is for a TV program titled “Rugrats”, which has an identifier of “63290” and a content rating of “All”. This TV program is associated with a permanent thumbnail identified as “04520.PMT”. Entry 220(N) is an example of a TV program that has been recording a sufficient amount of time (or has been completely recorded), and hence has a permanent thumbnail associated therewith.
An aspect of the described system is that the thumbnail images generated from the TV programs may or may not be displayed based on preferences established by the viewer. For instance, suppose a parent has established parental controls that prevent display of programs with a TV14 or higher rating at certain times of operation, or on certain displays viewed by younger viewers in the home. In this case, the parent would most likely prefer not to have a thumbnail image of the corresponding content displayed in various UI menus or listings. Accordingly, the show thumbnail field 210 dictates when a thumbnail should be displayed based on the rating provided in the rating field 208. In this example, if the rating is TV14 or higher, as is the case for the TV program “CSI: Miami” at entry 220(1), the entry contains a “No” data item in the show thumbnail field 210 to indicate that the associated thumbnail should not be shown. The thumbnail image can be replaced with a blank image, a logo of the TV rating, an icon, or other form of generic image. If the rating is less than TV14, as is the case for the TV program “Rugrats” at entry 220(N), a “Yes” data item is in the show thumbnail field 210 to indicate that the associated thumbnail can be shown.
In another implementation, the determination of whether to display a thumbnail based on viewer preferences may be made without maintaining a flag designator in show thumbnail field 210 for each associated recorded program (thereby eliminating the use of field 210 in data structure 200). In this alternate implementation, preferences are maintained in a separate memory location and the determination is made by dynamically comparing the rating of the recorded show against the current preferences to decide whether to depict the thumbnail at the time the thumbnail is to be displayed. In this manner, if the preferences are changed at anytime, the change is made quickly in this single memory location rather than updating every flag designator in the show thumbnail field 210 for all recorded programs.
For discussion purposes, the process 700 is described with reference to the system 100 described above with respect to
At block 702, the client device 102 begins recording a TV program. During this initial recording, the thumbnail generator 130 derives a temporary thumbnail image representative of the TV program (block 704). This temporary thumbnail is quickly created, for example, from the beginning portion of the TV program. The derivation involves at least minimum logic to ensure that the thumbnail generator 130 avoids extracting a black frame. Thus, the thumbnail generator 130 might be configured to extract the first non-black frame from the TV program and derive a thumbnail image from that frame.
At block 706, the temporary thumbnail image is stored in the cache 122. At block 708, the temporary thumbnail is displayed as part of any UI screens that identify recorded TV programs, such as the example UI screens 300, 400, 500, and 600 (
Accordingly, decision block 710 determines whether a threshold amount of the TV program has been recorded. This threshold amount may be measured in number of frames, duration, or other ways. For instance, the client device 102 may be configured to begin generating a permanent thumbnail after 1000 frames have been recorded, or after 15 minutes of the program has been recorded, or alternatively after the entire program has been recorded. If a sufficient portion of the TV program has not yet been recorded (i.e., the “No” branch from block 710), the client device 102 continues to use the temporary thumbnail in the cache 122.
Alternatively, if a sufficient portion of the TV program has been recorded (i.e., the “Yes” branch from block 710), the thumbnail generator 130 generates a permanent thumbnail representative of the recorded TV program (block 712). The thumbnail generator 130 uses video detection techniques to generate the image. It analyzes the frames of the recorded program, relying on significant logic to avoid black frames and frames from commercials, to ignore pre-padding and post-padding, to consider head shots, and use program metadata (e.g. category/genre). Once the frames are analyzed, a representative video frame is selected and the thumbnail image is derived from that selected frame.
At block 714, the permanent thumbnail image is stored in the cache 122. The permanent thumbnail can be stored over the temporary thumbnail, thereby deleting the temporary thumbnail from the cache, or alternatively, stored in addition to the temporary thumbnail. In any event, at block 716, the permanent thumbnail is displayed as part of any UI screens that identify recorded TV programs, such as the example UI screens 300, 400, 500, and 600 (
Although the invention has been described in language specific to structural features and/or methodological acts, it is to be understood that the invention defined in the appended claims is not necessarily limited to the specific features or acts described. Rather, the specific features and acts are disclosed as exemplary forms of implementing the claimed invention.
Number | Name | Date | Kind |
---|---|---|---|
5333091 | Iggulden et al. | Jul 1994 | A |
5442633 | Perkins et al. | Aug 1995 | A |
5497430 | Sadovnik et al. | Mar 1996 | A |
5530963 | Moore et al. | Jun 1996 | A |
5625877 | Dunn et al. | Apr 1997 | A |
5642294 | Taniguchi et al. | Jun 1997 | A |
5659685 | Williams et al. | Aug 1997 | A |
5710560 | Cohn | Jan 1998 | A |
5745190 | Ioka | Apr 1998 | A |
5751378 | Chen et al. | May 1998 | A |
5774593 | Zick et al. | Jun 1998 | A |
5778137 | Nielsen et al. | Jul 1998 | A |
5801765 | Gotoh et al. | Sep 1998 | A |
5821945 | Yeo et al. | Oct 1998 | A |
5835163 | Liou et al. | Nov 1998 | A |
5884056 | Steele | Mar 1999 | A |
5900919 | Chen et al. | May 1999 | A |
5901245 | Warnick et al. | May 1999 | A |
5911008 | Niikura et al. | Jun 1999 | A |
5920360 | Coleman, Jr. | Jul 1999 | A |
5952993 | Matsuda et al. | Sep 1999 | A |
5956026 | Ratakonda | Sep 1999 | A |
5959697 | Coleman, Jr. | Sep 1999 | A |
5966126 | Szabo | Oct 1999 | A |
5983273 | White et al. | Nov 1999 | A |
5990980 | Golin | Nov 1999 | A |
5995095 | Ratakonda | Nov 1999 | A |
6020901 | Lavelle et al. | Feb 2000 | A |
6047085 | Sato et al. | Apr 2000 | A |
6072496 | Guenter et al. | Jun 2000 | A |
6100941 | Dimitrova et al. | Aug 2000 | A |
6166735 | Dom et al. | Dec 2000 | A |
6168273 | Dupraz et al. | Jan 2001 | B1 |
6182133 | Horvitz | Jan 2001 | B1 |
6232974 | Horvitz et al. | May 2001 | B1 |
6236395 | Sezan et al. | May 2001 | B1 |
6282317 | Luo et al. | Aug 2001 | B1 |
6292589 | Chow et al. | Sep 2001 | B1 |
6298145 | Zhang et al. | Oct 2001 | B1 |
6307550 | Chen et al. | Oct 2001 | B1 |
6332163 | Bowman-Amuah | Dec 2001 | B1 |
6353824 | Boguraev et al. | Mar 2002 | B1 |
6389168 | Altunbasak et al. | May 2002 | B2 |
6404925 | Foote et al. | Jun 2002 | B1 |
6408128 | Abecassis | Jun 2002 | B1 |
6421675 | Ryan et al. | Jul 2002 | B1 |
6449251 | Awadallah et al. | Sep 2002 | B1 |
6462754 | Chakraborty et al. | Oct 2002 | B1 |
6466702 | Atkins et al. | Oct 2002 | B1 |
6473778 | Gibbon | Oct 2002 | B1 |
6516090 | Lennon et al. | Feb 2003 | B1 |
6581096 | Cottrille et al. | Jun 2003 | B1 |
6616700 | Thum et al. | Sep 2003 | B1 |
6622134 | Sorkin | Sep 2003 | B1 |
6631403 | Deutsch et al. | Oct 2003 | B1 |
6643643 | Lee et al. | Nov 2003 | B1 |
6643665 | Kimbell et al. | Nov 2003 | B2 |
6658059 | Iu et al. | Dec 2003 | B1 |
6661468 | Alten et al. | Dec 2003 | B2 |
6670963 | Osberger | Dec 2003 | B2 |
6691106 | Sathyanarayan | Feb 2004 | B1 |
6711587 | Dufaux | Mar 2004 | B1 |
6714909 | Gibbon et al. | Mar 2004 | B1 |
6721454 | Qian et al. | Apr 2004 | B1 |
6773778 | Onozawa et al. | Aug 2004 | B2 |
6792144 | Yan et al. | Sep 2004 | B1 |
6807361 | Girgensohn et al. | Oct 2004 | B1 |
6870956 | Qi et al. | Mar 2005 | B2 |
6934415 | Stentiford | Aug 2005 | B2 |
6938268 | Hodge | Aug 2005 | B1 |
6956573 | Bergen et al. | Oct 2005 | B1 |
7006091 | Masera et al. | Feb 2006 | B2 |
7055166 | Logan et al. | May 2006 | B1 |
7062705 | Kirkwood et al. | Jun 2006 | B1 |
7065707 | Chen et al. | Jun 2006 | B2 |
7069310 | Bartholomew | Jun 2006 | B1 |
7072984 | Polonsky et al. | Jul 2006 | B1 |
7095907 | Berkner et al. | Aug 2006 | B1 |
7116716 | Ma et al. | Oct 2006 | B2 |
7248294 | Slatter | Jul 2007 | B2 |
7296168 | Edwards | Nov 2007 | B2 |
7325199 | Reid | Jan 2008 | B1 |
7356464 | Stella et al. | Apr 2008 | B2 |
7546544 | Weber et al. | Jun 2009 | B1 |
7603626 | Williams et al. | Oct 2009 | B2 |
7639882 | Itakura | Dec 2009 | B2 |
7826709 | Moriya et al. | Nov 2010 | B2 |
20010023450 | Chu | Sep 2001 | A1 |
20010047355 | Anwar | Nov 2001 | A1 |
20020037159 | Goto et al. | Mar 2002 | A1 |
20020067376 | Martin et al. | Jun 2002 | A1 |
20020069218 | Sull et al. | Jun 2002 | A1 |
20020073218 | Aspromonte et al. | Jun 2002 | A1 |
20020100052 | Daniels | Jul 2002 | A1 |
20020116533 | Holliman et al. | Aug 2002 | A1 |
20020154833 | Koch et al. | Oct 2002 | A1 |
20020157116 | Jasinschi | Oct 2002 | A1 |
20020166123 | Schrader et al. | Nov 2002 | A1 |
20020191861 | Cheatle | Dec 2002 | A1 |
20030033347 | Bolle et al. | Feb 2003 | A1 |
20030068087 | Wu et al. | Apr 2003 | A1 |
20030115607 | Morioka et al. | Jun 2003 | A1 |
20030123850 | Jun et al. | Jul 2003 | A1 |
20030152363 | Jeannin et al. | Aug 2003 | A1 |
20030195863 | Marsh | Oct 2003 | A1 |
20030206656 | Schwartz et al. | Nov 2003 | A1 |
20030210886 | Li et al. | Nov 2003 | A1 |
20030237053 | Chen et al. | Dec 2003 | A1 |
20040001106 | Deutscher et al. | Jan 2004 | A1 |
20040039755 | Kunze | Feb 2004 | A1 |
20040039810 | Donescu et al. | Feb 2004 | A1 |
20040040041 | Crawford | Feb 2004 | A1 |
20040068481 | Seshadri et al. | Apr 2004 | A1 |
20040071083 | Li et al. | Apr 2004 | A1 |
20040078357 | LaChapelle et al. | Apr 2004 | A1 |
20040078382 | Mercer et al. | Apr 2004 | A1 |
20040078383 | Mercer et al. | Apr 2004 | A1 |
20040085341 | Hua et al. | May 2004 | A1 |
20040086046 | Ma et al. | May 2004 | A1 |
20040088726 | Ma et al. | May 2004 | A1 |
20040125877 | Chang et al. | Jul 2004 | A1 |
20040128317 | Sull et al. | Jul 2004 | A1 |
20040165784 | Xie et al. | Aug 2004 | A1 |
20040184776 | Inoue et al. | Sep 2004 | A1 |
20040236863 | Shen et al. | Nov 2004 | A1 |
20040267952 | He et al. | Dec 2004 | A1 |
20050069225 | Schneider et al. | Mar 2005 | A1 |
20050071328 | Lawrence | Mar 2005 | A1 |
20050084232 | Herberger et al. | Apr 2005 | A1 |
20050114434 | Yang et al. | May 2005 | A1 |
20050114784 | Spring et al. | May 2005 | A1 |
20050162556 | Desai | Jul 2005 | A1 |
20050169312 | Cakareski et al. | Aug 2005 | A1 |
20050175001 | Becker Hof et al. | Aug 2005 | A1 |
20050192987 | Marsh | Sep 2005 | A1 |
20050207442 | Zoest et al. | Sep 2005 | A1 |
20050257151 | Wu | Nov 2005 | A1 |
20050262439 | Cameron | Nov 2005 | A1 |
20060023748 | Chandhok et al. | Feb 2006 | A1 |
20060026528 | Paulsen et al. | Feb 2006 | A1 |
20060036759 | Shen et al. | Feb 2006 | A1 |
20060074754 | Toyohara et al. | Apr 2006 | A1 |
20060107209 | Chen et al. | May 2006 | A1 |
20060123053 | Scannell | Jun 2006 | A1 |
20060179054 | Levi | Aug 2006 | A1 |
20060190435 | Heidloff et al. | Aug 2006 | A1 |
20060190615 | Panwar et al. | Aug 2006 | A1 |
20060200442 | Parikh | Sep 2006 | A1 |
20060239644 | Barbieri | Oct 2006 | A1 |
20070027754 | Collins et al. | Feb 2007 | A1 |
20070060099 | Ramer et al. | Mar 2007 | A1 |
20070101387 | Hua et al. | May 2007 | A1 |
20070201558 | Xu et al. | Aug 2007 | A1 |
20080065751 | Hampson | Mar 2008 | A1 |
Number | Date | Country |
---|---|---|
0597450 | May 1994 | EP |
1 168 840 | Jan 2002 | EP |
1 123 915 | Jun 2002 | EP |
1478181 | Nov 2004 | EP |
2 356 080 | May 2001 | GB |
2000516435 | Dec 2000 | JP |
2001184802 | Jul 2001 | JP |
2002125199 | Apr 2002 | JP |
2002223412 | Aug 2002 | JP |
2002238027 | Aug 2002 | JP |
2004527019 | Sep 2004 | JP |
20020009089 | Feb 2002 | KR |
20030069635 | Aug 2003 | KR |
20030070480 | Aug 2003 | KR |
20040042449 | May 2004 | KR |
100511785 | Aug 2005 | KR |
20050087654 | Aug 2005 | KR |
WO9855943 | Dec 1998 | WO |
WO0028467 | May 2000 | WO |
Entry |
---|
“Visual Attention Based Image Browsing on Mobile Devices” 4 pages, Fan et al. |
“Probability Measures of Fuzzy Events” Journal of Mathematical Analysis and Applications, 23, 1968, pp. 421-427, Zadeh, L.A. |
“Deploying Visual Attention: The Guided Search Model” AI and the Eye Chapter 4, 1990, pp. 79-103, Wolfe et al. |
“Modeling Visual Attention Via Selective Tuning Artificial Intelligence” v78, 1995, pp. 507-545, Tsotsos et al. |
“Computational Architectures for Attention” The Attentive Brain Chapter 9, pp. 163-186, Nieber et al. |
“Focus on Agere System's Orinoco PC Card” International Journal of Network Management, Jan. 2002, pp. 187-193, Held, Gilbert. |
“Peer Group Filtering and Perceptual Color Image Quantization” Proceedings of IEEE International Symposium on Circuits and Systems, v4, 1999, pp. 21-24, Deng, et al. |
“A Feature-Based Algorithm for Detecting and Classifying Scene Breaks” Computer Science Department, Cornell University, 16 pages, Zabih, Ramin. |
“On the Detection and Recognition of Television Commercials” University of Mannheim, 17 pages, Lienhart et al. |
“Logo Detection in Digital Video” Math 100, Mar. 6, 2001 http://toonarchive.com/logo-detection/ 9 pages, Hargrove, Thomas. |
htpp://www.fatalfx.com/nomad/ “NOMAD (No More Advertising)” Feb. 17, 2003, 4 pages. |
“Automatic TV Advertisement Detection from MPEG Bitstream” Centre for Digital Video Processing/Research Institute for Network & Communication Eng., Dublin City University, 12 pages, Sadlier, David. |
“AudiCom: A Video Analysis System for Auditing Commercial Broadcasts” Computer Vision Center, Departament d'Informatica Univerisitat Autonoma de Barcelona, 5 pages, Sanchez, Juan Maria. |
“A Robust Audio Classification and Segmentation Method” Microsoft Research, China, 9 pages, Lu, et al. |
“Content Analysis for Audio Classification and Segmentation” IEEE, Transactions on Speech and Audio Processing, v10, No. 7, Oct. 2002, pp. 504-516, Lu, et al. |
“Content-Based Audio Segmentation Using Support Vector Machines” IEEE, 2001, pp. 956-959, Lu, et al. |
“Wireless Landscape—Need for Seamless Connectivity” Wireless Personal Communications, 2002, pp. 275-283, Jha, Uma S. |
“Pesky Home Networks Trouble Cable Behemoths” IEEE Spectrum, Apr. 2002, pp. 60-61, Cherry, Steven. |
“Threshold Selection Using a Minimal Histogram Entropy Difference” Optical Engineering, v36, No. 7, Jul. 1997, pp. 1976-1981, Sahoo, et al. |
“Attentive Mechanisms for Dynamic and Static Scene Analysis” Optical Engineering v34, No. 8, Aug. 1995, pp. 2428-2434, Milanese, et al. |
“An Effective Region-Based Image Retrieval Framework” From ACM Multimedia 2002, 28 pages, Jing, et al. |
“Statistical Learning of Multi-View Face Detection” Proceedings of ECCV 2002, 25 pages, Li, et al. |
“Visit: A Neural Model of Covert Visual Attention” Advances in Neural Information Processing Systems, v4, 1991, pp. 420-427, Ahmed, S. |
“Expectation-Based Selective Attention for Visual Monitoring and Control of a Robot Vehicle” Robotics and Autonomous System v22, No. 3-4, Dec. 1997, 26 pages, Baluja et al. |
“A User Attention Model for Video Summarization” Proceedings of ICIP, 2002, Ma, et al. |
“A Visual Attention Model for Adapting Images on Small Displays” ACM Multimedia Systems Journal, Chen, et al. |
“Video Shot Cut Detection Using Adaptive Tresholding” University of Surrey, 2000, pp. 1-10, Yusoff, et al. |
“Real-time Scene Change Detection on Compressed Multimedia Bitstream Based on Statistical Sequential Analysis” IEEE, 2000, pp. 1141-1144, Lelescu, et al. |
“Post Shot Boundary Detection Technique: Flashlight Scene Determination” University of Western Australia, 1999, pp. 447-450, Heng, et al. |
“An MPEG-1 Shot Boundary Detector Using XIL Colour Histograms” Dublin City University, 1998, pp. 1-7, O'Toole. |
“Robust scene-change detection in MPEG compressed domain” Electrical & Computer Engineering, 1998, pp. 95-99, Gamaz et al. |
“Dissolve Detection in MPEG Compressed Video” IEEE, 1997, pp. 1692-1696, Gu, et al. |
“Rapid Scene Analysis on Compressed Video” IEEE, 1995, pp. 533-544, Yeo, et al. |
“Function-based Object Model Towards Website Adaptation”, 2001, Proc. of the 10th International WWW Conference pp. 1-21, Chen, Jinlin et al. |
“The JPEG2000 Still Image Coding System: An Overview”, IEEE Transactions on Consumer Electronics, v46, No. 4 pp. 1103-1127 Nov. 2000, Christopoulos, et al. |
“Perception-based Image Transcoding for Universal Multimedia Access”, School of Electrical Engineering Korea University, Seoul, Korea, 2001 IEEE, pp. 475-478, Lee, et al. |
“A Model of Motion Attention for Video Skimming” Microsoft Research, Asia, 4 pages, Yu-Fei Ma & Hong Jiang Zhang. |
A Comparison of Feature Combination Strategies for Saliency-based Visual Attention Systems: Proc of SPIE Human Vision and Electronic Imaging IV (HVEI '99), San Jose, CA, v3644 Jan. 1999, pp. 473-482, ITTI et al. |
“A Model of Saliency-based Visual Attention for Rapid Scene Analysis”, IEEE Trans. on Pattern Analysis and Machine Intelligence, 1998, 5 pages, Itti et al. |
Video Skimming and Characterization through the Combination of Image and Language Understanding Techniques, Proc. of Computer Vision and Pattern Recognition, 1997 IEEE, pp. 775-781, Smith & Kinkade. |
“A New Perceived Motion Based Shot Content Representation”, Microsoft Research, China, 4 pages, Yu-Fei Ma & Hong Jiang Zhang. |
“Video Scene Extraction by Force Competition” IEEE International Conference on Multimedia and Expo (ICME 011) Waseda University, Tokyo, Japan, Aug. 2001, 4 pages, Lin, et al. |
“Evaluation of Automation Shot Boundary Detection on a Large Video Test Suite” School of Computer Applications & School of Electronic Engineering, Dublin City University, Glasnevin Dublin Ireland, Challenge of Image Retrieval Newcastle 1999, pp. 1-12, O'Toole et al. |
“Video-Summarization Using Descriptors of Motion Activity: A Motion Activity Based Approach to Key-Frame Extraction From Video-Shots” Journal of Electronic Imaging, Oct. 2001, v10 No. 4 pp. 909-916, Divakaran, et al. |
“Computational Modeling of Visual Attention” Nature Reviews/Neuroscience, v2, Mar. 2001, pp. 1-11, ITTI, et al. |
Bennett, et al., “Proscenium: A Framework for Spatio-Temporal Video Editing”, MM'03, Nov. 2-3, 2003, Berkeley, California, 8 pages. |
Fan, et al., “Visual Attention Based Image Browsing on Mobile Devices”, 4 pages. |
Foote, et al., “Creating Music Videos using Automatic Media Analysis”, 8 pages. |
Girgensogn, et al., “A Semi-automatic Approach to Home Video Editing”, UIST 2000, San Diego, CA, 9 pages. |
Girgensogn, et al., “Home Video Editing Made Easy—Balancing Automation and User Control”, 8 pages. |
Graham, et al., “The Video Paper Multimedia Playback System”, Nov. 2-8, 2003, 2 pages. |
Hua, et al., “Automatically Convering Photographic Series into Video”, MM '04, Oct. 10-16, 2004, New York, NY, 8 pages. |
Hua, et al., “AVE—Automated Home Video Editing”, 2003, ACM, 8 pages. |
Hua, et al., “Video Booklet”, 4 pages. |
Yip, et al., “The Automatic Video Editor”, Nov. 2-8, 2003, Berkeley, California, 2 pages. |
Zhang et al., “A Hierarchical Organization Scheme for Video Data” Pattern Recognition, Elsevier, Kidlington, GB, vol. 35 No. 11, Nov. 2002, pp. 2381-2387. |
“Advanced Digital Video Storage and On-line Retrieval System”, Advisor Project, Jun. 30, 2002, retrieved from the Internet: http://advisor.matrasi-tls.fr/DUP—workshop—sheet.pdf. |
Berlin et al., “Indexing for Resue of TV News Shots” Pattern Recognition, Elsevier, Kidlington, GB, vol. 35, No. 3, Mar. 2002, pp. 581-591. |
Lee et al., “Automatic Video Parsing Using Shot Boundary Detection and Camera Operation Analysis” Pattern Recognition, Elsevier, Kidlington, GB, vol. 34, No. 3, Mar. 2001, pp. 711-719. |
Adams, et al., “Seeded Region Growing”, IEEE, vol. 16, No. 6, 1994, pp. 641-648. |
Alexander, et al., “Informedia: News-on-Demand Multimedia Information Acquisition and Retrieval”, 1997, pp. 1-21. |
DeMenthon, et al., “Video Summarization by Curve Simplification”, ACM, 1998, pp. 211-218. |
Gong, et al., “Creating Motion Video Summaries with Partial Audio-Visual Alignment”, IEEE, 2002, pp. 285-288. |
Goncalves, “Towards a Learning Model for Feature Integration in Attention Control”, retrieved on Jul. 19, 2006 at <<http://ieeexplore.ieee.org/search/srchabstract.jsp?arnumber=10135533&isnumber=21824&pun>>, International Conference on Multisensor Fusion and Integration for Intelligent Systems, Aug. 20-22, 2001, pp. 311-316. |
Osberger, et al., “An Automatic Image Quality Assessment Technique Incorporating Higher Level Perceptual Factors”, retrieved on Jul. 19, 2005, at <<http://ieeexplore.ieee.org/search/srchabstract,jsp?arnumber727227&isnumber==15680&punu>>, International Conference on Image Processing, vol. 3, Oct. 4-7, 1998, pp. 414-418. |
Beg, “User Feedback Based Enchancement in Web Search Quality”, Elsevier Inc., 2004, pp. 20. |
Degn, et al., “Peer Group Filtering and Perceptual Color Image Quantization”, IEEE, 1999, pp. 8. |
Kelly, et al., “Implicit Feedback for Inferring User Preference: A Bibliography”, SIGIR Forum, vol. 37, No. 2, 2003, pp. 18-28. |
“NOMAD (No More Advertising)”, http://www.fatalfx.com/nomad/, Feb. 17 2003, 4 pages. |
Pirolli, “Exploring Browser Design Trade-offs Using a Dynamical Model of Optimal Information Foraging”, Proc. SIGHI Conference on Human factors in Computing Systems CHI, 1998, pp. 33-40. |
Smith, et al., “Scalable Multimedia Delivery for Pervasive Computing”, ACM, 1999, pp. 131-140. |
Sugiyama, et al., “Adaptive Web Search Based on User s Implicit Preference”, DEWS, 2004, pp. 8. |
Rauschenbach, et al., “Demand-driven image transmission with levels of detail and regions of interest”, Computers & Graphics, 1999, pp. 857-866. |
Rauschenbach, et al., “Flexible Embedded Image Communication using Levels of Detail and Regions of Interest”, Proc. IMC'98, 1998, pp. 1-10. |
Notice of Rejection from Japanese Patent Application No. 2003-375,014, mailed on Sep. 10, 2008, 10 pgs. |
Vetro, et al., “Object-Based Transcoding for Adaptable Video Content Delivery”, Transactions on Circuits and Systems for Video Technology, IEEE, vol. 11, No. 3, Mar. 2001, pp. 387-401. |
Li, “PeerStreaming A Practical P2P Streaming System”, Microsoft, Sep. 2004. |
Zhuang et al., “Adaptive Key Frame Extraction Using Unsupervised Clustering”, Oct. 1998 in IEEE International Conference on Image Processing, vol. 1, pp. 866-870. |
Wolf, “Key Frame Selection by Motion Analysis”, Conf. Proceedings Acoustics, Speech, and Signal Processing, 1996 IEEE International Conference, May 1996, vol. 2, pp. 1228-1231. |
Ex parte Douglas K. Brubacher and Luciano Passuello, Board Patent Appeals and Interference, Jan. 22, 2009, 14 pages. |
Padmanabhan et al., “Resilient Peer to Peer Streaming”, <<http://eprints.kfupm.edu.sa/62381.pdf>>, Mar. 2003, 17 pages. |
Uchihashi et al., “Video Manga: Generating Semantically Meaningful Video Summaries”, In Proceedings of the Seventh ACM International Conference on Multimedia, Orland Florida, Oct. 30-Nov. 5, 1999, ACM, pp. #383-pp. #392. |
Jiang et al., “GnuStream: A P2P Media Streaming System Prototype”, 2003 International Conference on Multimedia and Expo, vol. 2 (ICME '03), pp. 325-328. |
Wang, “Design and Realization of User Model of Personalized Information Retrieval”, Masteral Dissertation, Dalian University of Science and Technology, Dec. 15, 2004, pp. 10-75. |
Final Office Action for U.S. Appl. No. 11/278,487, mailed on Jun. 16, 2011, Yu-Fei Ma, “Generating a Motion Attention Model”. |
The Korean Office Action mailed Nov. 22, 2012 for Korean patent application No. 10-2008-7015218, a counterpart foreign application of US patent No. 7,599,918, 4 page. |
The Korean Office Action mailed Feb. 14, 2013 for Korean patent application No. 10-2008-7007991, a counterpart foreign application of US patent No. 8,180,826, 10 pages. |
The Japanese Office Action mailed Feb. 23, 2012 for Japanese patent application No. 2005-224585, a counterpart foreign application of US patent No. 7,986,372, 2 pages. |
The Extended European Search Report mailed Feb. 1, 2012 for European patent application No. 06849037.4, 10 pages. |
The Japanese Office Action mailed Sep. 1, 2011 for Japanese patent application No. 2005-224585, a counterpart foreign application of US patent No. 7,986,372, 2 pages. |
Office Action for U.S. Appl. No. 11/263,718, mailed on Sep. 8, 2011, Xian-Sheng Hua, “Template-Based Multimedia Authoring and Sharing”, 7 pgs. |
Non-Final Office Action for U.S. Appl. No. 11/279,793, mailed by Sep. 15, 2011, Xian-Sheng Hua, “Media Sharing and Authoring on the Web”, 21 pages. |
Shen, et al., “Implicit User Modeling for Personalized Search”, Proceedings of teh 14th ACM International Conference on Information and Knowledge Management, ICKM, Oct. 31, 2005, pp. 824-831, retrieved from the internet at http://dl.acm.org/citation.cfm?doid=1099554.1099747 on Jan. 19, 2012. |
The Korean Office Action mailed Aug. 27, 2013 for Korean patent application No. 10-2008-7007991, a counterpart foreign application of US patent No. 8,180,826, 13 pages. |
The Korean Office Action mailed Mar. 17, 2014 for Korean patent application No. 10-2008-7007991, a counterpart foreign application of U.S. Appl. No. 11/279,793, 11 pages. |
Dufaux, “Key frame selection to represent a video”, Proc. 2000 Int. Con. on Image Processing, vol. 2, Sep. 2000, pp. 275-278. |
Number | Date | Country | |
---|---|---|---|
20060107289 A1 | May 2006 | US |