The present invention relates to a system and method for selective recording of various types of information.
The explosion of information now available relating to a wide variety of topics ranging from business to entertainment has driven the development of technology capable of providing this information to various selected groups and individuals. Once the information is conveyed from a source to a destination, the information is often categorized and stored either by individuals or by service organizations for subsequent retrieval of selected information.
Regardless of the particular form of received information, or the character of the storage media, it is often desirable to identify information of interest and extract that information from the incoming data stream for subsequent storage. This procedure optimizes the use of the storage media while also increasing the efficiency of the user in assimilating the information. Thus, the user interested in ascertaining knowledge relative to a particular area is exposed to the most relevant information from a wide variety of sources without being inundated with extraneous facts, figures, and advertisements. Similarly, a user may select a particular audio or video program for recording from continuously broadcast information so that valuable time is not wasted viewing or listening to unwanted information.
Many people no longer “tune-in” to a single broadcast from beginning to end. Instead, people often interleave their viewing or listening with “channel surfing” and “ad zapping” or “filtering” of information. Filtering is commonly called “ad zapping” but is used herein to refer to elimination or avoidance of advertisements or other repetitive information, such as repeated news footage or the like. Surfing, as used herein, refers to searching over time and/or available information channels for “interesting” information.
A number of prior art systems have recognized the problem of eliminating extraneous information prior to recording for subsequent retrieval and review. Many of these systems are limited to use with video information (often in combination with audio information), such as a television broadcast, and are not easily adaptable to other forms of information.
Many of the systems designed for use with audio/video (AV) information rely on prior marking or coding of the information for subsequent identification by a recording device. For example, a television broadcast may contain information relative to the program characteristics and content such as the starting and ending times, the type or content of the program, a program guide summary, a classification, a rating, or the like. This additional information may be included when the signal is originally broadcast, but is typically added by an annotation service. The signal is received and decoded and may be used to control any of a number of devices. For example, closed captioning information may be decoded to provide a textual transcript of the audio information, or the decoding receiver may limit what type of programs are viewed. The signal may also be utilized to automatically record particular programs on preselected channels via control of a video cassette recorder (VCR) and VCR recording tape cassette. Annotation services have been relied upon to facilitate prior art systems which provide surfing functions.
Reliance upon an annotation service requires the viewer to subscribe to the service and forces the viewer to explicitly select a menu item which ultimately corresponds to an approximate time/channel slot. This is a continuing expense for the viewer and may not support all of the channels which the viewer receives. In addition, the interesting material, such as a sports broadcast during the nightly news, may not be considered a “programming unit” by the annotation service so it would not be given an identifier which could be used by such systems. Furthermore, the interesting information may be broadcast on other channels or time slots and would not be identified.
Other prior art systems and methods which are designed primarily to eliminate recording of commercials (zapping) during a program broadcast rely on a change in amplitude of the video and/or audio signal components to control the recording of the information. The recording device monitors the incoming signal while recording and detects commercials by the characteristic video fade and/or audio fade between a program segment and a commercial. When a commercial is detected, the recording is paused so that the unwanted information is not recorded. Some prior art systems pause for a predetermined period of time (i.e. 15 or 30 seconds) after detecting a fade (or eliminate material between two fades separated by a standard advertisement interval) to avoid recording the advertisement. These systems rely on non-intrinsic characteristics of the broadcast signal which may lead to incorrect blanks in the recording of the desired program material. For example, an artistic fade in the middle of a program broadcast may be mistaken for a fade prior to an advertisement resulting in an undesirable 30 second blank in the recorded material. In addition, these approaches are limited to the elimination of advertisements and would be difficult to adapt to elimination of other “uninteresting” information from the incoming information stream, such as repetitive news footage. Furthermore, as such systems become more prevalent, broadcasters may reduce or eliminate the fades altogether to defeat operation of these systems so that advertisements reach their intended audience.
Another approach found in prior art systems designed to provide automatic recording of a particular television program of interest, requires the user to enter an index code into a VCR which is decoded to determine the broadcast channel, program starting time, and program duration. Since these systems do not monitor the actual broadcast signal being received, once recording begins, it continues for the duration specified regardless of the particular content of the signal. These systems are subject to recording errors resulting from minor modifications to the programming schedule of the television stations. Such minor schedule changes often occur to accommodate important news broadcasts or the like.
One alternative approach used to eliminate advertisements includes adding control information to the recording so that the advertisements are skipped when the recording is replayed. This avoids lost information due to an erroneous indication of an advertisement but requires the user to wait while the recording is fast-forwarded through the advertisement. Since this approach also relies on audio/video fades; it could easily be defeated by broadcasters when the economic incentive becomes great enough. Another approach offered by at least one television manufacturer provides a “real-time” zapping function which allows the viewer to scan other channels for 30 seconds before automatically returning to the original channel, so as to avoid a standard length advertisement.
Prior art AV editing systems have utilized the highly constrained structure of various news programs to segment a broadcast based on recognizing predetermined logos and faces of newscasters which are broadcast repeatedly. After receiving and recording a newscast or other structured program, these systems attempt to identify the various segments by separating the individual frames into regions and searching for the previously identified logos or faces. The segments are then classified and archived for use in future programs.
It is thus an object of the present invention to provide a system and method for selectively recording information based on intrinsic characteristics of the received information signal.
It is a further object of the present invention to provide a system and method which avoids recording information which a user has previously identified as being uninteresting.
Another object of the present invention is to provide a system and method for automatically locating and recording information of interest broadcast on any one of a number of available broadcast channels.
Yet another object of the present invention is to provide a system and method which automatically avoids recording repetitive information.
Still another object of the present invention is to provide a system and method which utilize sequential pattern matching to identify previously marked information and to control a recording device based on the outcome of the pattern matching.
A still further object of the present invention is to provide a data structure particularly suited for use in a system and method for selective recording of information which facilitates efficient, unique, and reliable pattern matching of information.
Another object of the present invention is to provide a system and method for selective recording of information which utilizes sequential pattern matching of information based on a summary statistical characterization of the information.
In carrying out the above objects and other objects and features of the present invention a system is provided which includes a persistent memory for memorized information segments (marked by user or by repetition) and control logic for computing and matching statistical vectors based on the received information. The control logic implements sequential pattern matching using the statistical vectors to identify memorized information within the incoming information stream. The control logic is linked to a recording device to selectively record information based on the result of the pattern matching.
A data structure is also provided for use with the present invention and with other similar systems designed to uniquely and reliably identify information of interest (or of no interest) for recording (or omission from a recording) using sequential pattern matching. The data structure is organized around statistical vectors, each characterizing an information “frame.” An information frame is a short cohesive amount of data. For example, for video, this could be a single image field. The data structure includes an indexable table for each element of the statistical vector. Each table provides a list of the information frame “identifiers” associated with a given value or range of values of that element of the statistical vector. Each information frame identifier includes a statistical vector representing an information frame, a pointer to the previously stored frame, and a pointer to the subsequently stored frame. The sequence of information frames which are joined by previous/subsequent frame pointers will be referred to as an information segment. Depending on the mode of operation, the data structure may also include the time length to record following the memorized information segment or the last date/occurrence count of the memorized information segment.
A method is also provided for use with the system disclosed and other similar systems. The method includes controlling a recording device based on sequential pattern matching of information. The method includes marking information of interest, computing a sequence of statistical vectors representing the marked information, and storing the statistical vectors and sequence information to allow future pattern matching. The method also includes identifying memorized information segments within an incoming information stream utilizing sequential pattern matching against the memorized sequence of statistical vectors, and controlling a recording device based on the result of this pattern matching.
There are a number of advantages accruing to the present invention. For example, the system and method of the present invention use summary statistics, which capture intrinsic signal information, to detect repetitive material or information of interest, rather than using unintentional signal content, such as audio and/or video fades or annotations. As a result, the present invention does not require an annotation service to provide broadly distributed annotations of the broadcast information. Nor does it rely on unintentional, and avoidable, artifacts of composited media.
Another advantage of the present invention over those systems and methods found in the prior art is that the statistical characterization of the information utilized by the present invention will reduce or eliminate the possibility of false detections resulting in incorrect blanking in the recording of desired information. Since the present invention does not rely on amplitude changes of the received information, its function will not be impaired by incorrect blanking caused by unanticipated changes in the received signal during the course of a program broadcast.
Also unlike prior art systems and methods, the present invention is capable of recording information of interest from any of a large number of received channels without knowing a priori on which channel or at what time the information of interest will be broadcast. Many prior art systems require the user to enter the broadcast time and channel, or a code representing that information, which is translated by the recording device. In contrast, the present invention does not require the user to locate such broadcast information, but only to provide a representative sample of the stable (repeated from installment to installment) introductory information and the length of time to record once the introductory information is located.
Another advantage of the present invention is its applicability to a variety of media sources. The present invention is not limited to audio and/or video information, or broadcast information in general, but may be utilized for sequential pattern matching of virtually any time-based information signal. For example, the present invention may be used in sequential pattern matching of seismographic information, pressure information, or object profile information (such as used in radar and the like) to name a few.
The above objects and advantages in addition to other unenumerated objects, features, and advantages of the present invention will be readily appreciated by one of ordinary skill in the art from the following detailed description of the best mode for carrying out the invention when taken in connection with the accompanying drawings.
a and 2b depict data structures for use with a selective information recording system and method according to the present invention;
a and 3b are flow charts illustrating information selection for user-marked surfing and zapping operation modes according to the present invention;
It should be recognized that the flowcharts presented in
It should also be recognized that, while the flowcharts presented in
Referring now to
The present invention includes two main operating modes which share the same computational engine. The surfing function is used to locate previously marked and memorized information segments which activate a recording device to record for a designated period following the sequence. These information segments may be marked by user-marking or by assisted-marking. The zapping function is used to avoid recording previously encountered information segments. These information segments may be marked by the user, marked through assisted marking, or automatically determined by the system. The user-marking modes rely on user input to determine both the beginning and end of the information segments. The assisted-marking modes rely on a small amount of user input and assist the user in identifying the beginning and end of the information segment. The automatic-marking mode (for zapping only) characterizes and stores all information received while continuously comparing incoming information with the previously memorized (stored) information to identify repetitive sequences. Repetitive information is reinforced while all other information is eventually deleted from memory through a “selective forgetting” operation as described in detail below.
The choice between user-marking, assisted-marking and (for zapping only) automatic-marking may be a design decision where, in any single device, only one of the two classes is used for surfing and only one of the three classes is used for zapping. However, as is obvious from their descriptions, more than one class or marking could be supported for either zapping or surfing, if that extra flexibility is desired.
Returning now to
While in the automatically-marked zapping mode, block 13 determines if the current incoming frame is an unmatched frame which should be memorized by block 12. Sequential pattern matching of received information to memorized information is then performed at block 14 of
When a successful match is indicated at block 16, a recording device is appropriately controlled as represented by block 18. Depending on the mode of operation which was selected at block 10 when the matched information segment was memorized, a successful match of the marked information serves a different purpose. In the zapping modes, the incoming information which matches the memorized information is not recorded. However, in the surfing modes, the matched incoming information initiates the recording process which then continues for a designated time period thereafter.
In modes of operation other than the automatically-marked zapping mode, unmatched information is quickly discarded or “forgotten” and is not memorized. In the automatically-marked zapping mode, however, all incoming information is memorized for comparison to future incoming information. Repetitive information (that which is successfully matched) is then automatically zapped and is not recorded. A successful match also reinforces the previously memorized information. Eventually, unmatched (and therefore unreinforced) information which was previously memorized will be discarded through a “selective forgetting” process as required by available memory space.
A graphical illustration of data structures particularly suited for use with the selective information recording system according to the present invention is shown in
As indicated earlier, a frame of media as referred to herein means a short time interval of the media which remains unchanged by its surrounding material. For example, a standard frame in video is a 1/30th-second image along with the coincident 1/30th second of audio. A “frame” of sound is more difficult to define since there are no natural “synchronization” marks as are present with conventional video signals. A reasonable audio frame may be defined by a 20-millisecond section with the assumption that the beginning and end frames (which may be contaminated by other material) can simply be ignored. For digital information, a frame of information may be determined by the particular communication protocol used in transmission and reception. The operation of the present invention does not require any particular, well-defined frame length. However, the statistics chosen to characterize the information should be particularly robust to different framing offsets for audio-only sources.
As illustrated in
The original intervals are formed when an information frame is memorized by adding and subtracting an error margin to the computed statistical value for each statistical element 34 in statistical vector 36. The error margin, and thus the span of the original interval, depends upon the level of noise expected in the received signal (which may depend on the type of signal, i.e. analog or digital). An appropriately selected error margin increases the reliability of the pattern matching process. An analog signal may require a wider interval for satisfactory operation than the interval necessary for the same quality of operation when receiving a digital signal. If the error margin is too small for the quality or type of the received information signal, information which should match previously identified material will not be detected. If the error margin is significantly larger than necessary, the system will identify information segments which have similar statistical representations but which are not intended to be matched. This results in an inefficient matching process and may result in erroneous matches.
The particular statistics utilized to characterize the received information should be chosen based upon the type of information and the type of signal received. In a preferred embodiment of the present invention, the received signal is a compressed audio and video signal which conforms to the Joint Photographic Experts Group (JPEG) compression strategy, which is well known in the art. The set of statistics utilized in this case may be as simple and readily available as the spatially-aliased, bandpass thumbnail sketch of the received information frame. For example, each statistical element 34 may be obtained from the low-frequency coefficients in two-space (not including the zero frequency or DC coefficient) of an eight by eight block (of pixels) discrete cosine transform (DCT) used in JPEG or MPEG (Motion Picture Experts Group) compression along with the compressed audio signal corresponding to the video frame period. As is known in the art, the DCT is only one of the many available fast unitary transforms which may be used. The block DCT transform is particularly suited to this application since it compresses most of the entropy of the information into a small number of coefficients which may then be utilized as a unique summary characterization of the information segment. Some examples of other transforms which may be used include the wavelet, the block-Hadamard, and the block-Karhunen-Loeve transforms.
Alternatively, simpler statistics may be utilized depending on the information signal, such as row and column averages. Of course, more than one type of statistic could be used to determine one or more of the statistical elements 34 in statistical vector 36 so as to uniquely characterize the received information. The sensitivity of the selected statistic to noise in the received signal may also influence selection of the interval size. While the error intervals, such as interval 40, are preferably centered about the value of a corresponding statistical element 34, other offsets may be utilized without departing from the spirit or scope of the present invention. Similarly, the intervals may include only a single value.
With continuing reference to
As illustrated in
Some fields of each frame identifier are not used in all the available operating modes. Thus, a system of diminished functionality could be produced using a data structure without those corresponding fields. For example, the program length field 58 is used only by the surfing functions to determine the length of time to record after a successful match has been indicated. The last occurrence 60 and frequency fields 62 are used only by the automatically-marked zapping mode of operation. Thus, an appropriate data structure is created including only the necessary fields as indicated by the current operating mode.
Referring now to
As illustrated at step 84 of
Otherwise, the next frame of information is obtained as indicated by step 94 and the process continues with step 88. Steps 88 through 94 are repeated until all the frames of the marked information segment are recorded both as a sequence of consecutive frame identifiers and as nodes in the indexable table. After the input process has been completed, control passes to the appropriate process for the current operational mode as indicated by step 96. The various operational modes are illustrated and described in detail with reference to
b provides a detailed illustration of a process for creating a frame identifier. The process begins at step 90a by allocating space for the data structure. The statistical vector characterizing the associated received information frame is recorded within the data structure at step 90b. The minimum length for a successful match is stored at step 90c and the adjacent frame pointers (such as previous pointer 52 and subsequent pointer 54) are initialized at step 90d. This may include entering a null value for the first or last frame in a particular sequence as discussed with reference to
Preferably, the user marking and memorizing processes illustrated in
The system utilized to implement the user marking and memorizing process should have sufficient memory to buffer the frame statistics until the processing, or memorizing is completed, i.e. until the appropriate intervals and sub-intervals are created and the statistics are properly located within the indexable table. A preferred system also includes sufficient memory to accommodate a large number of frame identifiers since a frame identifier must be retained for each frame in the marked sequence throughout the operational life of the system, or until they are deleted by the user. The memory should also preferably be sized to accommodate a search tree for each element of the vector of frame statistics throughout the operational life of the system.
Referring now to
A frame of information is obtained from the incoming information channel being recorded at step 100. A statistical vector is calculated at step 102 using the same statistical calculations which were utilized during the user marking and memorizing process of
In a preferred embodiment, indexing into each binary tree is completed by finding all the leaves of the tree having an interval which contains the value computed for the current statistical element. For the first element and the first binary tree, the identification numbers which have been indexed in this manner are placed into a temporary cache. For each of the remaining elements and binary trees, the identification numbers in the temporary cache must be found among the indexed leaves, otherwise they are deleted from the cache. Step 106 determines when the temporary cache becomes empty, which indicates that the current statistical vector does not match a previously marked statistical vector and the indexing process terminates. Once the indexing process terminates, a data counter, which indicates the location of the current frame within the incoming information stream, is incremented and the program control returns to step 100.
At step 108 of
As indicated by step 116, if the next memorized frame statistics match the statistics computed in step 112, then step 118 determines if the end of the memorized sequence has been reached as indicated by a null value in the subsequent frame field 54 of the frame identifier. If the end of sequence is not indicated, control continues by incrementing the data counter to select and process the next frame within the incoming information stream at step 110. Otherwise, if step 116 determines the current frame does not match the subsequent frame of the memorized sequence, or step 118 determines that the memorized sequence has ended, control continues at step 120.
The length of the matched sequence (i.e. number of frames) is compared with the minimum match length of the selected memorized sequence at step 120 to determine if the match is long enough to indicate success. In the user-marked zapping mode, the determination as to whether the matched sequences were “long enough” depends on both the minimum match length indicated by the user in step 82, and on the constraints imposed by the computational latency. For example, if the latency of steps 102 and 104 is longer than a frame period and the latency in the loop of steps 110 through 118 is less than a frame period, the “long enough” constraint includes the need to make up for the excess latency of steps 102 and 104 and to thereby “catch up” to the current frame.
If the match is long enough as determined at step 120, step 122 controls the recording device to zap the marked information from the recording. This may be completed by returning to the starting location which was stored at step 108, such that the intervening information frames will be deleted or subsequently rerecorded and therefore lost, or by recording a control code to skip the marked information upon playback or by emptying a delay buffer, inserted between the incoming information stream and the recording device and pausing the recording until it refilled. If the result of step 120 is negative, the recording device continues uninterrupted recording and the process is repeated with step 100 to search for a matching sequence.
The computational capabilities of a system implementing the user-marked zapping function should provide a latency for computation of the frame statistics and the comparisons which is less than a single frame period. The latency of indexing into the memorized tables can be greater than a single frame period as long as its computational throughput is greater than the frame rate, and the latency delay is compensated for by the simple frame statistics computation/comparison loop (i.e. steps 110 through 118) in less than the time length of the marked sequence. An implementational system should also have sufficient memory to buffer the current frame statistics until such time as processing can be completed. The system should also have a cache for frame identifier numbers to provide for indexing across the separate memorized tables.
Step 140 of
With continuing reference to
Step 162 controls the recording device to begin recording information and continue for the predetermined time period specified by the user during the information marking process. In one embodiment, the search for other memorized information is suspended for the period during the recording. In another embodiment, if more than one sequence has been marked for recording and a preference is indicated among the marked sequences, the system continues to monitor available information channels while recording to locate information having a higher preference code. If such information is located, the current recording can be terminated so the higher preference material may be recorded.
The computational and memory requirements for a preferred system implementing the user-marked surfing function are directly proportional to the computational and memory requirements for the user-marked zapping function. The proportionality constant varies between unity and the number of available information channels, depending on the need to scan multiple channels in a single frame period. This requirement, in turn, depends upon the length of the marked information and the percentage match required to indicate a successful match.
Referring now to
Step 170 of
Step 180 is performed only periodically to selectively forget information so as to reduce the size of the memorized tables. Otherwise, the size of the memorized tables would grow without bound the longer the automatically-marked zapping mode was active. To avoid difficulties associated with using all the available memory, and increasingly difficult indexing of information, the size of the memorized tables is thus periodically reduced.
The memorized tables are reduced in size by traversing the memorized tables, examining each frame identifier to determine the last time it was received and the frequency of recognition. The frame identifiers which are too old or too infrequently encountered are then “forgotten”. The process of “forgetting” a frame identifier includes removing all pointers to that identifier (in their predecessor and successor frames and in the memorization tables) and allowing the memory occupied by that frame identifier to be reallocated by the system. The control logic then returns to step 170 and the process continues.
With continuing reference to
With continuing reference to
Step 202 of
The computational requirements of a preferred system implementing the automatically-marked zapping function are the same as those described for the user-marked zapping function with the additional requirement that the throughput of creating and memorizing a new frame identifier must be greater than the frame rate. The computational requirements of step 180 to selectively forget frame identifiers will not be constrained as long as this step is only occasionally necessary. In that case, the reduction can be completed when there is no incoming information frames (i.e. the recording device is off or the automatically-marked zapping function is disabled). If necessary, the reduction process can occur in the middle of a zapping session, by temporarily suspending the zapping function. During that time, repeated video would not be zapped from the recording and none of the incoming information frames would be memorized.
The memory requirements of a preferred system implementing the automatically-marked zapping function of the current invention are similar to those described to implement the user-marked zapping function. However, the memorizing process is continuous and newly memorized frames are preferably retained for at least the minimum expected repetition period before they are forgotten during a memorized table reduction. For example, a user who wishes to avoid recording advertisements during video recording may activate the automatically-marked zapping function during regularly recorded programs. If the program material is about two hours per day and the advertisements that are shown during those programs are repeated daily, then the automatically-marked zapping function would have to have enough memory to retain the frame identifiers and the indexing tables to accommodate just over two hours of video information. This would allow the advertisements to be seen more than once before their statistical vectors and frame identifiers would be removed due to memory constraints.
Referring now to
At step 220 of
Step 226 determines if a user input has been received indicating the desire to mark information. If no user input is received, a statistical vector is computed on the next frame of information and stored in the circular buffer by repeating steps 220 through 224. Once the circular buffer is full, the statistical vector of the next following frame replaces the oldest (in time) received frame in the buffer, etc. Once user input is received, step 228 marks the position within the circular buffer of the indicated starting frame.
At step 230, the user inputs the minimum number of frames to indicate a successful match. This information is used to allow a successful match even if the very beginning of the sequence is not observed and also to determine the minimum amount of information to be collected and memorized for later marking. The latter use is necessary for this mode since the user does not indicate a starting and ending point, but provides only a single indication to mark the sequence. If surfing mode is indicated at step 232, the program length is entered at step 234 and the control logic continues with step 236. Otherwise, step 234 is skipped and the process continues with step 236.
Step 236 moves the circular buffer pointer backward (as measured by time) through the circular buffer a length equal to the recall length. Thus, the oldest statistical vectors can be memorized before they are overwritten by vectors characterizing subsequently received information frames. This also allows information received before the user-marked starting frame (saved at step 228) to be inserted into the memorized tables by steps 238 and 240. Subsequent frames continue to have their statistical vectors calculated and stored in the circular buffer while steps 238 memorizes the statistics until step 240 detects the end of the sequence. Unlike the user-marked input process illustrated in
Referring now to
Extension of the memorized segments occurs when a terminal frame (the first or last frame) of memorized information is matched against a similar frame in the incoming information. The memorized segment is extended either forward or backward in time (if the last or first frame was matched, respectively) by memorizing additional sequential frames from the incoming information stream beginning with the frame adjacent to the matched terminal frame. Extension of the memorized segment is stopped when the chunking process (step 290) finds its segment boundaries.
Thus, steps 260 through 266 compute statistical vectors based on the incoming information stream and compare them to previously marked and memorized frames as previously described with reference to the user-marked zapping function (
Step 270 is illustrated with a dashed box since it is only performed when a recent successful match (one that is long enough) ended at the end of a memorized segment. This is indicated by a positive value for the extension counter which may be set by step 286 which is explained below. Thus, if the extension counter is zero, step 270 simply returns control to step 260. When the extension counter is positive, step 270 extends the memorized sequence forward in time by creating and memorizing frame identifiers using the current statistics determined at step 262. The frame is then added to the end of the memorized sequence that was last matched using an extension pointer (from step 286 explained below) to retrieve the last frame in that sequence. The extension pointer is then updated to point to the frame identifier that was just created and is now at the end of the matched memorized sequence. The extension counter is then decremented.
With continuing reference to
In contrast to step 270, step 274 attempts to extend the memorized sequence backward in time. Step 274 attempts this extension only if the matching memorized frame occurs at the start of a memorized sequence, the matching frame is not a boundary frame, and the circular buffer of statistics includes valid time-contiguous statistics from the incoming information stream. When these conditions are met, the memorized sequence is extended backward in time by creating and memorizing a sequence of frame identifiers from the statistics buffered by step 268 and attaching them to the start of the previously memorized sequence. The point of attachment of the added sequence is cached so that the effects may be reversed by step 294 if necessary. Without this ability, incorrect short-sequence matches would add unrelated information onto the front of the memorized sequence. Steps 276 through 284 proceed in a manner similar to that illustrated and described with reference to steps 110 through 118 of
If the end of a memorized sequence is detected at step 284, then step 286 determines whether it is appropriate to extend the memorized video forward in time (as explained with reference to step 270). If the end of the matched memorized sequence is not a boundary frame (as defined below under step 290), the memorized information should be extended forward in time. In this case, step 286 sets the extension pointer to the currently matched, memorized frame identifier, and the extension counter to the maximum extension length. This allows step 270 to extend the trailer if the other conditions for extension are met as explained above.
If the matched sequence is long enough as determined in step 288, step 290 updates the “chunking” of the memorized information. This step updates the sequencing information between the memorized frames that were matched and the memorized frames which were not matched so that the sequences contained in the memorized tables only cover pieces (chunks) of information which always occur together. The starting point of the matched sequence will form a “segment boundary” provided there was incoming information (which failed to match) immediately preceding that starting frame. In this case, the pointers which cross this segment boundary are nulled which indicates that the corresponding frame identifiers on either side of the segment boundary are “boundary frames”. Similarly, the ending point of the matched sequence is updated. Thus, if memorized information follows the matched information, the pointers across this segment boundary are nulled and the frame identifiers on either side of the segment become boundary frames as well. Once the memorized sequence is divided into chunks (if necessary), only the chunk that contains the user-marked frame is retained. Other chunks are forgotten by removing all pointers to the frames in those chunks and making the memory space available to the system. In addition to assuring that the memorized tables contain only pieces of information which always occur together, this process also stops the growth outward at the boundary frames during automatic leader/trailer extension explained below.
Step 292 controls the recording device to zap the information from the recording as explained with reference to step 202 of
Referring now to
Steps 302 through 310 continue buffering statistical vectors of incoming information frames until a frame matches a previously marked frame as determined by step 308. To avoid confounding of information segments, step 310 preferably utilizes a different buffer for each information channel. Of course, other techniques for associating an information frame with the information channel which it came from are possible. As with step 270 of
Referring now to
A persistent (non-volatile) memory 342 receives and stores incoming information and is in communication with control logic 340. Persistent memory 342 may include a significant amount of homogeneous or heterogeneous memory such as DRAM, RAM, EEPROM or the like. The preferred size and preferred computational capabilities of persistent memory 342 and control logic 340, respectively, have been described with reference to their associated functions illustrated in
Thus, it is understood, that while the forms of the invention herein shown and described include the best mode contemplated for carrying out the invention, they are not intended to illustrate all possible forms thereof. It will also be understood that the words used are descriptive rather than limiting, and that various changes may be made without departing from the spirit and scope of the invention disclosed.
This application is a divisional of U.S. patent application Ser. No. 10/888,800 filed Jul. 8, 2004 now U.S. Pat. No. 7,849,475; which is a continuation of U.S. patent application Ser. No. 08/771,947 filed Dec. 23, 1996 (now U.S. Pat. No. 6,782,186); which is a continuation of U.S. patent application Ser. No. 08/399,482 filed Mar. 7, 1995 now abandoned; all of which are hereby incorporated herein by reference in their entireties.
Number | Name | Date | Kind |
---|---|---|---|
3884403 | Brewer | May 1975 | A |
3919475 | Dukich et al. | Nov 1975 | A |
4033335 | Nickles | Jul 1977 | A |
4051534 | Dukich et al. | Sep 1977 | A |
4131919 | Lloyd et al. | Dec 1978 | A |
4217609 | Hatori et al. | Aug 1980 | A |
4260229 | Bloomstein | Apr 1981 | A |
4283735 | Jagger | Aug 1981 | A |
4319286 | Hanpachern | Mar 1982 | A |
4390904 | Johnston et al. | Jun 1983 | A |
4446997 | Himberg | May 1984 | A |
4520404 | Von Kohorn | May 1985 | A |
4524381 | Konishi | Jun 1985 | A |
4526308 | Dovey | Jul 1985 | A |
4527201 | Cappels | Jul 1985 | A |
4536066 | Bauer | Aug 1985 | A |
4574354 | Mihalik et al. | Mar 1986 | A |
4602297 | Reese | Jul 1986 | A |
4605964 | Chard | Aug 1986 | A |
4605973 | Von Kohorn | Aug 1986 | A |
4618895 | Wright | Oct 1986 | A |
4714184 | Young et al. | Dec 1987 | A |
4739398 | Thomas et al. | Apr 1988 | A |
4750052 | Poppy et al. | Jun 1988 | A |
4750053 | Allen | Jun 1988 | A |
4777537 | Ueno et al. | Oct 1988 | A |
4782401 | Faerber et al. | Nov 1988 | A |
4814876 | Horio et al. | Mar 1989 | A |
4827532 | Bloomstein | May 1989 | A |
4837817 | Maemori | Jun 1989 | A |
4841575 | Welsh et al. | Jun 1989 | A |
4843484 | Kanamaru et al. | Jun 1989 | A |
4847543 | Fellinger | Jul 1989 | A |
4913539 | Lewis | Apr 1990 | A |
4930160 | Vogel | May 1990 | A |
4934821 | Morton | Jun 1990 | A |
4965825 | Harvey et al. | Oct 1990 | A |
4989104 | Schulein et al. | Jan 1991 | A |
5012334 | Etra | Apr 1991 | A |
5012335 | Cohodar | Apr 1991 | A |
5025394 | Parke | Jun 1991 | A |
5038217 | Hayashi et al. | Aug 1991 | A |
5040081 | McCutchen | Aug 1991 | A |
5109482 | Bohrman | Apr 1992 | A |
5136655 | Bronson | Aug 1992 | A |
5146353 | Isoguchi et al. | Sep 1992 | A |
5157742 | Niihara | Oct 1992 | A |
5172281 | Ardis et al. | Dec 1992 | A |
5175769 | Hejna, Jr. et al. | Dec 1992 | A |
5177796 | Feig et al. | Jan 1993 | A |
5179449 | Doi | Jan 1993 | A |
5182641 | Diner et al. | Jan 1993 | A |
5185667 | Zimmermann | Feb 1993 | A |
5187571 | Braun et al. | Feb 1993 | A |
5226093 | Iwase | Jul 1993 | A |
5233654 | Harvey et al. | Aug 1993 | A |
5239428 | Nishida et al. | Aug 1993 | A |
5241428 | Goldwasser et al. | Aug 1993 | A |
5249289 | Thamm et al. | Sep 1993 | A |
5253061 | Takahama et al. | Oct 1993 | A |
5253066 | Vogel | Oct 1993 | A |
5253275 | Yurt et al. | Oct 1993 | A |
5262856 | Lippman et al. | Nov 1993 | A |
5265180 | Golin | Nov 1993 | A |
5267042 | Tsuchiya et al. | Nov 1993 | A |
5295089 | Ambasz | Mar 1994 | A |
5299019 | Pack et al. | Mar 1994 | A |
5305400 | Butera | Apr 1994 | A |
5317730 | Moore et al. | May 1994 | A |
5329320 | Yifrach | Jul 1994 | A |
5333091 | Iggulden et al. | Jul 1994 | A |
5343251 | Nafeh | Aug 1994 | A |
5377051 | Lane et al. | Dec 1994 | A |
5384703 | Withgott et al. | Jan 1995 | A |
5396287 | Cho | Mar 1995 | A |
5396583 | Chen et al. | Mar 1995 | A |
5404316 | Klingler et al. | Apr 1995 | A |
5406626 | Ryan | Apr 1995 | A |
5416310 | Little | May 1995 | A |
5420801 | Dockter et al. | May 1995 | A |
5421031 | De Bey | May 1995 | A |
5428774 | Takahashi et al. | Jun 1995 | A |
5430835 | Williams et al. | Jul 1995 | A |
5436542 | Petelin et al. | Jul 1995 | A |
5436653 | Ellis et al. | Jul 1995 | A |
5438357 | McNelley | Aug 1995 | A |
5438362 | Tabuchi | Aug 1995 | A |
5438423 | Lynch et al. | Aug 1995 | A |
5444478 | Lelong et al. | Aug 1995 | A |
5467271 | Abel et al. | Nov 1995 | A |
5467288 | Fasciano et al. | Nov 1995 | A |
5473379 | Horne | Dec 1995 | A |
5477331 | Yamaguchi et al. | Dec 1995 | A |
5486852 | Arai | Jan 1996 | A |
5488409 | Yuen et al. | Jan 1996 | A |
5510830 | Ohia et al. | Apr 1996 | A |
5514861 | Swartz et al. | May 1996 | A |
5524051 | Ryan | Jun 1996 | A |
5524193 | Covington et al. | Jun 1996 | A |
5528282 | Voeten et al. | Jun 1996 | A |
5537151 | Orr et al. | Jul 1996 | A |
5537530 | Edgar et al. | Jul 1996 | A |
5539483 | Nalwa | Jul 1996 | A |
5546145 | Bernardi et al. | Aug 1996 | A |
5550754 | McNelley et al. | Aug 1996 | A |
5555463 | Staron et al. | Sep 1996 | A |
5572261 | Cooper | Nov 1996 | A |
5575443 | Honeycutt | Nov 1996 | A |
5583565 | Cortjens et al. | Dec 1996 | A |
5583571 | Friedland | Dec 1996 | A |
5583652 | Ware | Dec 1996 | A |
5590195 | Ryan | Dec 1996 | A |
5592626 | Papadimitriou et al. | Jan 1997 | A |
5594498 | Fraley | Jan 1997 | A |
5598352 | Rosenau et al. | Jan 1997 | A |
5604551 | Choi et al. | Feb 1997 | A |
5606359 | Youden et al. | Feb 1997 | A |
5608839 | Chen | Mar 1997 | A |
5612742 | Krause et al. | Mar 1997 | A |
5613032 | Cruz et al. | Mar 1997 | A |
5613909 | Stelovsky | Mar 1997 | A |
5614940 | Cobbley et al. | Mar 1997 | A |
5623173 | Fasullo et al. | Apr 1997 | A |
5635982 | Zhang et al. | Jun 1997 | A |
5664227 | Mauldin et al. | Sep 1997 | A |
5666159 | Parulski et al. | Sep 1997 | A |
5671014 | Ito et al. | Sep 1997 | A |
5678793 | Hill | Oct 1997 | A |
5682597 | Ganek et al. | Oct 1997 | A |
5684514 | Branscomb | Nov 1997 | A |
5687095 | Haskell et al. | Nov 1997 | A |
5689300 | Shibata et al. | Nov 1997 | A |
5689648 | Diaz et al. | Nov 1997 | A |
5692213 | Goldberg et al. | Nov 1997 | A |
5692661 | Kellerman | Dec 1997 | A |
5694474 | Ngo et al. | Dec 1997 | A |
5701582 | DeBey et al. | Dec 1997 | A |
5703655 | Corey et al. | Dec 1997 | A |
5703795 | Mankovitz | Dec 1997 | A |
5713021 | Kondo et al. | Jan 1998 | A |
5717814 | Abecassis | Feb 1998 | A |
5717869 | Moran et al. | Feb 1998 | A |
5721823 | Chen et al. | Feb 1998 | A |
5724646 | Ganek et al. | Mar 1998 | A |
5726660 | Purdy et al. | Mar 1998 | A |
5726717 | Peters et al. | Mar 1998 | A |
5729108 | Steele et al. | Mar 1998 | A |
5729741 | Liaguno et al. | Mar 1998 | A |
5737009 | Payton | Apr 1998 | A |
5740037 | McCann et al. | Apr 1998 | A |
5742339 | Wakui | Apr 1998 | A |
5742517 | Van Den Bosch | Apr 1998 | A |
5749010 | McCumber | May 1998 | A |
5751336 | Aggarwal et al. | May 1998 | A |
5751806 | Ryan | May 1998 | A |
5752113 | Borden | May 1998 | A |
5754230 | Tsuruta | May 1998 | A |
5758181 | Becker | May 1998 | A |
5765133 | Antoniol et al. | Jun 1998 | A |
5768640 | Takahashi et al. | Jun 1998 | A |
5768648 | Skipp et al. | Jun 1998 | A |
5774498 | Oya et al. | Jun 1998 | A |
5774664 | Hidary et al. | Jun 1998 | A |
5778181 | Hidary et al. | Jul 1998 | A |
5791907 | Ramshaw et al. | Aug 1998 | A |
5793367 | Taguchi | Aug 1998 | A |
5793971 | Fujita et al. | Aug 1998 | A |
5796426 | Gullichsen et al. | Aug 1998 | A |
5805156 | Richmond et al. | Sep 1998 | A |
5809161 | Auty et al. | Sep 1998 | A |
5818439 | Nagasaka et al. | Oct 1998 | A |
5818510 | Cobbley et al. | Oct 1998 | A |
5822493 | Uehara et al. | Oct 1998 | A |
5826206 | Nemeth | Oct 1998 | A |
5828994 | Covell et al. | Oct 1998 | A |
5831662 | Payton | Nov 1998 | A |
5835667 | Wactlar et al. | Nov 1998 | A |
5838874 | Ng et al. | Nov 1998 | A |
5844693 | Miyata | Dec 1998 | A |
5870143 | Suzuki | Feb 1999 | A |
5880788 | Bregler | Mar 1999 | A |
5884141 | Inoue et al. | Mar 1999 | A |
5886739 | Winningstad | Mar 1999 | A |
5892536 | Logan et al. | Apr 1999 | A |
5893062 | Bhadkamkar et al. | Apr 1999 | A |
5917542 | Moghadam et al. | Jun 1999 | A |
5926205 | Krause et al. | Jul 1999 | A |
5929904 | Uchida | Jul 1999 | A |
5936659 | Viswanathan et al. | Aug 1999 | A |
5940004 | Fulton | Aug 1999 | A |
5946050 | Wolff | Aug 1999 | A |
5974235 | Nunally et al. | Oct 1999 | A |
5982979 | Omata et al. | Nov 1999 | A |
5999173 | Ubillos | Dec 1999 | A |
6005564 | Ahmad et al. | Dec 1999 | A |
6009204 | Ahmad | Dec 1999 | A |
6018359 | Kermode et al. | Jan 2000 | A |
6020883 | Herz et al. | Feb 2000 | A |
6025837 | Matthews, III et al. | Feb 2000 | A |
6041142 | Rao et al. | Mar 2000 | A |
6061055 | Marks | May 2000 | A |
6061056 | Menard et al. | May 2000 | A |
6069621 | Schupak | May 2000 | A |
6072542 | Wilcox et al. | Jun 2000 | A |
6081551 | Etoh | Jun 2000 | A |
6118925 | Murata et al. | Sep 2000 | A |
6141693 | Perlman et al. | Oct 2000 | A |
6160950 | Shimazaki et al. | Dec 2000 | A |
6172675 | Ahmad et al. | Jan 2001 | B1 |
6212657 | Wang et al. | Apr 2001 | B1 |
6236395 | Sezan et al. | May 2001 | B1 |
6240555 | Shoff et al. | May 2001 | B1 |
6243725 | Hempleman et al. | Jun 2001 | B1 |
6259817 | Ahmad | Jul 2001 | B1 |
6263507 | Ahmad et al. | Jul 2001 | B1 |
6266085 | Kato et al. | Jul 2001 | B1 |
6272231 | Maurer et al. | Aug 2001 | B1 |
6282362 | Murphy et al. | Aug 2001 | B1 |
6297845 | Kuhn et al. | Oct 2001 | B1 |
6351599 | Komeno | Feb 2002 | B1 |
6360202 | Bhadkamkar et al. | Mar 2002 | B1 |
6360234 | Jain et al. | Mar 2002 | B2 |
6366296 | Boreczky et al. | Apr 2002 | B1 |
6377519 | Lee et al. | Apr 2002 | B1 |
6385386 | Aotake | May 2002 | B1 |
6404811 | Cvetko et al. | Jun 2002 | B1 |
6452969 | Yim | Sep 2002 | B1 |
6502139 | Birk et al. | Dec 2002 | B1 |
6563532 | Strub et al. | May 2003 | B1 |
6598228 | Hejna, Jr. | Jul 2003 | B2 |
6622305 | Willard | Sep 2003 | B1 |
6624846 | Lassiter | Sep 2003 | B1 |
6690273 | Thomason | Feb 2004 | B2 |
6701528 | Arsenault et al. | Mar 2004 | B1 |
6704750 | Asazu | Mar 2004 | B2 |
6728678 | Bhadkamkar et al. | Apr 2004 | B2 |
6782186 | Covell et al. | Aug 2004 | B1 |
6825875 | Strub et al. | Nov 2004 | B1 |
6868452 | Eager et al. | Mar 2005 | B1 |
6880171 | Ahmad et al. | Apr 2005 | B1 |
6901209 | Cooper et al. | May 2005 | B1 |
6934461 | Strub et al. | Aug 2005 | B1 |
6961954 | Maybury et al. | Nov 2005 | B1 |
6986156 | Rodriguez et al. | Jan 2006 | B1 |
6993787 | Kamel et al. | Jan 2006 | B1 |
7003533 | Noguchi et al. | Feb 2006 | B2 |
7024678 | Gordon et al. | Apr 2006 | B2 |
7055103 | Lif | May 2006 | B2 |
7072575 | Kang | Jul 2006 | B2 |
7130528 | Blair et al. | Oct 2006 | B2 |
7143431 | Eager et al. | Nov 2006 | B1 |
7155735 | Ngo et al. | Dec 2006 | B1 |
7194186 | Strub et al. | Mar 2007 | B1 |
7266771 | Tow et al. | Sep 2007 | B1 |
7340760 | Wachtfogel et al. | Mar 2008 | B2 |
7352813 | Kim | Apr 2008 | B2 |
7454777 | Nishikawa et al. | Nov 2008 | B1 |
7480446 | Bhadkamkar et al. | Jan 2009 | B2 |
7519271 | Strub et al. | Apr 2009 | B2 |
7565681 | Ngo et al. | Jul 2009 | B2 |
20020006266 | Yoon et al. | Jan 2002 | A1 |
20020031331 | Kwoh et al. | Mar 2002 | A1 |
20020120925 | Logan | Aug 2002 | A1 |
20020157103 | Song et al. | Oct 2002 | A1 |
20020159750 | Jasinschi et al. | Oct 2002 | A1 |
20020170068 | Rafey et al. | Nov 2002 | A1 |
20030093790 | Logan et al. | May 2003 | A1 |
20030149574 | Rudman | Aug 2003 | A1 |
20040078812 | Calvert | Apr 2004 | A1 |
20040123324 | Sazzad et al. | Jun 2004 | A1 |
20060031879 | Colter et al. | Feb 2006 | A1 |
20060031885 | Colter et al. | Feb 2006 | A1 |
20060031916 | Colter et al. | Feb 2006 | A1 |
20060053470 | Colter et al. | Mar 2006 | A1 |
20070204319 | Ahmad et al. | Aug 2007 | A1 |
20080127240 | Covell et al. | May 2008 | A1 |
20090097823 | Bhadkamkar et al. | Apr 2009 | A1 |
20090276817 | Colter et al. | Nov 2009 | A1 |
Number | Date | Country |
---|---|---|
4408131 | Jul 1995 | DE |
179701 | Apr 1986 | EP |
0513601 | Nov 1992 | EP |
0674315 | Sep 1995 | EP |
0689362 | Dec 1995 | EP |
2231246 | Nov 1990 | GB |
117419 | Jul 1989 | JP |
06162166 | Jun 1994 | JP |
WO-9715926 | May 1997 | WO |
Entry |
---|
“Covert Video/Audio Sunglasses,” From Rock House Products, http://www.rock2000.com/nanny/glasses.htm, Dec. 9, 1998, 2 pages. |
Buckley et al., “The Effect of Adding Relevance Information in a Relevance Feedback Environment,” Proceedings of the 17th Annual Conference on Research and Development in Information Retrieval, Jul. 3-6, 1994, pp. 292-300. |
Carter, S. et al., “Improving Video-on Demand Server Efficiency Through Stream Tapping,” 1997 IEEE, pp. 200-207. |
Carter, S. et al., “Video-on-Demand Broadcasting Protocols,” pp. 1-25. |
Chen, F. et al., “The Use of Emphasis to Automatically Summarize a Spoken Discourse,” 1992 IEEE International Conference on Acoustics, Speech and Signal Processing, Mar. 23-26, 1992, pp. I-229 and I-232. |
Chen, T. et al, “Lip synchronization in talking head video utilizing, speech information,” SPIE, Visual Communications and Image Processing '95, vol. 2501, Jul. 1995, pp. 1690-1701. |
CNN at Work White Paper, 22 pages, 1994. |
Covell et al, “Spanning the Gap Between Motion Estimation and Morphing,” Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing Apr. 19-22, 1994, Adelaide, Australia, http://www.mangolassi.org/cove11/1994-010/ [Internet Accessed on Apr. 29, 2008]. |
Elliott, E., “Multiple Views of Digital Video”, MIT Media Laboratory, Interactive Cinema Group, 4 pages, Mar. 23, 1992. |
Elliott, E., “Watch, Grab, Arrange, See: Thinking with Motion Images via Streams and Collages”, Masters thesis, School of Architecture and Planning, Massachusetts Institute of Technology, Feb. 1993, pp. 3, 5, 7, 10-11, 13-35, 37-49, 51-61, 63-85,87-99, 101, 103-105. |
Flanagan, J.L. et al., “Computer-steered microphone arrays for sound transduction in large rooms,” J. Acoust. Soc. Am. 78 (5), Nov. 1985, pp. 1508-1518. |
Forsyth, David et al., “Searching for Digital Pictures,” Scientific American, Jun. 1997. |
Froba, Bernhard et al., “Multi-Sensor Biometric Person Recognition in an Access Control System,” Proceedings Second International Conference on Audio- and Video-Based Biometric Person Authentication, Mar. 22-23, 1999, Washington, D.C., pp. 55-59. |
Frost III, Otis, “An Algorithm for Linearly Constrained Adaptive Array Processing,” Proceedings of the IEEE, vol. 60, No. 8, Aug. 1972, pp. 926-935. |
Gemperle, Francine et al., “Design for Wearability”, IEEE, 1998, pp. 116-122. |
Grenier, Yves, “A microphone array for car environments,” Speech Communication, vol. 12, No. 1, Mar. 1993, pp. 25-39. |
Haeberli, Paul et al., “Automatic Panoramic Image Merging,” World Wide Web, Grafica Obscura, 1998, 2 pages. |
Hall, Allan, “A new spin on power dressing,” Associated Newspapers Ltd., Sep. 14, 1999. |
Hauptman et al., “Speech for Multimedia Information Retrieval,” UIST '95(ACM), Nov. 14, 1995, pp. 79-80. |
Hauptmann et al., “Text, Speech, and Vision for Video Segmentation: The Informedia.TM. Project,” AAAI Fall Symposium, Computation Models for Integrating Language and Vision, 6 pages, Nov. 10-12, 1995. |
Healey, Jennifer et al., “Quantifying Driver Stress: Developing a System for Collecting and Processing Bio-Metric Signals in Natural Situations,” Proceedings of the Rocky Mountain Bio-Engineering Symposium, Apr. 16-18, 1999. |
Healey, Jennifer et al., “StartleCam: A Cybernetic Wearable Camera”, Second International Symposium on Wearable Computers, Oct. 19-20, 1998, pp. 42-49. |
Henton, Caroline, et al, “Saying and Seeing It With Feeling: Techniques for Synthesizing Visible, Emotional Speech”. |
Homer, “NewsTime: A Graphical User Interface to Audio News,” Masters thesis, School of Architecture and Planning, Massachusetts Institute of Technology, Jun. 1993, pp. 1-84. |
Huttenlocher et al., “Tracking Non-Rigid Objects in Complex Scenes,” IEEE International Conference on Computer Vision, 1993, pp. 93-101. |
Kelly, Patrick H. et al., “An Architecture for Multiple Perspective Interactive Video,” Multimedia 1995, pp. 201-212. |
La Lunette Camera—Alain Mikli, http://www.mikli.fr/eng/Actualites/Lcamera/Lcamera.htm?Nav=1, La Lunette Camera Sunglasses, Jul. 27, 1998, 1 page. |
Lindblad C.J. et al., “ViewStation Applications: Implications for Network Traffic”, IEEE Journal on Selected Areas in Communications, vol. 13, No. 5, Jun. 1995, pp. 768-777. |
Mann, S. et al., “Video Orbits of the Projective Group: A simple approach to featureless estimation of parameters,” IEEE Trns. Image Proc., Jul. 1997. |
Mann, S. et al., “Virtual Bellows: Constructing High Quality Stills from Video,” IEEE, 1994, pp. 363-367. |
Moulines et al., “Non-parametric techniques for pitch-scale and time-scale modification of speech,” Speech Communication, vol. 16, 1995, pp. 175-205. |
Non-Final Office Action for U.S. Appl. No. 11/682,201, Mail Date Apr. 3, 2009, 30 pages. |
Paris et al., “A Low Bandwidth Broadcasting Protocol for Video on Demand,” IC3N '98, Oct. 1998, pp. 640-647. |
Phillips, P. Jonathan et al., “The FERET Evaluation,” Face Recognition: From Theory to Applications, 1998, pp. 244-261. |
Product description for SMPTE Time Code Products for GPS Video Position Logging, Horita Co., Inc., Viejo, California, 1998. |
Product description for Studio Master Time Code and Sync Generators, Horita Co., Inc., Viejo, California, 1998. |
Rabiner, A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition, Proceedings of the IEEE, vol. 77, No. 2, Feb. 1989. pp. 257-285. |
Reynolds, “A Gaussian Mixture Modeling Approach to Text-Independent Speaker Identification,” Ph.D. thesis, Dept. of Electrical Engineering, Georgia Institute of Technology, 1992, pp. 1-154. |
Rosenberg, Aaron et al., “Recent Research in Automatic Speaker Recognition,” Advances in Speech Signal Processing, 1992, pp. 701-738. |
Roucos et al., “High Quality Time-Scale Modification for Speech,” Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, IEEE, 1985, pp. 493-496. |
Sakauchi M. et al., “Multimedia Database Systems for the Contents Mediator”, IEICE Trans. Inf. and Syst., vol. E79-D, No. 6, Jun. 1996, pp. 641-646. |
Salton, G. et al., “Improving Retrieval Performance by Relevance Feedback”, Journal of the American Society of Information Science, vol. 41, No. 4, Jun. 1990, pp. 288-297. |
Scheirer et al., “Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator,” Proc. ICASSP, Apr. 21-24, 1997, pp. 1-4. |
Scott et al, “Synthesis of Speaker Facial Movement to Match Selected Speech Sequences,” Speech Science and Technology '94, Dec. 5-8, Perth, Australia, 6pgs. |
Senior, Andrew, “Face and feature finding for a face recognition system,” Proceedings Second International Conference on Audio- and Video-based Biometric Person Authentication, Mar. 22-23, 1999, Washington, D.C., pp. 154-159. |
Shahraray, B. et al., “Automatic generation of pictoral transcripts of video programs”, SPIE, vol. 2417, Jan. 1995, pp. 512-518. |
Shibata M., “A Description Model of Video Content and Its Application for Video Structuring”, Systems and Computers in Japan, vol. 27, No. 7, Jun. 1996, pp. 70-83. |
Slaney, Malcolm et al., “Baby Ears: A Recognition System for Affective Vocalizations,” Proceeding of the 1998 International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seattle, WA, May 12-15, 1998. |
Sony ClipLink Guide, pp. 9-14, 1996. |
Sony PowerHAD digital camcorder operating instructions, pp. 65-71, 1998. |
Supercircuits Microvideo Catalog, GC1 Professional Grade Color Covert Glasses Cam, 1998, 1 page. |
Szeliski, Richard et al., “Creating Full View Panoramic Image Mosaics and Environment Maps,” Computer Graphics Proceedings, Annual Conference Series, 1997, pp. 251-258. |
Szeliski, Richard, “Video Mosaics for Virtual Environments,” IEEE Computer Graphics and Application, Mar. 1996, pp. 22-30. |
Tennenhouse et al., “The ViewStation: a software-intensive approach to media processing and distribution,” Proceedings of the 17th Annual Conference on Research and Development in Information Retrieval, Jul. 3-6, 1994, pp. 104-115. |
Velbon.com, “CX Series Photo Tripods,” and “DF-Series ‘Dual Function’ Photo/Video Tripods,” 3 pgs. [internet accessed on Apr. 13, 1999]. |
Virtual Tech, “Thousands of New Products are introduced every year by hundreds of companies . . . ,” Toyoda Machinery USA, believed to be in use in Japan in 1998, 2 pages. |
Wactlar H. D. et al., “Intelligent Access to Digital Video: Informedia Project”, Computer, vol. 29, No. 5, May 1996, pp. 46-52. |
Welsh et al., “Facial Image Coding Using Principal Components,” Electronics Letters, vol. 28, No. 22, Oct. 1992, pp. 2066-2067. |
Yeung M. M. et al., “Efficient Matching and Clustering of Video Shots”, IEEE '95 (ICIP), vol. 1, Oct. 1995, pp. 338-341. |
Zabih et al., “A Real-Time System for Automatically Annotating Unstructured Image Sequences,” IEEE International Conference on Systems, Man, and Cybernetics, 1993, 6 pages. |
Zhang et al., “Automatic Parsing of News Video,” IEEE Conference on Multimedia Computing and Systems, 1994, pp. 45-54. |
U.S. Appl. No. 13/568,219, filed Aug. 7, 2012, Bhadkamkar et al. |
U.S. Appl. No. 13/619,519, filed Sep. 14, 2012, Ngo et al. |
“About Gitzo,” http//www.gitzo.com/en/company.htm, Gitzo S.A. Ball Head Camera Mounts, 4 pgs. |
“Correlating Audio and Moving-Image Tracks,” IBM Technical Disclosure Bulletin, vol. 33, No. 10A, Mar. 1991, pp. 295-296. |
“Covert Video/Audio Sunglasses,” From Rock House Products, http://www.rock2000.com/nanny/glasses.htm, 12/9/98, 2 pages. |
“Evolution of Steve Mann's ‘existential computer’ (‘wearable computer’) invention developed for applications in ‘personal imaging’”, IEEE Computer Society, 1998, 1 page. |
“Informedia: Experience on Demand,” DARPA/ITO Joint IC and V/IM Principal Investigator Meeting, Oct. 15-17, 1997. |
“VideoLogger® SDK: Virage Open Architecture,” Datasheet, Virage, Inc., San Mateo, California, 2003, 2 pgs. |
“VideoLogger®: Automate Video Encoding and Indexing,” Datasheet, Virage, Inc., San Mateo, California, 2003, 3 pgs. |
1998 Everest Expedition: Technology, http://www.everest.org/Everest/tech.html, Datacam Technology, Jul. 31, 1998, 2 pgs. |
Arons, B., “Techniques, Perception, and Applications of Time-Compressed Speech,” Proceedings of 1992 Conference, American Voice I/O Society, Sep. 1992, 19 pgs. |
Brooks et al., “Computer Graphics Animations of Talking Faces Based on Stochastic Models,” ISSIPNN'94, 1994 International Symposium on Speech, Image Processing and Neural Networks, Proceedings, vol. 1, Apr. 1994, pgs. 73-76. |
Number | Date | Country | |
---|---|---|---|
20110055861 A1 | Mar 2011 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10888800 | Jul 2004 | US |
Child | 12944142 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 08771947 | Dec 1996 | US |
Child | 10888800 | US | |
Parent | 08399482 | Mar 1995 | US |
Child | 08771947 | US |