Claims
- 1. For use in a video signal processor, a system for locating boundaries of video programs and commercials comprising:
an audio classifier controller capable of receiving at least one audio category of at least one segment of an audio signal, and capable of determining at least one rate of change of said at least one audio category, and capable of locating at least one of said boundaries by comparing said at least one rate of change of said at least one audio category with a threshold value.
- 2. The system as claimed in claim 1 wherein said audio classifier controller comprises a classification algorithm that is capable of classifying audio signals to obtain at least one audio category for each segment of said audio signal.
- 3. The system as claimed in claim 2 wherein said classification algorithm is capable of classifying audio signals into audio categories of silence, music, noise and speech.
- 4. The system as claimed in claim 3 wherein said audio classifier controller comprises a speaker identifier comprising a speech database that contains voice identification information of persons whose voices have been identified, and wherein said classification algorithm is capable of accessing said speech database of said speaker identifier and classifying speech audio signals of persons whose voices are in said speech database of said speaker identifier as audio categories.
- 5. The system as claimed in claim 4 wherein said speaker identifier comprises an unknown speaker database that contains voice information of persons whose voices have not been identified,
wherein said classification algorithm is capable of accessing said unknown speaker database and determining the number of unknown speakers who speak within a given period of time, and wherein said classification algorithm is capable of updating said speech database in said speaker identifier to add voice identification information for newly identified speakers.
- 6. The system as claimed in claim 1 wherein said audio classifier controller comprises a category change detector capable of receiving audio categories of segments of said audio signal, and capable of determining when an audio category of said audio signal changes, and capable of determining the identities of said audio categories before and after said change of audio category.
- 7. The system as claimed in claim 6 wherein said category change detector is capable of detecting audio subcategories of segments of said audio signal, and is capable of determining when an audio subcategory of said audio signal changes, and is capable of determining the identities of said audio subcategories before and after said change of audio subcategory.
- 8. The system as claimed in claim 6 wherein said audio classifier controller comprises a category change rate detector capable of receiving information from said category change detector concerning audio category changes, and capable of calculating the rates at which said audio category changes occur.
- 9. The system as claimed in claim 8 wherein said category change rate detector is capable of determining an overall change rate using information from the change rate of each category, the audio cut rate, the total length of time of each category, and the ratio of the change rate of each category to the total length of time of the category within a given period of time.
- 10. The system as claimed in claim 8 wherein said audio classifier controller comprises a boundary detector capable of receiving information from said category change rate detector concerning audio category rate changes, and capable of classifying at least one feature concerning at least one audio category rate change using a multifeature classifier to locate at least one boundary of a video program segment.
- 11. The system as claimed in claim 10 wherein said boundary detector is capable of assigning a weighting factor to each change in each category, said weighting factor comprising a number that represents the relative importance assigned to the category change in assessing the likelihood of locating a boundary at a point where a particular change in category occurs.
- 12. The system as claimed in claim 10 wherein said boundary detector is capable of receiving information from said category change rate detector concerning an overall change rate determined by using information from the change rate of each category, the audio cut rate, the total length of time of each category, and the ratio of the change rate of each category to the total length of time of the category within a given period of time, said boundary detector capable of classifying at least one feature concerning at least one overall change rate using a multifeature classifier to locate at least one boundary of a video program segment.
- 13. A video signal processor capable of locating boundaries of video programs and commercials comprising:
an audio classifier controller capable of receiving at least one audio category of at least one segment of an audio signal, and capable of determining at least one rate of change of said at least one audio category, and capable of locating at least one of said boundaries by comparing said at least one rate of change of said at least one audio category with a threshold value.
- 14. The video signal processor as claimed in claim 13 wherein said video signal processor comprises one of:
a television receiver, a video recorder, a device for receiving streaming video data signals, and a computerized personal multimedia retrieval system.
- 15. An audio signal processor capable of locating boundaries of audio programs and commercials comprising:
an audio classifier controller capable of receiving at least one audio category of at least one segment of an audio signal, and capable of determining at least one rate of change of said at least one audio category, and capable of locating at least one of said boundaries by comparing said at least one rate of change of said at least one audio category with a threshold value.
- 16. The audio signal processor as claimed in claim 13 wherein said audio signal processor comprises one of:
a radio receiver, an audio recorder, a device for receiving a source of streaming audio data signals, and a computerized personal audio multimedia retrieval system.
- 20. The method as claimed in claim 19, further comprising the steps of:
accessing a speech database in a speaker identifier within said audio classifier controller that contains voice identification information of persons who have been identified; and classifying speech audio signals of persons whose voices are in said speech database as audio categories.
- 21. The method as claimed in claim 20, further comprising the steps of:
accessing a unknown speaker database in said speaker identifier that contains voice information of persons who have not been identified; determining the number of unknown speakers who speak within a given period of time; and updating said speech database in said speaker identifier to add voice identification information for newly identified speakers.
- 22. The method as claimed in claim 17, further comprising the steps of:
receiving audio categories of said audio signal in a category change detector of said audio classifier controller; determining in said category change detector when an audio category of said audio signal changes; and determining in said category change detector the identities of said audio categories before and after said change of audio category.
- 23. The method as claimed in claim 22, further comprising the steps of:
receiving audio subcategories of said audio signal in a category change detector of said audio classifier controller; determining in said category change detector when an audio subcategory of said audio signal changes; and determining in said category change detector the identities of said audio subcategories before and after said change of audio subcategory.
- 24. The method as claimed in claim 22, further comprising the steps of:
receiving in a category change rate detector information from said category change detector concerning audio category changes; and calculating the rates at which said audio category changes occur.
- 25. The method as claimed in claim 24, further comprising the steps of:
determining in said category change rate detector an overall change rate using information from the change rate of each category, the audio cut rate, the total length of time of each category, and the ratio of the change rate of each category to the total length of time of the category within a given period of time; and classifying at least one feature concerning at least one overall change rate using a multifeature classifier to locate at least one boundary of a video segment.
- 26. The method as claimed in claim 24, further comprising the steps of:
receiving information in a boundary detector of said audio classifier controller from said category change rate detector concerning audio category rate changes; and classifying at least one feature concerning at least one audio category rate change using a multifeature classifier to locate at least one boundary of a video program segment containing said audio signal.
- 27. The method as claimed in claim 26, further comprising the step of:
assigning a weighting factor to each change in each category, wherein said weighting factor comprises a number that represents the relative importance assigned to the category change in assessing the likelihood of locating a boundary at a point where a particular change in category occurs.
CROSS-REFERENCE TO RELATED PATENT AND APPLICATION
[0001] The present invention is related to the inventions disclosed in U.S. Pat. No. 6,100,941 issued Aug. 8, 2000, entitled “APPARATUS AND METHOD FOR LOCATING A COMMERCIAL DISPOSED WITHIN A VIDEO DATA STREAM” and in U.S. Pat. application Ser. No. 09/006,657 filed Jan. 13, 1998, entitled “MULTIMEDIA COMPUTER SYSTEM WITH STORY SEGMENTATION CAPABILITY AND OPERATING PROGRAM THEREFOR INCLUDING FINITE AUTOMATON VIDEO PARSER.” This patent and this patent application are commonly assigned to the assignee of the present invention. The disclosures of this patent and patent application are hereby incorporated herein by reference for all purposes as if fully set forth herein.
Continuation in Parts (1)
|
Number |
Date |
Country |
Parent |
09006657 |
Jan 1998 |
US |
Child |
09746077 |
Dec 2000 |
US |