Claims
- 1. A method for monitoring audio content in a video broadcast, said method comprising the steps of:
receiving an audio datastream from the video broadcast; generating audio identifying information for audio content from the audio datastream; and determining whether the audio identifying information generated for the received audio content matches audio identifying information in an audio content database.
- 2. The method according to claim 1, wherein the audio identifying information is an audio feature signature that is based on audio content.
- 3. The method according to claim 2, wherein the determining step includes the sub-step of comparing the audio feature signature generated for the received audio content with the audio feature signatures stored in the audio content database.
- 4. The method according to claim 1, further comprising the steps of:
generating audio identifying information for predetermined audio content; and storing the audio identifying information for the predetermined audio content in the audio content database.
- 5. A method for charging a royalty for usage of copyrighted audio content in a video broadcast, said method comprising the steps of:
receiving an audio datastream from the video broadcast; generating audio identifying information for audio content from the audio datastream; determining whether the audio identifying information generated for the received audio content matches audio identifying information in a copyrighted audio content database; and if the audio identifying information generated for the received audio content matches audio identifying information in the copyrighted audio content database, determining a duration of the audio content in the video broadcast and charging a royalty based on the duration.
- 6. The method according to claim 1, further comprising the step of:
if the audio identifying information generated for the received audio content matches audio identifying information in the audio content database, recording information on a match between the received audio content and the audio content database.
- 7. The method according to claim 6, further comprising the steps of:
compiling cue sheet entries from the recorded match information, the cue sheet entries including identification of at least one piece of audio content in the video broadcast, a start time of each piece of audio content, and a duration of each piece of audio content; and charging royalties based on the cue sheet entries.
- 8. The method according to claim 6, further comprising the steps of:
compiling cue sheet entries from the recorded match information, the cue sheet entries including identification of each piece of audio content in the video broadcast, a start time of each piece of audio content, and a duration of each piece of audio content; and paying royalties based on the cue sheet entries.
- 9. The method according to claim 6, further comprising the step of:
compiling a cue sheet containing cue sheet entries from the recorded match information, the cue sheet entries including identification of at least one piece of audio content in the video broadcast, a start time of each piece of audio content, and a duration of each piece of audio content; and charging a fee for the cue sheet.
- 10. The method according to claim 1, wherein the generating step includes the sub-steps of:
obtaining an audio signal characterized by a time dependent power spectrum; analyzing the spectrum to obtain a plurality of time dependent frequency components; detecting a plurality of events in the plurality of time dependent frequency components; and producing the audio identifying information for the audio content from the audio datastream based on the plurality of events.
- 11. The method according to claim 10, wherein the sub-step of analyzing the spectrum includes:
sampling the audio signal to obtain a plurality of audio signal samples; taking a plurality of subsets from the plurality of audio signal samples; and performing a Fourier transform on each of the plurality of subsets to obtain a set of Fourier frequency components.
- 12. The method according to claim 10, wherein the sub-step of detecting a plurality of events includes:
taking a first running average with a first averaging period of each of a first subset of the plurality of time dependent frequency components to obtain a first sequence of average powers at a set of successive times for each of the plurality of time dependent frequency components; taking a second running average with a second averaging period that is different from the first averaging period of each of the subset of the plurality of time dependent frequency components to obtain a second sequence of average powers at the set of successive times for each of the plurality of time dependent frequency components; and identifying a plurality of average crossing events at a plurality of event times at which the first running average crosses the second running average.
- 13. The method according to claim 1, wherein the generating step includes the sub-steps of:
performing a Fourier transformation of the audio signals of the portion into a time series of audio power dissipated over a first plurality of frequencies; grouping the frequencies into a smaller second plurality of bands that each include a range of neighboring frequencies; detecting power dissipation events in each of the bands; and grouping together the power dissipation events from mutually adjacent bands at a selected moment so as to form the identifying feature.
- 14. A computer-readable medium encoded with a program for monitoring audio content in a video broadcast, said program containing instructions for performing the steps of:
receiving an audio datastream from the video broadcast; generating audio identifying information for audio content from the audio datastream; and determining whether the audio identifying information generated for the received audio content matches audio identifying information in an audio content database.
- 15. The computer-readable medium according to claim 14, wherein the audio identifying information is an audio feature signature that is based on audio content.
- 16. The computer-readable medium according to claim 15, wherein the determining step includes the sub-step of comparing the audio feature signature generated for the received audio content with the audio feature signatures stored in the audio content database.
- 17. The computer-readable medium according to claim 14, wherein said program further contains instructions for performing the steps of:
generating audio identifying information for predetermined audio content; and storing the audio identifying information for the predetermined audio content in the audio content database.
- 18. The computer-readable medium according to claim 14, wherein said program further contains instructions for performing the step of:
if the audio identifying information generated for the received audio content matches audio identifying information in the audio content database, determining a duration of the audio content in the video broadcast and charging a royalty based on the duration.
- 19. The computer-readable medium according to claim 14, wherein said program further contains instructions for performing the steps of:
if the audio identifying information generated for the received audio content matches audio identifying information in the audio content database, recording information on a match between the received audio content and the audio content database; compiling cue sheet entries from the recorded match information, the cue sheet entries including identification of at least one piece of audio content in the video broadcast, a start time of each piece of audio content, and a duration of each piece of audio content; and charging royalties based on the cue sheet entries.
- 20. The computer-readable medium according to claim 14, wherein said program further contains instructions for performing the steps of:
if the audio identifying information generated for the received audio content matches audio identifying information in the audio content database, recording information on a match between the received audio content and the audio content database; compiling cue sheet entries from the recorded match information, the cue sheet entries including identification of each piece of audio content in the video broadcast, a start time of each piece of audio content, and a duration of each piece of audio content; and paying royalties based on the cue sheet entries.
- 21. The computer-readable medium according to claim 14, wherein the generating step includes the sub-steps of:
obtaining an audio signal characterized by a time dependent power spectrum; analyzing the spectrum to obtain a plurality of time dependent frequency components; detecting a plurality of events in the plurality of time dependent frequency components; and producing the audio identifying information for the audio content from the audio datastream based on the plurality of events.
- 22. The computer-readable medium according to claim 21, wherein the sub-step of analyzing the spectrum includes:
sampling the audio signal to obtain a plurality of audio signal samples; taking a plurality of subsets from the plurality of audio signal samples; and performing a Fourier transform on each of the plurality of subsets to obtain a set of Fourier frequency components.
- 23. The computer-readable medium according to claim 21, wherein the sub-step of detecting a plurality of events includes:
taking a first running average with a first averaging period of each of a first subset of the plurality of time dependent frequency components to obtain a first sequence of average powers at a set of successive times for each of the plurality of time dependent frequency components; taking a second running average with a second averaging period that is different from the first averaging period of each of the subset of the plurality of time dependent frequency components to obtain a second sequence of average powers at the set of successive times for each of the plurality of time dependent frequency components; and identifying a plurality of average crossing events at a plurality of event times at which the first running average crosses the second running average.
- 24. The computer-readable medium according to claim 14, wherein the generating step includes the sub-steps of:
performing a Fourier transformation of the audio signals of the portion into a time series of audio power dissipated over a first plurality of frequencies; grouping the frequencies into a smaller second plurality of bands that each include a range of neighboring frequencies; detecting power dissipation events in each of the bands; and grouping together the power dissipation events from mutually adjacent bands at a selected moment so as to form the identifying feature.
- 25. A system for monitoring audio content in a video broadcast, said system comprising:
a receiver for receiving an audio datastream from the video broadcast; an identifying information generator for generating audio identifying information for audio content from the audio datastream; and a match detector for determining whether the audio identifying information generated for the received audio content matches audio identifying information in an audio content database.
- 26. The system according to claim 25, wherein the audio identifying information is an audio feature signature that is based on audio content.
- 27. The system according to claim 25, wherein the audio content database stores audio identifying information for predetermined audio content.
- 28. The system according to claim 25, further comprising:
an invoicer for determining a duration of the audio content in the video broadcast and charging a royalty based on the duration, if the audio identifying information generated for the received audio content matches audio identifying information in the audio content database.
- 29. The system according to claim 25, further comprising:
an information collector for recording information on a match between the received audio content and the audio content database, if the audio identifying information generated for the received audio content matches audio identifying information in the audio content database; a cue sheet generator for compiling cue sheet entries from the recorded match information, the cue sheet entries including identification of at least one piece of audio content in the video broadcast, a start time of each piece of audio content, and a duration of each piece of audio content; and an invoicer for charging royalties based on the cue sheet entries.
- 30. The system according to claim 25, further comprising:
an information collector for recording information on a match between the received audio content and the audio content database, if the audio identifying information generated for the received audio content matches audio identifying information in the audio content database; a cue sheet generator for compiling cue sheet entries from the recorded match information, the cue sheet entries including identification of each piece of audio content in the video broadcast, a start time of each piece of audio content, and a duration of each piece of audio content; and a royalty calculator for calculating royalties to be paid based on the cue sheet entries.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a continuation-in-part of prior U.S. application Ser. No. 09/803,298, filed Mar. 9, 2001, now ______, which is based upon and claims priority from prior U.S. Provisional Application No. 60/245,799, filed Nov. 3, 2000. The entire disclosures of application Ser. No. 09/803,298 and Provisional Application No. 60/245,799 are herein incorporated by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60245799 |
Nov 2000 |
US |
Continuation in Parts (1)
|
Number |
Date |
Country |
Parent |
09803298 |
Mar 2001 |
US |
Child |
09895822 |
Jun 2001 |
US |