Claims
- 1. A method for assessing quality of metadata associated with media on a communications network, said metadata comprising at least one keyword, said method comprising the steps of:
separating said metadata associated with said media into at least one keyword; comparing each keyword with at least one valid keyword in a set of valid keywords for determining a degree of similarity between said metadata and said set of valid metadata; and calculating a score in accordance with said degree of similarity for indicating said quality of said metadata.
- 2. A method in accordance with claim 1, further comprising the step of qualifying said metadata as valid metadata if said score is equal to or greater than a predetermined threshold, wherein said valid metadata comprises valid keywords.
- 3. A method in accordance with claim 1, wherein said step of separating said metadata comprises replacing each connecting character positioned adjacent to at least one keyword with a space.
- 4. A method in accordance with claim 3, wherein connecting characters comprise at least one of a period (“.”), an underscore (“_”), a backslash (“\”), a forward slash (“/”), a comma (“,”), an asterisk (“*”), and a hyphen (“-”).
- 5. A method in accordance with claim 1, wherein said step of calculating said score comprises summing a total number of matches between each valid keyword and each keyword.
- 6. A method in accordance with claim 1, wherein said step of calculating said score comprises:
assigning a weight to each of said valid keywords; for each valid keyword, summing a number of matches with said keywords for providing a keyword summation; multiplying said weight of each keyword by its respective keyword summation to provide a keyword product; and accumulating keyword products in accordance with one of summing all of said keyword products and averaging all of said keyword products for providing said score.
- 7. A method in accordance with claim 1, said metadata comprising a plurality of uniform resource indicators (URIs), each URI comprising at least one keyword; said method further comprising the steps of:
calculating a score for each URI; and qualifying said URI as a valid URI if a respective score is equal to or greater than a predetermined threshold, wherein said valid URI comprises valid keywords.
- 8. A method in accordance with claim 1, wherein said metadata comprise elements related to at least one of content of the media, intellectual property rights associated with the media, and instantiation of the media.
- 9. A method in accordance with claim 1, wherein said media comprises at least one of multimedia and streaming media.
- 10. A method in accordance with claim 1, wherein said communications network is a computer network.
- 11. A computer system for assessing quality of metadata associated with media on a computer network, said metadata comprising at least one keyword, said computer system comprising at least one computer, each of said at least one computer being communicatively coupled to all of said at least one computer, wherein each of said at least one computer includes at least one program stored therein for allowing communication between each and every of said at least one computer, each of said at least one program operating in conjunction with one another to cause said at least one computer to perform the steps of:
separating said metadata associated with said media into at least one keyword; comparing each keyword with at least one valid keyword in a set of valid keywords for determining a degree of similarity between said metadata and said set of valid metadata; and calculating a score in accordance with said degree of similarity for indicating said quality of said metadata.
- 12. A computer system in accordance with claim 11, wherein said step of calculating comprises:
assigning a weight to each of said valid keywords; for each valid keyword, summing a number of matches with said keywords for providing a keyword summation; multiplying said weight of each keyword by its respective keyword summation to provide a keyword product; and accumulating keyword products in accordance with one of summing all of said keyword products and averaging all of said keyword products for providing said score.
- 13. A program readable medium having embodied thereon a program for causing a processor to assess a quality of metadata associated with media on a communications network, said metadata comprising at least one keyword, said program readable medium comprising:
means for causing said processor to separate said metadata associated with said media into at least one keyword; means for causing said processor to compare each keyword with at least one valid keyword in a set of valid keywords for determining a degree of similarity between said metadata and said set of valid metadata; and means for causing said processor to calculate a score in accordance with said degree of similarity for indicating said quality of said metadata.
- 14. A program readable medium in accordance with claim 13, wherein said means for causing said processor to calculate said score comprises:
means for causing said processor to assign a weight to each of said valid keywords; for each valid keyword, means for causing said processor to sum a number of matches with said keywords for providing a keyword summation; means for causing said processor to multiply said weight of each keyword by its respective keyword summation to provide a keyword product; and means for causing said processor to accumulate keyword products in accordance with one of summing all of said keyword products and averaging all of said keyword products for providing said score.
- 15. A data signal embodied in a carrier wave comprising:
a separate metadata code segment for separating metadata associated with media on a communications network into at least one keyword, wherein said metadata comprises at least one keyword; a compare keyword code segment for comparing each keyword with at least one valid keyword in a set of valid keywords for determining a degree of similarity between said metadata and said set of valid metadata; and a calculate score code segment for calculating a score in accordance with said degree of similarity for indicating a quality of said metadata.
- 16. A data signal in accordance with claim 15, further comprising a qualify metadata code segment for qualifying said metadata as valid metadata if said score is equal to or greater than a predetermined threshold, wherein said valid metadata comprises valid keywords.
- 17. A data signal in accordance with claim 15, wherein said metadata is separated by replacing each connecting character positioned adjacent to at least one keyword with a space.
- 18. A data signal in accordance with claim 17, wherein connecting characters comprise at least one of a period (“.”), an underscore (“_”), a backslash (“\”), a forward slash (“/”), a comma (“,”),an asterisk (“*”) and a hyphen (“-”)
- 19. A data signal in accordance with claim 15, wherein said calculate score code segment comprises a sum all matches code segment for summing a total number of matches between each valid keyword and each keyword.
- 20. A data signal in accordance with claim 15, wherein said calculate score code segment comprises:
an assign weight code segment for assigning a weight to each of said valid keywords; for each valid keyword, a sum valid keyword matches code segment for summing a number of matches with said keywords for providing a keyword summation; a multiply code segment for multiplying said weight of each keyword by its respective keyword summation to provide a keyword product; and an accumulate code segment for accumulating keyword products in accordance with one of summing all of said keyword products and averaging all of said keyword products for providing said score.
- 21. A data signal in accordance with claim 15, said metadata comprising a plurality of uniform resource indicators (URIs), each URI comprising at least one keyword, said data signal further comprising:
a calculate URI score code segment for calculating a score for each URI; a qualify URI code segment for qualifying said URI as valid URI if a respective score is equal to or greater than a predetermined threshold, wherein said valid URI comprises valid keywords.
- 22. A data signal in accordance with claim 15, wherein said metadata comprise elements related to at least one of content of the media, intellectual property rights associated with the media, and instantiation of the media.
a calculate URI score code segment for calculating a score for each URI; a qualify URI code segment for qualifying said URI as valid URI if a respective score is equal to or greater than a predetermined threshold, wherein said valid URI comprises valid keywords.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority from U.S. provisional application No. 60/252,273, filed on Nov. 21, 2000, which is herein incorporated by reference in its entirety. This application is related to the following applications, each being filed on the same date as the instant application: application number ——————, entitled “Internet Streaming Media Workflow Architecture,” attorney docket number D4961-00014; application number ——————, entitled “Interpretive Stream Metadata Extraction,” attorney docket number D4961-00015; application number ——————, entitled “Metadata Quality Improvement,” attorney docket number D4961-00016; application number ——————, entitled “Full Text Relevancy Ranking,” attorney docket number D4961-00017; application number ——————, entitled “Grouping Multimedia And Streaming Media Search Results,” attorney docket number D4961-00018; and application number ——————,entitle “Internet Crawl Seeding,” attorney docket number D4961-00020.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60252273 |
Nov 2000 |
US |