Apparatus, system and method for extracting structure of song lyrics using repeated pattern thereof

Abstract
An apparatus, system, and method for extracting the structure of song lyrics using a repeated pattern thereof are provided. The apparatus includes a lyric extractor extracting lyric information from metadata related to an audio file, a character string information extractor extracting an interlude section and a repeated character string based on the extracted lyric information, a paragraph extractor extracting a paragraph based on the repeated character string and then a set of paragraphs having the same repeated pattern among the extracted paragraphs, and a lyric structure generator arranging an interlude section, a character string, and a paragraph related to the audio file in a tree structure.
Description

BRIEF DESCRIPTION OF THE DRAWINGS

The above and/or other features and advantages of the present invention will become more apparent by describing in detail preferred exemplary embodiments thereof with reference to the attached drawings in which:



FIG. 1 is an internal block diagram of an apparatus for extracting the structure of song lyrics using a repeated pattern thereof according to an exemplary embodiment of the present invention;



FIGS. 2A-2C illustrate an example of preprocessing extracted lyric information using a repeated pattern through a preprocessor in an apparatus for extracting the structure of song lyrics using a repeated pattern thereof according to a modified exemplary embodiment of the present invention;



FIG. 3 illustrates an example of extracting a keyword for an interlude section from lyric information in an interlude section extractor of an apparatus for extracting the structure of song lyrics using a repeated pattern thereof according to a modified exemplary embodiment of the present invention;



FIGS. 4A-4D illustrate an example of calculating a distance between two character strings and extracting a repeated character string in a character string repetition number extractor of an apparatus for extracting the structure of song lyrics using a repeated pattern thereof according to a modified exemplary embodiment of the present invention;



FIGS. 5A-5C illustrate an example of extracting a paragraph in a first paragraph extractor of an apparatus for extracting the structure of song lyrics using a repeated pattern thereof according to a modified exemplary embodiment of the present invention;



FIG. 6 illustrates an example of extracting an extended paragraph (EP) in a second paragraph extractor of an apparatus for extracting the structure of song lyrics using a repeated pattern thereof according to a modified exemplary embodiment of the present invention;



FIG. 7 illustrates an example of arranging predetermined lyric information in a tree structure in an apparatus for extracting the structure of song lyrics using a repeated pattern thereof according to a modified exemplary embodiment of the present invention;



FIG. 8 is a flowchart illustrating a method for extracting the structure of song lyrics using a repeated pattern thereof according to an exemplary embodiment of the present invention; and



FIG. 9 is an internal block diagram of a system for extracting the structure of song lyrics using a repeated pattern thereof according to an exemplary embodiment of the present invention.


Claims
  • 1. An apparatus for extracting the structure of song lyrics using a repeated pattern of the song lyrics, the apparatus comprising: a lyric extractor for extracting lyric information from metadata related to an audio file;a character string information extractor for extracting an interlude section and a repeated character string based on the extracted lyric information;a paragraph extractor for extracting a paragraph based on the repeated character string and then a set of paragraphs having a same repeated pattern among the extracted paragraphs; anda lyric structure generator for arranging interlude sections, character strings, and paragraphs related to the audio file in a tree structure.
  • 2. The apparatus of claim 1, wherein the character string information extractor comprises: an interlude section extractor for extracting a keyword for an interlude section from a character string contained in the extracted lyric information; anda character string repetition number extractor for calculating a distance between two character strings in the extracted lyric information and extracting the number of repetitions of character strings.
  • 3. The apparatus of claim 2, wherein the distance between two character strings is calculated using sim(SI, SJ)=AI,J/max(I,J) when AI,J is the number of the same characters within each character string and max(I,J) is a greater number of characters within the character strings SI and SJ.
  • 4. The apparatus of claim 3, wherein a degree of similarity between characters in the two character strings is calculated using a(i,j)=max(a(i,j−1),a(i−1,j),a(i−1,j−1)+δ(si,sj)) when i and j denote characters within the two character strings and δ (si,sj) indicates the sameness between characters being compared.
  • 5. The apparatus of claim 1, wherein the paragraph extractor comprises: a first paragraph extractor for extracting a set of consecutive character strings having the same number of repetitions among the repeated character strings as a paragraph; anda second paragraph extractor for extracting a set of paragraphs having the same repeated pattern among the paragraphs extracted by the first paragraph extractor.
  • 6. The apparatus of claim 1, further comprising: a preprocessor for deleting supplementary information contained in the extracted lyric information; anda controller for extracting a thematic portion of the audio file based on the created tree structure.
  • 7. A method for extracting the structure of song lyrics using a repeated pattern of the song lyrics, the method comprising: extracting lyric information from metadata related to an audio file;extracting an interlude section and a repeated character string based on the extracted lyric information;extracting a paragraph based on the repeated character string;extracting a set of paragraphs having the same repeated pattern among the extracted paragraphs; andarranging interlude sections, character strings, and paragraphs related to the audio file in a tree structure.
  • 8. The method of claim 7, wherein a distance between two character strings is calculated using sim(SI,SJ)=AI,J/max(I,J) when AI,J is the number of the same characters within each character string and max(I,J) is a greater number of characters within the character strings SI and SJ.
  • 9. The method of claim 8, wherein a degree of similarity between characters in the two character strings is calculated using a(i,j)=max(a(i,j−1),a(i−1,j),a(i−1,j−1)+δ(si, sj)) when i and j denote characters within the two character strings and a δ (si,,sj) indicates the sameness between characters being compared.
  • 10. The method of claim 7, further comprising performing preprocessing to delete supplementary information contained in the extracted lyric information.
  • 11. The method of claim 7, further comprising extracting a thematic portion of the audio file based on the tree structure.
  • 12. A system for extracting the structure of song lyrics using a repeated pattern of the song lyrics, the system comprising a key input unit for receiving predetermined key input signals generated by a user's selection;an audio file management unit for managing an operation related to an audio file when the input key signals indicates a request for the audio file; anda lyric structure generator for analyzing the song lyrics related to the audio file, extracting a repeated character string and a paragraph, and arranging information about the song lyrics in a tree structure.
Priority Claims (1)
Number Date Country Kind
10-2006-0013269 Feb 2006 KR national