The disclosure of Japanese Patent Application No. 2007-339372, filed on Dec. 28, 2007, is incorporated herein by reference.
The illustrative embodiments relate to a music displaying apparatus and a computer-readable storage medium storing a music displaying program for displaying a music piece to a user, and more particularly, to a music displaying apparatus and a computer-readable storage medium storing a music displaying program for analyzing user's singing voice, thereby displaying a music piece.
Karaoke apparatuses, which have a function of analyzing singing of a singing person to report a result in addition to a function of playing a karaoke music piece, have been put to practical use. For example, a karaoke apparatus is disclosed, which analyzes formant of a singing voice of the singing person and displays a portrait of a professional singer having a voice similar to that of the singing person (e.g. Japanese Laid-Open Patent Publication No. 2000-56785). The karaoke apparatus includes a database in which formant data of voices of a plurality of professional singers is stored in advance. Formant data obtained by analyzing the singing voice of the singing person is collated with the formant data stored in the database, and a portrait of a professional singer having a high similarity is displayed. Further, the karaoke apparatus is capable of displaying a list of music pieces of the professional singer.
However, the above karaoke apparatus disclosed in Japanese Laid-Open Patent Publication No. 2000-56785 has the following problem. The karaoke apparatus merely determines whether or not the voice of the singing person (the formant data) is similar to the voices of the professional singers, which are stored in the database, and does not take into consideration a characteristic (a way) of the singing of the singing person. In other words, only a portrait of a professional singer having a voice similar to that of the singing person, and a list of music pieces of the professional singer are shown, and the shown music pieces are not necessarily easy or suitable for the singing person to sing. For example, the karaoke apparatus cannot show a music piece of a genre at which the singing person is good. Therefore, a feature of the illustrative embodiments is to provide a music displaying apparatus and a computer-readable storage medium storing a music displaying program for analyzing a singing characteristic of the singing person, thereby displaying a music piece and a genre which are suitable for the singing person to sing.
The illustrative embodiments may have the following exemplary features. It is noted that reference numerals and supplementary explanations in parentheses are merely provided to facilitate the understanding of the illustrative embodiments in relation to certain illustrative embodiments.
A first illustrative embodiment may have a music displaying apparatus comprising voice data obtaining means (21), singing characteristic analysis means (21), music piece related information storage means (24), comparison parameter storage means (24), comparison means (21), selection means (21), and displaying means (12, 21). The voice data obtaining means is means for obtaining voice data concerning singing of a user. The voice data obtaining means is means for obtaining voice data concerning singing of a user. The singing characteristic analysis means is means for analyzing the voice data to calculate a plurality of singing characteristic parameters which indicate a characteristic of the singing of the user. The music piece related information storage means is means for storing music piece related information concerning a music piece. The comparison parameter storage means for storing a plurality of comparison parameters, which is to be compared with the plurality of singing characteristic parameters, so as to be associated with the music piece related information. The comparison means for comparing the plurality of singing characteristic parameters with the plurality of comparison parameters to calculate a similarity between the plurality of singing characteristic parameters and the plurality of comparison parameters. The selection means is means for selecting at least one piece of the music piece related information which is associated with a comparison parameter which has a high similarity with the singing characteristic parameter. The displaying means is means for displaying information based on the music piece related information selected by the selection means.
According to an exemplary feature of the first illustrative embodiment, it is possible to show to the user information based on the music piece related information, which takes into consideration the characteristic of the singing of the user, for example, information concerning a karaoke music piece suitable for the user to sing, and a music genre suitable for the user to sing.
In another exemplary feature of the first illustrative embodiment, the music piece related information storage means stores, as the music piece related information, music piece data for reproducing at least the music piece. The comparison parameter storage means stores, as the comparison parameter, a parameter which indicates a musical characteristic of the music piece so as to be associated with the music piece data. The selection means selects at least one piece of the music piece data which is associated with the comparison parameter which has the high similarity with the singing characteristic parameter. The displaying means shows information of the music piece based on the music piece data selected by the selection means.
According to an exemplary feature of the first illustrative embodiment, information of a music piece, such as a karaoke music piece suitable for the user to sing, and the like, can be shown.
In an exemplary feature of the first illustrative embodiment, the comparison parameter storage means further stores, as the comparison parameter, a parameter which indicates a musical characteristic of the music genre. The music piece related information storage means further stores, as the music piece related information, genre data which indicates a music genre. The music displaying apparatus further comprises music piece genre similarity data storage means (24), and voice genre similarity calculation means (21). The music piece genre similarity data storage means is means for storing music piece genre similarity data which indicates a similarity between the music piece and the music genre. The voice genre similarity calculation means is means for calculating a similarity between the singing characteristic parameter and the music genre. The selection means selects the music piece data based on the similarity calculated by the voice genre similarity calculation means and the music piece genre similarity data stored by the music piece genre similarity data storage means.
In another exemplary feature of the first illustrative embodiment, the music piece data includes musical score data for indicating musical instruments used for playing the music piece, a tempo of the music piece, and a key of the music piece. The music displaying apparatus further comprises music piece genre similarity calculation means for calculating a similarity between the music piece and the music genre based on the musical instruments, the tempo and the key which are included in the musical score data.
According to an exemplary feature of the first illustrative embodiment, a music piece such as a karaoke music piece, and the like can be shown while a music genre suitable for the characteristic of the singing of the user is taken into consideration.
In an exemplary feature of the first illustrative embodiment, each of the plurality of singing characteristic parameters and the plurality of comparison parameters includes a value obtained by evaluating one of accuracy of pitch concerning the singing of the user, variation in pitch, a periodical input of voice, and a singing range.
According to an exemplary feature of the first illustrative embodiment, the similarity can be calculated more accurately.
In an exemplary feature of the first illustrative embodiment, the music piece data includes musical score data which indicates musical instruments used for the music piece, a tempo of the music piece, a key of the music piece, a plurality of musical notes which constitute the music piece. The singing characteristic analysis means includes voice volume/pitch data calculation means for calculating, from the voice data, voice volume value data which indicates a voice volume value, and pitch data which indicates a pitch. The singing characteristic analysis means compares at least one of the voice volume value data and the pitch data with the musical score data to calculate the singing characteristic parameter.
According to an exemplary feature of the first illustrative embodiment, since the singing voice is analyzed based on a musical score, the voice volume, and the pitch, the characteristic of the singing can be calculated more accurately.
In an exemplary feature of the first illustrative embodiment, the singing characteristic analysis means calculates the singing characteristic parameter based on an output value of a frequency component for a predetermined period from the voice volume value data.
In an exemplary feature of the first illustrative embodiment, the singing characteristic analysis means calculates the singing characteristic parameter based on a difference between a start timing of each musical note of a melody part of a musical score indicated by the musical score data and an input timing of voice based on the voice volume value data.
In an exemplary feature of the first illustrative embodiment, the singing characteristic analysis means calculates the singing characteristic parameter based on a difference between a pitch of a musical note of a musical score indicated by the musical score data and a pitch based on the pitch data.
In an exemplary feature of the first illustrative embodiment, the singing characteristic analysis means calculates the singing characteristic parameter based on an amount of change in pitch for each time unit in the pitch data.
In an exemplary feature of the first illustrative embodiment, the singing characteristic analysis means calculates, from the voice volume value data and the pitch data, the singing characteristic parameter based on a pitch having a maximum voice volume value among voices, a pitch of each of which is maintained for a predetermined time period or more.
In an exemplary feature of the first illustrative embodiment, the singing characteristic analysis means calculates a quantity of high frequency components included in the voice of the user from the voice data, and calculates the singing characteristic parameter based on a calculated result.
According to an exemplary feature of the first illustrative embodiment, it is possible to calculate the singing characteristic parameter which more accurately captures the characteristic of the singing of the user.
In another exemplary feature of the first illustrative embodiment, the music piece related information storage means stores, as the music piece related information, genre data which indicates at least a music genre. The comparison parameter storage means stores, as the comparison parameter, a parameter which indicates a musical characteristic of the music genre so as to be associated with the music genre. The selection means selects the music genre which is associated with the comparison parameter which has the high similarity with the singing characteristic parameter. The displaying means shows a name of the music genre as information based on the music piece related information.
According to an exemplary feature of the first illustrative embodiment, a music genre suitable for the characteristic of the singing of the user can be shown.
In an exemplary feature of the first illustrative embodiment, the music piece data includes musical score data for indicating musical instruments used for playing the music piece, a tempo of the music piece, and a key of the music piece. The music displaying apparatus further comprises music piece parameter calculation means for calculating, from the musical score data, the comparison parameter for each music piece. The comparison parameter storage means stores the comparison parameter calculated by the music piece parameter calculation means.
In an exemplary feature of the first illustrative embodiment, the music piece parameter calculation means calculates, from the musical score data, the comparison parameter based on a difference in pitch between two adjacent musical notes, a position of a musical note within a beat, and a total time of musical notes having lengths equal to or larger than a predetermined threshold value.
According to an exemplary feature of the first illustrative embodiment, even in the case where the user composes a music piece or where a music piece is newly obtained by downloading it from a predetermined server, the self composed music piece or the downloaded music piece is analyzed, thereby producing and storing a comparison parameter. Thus, it is possible to show whether or not even the self-composed music piece or the downloaded music piece is suitable for the characteristic of the singing of the user.
A second illustrative embodiment may have a computer-readable storage medium storing a music displaying program which causes a computer of a music displaying apparatus, which shows a music piece to a user, to function as: voice data obtaining means (S44); singing characteristic analysis means (S45); music piece related information storage means (S65); comparison parameter storage means (S47, S48); comparison means (S49), selection means (S49); and displaying means (S51). The voice data obtaining means is means for obtaining voice data concerning singing of the user. The singing characteristic analysis means is means for analyzing the voice data to calculate a plurality of singing characteristic parameters which indicate a characteristic of the singing of the user. The music piece related information storage means is means for storing music piece related information concerning a music piece. The comparison parameter storage means is means for storing a plurality of comparison parameters, which is to be compared with the plurality of singing characteristic parameters, so as to be associated with the music piece related information. The comparison means is means for comparing the plurality of singing characteristic parameters with the plurality of comparison parameters to calculate a similarity between the plurality of singing characteristic parameters and the plurality of comparison parameters. The selection means is means for selecting at least one piece of the music piece related information which is associated with a comparison parameter which has a high similarity with the singing characteristic parameter. The displaying means is means for displaying information based on the music piece related information selected by the selection means.
The second illustrative embodiment may have the same advantageous effects as those of the first illustrative embodiment.
In an exemplary feature of the second illustrative embodiment, the music piece related information storage means stores, as the music piece related information, music piece data for reproducing at least the music piece. The comparison parameter storage means stores, as the comparison parameter, a parameter which indicates a musical characteristic of the music piece so as to be associated with the music piece data. The selection means selects at least one piece of the music piece data which is associated with the comparison parameter which has the high similarity with the singing characteristic parameter. The displaying means shows information of the music piece based on the music piece data selected by the selection means.
According to an exemplary feature of the second illustrative embodiment, the same advantageous effects as those of the second aspect are obtained.
In an exemplary feature of the second illustrative embodiment, the music piece related information storage means further stores, as the music piece related information, genre data which indicates a music genre. The comparison parameter storage means further stores, as the comparison parameter, a parameter which indicates a musical characteristic of the music genre. The music displaying program further causes the computer of the music displaying apparatus to function as music piece genre similarity data storage means (S63), and voice genre similarity calculation means (S66). The music piece genre similarity data storage means is means for storing music piece genre similarity data which indicates a similarity between the music piece and the music genre. The voice genre similarity calculation means is means for calculating a similarity between the singing characteristic parameter and the music genre. The selection means selects the music piece data based on the similarity calculated by the voice genre similarity calculation means and the music piece genre similarity data stored by the music piece genre similarity data storage means.
According to an exemplary feature of the second illustrative embodiment, the same advantageous effects as those of the third aspect are obtained.
In an exemplary feature of the second illustrative embodiment, the music piece data includes musical score data for indicating musical instruments used for playing the music piece, a tempo of the music piece, and a key of the music piece. The music displaying program further causes the computer of the music displaying apparatus to function as music piece genre similarity calculation means (S4) for calculating a similarity between the music piece and the music genre based on the musical instruments, the tempo and the key which are included in the musical score data.
According to an exemplary feature of the second illustrative embodiment, the same advantageous effects as those of the fourth aspect are obtained.
In an exemplary feature of the second illustrative embodiment, each of the plurality of singing characteristic parameters and the plurality of comparison parameters includes a value obtained by evaluating one of accuracy of pitch concerning the singing of the user, variation in pitch, a periodical input of voice, and a singing range.
According to an exemplary feature of the second illustrative embodiment, the same advantageous effects as those of the fifth aspect are obtained.
In an exemplary feature of the second illustrative embodiment, the music piece data includes musical score data which indicates musical instruments used for the music piece, a tempo of the music piece, a key of the music piece, a plurality of musical notes which constitute the music piece. The singing characteristic analysis means includes voice volume/pitch data calculation means for calculating, from the voice data, voice volume value data which indicates a voice volume value, and pitch data which indicates a pitch. The singing characteristic analysis means compares at least one of the voice volume value data and the pitch data with the musical score data to calculate the singing characteristic parameter.
According to an exemplary feature of the second illustrative embodiment, the same advantageous effects as those of the sixth aspect are obtained.
In an exemplary feature of the second illustrative embodiment, the singing characteristic analysis means calculates the singing characteristic parameter based on an output value of a frequency component for a predetermined period from the voice volume value data.
In an exemplary feature of the second illustrative embodiment, the singing characteristic analysis means calculates the singing characteristic parameter based on a difference between a start timing of each musical note of a melody part of a musical score indicated by the musical score data and an input timing of voice based on the voice volume value data.
In an exemplary feature of the second illustrative embodiment, the singing characteristic analysis means calculates the singing characteristic parameter based on a difference between a pitch of a musical note of a musical score indicated by the musical score data and a pitch based on the pitch data.
In an exemplary feature of the second illustrative embodiment, the singing characteristic analysis means calculates the singing characteristic parameter based on an amount of change in pitch for each time unit in the pitch data.
In an exemplary feature of the second illustrative embodiment, the singing characteristic analysis means calculates, from the voice volume value data and the pitch data, the singing characteristic parameter based on a pitch having a maximum voice volume value among voices, a pitch of each of which is maintained for a predetermined time period or more.
In an exemplary feature of the second illustrative embodiment, the singing characteristic analysis means calculates a quantity of high-frequency components included in the voice of the user from the voice data, and calculates the singing characteristic parameter based on a calculated result.
In an exemplary feature of the second illustrative embodiment, the music piece related information storage means stores, as the music piece related information, genre data which indicates at least a music genre. The comparison parameter storage means stores, as the comparison parameter, a parameter which indicates a musical characteristic of the music genre so as to be associated with the music genre. The selection means selects the music genre which is associated with the comparison parameter which has the high similarity with the singing characteristic parameter. The displaying means shows a name of the music genre as information based on the music piece related information.
In an exemplary feature of the second illustrative embodiment, the music piece data includes musical score data for indicating musical instruments used for playing the music piece, a tempo of the music piece, and a key of the music piece. The music displaying program further causes the computer of the music displaying apparatus to function as music piece parameter calculation means (S3) for calculating, from the musical score data, the comparison parameter for each music piece. The comparison parameter storage means stores the comparison parameter calculated by the music piece parameter calculation means.
In an exemplary feature of the second illustrative embodiment, the music piece parameter calculation means calculates, from the musical score data, the comparison parameter based on a difference in pitch between two adjacent musical notes, a position of a musical note within a beat, and a total time of musical notes having lengths equal to or larger than a predetermined threshold value.
According to the second illustrative embodiment, a music piece and a music genre, which are suitable for a singing characteristic of the singing person, can be shown.
These and other features and advantages may be better and more completely understood by referring to the following detailed description of the drawing, of which:
The upper housing 13a is formed with sound release holes 18a and 18b for releasing sound from a later-described pair of loudspeakers (30a and 30b in
The upper housing 13a and the lower housing 13b are connected to each other by a hinge section so as to be opened or closed, and the hinge section is formed with a microphone hole 33.
The lower housing 13b is provided with, as input devices, a cross switch 14a, a start switch 14b, a select switch 14c, an A button 14d, a B button 14e, an X button 14f, and a Y button 14g. In addition, a touch panel 15 is provided on a screen of the second LCD 12 as another input device. The lower housing 13b is further provided with a power switch 19, and insertion openings for storing a memory card 17 and a stick 16.
The touch panel 15 is of a resistive film type. However, the touch panel 15 may be of any other type. The touch panel 15 can be operated by a finger as well as the stick 16. In the illustrative embodiment, the touch panel 15 having a resolution of 256 dots×192 dots (detection accuracy) as same as the second LCD 12 is used. However, resolutions of the touch panel 15 and the second LCD 12 do not necessarily be the same.
The memory card 17 is a storage medium storing a game program, and inserted through the insertion opening provided at the lower housing 13b in a removable manner.
With reference to
In
To the first GPU 26 is connected a first VRAM (Video RAM) 28, and to the second GPU 27 is connected a second VRAM 29. In accordance with an instruction from the CPU core 21, the first GPU generates a first game image based on the image data which is stored in the RAM 24 for generating a game image, and writes images into the first VRAM 28. The second GPU 27 also follows an instruction from the CPU core 21 to generate a second game image, and writes images into the second VRAM 29. The first VRAM 28 and the second VRAM 29 are connected to the LCD controller 31.
The LCD controller 31 includes a register 32. The register 32 stores a value of either 0 or 1 in accordance with an instruction from the CPU core 21. When the value of the register 32 is 0, the LCD controller 31 outputs to the first LCD 11 the first game image which has been written into the VRAM 28, and outputs to the second LCD 12 the second game image which has been written into the second VRAM 29. When the value of the register 32 is 1, the first game image which has been written into the first VRAM 28 is outputted to the LCD 12, and the second game image which has been written into the second VRAM 29 is outputted to the first LCD 11.
The wireless communication section 35 has a function of transmitting or receiving data used in a game process, and other data to or from a wireless communication section of another game apparatus.
It will be appreciated that other devices provided with a press-type touch panel that are supported by a housing may be used. Other devices may include, for example, a hand-held game apparatus, a controller of a stationery game apparatus, and a PDA (Personal Digital Assistant). Further, an input device in which a display is not provided under a touch panel may be utilized.
With reference to
First, the karaoke game is started up, and a menu of “karaoke” is selected from an initial menu (not shown) to display a karaoke menu screen as shown in
More specifically, when the player selects the “diagnosis” from the menus in
Then, the singing voice parameter and a music piece parameter stored in advance in the memory card 17 (which is read in the RAM 24 when the game processing is executed) are compared with each other. Here, the music piece parameter is generated in advance by analyzing music piece data. The music piece parameter indicates not only a characteristic of a music piece but also which singing voice parameter of a singing voice the music piece is suitable for. Thus, as a tendency of a value of the singing voice parameter is more similar to that of the music piece parameter, the music piece is determined to be more suitable for the singing voice. Such a similarity is determined, and a music piece suitable for the singing voice (a singing way, a characteristic of singing) of the player is searched for. In the illustrative embodiment, Pearson's product-moment correlation coefficient is used for determining a similarity. The search result is displayed as a “recommended music piece”. Further, in the illustrative embodiment, a music genre suitable for the singing way of the player (a recommended genre) is also displayed. As a result, when the player finishes singing the music piece, for example, phrases, “A genre suitable for you is OOOO. A recommended music piece is ΔΔΔΔ” are displayed.
As described above, in the game of the illustrative embodiment, the player sings during the “diagnosis”, and the processing of displaying a music piece and a music genre, which are suitable for the singing voice of the player, is executed.
The following will describe the outline of the above music displaying processing.
In the illustrative embodiment, the memory card 17, which stores contents corresponding to music piece data (D2), music piece analysis data (D3), and a music piece genre correlation list (D4) in
More specifically, in the music piece analysis (P2), musical score data in the music piece data (D2) is inputted for performing later-described analysis processing. As an analysis result, the music piece analysis data (D3) and the music piece genre correlation list (D4) are outputted. In the music piece analysis data is stored a music piece parameter which indicates a musical interval sense, a rhythm, a vibrato, and the like of an analyzed music piece. In the music piece genre correlation list is stored music piece genre correlation data which indicates a similarity between a music piece and a genre. For example, for a music piece, 80 points and 50 points are stored for a genre of “rock” and a genre of “pop”, respectively. This data will be described in detail later.
In addition, a genre master (D1) is produced in advance by a game developer, or the like, and stored in the memory card 17. The genre master is defined so as to associate a genre of a music piece used in the illustrative embodiment with a characteristic of a singing voice suitable for the genre.
The following will describe the outline of the music displaying processing which is executed when the player selects the “diagnosis” from the above menus in
Next, the singing voice analysis data (D5) and the genre master (D1) are inputted, and singing voice genre correlation analysis (P3) is performed for analyzing which music genre is suitable for a singing voice of a singing person. In this analysis, a correlation value between the inputted singing voice and a genre (a value indicating a degree of similarity) is calculated. Then, singing voice genre correlation data, which is a result of this analysis, is stored as a singing voice genre correlation list (D6).
Subsequently, singing voice music piece correlation analysis (P4) is performed. In this analysis, the music piece analysis data (D3), the music piece genre correlation list (D4), the singing voice analysis data (D5), and the singing voice genre correlation list (D6) are inputted. Then, based on these data and lists, correlation values between the singing voice of the player and music pieces stored in the game apparatus 10 are calculated. Only correlation values which are equal to or larger than a predetermined value are extracted from the calculated values to produce a nominated music piece list (D7).
Next, music piece selection processing (P5) using the nominated music piece list as an input is performed. In this processing, a music piece is selected randomly as a recommended music piece from the nominated music piece list. The selected music piece is shown as a recommended music piece to the player.
Further, type diagnosis (P6) using the singing voice genre correlation list (D6) as an input is performed. In this diagnosis, a genre having the highest correlation value is selected from the singing voice genre correlation data, and its genre name is outputted. The genre name is displayed as a result of the type diagnosis together with the recommended music piece.
As described above, in the illustrative embodiment, the musical score data is analyzed for producing data (a music piece parameter) which indicates a characteristic of a music piece. Also, the singing voice of the player is analyzed for producing data (a singing voice parameter) which indicates a characteristic of a singing way of the player.
The following will describe various data used in the illustrative embodiment. The above singing voice parameter and the music piece parameter, which are analysis results of voice and a music piece in the music displaying processing of the illustrative embodiment, will be now described. The singing voice parameter is obtained by dividing a characteristic of the singing voice into a plurality of items and quantifying each item. In the illustrative embodiment, 10 parameters shown in the table in
In
A groove 502 is a parameter obtained by evaluating whether or not an accent (a voice volume equal to or larger than a predetermined volume) occurs for each period of a half note. For example, in the case where a voice is represented by a waveform as shown in
An accent 503 is a parameter obtained, similarly as the groove 502, by observing and evaluating how frequently a voice volume (a wave of the voice volume) is changed. Different from the groove 502, the observation is performed for each period of two bars.
A strength 504 is a parameter obtained, similarly as the groove 502, by observing and evaluating how frequently a voice volume (a wave of the voice volume) is changed. Different from the groove 502, the observation is performed for each period of an eighth note.
A musical interval sense 505 is a parameter obtained by evaluating whether or not the player sings with correct pitch with respect to each musical note of a melody part of a musical score. As a number of musical notes, with respect to which the player sings with correct pitch, increases, a value of the musical interval sense 505 becomes large.
A rhythm 506 is a parameter obtained by evaluating whether or not the player sings in a rhythm which matches a timing of each musical note of a musical score. When the player sings correctly at a start timing of each musical note, a value of the rhythm 506 becomes large. In other words, as a voice volume equal to or larger than a predetermined value is inputted at a start timing of a musical timing, the value of the rhythm 506 becomes large.
A vibrato 507 is a parameter obtained by evaluating how frequently a vibrato occurs during singing. As a total time, for which a vibrato occurs until singing of a music piece is finished, is longer, a value of the vibrato 507 becomes large.
A roll (kobushi which is a Japanese term) 508 is a parameter obtained by evaluating how frequently a roll occurs during singing. When a voice changes from a low pitch to a correct pitch within a constant time period from the beginning of singing (from a start timing of a musical note), a value of the roll 508 becomes large.
A singing range 509 is a parameter obtained by evaluating a pitch which the player is best at. In other words, the singing range 509 is a parameter obtained by evaluating a pitch of a voice. As a pitch with which the player sings with the greatest voice volume is higher, a value of the singing range 509 becomes large. The pitch with which the player sings with the greatest voice volume is used because it is considered that the player can output a loud voice with a pitch which the player is good at.
A voice quality 510 is a parameter obtained by evaluating a brightness of a voice (whether or not the voice is a carrying voice or an inward voice). The parameter is calculated from data of a voice spectrum. When a voice has more high-frequency components, a value of the voice quantity 10 becomes large.
The following will describe the music piece parameter. The music piece parameter is a parameter obtained by analyzing the musical score data, and quantifying each item which indicates a characteristic of a music piece. The music piece parameter is to be compared with the singing voice parameter for each item. The music piece parameter implies that “this music piece is suitable for a person with a singing voice having such a singing voice parameter”. In the illustrative embodiment, 5 parameters shown in the table in
In
A rhythm 602 is a parameter obtained by evaluating a rhythm of a music piece and ease of singing the music piece.
A vibrato 603 is a parameter obtained by evaluating ease of putting vibratos in a music piece.
A roll 604 is a parameter obtained by evaluating ease of putting rolls in a music piece.
A voice quality 605 is a parameter obtained by evaluating which voice quality of a person a music piece is suitable for.
The above parameters are calculated from the voice of the player and the musical score data of the music piece. In the illustrative embodiment, processing is performed so that as a similarity between the singing voice parameter and the music piece parameter is higher, the music piece may be determined to be more suitable for the singing voice of the player, and shown as a recommended music piece.
The following will describe data which is stored in the RAM 24 when the game processing is executed.
In the program storage area 241 is stored a game program executed by the CPU core 21. The game program includes a main processing program 242, a singing voice analysis program 243, a recommended music piece search program 244, a type diagnosis program 245, and the like.
The main processing program 242 is a program corresponding to processing of a later-described flow chart in
In the data storage area 246 are stored data such as a genre master 247, music piece data 248, music piece analysis data 249, a music piece genre correlation list 250, sound data 251.
The genre master 247 is data corresponding to the genre master D1 shown in
Referring back to
Referring back to
Referring back to
Referring back to
In the work area 252 various data is stored which is used temporarily in the game processing. More specifically, work area 252 stores the singing voice analysis data 253, a singing voice genre correlation list 254, an intermediate nominee list 255, a nominated music piece list 256, a recommended music piece 257, a type diagnosis result 258, and the like.
The singing voice analysis data 253 is data produced as a result of executing analysis processing for the singing voice of the player. The singing voice analysis data 253 corresponds to the singing voice analysis data D5 in
The singing voice genre correlation list 254 is data corresponding to the singing voice genre correlation list D6 in
The intermediate nominee list 255 is data used during processing for searching for music pieces, which may be nominated as a recommended music piece to be shown to the player.
The nominated music piece list 256 is data concerning music pieces nominated for a recommended music piece to be shown to the player. The nominated music piece list 256 is produced by extracting, from the intermediate nominee list 255, data having correlation values 2552 equal to or larger than a predetermined value.
The recommended music piece 257 stores a music piece number of a “recommended music piece” which is a result of later-described recommended music piece search processing.
The type diagnosis result 258 stores a music genre name which is a result of later-described type diagnosis processing.
With reference to
Next, at a step S2, data of a musical instrument, a tempo, and musical notes of a melody part are obtained from the read musical score data 2483.
Next, at a step S3, processing is executed for analyzing data obtained from the above musical score data 2483 to calculate an evaluation value of each item of the music piece parameter shown in
Concerning an evaluation value of the musical interval sense 601, processing is executed for evaluating a change in musical intervals, which occurs in a musical score, to calculate the evaluation value. More specifically, the following processing is executed.
A difficulty value is set to a musical interval between any two adjacent musical notes. For example, in the case where a musical interval between two adjacent musical notes is large, it is difficult to change pitch during singing as indicated by a musical score, and thus a high difficulty value is set thereto.
Next, an occurrence probability of each musical interval in the melody part is calculated. Then, an occurrence difficulty value is calculated for each musical interval by using the following equation:
occurrence difficulty value=occurrence probability×difficulty value of musical interval.
Next, the occurrence difficulty value of each musical interval is totaled to calculate a total difficulty value. Then, an evaluation value is calculated by using the following equation:
evaluation value=total difficulty value×α.
Here, α is a predetermined coefficient (it is the same below). The evaluation value is stored as an evaluation value of the musical interval sense 601.
Concerning an evaluation value of the rhythm 602, the following processing is executed to calculate the evaluation value. One beat (a length of a quarter note) is equally divided into twelve parts, and a difficulty value is set to each position or each of the twelve parts within the beat.
Next, an occurrence probability of a musical note of the melody part at each position within the beat is calculated. In addition, for each position within the beat, a value (a within-beat difficulty value) is calculated by multiplying the occurrence probability by the difficulty value which is set to the position within the beat. Further, the calculated within-beat difficulty values are totalized to calculate a within-beat difficulty total value. Then, an evaluation value is calculated by using the following equation:
evaluation value=within-beat difficulty total value×α.
The evaluation value is stored as an evaluation value of the rhythm 602.
An evaluation value of the vibrato 603 is calculated as follows. Sound production times of musical notes of the melody part, which have time lengths equal to or longer than 0.55 seconds, are totalized to calculate a sound production time total value. The musical note having the time length equal to or longer than 0.55 seconds is considered to be suitable for a vibrato, and an evaluation value of the vibrato 603 is calculated by using the following equation:
evaluation value=sound production time total value×α.
The evaluation value is stored as an evaluation value of the vibrato 603.
The following processing is executed to calculate an evaluation value of the roll 604. Similarly as the musical interval sense, a unit which sets a semitone as 1 is used, and a value (a musical interval value) is set to a musical interval between any two adjacent musical notes. A higher numerical value is set to a larger musical interval.
Next, an occurrence probability of each musical interval in the melody part is calculated. For each musical interval, a musical interval occurrence value is calculated by using the following equation:
musical interval occurrence value=occurrence probability×musical interval value of each musical interval.
Next, the calculated musical interval occurrence value of each musical interval is totalized to calculate a total musical interval occurrence value. An evaluation value is calculated by using the following equation:
evaluation value=total musical interval occurrence value×α.
Further, an average of this evaluation value and the evaluation value of the vibrato 603 is calculated, and the calculated average value is stored as an evaluation value of the roll 604.
Next, an evaluation value of the voice quality 605 is calculated as follows. A value corresponding to a voice quality (a voice quality value) is set for each musical instrument used for a music piece.
Next, based on the above voice quality values, the voice quality value for each musical instrument used for the music piece is totaled to calculate a total voice quality value. Then, an evaluation value is calculated by using the following equation:
evaluation value=total voice quality value×α.
The evaluation value is stored as an evaluation value of the voice quality 605.
The above analysis processing is executed to calculate the music piece parameter for a music piece. The music piece parameter is additionally outputted to the music piece analysis data 249 so as to be associated with the music piece which is an analyzed object.
Referring back to
Next, at a step S5, whether or not all of music pieces have been analyzed is determined. When there are music pieces which have not been analyzed yet (NO at step S5), step S1 is returned to, and a music piece parameter for the next music piece is calculated. On the other hand, when analysis of all of the music pieces has been finished (YES at step S5), the music piece analysis processing is terminated.
The following will describe production of the aforementioned music piece genre correlation list 250.
At step S11, a musical instrument tendency value is calculated. The musical instrument tendency value is used for estimating, from a type of a musical instrument used for a music piece, which genre the music piece is suitable for. In other words, the musical instrument tendency value is for taking into consideration a musical instrument which is frequently used for each genre.
In calculating the musical instrument tendency value, a tendency value, which indicates how frequently a musical instrument is used for each genre, is set for each of musical instruments used for music pieces in the illustrative embodiment.
Based on setting of such a tendency value and a type of a musical instrument used for a music piece which is a processed object, a musical instrument tendency value is calculated for each genre.
Referring back to
In calculating the tempo tendency value, a tendency value, which indicates how frequently a tempo is used for each genre, is set as shown in
Based on setting of such a tendency value and a tempo used for a music piece which is a processed object, a tempo tendency value is calculated for each genre.
Referring back to
In calculating the major/minor key tendency value, a tendency value, which indicates how frequently the minor key and the major key are used for each genre, is set as shown in
Based on setting of such a tendency value and a type of a key used for a music piece which is a processed object, a major/minor key tendency value is calculated for each genre.
Referring back to
The music piece analysis data 249 and the music piece genre correlation list 250, which are produced through the above processing, are stored together with the game program and the like in the memory card 17. When the player plays the game, the music piece analysis data 249 and the music piece genre correlation list 250 are read in the RAM 24, and used for processing as described below.
With reference to
At step S21, processing of displaying the menu shown in
Next, at step S22, a selection operation from the player is accepted. When the selection operation from the player is accepted, whether or not “training” is selected is determined at step S23.
As a result of the determination at step S23, when “training” is selected (YES at the step S23), the CPU core 21 executes karaoke processing for reproducing a karaoke music piece at step S27. It is noted that in the illustrative embodiment, since the karaoke processing is not directly relevant to the illustrative embodiments, the description thereof will be omitted.
On the other hand, as the result of the determination at step S23, when “training” is not selected (NO at the step S23), whether or not “diagnosis” is selected is determined at step S24. As a result, when “diagnosis” is selected (YES at step S24), later-described singing voice analysis processing is executed at step S26. On the other hand, when “diagnosis” is not selected (NO at step S24), whether or not “return” is selected is determined at step S25. As a result, when “return” is not selected (NO at step S25), step S21 is returned to, and the processing is repeated. When “return” is selected (YES at the step S25), the karaoke game processing of the illustrative embodiment is terminated.
The following will describe the singing voice analysis processing.
As shown in
When a music piece is selected by the player, musical score data 2483 of the selected music piece is read at the subsequent step S42.
Next, at step S43, processing of reproducing the music piece is executed based on the read musical score data 2483. At the subsequent step S44, processing of obtaining voice data (namely, a singing voice of the player) is executed. Analog-digital conversion is performed on a voice inputted to the microphone 36 thereby to produce input voice data. It is noted that in the illustrative embodiment, a sampling frequency for a voice is 4 kHz (4000 samples per second). In other words, a voice inputted for one second is divided into 4000 pieces, and quantified. Then, fast Fourier transformation is performed on the input voice data thereby to produce frequency-domain data. Based on this data, voice volume value data and pitch data of the singing voice of the player are produced. The voice volume value data is obtained by calculating an average of values obtained by squaring each value of closest 256 samples for each frame. The pitch data is obtained by detecting a pitch based on a frequency, and indicated by a numerical value (e.g. a value of 0 to 127) for each pitch.
Next, at step S45, analysis processing is executed. In this processing, the voice volume value data and the pitch data are analyzed to produce the singing voice analysis data 253. Each singing voice parameter 2532 of the singing voice analysis data 253 is calculated by executing the following processing.
With respect to “voice volume”, the following processing is executed. A constant voice volume value is set at 100 points (namely, a reference value), and a score is calculated for each frame. An average of scores from the start of a music piece to the end thereof is calculated, and stored as the “voice volume”.
Next, concerning “groove”, processing for analyzing whether or not an accent (a voice volume equal to or larger than a constant volume) occurs for each period of a half note is executed. More specifically, using the Goertzel algorithm, a frequency component for a period of a half note is observed with respect to the voice volume data of each frame. Then, a result value of the observation is multiplied by a predetermined constant number to calculate the “groove” in the range between 0 and 100 points.
Next, concerning “accent”, processing similar to the “groove” processing is executed to calculate the “accent”. However, different from the “groove”, a frequency component is observed for each period of two bars.
Next, concerning “strength”, processing similar to the “groove” is executed to calculate the “strength”. However, different from the “groove”, a frequency component is observed for each period of an eighth note.
Next, concerning “musical interval sense”, the following ratio is calculated and stored. In other words, among frames in which portions including lyrics are played, a ratio of frames, in each of which a pitch of the singing voice of the player (calculated from the above pitch data) is within a semitone higher or lower from a pitch indicated by a musical note, is calculated to obtain the “musical interval sense”.
Next, concerning “rhythm”, the following ratio is calculated and stored. Specifically, a ratio of a number of musical notes with lyrics, with respect to each of which a start timing of singing is within a constant time from a timing indicated by the musical note, and with respect to each of which a pitch of the singing voice of the player at a frame at the start timing of singing is within a semitone higher or lower from a pitch indicated by the musical note, to a number of all musical notes is calculated.
Next, “vibrato” is obtained by checking a number of times (a time) which a vibrato is put. The number of times a variation in a sound occurs for one second is checked, and a processing burden is increased if checking is performed for the whole frequencies. Thus, in the illustrative embodiment, components in three frequencies, 3 Hz, 4.5 Hz, and 6.5 Hz are checked. This is because it is generally considered to recognize (hear) that a vibrato is put if variation in a sound in the range between 3 Hz and 6.5 Hz is maintained for a certain time. Thus, the checking is performed for an upper limit (6.5 Hz), a lower limit (3 Hz), and an, intermediate value (4.5 Hz) in the above range, and hence becomes efficient. More specifically, the following processing is executed. Using the Goertzel algorithm, components of the inputted voice of the player in 3 Hz, 4.5 Hz, and 6.5 Hz are checked. The number of frames in which maximum values of the three frequency components exceed a constant threshold value is multiplied by the predetermined coefficient α, and the calculated value is stored as the “vibrato”.
Next, concerning “roll”, the following processing is executed. A frame, in which a pitch of the singing voice of the player is raised from a pitch in the last frame, is detected during a period from a position of each musical note to a time when the pitch of the singing voice of the player reaches a correct pitch (a pitch indicated by the musical note). As an evaluation score concerning the frame, points are added in accordance with a raised amount of the pitch. Then, the evaluation scores for the entire music piece are totalized to calculate a total score. Further, a value obtained by multiplying the total score by the predetermined coefficient α is stored as the “roll”.
Next, concerning “singing range”, for a diatonic scale, an average of voice volume values, with which a pitch of a singing voice is maintained for a certain time period or more, a time is calculated from the start of playing a music piece. Then, a value, which is obtained by multiplying by 4 a pitch (0 to 25) having the maximum value among values obtained by adding to the average values for one octave higher and lower from a central pitch in accordance with Gaussian distribution, is regarded as the “singing range”.
Next, concerning “voice quality”, the following processing is executed. Spectrum data as shown in
Referring back to
Next, at step S47, whether or not reproduction of the music piece has been finished is determined. When the reproduction of the music piece has not been terminated (NO at step S47), step S43 is returned to, and the processing is repeated.
On the other hand, when the reproduction of the music piece has been finished (YES at step S47), the singing voice genre correlation list 254 is produced based on the singing voice analysis data 253 and the genre master 247 at step S48. In other words, a correlation value between each singing voice parameter of the singing voice analysis data 253 and each singing voice parameter definition 2472 of the genre master 247 is calculated. In the illustrative embodiment, the correlation value is calculated by using a Pearson's product-moment correlation coefficient. The correlation coefficient is an index which indicates correlation (a degree of similarity) between two random variables, and ranges from −1 to 1. When a correlation coefficient is close to 1, two random variables have positive correlation, and a similarity therebetween is high. When a correlation coefficient is close to −1, two random variables have negative correlation, and a similarity therebetween is low. More specifically, where a data row, (x,y)={(xi,yi)}, including two pairs of numerical values is given, a correlation coefficient is obtained as follows.
It is noted that in the above equation 1,
By using the above equation 1, a correlation value with a singing voice is calculated for each genre. Based on the calculated result, the singing voice genre correlation list 254 is produced as shown in
Next, at step S49, type diagnosis processing is executed.
Referring back to
Next, at step S62, the singing voice analysis data 253 is read. In addition, at step S63, the singing voice genre correlation list 254 is read. In other words, all of the parameters concerning the singing voice (namely, an analysis result of the singing voice) are read.
Next, at step S64, the music piece parameter for one music piece is read from the music piece analysis data 249. In addition, at step S65, data corresponding to the music piece read at step S64 is read from the music piece genre correlation list 250. In other words, all of the parameters concerning the music piece (namely, an analysis result of the music piece) are read.
Next, at step S66, a correlation value between the singing voice of the player and the read music piece by using the above Pearson's product-moment correlation coefficient. More specifically, the values of the singing voice parameter (see
Next, at step S67, whether or not the correlation value calculated at step S66 is equal to or larger than a predetermined value is determined. As a result, concerning a music piece having a correlation value equal to or larger than the predetermined value (YES at the step S67), a music piece number of the music piece and the calculated correlation value are additionally stored in the nominated music piece list 256 at step S68.
Next, at step S69, whether or not the correlation values of all of the music pieces have been calculated is determined. As a result, when the calculation of the correlation values of all of the music pieces has not been finished yet (NO at the step 69), step S64 is returned to, and the processing is repeated for music pieces, the correlation values of which have not been calculated yet.
On the other hand, as the result of the determination at step S69, when the correlation values of all of the music pieces have been calculated (YES at step S69), a music piece is randomly selected from the nominated music piece list 256 at step S70. At step S71, a music piece number of the selected music piece is stored as the recommended music piece 257. It is noted that a music piece may not be randomly selected from the nominees, but a music piece having the highest correlation value may be selected therefrom. Then, the recommended music piece search processing is terminated.
Referring back to
As described above, in the illustrative embodiment, the singing voice of the player is analyzed to calculate and produce data which indicates a characteristic of the singing voice. Then, processing of calculating a similarity between data obtained by analyzing a characteristic of a music piece from the musical score data and data obtained by analyzing the characteristic of the singing voice is executed, thereby searching for and displaying a music piece suitable for the player (a singing person). This enhances the enjoyment of the karaoke game. Also, a music piece, which is easy to sing, is shown to a player who is bad at karaoke, and it is possible to provide a chance for enjoying karaoke. Further, it is possible to make a player, who has been avoiding karaoke, enjoy the karaoke game pleasantly. Therefore, it is possible to provide a karaoke game which a wide range of players can enjoy. In addition, a music genre suitable for the singing voice of the player can be shown. Thus, it is easy for the player to select a music piece suitable for his or her singing voice, and the like by making selection of a karaoke music piece focusing on the shown genre, and the enjoyment of the karaoke game is enhanced.
It has been described that the music piece analysis processing is executed prior to game play by the player (prior to shipment of the memory card 17 which is a game product). However, the illustrative embodiments are not limited thereto, and the music piece analysis processing may be executed during the game processing. For example, the game program is programmed so as to add the music piece data 248 by downloading it from a predetermined server. When a music piece is additionally stored in the game apparatus 10 by downloading it, the music piece analysis processing may be executed. Thus, the added music piece can be analyzed to produce analysis data, and a range of selection of a music piece suitable for the player can be widened. Alternatively, the game program may be programmed so that the player can compose a music piece. The music piece analysis processing may be executed with respect to the music piece composed by the player to update the music piece analysis data and the music piece genre correlation list. This enhances the enjoyment of the karaoke game.
The method of the recommended music piece search processing executed at step S50 is merely an example, and the illustrative embodiments are not limited thereto. Any method of the recommended music piece search processing may be used as long as a similarity is calculated from the music piece parameter and the singing voice parameter. For example, the following method of the recommended music piece search processing may be used.
Next, at step S92, the singing voice analysis data 253 is read. At the subsequent step S93, the music piece genre correlation list 250 is read. Further, at step S94, the singing voice genre correlation list 254 is read.
Next, at step S95, the music piece parameter for one music piece is read from the music piece analysis data 249.
Next, at step S96, a correlation value between the singing voice of the player (namely, the singing voice analysis data 253) and the music piece of the read music piece parameter is calculated by using the Pearson's product-moment correlation coefficient.
Next, at step S97, whether or not the correlation value calculated at step S96 is equal to or larger than a predetermined value is determined. As a result, concerning a music piece having a correlation value equal to or larger than the predetermined value (YES at step S97), a music piece number of the music piece and the calculated correlation value are additionally stored in the intermediate nominee list 255 at step S98.
Next, at step S99, whether or not the correlation values of all of the music pieces have been calculated is determined. As a result, when the calculation of the correlation values of all of the music pieces has not been finished yet (NO at step S99), step S95 is returned to, and the processing is repeated for music pieces, the correlation values of which have not been calculated yet.
On the other hand, as the result of the determination at step S99, when the correlation values of all of the music pieces have been calculated (YES at step S99), it means that the intermediate nominee list 255 including, for example, contents as shown in
Next, at step S101, the music piece genre correlation list 250 is referred to, and a music piece number of a music piece, in which the “suitable genre” has a correlation value equal to or larger than the predetermined value, is extracted from the intermediate nominee list 255. The music piece number is additionally stored in the nominated music piece list 256. For example, it is assumed that contents are obtained as shown in
Instead of the above methods of the recommended music piece search processing, the following method may be used. For example, a correlation value between the singing voice analysis data 253 and the music piece analysis data 249 is calculated. Next, for the contents in the singing voice genre correlation list 254, weight values are set in ascending order of the correlation values. Also, for the contents in the music piece genre correlation list 250, weight values are set in ascending order of the correlation values. Then, the correlation value between the singing voice analysis data 253 and the music piece analysis data 249 is adjusted by multiplying it by the weight value. Based on the adjusted correlation value, a recommended music piece may be selected. As described above, any method of the recommended music piece search processing may be used as long as a similarity is calculated from the music piece parameter and the singing voice parameter.
Items which are objects to be analyzed for a music piece and a singing voice, namely, the music piece parameter and the singing voice parameter are not limited to the aforementioned contents. As long as the parameter indicates each of characteristics of a music piece and a singing voice and a correlation value is calculated therefrom, any parameter may be used.
While the illustrative embodiments have been described in detail, the foregoing description and all exemplary features are not to be limited by the disclosure. It is understood that numerous other modifications and variations can be devised and that the invention is intended to be defined by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
2007-339372 | Dec 2007 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
4771671 | Hoff, Jr. | Sep 1988 | A |
7488886 | Kemp | Feb 2009 | B2 |
7605322 | Nakamura | Oct 2009 | B2 |
20070131094 | Kemp | Jun 2007 | A1 |
Number | Date | Country |
---|---|---|
A-2000-056785 | Feb 2000 | JP |
Number | Date | Country | |
---|---|---|---|
20090165633 A1 | Jul 2009 | US |