The present invention relates generally to systems and methods of organizing and managing multiple music items and, more particularly, to systems and methods for creating music playlists.
In recent years multimedia content has become increasingly available to the general public. High-performance home computers, the ever-increasing availability of broadband data-networks, efficient data compression methods, and high capacity storage media have all played a role in the increasing access to, and demand for, multimedia content. More importantly for purposes of the instant invention, the increase in demand for, and usage of, audio works has been especially noticeable.
A user's audio collection is typically stored as individual files on the hard disc of a personal computer or another data storage medium. The collection is typically administered by a special purpose program and stored in a database that has been created for that purpose. Each song typically has some minimal amount of descriptive metadata associated with it (e.g., song title, performer, style, etc.) which may or may not be complete. Such a computerized database allows a user to quickly locate and initiate the playback of a particular song. However, a database-based approach to managing a large number of music items is most useful to the user who has intimate knowledge of his or her entire music collection and is able to identify with some particularity the song or songs that are desired to be played. In most cases, standard database text search logic will be used to locate audio content (e.g., by title, album, etc.), but that approach is difficult to fine tune and the results returned tend to be either over or under inclusive.
This weakness of the database search function has led to the development of the now well-known playlist concept—i.e., a playlist is a subset of an audio database that conventionally contains a hand picked collection of music items that the user wishes to listen to sequentially, randomly, etc.
However, building playlists from a database that contains large numbers of songs is a cumbersome process. For example, the user must find and select an individual music item, select a playlist, signal that the found item(s) be included in the playlist, find and select the next item, etc.
In view of the difficulties inherent in using playlists to manage a large number of songs in a database, it is no surprise that a number of aids have been developed to streamline the process. For example, shortcuts have been developed that allow the user to insert complete directories into a selected playlist. Additionally, playlist generation routines have been devised that allow the user to specify a genre, artist, year, album, etc., and have all of the audio items in the database that match the specified criteria assembled into a playlist.
Additionally, software developers have provided increased functionality in the database management software that resides on the user's computer. The current state-of-the-art provides PC-based software packages that allow a user to create and store playlist files until they are needed. This approach however is not typically fully automated since the user must select the individual songs that comprise the playlist or specify the criteria for a database search and then store the playlist for later reproduction.
Another disadvantage of the prior art approach is that a playlist, once generated, is a static construct that represents the user's preferences or the search results as they existed at one moment in time. If a user were, for example, to want to modify a playlist, the search that created it would need to be manually entered again and/or the user would need to manually search for songs to add to the playlist. When a new song is added to an audio database, it will not be included in a playlist that would logically accommodate it unless the user remembers to add that song manually or unless a new search is conducted. Finally, none of these searches will uncover a song that has been misclassified as to genre, artist, etc., nor will a conventional search uncover songs that are sonically similar to a song that is a user favorite.
All of these approaches are user initiated processes and the task of generating a playlist is ultimately left to the user. Playlists generated via the conventional approach are, thus, either the product of a long and cumbersome manual process or they are products of automated algorithms that choose songs based on the associated audio metadata (e.g., genre, artist, year, album, etc.) Additionally all of these approaches provide the user with a static playlist, i.e., the user is provided with a playlist file that does not change during the listening process or anytime later unless it is manually updated.
An additional problem that is encountered when managing a very large audio database is that only an exceptional user will be able to remember all the individual music items stored therein. This is especially understandable in view of the fact that some of the files might have been added years earlier. Thus, amassing a large database of audio content may ultimately be unsatisfying to the user who cannot locate items that are known to be in the database or even remember specific audio content.
Thus, technological advances have allowed a user to amass an arbitrarily large number of audio files but then do not provide the tools necessary to enjoy them fully. When the user wants to create a playlist, he or she must remember each individual track or artist—an almost impossible undertaking when the user is using a database that has been accumulated over many years. Even automatic playlist generation routines only provide a partial solution to this problem because these routines only scratch the surface of a huge music database.
In recent years further advancements in playlist creation and audio data management have been made. For example, audio content may be analyzed when added to a database to determine its musical characteristic/features. These extracted musical features can then be used to generate playlists containing items that have similar calculated features. Such approaches allow the user to influence the playlist generation process on a somewhat intuitive level.
Such an approach, however, also has its drawbacks. For example, playlist/generation process is like a black box algorithm which provides the user with a new similarity playlist each time the algorithm is initiated, but leaves the user without an explanation as to why a particular music item was included. A user who is new to similarity broad playlist generation might find such an approach novel at first, ultimately the disadvantages of this approach will become apparent.
The user is even more limited in his self-initiated choices when creating a new playlist. A typical prior art approach will provide the user with music items that are musically similar to a selected one, however if the user does not like one, some or even all of the music items, the playlist generation process will have to be initiated again. Typically, the user will be limited to the selection of a single music item from which the playlist will be generated and multiple starter songs might need to be selected sequentially before a satisfactory playlist is obtained. With such an approach, the user can either accept the next music item or initiate the playlist generation process again and again. This process, although providing a fresh look at the music collection of a user, is unwieldy to use and is rapidly losing favor among end users.
Thus, what is needed is a system and method that provides a user with a quick and dynamic method of playlist generation wherein the algorithm that is used creates a playlist that is based on the musical characteristics of the music items, rather than being based on song metadata. This method should also provide advanced selection criteria for the user, which allows a user to weight the selection criteria associated with a currently playing musical work and thereby signal to the playlist generation algorithm that he or she prefers (or not) other music items containing (or not) a particular musical feature. Furthermore, it would be preferred that this playlist generation algorithm would be able to generate a playlist in a short amount of time without initiating an extensive re-analysis of the music items stored in the database.
Heretofore, as is well known in the media editing industry, there has been a need for an invention to address and solve the above-described problems. Accordingly, it should now be recognized, as was recognized by the present inventors, that there exists, and has existed for some time, a very real need for a system and method that would address and solve the above-described problems.
Before proceeding to a description of the present invention, however, it should be noted and remembered that the description of the invention which follows, together with the accompanying drawings, should not be construed as limiting the invention to the examples (or preferred embodiments) shown and described. This is so because those skilled in the art to which the invention pertains will be able to devise other forms of the invention within the ambit of the appended claims.
There is provided herein a system and method for efficient generation and management of similarity playlists from an audio database. The instant invention will allow users of such a database to initiate and shape the playlist generation process to automatically provide playlists that are based on the musical features of the files stored in the database. This will result in playlists that contain musically similar items. Furthermore, the instant invention will allow the user to select and weight the musical criteria that are used to form the playlist, thereby adapting it accordingly. In the preferred embodiment, the user will initiate the playlist generation process by selecting a music file. A similarity playlist will be automatically constructed based on the musical features of the selected work. The user will then be allowed to review the playlist and shape it as part of a parametrization step. During this step, a preference or dislike for each musical feature can potentially be indicated and the playlist generation process will then be re-initiated. The resulting new playlist will be built according to the indicated preferences of a user. Once the user is satisfied with a current similarity playlist, he or she will be given the opportunity to implement a further filtering step which will dynamically adapt the current playlist even further according to his or her preferences. The parametrization step, as well as the filtering step, will both preferably provide immediate results to the user without a new analysis of the music items stored in a database. Note that the filtering and parametrization steps can be performed in any order. One or the other or both might be performed in a given user session.
The methods taught herein will utilize a song model for each individual music item that will be stored in a song model database. The music item might be stored either with the song model or in a separate database. The song model database and the stored song model data will then be used in the similarity playlist generation process. The song model will be generated preferably on a personal computer as part of a preliminary analysis that is performed on each music item in a digital music collection. The analysis step is an integral part of the operation of the instant invention and will preferably be carried out by digital music management software that is resident on or accessible by a personal computer. The analysis step will preferably operate on the entire music database and will preferably generate a number of different attribute values for each music item contained therein. The attribute values will be stored as an audio signature for each music item. This audio signature will then be used in a model generation step resulting in a song model. The song model obtained and stored at this step will then be used as the starting point in the similarity playlist generation process.
The system and method taught herein will preferably be designed to automatically generate a similarity playlist from the music items stored in an audio database. It is generally directed to allowing users to experience the content of their music collection in a completely new way.
In brief, the instant invention will provide the user with a method of automatically generating similarity playlists from an audio database, where the playlists are created based on the musical characteristics or attributes of a selected base music item. The process will preferably be completely automatic and will utilize previously computed attribute values. Furthermore, in some preferred variations, the instant method will operate dynamically, i.e., the process will be performed repeatedly on a series of changing base music items based on previously indicated user personal preference values using this approach. The instant invention will be able to quickly generate an associated similarity playlist without high resource demands and additional analysis steps prior to generating the new similarity playlist.
The preferred method will be implemented by initially executing an analysis on each music item in the audio database. Preferably, this analysis will be carried out at the time each music item is inserted into the database. The result of this analysis is an audio signature that will be used to create a song model that describes the unique features of a music item. The steps involved with generating this song model will preferably be invisible to the user. The user will only be required to select an initial music item—the query music item that will serve as the origin from which the playlist will be formed.
To adapt the similarity playlist to the user's preferences, the preferred system will provide one of the following mechanisms or a combination of them. The user will be able to indicate that certain musical characteristics, so-called classification parameters of the query music item are of greater importance than others. For instance, if the query music item has a slow tempo, it might be important to the user that the music items returned by the instant method should also preferably have a tempo similar to that of the query music item. Note that in the preferred embodiment, this will not be an on/off setting, but a level setting which causes the internal rules engine to place a higher weight on the similarity of a particular musical classification parameter or parameters.
Another preferred embodiment will allow the user to select a song from the similarity list and indicate that he or she likes or dislikes this recommendation in that particular context. The system will then modify the internal rules engine and generate another similarity list according to that preference. In another preferred embodiment, the user will be able to set filters by defining criteria which a song has to meet in order to be even considered as a part of the similarity list. For instance, if the user sets a filter to exclude slow tempo, songs that do have a slow tempo will not be considered at all.
In another preferred embodiment the method will utilize an internal policy wherein all the musical features of a currently selected music item have a pre-defined default weighting which represents the importance of each feature for general musical similarity. By signaling the personal preference for a particular musical feature, a classification parameter e.g., if the user increases the user control for the tempo of music items, the instant invention will adjust its rules by raising the importance of that classification parameter. This will then lead to a new similarity playlist that contains more items with a tempo that is close to that of the currently selected music item. Preferably, the selection and weighting of a particular musical feature, classification parameter of a current music item will be used in a relative manner by the instant invention. Note that a user-initiated increase in the importance of the “tempo” matching parameter does not mean that the user desires music items with a higher tempo in his or her playlist. Such a selection preferably indicates that the user prefers more music items in the playlist that have a similar tempo as that of the current music item. The user will preferably be able to accept or reject the proposed similarity playlist or, provisionally accept it and listen to some or all of the included songs, before deciding whether or not to keep it.
According to a preferred embodiment, the instant invention preferably begins with the activation of the digital media management program. This program will preferably be stored on a personal computer that has access to the digital audio content of the user. A first aspect of the instant invention will preferably be carried out on a user's personal computer and will utilize the user's music database to generate a song model for each music item stored in that database. The user will preferably be presented with a graphical user interface that provides access to the functionality of a media management program, e.g. sorting, storing, and archiving digital music files.
The media management program will preferably allow the user to import new music items into an existing media collection. When a new music item is imported, the media management program will preferably automatically assign an ID to each imported music item—the ID being intended to provide a better localization of the music items within the database. Thus, in the preferred embodiment, each music item in the database will have a unique ID.
In the next preferred step, the instant invention will perform an analysis of the imported digital music items, the purpose of the analysis being to determine their musical characteristics. This analysis and the steps associated therewith provide the foundation for the similarity playlist generation and adaptation process. The analysis will preferably begin by generating an audio signature of a base music item. This audio signature is preferably primarily a data structure that contains low level, mid level and high level features extracted from a music item. The higher in the feature hierarchy of low, mid and high a feature is, the more likely it will be associated with some directly perceptible characteristic of the music. For example, the low level features tend to represent characteristics of the source file. These features will preferably be obtained by band separating the audio content and by saving a value per band at predetermined time intervals, for example every 10 ms or 30 ms. Additionally this process might be repeated throughout the music item. Thus, in the preferred embodiment, the instant invention creates a vector for every unit of time. The low level features might be selected from the well known MPEG-7 standard, wherein there are seventeen temporal and spectral parameters specified that can be used to describe audio. These parameters are divided into the following groups:
In a next preferred step the instant invention will further calculate the mid level features. The mid level features will preferably be derived from the low level features. For example LogLoudness is a low level parameter which measures the loudness progression of audio material. An example of a mid-level parameter related or derived from the LogLoudness parameter would be a parameter that is based on statistical calculations of the LogLoudness. The mid-level parameters also do not have an intuitive meaning to a user.
The high level parameters will also preferably be derived from previously generated parameters preferably the mid-level features. The high level parameters, however, will tend to be meaningful to a user. For example a high level feature might be the distribution of the notes or the rhythmical course over the whole music item. The totality of the low level, mid level and high level features that describe a song will be referred to as the audio signature. The audio signature will tend to contain most of the temporal information of the music item and, therefore, the audio signature file itself may be quite large.
To reduce the size of the audio signature a number of further calculations will preferably be initiated. These calculations will preferably use the audio signature data as starting point. One calculation or set of sequential calculations is called adaptation, where the measured values are temporally summed. That is, in a preferred embodiment, the feature values that have been calculated for multiple bands are combined into a single representative value by, for example, calculating their mean, variance, median, mode, or some other single value. As a result, the instant invention generates holistic models of the songs in the database which are then used in the formation of similarity playlists.
The second preferred calculation is the aspect classification. Aspects are attributes of the audio content that describe its musical characteristics. These aspects are divided into those that are easily comprehensible by a novice user and features that describe the deeper musical characteristics of the audio content. Aspects that are more accessible and comprehensible to the user might include, for example, Instrument Density, Solo Instrument, Mood, Music Color, Percussiveness, Singing (vocals), Style, and Tempo. To each aspect a number of different classes might be connected. For example, the aspect Instrument Density might take the values of sparse and full. Mood might have classes such as aggressive, calm, danceable, happy, and melancholic. In the preferred embodiment, the aspect classification will produce a histogram which illustrates, per temporal unit, to which class the currently analyzed music item belongs to. The histogram will preferably show, over the running time of the music item, that, for example, 60% of the content item contains vocals and 40% does not.
The classification step will preferably assign probabilities within each time intervals of the songs, each probability representing the likelihood that the song belongs to an associated song class. Preferably, the song classes will be defined in terms of calculated (or assigned) musical characteristics which have been obtained via an artificial intelligence methodology.
The result of the aspect classification step will preferably be a collection of values that represent the probability that the song belongs to each of several different classes. The aspect profile and the transformed values (from the adaptation step) will then preferably be stored in a song model which is a compacted version of the data calculated from the song itself, e.g., it might contain histograms and other summary statistics.
This analysis will preferably be carried out for every item stored in the audio database and the song model will then preferably be stored together with the IDs of the audio items in a separate song model database or together with the music item in the audio content database.
In a next preferred step the user will select an initial music item—the base music item. The instant invention will then determine if a song model corresponding to the selected music item has been stored previously. If not (e.g., if this is the initial run of the program or the selected music item is new), the above described analysis steps will be implemented. However, in the preferred embodiment the selected music item and the corresponding song model data will already be a part of both databases—the song model database and the content database.
As a next preferred step the instant invention will generate an initial similarity playlist and present it to the user. The instant invention will preferably access the song model database and the song model data. The song model data of the currently selected music item will be used to calculate a distance from the selected song to the other songs in the database. Although it is preferable that a Euclidean (or other) distance be calculated, it is also possible that a playlist might be assembled based on song ontology, wherein the individual features of each song are considered in the generation process and any individual weighting of such features is incorporated in the process of selecting new music items. Music items whose song model data is similar to or is lying within an acceptable range of the characterizing values of the base music item will preferably be selected and inserted into the initial similarity playlist.
Preferably, the music items that are determined to be in the similarity playlist will be sorted according to a percentage of similarity to the base music item. The generation step, relying as it preferably does on precomputed values, will be able to quickly return a playlist from even a very large database. Additionally, the instant invention will preferably provide the user with a graphical user interface that contains the information about the base music item, and the contents of the current similarity playlist together with a value that illustrates the degree of similarity between each playlist item and the base music item.
In a next preferred step, i.e.,—the parameterization step, the user will signal to the instant invention which features/classification parameters of the currently selected music item he or she prefers or dislikes by setting a personal preference value for each classification parameter. These preference values and the classification parameter values will be used to further refine the playlist by generating a target selection model containing the user preference value, the classification parameters and the classification parameter values. The user will preferably be able to indicate this to the instant invention by interacting with the graphical user interface. The musical feature section will preferably allow the user to communicate his desire to, for example, include more items that have approximately the same or similar tempo as the base music item. The user might also elect to include more (or fewer) items with female singing, if the base music item contains female vocals. The modified personal preferences will preferably be used to produce a new parameterized similarity playlist that contains more items of the sort preferred by the user. The classification step will, in an embodiment be initiated with the results preferably being displayed dynamically as the user modifies the selection from criteria.
The actual search for similar items will preferably proceed as follows. The instant invention will begin by reading the personal preference values which will preferably take the form of a user modifiable representation of the classification parameters of the song model that has been calculated from the initial music item. Next, a search query will preferably be created and used to retrieve song models from the song model database. A distance measure (or degree or similarity, etc.) will preferably be calculated from each song model in the database to the current song model. The user will be able to control the search preferably through the use of a specialized graphical user interface designed for that purpose. The initial similarity playlist will preferably be created using all of the musical features of the base music item as starting point, with the features all given equal weight. Subsequent searches will preferably use the weighting scheme or preferences specified by the user.
As a next preferred step the instant invention will preferably create a new parameterized similarity playlist and present it to the user. Preferably, the items in this list will all be, probabilistically speaking, similar to the base music item according to the preferences specified by the user. The distance measurement will preferably be determined automatically from the song model as has been described previously.
In addition to the parameterization step, the user will also preferably be able to refine the similarity playlist by initiating an optional filtering step. The filtering process will allow the user to refine the similarity playlist even further. The user will preferably be provided with a graphical user interface that allows specification of user preferences regarding both the metadata associated with the base music item and regarding the song model data extracted from the same item. For example, the user will preferably be able to indicate that he or she prefers only songs from the artist that recorded the base music item, or only songs that exhibit a happy mood, or only songs from a particular release year, etc. In general, the filtering step allows the user to further refine the latest similarity playlist produced by the instant invention. The filtering step will preferably not initiate a new classification computation but instead it will use the existing calculation results.
The user will preferably be given some additional degree of control over the similarity playlist generated according to the instant method. For example, the user might be allowed to specify the length of the similarity playlist and whether or not the similarity playlist is to be updated/renewed as new music items are added to the music content database. The user will preferably be allowed to change the musical feature, parameter preferences used in the similarity playlist generation at any time. Furthermore, the instant invention will preferably automatically update the selectable musical features displayed in the graphical user interface depending on the currently selected music item.
It should be clear that an approach such as this would be a tremendous aid to the user who wants to enhance enjoyment of the items stored in an audio database by automatically generating similarity playlists that contain music that matches the feature preferences of a base music item and of the user. Additionally, since the music items will preferably be analyzed only once (e.g., at the time they are added to the audio database), the instant invention can be implemented with even very modest computing power. Thus, similarity playlist generation, parameterization, and filtering can potentially be implemented in virtually any computing device, either with the song model database stored locally or connectible through a network connection for example via W-LAN.
The foregoing has outlined in broad terms the more important features of the invention disclosed herein so that the detailed description that follows may be more clearly understood, and so that the contribution of the instant inventors to the art may be better appreciated. The instant invention is not limited in its application to the details of the construction and to the arrangements of the components set forth in the following description or illustrated in the drawings. Rather the invention is capable of other embodiments and of being practiced and carried out in various other ways not specifically enumerated herein. Additionally, the disclosure that follows is intended to apply to all alternatives, modifications and equivalents as may be included within the spirit and the scope of the invention as defined by the appended claims. Further, it should be understood that the phraseology and terminology employed herein are for the purpose of description and should not be regarded as limiting, unless the specification specifically so limits the invention.
Other objects and advantages of the invention will become apparent upon reading the following detailed description and upon reference to the drawings in which:
Referring now to the drawings, wherein the reference numerals indicate the same parts throughout the several views, there is provided a system and method for efficient maintenance of a large number of music items, via the creation of similarity playlists. The instant invention preferably utilizes a playlist generation method that improves on the prior art methods by utilizing the musical characteristics of the music items. The instant invention will preferably enable the user to generate a similarity playlist without requiring excessive computing power. Furthermore, the instant invention will preferably allow the user to parameterize the content of a similarity playlist by providing preferences (or dislikes) regarding individual musical features of the music item. The parameterized playlist will preferably be generated in real time and the instant invention will furthermore allow the user to perform this parameterization step repeatedly. The instant invention will continually provide playlists as the user modifies the search criteria. The user will also preferably be able to initiate a filtering step to further refine a parameterized or initial similarity playlist.
As is generally indicated in
Additionally, it is possible that an external camera 110 of some sort will be utilized with and will preferably be connectible to the computer so that video and/or graphic information can be transferred to and from the computer. Preferably the camera 110 will be a digital video camera, although that is not a requirement, as it is contemplated that the user might wish to utilize still images from a digital still camera in the creation of his or her multimedia work. Further, given the modern trend toward incorporation of cameras into other electronic components (e.g., in handheld computers, telephones, laptops, etc.) those of ordinary skill in the art will recognize that the camera might be integrated into the computer or some other electronic device and, thus, might not be a traditional single-purposes video or still camera. Although the camera will preferably be digital in nature, any sort of camera might be used, provided that the proper interfacing hardware between it and the computer is utilized. Additionally, a microphone 130 might be utilized so that the user can add voice-over narration to a multimedia work and a CD or DVD reader/burner 120 could be useful for storing user data/content on optical disk.
Additionally, a portable music device 140 such as an MP3 player or other portable device capable of playing digital music will preferably be connected or connectible to the computer to allow the user to exchange digital music items between the computer and the portable device. In some instances, the portable music device 140 will have an integrated loudspeaker through which digital music items may be performed for a listener. In other instances, the device 140 will likely have at least an electronic connection which is designed to receive a connector for a pair of headphones (e.g., a headphone jack), or an external or internal wireless module (e.g., Bluetooth or WiFi), or a stereo or other connector that is suitable for electronic communication with an external amplifier and/or speaker, etc.
Turning next to
After the instant invention has extracted the low level features and created the vector of data values, an audio signature 305 will preferably be generated. To generate this signature, the instant invention will preferably calculate mid level features, which are typically derived from the low level features. For example, LogLoudness is a low level feature which measures the loudness progression of audio material. An example of a mid-level feature that might be derived from LogLoudness would be a feature that is obtained by applying a statistical calculation to the series (e.g., a mean, running average, etc.). The mid-level features will tend to have more relevance musically than the low level features.
In a next step, the instant invention will preferably generate high-level features which will also be derived from previously generated or extracted data. For example, the mid-level features (and/or low-level features) might be used to form the high-level feature set. The low-level features, the mid-level features and the high-level features will preferably be chosen to describe in some sense a music item in mathematical terms, the totality of which will be referred to as the audio signature hereinafter. Since the audio signature will preferably contain much of the temporal information about the music item (e.g., the low level features), the audio signature will tend to be rather large.
In order to reduce the size of the audio signature, it is preferred that this data be compressed in such a way as to keep as much information about the song as possible. After the signature is composed, the instant invention will initiate a model generation step 310. This step uses the data in the audio signature to generate a song model 315. The song model generation will be described in more detail in connection with
The song model database 220 will preferably contain all of the song models 315 for each item in the audio database. The song model database and the audio content database are preferably related by an ID value which is unique to each music item and its song model.
In some preferred embodiments the instant invention will begin with a query/base music item 335, where the base music item is a song that has been selected by the user and provides a starting point for the assembly of a similarity playlist. For purposes of specificity in the text that follows, it will be assumed that the query/base music item will already have been analyzed and its song model is already stored in the song model database.
The instant invention will next preferably access the song model database 325 and locate and read the song model that corresponds to the query song. Next, the instant invention will preferably calculate a distance measurement 340 using the song model data from the query music item and each of the other music items in the database. Since the song model data will preferably have been previously compressed, the distance measurement will likely be calculated very quickly to produce result list 345. The result list 345 represents a listing of each item in the audio database together with a numerical or other value that describes the degree to which that item is similar to the query item. Next, the instant invention will preferably perform an aggregation step 350 which might involve sorting the results list based on similarity, with the songs most likely to be similar being listed first. The instant invention will then preferably select a predetermined number of music items from that sorted list that are most similar to the query song. These selected items are the initial similarity playlist 355.
Turning next to
Turning now to
Each interaction of the user with the GUI will preferably result in a new classification 400 being initiated. Each new classification will preferably generate a new similarity list.
In another embodiment the instant invention will utilize a weighted similarity in the computation of the playlist. Initially, it is preferred that the weights (which correspond to musical features) be equal initially. Then, the user will be given the option of varying the weights to shape the resulting playlist. By signaling the preference for a particular musical feature (e.g., the user might elect to increase the weight associated with the “tempo” classification parameter), songs in the database that have a classification parameter value close to that of the query music item will be given higher priority in the formation of the playlist. Note that an increase in the importance of “tempo” does not mean that the user desires music items with a higher (faster) tempo, but rather that such a selection signals to the instant invention that the user prefers more music items with a similar tempo as that of the query item in the playlist.
It should be noted that the similarity playlist is a listing of the associated music items and not necessarily a compilation of the associated music files. The listing will preferably provide information concerning where the music items are stored.
Turning next to
In some preferred embodiments the probabilities will be calculated using statistical discriminate analysis (continuous and/or discrete), wherein linear combinations of features (e.g., tempo, center frequency, instruments present, vocals present (“YES”/“NO”), etc.) are used to separate multivariate data points into different groups. In other instances, centroid-based distance calculations could be used to determine the distance (and associated probability) between the characteristics measured inside each window and the centroids of the different groups. Those of ordinary skill in the art will readily be able to devise alternative measures.
However they might be calculated, the probabilities so obtained represent the likelihood that a given music work belongs to a particular classification (or grouping, etc.) of songs (e.g., rock, jazz, etc.). The probability will preferably be calculated by comparing the data values that have been extracted from a particular music item with corresponding data items that are characteristic of each group, which might be predetermined by the instant inventors (or others) or calculated previously (or on the fly) from songs in each classification.
The adaptation step 720 is the second process that might be applied to the audio signature. In this step, the data values will then be transformed into summary statistical quantities such as variance, mean, median, etc., which may or may not be normalized. As a result, the adaptation step provides a holistic model 730 for each music file. The aspect profile 710 and the holistic model 730 of a music item will then preferably be stored in the song model 315 of that particular music item, the song model being a storage container for the data describing the compressed music item. The song model will then be stored in the song model database 220 with an associated unique ID.
Turning next to
The modified weights for each classification parameter and the classification parameter value form a target selection model that will then be used in a recomputation of the distances of each music item from the selected music item, thereby revising the playlist (step 825). The user will preferably be able to repeat the above-described process any number of times. Additionally, the user may choose to select a different base music item during this process and have an alternative playlist be determined.
Turning now to
In the next preferred step, the instant invention will preferably access the song model associated with the currently selected music item and extract some or all of its musical features, the classification parameters and the associated data values. In the next preferred step, the instant invention will compare the data values for each of the classification parameters with a pre-defined threshold value 920, a goal of this step being to present the user with the musical features that are most relevant to the music item.
As a next preferred step the instant invention will select those classification parameters that exceed the threshold value 930. In the next preferred step the instant invention will integrate the parameters that exceed the threshold into the graphical user interface to be potentially modified by the user during the filtering step 940. Additionally, the instant invention also will preferably extract metadata from the music item and presents that metadata in the graphical user interface as well.
Turning next to
Given a curve like that in
Coming next to
In particular example of
In another section of the graphical user interface, the user will preferably be presented with the similarity playlist 1140. The similarity playlist displays the music items that have properties that most closely match the preferences selection of the user. The target selection model for this example (i.e. the model against which music items in the database will be compared in forming the similarity value) is comprised of the five classification parameters (“TEMPO”, “SINGING”, “FEMALE VOICE”, etc.) and their associated parameter values in the base music item 1110 (e.g., TEMPO=100 BPM, SINGING=“YES”, FEMALE VOICE=“NO”, etc.) as weighted by the preferences of the user as measured by the positions of the sliders 1134,
In this figure, the user-shaped playlist 1142 is displayed, with each song in the playlist 1142 being accompanied by a percentage value (0%-100%) that represents the calculated similarity 1144 between that song and the base song or, more generally, the similarity between each song and the current selection criteria as weighted by the user's preferences. The similarity playlist 1142 will preferably be generated and updated on the fly as the user manipulates the controls in the parameter section 1130. If the user is satisfied with the current playlist and wants to store it for a later time he or she will be able to do so, preferably by activating the save control 1150.
Finally,
By selecting the filter control 1160 the user will be able to filter the songs in the parameterized similarity playlist as has been described previously. Additionally, by activating the “back” button 1170 the user will be able to go back one step to select another base music item. It is also preferred that the user will be given the option to select a music file from the similarity playlist to be the new base music item, thereby providing a starting point for the generation of a new playlist.
Of course, many modifications and extensions could be made to the instant invention by those of ordinary skill in the art. For example, in one preferred embodiment the user will be able to access an online database of digital music items and have a playlist generated from those items based on the user's selected song. Such an approach could further be enhanced by allowing the user to sample the content of the resulting similarity playlist and, additionally, to purchase content that he or she likes but does not own. Additionally the concept of parametrization and filtering of the similarity playlist might be implemented within an online environment, wherein both the song model database and audio content database are stored online and the user accesses these databases via a network connection.
Note that the term “database” should be broadly construed herein to include traditional hierarchical, relational, and flat databases as well as databases that consist of multiple files and/or that are resident on one or more computers.
Further, although in the preferred embodiment the ID that is assigned to each musical item in the database will be unique and numerical (e.g., a sequential count, a hashed value based on the song name, genre, etc.) those of ordinary skill in the art will recognize that there are many alternative ways to implement such an ID assignment process.
Thus, the present invention is well adapted to carry out the objects and attain the ends and advantages mentioned above as well as those inherent therein. While the inventive device has been described and illustrated herein by reference to certain preferred embodiments in relation to the drawings attached thereto, various changes and further modifications, apart from those shown or suggested herein, may be made therein by those of ordinary skill in the art, without departing from the spirit of the inventive concept the scope of which is to be determined by the following claims.