The invention relates to a method for generating data objects for execution in a sequence. In an embodiment, the invention relates to a generative music method.
Previous attempts at generative music software have generally fallen into two categories: those whose output is overwhelmingly random, because they do not apply the rules and constraints to the random output that are necessary to produce the kind of structured music that ‘makes sense’ to the listener's ear; and those that use machine-learning to build up a database of the likely patterns and progressions in a certain style of music, in order to imitate that style.
An invention is set out in the claims.
Embodiments of the invention will now be described, by way of example, with reference to the drawings, of which:
A method for providing a plurality of outputs at respective beats is provided. The method combines a random number generator with a set of rules and algorithmic processes that are applied to the random output to generate in real time or as a batch a coherent sequence of data objects executable in sequence to provide an output at beats corresponding to the sequence position.
Throughout the following, the term “array” may be used interchangeably with “matrix”, “vector”, or any other sequence of values or continuous function. Whenever the use of a random number generator and a probability array is described, it is referring to a process in which a random number is generated and used to select an entry in a probability array, as described in more detail below.
In overview, referring to flow diagram 100 data objects are generated and stored in a list for sequential execution. The data objects may represent audio data, for example. In particular, the audio data may represent musical notes. Thus the list may be referred to as a list of notes. When a data object is executed, musical note data is appended to a list or file, which may be a list of musical note data scheduled for playback through an output, or a data file used for storage and future playback. This data file may be a text file, a MIDI data file, an audio data file, or any other type of data file. The list or file of musical note data may be played directly on an output device or sent to a remote output device on which it can be played. For example musical note data may be streamed over the internet to a remote device, and the remote device may use the musical note data to assemble an audio file or other playback mechanism and play the resulting music.
The method executes the data objects in the list sequentially, with each data object ascribed an instant in time on which it should be played. Because the list of notes is appended to a data file, or played, sequentially, the position of a note in the list determines the instant in time ascribed to that note on which it will be played without requiring the time to be explicitly specified, as this is determined by the ordering. The method 100 therefore schedules a data object for execution by placing the object at a particular position in the list, corresponding to an instant on which the note represented by the data object is to be played. In musical terms, each position in the list represents a beat.
With reference to
The number of beats that require data objects to be generated is determined at step 102. This may be all the beats in the total duration of the music, which we will call batch generation. Alternatively, it may be any number of beats that does not represent all the beats in the total duration of the music. If this is the case, the method 100 may be run during playback of the data objects, whenever there are only a small number of generated data objects yet to be played, so that data objects are only generated just before they are to be played; we will call this realtime generation. Alternatively, the method 100 may be repeatedly run during or before playback of the data objects, in order to prepare as quickly as possible the first data objects to be played and be able to play them while more data objects are being generated; we will call this continuous generation. In another embodiment, it may be determined at step 102 that no number of beats should be set and that data objects should continue to be generated indefinitely until either a random number generator coupled with a probability array determines that the music should end or an input such as a user input or another input described in more detail below signals that the music should end.
The number of beats that require data objects to be generated may be set by the user. Alternatively, it can be set using a random number generator coupled with a probability array, or according to the total duration of the music. The total duration of the music, similarly, is determined using a random number generator coupled with a probability array. Alternatively, the total duration may be set by a user or by some other input, described below.
The list of data objects may comprise a number of lists each representing a different part, for example a melody part, a bass part, a harmony part or other type of part. Within each list of data objects, it is not necessary that every instant be assigned a data object; a data object may last for more than one instant, in which case there will be no data objects after the data object at the further instants in the list that the data object covers. A null value such as a rest may indicate that no output is to be generated for one or more instants. At step 106, new data objects are generated for each part that requires more notes. The exact make-up of parts may vary with each running of the method 100, and may also vary within a single running of the method 100. An example of a list of data objects 500 comprising a number of parts is shown in
Step 104 of the method 100 may be repeated as described above and may be referred to as the generator loop. The generator loop until data objects have been generated for all the beats the method 100 is generating data objects for.
During each cycle of the generator loop, new data objects are generated for the first beat that requires data objects and hasn't yet had data objects generated.
A data object's position in the list of data objects is used to ascribe an instant on which that data object should be played. When a data object is generated, it is ascribed an instant for playback by adding the data object to an ordered list at a position corresponding to the beat number on which the data object should be played. The list of data objects in method 100 maintains the current beat number.
Alternatively, the data objects could each store a scheduled beat number corresponding to the beat on which they are scheduled to be executed. The data objects could then be stored in an unordered list. On execution of the list, step 106 of the method 100 searches through the list and executes the data objects in order of increasing beat number.
Before any data objects are generated in list of data objects, a setup routine may be run. Empty lists for storing the data objects of each part, different types of data object (for example notes, chords and phrases), and probability arrays used in the generation of data objects are initialised. These items may be initialised in a variety of different ways, for example by using a random number generator coupled with probability arrays, by importing values from a stored settings file, by taking values from a user's input or some other input described below, or by a combination of the three methods. Similar methods are used to select musical factors such as key, tempo, time signature and the number and composition of parts. This setup routine is run before the first time method 100 is run, but it need not be run for further calls of method 100 within the same musical piece, since these calls can use the same lists and probability arrays that were set up at the start of the musical piece.
The generation of data objects will now be described in more detail. Whenever a data object is generated, various checks are performed against data objects scheduled for simultaneous instants in time or on adjacent or nearby beats in the same part as the data object being generated or in other parts. Each check increases or decreases the probability, drawn from the probability arrays initialised during the setup routine described above, of a specific value being selected for a particular variable of the data object being generated. Once all checks have been run, the values of the various variables of the data object being generated are selected using a random number generator coupled with the probability arrays that have arisen from the checks.
The checks used in the generation of data objects include consonance checks, which will be described in more detail below. In addition to consonance checks, other checks may be performed whenever a data object is generated. For example, the method may check that the pitch of the data object being generated falls within a particular range. Additionally or alternatively, the method may check that an interval between the pitch of the data object being generated and the pitch of a preceding data object falls within a pre-defined set of allowed intervals.
Other factors are considered when generating data objects through the use of probability arrays determining intervals and chord progressions that are likely to follow previous melodic and harmonic movement, probability arrays determining durations that are likely to follow previous rhythmic movement, probability arrays determining whether phrases and sections should come to an end, and probability arrays determining what happens if it is decided that a phrase or section will come to an end. Probability arrays will be described in more detail below.
Data objects may be generated as free data objects. A free data object may be generated independently of other data objects, or it may be generated taking surrounding data objects into consideration. It may be generated by using a random number generator combined with a probability array initialised during the set-up routine described above, or by using the checks described above, or by using a combination of these two methods.
A number of data objects may be generated as part of a pattern or group based on a certain sequence of data objects that has already been generated. The certain sequence of data objects can be a sequence of data objects generated earlier on in the music that has been stored in the list of notes. By generating patterns of data objects based on previous sequences of data objects, musically coherent sequences of data objects can be generated.
The pattern of data objects may be a repeat of the certain sequence of data objects, in which case the certain sequence of data objects is copied to a position in the list corresponding to the beat on which the certain sequence should be repeated. Alternatively, a transformation may be applied to the certain sequence to generate the pattern. Any suitable transformation may be used, for example a transposition (a shift in the pitch of the notes), an inversion (an inversion of the intervals between the notes), a retrograde (a reversal of the order of the notes), a rhythmic alteration or other transformations and processes.
To ensure both that sensible patterns are constructed and that there is a great deal of variation in the way patterns are constructed, the following mechanism is employed. With reference to the method 600 of
Once the initial note of focus has been chosen, a transformation is selected at step 604. The transformation may be any of the transformations mentioned above, or any other suitable transformation. The first note of the pattern is then generated using all, some or none of the elements of the note of focus, at step 606. The number and types of elements of the note of focus that are re-used for the new data object is decided by a random number generator and probability array. In order to start generating the second note of the pattern, a note adjacent to the note of focus (the next or the previous note depending on whether the selected transformation involves maintaining or reversing the order of the original data objects, respectively) is selected from the list of notes and becomes the note of focus at step 608, so that there is a new note of focus on which the second note of the pattern can be based.
The selected transformation is then applied to the note of focus at step 610, generating a new data object. The various data object-generation checks described above are applied to this data object at step 612. If the application of these checks renders a data object impossible, or if a random number generator and probability array dictates that this course should otherwise be taken, any number of the elements or variables of the new data object may be altered at step 614. The variables contained within data objects will be described in more detail below. The data object is then added to the list of data objects at step 616, and the sequence may come to an end at step 618, as determined by a random number generator and probability array. If the sequence does not come to an end, the note of focus again shifts either forwards or backwards (according to the type of transformation) in the list of notes at step 620, and the method 600 returns to step 610 to generate another note of the pattern.
In an alternative embodiment for generating a pattern of data objects, with reference to the method 200 of
Once the sequence of data objects on which the pattern is to be based has been chosen, the pattern is generated by applying a transformation to the sequence at step 206. The various data object-generation checks described above are then applied to each data object in the pattern at step 208. These checks are repeated at various different shifts in pitch of the entire pattern. At step 210 a particular pitch shift is selected based on the outcome of the checks. The selected shift of pitch is applied to each of the data objects in the pattern and then the data objects are added to the list of data objects at step 212.
In each of the above pattern generation methods, as each new data object is generated, consonance checks and other checks are performed. If the new data object does not satisfy these checks, one or more of the parameters (or elements or variables) of the new data object may be varied until the new data object does satisfy these checks. Alternatively, the pattern may come to an end. A variation or an end to the pattern may also occur if the checks are satisfied but a random number generator and probability array determines that one of these options should be taken.
When either a free data object or a pattern is being generated in a given part, the other parts may be in one of three states. Firstly, a second part may already have had data objects generated for the beats for which data objects are being generated in the first part, in which case the data objects in the second part can be used for consonance checks and other checks during the generation of data objects in the first part. Secondly, a second part may have no data objects yet generated for the beats in question, in which case the data objects can be generated in the first part without recourse to this second part. Or thirdly, a second part may be having notes generated alongside the generation of the data objects in the first part, in which case the generation of data objects alternates between the parts, and for each data object that is generated one of the first two states described above is found in the other generating part.
Whether the method generates a free data object or a pattern of data objects, which certain sequence of data objects the pattern is based on, and whether the pattern constitutes a repeat or some other transformation of the certain sequence is decided in real time as the data objects are generated using a random number generator coupled with probability arrays.
In order to generate music that has a sense of direction, the method 1100 of
The overall probability of the most likely sequence of data objects that would fill the remaining gap between a data object being considered and the target data object may be calculated in the following manner. An empty list of lists of data objects is created at step 1106. The probabilities of the various possible data objects for the first beat of the gap are calculated at step 1108 using the usual data object generation methods involving probability arrays, as described above. Each of these possible data objects is entered into the list of lists of data objects as a new list of data objects at step 1110. The list of data objects with the greatest overall probability is selected at step 1112, and this list of data objects is used to continue. Steps 1108, 1110 and 1112 are repeated until the most probable sequence of data objects that could fill the gap has been calculated; at every stage, the list of data objects with the greatest overall probability is selected at step 1112, the probabilities of the various possible data objects that can follow it are calculated at step 1108, and a new list of data objects is created for every possible ensuing data object by adding it to the list currently being used, with all these lists being added to the list of lists of data objects, at step 1110.
When the most probable sequence of data objects that could fill the gap has been calculated, the list of data objects with the greatest overall probability is selected at step 1112.
In an alternative embodiment, the overall probability of the most likely sequence of data objects that would fill the remaining gap between a data object being considered and a target data object may be substituted for an approximation of this value. This approximation may be reached by limiting the number of possible data objects considered at every stage to a certain number of the most probable data objects. Alternatively, it may be reached by using some heuristic, such as the number of semitones between the musical pitches of the two data objects or the number of steps around the circle of fifths between the musical pitches of the two data objects. Such an approximation leads to faster data object generation, which is useful when trying to maximise speed in realtime generation and other forms of generation.
The method can structure the music into phrases by generating data objects to form a cadence at the end of each phrase. The degree to which the music is structured into phrases depends on how often the method generates a cadence sequence, which is determined during the setup routine described above and can be altered during the method 100 through the use of a random number generator and a probability array. Different rules and probability arrays determine the generation of data objects for a cadence and govern the beat on which the final notes of a phrase will be played, the pitches those final notes will take, and other factors.
The beats on which each phrase begins and ends are stored so that the entire phrase, or a section of a phrase, can be recovered and repeated or used in the generation of a pattern of data objects as described above.
The use of probability arrays described herein allows certain progressions in the music to be more likely than others, while leaving enough of a random element so that the music is different every time the method is run. For example, a probability array can represent the likelihood of various different note durations for the next data object given the note duration of the preceding data object.
Such a probability array is shown in
To give an example, a probability array may represent four different outcomes: A, B, C and D. If each of the four outcomes is equally likely, the probability array may be represented as (0.25, 0.5, 0.75, 1). To select an outcome, a random number between 0 and 1 is generated. The random number is compared with each value in the probability array, from left to right in this example, and the outcome corresponding to the first value that is greater than the random number is selected. For example, if the random number generated is 0.85, outcome D is selected. A probability array therefore represents a weighting associated with various outcomes.
To determine the duration of the note of the data object being generated, a row of the probability array is selected based on the duration of the note of the preceding data object. For example, if the duration of the preceding note was 2, the fourth row is selected. The selected row of the probability array represents the various likelihoods of the different possible durations for the next note. A particular duration is selected for the data object being generated by coupling the selected row with a random number generator as described above.
Probability arrays can be imported from a configuration file or they can be built during the generation loop at step 104 of method 100. A method 300 will now be described, in relation to
The same process can be used in many different contexts herein and wherever the use of probability arrays is mentioned. For example, probability arrays can be used to select the pitch of the next note based on a pitch, duration or other parameter of a preceding note; to select the transformation to be applied to the next pattern based on a transformation applied to a previous pattern; to determine whether to instigate a cadence based on the number of beats since the last cadence; to determine the types of parts used and the relationship between them (that is, for example, which parts are checked for consonance against which other parts); and to make many more determinations.
Each data object represents a note in the music. Each data object contains variables corresponding to the pitch, duration and volume of the note, as well as information about on which instrument the note should be played, and other factors. A data object may represent, inter alia, a single note, a collection of several notes (a chord), a pitched or unpitched percussive sound, a duration having no pitch (a rest), a mode or scale of notes representing the harmony on that beat, or some other pre-recorded sound.
Once a data object has been generated, it is stored in an expanding list of data objects. The list of data objects stores the data objects in an order corresponding to the beats on which the data objects will be played. Each data object remains in the list of data objects until all data objects have been generated, so that it can be used to generate patterns and other sequences of data objects as described above. Once the list of data objects is complete, each data object may be executed. This may involve immediately converting the list to audio and playing it back on an output device, or storing it as text, audio or other data for future playback or for combination with some other media, for instance a video or a game. Once the data objects have been executed, the list of notes may be cleared.
At any point during or after the generation of data objects, the list of data objects can be regenerated from a certain point in the list of data objects onwards. Alternatively, a specific section of the list of data objects can be regenerated. This presents a technical challenge, since the state of the generation parameters as they were when the data objects in the list at the point to be regenerated were originally generated must be recreated. This is achieved by storing the state of the generation parameters on every beat in the list of data objects, so that that state can be returned to in order to regenerate data objects. This method enables users to edit certain data objects or sections of the list of data objects after the generation of data objects is complete.
The consonance checks mentioned above will now be described in more detail. Consonance is, essentially, the degree to which notes fit together, given their fundamental frequencies and harmonics. The more two notes sound like they go together, in general, the greater the consonance between them. This principle applies both to data objects scheduled for simultaneous instants in time and to data objects scheduled for sequential instants in time. Generating data objects with a high level of consonance presents a technical challenge. To achieve a high level of consonance, a number of intervals are defined. For each interval, a probability is defined that represents the likelihood of two simultaneous data objects having that interval appearing in the music. The intervals and their respective consonance probabilities may be either stored in a settings file that is read during the setup routine described above, generated using a random number generator and probability arrays, or generated using a combination of these two methods.
When a data object is generated, its pitch is chosen according, inter alia, to how consonant it is with the pitches of other data objects that are scheduled for the same instant in time as the data object being generated, or that are scheduled for an instant in time that is soon before the data object being generated and whose output will be ongoing at the scheduled instant in time of the data object being generated. The consonance of a potential pitch for a data object being generated is checked by calculating an interval between the potential pitch and the pitch of one or more other data objects that are scheduled at simultaneous instants in time or whose output will be ongoing at the scheduled instant in time of the data object being generated, and selecting the potential pitch for the data object being generated according to the outcome of a random number generator coupled with the defined consonance probability for that interval.
Other rules may also be applied: for example, a dissonant note (that is, a note that is not consonant) may require an adjacent consonant note to follow it; or a consonant note may be allowed to extend past the end of a note in another part, but a dissonant note may not be allowed to do so. A consonance check may also involve checking surrounding data objects, and evaluating potential data objects that could follow the data object being generated.
Instead of scheduling data objects to be played on a particular beat by arranging the data objects in a list, the method can allow for notes to be scheduled between beats by scheduling the data objects to be played at a specific time. In this embodiment, data objects are ascribed playback instants at scheduled times instead of on a particular beat. In an alternative embodiment, data objects can be ascribed playback instants both on particular beats and at specific times that fall between beats (by being scheduled at times relative to beats; for example, a data object may be ascribed a playback instant a certain number of milliseconds after a certain beat).
The execution of a data object may cause a sound from a library of pre-recorded sounds to be played. Alternatively, when a data object is executed the waveform of the sound may be generated by the application and then played. In an embodiment, instead of or in addition to playing a sound, audio data representing the sound may be appended to an audio file when a data object is executed. The audio file may then be retrieved and played back or downloaded at a later time.
Similarly, non-audio data may be appended to a file when a data object is executed, and thus stored for later retrieval, execution and playback, so that a user may recall a particular piece of music generated by the method; the stored data can be used by the method to create new data objects to be executed (or, if the data objects themselves are stored, they can simply be executed themselves).
The various parameters used in the setup routine and when generating data objects may be modified in response to feedback received from the user. The feedback may be received through buttons on the user interface, deduced from the types of music the user chooses to keep listening to and the types of music the user chooses to skip, deduced from the types of music the user chooses to listen to at various times of day or in various situations, or received by any other suitable method.
The various parameters used in the setup routine and when generating data objects may also be modified in response to various other inputs. For example, the user may be able to choose a style of music, the tempo, etc. through a user interface. The method may also adjust those parameters in response to data received from a sensor of a device, for example an accelerometer or a microphone, or in response to other factors, such as information obtained from the internet about the current weather, location or time of day, or information about any media the music is intended to accompany, such as the duration of a video.
Data received from the various sources identified above can be used to affect both overall settings, such as genre and tempo, and the probability arrays used in the generation of data objects that determine the course the music takes. For example, if a user is running while listening to music, the method can detect the regular motion that arises from the running and adjust the tempo of the music to align with the period of that motion. Similarly, data received indicating that it is raining can be used to adjust the probability arrays such that the music is more likely to tend towards a minor key.
A method 700 by which data received from various sources can be used to affect the generation of data objects will now be described in more detail in relation to
At step 704 the sync points are used to generate the structure of the music. The specific instants ascribed to the sync points are used to determine possible numbers of beats, speeds, time signatures, and other factors that can be selected for the data objects between the sync points. The total duration of the music, as well as probability arrays as described above, may be used in conjunction with the sync points to determine this structure. For instance, if two sync points are placed 3.4 seconds apart from each other, a probability array may dictate that 64 beats must fall between those sync points, which in turn would dictate that the speed of the music must be 0.053125 seconds per beat.
Then, during the generation of data objects, when data objects are generated in the time region of a sync point, that sync point is used at step 706 to change the generation parameters, for instance by altering the speed of the music or the instruments being used.
In an alternative embodiment, a sync point can be set during the generation of data objects, either for some future instant in time or for the instant that has been reached in the generation. In this case, an input may immediately have an effect on the generation parameters, meaning that the music responds to external inputs as it is generated or played.
A schematic example of a user interface is shown in
The settings screen 900 is shown in
A settings choice screen 1000 is shown in
A schematic example of an alternative user interface is shown in
The sync points screen 1300 is shown in
The method may be performed on a device, and the sounds generated by the method may be stored or output on that device. Alternatively, the method may be performed on a server, and the resulting data streamed to a remote device for storage or playback. These data may be in the form of data objects, in which case the data objects are converted to audio output on the device using the audio output playing method described above; alternatively, the data may be in the form of audio data, which can be output on the device with no conversion necessary.
The method may be performed in order to generate sounds intended to accompany a video. In this case, data received from inputs may affect the parameters used to generate data objects, with the effect that the music may respond to these inputs. This input may be taken from the video automatically, such as the duration of the video or data from visual or audio analysis performed on the video; alternatively, it may be input by a user, such as the sync points described above.
The method may be performed in order to generate sounds intended to accompany events unfolding in realtime, such as a video game, a live stream or a user's day-to-day actions. In this case, too, data received from inputs may affect the parameters used to generate data objects, with the effect that the music may respond to these inputs. This input may be taken from the events automatically, such as a change of level in a video game or some form of code trigger; alternatively, it may be input by a user, such as a user pressing a button to indicate that the music should decrease its speed.
It will be appreciated that the approaches described above can be performed using any appropriate algorithms and in relation to any appropriate device. The steps can be performed in software, firmware or hardware and can be implemented for example in the form of a downloadable application interacting with audio and display hardware components on a device such as a computer, laptop, tablet or telephone. Such a device 400 is shown schematically in
Number | Date | Country | Kind |
---|---|---|---|
1219521.0 | Oct 2012 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2013/052831 | 10/30/2013 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2014/068309 | 5/8/2014 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6313390 | Adriaans et al. | Nov 2001 | B1 |
7777123 | Hennig et al. | Aug 2010 | B2 |
20070074620 | Kay | Apr 2007 | A1 |
20110010321 | Pachet et al. | Jan 2011 | A1 |
20150255052 | Rex | Sep 2015 | A1 |
Number | Date | Country |
---|---|---|
2 903 803 | Jan 2008 | FR |
Entry |
---|
The International Bureau of WIPO, “Search Report” in application No. PCT/GB2013/052831, dated Feb. 24, 2014, 12 pages. |
Redeout, E., “Sibelius 1.0,” Keyboard, Us, Miller Freeman Publications, San Francisco, CA; vol. 25, No. 4, Apr. 1999; pp. 108-110,112,114, XP000823548; ISSN: 0730-0158. |
Inernational Searching Authority, “Written Opinion” in application No. PCT/GB2013/052831, dated Feb. 24, 2014, 7 pages. |
Number | Date | Country | |
---|---|---|---|
20150255052 A1 | Sep 2015 | US |