A portion of the disclosure of this patent document contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in a published Patent and Trademark Office patent file or record, but otherwise reserves all copyrights whatsoever.
The invention relates to methods for creating variations of sequences of symbols, and more particularly, to methods for creating a variation of a piece of music or another symbol sequence whereby the variation differs from the original sequence but nevertheless retains some features of the original sequence.
The present invention is an improvement upon the computer-aided techniques of my prior U.S. Pat. No. 5,606,144, and earlier MIT doctoral thesis Musical Variations from a Chaotic Mapping, and papers “Musical Variations from a Chaotic Mapping” (Chaos, 1996), “A chaotic mapping for music and image variations” (Proc. Fourth Int'l. Chaos Conference, 1998), and “Creating Musical Variation” (Science, 2008). The invention is a method for extending the variation capabilities of musical compositions, including both printed and recorded versions of the same. It should be noted that the term “variation” is used herein to refer to any process by which some features of a work (such as a musical piece) change while other features remain the same, and the term “variation technique” is used herein to refer to any modification and/or compositional procedure.
The present invention is mainly described in this paper in terms of its application to musical symbols or notes, following the illustrations of my said prior patent. However, it should be noted that the present invention is not limited to variations of music, and that embodiments of the present invention are generically applicable to all types of symbols, characters, images, and such like.
In my prior patent, a chaotic mapping technique is provided for generating, for example, musical variations of a given work. This technique, based on the sensitivity of chaotic trajectories to initial conditions, produces changes in the pitch sequence of a piece. The mapping in embodiments of my prior patent utilize two chaotic trajectories from the Lorenz equations—a system comprising three nonlinear first order differential equations
dx/dt=σ(y−x) (1)
dy/dt=rx−y−xz (2)
dz/dt=xy−bz, (3)
where σ=10, b=8/3, and r=28 (E. N. Lorenz, “Deterministic nonperiodic flow,” J. Atmos. Sci. 20, 130-141 (1963)). Other embodiments use other chaotic systems.
In the original chaotic mapping of my prior patent, the x-components {xi} of a reference chaotic trajectory are paired with a sequence of musical pitches {pi}. Each pi is then marked on the x axis at the point designated by its xi. In this way, the x-axis becomes a pitch axis configured according to the notes of the source composition. Then, a second chaotic trajectory, whose initial condition differs from the first, is launched. Its x-components trigger pitches on the pitch axis that vary in sequence from the source work, thus creating a variation. The possible variations are virtually infinite in number, and can be very close to the source, diverge substantially, or achieve degrees of variability in between these two extremes.
The Lorenz equations arise in applications ranging from lasers to private communications, and have also served as generators of “chaotic” music, where a chaotic system is allowed to free-run and its output converted into a series of notes, rhythms, and other musical attributes in order to create a piece from scratch. However, these approaches did not generate variations on an already completed piece.
The original chaotic mapping of my said U.S. Pat. No. 5,606,144 operates on the note list of a source piece, e.g., one provided by a MIDI file score, and treats each successive pitch, or simultaneous sounding of pitches (chord), as a parsed event. This represents the default parsing of the MIDI file score.
However, despite its ability to produce variations which can be delightful, appealing to musicians and non-musicians alike, the original chaotic mapping described in my previous patent has limitations. For instance, it requires that no note occur in the variation that was not already present in the input source file. While this can create a ‘pitch space’ link between the variation and the input source file, i.e., the variation is composed of pitches found in the input file, a richer variation technique would allow pitches outside the pitch space of the input file, if desired. Furthermore, my original chaotic mapping technique does not offer choices to apply different variation techniques within one piece.
Also, my original chaotic mapping approach applies only to a discrete-time signal representation of a source piece, e.g., a MIDI file score, prose, poetry, image, etc., but not to a continuous-time signal representation of a work, e.g., an audio recording, video, film, spoken word art, video games, and so on.
What is needed, therefore, is an improved chaotic mapping method which can introduce notes and/or symbols into a variation which are outside the pitch space of the input file, which can combine and offer choices between both new and time-honored variation and/or compositional procedures, and can generate variations not only of a discrete-time signal representation but also of a continuous-time signal representation of an existing work.
The present invention is a method for transforming an ordered input into a variant ordered output, whereby the variant output differs from but maintains a recognizable similarity to the input. Embodiments of the method include the following steps:
In embodiments, the hole-selecting algorithm includes application of a chaotic mathematical function. In some of these embodiments the chaotic mathematical function is a fourth-order Runge-Kutta implementation of a set of Lorenz equations, and in some of these embodiments the Lorenz equations are:
dx/dt=σ(y−x); (4)
dy/dt=rx−y−xz; and (5)
dz/dt=xy−bz, (6)
where σ=10, b=8/3, and r=28.
In some embodiments, numeric values from the two sets of values having the same index can be compared, and the original element can be replaced if the numeric value from the second group is larger (in some embodiments) or smaller (in other embodiments) than the numeric value from the first group.
In various of these embodiments, the chaotic mathematical function is used to generate a first set of numeric values and a second set of numeric values, where the number of values in each set equals the number of original elements. Elements of the first and second sets of numeric values are then paired according to a pairing algorithm, and the paired elements are compared to each other so as to determine if the corresponding original element is a hole element.
Still other embodiments include for each value of j from 1 to imax finding the value k of the index of the smallest member of the first group of values which is greater than the “j” member of the second group of values, and determining that Nj is a “hole element” if j≠k.
Some embodiments of the present invention are applicable to inputs that are ordered sets of discrete items, such as MIDI files or other lists of notes, note durations, and note onset times, while other embodiments are applicable to inputs which are continuously varying functions, such as recordings of music. Some embodiments include parsing a continuous recording of sound into discrete notes, note durations, and time intervals between notes.
In various embodiments an input which is a continuous function is divided into a sequential series of original elements which are segments of the input. The segments can be determined by simply dividing the input into segments of equal length. In other embodiments, starting points, and in some embodiments also ending points, are selected by a user by automated detection of time signature beats, boundaries between musical phrases, or repetitive structures.
The substitution algorithm can include scanning input segments in a forward or backward direction until another segment is located that is substantially equal in length to the hole segment. The scanning can begin at the beginning or end of the input, or at a segment that is symmetrically located with respect to the hole segment on opposite sides of a symmetry element.
A given substitution element can be restricted to replacing only one hole element. A variation procedure can be applied to a selected element. If the input is a musical input, a substitution element can be derived from music that is part of the input, distinct from the input but part of a musical composition to which the input belongs, or part of a musical composition to which the input does not belong.
A substitution element can be derived from a musical seed source or other seed source which is distinct from the input and includes an ordered set of elements. An element of the seed source can be selected which immediately follows an element which is at least similar to the element immediately preceding the hole element. A chaotic mathematical function can be applied to a seed source in order to select and/or modify a substitution element. The substitution element can be derived from a plurality of elements of the seed source, and/or an algorithm can be applied to the seed source such as scanning, stenciling, interval shift and/or insert, reversal, inversion (dynamic), repetitive beat, and/or phrase structure.
For musical inputs and/or seed sources, providing a substitution and/or modification element can include reversing pitch order, pitch duration, or pitch onset time, inverting a note (for example about a closest preceding note belonging to an element which is not a hole element), moving a note by a specified interval, and/or adding at least one note having a pitch which is offset by a specified musical interval from a pitch included in the hole element.
In some embodiments providing a substitution and/or modifying element includes application of a pitch or timing mask element to the hole element and applying at least one of “and,” “inclusive or,” and “exclusive or” logic between the mask and the hole element. The mask element can be another of the original elements or an outside element.
Separate tracks from a musical recording (e.g. voice, accompanying instruments, and percussion) can be treated as separate input sources for variation according to the present invention and then recombined to create a “remixed” variant. The tracks can be separately varied and then combined, and/or one track can be used as a seed source for another track. In embodiments, a website or other forum is provided for composers, artists, DJ's, and such like to post the separate tracks from their recordings, so that “fans” can remix them and create individualized and differentiated works.
Music variations created using the present invention can be uploaded to websites such as social media sites, and can also be combined with graphical works such as photographs or videos. For example, a user can take a “selfie” or other photograph, or a short video such as a “vine,” using a hand-held or wearable computing device, and then associate the graphical work with a novel variation of a well-known composition before sharing/sending the combination to one or more recipients by email, tweet, or SMS message. In embodiments, the website can be programmed to generate and play a new variation of the composition each time the website is visited. In other embodiments, the present invention can be implemented in a website, allowing a user for example to make a recording or upload a song, create a variation according to the present invention, and share/send the variation to one or more friends or other recipients in combination with a message.
In embodiments, a composer or choreographer can create short variations of a musical input using the present invention that will accompany the pictures in a storyboard presentation for a project such as an ad campaign, a Broadway show, or a new movie concept, so as to win approval and commitment to the project before investing the time and effort required to create a full accompanying score. In other embodiments, short musical variations created using the present invention can be included with a PowerPoint or similar presentation, so as to add an audible component to a presentation of a concept such as a business proposal, artistic concept, or project plan. In some of these embodiments, the present invention is embedded within the software used to generate the presentation, to facilitate the creation by the presenter of unique auditory elements.
The present invention can enhance social interactions in other ways. For example, a greeting card can include a small chip that is accessible via the web or via a computing device, so that the user can store thereupon a unique and personal composition created using the present invention, to be played when the card is opened. The user can further include a recording of his or her own voice or of a synthesized voice, or another acoustic or electronic input, to further personalize the message. For example, a Valentine's Day card could include a custom variant of a romantic song where a recording of the sender's voice speaking the recipient's name is used as a seed source. Or the sender could record himself/herself sing the Happy Birthday song and use the recording as a seed source to vary a well-known composition (e.g. theme from Star Wars) to be played by a birthday card.
Websites can host “chains” of compositions where individuals create and post successive variations of a starting composition. In other words, a first user can create a first variant of a source input composition, a second user can create a second variant that is a variant of the first variant, and so forth. A visitor to the website can then enjoy listening to a succession of variations that may begin as small changes to the input composition, and then evolve to variations where the input is hardly recognizable.
In embodiments, the present invention is implemented in a small electronic chip that can be included in a larger item, such as a hand-held device (smart phone, iPod, iPad, tablet, etc), an MP3 player, a greeting card, or wearable technology such as smart watches, smart bands, or wearable computers. In some of these embodiments, the item is thereby enabled to play variations of music stored therein. For example, the chip can be instructed to create a new variation of a song every time it is played on an MP3 player. In other embodiments, the chip is included in a greeting card, so that the sender can create a unique composition to be played when the card is opened without resort to a website or separate computer. In some of these embodiments, the chip can be programmed to play a different variation each time the card is opened.
The present invention is a method practiced by a computing device for transforming an input musical composition into a variant musical composition, whereby the variant composition differs from but maintains a recognizable similarity to the input composition. The method includes parsing by the computing device of the input composition into an ordered sequence of original elements {Ni}, the original elements being sequentially indexed by successive integer values of an index i from 1 to a maximum value imax, for each value of i, the computing device using a hole selecting algorithm to determine if Ni is a candidate for substitution, herein referred to as a “hole element,” and using a substitution algorithm, for at least one hole element the computing device providing a substitution element and replacing the hole element with the substitution element, the resulting ordered set of original and substituted elements constituting the variant composition.
Embodiments further include applying the method separately to at least two tracks of the input musical composition so as to create variants thereof, and recombining the variants to create a remixed musical composition.
In some embodiments, a plurality of tracks from an input musical composition are posted on a website, and the website is configured to enable visitors to the website to access subsets of the tracks and to create therefrom a remixed musical composition.
In other embodiments, the input musical composition includes a subset of a plurality of tracks included in a musical recording, and the substitution element is an element Ms derived from at least one seed track of the musical recording, wherein the at least one seed track is not included in the input musical composition, the substitution creating a remixed musical composition.
In various embodiments the input musical composition includes a concatenation of at least two subsets of a plurality of tracks included in a musical recording. And in certain embodiments the input musical composition includes a concatenation of at least two subsets of a plurality of tracks included in a plurality of musical recordings.
Selected embodiments further comprise combining the variant composition with a graphical element. In some of these embodiments, the graphical element is a graphical image. In other of these embodiments, the graphical element is a video or a part of a video. In still other of these embodiments, the graphical element is a film or a part of a film. In yet other of these embodiments, the graphical element is a video game or a part of a video game.
In certain other of these embodiments, the graphical element is a presentation slide element or presentation slide deck. Some other of these embodiments further include forwarding the combined variant composition and graphical element to at least one recipient. In various of these embodiments, the graphical element is an element of a storyboard that describes a proposal for a musically accompanied graphical work. And in some of these embodiments, the musically accompanied graphical work is a musically accompanied video.
In other of these embodiments, the graphical element is a slide presentation created by a presentation software application. And in some of these embodiments the presentation software application is configured to perform the steps of parsing, using the selection algorithm, and using the substitution algorithm.
In embodiments, the input musical composition is associated with a website, and the website is configured, each time a user visits a designated page of the website, to repeat the steps of using the hole selecting algorithm and using the substitution algorithm to create a new variant of the input musical composition, and play the new variant to the user.
In various embodiments, the input musical composition is associated with software, the software being configured, each time a user opens it, to repeat the steps of using the hole selecting algorithm and using the substitution algorithm to create a new variant of the input musical composition, and play the new variant to the user.
Certain embodiments further include storing the variant composition in a digital device included in a greeting card, the digital device being configured to play the variant composition when the greeting card is opened.
In selected embodiments, a digital device is configured to perform the steps of parsing, using the hole selecting algorithm, and using the substitution algorithm. In some of these embodiments, the digital device is included in a greeting card, a toy, an MP3 player, a cellular telephone, or a hand-held or wearable electronic device. In other of these embodiments, the digital device is configured to repeat the steps of using the hole selecting algorithm and using the substitution algorithm each time a musical composition is accessed, so that a new variant composition is played each time the composition is accessed.
In embodiments, the method is practiced by an application running in hardware or software on a hand-held or wearable electronic device. In certain embodiments, the method is practiced by a computing device that is accessible via a network to a hand-held or wearable electronic device.
In various embodiments, the method is practiced by a computing module included in a hand-held or wearable electronic device. In some of these embodiments, the hand-held or wearable electronic device is configurable to play a new variant of the input musical composition each time the input musical composition is selected. In other of these embodiments, the computing module included in a hand-held or wearable electronic device is configurable to play a new variant of the input musical composition each time the input musical composition is selected. In various of these embodiments, a plurality of tracks from an input musical composition are accessible to a hand-held or wearable electronic device, and the hand-held or wearable electronic device is configured to enable the user to access subsets of the tracks and to create therefrom a remixed musical composition. And in certain of these embodiments the input musical composition includes a subset of a plurality of tracks included in a musical recording, and the substitution element is an element Ms derived from at least one seed track of the musical recording, wherein the at least one seed track is not included in the input musical composition, the substitution creating a remixed musical composition.
In embodiments, the method is practiced via a website or computing device. In some embodiments, the method is practiced by a video game so as to play a new variant of a musical theme each time specified criteria are met during play of the game.
In other embodiments, the steps of using the hole selecting algorithm and using the substitution algorithm are repeated in a chain sequence, whereby the input musical composition for each repetition is a variant composition created by a previous repetition. In some of these embodiments, the steps of using the hole selecting algorithm and using the substitution algorithm are performed by a website, the website being configured to store the variant compositions and make them available to visitors of the website. And in other of these embodiments, the steps of using the hole selecting algorithm and using the substitution algorithm are performed by a hand-held or wearable electronic device, the device being configured to send the variant composition to another person to keep the chain of variations going from recipient to recipient.
And in various embodiments, the steps of using the hole selecting algorithm and using the substitution algorithm results in a variant composition that is so different from the original composition as to be deemed a new composition in its own right.
The features and advantages described herein are not all-inclusive and, in particular, many additional features and advantages will be apparent to one of ordinary skill in the art in view of the drawings, specification, and claims. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and not to limit the scope of the inventive subject matter.
With reference to
parsing the input 100 into an ordered sequence of original elements {Ni}, the original elements being sequentially indexed by successive integer values of an index i from 1 to a maximum value imax;
Methods of various embodiments vary according to whether the input is a set of discrete elements, such as a MIDI file, or a continuous input, such as a recording of music.
Embodiments Applied to Discrete Inputs Such as MIDI File Scores
Block [1] represents a discrete-time signal, e.g., a MIDI file representation of a musical piece. It serves as the input to Block [2], denoting at least one parsing scheme such as the Discrete-time Signal Parser of
In Block [5], the Improved Chaotic Mapping of
Block [6] denotes a variation of the source MIDI file of Block [1], consisting of changes to the event sequence parsed in Block [2], e.g., modifications to the original pitch/rhythmic sequence that result from introducing pitches and/or rhythmic patterns inside or outside the event space of the source piece.
The variation of Block [6] becomes the input to a MIDI sequencer (Block [7]) capable of playing the variation and outputting it as an audio signal to a Mixer (Block [8]). The Amplifier of Block [9] receives the audio signal of the variation, operates on it, and sends it through the Speakers of Block [10] so that the pitch and/or rhythmic variation of the source piece can be heard.
Embodiments Applied to Continuous Inputs Such as Audio Recordings
An Audio Parser in Block [2], such as the ‘Start-times only’ Parser of
The Improved Chaotic Mapping of
The Audio Variation of Block [6] consists of changes to the event sequence of the source audio recording, as determined by the Improved Chaotic Mapping in conjunction with any designated variation procedure(s) of Block [4]. The Audio Variation of Block [6] passes through the Mixer, Amplifier, and Speaker systems of Blocks [7], [8], and [9] so that it can be heard.
The benefits of the Extended Span and Improved Chaotic Mapping algorithms of the present invention become apparent in comparison with the Original Chaotic Mapping algorithm of U.S. Pat. No. 5,606,144.
Next, a new trajectory is started at an IC differing from the reference (
The Extended-Span Chaotic Mapping Method
When using the original chaotic mapping, the initial conditions (ICs) of the new trajectory must be chosen fairly close to those of the reference solution in order to create variations that ‘make sense’ stylistically, at least on an intuitive level. To allow more flexibility with the ICs, an improvement of the original chaotic mapping can be employed, in accordance with the present invention, which musically accommodates a greater departure in initial conditions for the second chaotic trajectory, compared to the first. Thus, for any given variation, ICs can be chosen for the first and second chaotic trajectories that differ from each other by a much greater margin, yet the variation will still maintain a reliable resemblance to the source piece. Such a result is not possible with the original chaotic mapping. This improvement of the original chaotic mapping is referred to herein as the “Extended-span Chaotic Mapping,” because it extends the range of IC values that will produce the same pitch in the variation that occurred in the source piece. Thus, the Extended-span Chaotic Mapping is a kind of ‘foolproof’ chaotic mapping, because no matter how radically divergent the initial conditions, this mapping ensures an audible musical connection between the source composition and the variation than is otherwise possible with the original chaotic mapping.
To apply the Extended-span Chaotic Mapping, as earlier described, a chaotic trajectory with initial condition (IC) of (1, 1, 1) is simulated using a fourth-order Runge-Kutta implementation of the Lorenz equations. Let x1,j denote the j'th x-value of this first chaotic trajectory.
In
where g(j) is assigned the value of the index j of the least x1,j that is greater than or equal to x2,j; thus pg(j) represents some pitch pj from the original Bach pitch sequence {pj} whenever x2,j≦x1,j. But if x2,j>x1,j, then the original pitch pj is retained in the variation.
In the example given here, since x2,j>x1,j for j=1, . . . , 7, 14, 15, 16, the pitches of the variation match those of the source composition. But if the first condition of the Extended-span Chaotic Mapping applies, i.e., x2,j≦x1,j, the variation pitches may or may not replicate the source pitches, as can be seen in the variation pitches at locations j=8, . . . , 13. To determine pg(8), find the least x1,j that is greater than or equal to x2,8, where x2,8=16.84: x2,8=16.84≦x1,8, where x1,8=19.46. Furthermore, x1,8 is the least x1,j≧x2,8. Therefore, assign the index of x1,8(j=8) to g(8) so that g(8)=8→pg(8)=p8. From part F, p8=E5. Therefore, pg(8)=E5=p′8: this same pitch occurs in the variation exactly where it appeared in the source, i.e., at location j=8.
To determine pg(9), find those x1,j≧=8.78. Take the smallest one: x1,170=10.19. Its index j=170 is now assigned to g(9); thus pg(9)=p170, and the 170th pitch B3 of the source Bach now appears as the ninth pitch of the variation. Proceeding in like manner, pg(10) of the variation will yield E4 (p2 of the original Bach), pg(11) will give G4 (p132 of the original Bach), pg(12) will return E4 (p126 of the original Bach), and pg(13) will give A4 (p30 of the original Bach). Recall that pitches 14-16 match those of the source, as given above. In this way, the Extended-span Chaotic Mapping constructs the pitches p′j of the variation, shown in
The Improved Chaotic Mapping Method
Notice further that parts B, C, and D of
Similarly, for j=9, first deduce the value of x2,9=15.24 from part D; find the least x1,i≧15.24 which is x1,9=15.26; take the value of the index i of x1,9 which is 9 and assign it to g(9)→g(9)=9. Applying the ‘hole-generating’ function to x2,1 and x2,9 yields blackened holes since j=g(j) in both cases. On the other hand, open holes denote those j for which j≠g(j). For example, for j=4, apply the function w(x2,j)=g(j) to determine the value of g(4): first deduce the value of x2,4=3.68 from part D; find the least x1,i≧3.68 which is x1,5=6.40; take the value of the index i of x1,5 which is 5 and assign it to g(4)→g(4)=5. Since j≠g(j), i.e., since 4≠g(4), an open hole results for location 4 in the Hole Template. Likewise, for Hole Template locations 5-7 and 16, it can be verified that j≠g(j), and so open holes arise in these locations as well.
Blackened holes indicate no change from the source Bach, and open holes indicate changes will occur in those locations. The notes that will change—Nc,4, Nc,5, Nc,6, Nc,7, and Nc,16—are indicated above the open holes. Each Nc,j denotes the pitch number of the jth pitch of the source piece for which j≠g(j). The unchanging notes of the Bach and the prior event pitches Ep,3 and Ep,15, which will serve as the reference pitches for the Dynamic Inversion process explained in parts F and G, are marked above the blackened holes. For example, prior pitch event Ep,3 represents the pitch immediately prior to four ‘open holes’ in the variation; here Ep,3 corresponds to the third blackened hole=G4. This prior pitch event Ep,3=G4 functions as the reference pitch for the Dynamic Inversion of those Bach input file pitches 4-7, i.e., Nc,4, Nc,5, Nc,6, and Nc,7, which will all be inverted about the G4. Thus, the Nc,j's will be inverted about the pitch occupying the previous blackened hole—here, G4. In the Dynamic Inversion procedure, reference pitches are defined by, and change according to the Hole Template of the Improved Chaotic Mapping. Thus, each differs from any other reference pitch, e.g., Ep,3≠Ep,15 so that the reference pitches are dynamic rather than fixed or static. By contrast, in past and current commercial computer practice, inversions are taken about a reference pitch that is fixed, e.g., middle C (C4) or a user-supplied reference pitch.
The process of Dynamic Inversion is set up in
Since j=g(j) for the first three events of the variation, as already determined by the Hole Template of part E, the original Bach pitches will occur in locations 1-3 of the variation. The same is true for the 8th-15th pitches of the variation. However, since j≠g(j) for j=4, . . . , 7, 16, the variation will change from the source according to the expression for the Improved Chaotic Mapping working in conjunction with said Dynamic Inversion procedure:
where g(j) is assigned the value of the index i of the least x1,i for which x2,j≦x1,i. Here, Dynamic Inversion is used to calculate each new pitch PN of the variation using the expression N=−(Nc,j−Ep,j-1)MOD12+Wp,j-1, where Nc,j denotes the pitch number of the jth pitch of the source piece for which j≠g(j), and Ep,j-1 is defined in terms of Nc,j as the pitch number of the (j−1) pitch of the original pitch sequence that occurs before one or more consecutive Nc,j's. As stated earlier, the prior event Ep,j-1 serves as the dynamic reference pitch about which the inversion of one or more consecutive Nc,j's will take place.
To find p′4, the fourth pitch of the variation, calculate PN where N=−(Nc,4−Ep,3)MOD12+Ep,3=−(72−67)+67=62→P62=D4=p′4, the 4th pitch of the variation. Pitches p′5, p′6, and p′7 result from a similar procedure. The mod 12 option did not apply in determining p′4. Though it was not necessary here, it can be invoked to preclude inversions that move up/down from any reference pitch by more than an octave. To see how the G#4 is assigned to p′16, again calculate PN where N=−(Nc,16−Ep,15)MOD12+Ep,15=−(76−72)+72=68→P68=G#4=p′16.
Applying the Improved Chaotic Mapping in conjunction with Dynamic Inversion yields the variation sequence displayed in
where E′j represents any event of the variation, ei=g(j) denotes any pitch, chord, phrase, beat, note rhythmic value, note-group, and/or any other musical event from the source work that will appear unchanged in the variation as a result of the condition j=g(j), and Ej represents any musical event in the variation produced by the Improved Chaotic Mapping in conjunction with a designated variation procedure whenever j≠g(j). The term g(j) is assigned the value of the index i of the least x1,i for which x2,j≦x1,i.
In Block [1], the first chaotic trajectory {x1,i, y1,i, z1,i}, indexed on i, with initial conditions (x1,1, y1,1, z1,1) is launched. A second chaotic trajectory {x2,j, y2,j, z2,j}, indexed on j, with initial conditions (x2,1, y2,1, z2,1) is simulated in Block [2]. The hole-generating function w(x2,j)=g(j) of Block [3] takes each x-value of the second chaotic trajectory (and/or y-z-values of same) and determines g(j), where g(j) is assigned the value of the index i of the least x1,i such that x2,j≦x1,i. The hole-generating function creates the Hole Template of Block [4] according to whether or not j=g(j). If so, a blackened hole appears at the appropriate j; if not, an open hole occurs.
Block [4] shows a hypothetical example of a Hole Template (resulting from Block [3]), where M represents any given integer. Here, M is the hypothetical value of the leftmost j in the plotting. Suppose that applying the hole-generating function, w(x2,j)=g(j), to x2,j results in j=g(j), and equivalently M=g(M) for the j=M location in the drawing. Thus, for j=M, a blackened hole results. Suppose further that the same process applied to the next x-value x2,j, where j=M+1, also results in j=g(j)→M+1=g(M+1). Then for j=M+1, a blackened hole appears. As stated earlier, blackened holes denote those j for which j=g(j). Here, blackened holes result from applying the hole-generating function to the two x-values x2,M and x2,M+1, as well as to the x-values x2,M+5 and x2,M+6. But now suppose that for x2,j where j=M+2, the hole-generating function returns a value for g(j) that does not equal j. Thus an open hole occurs for j=M+2. Ditto for j=M+3 and j=M+4.
Block [5A] supplies the event sequence {ei} of a source work—which can include MIDI events, audio events, or both—to the Improved Chaotic Mapping of Block [5B]. Note that if the event list consists of more than one musical attribute (e.g., MIDI pitch, onset time, and velocity), each can be varied separately, or together, by applying this exemplary algorithm to one or more axes in 3-space, or to n axes in n-space, e.g., where n axes result from harnessing additional chaotic systems.
The Improved Chaotic Mapping is applied in Block [5B] whenever j=g(j), i.e., at the blackened holes. This results in events ei=g(j) occurring in the same spot in the variation as in the source piece. Thus for this hypothetical example, original events eM, eM+1, eM+5, eM+6 fill the blackened holes E′j=M, E′j=M+1, E′j=M+5, and E′j=M+6 in the variation shown in Block [7].
Block [6A] provides a designated variation procedure which will work in tandem with the Improved Chaotic Mapping of Block [6B] to generate new events Ej whenever j≠g(j), i.e., at the open holes. Thus, for this hypothetical example, new events EM+2, EM+3, and EM+4 fill the open holes and are equivalent to E′j=M+2, E′j=M+3, and E′j=M+4, in the variation shown in Block [7].
Block [7] contains the variation's event sequence {E′j} which comprises the sum of Blocks [5B]+[6B]. The contents of the blackened holes remain unchanged from the source event sequence. The open holes are filled with new events Ej supplied by the Improved Chaotic Mapping in conjunction with a designated variation procedure(s). The variation is produced by merging the contents of the blackened and open holes to give the variation's final event sequence {E′j}={ . . . , eg(M), eg(M+1), EM+2, EM+3, EM+4, eg(M+5), eg(Mα6), . . . }.
In the embodiments discussed above, the hole-generating function of the Improved Chaotic Mapping produces a Hole Template where the number of open vs. blackened holes can vary depending on ICs for chaotic trajectories 1 and 2. In other embodiments, other schemes such as probabilistic methods can produce a Hole Template capable of receiving new events via, for instance, Dynamic Inversion. One inherent advantage of the Improved Chaotic Mapping over a probabilistic scheme lies in the fact that the Improved Chaotic Mapping has several built-in ‘controls’ or ‘sliders’ that determine the amount of variability—all arising from a natural mechanism for variability present in chaotic systems, i.e., the sensitive dependence of chaotic trajectories to initial conditions. Thus, the degree to which the initial conditions chosen for the second chaotic trajectory differ from those assigned to the first trajectory will directly affect the amount of variability present in a variation.
The Bach Prelude in C (from the Well-tempered Clavier, Book I) presents an ideal demonstration piece for the Improved Chaotic Mapping. The first two phrases—measures (mm.) 1-11, representative of the entire Prelude—are shown in
The last pitch event of the source Bach Prelude is a 5-note C major chord. Several methods exist for inverting a chord using Dynamic Inversion: a chord could be inverted about its midpoint or about a prior pitch event; or the chord's midpoint could be inverted about a prior pitch event such that its pitches are transposed with respect to the new midpoint, or the chord's midpoint could be inverted about a prior pitch event and its pitches then inverted about the new midpoint, and so on.
It should be noted that the Improved Chaotic Mapping method can invoke two designated variation procedures simultaneously, one for j=g(j) and another for j≠g(j):
where Ej,a represents an event given by designated variation procedure “a” (denoted by subscript “a”) whenever j=g(j), and Ej,b represents an event given by designated variation procedure “b” (denoted by subscript “b”) whenever j≠g(j).
Parsers for Discrete-Time Signals, Such as MIDI .mid Files
A discrete-time parser is necessary to separate a discrete-time signal, such as a MIDI file score, into a sequence of discrete events so that a variation of the signal can be generated, for example using the Improved Chaotic Mapping method. The “default” parsing of
As a way of offering more flexibility for generating the event list, the Discrete-time Signal (e.g., MIDI file) Parser of
To illustrate how a Timing Template can work in various embodiments in conjunction with the parsed events of
If the events of the variation, however, are stitched together without the Timing Template, i.e., one right after another in the order Event 1, Event 7, Event 3, then Event 7 would be placed immediately following Event 1, at the Start time of the next event following Event 1 in the original score, i.e., at 4|3|000, the Start time of Event 2. In the variation, Event 3 would occur immediately following Event 7. Where Event 3 starts would be determined by taking the difference between the Start time of the next event following Event 7 in the score of the source work, i.e., the Start time of Event 8=21|3|000, and the time of the last note D4 (in the left hand of the piano part) of Event 7=21|2|000. The difference is 21|3|000−21|2|000=1 quarter beat or 480 ticks. This difference is then added to the timing of the last note/chord event (Event 7) in the variation 6|2|000+480 ticks=6|3|000. Thus Event 3 would be placed at 6|3|000 following Event 7, and the Start times of the first three events of the variation would be: 1|3|000 (Event 1), 4|3|000 (Event 7), and 6|3|000 (Event 3).
Parsers for Continuous-Time Signals, Such as Audio .wav Files:
A continuous-time parser is needed to separate a continuous-time signal, e.g, an audio recording, into a sequence of parsed audio events so that an Audio Variation can be generated. Continuous-time signal parsers of the present invention include the Toggle Switch Parser, the ‘Start-times only’ Parser, and the Duration-specific Automatic Parser.
The Toggle Switch Parser
First, a switch key that can toggle—that is, alternate between two states—must be designated on a computer (or on another capable system). The toggle key triggers one of two states: ‘Start time’ S and ‘Finish time’ F, whenever struck. The Start and Finish times of each parse are used to separate an audio recording into a discrete sequence of parsed audio events.
Upon activating the toggle key in Block [1], the Toggle Switch Parser reads the system clock, records the system clock time as the ‘reference clock time’ Rt that will serve as the reference time from which all subsequent times are calculated, and initializes the Start time S1 to 0 seconds and plays the audio file (mp3, .wav, .pcm, .acm, etc.) so that the user can, in real time, initiate the Finish and Start times of each desired parsed event. The Toggle Switch Parser also ascertains whether or not there exists a delay between the input signal, generated by striking the toggle key, and the timing of that signal vis à vis the system clock. A counter is initialized: i=0.
To delineate the Finish time F1 of the first parsed event (S1, F1), the user will again hit the toggle key, after which the parser reads and records the current system clock time, calculates the elapsed time between the reference clock time Rt and the current clock time, minus any delay associated with the toggle key, and records the Finish time F1+i.
To delineate any subsequent parsed events, the user continues to strike the toggle key to record Start S and Finish F times until the last recorded Finish time is greater than or equal to the total time TTOT of the recorded musical piece. These Start S and Finish F times are then applied to the recorded audio file to separate it into discrete audio events upon which the Improved Chaotic Mapping, or any other modification scheme, can operate.
In embodiments, after the Improved Chaotic Mapping method has generated a variation, each event of the variation Ej′ is placed, in sequence, in a Timing Template provided by the Start times of the parsed audio events as delineated in the source piece. That is, each Ej′ is placed at the precise time that the corresponding original event ei=j occurred in the source work. In this way, the Start times of the parsed events of the source piece provide a Timing Template which determines where each of the new events Ej′ comprising the variation will be placed, in sequence. With the Timing Template approach, the variation will be the same duration as the source piece; however, some events of the variation may ‘bleed’ into succeeding events, or silence may occur between two adjoining events, due to one being longer or shorter than the duration allocated by the Timing Template, respectively.
Alternatively, the events {Ej′} of the variation may follow one another back-to-back sequentially, in the order proscribed by the Improved Chaotic Mapping. Here, as with the Timing Template approach, the events of the variation can be stitched together by using audio editing techniques. For instance, logarithmic cross fading can be used to allow the new event to enter into and emerge from a seam over a 60 ms time span, while simultaneously fading out the previous event, also over the same 60 ms time span.
One advantage of the Toggle Switch Parser lies in the ability of the user to selectively discard parts of any audio recording by controlling the Finish time Fi. The considered placement of each Finish time can determine which parts of an audio file are eliminated, as desired, or which parts might be used for Infused Variation, another method discussed in more detail below. It is important to note, however, that the Toggle Switch Parser will always result in data loss.
‘Start-Times Only’ Parser
The ‘Start-times only’ Parser of
Duration-Specific Automatic Parser
It is also possible to more fully automate the ‘Start-times only’ Parser algorithm of
In general, the length of the second phrase (or repetitive beat) will constitute the duration for all the parsed audio events, because the source recording may begin with an upbeat or even a free-form introduction; it may have a glitch at the start (end) of the recording, have a Coda, Cadenza or extemporaneous material at the beginning and/or end.
Instead of simply using a fixed time duration, embodiments of the Duration-specific Automatic Parser can take advantage of state-of-the-art beat detection technology to parse an audio file so that each parse has a specified number of beats.
One application of the Duration-specific Automatic Parser takes advantage of the plethora of 16- and 8-measure phrases in both classical and popular music, many of which can be neatly subdivided into 4-measure groups, which in turn often subdivide into 2-measure units. This specific application is referred to herein as the “4+4 method,” so-named because it relies on the prevalence of 4-measure groups to comprise the usual 8- and 16-measure phrases common to many songs, i.e., 4-measure group+4-measure group=8-measure phrase.
Variations of Audio Recordings Via the Improved Chaotic Mapping
The Improved Chaotic Mapping method can be applied to continuous-time signal inputs as well as discrete-time signal inputs. Variation methods which can be applied to continuous-time signal inputs include the Interval Method and the Scanning Method.
The Interval Method for Making Audio Variations
The Interval Method, in accordance with the Hole Template of the Improved Chaotic Mapping, shifts an event up or down by a given musical interval (such as a minor second) and/or adds notes above or below a given pitch to form a chord. The interval can be either fixed or changeable in accordance with the musical context. Commercially available pitch shifters and detectors can be used to move (transpose) selected events, as well as identify the pitches of a given musical event in order to provide musical context. Such context could help determine other note(s) and/or intervals to be added, e.g., the major, minor, and perfect intervals associated with Western music.
The same timings that constitute the vocal parse can then be applied to the piano part of
The vocal line of the Audio Variation of
The Interval Method described above has certain advantages. For example, song variations can occur which preserve the original sequence of the words in the lyrics. Also, the melodic and harmonic language of recorded songs can change because selected audio events for j≠g(j) can be shifted by a desired interval and/or acquire additional notes. Just as Dynamic Inversion allows multiple reference pitches, the Interval Method can also allow different intervallic shifts and provide them according to the musical context of surrounding events.
The Scanning Method for Making Audio Variations
Whenever the Improved Chaotic Mapping returns an ‘open hole’ (i.e., when j≠g(j)), the Improved Chaotic Mapping can employ the Scanning Method as a designated variation procedure to fill the open hole with an event Ej, according to any of six Scanning Options. Each option can occur with or without ‘banning’. If ‘banning’ is in place, then any substitution event can only be used once in the Audio Variation. If banning is not invoked, then there is no limit to the number of times a substitution event can be used in the Audio Variation.
Three of these options are explained below with respect to
Look Forward Option (LF):
For j≠g(j), the invention will ascertain the duration of the original event ei=j and then scan the sequential list of original events {ei}, starting with the very first event e1, and find the first available ‘same duration’ event, within a tolerance of τ seconds. This becomes the ‘open hole filler’ event Ej that passes the “length test”, i.e., it has the same duration as the original event, within a tolerance τ seconds. It is possible—especially for open holes near the beginning of the input and especially without banning—that the first event that passes the length test will be the original event ei=Ej, with i=j.
Using
Now suppose that banning is in effect for the previous example and that the next open hole occurs for j=4 (i.e., 4≠g(4) for the fourth event of the variation). Since Event 1 has already been used to fill the first hole, the LF method will fill the fourth hole of the variation with Event 4 (e4), which is the first 8-measure (plus upbeat) event it finds, starting from the beginning of the original event sequence {ei}, that has not already served as an ‘open hole filler’—even though substitution event E4 is the same as the original event e4.
Best Match Option:
For j≠g(j), the invention will ascertain the duration of the original event ei=j and then scan the sequential list of all of the original events {ei} in order to find the event that best passes the length test, provided it is not the original event ei=j, and if banning is included provided it has not already been used as a substitution event. The best duration match becomes the ‘open hole filler’ event Ej. If none is available, within a tolerance τ, then the Best Match option can retain the original, or it can vary the original event using any variation technique.
Using
In embodiments the Scanning Method includes an amplitude test, whereby the algorithm looks for a seed event that not only passes the length test but also passes an amplitude test, within a tolerance va. To find a suitable seed event, in some embodiments the algorithm determines the average amplitude of the last part, e.g., the last 100 ms, of the preceding event of the variation, the average amplitude of the initial portion, e.g., the first 100 ms, of any seed event that passes the length test, and selects the seed event whose average amplitude best matches the tail end (e.g., the last 100 ms) of the preceding event of the variation. If no seed event passes the length and amplitude tests, then the algorithm can apply the Interval Method and shift the input file event by a given interval (e.g., a minor 2nd), and/or add a given interval to the input file event, and/or apply any other modification.
Embodiments of the present invention include other Scanning Options, such as Look Backward, where scanned events are found by scanning from the end of the source piece back towards the beginning, Modified Look Backward and Modified Look Forward, where the scan avoids using the original event as a substitution event, and Cater-corner, where scanned events are found in the half of the source piece opposite to where the j≠g(j) event occurs.
Finally, in some embodiments the Scanning Method can scan events outside the event space of the source, for example by scanning a totally different piece or a section of the same piece or composition which is not included in the input. The result will be a variation containing parses from both event spaces, i.e., an Infused Variation, which is discussed below in more detail.
Combining Variation Techniques to Create More Sophisticated Variations of Recordings
To generate the upper vocal line shown in the score representation of the Audio Variation of
where g(j) is assigned the value of the index j of the least x1,j that is greater than or equal to x2,j. Thus eg(j) represents some event ej from the original event sequence {ej} whenever x2,j≦x1,j. But if x2,j>x1,j, then the original event ej is retained in the variation.
The Extended-span Chaotic Mapping employed the following parameters: ICs (1, 1, 1) and (0.5, 1, 1) for the first and second trajectories, respectively, with the mapping applied to the x-values of the two trajectories. The Start time of each parsed audio event of the original event sequence {ej} then served as a starting time placeholder for each ej′ produced by the Extended-span Chaotic Mapping (the Timing Template approach described earlier). Thus, event e1′ is placed in the Audio Variation at the same Start time the original event e1 occurred in the source Aguilera recording; event e2′ is placed in the Audio Variation at the same Start time the original event e2 appeared in the recording of “Beautiful”, and so on.
The upper line of the illustrative score representation of the audio result is marked “ESCM”, indicating that it was generated using the Extended-span Chaotic Mapping. The sequence of events for the upper line of the Audio Variation comprises, in order, the following audio events,
{1 2 3 10 5 10 7 10 9 3 11 1 13 9 15 9 17 20 8 23 24 23 24 25 26},
where changes from the original event sequence {ej} are shown above in bolded numbers. These changes are reflected in
It should be noted that there are silences in the ESCM-generated part of the variation due to some ej′ having shorter durations than the placeholder events ej of the source recording. There are also instances where events are longer in duration than the placeholder events.
Since it is possible to run two (or more) different methods simultaneously on any given recording, the Improved Chaotic Mapping with the Scanning Method (Look Forward+banning option) as a designated variation procedure was also applied to the source recording, shown as the lower line and marked “ICM(LF+b)” in the illustrative score representation of the Audio Variation of
In the resulting Audio Variation, Events 1-14 of the lower line remain unchanged from the source events because in each case, the LF+b option looked for the first unbanned event that passed the length test. For each Event 1-14, that first unbanned event turned out to be the original event. For the fifteenth open hole event of the LF+b variation, both Events 10 and 13 satisfied the length test, but since Event 10 had already been used as the tenth ‘open hole filler’ (and was therefore banned from further use), Event 13 became the fifteenth event of the variation.
The Improved Chaotic Mapping with the LF+b Scanning Option generated the following sequence of events {Ej′} for the lower line of the score representation of the Audio Variation:
{1 2 3 4 5 6 7 8 9 10 11 12 13 14 13 16 15 18 19 20 22 19 24 25 26},
where changes from the original event sequence {ei} are shown above in bolded numbers. These changes are marked by boxed, boldface, capital letters in the “LF+b” line of
The upper and lower lines in the illustrative score representation of the ESCM and ICM(LF+b) Audio Variations combine to give the Audio Variation of the Aguilera recording of
Rhythmic and Rhythmic-Pitch Variations Via the Improved Chaotic Mapping
The Improved Chaotic Mapping in tandem with the above designated variation procedures can make rhythmic, as well as rhythmic-pitch variations of MIDI file scores, by introducing new rhythmic events, and/or new pitch/chord events, that are not present in the source. These new events can be generated by algorithms such as the Rhythmic-reversal Method, the Stenciling Method, and Infused Variation.
Rhythmic-Reversal Method
The Rhythmic-reversal Method encompasses six algorithms: (a) the Basic Rhythmic-reversal Algorithm, (b) the Rhythmic-reversal+Pitch-reversal Algorithm, (c) the Rhythmic-reversal+Pitch-Dynamic Inversion Algorithm, (d) the Rhythmic-reversal+Pitch-inversion Algorithm, (e) the Rhythmic-reversal+Pitch-reversal-Dynamic Inversion Algorithm, and (f) the Rhythmic-reversal+Pitch-reversal-inversion Algorithm.
The Improved Chaotic Mapping next operates on the sequence of N-beat chunk events. Whenever j≠g(j), signifying an open hole ‘receptor’ event in the Hole Template of the Improved Chaotic Mapping, the Basic Rhythmic-reversal Algorithm reverses the rhythms of the receptor N-beat chunk ei=j. But the pitches of the receptor N-beat chunk remain the same. For instance, if j≠g(j) for the first beat of the Gershwin, then the rhythmic pattern of two sixteenths followed by an eighth would be reversed to yield: eighth, sixteenth, sixteenth for that first changed beat. The notes would read C#5=eighth, D5=sixteenth, F4=sixteenth as shown in
There are different ways in various embodiments to reverse the rhythm of any given N-beat chunk. One reversal method of the present invention that can be applied to m. 3 of the Gershwin Prelude (a 2-beat chunk) is to “read back” the rhythmic values from right to left: eighth, eighth rest, sixteenth, sixteenth rest, eighth. The note rhythmic values would be assigned, in order, to the pitches present in m. 3 so that m. 3 of the variation would comprise: eighth=Bb1, eighth rest, sixteenth=the chord Bb2-F3-G3-Bb3-D4-G4, sixteenth rest, eighth=the chord Bb2-F3-G3-Bb3-D4-G4, as shown in
Another reversal method of the present invention applicable to MIDI files is to count the number of ticks separating the onset time of each pitch/chord from the onset time of the pitch/chord that follows it, here referred to as the “count back” approach. The number of ticks is found by ‘looking or counting back’ from a pitch/chord to the pitch/chord that immediately follows it and counting the ticks between them. The MIDI protocol allows 480 ticks per quarter note. Thus, in counting back from the last chord of m. 3 (in the source Gershwin of
Rhythmic reversal can be combined with pitch reversal, so that both the rhythms of the receptor N-beat chunk ei=j, and the pitches of the receptor N-beat chunk are reversed. Suppose that j≠g(j) for a receptor event composed of a 2-beat chunk comprising m. 3 of the Gershwin, then both the rhythm of the measure and the order of the pitches change so that the variation results in: 3|1|000=Bb2-F3-G3-Bb3-D4-G4, 3|1|240=Bb2-F3-G3-Bb3-D4-G4, and 3|2|120=Bb1, assuming a rhythmic reversal based on the “count back” approach, as shown in
In other embodiments, Rhythmic reversal is combined with a pitch Dynamic Inversion algorithm to reverse the rhythm and invert the pitches of the receptor N-beat chunk about a dynamic reference pitch, e.g., the last pitch/chord of the prior chunk.
As an example, suppose that the Gershwin is parsed into 2-beat chunks and the Improved Chaotic Mapping designates the 2-beat chunk comprising measure 2 as an open hole event, i.e., 2≠g(2). Then the rhythm of Event 2 must reverse, as given by the first measure of
As an example of traditional inversion with a fixed reference pitch, again suppose that the Gershwin is parsed into 2-beat chunks and the Improved Chaotic Mapping designates the first 2-beat chunk (m. 1) as an open hole event, i.e., 1≠g(1). Then, in accordance with the Rhythmic-reversal+Pitch-inversion Algorithm, the rhythm of Event 1 (the first 2-beat chunk) must reverse, as given by the first measure of
It is possible in embodiments to take the combination of Rhythmic reversal and Pitch-Dynamic Inversion one step further by using the results of the last measure of
Of course, any embodiment of the Rhythmic-reversal Method can be applied to a series of input events or “chunks” where each chunk comprises a different number of beats, as can be created for example using the Discrete-time Signal Parser of
The Stenciling Method
In embodiments, the Stenciling Method is applicable to an input which has been parsed into events delineated by individual beats. The Improved Chaotic Mapping then operates on the sequence of N-beat chunks. Whenever j≠g(j), signifying an open hole ‘receptor’ event in the Hole Template of the Improved Chaotic Mapping, the N-beat chunk serving as the receptor chunk will be ‘stenciled’ by an appropriate rhythmic pattern selected or generated by human or machine. The Stenciling Method will stamp the note rhythms of the receptor N-beat chunk with the note rhythms of the ‘stenciling’ chunk.
As an example, suppose in making the variation, the Improved Chaotic Mapping returns an open hole for the thirteenth beat of the Gershwin, the first beat of measure 7, shown in
Assuming the source Gershwin Prelude is parsed into a sequence of 1-beat chunks, the Stenciling Method in embodiments takes the MIDI onset times contained in beat 5, i.e., 3|1|000, 3|1|360, and overlays this rhythm like a stencil on receptor beat 13. The MIDI times for the pitched events in beat 13 are Bb1-C#6=7|1|000, D6=7|1|120, F6=7|1|240, F3-Bb3-D4-G4=7|1|360. Those pitches in the receptor chunk (here, beat 13) that are coincident with, or ‘directly touch’, the note values in the stenciling chunk (beat 5) are assigned the stenciling rhythm, i.e., Bb1-C#6=7|1|000 and F3-Bb3-D4-G4=7|1|360.
The Basic Stenciling Algorithm allows only those pitches in beat 13 that are stamped by the note rhythmic values of the stenciling beat 5 (Bb1-C#6=7|1|000 and F3-Bb3-D4-G4=7|1|360) to occur in the variation; the other pitches in beat 13 are discarded (everything between the downbeat and the last 16th of beat 13 disappear in the variation). Clearly, this will change the rhythmic and pitch structure of the piece. Different rhythms can get stamped onto receptor N-beat chunks, while some essence of the pitch and harmonic vocabulary remains. This can be highly desirable as a variation technique, because it achieves a kind of ‘reduction’ or ‘condensation’ of musical material. Yet even if the stenciling chunk contains more pitch/chord events, and hence more note rhythmic values, than the receptor chunk, the rhythm of the receptor chunk can never include more note rhythmic values than it originally possessed.
The Expanded Stenciling Algorithm first applies the Basic Stenciling Algorithm. Both algorithms are treated in
As an example where the stencil contains fewer notes than the receptor, suppose in the Gershwin Prelude beat 5 stencils beat 21. Both beats 5 and 21 are shown in
When applied to MIDI files, the Basic Stenciling Method lays a stenciling chunk on a receptor chunk. Any event overlap showing coincident onset times constitutes a direct hit. Thus the result of beat 5 stamping beat 21 consists of only the chord Bb1-F4 (the result of coincident 3|1|000 and 11|1|000) and the chord E3-F3 (the result of coincident 3|1|360 and 11|1|360).
Now consider what can happen if the stenciling rhythm contains more note values than the receptor beat. Suppose beat 21, the first beat of m. 11 (
But the Expanded Stenciling Algorithm goes further. It imprints the stenciling rhythm of three more 32nd notes plus an eighth note on the receptor chunk. In general, the Expanded Stenciling Algorithm must imprint all the note rhythms present in the stenciling chunk upon the receptor chunk. Since the three extra stenciling 32nds occur in the first half of the stenciling beat 21 and since Bb1 is the only note that sounds in the first half of the receptor beat 5, the algorithm assigns the Bb1 to each of the three 32nd note rhythms, since it is the nearest pitch in the first half of the beat (the “nearest pitch” approach). The first half of beat 5 as it will be heard in the variation is shown in
To implement the stenciling eighth note rhythm located at 11|1|240 (in the second half of the treble part of beat 21) in
As mentioned earlier, the Bb2-F3-G3-Bb3-D4-G4 chord found on the fourth sixteenth of beat 5, by virtue of its being coincident with the last 16th of the stenciling chunk, is retained as is. In the variation, then, this chord will be heard as the last sixteenth of beat 5 in the variation. The final result, shown in
In various embodiments, the Expanded Stenciling Algorithm can be combined with any of several stenciling options which redress the repeated note problem of the “nearest pitch” approach. These options are illustrated in
Option C is the same as Option B, except that any transplanted pitches/chords are inverted about a reference pitch such as middle C.
Option E is similar to Option D, except that transplanted pitches/chords are inverted about the last pitch/chord of the most recent unchanged N-beat chunk of the variation (
Options G, H, and I are all similar to Option F. In Option G, the transplant is not only reversed but also inverted about a fixed reference pitch, e.g., middle C. The reversed-inverted transplant in
If any transplant of Options C-E and G-I contains one or more chords, the chord or chords can be inverted by first inverting the midpoint of the chord and then inverting the chord's pitches about the new midpoint.
As with the Infused Variation Methods described below, the Stenciling Method can stamp one piece with the rhythmic characteristics of another in order to infuse the first piece with rhythmic ideas from the second, or from any other source. The Stenciling Method can be used with the Improved Chaotic Mapping, or with any other method capable of generating receptor and stenciling events, such as a probabilistic scheme.
So far the Stenciling Method has been applied to a sequence of N-beat chunks, where each parse contains the same number of beats as every other parse. In embodiments, the Stenciling Method can be applied to a sequence of n-beat chunks, where n is variable. Thus the number of beats comprising each parse can be customized according to some desirable outcome.
Infused Variation Methods
Embodiments of the present invention enable a user to make Infused Variations by infusing a source piece not only with pitch/chord events from other pieces, but also with rhythmic events from one or more separate works. In embodiments, each time j≠g(j) in the variation process, the invention can supply user- or machine-generated seed event(s) from a work outside the event space of the input file, i.e., outside the original event sequence {ei}. Such an event of the seed source can be selected based on various strategies, e.g., selection of the seed event based on a comparison of what preceded the seed event in the seed source, and what preceded the ‘open hole’ event in the input file.
Moreover, any seed element or sequence of seed elements can be varied using any combination of the variation techniques described above. For example, any seed can be stenciled by the open hole receptor event of the source piece so that the pitch/chord events of the seeds themselves are rhythmically varied, or the pitch/chord events of the input file can vary by infusing or stamping them with the rhythms of a seed file.
It is also possible to use a seed ‘stencil’ to infuse the input file. For example, for N-chunk parsing, the entire input file is overlaid by the entire seed file. Next, the Improved Chaotic Mapping, operating on the input file, produces the open and blackened holes forming the Hole Template. (The Hole Template could also be produced by a probabilistic scheme.) Any open hole N-beat chunk of the input file is stenciled by the N-beat chunk of the seed file that overlays it. Those pitch/chords of the receptor chunk that are not coincident with any pitch/chords of the seed chunk can then be retained, varied, or thrown away. (Or they could be replaced by any transplanted notes from the stencil, according to Options B-I of
In embodiments, Infused Variations can occur even without the Hole Template of the Improved Chaotic Mapping. The process in various embodiments involves overlaying two files, noting all coincident events, and replacing one by the other. Those non-coincident events can either be retained, eliminated, or varied further by the Expanded Stenciling Algorithm, for example using any of the Stenciling Options A-I of
Infused Variation of continuous inputs such as audio files can also be accomplished via the Scanning Method, which can select and implant audio seed events into an input audio file. The first step is to select an input piece and a seed piece and parse both. Whenever j≠g(j), signifying an open hole ‘receptor’ event in the Hole Template produced by the Improved Chaotic Mapping operating on the input file, the seed piece is scanned, according to any of the Scanning Options described above, to find a seed event that can fill the open hole of the input file.
Events from different pieces can be joined using any of various audio tools, such as beat matching technology whereby tempo adjustments are made to the input and/or seed pieces to ensure tempo compatibility; peak normalization of the input and seed files so that they both have the same relative dynamics (i.e., the signal-to-noise ratio remains unchanged from one file to the next); crossfading, including adjustment of the duration of the crossfading and/or placement of the cut point (i.e., the point where one event abuts another); masking techniques such as adding material over a cut point, e.g., inserting looping material or percussive material; equalization exchanges (switching EQs) over the cut point whereby one EQ band is faded out while another is faded in, e.g., turning the treble EQ of one event to half volume while turning the treble EQ of the following event to full volume; eliminating frequencies past a certain threshold, e.g., over the cut point; eliminating (or diminishing) the bass of one event and eliminating (or diminishing) the treble of the following event while employing some crossfading to smooth over the transition; as well as other tools known in the art.
Application of Discrete-Input Variation Methods to Continuous Inputs
In embodiments, the discrete-input variation methods described herein can be applied to continuous inputs by converting the continuous input to a discrete input. State-of-the-art audio technology such as audio-to-MIDI converters exist or can be implemented, as for instance described by Carabias-Orti, et al. (J. J. Carabias-Orti, P. Vera-Candeas, F. J. Cañadas-Quesada, and N. Ruiz-Reyes. “Music Scene-Adaptive Harmonic Dictionary for Unsupervised Note-Event Detection,” IEEE Transactions on Audio, Speech, and Language Processing 18 (3), 473-486 (2010)).
To produce the “Paparazzi” variation of
The block diagram of
The Audio Vocal Track [1] enters an Audio-to-MIDI converter [2], which identifies the individual pitches and their onset times in the mp3 vocal track recording. The pitch sequence {pi} and onset time sequence {ni} are provided to [3]. If no pitch detector is available, the mp3 vocal track pitch sequence {pi} can be produced by hand, either by using an existing score and/or by ear.
Used in conjunction with one or more designated variation procedures [4], the Improved Chaotic Mapping [5] and/or another modification scheme operates on the pitch sequence {pi} of the source song and produces the pitch sequence of the variation {p′j} [6].
The MidiTime-to-AudioTime converter [7] converts the sequence of MIDI onset times {ni} to audio equivalent times {ti=j}. Two indices are initialized to i=1 and j=1 [8]. Since some of the p′j correspond to changed pitches in the source audio vocal track while other p′j denote unchanged pitches from the source audio vocal track, the pi and p′j are compared [9]. If pi=p′j, for i=j, then the algorithm does nothing to the Audio Vocal Track [1], but if pi≠p′j, a state-of-the-art pitch detector locates the pitch pi occurring at audio time ti=j, and a pitch shifter is then employed to pitch shift pi to p′j in the Audio Vocal Track including any additional equalization adjustments to the shifted pitch [10]. Equalization adjustment may be necessary to eliminate any unwanted background noise that was pitch shifted as well. The indices are incremented [11] and the process continues, until the end-of-file has been reached.
When the end-of-file is reached [12], an Audio Vocal Track exists with a pitch sequence that differs from the original [13]. State-of-the-art audio technology is used [14] to smoothly stitch (meld) each changed pitch to the previous and succeeding pitches in the modified Audio Vocal Track [13]. The audio edits can include short crossfades, changes in the durations of neighboring notes, adjustments to pitch intonation and vibrato, and adjustments to front and back ends of pitch events. After the stitching process, the algorithm produces the completed variation of the source piece [15], which is sent to the mixer [16], amplifier [17], and speakers [18] so that it can be heard.
Given state-of-the-art audio technology—including but not limited to Audio-to-MIDI conversion, MidiTime-to-AudioTime conversion, as well as operators capable of identifying, accessing, moving, elongating/shortening, removing, shifting, and detecting duration of individual note events in an audio file—and the block diagram of
These results enabling Pitch, Rhythmic-pitch, and Infused Variations can further extend to other forms of continuous-time signals, e.g., film, video, spoken word art, video games, prose, poetry, image, and so on, as can all other variation techniques discussed herein, such as the Improved Chaotic Mapping, the Extended-span Chaotic Mapping, Dynamic Inversion, the Interval Method, Scanning Method, Rhythmic-reversal Method, and the Stenciling Method. All of the above can be adjusted to suit non-Western styles as well.
With reference to
With reference to
Music variations created using the present invention can be uploaded to websites, including social media sites and digital music services such as Pandora and Spotify. In embodiments, a website can be programmed to generate and play a new variation of a composition each time the website is visited. Similarly, a mobile app can implement the present invention to change specific songs on a playlist, e.g., to rejuvenate a play list or create playlists where the original songs are varied so that they can change from one hearing to the next.
Music variations can also be combined with graphical works such as photographs or videos. For example, with reference to
With reference to
The present invention can enhance social interactions in other ways. For example, with reference to
In other embodiments of the present invention, an e-card hosting website enables a sender to produce a unique and personal composition 2502 created using the present invention, to be played when the card is opened. The sender can further include a recording of his or her own voice, or of a synthesized voice, or of another acoustic or electronic input, to further personalize the message. For synthesized voices, the invention enables the user to vary the synthesized voice track, thereby changing the text, pitch or speed of the voice, etc.
With reference to
With reference to
Video games typically feature sound tracks that accompany the actions of a hero, heroine, or the user. For example, theme music is often associated with actions of the hero or even with the user. With reference to
In embodiments, the present invention is implemented in a small electronic chip that can be included in a larger item, such as a hand-held device (smart phone, iPod, iPad, tablet, etc), an MP3 player, a greeting card, or wearable technology such as a smart watch, smartband, or wearable computer. In some of these embodiments, the item is thereby enabled to play variations of music stored therein. For example, the chip can be instructed to create a new variant of a composition every time it is played on an MP3 player. With reference to
The foregoing description of the embodiments of the invention has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. Many modifications and variations are possible in light of this disclosure. It is intended that the scope of the invention be limited not by this detailed description, but rather by the claims appended hereto.
This application is a continuation in part of U.S. patent application Ser. No. 13/192,380, filed on Jul. 27, 2011. Application Ser. No. 13/192,380 claims the benefit of U.S. Provisional Application No. 61/367,897, filed Jul. 27, 2010, and U.S. Provisional Application No. 61/512,186, filed Jul. 27, 2011. All of these applications are herein incorporated by reference in their entirety for all purposes.
Number | Name | Date | Kind |
---|---|---|---|
5003860 | Minamitaka | Apr 1991 | A |
5281754 | Farrett et al. | Jan 1994 | A |
5331112 | Sato et al. | Jul 1994 | A |
5371854 | Kramer | Dec 1994 | A |
5418323 | Kohonen | May 1995 | A |
5606144 | Dabby | Feb 1997 | A |
6028262 | Minamitaka | Feb 2000 | A |
6137045 | Short et al. | Oct 2000 | A |
6177624 | Takahashi et al. | Jan 2001 | B1 |
7034217 | Pachet | Apr 2006 | B2 |
7135635 | Childs et al. | Nov 2006 | B2 |
7193148 | Cremer et al. | Mar 2007 | B2 |
7498504 | Bourgeois | Mar 2009 | B2 |
7560636 | Wolfram | Jul 2009 | B2 |
7629528 | Childs, Jr. et al. | Dec 2009 | B2 |
7840608 | Kurzweil et al. | Nov 2010 | B2 |
8035022 | Wolfram | Oct 2011 | B2 |
Entry |
---|
Lorenz, E.N., Deterministic nonperiodic flow, J. Atmos. Sci., 1963, vol. 20, pp. 130-141. |
Dabby, D.S., Musical Variations from a Chaotic Mapping, 1995 MIT doctoral thesis, 162 pgs. |
Dabby, D.S., Musical Variations from a Chaotic Mapping, Chaos, 1996, 13 pgs. |
Dabby, D.S. Dabby, A chaotic mapping for music and image variations, Proc. Fourth Int'l. Chaos Conference, 1998, 12 pgs. |
Dabby, D.S. Dabby, Creating Musical Variation, Science, 2008, 2 pgs. |
Carabias-Orti, J. J. et al., Music Scene-Adaptive Harmonic Dictionary for Unsupervised Note-Event Detection, IEEE Transactions on Audio, Speech, and Language Processing 18 (3), 2010, pp. 473-486. |
Number | Date | Country | |
---|---|---|---|
61367897 | Jul 2010 | US | |
61512186 | Jul 2011 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13192380 | Jul 2011 | US |
Child | 14264612 | US |