The present disclosure relates generally to audio samples and more particularly to selecting audio samples in response to musical stimuli.
Digital audio workstations (DAWs) can provide users with the ability to record, edit, and play back digital audio. For instance, many DAWs include a sampling functionality wherein a user can create a musical composition by arranging audio samples using a graphical user interface (GUI) and/or MIDI controller (e.g., a keyboard). Audio samples can simulate the sound of a real musical instrument, and thus playing back an arrangement of such audio samples can simulate a live musical performance.
In some situations, the playback of audio samples may fail to accurately simulate the experience of listening to a real musical instrument. For instance, playing a note on a musical instrument with a decaying sound pattern, such as a cymbal, piano, guitar, and the like, can result in the instrument having a certain amount of excitation. Due to this excitation, playing a subsequent note can produce a different sound pattern with greater excitation as compared to the excitation produced by the initial note played when the instrument was “at rest.” The playback of audio samples corresponding to such instruments may not accurately reflect differences in the excitation state.
As another example, when the same note is played repetitively on a real musical instrument, the resulting sound pattern will include some variation in audio characteristics for each repeated note, such as timbral and tonal differences. The repeated playback of an audio sample to simulate repetitive notes may sound artificial to a listener due to the lack of variation in such audio characteristics.
Certain embodiments of the invention are directed to selecting audio samples in response to musical stimuli.
Certain embodiments are described that provide for selecting an audio sample based on the excitation state of an instrument. In some embodiments, a musical stimulus can be received by a computing device. The musical stimulus may correspond to a musical instrument that produces a decaying audio pattern. A current excitation level associated with previously received stimuli can be calculated. An audio sample corresponding to the received musical stimulus can be selected, the audio sample being selected using the current excitation level associated with the previously received musical stimuli. The selected audio sample can be played back.
In some embodiments, the selected audio sample can be one of a plurality of audio samples corresponding to the received musical stimulus, the plurality of audio samples corresponding to different excitation levels. In some embodiments, a velocity level of the received musical stimulus can be determined, and the plurality of audio samples (including the selected sample) may correspond to the determined velocity level.
In some embodiments, the current excitation level associated with the previously received stimuli can be compared with threshold levels assigned to the plurality of audio samples. In such embodiments, selecting the audio sample can include determining that the current excitation level exceeds a threshold level assigned to the selected audio sample.
In some embodiments, calculating the current excitation level may include determining individual excitation levels associated with the previously received stimuli, and summing the individual excitation levels to generate the current excitation level. In some embodiments, the individual excitation levels can be determined based upon the individual volume levels of the previously received musical stimuli.
In some embodiments, the received musical stimulus can be a first musical stimulus and the selected audio sample a first audio sample. The first musical stimulus can be identified as a previously received musical stimulus, and a second musical stimulus can be received. A current excitation level associated with the previously received musical stimuli including the first musical stimulus can be calculated. A second audio sample corresponding to the received second musical stimulus can be selected, the second audio sample being selected using the current excitation level associated with the previously received musical stimuli including the first musical stimulus. The selected second audio sample can be played back. In some embodiments, the second audio sample can correspond to a different excitation level than the first audio sample. In some embodiments, playback of the first audio sample can be ended when the playback of the second audio sample is initiated.
Certain embodiments are further described that provide for selecting audio samples having different velocity levels in response to repeated musical stimuli. In some embodiments, a first instance of a musical stimulus having a first velocity can be received by a computing device. A first audio sample corresponding to the first velocity level of the received musical stimulus can be played back. The first audio sample can be one of a plurality of audio samples that correspond to different velocity levels of the musical stimulus. A second instance of the musical stimulus having the first velocity level can be received. A second audio sample corresponding to a second velocity level of the received musical stimulus can be selected from the plurality of audio samples. The second audio sample can be played back, and may include different audio characteristics than the first audio sample. In some embodiments, the different audio characteristics can include different tonal characteristics.
In some embodiments, the first velocity level can correspond to a first volume level and the second velocity level can correspond to a second volume level. In such embodiments, playing back the second audio sample can include modifying the second volume level. In some embodiments, modifying the second volume level can include scaling the second volume level in accordance with the first volume level. In some embodiments, the first and second velocity levels can be adjacent velocity levels.
In some embodiments, it can be determined that the first and second instances of the musical stimulus are received consecutively. In some embodiments, a time interval between the first and second instances of the musical stimulus can be measured and compared to a threshold time interval. The measured time interval can be determined to be within the threshold time interval.
In the following description, for the purposes of explanation, specific details are set forth in order to provide a thorough understanding of embodiments of the invention. However, it will be apparent that various embodiments may be practiced without these specific details.
Certain embodiments of the invention are directed to selecting audio samples in response to musical stimuli. For instance, certain embodiments are described that provide for selecting an audio sample based on the excitation state of an instrument. As a non-liming example, audio samples corresponding to an instrument (e.g., a cymbal) can be recorded at various excitation levels. For instance, samples can be recorded of a cymbal being hit at rest, a cymbal being hit following two previous hits, a cymbal being hit following four previous hits, etc. Such samples can be recorded and stored for multiple velocity levels of the simulated instrument. When a sequence of musical stimuli is received in the context of a DAW arrangement and/or from a MIDI controller, the corresponding excitation state of the instrument can be calculated. For instance, if a musical stimulus for an instrument (e.g., a cymbal hit) is received when one or more previously received stimuli for the instrument (e.g., previous cymbal hits) are currently being played back, the individual excitation levels of the previously received stimuli can be summed. In some embodiments, the sum of the current excitation levels can be approximated by summing the individual volume levels of the previously received stimuli. Using the calculated excitation level and the intensity (e.g., the velocity level) of the instant musical stimulus, an audio sample can be selected for the instant musical stimulus that reflects the current excitation state of the simulated instrument.
In embodiments of the invention, audio samples selected based on the excitation state of an instrument may correspond to any simulated instrument capable of excitation. For instance, exemplary instruments can include a drum kit including various pieces or components (e.g., a ride cymbal, crash cymbal, hi-hat, bass drum, one or more toms, snare, etc.), other percussion instruments (e.g., a gong, bell, etc.), a stringed instrument (e.g., a guitar, bass, piano, etc.), or any other suitable instrument capable of excitation.
By storing audio samples of varying excitation levels, calculating the current excitation state of a simulated instrument, and selecting an audio sample accordingly, the excitation behavior of a real instrument can be reproduced. Thus, a more natural sounding and realistic simulation of a live performance using a real musical instrument can be provided.
Certain embodiments are further described that provide for selecting audio samples having different velocity levels in response to repeated musical stimuli. As a non-limiting example, audios samples corresponding to an instrument (e.g., a guitar) playing a particular note can be recorded at various “velocity” levels (e.g., the speed or force with which a note has been struck). When a musical stimulus is received (e.g., a guitar note with a particular velocity) in the context of a DAW arrangement and/or from a MIDI controller, an audio sample corresponding to the velocity level of the guitar can be selected and played back. If the same musical stimulus is received again (e.g., a repetition of the guitar note with the same velocity), in some embodiments, an audio sample corresponding to a different velocity level (e.g., the next higher or lower velocity) can be selected and played back.
Since audio samples corresponding to different velocity levels may be associated with different output volume levels, in some embodiments, the output volume level of the second audio sample can be scaled up or down to “match” the volume level of the first audio sample. In embodiments of the invention, audio samples selected in response to repeated musical stimuli may correspond to any suitable musical instrument.
Audio samples corresponding to different velocity levels may include differences in tone and/or timbre. Thus, by selecting audio samples for playback with different velocity levels, variations can be introduced into an arrangement or performance in which repeated notes are played. Such variation may provide for a more natural sounding and realistic simulation of a live performance using a real musical instrument.
It should be appreciated that system 100 depicted in
UI subsystem 102 may provide an interface that allows a user to interact with system 100. UI subsystem 102 may output information to the user. For instance, UI subsystem 102 may include a display device such as a monitor or a screen, and an audio output device such as a speaker. UI subsystem 102 may also enable the user to provide inputs to system 100. In some embodiments, UI subsystem 102 may include a touch-sensitive interface (i.e. a touchscreen) that can both display information to a user and also receive inputs from the user. For instance, in some embodiments, UI subsystem 102 can receive touch input from a user. Such touch input may correspond to one or more gestures, such as a drag, swipe, pinch, flick, single-tap, double-tap, rotation, multi-touch gesture, and/or the like. In some embodiments, UI subsystem 102 may include one or more input devices that allow a user to provide inputs to system 100 such as, without limitation, a mouse, a pointer, a keyboard, or other input device. In certain embodiments, UI subsystem 102 may further include a microphone (e.g., an integrated microphone or an external microphone communicatively coupled to system 100) and voice recognition circuitry configured to facilitate audio-to-text translation and to translate audio input provided by the user into commands that cause system 100 to perform various functions. In some embodiments, UI subsystem 102 may further include eye gaze circuitry configured to translate eye gaze input provided by the user into commands that cause system 100 to perform various functions.
Memory subsystem 106 may be configured to store data and instructions used by some embodiments of the invention. In some embodiments, memory subsystem 106 may include volatile memory such as random access memory or RAM (sometimes referred to as system memory). Instructions or code or programs that are executed by one or more processors of system 100 may be stored in the RAM. Memory subsystem 106 may also include non-volatile memory such as one or more storage disks or devices, flash memory, or other non-volatile memory devices. In some embodiments, memory subsystem 106 can store arrangement data 108, sample selection parameters 110, and audio samples 112.
Audio samples 112 stored in memory subsystem 106 can correspond to one or more simulated musical instruments. In some embodiments, one or more of audio samples 112 can be a digital recording of a real instrument being played live. Audio samples 112 can be in any suitable audio format. For instance, in embodiments of the invention, one or more of audio samples 112 can be in an uncompressed format (e.g., AIFF, WAV, AU, etc.), lossless compression format (e.g., M4A, MPEG-4 SLS, WMA Lossless, etc.), lossy compression format (e.g., MP3, AAC, WMA lossy, etc.), or any other suitable audio format.
Arrangement data 108 stored in memory subsystem 106 can describe arrangements including one or more of audio samples 112. For instance, in some embodiments, a user can create a musical arrangement by arranging a plurality of audio samples 112 within various tracks or channels using a graphical user interface (GUI) associated with a DAW executed by system 100. Arrangement data 108 can identify which of audio samples 112 are included in an arrangement. In some embodiments, arrangement data 118 can further identify the tracks and temporal positions (e.g., zones) to which audio samples have been assigned within the arrangement, relationships between audio samples (e.g., groupings of drum kit components), effects applied to audio samples in the arrangement (e.g., reverb, chorus, compression, distortion, filtering, etc.), and other parameters of audio samples include in the arrangement, such as velocity, volume level, pitch, octave, and the like.
In some embodiments, system 100 may include an interface (not shown) to communicate with an external controller (e.g., a MIDI controller). For instance, such a controller can be used to trigger the playback of one or more of audio samples 112 and/or to arrange one or more of audio samples 112 in an arrangement. In such embodiments, the arrangement and/or a record of the triggered audio samples can be stored in arrangement data 108.
Sample selection parameters 110 can include various parameters used to select one or more of audio sample 112 for playback. For instance, in the case of musical stimuli corresponding to a simulated instrument capable of excitation, in some embodiments, sample selection parameters 110 can include one or more threshold values used to select an audio sample corresponding to a particular excitation for playback. In some further embodiments, in the case of repeated musical stimuli, sample selection parameters 110 can include one or more rules regarding the selection of audio samples corresponding to varying velocity levels, threshold values used to determine whether velocity variations are to be introduced, and other parameters used for audio sample selection.
In some embodiments, system 100 may be part of a computing device. For instance, the computing device can be a desktop computer or a mobile computing device such as a laptop computer, tablet computer, smart phone, media player, and the like. In some embodiments, memory subsystem 106 may be part of the computing device. In some other embodiments, all or part of memory subsystem 106 may be part of one or more remote server computers (e.g., web-based servers accessible via the Internet or other network).
In some embodiments, UI subsystem 102, playback subsystem 104, memory subsystem 106, sample selection subsystem 114, and excitation determination subsystem 116, working in cooperation, may be responsible for selecting an audio sample for playback based on the excitation state of an instrument. For instance, input provided by a user can be received at playback subsystem 104 from UI subsystem 102. In some embodiments, the input may correspond to an instruction to playback an arrangement of audio samples.
Upon receipt of the input, playback subsystem 104 can begin playing back the arrangement of audio samples in accordance with arrangement data 108 stored in memory subsystem 106. During the playback, musical stimuli can be received (e.g., within the arrangement and/or from an external controller) that corresponds to a simulated instrument capable of excitation. For instance, the particular arrangement stored in arrangement data 108 may include a drum track including a number of cymbal notes. When an audio sample corresponding to a cymbal note is reached during playback of the arrangement, sample selection subsystem 114, working in cooperation with excitation determination subsystem 116, can select an audio sample having the appropriate excitation for playback. For instance, excitation subsystem 116 can calculate the current excitation level of the simulated instrument (e.g., the cymbal) which can be used to identify the appropriate sample for playback.
In some embodiments, to determine the current excitation state or the instrument, excitation determination subsystem 116 can calculate the current excitation level associated with previously received musical stimuli corresponding to the instrument. For instance, if there are one or more currently playing audio samples that correspond to previously received stimuli (e.g., previous cymbal notes), excitation determination subsystem 116 can sum the individual excitation levels of each stimuli. In some embodiments, the sum of the individual excitation levels can be approximated by summing the individual volume levels of the currently playing audio samples.
Sample selection subsystem 114 can use the current excitation level calculated by excitation determination subsystem 116 in combination with selection parameters stored in sample selection parameters 110 to select the appropriate audio sample for playback. For instance, in some embodiments, sample selection subsystem 114 can compare the current excitation level of the instrument with threshold values stored in sample selection parameters 110. The threshold values can correspond to audio samples having different excitation levels as stored in audio samples 112. In some embodiments, sample selection parameters 110 may include a distinct set of threshold values for audios samples corresponding to one or more velocity levels of the instrument. Thus, in such embodiments, sample selection subsystem 114 can retrieve the threshold values that specifically correspond to the velocity level of the received musical stimulus (e.g., the velocity of the cymbal note) from sample selection parameters 110. In some embodiments, sample selection subsystem 114 can analyze the threshold values to determine which, if any, of the threshold values are exceeded by (or, in some embodiments, equal to) the calculated excitation level of the instrument. In some embodiments, sample selection subsystem can select an audio sample that corresponds to the highest threshold level that is exceeded (or met) by the current excitation level of the instrument.
Sample selection subsystem 114 can retrieve the selected audio sample from audio samples 112 stored in memory subsystem 106. Playback subsystem 104 can then utilize an audio output device (e.g., a speaker) of UI subsystem 102 to playback the selected audio sample corresponding to the simulated excitation level.
In some embodiments, UI subsystem 102, playback subsystem 104, memory subsystem 106, sample selection subsystem 114, and volume level matching subsystem 118, working in cooperation, may be responsible for selecting audio samples corresponding to different velocity levels in response to repeated musical stimuli. For instance, input provided by a user can be received at playback subsystem 104 from UI subsystem 102. In some embodiments, the input may correspond to an instruction to playback an arrangement of audio samples.
Upon receipt of the input, playback subsystem 104 can begin playing back the arrangement of audio samples in accordance with arrangement data 108 stored in memory subsystem 106. During the playback, repeated stimuli corresponding to the same instrument can be received (e.g., within the arrangement and/or from an external controller). For instance, the particular arrangement stored in arrangement data 108 may include repeated guitar notes having the same velocity level. When an initial stimulus (e.g., the first instance of the guitar note) is received, in some embodiments, sample selection subsystem 114 can retrieve an audio sample associated with the particular velocity level from audio samples 112 for playback. When a subsequent stimulus (e.g., the second instance of the guitar note) having the particular velocity level is received, in some embodiments, sample selection subsystem 114 can retrieve an audio sample associated with a different velocity level. For instance, sample selection subsystem 114 can select an audio sample associated with a higher or lower velocity level than that of the received stimuli.
In some embodiments, sample selection parameters 110 can include rules used by sample selection subsystem 114 to select the appropriate audio sample for playback. For instance, in some embodiments, sample selection parameters 110 may include threshold time intervals that determine whether an audio sample associated with a different velocity level is to be selected. In such embodiments, the time interval between the first and second instances of the musical stimulus can be compared to the threshold time intervals. If the time interval between the stimuli exceeds (or, in some embodiments, meets) a threshold time interval, in some embodiments, sample selection subsystem 114 can instead select the same audio sample for playback that was selected in response to the initial stimulus. If an audio sample associated with a different velocity is to be selected (e.g., if the threshold time interval is not exceeded), sample selection parameters 110 can further include rules governing whether an audio sample associated with a higher or lower velocity level is to be selected, how much higher or lower the velocity level of the sample will be, in addition to which audio samples to select in response to a 3rd instance, 4th instance, etc., of the musical stimulus.
If an audio sample associated with a different velocity level than that of the initial stimulus is selected, the volume level of the audio sample may be higher or lower than that of the initial audio sample. In some embodiments, volume level matching subsystem 118 can “scale” the volume of the subsequent audio sample to more closely “match” the volume level of the initial sample. For instance, if an audio sample associated with a higher velocity is selected in response to the subsequent stimulus, volume level matching subsystem 118 can reduce the volume of the audio sample for playback. Similarly, if an audio sample associated with a lower velocity is selected in response to the subsequent stimulus, volume level matching subsystem 118 can increase the volume level of the audio sample for playback to more closely match the volume level of the initial audio sample. In some embodiments, volume level matching can be accomplished by increasing or reducing the overall volume level of the selected audio sample until its peak level (e.g., the point on the audio waveform with the highest amplitude) is approximately equal to that of the initial audio sample. In some embodiments, a reduction or increase in volume level can be uniform across the audio sample. In some other embodiments, changes in volume level can be applied differently across the audio sample (e.g., by applying different scaling parameters to the “attack” and “tail” portions of the waveform).
In some embodiments, after modification of the volume level by volume level matching subsystem 118, playback subsystem 104 can utilize an audio output device (e.g., a speaker) of UI subsystem 102 to playback the selected audio sample associated with the different velocity level.
In the examples described above in regards to system 100 shown in
System 100 depicted in
Network 1406 may include one or more communication networks, which can be the Internet, a local area network (LAN), a wide area network (WAN), a wireless or wired network, an Intranet, a private network, a public network, a switched network, or any other suitable communication network. Network 1406 may include many interconnected systems and communication links including but not restricted to hardwire links, optical links, satellite or other wireless communications links, wave propagation links, or any other ways for communication of information. Various communication protocols may be used to facilitate communication of information via network 1406, including but not restricted to TCP/IP, HTTP protocols, extensible markup language (XML), wireless application protocol (WAP), protocols under development by industry standard organizations, vendor-specific protocols, customized protocols, and others.
In the configuration depicted in
In some embodiments, during playback, repeated stimuli corresponding to the same instrument can be received (e.g., in an arrangement and/or from an external controller). As described above with respect to system 100 shown in
In the configuration depicted in
It should be appreciated that various different distributed system configurations are possible, which may be different from distributed system 1400 depicted in
As described herein, certain embodiments of the invention are directed to selecting an audio sample based on the excitation state of an instrument and, in some embodiments, the audio samples may correspond to a simulated instrument that is capable of excitation. In some embodiments, such instruments may produce a sound pattern (e.g., a waveform) that decays over time. For instance, such a sound pattern can include an initial “attack” portion having high amplitude sound levels followed by a “tail” portion having sound levels that decay over time. In some embodiments, the excitation caused by playing multiple notes of an instrument may correspond to a superposition of the decaying waveforms associated with the individual notes.
As further depicted in
In some embodiments, audio samples 200 may correspond to a particular velocity level of the instrument. In some embodiments, a distinct set of audio samples corresponding to different excitation levels can be recorded for one or more velocity levels of an instrument. In some embodiments, one or more of audio samples 200 may each be associated with a threshold level. As described in further detail below, such threshold levels can be used to determine which of audio samples 200 best simulates the current excitation state of the corresponding instrument. In some embodiments, threshold levels can be stored separately from audio samples 200, such as in sample selection parameters 110 in system 100 shown in
In
In
Upon determining the current excitation level, the computing device can select an appropriate audio sample that reflects the excitation state of the instrument. For instance, as depicted in
In
In some embodiments, to select the appropriate audio sample for playback, the calculated excitation level (e.g., the summed volume level) of previously received stimuli can be compared to threshold levels. In the example shown in
In
The summed volume levels can be compared to threshold levels assigned to audio samples corresponding to the simulated instrument (e.g., the ride cymbal). In some embodiments, the computing device can select an audio sample for playback with the highest threshold level that is met or exceeded by the calculated excitation level (e.g., the summed volume levels of audio samples 200(a), 200(b)). In the example shown in
As illustrated in
In
In response to musical stimulus 606, the computing device can calculate the current excitation level of the previously received ride cymbal notes that are currently being played. In
The volume level of audio sample 200(b) can be compared to threshold levels assigned to audio samples corresponding to the simulated instrument (e.g., the ride cymbal). In this example, the computing device may determine that the threshold level assigned to audio sample 200(b) is the highest threshold met or exceeded by the current excitation level. Thus, the current excitation level may be insufficient to trigger audio sample 200(c) in this example since the spacing between musical stimuli 404 and 406 has allowed playback of audio sample 200(a) to end. By again selecting audio sample 200(b), the computing device can simulate the excitation of the ride cymbal caused by musical stimulus 606 being received when the ride cymbal excitation caused by musical stimulus 604 but not musical stimulus 602. Upon selection, the computing device may playback audio sample 200(b).
In
In response to musical stimulus 706, the computing device can determine that the current excitation level (e.g., the current volume level) of previously received stimuli is zero. Based on the calculation, the computing device can select the audio sample with an excitation level corresponding to the ride cymbal being at rest. For instance, as depicted in
In
In response to musical stimulus 804, the computing device can determine that the current excitation level of the previously received stimuli (e.g., musical stimuli 802) is zero. Based on the calculation, the computing device can select the audio sample with an excitation level corresponding to a ride cymbal being played when the cymbal is at rest. Thus, as depicted in
In some embodiments, as described above, a set of audio samples may correspond to a particular velocity level, and sets of audio samples can be recorded for a plurality of velocity levels of an instrument. Moreover, in some embodiments, threshold levels assigned to a set of audio samples corresponding to a particular velocity level may be different than the threshold levels assigned to a set of audio samples corresponding to a different velocity level. Thus, in response to a musical stimulus, the computing device may determine the velocity level of the stimulus to identify the appropriate set of threshold values to analyze. If a stimulus corresponding to a low velocity level is followed by a stimulus corresponding to a high velocity level, the excitation caused by the initial stimulus may be small or even insignificant in comparison to the excitation caused by the subsequent stimulus. Thus, when the current excitation caused by the low velocity stimulus are compared to threshold values corresponding to the high velocity stimulus, a high velocity audio sample corresponding to the instrument being at rest may be selected. Similarly, if a stimulus corresponding to a high velocity level is followed by a stimulus corresponding to a low velocity level, the excitation caused by the subsequent stimulus may be small or insignificant in comparison to the excitation caused by the initial stimulus. When the current excitation caused by the high velocity stimulus is compared to threshold values corresponding to the low velocity stimulus, a low velocity audio sample corresponding to the instrument being at a high level of excitation may be selected. However, the selected audio sample may be associated with the same or a lower excitation level than that played back in response to the initial high velocity stimulus.
In some embodiments, a simulated instrument can include different regions or components that are associated with independent excitation levels. For instance, an actual cymbal can include a “bell” portion and an “outer” portion that produce different sound patterns when played, and that generate excitation energy independently. The bell portion of a cymbal can have a high level of excitation energy when the outer portion has a low or negligible level of excitation energy, and vice versa. In some embodiments, in response to stimuli corresponding to playing notes on such instruments, the excitation of different regions or components can be calculated independently and appropriate audio samples selected accordingly.
In some embodiments, audio samples can be played back in a simultaneous format. For instance, a musical stimulus can be received during playback of an audio sample corresponding to a particular excitation. In some embodiments, when an audio sample is selected and played back for the instant stimulus, the playback of the previous audio sample may continue. Thus, the audio samples corresponding to the first and second stimuli can be played back simultaneously using different tracks or channels. Similarly, if a third stimulus is received, a selected audio sample can be played back simultaneous with the decaying first and second audio samples. In some embodiments, a threshold number of simultaneous audio samples can be played back. For instance, in some embodiments, if a fourth stimulus is received, a selected audio sample may be played back simultaneous with the decaying second and third audio samples, but playback of the first audio sample can be terminated.
As illustrated in
At step 904, a current excitation level associated with previously received musical stimuli can be calculated. For instance, the individual excitation levels associated with previously received musical stimuli corresponding to the same instrument can be determined and summed to generate the current excitation level. In some embodiments, the individual excitation levels can be determined based upon the individual volume levels of the previously received stimuli as currently being played back.
At step 906, an audio sample corresponding to the received music stimulus can be selected, the audio sample being selected using the calculated current excitation level associated with the previously received musical stimuli. For instance, in some embodiments, the current excitation level associated with the previously received stimuli can be compared with threshold levels assigned to a plurality of audio samples corresponding to the received musical stimulus, the plurality of audio samples corresponding to different excitation levels. In such embodiments, at step 906, selecting the audio sample can include determining that the current excitation level exceeds a threshold level assigned to the selected audio sample. In some embodiments, a velocity level of the received musical stimulus can be determined. In such embodiments, the plurality of audio samples, including the selected sample, may correspond to the determined velocity level. At step 908, the selected audio sample can be played back.
In some embodiments, upon initiating playback, the first musical stimulus can be identified as a previously received musical stimulus, and a second musical stimulus can be received. The current excitation level associated with the previously received musical stimuli including the first musical stimulus can be calculated. An audio sample corresponding to the received second musical stimulus can be selected, the second audio sample being selected using the current excitation level associated with the previously received musical stimulus including the first musical stimulus. The selected second audio sample can be played back. In some embodiments, the second audio sample can correspond to a different excitation level than the first audio sample. In some embodiments, the playback of the first audio sample can continue when the second audio sample is played back. In some embodiments, playback of the first audio sample can end when the playback of the second audio sample is initiated.
By storing audio samples of varying excitation levels, calculating the current excitation behavior of a simulated instrument, and selecting an audio sample accordingly, the excitation state of a real instrument can be reproduced. Thus, a more natural sounding and realistic simulation of a live performance using a real musical instrument can be provided.
As described herein, certain embodiments are further described that provide for selecting audio sample having different velocity levels in response to repeated musical stimuli. In some embodiments, audio samples corresponding to an instrument playing a particular note may be recorded at various “velocity” levels (e.g., the speed or force with which a note has been struck). For instance, in some embodiments, a MIDI format supports 127 different velocity levels (e.g., 1 to 127). In various embodiments, the recorded audio samples may correspond to any suitable instrument. For instance, audio samples having different velocity levels that are selected in response to repeated musical stimuli may correspond to instruments capable of excitation (as described herein), instruments capable of a small amount of excitation or incapable of excitation (e.g., a wood block, closed hi-hat, etc.), or any other suitable musical instrument. In various embodiments, the audio samples can be in any suitable audio format such as uncompressed formats, lossless compression formats, lossy compression formats, or any other suitable audio format.
In
In
In response to the first instance of the musical stimulus, the computing device can select and playback an audio sample that corresponds to the same note and velocity as the received stimulus. For instance, as illustrated by playback map 1006, in response to musical stimulus 1002(a), the computing device can select and playback an audio sample that corresponds to a guitar note (C#) with a velocity level 1006(a) of 99.
In some embodiments, in response to the second instance of the musical stimulus, the computing device can select an audio sample that corresponds to a different velocity level. For instance, as illustrated by playback map 1006, in response to musical stimulus 1002(b), the computing device can select an audio sample that corresponds to the same guitar note (C#) but with a different velocity level 1006(b), i.e. a higher velocity level of 100. In some embodiments, in response to the second instance of the musical stimulus 1002(b), an audio sample corresponding to the same note but with a lower velocity level 1008(b) can be selected. For instance, as illustrated by playback map 1008 shown in
In the example depicted in
In some embodiments, upon selecting the audio sample corresponding to a different velocity level, the computing device can playback the audio sample using an audio output device such as a speaker. In some embodiments, prior to playing back the selected sample, a volume level matching can be performed on the audio sample corresponding to the higher or lower velocity level. For instance, audio samples corresponding to different velocity levels may be associated with different output volume levels since an increase in velocity level (e.g., an increase in the speed or force with which a note has been struck) may generally result in an increase in output volume level. In some embodiments, the computing device can “scale” the volume level of the audio sample selected in response to the second instance of the musical stimulus to more closely match the volume level of the audio sample selected in response to the first instance of the musical stimulus. As an example, referring back to playback map 1006 shown in
In various embodiments, output volume levels can be increased or decreased to more closely match the output volume level of the audio sample played back in response to the first instance of the musical stimulus in a number of different ways. For instance, the overall volume level of the selected audio sample can be increased or decreased until its peak volume level (e.g., the point on the audio waveform with the highest amplitude) is equal or approximately equal to the peak volume level of the audio sample played back in response to the first instance of the musical stimulus. In some embodiments, an increase or decrease in volume level can be applied uniformly across the length of an audio sample. In some other embodiments, volume modifications can be applied in a non-uniform manner. For instance, in the case of an audio sample having a decaying sound pattern, in some embodiments, the volume level of the “attack” portion of the sample can be modified differently than the decaying “tail” portion of the sample. In various embodiments, output volume levels of an audio samples can be increased and/or decreased using any suitable modification parameters.
By performing volume level matching as described above, in some embodiments, the output volume level of the audio sample can more closely match that of the initial audio sample corresponding to the original velocity level while retaining the variation in audio characteristics (e.g., tonal, timbral, and/or other differences) associated with different velocities.
In
As described above in the context of musical stimuli 1002(a) and 1002(b) shown in
In
In some other embodiments, in response to the third instance of the musical stimulus 1102(c), the computing device can select an audio sample corresponding to a velocity level that creates an “alternating” pattern of velocity levels. For instance, as illustrated in playback map 1108, the computing device can select an audio sample that corresponds to a velocity level 1108(c) that is lower than that of the received stimuli, i.e. a velocity level of 98. Since the audio samples selected in response to musical stimuli 1102(a)-(c) correspond to alternating velocity levels of 99, 100, and 98 in this example, an alternating pattern of velocity levels can be created. In such embodiments, audio samples selected in response to subsequent instances of the musical stimulus can be chosen in accordance with the alternating pattern (e.g., 99, 100, 98, 99, 100, 98). In various embodiments, any suitable pattern of velocity levels can be created in response to repeated musical stimuli. Moreover, as described above, a volume level matching can be performed prior to playing back an audio sample corresponding to a velocity level lower or higher than that of the received musical stimuli.
In
In response to the first instance of the musical stimulus, the computing device can select an audio sample corresponding to the velocity level of the musical stimuli. As shown in playback map 1206, in response to musical stimulus 1202(a), the computing device can select and playback an audio sample that corresponds to the same guitar note (e.g., C#) and the same velocity level 1206(a), i.e. a velocity level of 99.
In some embodiments, the time interval between the first and second instances of a musical stimulus can be considered in determining whether to select an audio sample corresponding to different velocity level in response to a second instance of the musical stimulus. Thus, in response to the second instance of the musical stimulus 1202(b), the computing device can measure or otherwise determine the time interval between the first and second stimuli (e.g., four beats). The time interval can then be compared to a threshold time interval. In some embodiments, if the measured time interval is greater than (or equal to) the threshold time interval, the computing device may not select an audio sample corresponding to a different velocity level to playback in response to the second instance of the stimulus. For instance, as shown in playback map 1206, in response to musical stimulus 1202(b) being positioned four beats apart from musical stimulus 1202(a), the computing device may select and playback an audio sample that corresponds to the same guitar note (e.g., C#) and the same velocity level 1206(b), i.e. a velocity level of 99, as that associated with the received stimuli. Thus, the same audio sample can be played back in response to both musical stimuli 1202(a) and 1202(b) when the time interval between the stimuli meets or exceeds a threshold time interval.
In some embodiments, the computing device may further determine whether repeated instances of a musical stimulus are consecutive. For instance, in an arrangement including musical stimuli that correspond to other instruments (or components of the same instrument), and that are positioned between repeated instances of the musical stimulus, in some embodiments, the computing device may not select an audio sample corresponding to a different velocity level to playback in response to the second instance of the stimulus. Further, in some embodiments, if two or more musical stimuli are received that correspond to different velocity levels, the computing device may select audio samples corresponding to velocity levels that are the same as that of the received musical stimuli.
As illustrated in
At step 1304, a first audio sample corresponding to the first velocity level of the received musical stimulus can be played back. In some embodiments, the first audio sample can be one or a plurality of audio samples that correspond to different velocity levels of the musical stimulus.
At step 1306, a second instance of the musical stimulus having the first velocity level can be received and, at step 1308, a second audio sample corresponding to a second velocity level of the received musical stimulus can be selected from the plurality of audio samples. In some embodiments, the first and second velocity levels can be adjacent velocity levels. In some embodiments, a time interval between the first and second instances of the musical stimulus can be measured and compared to a threshold time interval. In such embodiments, the measured time interval can be determined to be within the threshold time interval prior to selecting the second audio sample corresponding to the second velocity level. In some embodiments, the first and second instances of the musical stimulus can be determined to have been received consecutively prior selecting the second audio sample corresponding to the second velocity level.
At step 1310, the second audio sample can be played back. In some embodiments, the second audio sample may include different audio characteristics than the first audio sample. For instance, the first and second audio samples may have different tonal, timbral, and/or other characteristics. In some embodiments, the first velocity level can correspond to a first volume level and the second velocity level can correspond to a second volume level. In such embodiments, playing back the second audio sample can include modifying the second volume level. In some embodiments, modifying the second volume level can include scaling the second volume level in accordance with the first volume level.
Audio samples corresponding to different velocity levels may include differences in tone, timbre, or other audio characteristics. Thus, by selecting audio samples for playback with different velocity levels, variations can be introduced into an arrangement or performance in which repeated notes are played. Such variation may provide for a more natural sounding and realistic simulation of a live performance using a real musical instrument.
As described above, system 100 illustrated in
Bus subsystem 1504 can provide a mechanism for allowing the various components and subsystems of computer system 1500 communicate with each other as intended. Although bus subsystem 1504 is shown schematically as a single bus, alternative embodiments of the bus subsystem may utilize multiple busses.
Processor 1502, which can be implemented as one or more integrated circuits (e.g., a conventional microprocessor or microcontroller), controls the operation of computer system 1500. One or more processors 1502 may be provided. These processors may include single core or multicore processors. In various embodiments, processor 1502 can execute a variety of programs in response to program code and can maintain multiple concurrently executing programs or processes. At any given time, some or all of the program code to be executed can be resident in processor(s) 1502 and/or in storage subsystem 1506. Through suitable programming, processor(s) 1502 can provide various functionalities described above.
Network interface subsystem 1516 provides an interface to other computer systems and networks. Network interface subsystem 1516 serves as an interface for receiving data from and transmitting data to other systems from computer system 1500. For example, network interface subsystem 1516 may enable computer system 1500 to connect to one or more devices via the Internet. In some embodiments network interface 1516 can include radio frequency (RF) transceiver components for accessing wireless voice and/or data networks (e.g., using cellular telephone technology, advanced data network technology such as 3G, 4G or EDGE, WiFi (IEEE 802.11 family standards, or other mobile communication technologies, or any combination thereof), GPS receiver components, and/or other components. In some embodiments network interface 1516 can provide wired network connectivity (e.g., Ethernet) in addition to or instead of a wireless interface.
User interface input devices 1512 may include a keyboard, pointing devices such as a mouse or trackball, a touchpad or touch screen incorporated into a display, a scroll wheel, a click wheel, a dial, a button, a switch, a keypad, audio input devices such as voice recognition systems, microphones, eye gaze systems, and other types of input devices. In general, use of the term “input device” is intended to include all possible types of devices and mechanisms for inputting information to computer system 1500. For example, in an iPhone®, user input devices 1512 may include one or more buttons provided by the iPhone® and a touchscreen which may display a software keyboard, and the like.
User interface output devices 1514 may include a display subsystem, indicator lights, or non-visual displays such as audio output devices, etc. The display subsystem may be a cathode ray tube (CRT), a flat-panel device such as a liquid crystal display (LCD), a projection device, a touch screen, and the like. In general, use of the term “output device” is intended to include all possible types of devices and mechanisms for outputting information from computer system 1500. For example, a software keyboard may be displayed using a flat-panel screen.
Storage subsystem 1506 provides a computer-readable storage medium for storing the basic programming and data constructs that provide the functionality of some embodiments. Storage subsystem 1506 can be implemented, e.g., using disk, flash memory, or any other storage media in any combination, and can include volatile and/or non-volatile storage as desired. Software (programs, code modules, instructions) that when executed by a processor provide the functionality described above may be stored in storage subsystem 1506. These software modules or instructions may be executed by processor(s) 1502. Storage subsystem 1506 may also provide a repository for storing data used in accordance with the present invention. Storage subsystem 1506 may include memory subsystem 1508 and file/disk storage subsystem 1510.
Memory subsystem 1508 may include a number of memories including a main random access memory (RAM) 1518 for storage of instructions and data during program execution and a read only memory (ROM) 1520 in which fixed instructions are stored. File storage subsystem 1510 may provide persistent (non-volatile) memory storage for program and data files, and may include a hard disk drive, a floppy disk drive along with associated removable media, a Compact Disk Read Only Memory (CD-ROM) drive, an optical drive, removable media cartridges, and other like memory storage media.
Computer system 1500 can be of various types including a personal computer, a portable device (e.g., an iPhone®, an iPad®, and the like), a workstation, a network computer, a mainframe, a kiosk, a server or any other data processing system. Due to the ever-changing nature of computers and networks, the description of computer system 1500 depicted in
Embodiments can be implemented in digital electronic circuitry, or in computer software, firmware, or hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them. Embodiments of the subject matter described in this specification can be implemented as one or more computer program products, i.e., one or more modules of computer program instructions encoded on a non-transitory computer-readable medium for execution by, or to control the operation of, data processing apparatus.
Various embodiments described above can be realized using any combination of dedicated components and/or programmable processors and/or other programmable devices. The various embodiments may be implemented only in hardware, or only in software, or using combinations thereof. The various processes described herein can be implemented on the same processor or different processors in any combination. Accordingly, where components or modules are described as being configured to perform certain operations, such configuration can be accomplished, e.g., by designing electronic circuits to perform the operation, by programming programmable electronic circuits (such as microprocessors) to perform the operation, or any combination thereof. Processes can communicate using a variety of techniques including but not limited to conventional techniques for interprocess communication, and different pairs of processes may use different techniques, or the same pair of processes may use different techniques at different times. Further, while the embodiments described above may make reference to specific hardware and software components, those skilled in the art will appreciate that different combinations of hardware and/or software components may also be used and that particular operations described as being implemented in hardware might also be implemented in software or vice versa.
The various embodiments are not restricted to operation within certain specific data processing environments, but are free to operate within a plurality of data processing environments. Additionally, although embodiments have been described using a particular series of transactions, this is not intended to be limiting.
Thus, although specific invention embodiments have been described, these are not intended to be limiting. Various modifications and equivalents are within the scope of the following claims.
This application claims the benefit of U.S. Provisional Application No. 61/845,780, filed Jul. 12, 2013, entitled “Selecting Audio Samples in Response to Musical Stimuli,” the disclosure of which is incorporated by reference herein in its entirety. This application is also related to commonly-owned co-pending U.S. application Ser. No. 13/965,913, filed of even date herewith, entitled “Selecting Audio Samples Based on Excitation State,” the disclosure of which is incorporated by reference herein in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5018430 | Iinima et al. | May 1991 | A |
5319151 | Shiba et al. | Jun 1994 | A |
5471008 | Fujita et al. | Nov 1995 | A |
5726371 | Shiba et al. | Mar 1998 | A |
5862063 | Thome et al. | Jan 1999 | A |
6040516 | Masuda | Mar 2000 | A |
6162983 | Takahashi | Dec 2000 | A |
6191350 | Okulov et al. | Feb 2001 | B1 |
7342166 | Kay | Mar 2008 | B2 |
7826911 | Bennett | Nov 2010 | B1 |
8178773 | Riopelle | May 2012 | B2 |
8404958 | Shrem | Mar 2013 | B2 |
8901406 | Buskies et al. | Dec 2014 | B1 |
20120186419 | Roderick | Jul 2012 | A1 |
20130087037 | Dreher | Apr 2013 | A1 |
20150082973 | Little et al. | Mar 2015 | A1 |
Entry |
---|
Notice of Allowance for U.S. Appl. No. 13/965,913, mailed on Oct. 8, 2014, 8 pages. |
Non-Final Office Action for U.S. Appl. No. 14/530,130, mailed on Aug. 14, 2015, 6 pages. |
Number | Date | Country | |
---|---|---|---|
20150013531 A1 | Jan 2015 | US |
Number | Date | Country | |
---|---|---|---|
61845780 | Jul 2013 | US |