The exemplary and non-limiting embodiments described herein relate generally to audio technologies and, more particularly, to methods of controlling separate audio objects in a mixed sound production and apparatuses for such methods.
Various technologies allow end users to separate audio tracks or objects after formal productions of music or other audio has been mixed. Such technologies include DOLBY ATMOS, blind signal separation (BSS), and MPEG Spatial Audio Object Coding (SAOC). Music producers generally want to limit an end user's ability to apply these technologies in efforts to maintain the integrity of the originally-produced music.
The following summary is merely intended to be exemplary. The summary is not intended to limit the scope of the claims.
In one exemplary aspect, a method comprises: providing an audio file comprising two or more discrete tracks; separating the two or more discrete tracks; setting a limit on an amount at least one of the two or more discrete tracks may be altered; and outputting the separated and limited discrete tracks as a file for use by an end user.
In accordance with another exemplary aspect, a method comprises: receiving an audio file comprising two or more discrete tracks; analyzing the two or more discrete tracks to determine a set limit on possible remixing for at least one of the two or more discrete tracks; editing at least one of the two or more discrete tracks; and remixing the two or more discrete tracks based on the determined set limit on possible remixing for the at least one of the two or more discrete tracks.
In accordance with another exemplary aspect, an apparatus comprises: at least one processor; and at least one non-transitory memory including computer program code configured to, with the at least one processor, cause the apparatus at least to: receive an audio file comprising two or more discrete tracks; separate the two or more discrete tracks from each other; analyze the two or more discrete tracks to determine a set limit on possible remixing for at least one of the two or more discrete tracks; and allow a user to remix the two or more discrete tracks based on the determined set limit on possible remixing for the at least one of the two or more discrete tracks.
The foregoing aspects and other features are explained in the following description, taken in connection with the accompanying drawings, wherein:
Referring to the Figures, exemplary embodiments of methods of allowing an end user to alter, within defined limits, a mix of instrument and voice tracks are disclosed herein. Also disclosed herein are exemplary embodiments of apparatuses for carrying out such methods. Although some technologies (e.g., CD, MP3, and the like) do not allow the end user to alter a mix of tracks, more recent technologies do make such allowances. Such altering may be via the use of technology formats such as DOLBY ATMOS, blind signal separation (BSS), and MPEG Spatial Audio Object Coding (SAOC). For example, DOLBY ATMOS allows a completely free editing of instrument and voice tracks. BSS, however, provides a slightly less free editing ability due to little or no user-available information about the source signals or mixing processes used therefor. Editing using BSS may also result in products that suffer from separation artefacts. Similarly, in using MPEG SAOC, free editing is allowed, but audio quality may suffer due to changes in the object separation that are too drastic relative to the source track.
In any editing technology, music recordings may be recognized using any suitable method for the automatic identification of sound recordings. Some music files may be separated into instrument and voice tracks using any suitable algorithm (e.g., DUET BSS algorithm). Additionally, the instrument in each track may be recognized by any suitable automatic musical instrument recognition algorithm. A dominant instrument in a song can also be recognized by analyzing the music for a dominant tag.
In using any of the foregoing technologies to separate audio tracks, the end user may have a tendency to excessively modify music or other audio to their liking. For example, a guitar player may want to better hear the guitar track in a song in order to learn the notes or chords, a vocalist may want to understand lyrics, a person viewing a movie may want to hear the explosions louder or to filter out background noise in order to hear what a character is saying, or a person may want to simply hear only a voice track without hearing music, a laugh track, or other background sound in an effort to keep the overall volume at a minimum.
However, audio- or music producers generally do not want to allow end users complete control for one or more reasons. In particular, for reasons related to copyright, artists and producers want to limit the ability of an end user to completely single out instruments into separate tracks because such separated tracks may be used for unauthorized projects. Drastic changes to a song can also make that song sound bad, and when such versions of that song are widely disseminated (e.g., over the Internet), the original artist may incur negative reviews and/or bad publicity. Furthermore, it is well-noted that removing a laugh track from a comedy show often eliminates or reduces any comic effect.
Accordingly, both artists and producers generally dislike having their work products excessively modified by end users. To address issues of excessive modification of produced audio, music artists and music producers may desire to curtail the ability of an end user to modify a song (or other work product) by setting limits (either automatically or manually) regarding the extent to which the end user can remix audio files (for example, combine, pan, add reverberation, add autotuning ability, alter levels (e.g., levels of amplification relative to other levels), and the like). Such limits may include separating audio sources based on their location (e.g., direction of sound propagation) and playback of the separated audio sources. Other limits may be imposed by manipulation of playback options based on metadata associated with the audio. The exemplary embodiments disclosed herein are directed to imposing limits on the separation of instrument tracks (which is different from separation based on direction of sound propagation) as well as limiting the separation quality in playback.
The exemplary embodiments disclosed herein are directed to methods and tools for music producers to limit an end user's manipulation (using a user interface (UI)) of an audio mix of a song after instrument tracks have been separated. The manipulation may have either automatic limits or limits that can be set by the music producers. The exemplary embodiments described herein take into account the quality of the separation system when deciding the limits. In the alternative, a music producer can take into account the quality of the separated instrument tracks. Furthermore, the end user may be able to exceed the limits and listen to the result, but they may be prevented from saving or publishing (where license is given) remixes that exceed the limits. In addition, the exemplary embodiments disclosed herein may be used to enable inexperienced end users to remix their own recorded content with the help of BSS while limiting the risk of compromising the quality of the recorded content due to over remixing.
Referring to
Method 100 is initiated with a music production step 110, a first separation step 120, and a limit setting step 130. In the music production step 110, a music producer uses discrete tracks of instruments and vocals (e.g., one instrument per track, one vocal per track, etc.) to make a preferred mix of a song. The music producer may then output and/or save the preferred mix in a particular format (e.g., mono, stereo, 5.1, or the like). The limits are stored as metadata alongside the actual song audio. In some embodiments, the song audio is stored in MPEG SAOC format, and the metadata is added to the free metadata field in the SAOC format. In some other embodiments, the song audio is stored in MPEG SAOC format and the metadata is added to an MP4 container. In some other embodiments, the song audio is stored as AAC (advanced audio coding), and the metadata is added to the free metadata field in the AAC format. With MPEG SAOC the tracks are stored within the format, and modifying the stored tracks is easy.
With AAC the tracks of the preferred mix may need to be separated into discrete tracks before modification, as indicated in the first separation step 120. An audio object separation system is used both in the music production end and at the user end. The separation system may need to be the same so that the metadata that the music producers decide on is applicable also at the user end. With some formats, the preferred mix may be separated into instrument tracks using BSS. If SAOC is used, however, the separation is built in. In any case, however, although the tracks are considered to be discrete, the separation thereof may not be perfect due to limitations in the production (e.g., particularly with regard to AAC and SAOC).
In the limit setting step 130, the music producer listens to the separated tracks with different remixes and determines allowed limits of user remixing. In this way, the music producer ensures that the determined limits are suitable for the used separation quality. Naturally, the producers can also consider aspects other than separation quality when deciding the limits. Such aspects may include artistic views and the like. The producers may also set limits such as, for example, “Vocal track cannot be attenuated” (so the limit may be zero, i.e., no alteration of the track is allowed), “Background singing track must always be at least 3 dB less than lead vocal track,” and the like. In some exemplary embodiments, the limits may be determined using a computer program.
After the music production step 110, the first separation step 120, and/or the limit setting step 130, the separated/limited audio and metadata are outputted as an output file 140. In various exemplary embodiments, the output file 140 may be a finished production of an audio file by the music producer for the end user and may comprise instrument and/or vocal tracks.
If desired, the output file 140 may be encrypted so that only an approved version of a player program may decrypt it. In this way it can be made so that the end user cannot use other player programs to remix the songs excessively. In other words, a digital rights management (DRM) scheme may estimate the degree of change (as defined by the allowed limits for editing or remixing) before allowing saving, copying, or sharing of the content. The output file 140 is then stored, streamed, and/or sent to the player program in a sending step 150.
When the player program receives the output file 140 from the sending step 150, the player program checks for the existence of metadata and allows the end user to remix the song accordingly. As shown in a second separation step 160, the audio file of the preferred mix may be separated into instrument and/or voice tracks using BSS. If, on the other hand, the separation in the second separation step 160 is via SAOC, the separation is built in. In embodiments in which DOLBY ATMOS is used in the second separation step 160, at least some of the tracks may already be discrete because DOLBY ATMOS generally maintains the separation of tracks through the sending step 150. As shown in a remix step 170, the metadata is received, an analysis of the metadata is made to ascertain limits as to how much the end user may remix the tracks, and the end user listens to the audio and may remix it to the extent allowable using a UI. In doing so, the end user may not exceed the limits set in the metadata by the producers in the limit setting step 130.
In some embodiments, however, the end user may be permitted to exceed the set limits in a limit exceeding step 180. In the limit exceeding step 180, the end user may exceed the set limits temporarily, but the result may not be stored. In some other embodiments, the end user may be allowed to exceed the set limits only when the end user is constantly manipulating a control (e.g., holding a finger on a control button).
Referring to
Method 200 is initiated with a music production step 210, a first separation step 220, a quality estimation step 225, and a limit setting step 230. In the music production step 210, original instrument tracks are available from the music production. A music producer makes a preferred mix of a song and outputs and/or saves it in a particular format (e.g., mono, stereo, 5.1, or the like).
As shown in the first separation step 220, the preferred mix is separated into instrument tracks. The separated instrument tracks may be slightly different from the original instrument tracks because the separation may not be perfect. The separation can be done using BSS for traditional music files and SAOC for SAOC files.
In the quality estimation step 225, the quality of the separation is estimated using different criteria. In some embodiments only the file format is considered in the quality evaluation. SAOC files are considered to have high quality, whereas BSS files are considered to have lower quality. Different metrics of the files may also be calculated, for example, the system may calculate cross correlation between the original instrument tracks and the separated tracks. Higher correlation generally corresponds to higher quality.
In the limit setting step 230, limits may be set for how the end user can remix the files. Typically, higher quality files can be edited more, and lower quality files less. The limits may be written into metadata.
In addition to separation quality related limits, there may be other limits. The player program may analyze different characteristics of the tracks of the song. The analysis may include recognizing instruments using automatic instrument recognition algorithms, recognizing the dominant instrument of the song, recognizing the tempo of the song, recognizing frequency- or time-domain characteristics such as impulses, zero-crossing rate, spectral tilt, overall spectrum, transient or sinusoidal nature of the song, and the like. The analysis can be performed for the whole song or only for parts of the song. The end user may remix the song using a UI. The player program software sets limits for end user remixing based on the analysis. The limits may be, for example, “The dominant instrument cannot be attenuated,” “Vocal tracks cannot be attenuated,” “Songs with tempo>180 bpm cannot be sped up,” and the like. Also, limits can be used in combination with music analysis (e.g., “The dominant instrument cannot be attenuated more than 2 dB,” “Vocal tracks cannot be attenuated,” and the like). These limits may also be written into the metadata.
The separated/estimated/limited audio and metadata are then outputted as an output file 240 as instrument tracks. The output file 240 is then stored, streamed, and/or sent to the end user in a sending step 250.
When the player program receives the output file 240 from the sending step 250, the player program receives a music song file with metadata that describes the limits for editing. The player program separates the song into audio object tracks in a second separation step 260. In the second separation step 260, the audio file may be separated using BSS or SAOC (with the separation built in).
As shown in a remix step 270, the end user may remix the song using a UI. The end user may not remix, the song so that the remix would exceed the limits set in the metadata. The limits may be, for example, “An instrument may be set louder or quieter but not more than described in the metadata.” For each track the metadata may contain limits, e.g., in Decibels. Such limits may be, for example, “The level of Track 3 can by modified by the user from −3 dB to +3 dB” if the separation quality is low, and “The level of Track 3 can by modified by the user from −12 dB to +12 dB” if the separation quality is high.
In some embodiments of method 200 (as with method 100), the end user may exceed the set limits in a limit exceeding step 280. In the limit exceeding step 280, the end user may exceed the set limits temporarily, but the result may not be stored. In some other embodiments, the end user may be allowed to exceed the set limits only when the end user is constantly manipulating a control (e.g., holding a finger on a control button).
Referring to
Method 300 comprises a music production step 310 in which one or more original instrument tracks are available from the music production. A music producer makes a preferred mix of a song and outputs and/or saves it in a particular format (e.g., mono, stereo, 5.1, or the like). After the music production step 310, the preferred mix is outputted as an output file 340 of instrument tracks and stored, streamed, and/or sent to the end user in a sending step 350.
In a separation step 360, a player program receives a music song file with no metadata. The player program separates the preferred mix into instrument tracks. The separation can be done using BSS for traditional music files and SAOC for SAOC files.
Quality of the separation may then be estimated in an end user quality estimation step 365. The end user quality estimation step 365 may employ different criteria. In some embodiments only the file format is considered in the quality evaluation. SAOC files are considered to have high quality, whereas BSS files are considered to have lower quality. Different metrics of the files may also be calculated, for example, the system may calculate cross correlation between the separated tracks. If the tracks have low correlation, then the separation may be considered to have a high success rating. However, if the tracks have high correlation, then the separation may be considered to have a lower success rating.
As shown in a limit setting step 367, limits are set with regard to how the end user may remix the files. In such a step, higher quality files can be edited more, and lower quality files can be edited less. Such limits may be, for example, “The level of Track 3 can by modified by the user from −3 dB to +3 dB” if the separation quality is low, and “The level of Track 3 can by modified by the user from −12 dB to +12 dB” if the separation quality is high.
In addition to separation quality related limits there may be other limits. For example, the player program may analyze different characteristics of the tracks of the song. The analysis may include recognizing instruments using automatic instrument recognition algorithms, recognizing the dominant instrument of the song, recognizing the tempo of the song, recognizing frequency- or time-domain characteristics such as impulses, zero-crossing rate, spectral tilt, overall spectrum, transient or sinusoidal nature of the song, and the like. The analysis can be performed for the whole song or only for parts of the song. The user may remix the song using a UI. The player program software sets limits for user remixing based on the analysis. The limits may be, for example, “The dominant instrument cannot be attenuated,” “Vocal tracks cannot be attenuated,” “Songs with tempo>180 bpm cannot be sped up,” and the like. Also, limits can be used in combination with music analysis (e.g., “The dominant instrument cannot be attenuated more than 2 dB,” “Vocal tracks cannot be attenuated,” and the like).
As shown in a remix step 370, the end user may remix the song using a UI. The end user may not remix the song so that the remix would exceed the limits set.
The end user may exceed the set limits in a limit exceeding step 380. In the limit exceeding step 380, the end user may exceed the set limits temporarily, but the result may not be stored. In some other embodiments, the end′ user may be allowed to exceed the set limits only when the end user is constantly manipulating a control (e.g., holding a finger on a control button).
Referring to
Referring to
Referring to
Particularly in embodiments in which the UI 400 is graphical, the music files 460 may include visual files where visual objects of the visual files have been mapped to instruments and/or voices, i.e., music objects in the music files 460. The end user may then manipulate the visual objects and at the same time automatically manipulate the music objects. Manipulation of the music objects includes, but is not limited to, selecting a band member visual object to increase their track volume, and the like. As shown in
Referring to
Other aspects of the UI 400 may include displaying different pictures on the display 420 at different parts of songs when players change or are silent. Additionally, a record label associated with the song may provide several example mixes with associated pictures where band member visual objects match the mix. Such a system may automatically choose an end user-preferred mix based on either user settings or the end user's previous habits. For example, if the end user always mixes the bass player louder, a mix may be selected where the bass player is loudest of all the players. The end user may be supplied with a visual confirmation of the mix because the system shows him or her an image where the bass player visual object is larger than others.
Remixing by the end user on the UI 400 may also be channel-based so that the end user amplifies/attenuates an instrument differently in different channels in multichannel audio. In this way, the end user may control the perceived spatial location of the instrument. The differing amount of amplification may also be limited in metadata or by automatic analysis, e.g., higher quality files associated with the spatial location can be changed more than lower quality files.
In any embodiment of the UI 400, the end user may exceed the limits set by the music producers. In the UI 400, this may be visible, for example, on a touch display device so that the control moves slower after the limit has been exceeded and the end user has to swipe many times to move the control. The control may be a slider as shown in
In embodiments of the UI 400 in which the end user may exceed the limits set by the producers, the exceeded limits may be noted on the display 420 by being made visible, for example, on a touch display device so that the remix exceeds limits only if the user holds his finger on the slider and keeps it above the slider maximum value. When the user releases his finger the slider returns to the maximum value.
In addition to using the exemplary embodiments described herein for music files, the embodiments described herein are applicable for audio broadcasts, teleconferencing, movie audio, audio books, and the like. The exemplary embodiments described herein may also be used with an end user's own recorded content because allowing end users to remix their own content is a beneficial feature (but traditional remix with BSS may require some expertise, which may cause the end-result to sound less than desirable, thereby possibly causing end users to stop using it). However, with the exemplary embodiments described herein, BSS can be used by inexperienced end users with less of a risk of them detracting from the overall quality of their own content.
With regard to the embodiments described herein: limits for remixing may be set individually for each instrument; artists can set instrument-wise limits even if audio is sent using backwards-compatible audio formats such as AAC or MP3 using audio object separation and metadata; in some embodiments the limits for remixing may be set automatically; in some embodiments automatic instrument-wise limits may be used even without metadata; and an intuitive way for end users to create personal music mixes is realized.
Referring now to all of the Figures described herein, any of the foregoing exemplary embodiments may be implemented in software, hardware, application logic, or a combination of software, hardware, and application logic. The software, application logic, and/or hardware may reside in the UI 400 to remix audio. If desired, all or part of the software, application logic, and/or hardware may reside at any other suitable location. In an example embodiment, the application logic, software, or an instruction set is maintained on any one of various computer-readable media. A “computer-readable medium” may be any media or means that can contain, store, communicate, propagate, or transport instructions for use by or in connection with an instruction execution system, apparatus, or device, such as a computer. A computer-readable medium may comprise a computer-readable storage medium that may be any media or means that can contain or store the instructions for use by or in connection with an instruction execution system, apparatus, or device, such as a computer.
In one exemplary embodiment, a method comprises: providing an audio file comprising two or more discrete tracks; separating the two or more discrete tracks; setting a limit on an amount at least one of the two or more discrete tracks may be altered; and outputting the separated and limited discrete tracks as a file for use by an end user.
In the method, setting a limit on an amount each of the two more discrete tracks may be altered may comprise storing a limit as metadata. Storing a limit as metadata may comprise manually setting the limit in the metadata. Manually setting the limit in the metadata may comprise manually determining upper and lower limits of acceptable sound. Storing a limit as metadata may comprise automatically setting the limit in the metadata based on a configuration of the two or more discrete tracks in the audio file. Automatically setting the limit in the metadata may comprise determining a sound quality based on a correlation between the two or more discrete tracks of the audio file and the two or more discrete tracks after separation. The method may further comprise encrypting the outputted file. Storing a limit as metadata may comprise storing audio in a spatial audio object coding format. Storing a limit as metadata may comprise storing audio in a spatial audio object coding format and storing the metadata in an MP4 container.
In another exemplary embodiment, a method comprises: receiving an audio file comprising two or more discrete tracks; analyzing the two or more discrete tracks to determine a set limit on possible remixing for at least one of the two or more discrete tracks; editing at least one of the two or more discrete tracks; and remixing the two or more discrete tracks based on the determined set limit on possible remixing for the at least one of the two or more discrete tracks.
The method may further comprise separating the two or more discrete tracks from each other before analyzing. Separating the two or more discrete tracks from each other may be based on a blind signal separation format or a spatial audio object coding format. The method may further comprise exceeding the determined set limit on possible remixing. The received audio file may include no metadata. Analyzing the two or more discrete tracks may comprise calculating a correlation between the two or more separated discrete tracks and setting a limit on an extent of possible remixing based on the calculated correlation.
In another exemplary embodiment, an apparatus comprises: at least one processor; and at least one non-transitory memory including computer program code configured to, with the at least one processor, cause the apparatus at least to: receive an audio file comprising two or more discrete tracks; separate the two or more discrete tracks from each other; analyze the two or more discrete tracks to determine a set limit on possible remixing for at least one of the two or more discrete tracks; and allow a user to remix the two or more discrete tracks based on the determined set limit on possible remixing for the at least one of the two or more discrete tracks.
The apparatus may be further caused to separate the two or more discrete tracks from each other based on a blind signal separation format. The apparatus may be further caused to separate the two or more discrete tracks from each other based on a spatial audio object coding format. The apparatus may be further caused to allow the user to exceed the determined set limit on possible remixing. Analyzing the two or more discrete tracks may comprise further causing the apparatus to calculate a correlation between the two or more separated discrete tracks and to set a limit on an extent of possible remixing based on the calculated correlation.
It should be understood that the foregoing description is only illustrative. Various alternatives and modifications can be devised by those skilled in the art. For example, features recited in the various dependent claims could be combined with each other in any suitable combination(s). In addition, features from different embodiments described above could be selectively combined into a new embodiment. Accordingly, the description is intended to embrace all such alternatives, modifications, and variances which fall within the scope of the appended claims.
Number | Name | Date | Kind |
---|---|---|---|
3797037 | Kolpek | Mar 1974 | A |
5960152 | Sawabe | Sep 1999 | A |
6490359 | Gibson | Dec 2002 | B1 |
6560577 | Gilbert | May 2003 | B1 |
7157638 | Sitrick | Jan 2007 | B1 |
7158456 | Han | Jan 2007 | B2 |
7450705 | Cohen | Nov 2008 | B1 |
7605320 | Le | Oct 2009 | B2 |
7689394 | Furem | Mar 2010 | B2 |
7912566 | Lee | Mar 2011 | B2 |
7935881 | Aimi | May 2011 | B2 |
7960638 | Miyajima | Jun 2011 | B2 |
8019449 | Barzegar | Sep 2011 | B2 |
8085269 | Classen | Dec 2011 | B1 |
8170884 | Vaudrey et al. | May 2012 | B2 |
8173883 | Willacy et al. | May 2012 | B2 |
8290769 | Taub et al. | Oct 2012 | B2 |
8315396 | Schreiner et al. | Nov 2012 | B2 |
8541676 | Waldman | Sep 2013 | B1 |
8939056 | Neal, III | Jan 2015 | B1 |
9024166 | Ramos | May 2015 | B2 |
9076264 | Gillespie et al. | Jul 2015 | B1 |
9293127 | Serletic | Mar 2016 | B2 |
9294862 | Kim | Mar 2016 | B2 |
9310959 | Serletic, II | Apr 2016 | B2 |
9319821 | Arrasvuori et al. | Apr 2016 | B2 |
9330720 | Lee | May 2016 | B2 |
10002596 | Vilermo | Jun 2018 | B2 |
20010015917 | Heo | Aug 2001 | A1 |
20020024889 | Taira | Feb 2002 | A1 |
20020045962 | Kobayashi | Apr 2002 | A1 |
20020051407 | Griner | May 2002 | A1 |
20030023421 | Finn | Jan 2003 | A1 |
20030086341 | Wells et al. | May 2003 | A1 |
20030091204 | Gibson | May 2003 | A1 |
20030165328 | Grecia | Sep 2003 | A1 |
20040111171 | Jang | Jun 2004 | A1 |
20050013583 | Itoh | Jan 2005 | A1 |
20050268323 | Yamaguchi | Dec 2005 | A1 |
20060028951 | Tozun | Feb 2006 | A1 |
20060075422 | Choi | Apr 2006 | A1 |
20060129842 | Herberger et al. | Jun 2006 | A1 |
20060209652 | Toms | Sep 2006 | A1 |
20070133811 | Hashimoto | Jun 2007 | A1 |
20070198551 | Barnes et al. | Aug 2007 | A1 |
20090012779 | Ikeda | Jan 2009 | A1 |
20090052290 | Tanaka | Feb 2009 | A1 |
20090106429 | Siegal | Apr 2009 | A1 |
20090147961 | Lee | Jun 2009 | A1 |
20090235300 | Hayashi | Sep 2009 | A1 |
20090306973 | Hiekata | Dec 2009 | A1 |
20100076577 | Lee | Mar 2010 | A1 |
20100280638 | Matsuda | Nov 2010 | A1 |
20100303262 | Okabayashi | Dec 2010 | A1 |
20110046759 | Kim | Feb 2011 | A1 |
20110111850 | Beerhorst | May 2011 | A1 |
20120101608 | Jang | Apr 2012 | A1 |
20120232681 | Mundy | Sep 2012 | A1 |
20120277894 | Lee | Nov 2012 | A1 |
20130138232 | Kim | May 2013 | A1 |
20130170651 | Lee | Jul 2013 | A1 |
20140041511 | Kaipainen | Feb 2014 | A1 |
20140053711 | Serletic, II | Feb 2014 | A1 |
20140115468 | Guerrero | Apr 2014 | A1 |
20140140536 | Serletic, II et al. | May 2014 | A1 |
20140248036 | Saari et al. | Sep 2014 | A1 |
20140254831 | Patton | Sep 2014 | A1 |
20140288685 | Haefeli | Sep 2014 | A1 |
20140348342 | Laaksonen et al. | Nov 2014 | A1 |
20140355789 | Bohrarper et al. | Dec 2014 | A1 |
20150016641 | Ugur et al. | Jan 2015 | A1 |
20150066481 | Terrell | Mar 2015 | A1 |
20150139426 | Tammi et al. | May 2015 | A1 |
20150242180 | Boulanger-Lewandowski | Aug 2015 | A1 |
20150243325 | Pacurariu | Aug 2015 | A1 |
20150317655 | Myers | Nov 2015 | A1 |
20150348524 | Skillings | Dec 2015 | A1 |
20150380014 | Le Magoarou | Dec 2015 | A1 |
20160062730 | Kwon | Mar 2016 | A1 |
20160157039 | Disch | Jun 2016 | A1 |
20160267914 | Hu | Sep 2016 | A1 |
20170115956 | Zalon | Apr 2017 | A1 |
20180005614 | Vilermo | Jan 2018 | A1 |
20180083587 | Tammi | Mar 2018 | A1 |
20180096705 | Vilermo | Apr 2018 | A1 |
Number | Date | Country |
---|---|---|
2533654 | Jun 2016 | GB |
WO-2007040791 | Apr 2007 | WO |
WO-2011087460 | Jul 2011 | WO |
WO-2013010188 | Jan 2013 | WO |
WO-2015154159 | Oct 2015 | WO |
Entry |
---|
Sylvain, Marchand, et al., “Dream: a Novel System for Joint Source Separation and Multitrack Coding”, Audio Engineering Society, http://www.aes.org/e-lib/browse.cfm?elib=16479; Oct. 25, 2012, 1 pg. |
Lund, Henrik Hautop, et al., “Remixing playware”, 23rd IEEE International Symposium on Robot and Human Interactive Communication, http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true&arnumber=6926229>; Aug. 25-29, 2014, 2 pgs. |
Sturmel, Nicolas, et al., “Linear Mixing Models for Active Listening of Music Productions in Realistic Studio Conditions”, Audio Engineering Society, http://www.aes.org/e-lib/browse.cfm?elib=16232; Apr. 26, 2012, 1 pg. |
“GarageBand for iPad”, Apple Inc., http://help.apple.com/garageband/ipad/1.0.1/#chs3928318c; 2001, 6 pgs. |
“MPEG Spatial Audio Object Coding (SAOC)”, http://mpeg.chiariglione.org/standards/mpeg-d/spatial-audio-object-coding; Oct. 2016, 8 pgs. |
“Upgrade Your Audio Experience”, Dolby Atmos Audio Technology, http://www.dolby.com/us/en/brands/dolby-atmos.html; Oct. 2016; 4 pgs. |
Richard, Scott, “The DUET blind Surce Separation Algorithm”, 2007, 25 pgs. |
Eronen, Antti, “Automatic Musical Instrument Recognition”, Master of Science Thesis, Tampere University of Technology, Apr. 11, 2001, 74 pgs. |
Falch, Cornelia et al., “Spatial Audio Object Coding With Enhanced Audio Object Separation”, Proc. of the 13th Int. Conference on Digital Audio Effects (DAFx-10, Sep. 6-10, 2010. |
Engdegard, Jonas, et al., “Spatial Audio Object Coding (SAOC)—The Upcoming MPEG Standard on Parametric Object Based Audio Coding”, Audio Engineering Society Convention 124, Audio Engineering Society, 2008, 15 pgs. |
Terentiev, Leonid, et al., “SAOC for Gaming—The Upcoming MPEG Standard on Parametric Object Based Audio Coding”, Audio Engineering Society Conference, 35th International Conference: Audio for Games, Audio Engineering Society, 2009, 7 pgs. |
Phon-Amnuaisuk Somnuk “Handling a Dynamic Mixture of Sources in Blind Source Separation Tasks”. Proceedings of the 2013 Conference on Technologies and Applications of Artificial Intelligence (TAAI), Dec. 6-8, 2013, Taipei, Taiwan. Piscataway, NJ, USA: IEEE, Dec. 6, 2013, p. 211-216, ISBN 978-1-4799-2528-2, [retrieved on Jan. 25, 2018], <DOI: 10.1109/TAAI.2013.50>, XP 032584659 section III. |
“Blind Signal Separation” Wikipedia article [online], Aug. 26, 2016, [retrieved on Jan. 25, 2018]. Retrieved from https://en.wikipedia.org/w/index.php?title=Blind_signal_separation&oldid=736311635 section “Approaches”. |
Number | Date | Country | |
---|---|---|---|
20180096705 A1 | Apr 2018 | US |