The present invention relates to an audio device, a program, and a control method.
Various techniques have recently been considered for audio devices and applications related to DAW (Digital Audio Workstation) and DTM (Desk Top Music). For example, Non-Patent Literature 1 discloses an application that makes it possible to execute ex-post sound separation on a computer.
However, an actual use of the above-described technique has still not been sufficiently proposed yet.
Therefore, an object of the invention is to provide, regarding an audio device that performs control related to a separation process, an audio device, a program, and a control method that make it possible to achieve high operability and flexibility.
The following describes preferred exemplary embodiments of the invention in detail with reference to the accompanying drawings. It is to be noted that, in this description and the accompanying drawings, components that have substantially the same functional configuration are indicated by the same reference signs, and thus redundant description thereof is omitted.
As illustrated in
The display 110 is a display device that displays a screen including various pieces of information in the audio device 100 and an operator for receiving a user operation. Various types of display devices are employable as the display 110, examples of which include a liquid crystal display device and an organic EL display device.
The operation unit 120 includes an input device such as a keyboard or a mouse.
The controlling module 130 is implemented in the audio device 100 by a processor such as a communication interface or a CPU (Central Processing Unit) and a memory serving as a work area, and controls an operation of the audio device 100. The controlling module 130 includes a display controlling module 131, an operation acquiring module 132, a separating module 133, a selecting module 134, and an outputting module 135 that are implemented by a processor operating in accordance with a program stored in a memory or received via a communication interface.
The display controlling module 131 causes the display 110 to display a screen including information of audio data of a music piece related to the audio device 100, and a GUI (Graphical User Interface) for receiving the user operation related to a setting and control with respect to the audio data.
The operation acquiring module 132 acquires information indicating the user operation based on the user operation via the operation unit 120. Thereafter, the operation acquiring module 132 supplies the information indicating the user operation to each of the components in the controlling module 130.
The separating module 133 executes a separation process of separating the audio data of the music piece into multiple parts that are included in the music piece. The multiple parts into which the audio data is to be separated each correspond to the corresponding one of sounds for respective parts of the music piece to be played based on the audio data. Here, each of the parts corresponds to, for example, a sound of a vocal or a musical instrument.
In addition to the above-described example, the part may be further subdivided and separated for each kind of musical instrument, for example, the drum part may be subdivided and separated into a bass drum part, a snare drum part, and a hi-hat part. In this case, a direct separation process may be executed, from the audio data of the music piece of a separation target into the bass drum part, the snare drum part, and the hi-hat part, or a stepwise separation process may be executed in which the drum part is subjected to music separation to thereby be separated stepwise into the bass drum part, the snare drum part, and the hi-hat part. In both cases, the parts of the music piece include a part obtained by subjecting each of the parts included in the music piece to the music separation, for example, one portion of a part that configures a part of a musical instrument and that is obtained by further subdividing each of the parts included in the music piece. Various known techniques are available for such a separation process of performing separation into multiple parts, and thus detailed explanation thereof is omitted.
It is to be noted that the audio device 100 may include a communication interface, and may be configured to execute the separation process by an external audio device, a computer, etc., or may be configured to acquire, via the unillustrated communication interface, the audio data that has been subjected to the separation process by the external audio device, the computer, etc., and to store the audio data in the storage 140. In this case, the audio device 100 does not include the device (the separating module 133) that executes the separation process, and the external audio device and the computer each operate as the device (the separating module 133) that executes the separation process.
The selecting module 134 selects at least one part included in the above-described multiple parts based on information indicating the user operation. Thereafter, the selecting module 134 supplies information indicating a selection result to each component included in the controlling module 130.
The outputting module 135 outputs information to the storage 140 and an outside of the audio device 100. A specific process performed by the outputting module 135 will be described later.
The storage 140 is configured to store the audio data of the music piece with use of, for example an HDD (Hard Disk Drive) and a flash memory. The storage 140 stores the audio data of multiple music pieces in a predetermined format such as a MP3 format. The audio data includes, in addition to audio information, information of, for example, BPM, an artwork, a title, an artist name, an album name, a key, a DJ play count, and a genre of the music piece as tag information. The audio data to be stored in the storage 140 is associated with a time stamp that is information of a play position. Further, the storage 140 also operates as a device for holding a result of the separation process. The holding of the result of the separation process will be described later.
It is to be noted that the audio device 100 may include a communication interface, and may be configured to acquire the audio data stored in an external storage device, a computer, etc., via the unillustrated communication interface and store the audio data in the storage 140. In this case, the audio device 100 does not include the storage 140, and the external storage device operates as the storage 140.
A description is given of a display screen to be displayed on the display 110 when the user performs control related to the separation process on the above-described audio device 100.
For control before the separation process, the display controlling module 131 of the audio device 100 causes the display 110 to display labels representing candidates for parts into which the audio data of the music piece is separable in the separation process that has not been executed, and the operation acquiring module 132 acquires information indicating the user operation of selecting at least a portion of the labels displayed on the display 110.
As illustrated in
It is to be noted that the texts representing the kinds of the parts are the labels identifying the candidates for the parts into which the audio data of the music piece is separable, and the operation acquiring module 132 acquires the presence or absence of a check in each of the checkboxes of the respective parts as the information indicating the user operation.
The user performs an operation via the operation unit 120 based on the information displayed on the display 110, which makes it possible to select a part that the user wants to separate by the separation process before the separation process.
The user performs the operation via the operation unit 120 based on the information displayed on the display 110, which makes it possible to, before the separation process, divide the parts into parts that the user wants to separate by the separation process and other parts.
As described above, the display controlling module 131 causes the display 110 to display the labels representing the candidates for the parts into which the audio data of the music piece is separable in the separation process that has not been executed, and the operation acquiring module 132 acquires information indicating the user operation of selecting at least a portion of the labels displayed on the display 110.
The selecting module 134 selects at least one part from the candidates for the parts into which the audio data of the music piece is separable in the separation process based on the information indicating the user operation acquired by the operation acquiring module 132. Thereafter, the separating module 133 executes the separation process for the part selected by the selecting module 134.
As described above, when the audio device 100 does not include the separating module 133 and the external audio device or the computer, for example, executes the separation process, the outputting module 135 outputs the information indicating the part selected by the selecting module 134 to the external audio device or the computer that executes the separation process.
Next, operations of the respective components during the control before the separation process will be described.
In the example illustrated in
If the operation acquiring module 132 determines that the user operation of selecting the part has been performed (Step S102: YES), the display controlling module 131 updates display contents of the display 110 (Step S103).
The audio device 100 repeats Step S102 and Step S103 until the selection of the part is completed (Step S104: YES).
As described above, the display controlling module 131 of the audio device 100 causes the display 110 to display the labels representing the candidates for the parts into which the audio data of the music piece is separable in the separation process, and the operation acquiring module 132 acquires the information indicating the user operation of selecting at least a portion of the labels. Thereafter, based on the acquired user operation, the selecting module 134 selects at least one part, and the separating module 133 executes the separation process for the selected part. Conventionally, all parts have been subjects of the separation process, for example, the piano sound part has been a subject of the separation process for the audio data in which no piano sound part exists. However, according to the audio device 100 of the exemplary embodiment, it is possible to improve flexibility of a setting of the separation process. It is therefore possible to select a part based on a characteristic of the audio data to be subjected to the separation process, or to select a part based on demand of the user.
For control after the separation process, the display controlling module 131 of the audio device 100 causes the display 110 to display labels representing parts whose audio data of the music piece has been separated in the separation process executed by the separating module 133.
As illustrated in
In the separation process, when there are a part whose audio data of the music piece has been separated and a part whose audio data of the music piece has not been separated, the display controlling module 131 causes the display 110 to display a label representing the part whose audio data of the music piece has been separated and a label representing the part whose audio data of the music piece has not been separated in different expressions. For example, when the piano sound part has not been separated, as illustrated in a display area A7 of
It is possible to determine whether it is the part whose audio data of the music piece has been separated or the part whose audio data of the music piece has not been separated based on whether or not the result obtained by the separation performed by the separating module 133 includes the audio data for each part.
As described above, it is possible to check the result of the separation process by causing the display 110 to display the label representing the part whose audio data of the music piece has been separated in the separation process; however, it is also possible to edit the above-described label on the audio device 100.
The display controlling module 131 of the audio device 100 causes the display 110 to display a screen for receiving a user operation of inputting a label for any of the labels displayed as the results of the separation process, and the operation acquiring module 132 acquires information indicating the user operation of selecting or inputting the label.
As illustrated in
The user selects any part by performing an operation on the list via the operation unit 120, and inputs a label of the selected part by performing an operation on the text box via the operation unit 120. For example, if the piano sound part has been separated in the separation process, but it has turned out to be a guitar sound part according to a sensibility evaluation of the user, in other words, if the guitar sound part has erroneously been separated as the piano sound part, it is possible for the user to correct the label of the piano sound part to the guitar sound part.
It is possible for the user to select a correct label from the selection box displayed on the display 110 by performing the operation via the operation unit 120.
It is to be noted that the respective displays described by referring to FIGS. 6A, 6B, 7A, and 7B may be configured to set the presence or absence of the display by user. For example, in the examples illustrated
Further,
Next, operations of the respective components during the control after the separation process will be described.
In the example illustrated in
If the operation acquiring module 132 determines that the user operation of selecting the part has been performed (Step S202: YES), the operation acquiring module 132 determines whether the user operation of inputting the label via the operation unit 120 has been performed (Step S203).
If the operation acquiring module 132 determines that the user operation of inputting the label has been performed (Step S203: YES), the display controlling module 131 updates the display contents of the display 110 (Step S204).
The audio device 100 repeats the process from Step S202 to Step S204 until the inputting of the label is completed (Step S205: YES). When the inputting of the label is completed (Step S205: YES), the outputting module 135 outputs the inputted label to the storage 140 together with information identifying a part associated with the label (Step S206).
As described above, the display controlling module 131 of the audio device 100 causes the display 110 to display the labels representing the parts whose audio data of the music piece has been separated in the separation process that has been executed, and the operation acquiring module 132 acquires the information indicating the user operation of selecting and inputting the label. Thereafter, based on the acquired information indicating the user operation, the selecting module 134 selects at least one part, and outputs the inputted label to the storage 140 together with the information identifying the part associated with the label. Accordingly, it is possible to edit the result of the separation process based on the user operation. It is therefore possible to bring the result of the separation process closer to a realistic situation, and to improve accuracy of the separation process.
The result of the label inputting described in (2) may be applied to other control in the audio device 100.
For example, when a parameter setting of a filter or an effect is to be performed for each part that has been separated by the separation process, the label of each part upon setting the parameter may be dynamically changed based on the result of the label inputting described in (2).
Further, for example, when various types of editing of the music piece based on the part that has been separated by the separation process are to be performed, the label of each part upon editing may be dynamically changed based on the result of the label inputting described in (2). Such control may also be useful, for example, in performing editing such as muting any part or replacing any part with a different instrument part.
Further, upon performing another process, the label representing the part whose audio data of the music piece has been separated and the label representing the part whose audio data of the music piece has not been separated may be displayed in different expressions.
According to an exemplary embodiment of the invention described above, the labels representing the multiple parts are displayed on the display, the information indicating the user operation of selecting at least a portion of the labels is acquired, and at least one part included in the multiple parts is selected based on the information indicating the user operation. It is therefore possible to achieve high operability and flexibility by performing an easy-to-understand display for the user on the audio device that performs the control related to the separation process.
Further, according to an exemplary embodiment of the invention, the multiple parts include the candidates for the parts into which the audio data of the music piece is separable in the separation process that has not been executed, and the information indicating at least one part that has been selected is outputted to the device that executes the separation process. It is therefore possible to improve the flexibility of the setting of the separation process.
Further, according to an exemplary embodiment of the invention, the multiple parts include the parts whose audio data of the music piece has been separated in the separation process that has been executed. In addition, the information indicating the user operation of inputting the label representing at least a portion of the multiple parts is further acquired, and the inputted label is outputted, together with the information identifying the part associated with the label, to the device that holds the result of the separation process. This makes it possible to edit the result of the separation process based on the user operation.
Further, according to an exemplary embodiment of the invention, the part whose audio data of the music piece has been separated in the separation process that has been executed (i.e., a first part) and the part whose audio data of the music piece has not been separated in the separation process (i.e., a second part) are included, and the labels representing the respective parts are displayed in different expressions. It is therefore possible to perform an easy-to-understand display for the user.
It is to be noted that the separation process described in the above-described exemplary embodiment may be executed in units of music piece or may be executed for a portion of the music piece. For example, a known music analysis process may be performed, and the separation process may be performed for each segment based on development of the music piece. In this case, for the control before the separation process described in (1), the setting of the separation process may be performed for each segment. Further, for the control after the separation process described in (2), the editing of the result of the separation process may be performed for each segment. Such configurations make it possible to perform more detailed control related to the separation process.
Further, the display screens described in the above-described exemplary embodiment are each an example, and the respective elements, designs, arrangements, etc. thereof are not limited to the exemplary embodiment. For example, the label identifying the part may be an illustration rather than the text, or may be a combination of the illustration and the text. Further, the respective elements, designs, arrangements, etc. of the display screens may be settable by the user.
Further, in the above-described exemplary embodiment, the controlling module 130 of the audio device 100 performs the above-described process by reading a program from an unillustrated storage and an unillustrated recording medium and executing the program. However, the invention is not limited thereto, and for example, the controlling module 130 may acquire a program from a device on a network and execute the program. Examples of the recording medium include a disk-type recording medium, an HDD, and a semiconductor memory.
Further, the audio device having the above-described functions is not limited to the examples of the above-described exemplary embodiment, and may be, for example, a mixer or a DJ controller having a mixer function. Still further, the invention may be applied not only to a DJ device and a DJ application, but also to a music application, a streaming service using the Internet, for example. Moreover, the invention may also be applicable to an electronic musical instrument, as well as an audio device such as DAW (Digital Audio Workstation) or DTM (Desk Top Music).
Preferred exemplary embodiments of the invention have been described above in detail with reference to the accompanying drawings, but the invention is not limited to such exemplary embodiments. It is apparent that a person having ordinary skill in the art of the invention can arrive at various alterations and modifications within the scope of the technical idea defined by the appended claims, and it is understood that such alterations and modifications naturally fall within the technical scope of the invention.
100: audio device, 110: display, 120: operation unit, 130: controlling module, 131: display controlling module, 132: operation acquiring module, 133: separating module, 134: selecting module, 135: outputting module, 140: storage
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2021/020240 | 5/27/2021 | WO |