The present invention relates to a processing device, a processing method and a program.
Recently, a digital twin has been widespread. In the digital twin, devices and facilities are constructed on a virtual space, and simulation is performed using the digital information. The digital twin enables design improvement, failure prediction, and the like.
On the other hand, each individual can also be formed as a digital twin of a human in a virtual space using an avatar that is active in the virtual space.
Furthermore, there is a method for modeling various emotions or speaker styles in speech synthesis using HMM (Non Patent Literature 1).
Non Patent Literature 1: Junichi YAMAGISHI, Koji ONISHI, Takashi MASUKO, and Takao KOBAYASHI, “Acoustic Modeling of Speaking Styles and Emotional Expressions in HMM-Based Speech Synthesis”, IEICE TRANS. INF. & SYST., VOL. E88-D, NO. 3 March 2005
In the digital twin, expressing emotions and actions, not by arranging a plurality of the same avatars, but by a plurality of avatars having respective personalities as in people in the real world has been attempted.
In order to impart a personality to an avatar, there is a method for expressing individual emotions and movements on a virtual space after storing facial expressions or actions of a user on a server in advance. However, a human does not always show the same reaction to an external stimulus, and the personality of the user may not be sufficiently exhibited in behaviors of the user reproduced on the virtual space.
Therefore, technology for reproducing natural behaviors of a user in a target such as an avatar has been expected by varying unconscious behaviors such as appearance of a specific habit and breathing according to a conscious behavior of the user and the like.
The present invention has been made in view of the above circumstances, and an object of the present invention is to provide technology for reproducing natural behaviors of a user in a target.
A processing device of an aspect of the present invention includes an acquisition unit that acquires an instruction to specify a conscious behavior of a target that reproduces a behavior of a user, a determination unit that determines a parameter for reproducing an unconscious behavior corresponding to the conscious behavior specified by the instruction from unconscious parameter data in which an identifier of a conscious behavior of the user, an identifier of an unconscious behavior in the conscious behavior, and an index for specifying a parameter for reproducing the unconscious behavior are associated with each other, and an output unit that outputs a determined parameter to a drive unit of the target, in which a parameter for reproducing the unconscious behavior varies depending on the conscious behavior.
A processing method of an aspect of the present invention includes a step in which a computer acquires an instruction to specify a conscious behavior of a target that reproduces a behavior of a user, a step in which the computer determines a parameter for reproducing an unconscious behavior corresponding to the conscious behavior specified by the instruction from unconscious parameter data in which an identifier of a conscious behavior of the user, an identifier of an unconscious behavior in the conscious behavior, and an index for specifying a parameter for reproducing the unconscious behavior are associated with each other, and a step in which the computer outputs a determined parameter to a drive unit of the target, in which a parameter for reproducing the unconscious behavior varies depending on the conscious behavior.
An aspect of the present invention is a program for causing a computer to function as the above processing device.
According to the present invention, technology for reproducing natural behaviors of a user in a target can be provided.
Hereinafter, an embodiment of the present invention will be described with reference to the drawings. In the drawings, the same portions are denoted by the same reference signs, and a description thereof is omitted.
(Processing System)
A processing system 5 according to the embodiment of the present invention will be described with reference to
The target T reproduces behaviors of a user. The target T is driven in accordance with an instruction from a drive unit TD formed by using a computer.
The target T is, for example, a robot that is active in the real world, an avatar that is active in a virtual space, or the like. The target T may be formed by imitating a user himself/herself, or may be formed by imitating a character other than the user, an object other than a human, or the like. The object other than a human may be a creature, or may be any object such as a rock, a tree, a cloud, or a celestial body. In the target T, behaviors of a user may be reproduced by the entire individual of a robot or a human, or behaviors of a user may be reproduced by a part of a robot or a human such as an arm, a face, or a head. Furthermore, the target T may be formed by using a part of the individual such as only a face portion or only an upper body. The robot may be formed from any member such as metal or a member imitating skin. The avatar is controlled by the drive unit TD so as to be active in the virtual space.
Note that, in an example illustrated in
The processing device 1 according to the embodiment of the present invention adds unconscious behaviors reflecting the personality of a user to a conscious behavior reproduced in the target T, thereby reproducing natural behaviors of the user in the target T. At this time, the processing device 1 varies the unconscious behaviors depending on the conscious behavior reproduced by the target T, the situation of the target T, and the like, thereby reproducing more natural behaviors.
In the embodiment of the present invention, behaviors of the target T will be described being distinguished into conscious behaviors and unconscious behaviors. The conscious behaviors are behaviors that a user consciously performs based on user's own decision. The conscious behaviors are specified in advance as behaviors that the target T is caused to perform in instruction data N. The conscious behaviors are, for example, smiling, utterance of “hello”, and “bowing” when making a greeting. On the other hand, the unconscious behaviors are behaviors that a user performs independently of user's own decision. The unconscious behaviors are behaviors added by the processing device 1 when reproducing the behaviors specified in the instruction data N. The unconscious behaviors are physiological movements of breathing, blinking, and the like, an unintentional habit, and the like. Note that the unconscious behaviors of a habit, breathing, blinking, and the like may be specified by the instruction data N. In this case, the processing device 1 adds unconscious behaviors that do not conflict with behaviors specified by the instruction data N.
In the embodiment of the present invention, the target T reproduces behaviors of a user according to an instruction of the drive unit TD, and conscious behaviors by the target T in that case are behaviors estimated to be performed consciously by the user based on user's own decision. The unconscious behaviors by the target T are behaviors estimated to be unconsciously performed when a user consciously behaves based on user's own decision.
When conscious behaviors to be reproduced in the target T are specified, the processing device 1 according to the embodiment of the present invention also reproduces unconscious behaviors reflecting the personality of a user in the target T. The unconscious behaviors are controlled by the processing device 1 so as to vary according to the personality of an individual user. Furthermore, the unconscious behaviors, and the conscious behaviors reproduced by the target T vary depending on the situation of the target T and the like. When the target T reproduces a conscious behavior, the target T also reproduces unconscious behaviors reflecting the personality of a user during the behavior, so that the target T can be caused to reproduce natural behaviors of the user.
(Processing Device)
The processing device 1 acquires instruction data N illustrated in
The instruction data N specifies conscious behaviors of the target T that reproduces behaviors of a user.
An example of the instruction data N will be described with reference to
In
In the instruction data N illustrated in
In the embodiment of the present invention, the situation set in the instruction data N is one of conditions that cause unconscious behaviors. The unconscious behaviors reproduced in the target T may be determined in consideration of a situation in addition to a conscious behavior specified by the instruction data N. For example, unconscious behaviors may be determined from an external situation such as a scene, or the unconscious behaviors may be determined from an internal situation of the target T such as a state. The unconscious behaviors may be determined from a complex situation of external and external situations. Furthermore, the unconscious behaviors may be determined from an internal situation specified from an external situation. For example, an internal situation of “tension increases” is specified from an external situation of “there is an audience listening with no response such as nodding but staring”, and an unconscious behavior of “sweating” is determined from the internal situation of “tension increases”.
A description will be described regarding processing in which the processing device 1 generates motion instruction data M to which unconscious behaviors reflecting the personality of a user are added from the instruction data N in which conscious behaviors of the target T are specified
The processing device 1 includes unconscious parameter data 11, motion data 12, an acquisition unit 21, a determination unit 22, and an output unit 23. The unconscious parameter data 11 and the motion data 12 are data stored in a storage device such as a memory 902 or a storage 903. The acquisition unit 21, the determination unit 22, and the output unit 23 are processing units implemented in a CPU 901.
The unconscious parameter data 11 is data that associates identifiers of conscious behaviors of a user, identifiers of unconscious behaviors in the conscious behavior, and indexes for specifying parameters for reproducing the unconscious behaviors with each other. An identifier of a conscious behavior is data specifying a conscious behavior reproduced by the target T. An identifier of an unconscious behavior is data specifying an unconscious behavior reproduced by the target T. In an example of the unconscious parameter data 11 illustrated in
A parameter for reproducing an unconscious behavior specifies any one or more of a speed, a frequency, and a pattern in the unconscious behavior. For example, for an unconscious behavior “breathing”, data for specifying a breathing speed, a breathing frequency, a breathing pattern, or the like in a predetermined conscious behavior and situation is set as a parameter of the unconscious parameter data 11. The breathing pattern is a pattern of repeating “inhaling” and “exhaling” that varies depending on a conscious behavior of a user or the like, such as a pattern of repeating “inhaling” and “exhaling” and a pattern of repeating “inhaling” twice and then repeating “exhaling” twice.
In the example of the unconscious parameter data 11 illustrated in
Note that,
As illustrated in
As illustrated in
Note that, in the unconscious parameter data 11 illustrated in
In a case where the parameters are only determined from the data set in which the specific action (“action: bowing”) is set, a breathing frequency is 1.1 times higher than the default and a blinking frequency is 0.8 times higher than the default. Furthermore, in a case where the parameters are determined from the two data sets in which the specific action (“action: bowing”) and any setting (“action: -”) are set, the breathing frequency is 1.1*1.2 times higher than the default and the blinking frequency is 0.8*0.8 times higher than the default. In the unconscious parameter data 11, a relation between a conscious behavior and change amounts of parameters may be appropriately set.
Furthermore, although the unconscious parameter data 11 illustrated in
The unconscious parameter data 11 is required to be referred to in order to specify parameters of unconscious behaviors relative to a conscious behavior, and a method for setting the parameters of the unconscious behaviors and a method for calculating values thereof are not specified.
As illustrated in
In an example illustrated in
Note that the unconscious parameter data 11 and the motion data 12 are formed so as to reflect unique behaviors of a user that the target T is caused to reproduce. For example, the unconscious parameter data 11 and the motion data 12 may be provided for each user. Furthermore, default data used for general purposes and data for each user that specifies a difference from the default may be provided.
The acquisition unit 21 acquires the instruction data N that has been described with reference to
The determination unit 22 determines, from the unconscious parameter data 11, parameters for reproducing unconscious behaviors corresponding to conscious behaviors specified by the instruction data N. Here, the parameters for reproducing unconscious behaviors are controlled so as to vary depending on the conscious behaviors.
For the conscious behaviors specified by the instruction data N, the determination unit 22 refers to the unconscious parameter data 11 and acquires unconscious behaviors added to the conscious behaviors and the change amounts of the parameters for reproducing the unconscious behaviors. The determination unit 22 determines the parameters for reproducing the unconscious behaviors by reflecting the change amounts acquired from the unconscious parameter data 11 in default parameters defined in the motion data 12.
In a case where the acquisition unit 21 acquires the situation of the target T, the determination unit 22 may determine parameters for reproducing unconscious behaviors corresponding to the acquired situation of the target T.
In the sequence number #1 of the instruction data N illustrated in
The determination unit 22 adds, to the conscious behavior of the sequence number #1 of the instruction data N, two unconscious behaviors of breathing “chest rises and falls at intervals of (10/1.2) seconds” and blinking “upper eyelid and lower eyelid contact with each other at intervals of (5/0.8) seconds”. Similarly, the determination unit 22 determines parameters for reproducing unconscious behaviors for a facial expression “smiling” and utterance “-” that are other conscious behaviors of the sequence number #1 of the instruction data N.
In a sequence number #2 of the instruction data N illustrated in
Note that the method for determining parameters of unconscious behaviors described herein is an example, and the method is not limited thereto. Unconscious behaviors and the parameters for causing the target T to reproduce the unconscious behaviors may be set according to the conscious behavior, the situation, and the like specified in the instruction data N. Similarly, the determination unit 22 determines parameters for reproducing unconscious behaviors for a facial expression “smiling” and an action “bowing” that are other conscious behaviors of the sequence number #2 of the instruction data N.
The output unit 23 outputs parameters determined by the determination unit 22 to the drive unit TD of the target T. The output unit 23 outputs, for example, the motion instruction data M illustrated in
The motion instruction data M, for each sequence number of the instruction data N, associates identifiers of behaviors to be reproduced by the target T in the sequence with specific movements of the behaviors. For example, in a sequence number #1, unconscious behaviors of breathing, blinking, or the like are added in addition to a conscious behavior of a facial expression “smiling”. Furthermore, specific movements of the unconscious behaviors are calculated from the personality of a user, the conscious behavior, and the situation of the target T. In a sequence number #2, unconscious behaviors of breathing and blinking are added in addition to a habit of utterance of “uh” after a conscious behavior of utterance of “hello”. The “uh” after the utterance of “hello” is added as an unconscious behavior of the user.
A processing method by the processing device 1 will be described with reference to
In step S1, the processing device 1 acquires instruction data N in which conscious behaviors and situations are specified. Processing from step S2 to step S3 is repeated for each of the conscious behaviors specified by the instruction data N.
In step S2, the processing device 1 determines whether there is a setting for a conscious behavior to be processed in the unconscious parameter data 11. For example, in the unconscious parameter data 11, whether there is a specific behavior specified as the conscious behavior or “-” specified as any behavior in the instruction data N. In a case where there is no setting for the conscious behavior, since there is no unconscious behavior to be added by the processing device 1, processing of step S2 is performed for the next conscious behavior.
In step S2, in a case where there is a setting for the conscious behavior to be processed in the unconscious parameter data 11, the processing device 1 determines parameters for reproducing unconscious behaviors in the target T from the unconscious parameter data 11 and motion data 12 in step S3.
In a case where processing from step S2 to step S3 ends for each of the conscious behaviors specified by the instruction data N, in step S4, the processing device 1 reflects the parameters determined in step S3 in respective behaviors and generates motion instruction data M. The motion instruction data M generated here is data in which unconscious behaviors reflecting the personality of a user are added to the conscious behaviors specified by the instruction data N.
In step S5, the processing device 1 outputs the motion instruction data M generated in step S4 to the drive unit TD of the target T. Since the target T can be driven in accordance with the motion instruction data M, natural behaviors reflecting the personality of the user can be performed.
The processing device 1 can generate motion data 12 to which unconscious behaviors reflecting the personality of the user are added according to conscious actions, situations, and the like. As a result, unique and natural behaviors reflecting the personality of the user can be reproduced in the target T.
The processing device 1 of the present embodiment described above is, for example, a general-purpose computer system including a central processing unit (CPU, processor) 901, the memory 902, the storage 903 (hard disk drive (HDD), solid state drive (SSD)), a communication device 904, an input device 905, and an output device 906. In the computer system, by the CPU 901 performing a predetermined program loaded on the memory 902, each function of the processing device 1 is implemented.
Note that the processing device 1 may be implemented by one computer, or may be implemented by a plurality of computers. Note that the processing device 1 may be a virtual machine that is implemented in a computer.
The program for the processing device 1 can be stored in a computer-readable recording medium such as an HDD, an SSD, a universal serial bus (USB) memory, a compact disc (CD), or a digital versatile disc (DVD), or can be distributed via a network.
Note that the present invention is not limited to the above embodiment, and various modifications can be made within the scope of the gist of the present invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2020/039944 | 10/23/2020 | WO |