This application claims priority to Japanese Patent Application No. 2023-018087 filed on Feb. 9, 2023, incorporated herein by reference in its entirety.
The present disclosure relates to a power assist system and a storage medium.
In recent years, the development of a power assist device that assists the action of a wearer has been advanced. For example, a related technology is disclosed in Japanese Patent No. 6947613.
Japanese Patent No. 6947613 discloses a power assist device using an assist-as-needed (AAN) control technique in which the assist for the wearer is small when a following error between a trajectory of the wearer's body and a target trajectory is small and in which the assist for the wearer is larger as the following error is larger.
The AAN control disclosed in Japanese Patent No. 6947613 is a control to follow the target trajectory designated in advance, and therefore a wearer cannot freely manipulate a robot according to the wearer's intention. In order to enable the wearer to freely manipulate the robot according to the wearer's intention, it is necessary to measure an interference force that acts between the wearer who indicates the intention and the robot. However, generally, in the wearable power assist system, the measurement of the interference force by a sensor is difficult except in a case where the wearer himself/herself can exactly operate the signals of a force sensor.
The present disclosure has been made in view of the above circumstance, and has an object to provide a power assist system and a storage medium that allow a power assist according to the intention of a user that wears a robot for which the measurement by a force sensor is difficult, by accurately estimating the interference force that is given from the user to the robot.
A power assist system according to the present disclosure includes: a robot that is worn by a user, the robot including a drive source that assists action of the user; an interference force estimation unit that estimates an interference force given from the user to the robot; an admittance model that generates a target velocity in a virtual object having a predetermined dynamic characteristic, the target velocity corresponding to the interference force estimated by the interference force estimation unit; and a control unit that drives the drive source by a control torque that allows the velocity of the robot to follow the target velocity, in which the interference force estimation unit includes: a disturbance observer that detects a disturbance based on the momentum of the robot; and an estimation unit that estimates the interference force depending on the disturbance detected by the disturbance observer, using an ESN learning model for which learning about the interference force depending on the disturbance is performed while weights in a reservoir layer are stochastically assigned. The power assist system accurately estimates the interference force that is given from a user that wears the robot, to the robot. Thereby, the power assist system can realize a high-accuracy admittance control, and therefore can perform a power assist according to user's intention.
A storage medium storing a control program according to the present disclosure is a storage medium storing a control program causing a computer to execute an assist process by a power assist system including a robot that is worn by a user, the robot including a drive source that assists action of the user, the control program causing the computer to execute: a process of estimating an interference force given from the user to the robot; a process of generating a target velocity in a virtual object having a predetermined dynamic characteristic, the target velocity corresponding to the interference force; and a process of driving the drive source by a control torque that allows the velocity of the robot to follow the target velocity, the control program causing the computer to, in the process of estimating the interference force, execute: a process of detecting a disturbance based on a momentum of the robot by a disturbance observer; and a process of estimating the interference force depending on the disturbance detected by the disturbance observer, using an ESN learning model for which learning about the interference force depending on the disturbance is performed while weights in a reservoir layer are stochastically assigned. The control program accurately estimates the interference force that is given from a user that wears the robot, to the robot. Thereby, the control program can realize a high-accuracy admittance control, and therefore can perform a power assist according to user's intention.
The present disclosure can provide a power assist system and a storage medium that allow a power assist according to the intention of a user that wears a robot for which the measurement by a force sensor is difficult, by accurately estimating the interference force that is given from the user to the robot.
Features, advantages, and technical and industrial significance of exemplary embodiments of the disclosure will be described below with reference to the accompanying drawings, in which like signs denote like elements, and wherein:
The present disclosure will be described below with embodiments of the disclosure. The disclosure according to the claims is not limited to the embodiments described below. Further, all configurations described in the embodiments are not necessary as means for solving the problem. For clear explanation, in the following descriptions and drawings, omissions and simplifications are performed when appropriate. In the drawings, identical elements are denoted by identical reference characters, and repetitive descriptions are omitted as necessary.
As shown in
The robot 11 is worn by a user U1 that needs action assist, and assists the motion of the user U1. For example, the robot 11 includes support members that support sites of the lower limbs of the user U1 and rotation members that are provided so as to correspond to joints of the lower limbs of the user U1. Each rotation member connects two support members of the support members in a rotatable manner, and includes an actuator (drive source) that drives and rotates the two support members, as exemplified by a servomotor, and an angle sensor such as a rotary encoder. The robot 11 is not limited to the attachment to the lower limbs of the user U1, and may be attached to the upper limbs of the user U1 or the upper limbs and lower limbs of the user U1.
The interference force estimation unit 12 estimates an interference force that is a force given from the user U1 that wears the robot 11 to the robot 11. Accordingly, it can be said that the interference force indicates the intention about the action of the user U1. Specifically, the interference force estimation unit 12 includes an MBO 121, an ESN 122, and an estimation unit 123.
The MBO 121 is a kind of disturbance observer, and outputs a variable S corresponding to the interference force, based on the momentum of the robot 11. MBO is an abbreviation for Momentum Based Observer. The ESN 122 is an ESN learning model that performs learning about an interference force ⋅τext depending on the variable S while weights in a reservoir layer in a reservoir computing learning are stochastically assigned. ESN is an abbreviation for Echo State Network. The estimation unit 123 estimates an interference force τext(t) depending on the variable S output from the MBO 121, using the learning model of the ESN 122.
In the embodiment, an example in which the MBO that is a kind of disturbance observer is used is described, but without being limited to this, a kind of disturbance observer other than the MBO may be used. Further, in the embodiment, an example in which the ESN that is a kind of reservoir computing technique is used is described, without being limited to this, a kind of reservoir computing technique other than the ESN may be used.
The admittance model 13 generates a target velocity Vd in a virtual object having a predetermined dynamic characteristic, and the target velocity Vd corresponds to the interference force estimated by the interference force estimation unit 12. The arithmetic unit 14 calculates an actual velocity q of the robot 11, from detection results of angle sensors and others attached to the robot 11. The subtracter 15 outputs a velocity error (following error) r that is the difference between the target velocity Vd and the actual velocity q of the robot 11.
The NN base adaptive control unit 16 is an adaptive control system that is configured using a neural network. Specifically, the NN base adaptive control unit 16 assures a uniform ultimate boundedness in which the velocity error r={dot over (q)}−Vd(t) is within a certain range even when a physical parameter in the dynamics between the user U1 and the robot 11 is not known, by an adaptive law (adaptive term) in the neural network.
Here, when q, {dot over (q)}, {umlaut over (q)}∈Rn are respectively a joint angle, angular velocity and angular acceleration in a human-machine system in which the user U1 and the robot 11 are integrated and that is configured by n joints, a motion equation for the system is expressed as the following Expression (1).
Further, τext, to be precise, is described as an interference torque that acts between the user U1 and the robot 11, in accordance with the definition in mechanics, but is descried as the interference force, hereinafter, as long as misunderstanding is not given, because there is only a relation of linear transformation using a Jacobian matrix or the like, between force and torque.
Further, τ indicates control torque.
The MBO 121 estimates the interference force τext, using the generalized momentum of the robot 11, as described below.
First, when the generalized momentum is p=M(q){dot over (q)}, the MBO 121 is designed such that the following Expression (2), Expression (3) and Expression (4) are satisfied.
K0∈Rn+n indicates an observer gain. The observer gain is a positive definite matrix.
Thereby, the following Expression (5) is satisfied.
Accordingly, in the case of Td=0, S→τext is assured. The MBO 121 can sense the contact between the user U1 and the robot 11 with a high sensitivity. In the construction of the MBO 121 that is the disturbance observer, an accurate mathematical model for the user U1 and the robot 11 is necessary. However, in the power assist system in which the robot is worn, an accurate modeling is impossible, and there is always a modeling error. The system is strongly influenced by the modeling error, so that a large estimation error is generated. For compensating the estimation error that is generated by the MBO, the interference force estimation unit 12 employs a hybrid learning technique that is one reservoir computing technique ant in which an ESN model is combined with the MBO.
Next, a learning model that is included in the estimation unit 123 and that is called an ESN will be described.
Win∈RNx×Nu indicates an input connection weight.
Wr∈RNx×Nx indicates a recurrent connection weight.
Wout∈Rn×(1+Nu×Nx) indicates an output connection weight.
In the case where the state of the reservoir is x(t)∈RNx, when {tilde over (x)}1∈Rp and {tilde over (x)}2∈RNx-p are satisfied and x [{tilde over (x)}1T{tilde over (x)}2T]T is satisfied, respective time evolutions of {tilde over (x)}1 and {tilde over (x)}2 are expressed as the following Expression (6) and Expression (7).
Here, x|ba indicates a vector resulting from cutting a vector x from an a-line to a b-line. That is, the dimension of {tilde over (x)}1 is p, and the dimension of {tilde over (x)}2 is Nx−P. In Expression (6) and Expression (7), p is a parameter that gives a mixing ratio (p/Nx) between a non-linear dynamics and a linear dynamics.
Wb∈RNx indicates a bias term.
Further, when α∈[0,1] is a leak rate, the following Expression (8) is satisfied.
As the leak rate α is higher, the contribution rate of new input information is lower, and as the leak rate α is lower, the contribution rate of the new input information is higher.
Furthermore, when X(t)=[1uT(t){tilde over (X)}T(t)]T is defined, the output ŷ(t) is calculated by the following Expression (9). T at a right shoulder of a vector or matrix indicates the transposition of the vector or matrix.
In the reservoir computing learning, only the learning of the output connection weight is performed. Other connection weights are initialized by random numbers, and are fixed. In the learning of the output connection weight Wout a least squared method is generally used, but in the embodiment, the learning is performed as described below, using teaching signals (u(t), y(t)) and an iterative least squares method.
P(t)∈R(1+Nu+Nx)×(1+Nu+Nx) indicates an update matrix, and β indicates an arbitrary positive constant. I is an identity matrix of (1+Nu+Nx)×(1+Nu+Nx).
When the ultimate point of W_ in the iterative learning is W_(tend), the output connection weight for calculating the output ŷ(t) from the input signal u(t) is Wout=WT_(tend).
In an example in
In the case of each of the learning mode and the estimation mode, the input signal u(t) to the ESN 122 is u(t)=[ST(t)qT(t){dot over (q)}T(t)τT(t)]T. Further, the output y(t) of the teaching signal in the learning mode is y(t)=text(t). Moreover, in the estimation mode, the output ŷ(t) of the ESN 122 gives an estimated value ŷ(t)={circumflex over (τ)}ext(t) of the interference force τext.
On the other hand, as shown in
In this way, the power assist system 1 according to the embodiment can accurately estimate the interference force that is given from the user that wears the robot, to the robot, by using the combination of the MBO 121 and the ESN 122 instead of using the force sensor. Thereby, the power assist system 1 according to the embodiment can realize a high-accuracy admittance control, and therefore can perform a power assist according to user's intention.
The present disclosure is not limited to the above embodiment, and can be appropriately modified without departing from the spirit.
In the embodiment, the example in which the series estimation method in which the output of the MBO 121 is input to the ESN 122 is employed in the interference force estimation unit 12 has been described, but the present disclosure is not limited to this. In the interference force estimation unit 12, a parallel estimation method in which the MBO 121 and the ESN 122 are used in parallel may be employed. The description will be briefly made below with use of
The power assist system 1 in which the parallel estimation method shown in
Furthermore, in the present disclosure, some or all of assist processes by the power assist system 1 can be realized by causing a central processing unit (CPU) to execute computer programs.
The above programs include commands (or software codes) for causing a computer to perform one or more functions described in the embodiments when the above programs are read by the computer. The programs may be stored in a non-transitory computer-readable medium or a tangible storage medium. Although being not limited, examples of the non-transitory computer-readable medium or the tangible storage medium include memory technologies such as a random-access memory (RAM), a read-only memory (ROM), a flash memory, and a solid-state drive (SSD), optical disc storages such as a CD-ROM, a digital versatile disc (DVD), and a Blu-ray® disc, and magnetic storage devices such as a magnetic cassette, a magnetic tape, and a magnetic disc storage. The programs may be sent through a transitory computer-readable medium or a communication medium. Although being not limited, examples of the transitory computer-readable medium or the communication medium include an electric propagation signal, an optical propagation signal, an acoustic propagation signal, or other types of propagation signals.
Some or all of the above embodiments can be described as the following supplement, but are not limited to the following supplement.
Number | Date | Country | Kind |
---|---|---|---|
2023-018087 | Feb 2023 | JP | national |