1. Field
Embodiments presented herein provide techniques for simplifying models of robots, and, in particular, for systematically deriving simplified models of humanoid robots.
2. Description of the Related Art
In humanoid robot control, simplified dynamics models are often used to represent the robot, as it is difficult to design controllers that control full dynamics models having many degrees of freedom (DOF). Typically, simplified models have fewer DOF than full models and are linearized in order to apply linear control theory. Examples of simplified models include the one-joint inverted pendulum model, the two-joint inverted pendulum model, the cart-table model, the inverted pendulum with reaction wheel, the double inverted pendulum, and the linear biped model. Conventionally, controller developers formulate these models manually based on their intuition. This approach is difficult to generalize, and it is not always clear how to determine parameters of a simplified model, or if the simplified model captures the essential dynamics of the full model. Further, model-specific programs, each of which can be employed with particular model(s), may be required to compute the state of the simplified models. For example, a one-joint inverted pendulum model may use the center-of-mass (COM) of the entire body and thus require a different program than a two joint inverted pendulum model that uses separate COMB for the upper and lower body. In addition, joints of simplified models may not correspond to physical joints, so converting between input torques of simplified models and joint torques of full models may not be straightforward, especially since no systematic approach exists for performing such a conversion.
One embodiment of the invention includes a method for simplifying a robot model. This method may generally include performing a singular value decomposition of an inertial term of the robot model and determining singular values and corresponding singular vectors to keep in an inertial term of a first simplified model by matching a kinetic energy of the robot model and a kinetic energy of the first simplified model.
In a particular embodiment, this method may further include linearizing the robot model around a first nominal state and may further include determining a gravitational forces term and a velocity-dependent forces term of the first simplified model by computing joint torques at sample poses around the first nominal state and solving for the gravitational forces term and the velocity-dependent forces term. In yet a further embodiment, this method may also include controlling the first simplified model using a first controller and determining joint torques and expected contact forces for the robot model from an input and reference trajectory of the first simplified model by optimizing a cost function.
Other embodiments include a computer-readable medium that includes instructions that enable a processing unit to implement one or more aspects of the disclosed methods as well as a system configured to implement one or more aspects of the disclosed methods.
So that the manner in which the above recited aspects are attained and can be understood in detail, a more particular description of aspects of the invention, briefly summarized above, may be had by reference to the appended drawings.
It is to be noted, however, that the appended drawings illustrate only typical aspects of this invention and are therefore not to be considered limiting of its scope, for the invention may admit to other equally effective aspects.
Embodiments disclosed herein provide techniques for systematically determining simplified models of humanoid robots. As used herein, a model includes one or more equations with mass and dynamic properties of a robot, having joint torques as inputs and motion of the robot as output. Techniques disclosed herein permit a model having degrees of freedom (DOF) to be simplified to a model having a user-specified number of DOF k, where k: is less than A simplification application linearizes the robot model around a nominal state and performs a singular value decomposition of an inertial term of the model, selecting singular values and corresponding singular vectors to be kept in an inertial term of a simplified model by matching a kinetic energy of the original model to a kinetic energy of the simplified model. In one embodiment, the inertial term may be an inverse inertial matrix of a system constrained by contact constraints, and the smallest k nonzero singular values and their corresponding singular vectors may be kept. Further, a gravitational forces term and a velocity-dependent forces term may be determined by computing active joint torques at sample poses around the nominal pose and solving for the gravitational forces term and the velocity-dependent forces term. A mapping from the simplified model to the original model may be determined using, e.g., numerical optimization. The simplified model may then be controlled using a controller which, based on the current state of the simplified model, computes input(s) to the model needed to achieve a given control objective (e.g., returning to the nominal pose). In one embodiment, the controller may be an infinite-horizon linear quadratic regulator, and an observer may be used to estimate the state of the simplified model based on measurements from the robot. In addition, joint torques of the original model may be sent to joint controllers of the robot being modeled to cause the robot to move. Note, although discussed primarily with respect to humanoid robots, techniques disclosed herein may be applied to other types of robots (e.g., other legged robots) as well.
The following description references aspects of the disclosure. However, it should be understood that the disclosure is not limited to specific described aspects. Instead, any combination of the following features and elements, whether related to different aspects or not, is contemplated to implement and practice the disclosure. Furthermore, although aspects of the disclosure may achieve advantages over other possible solutions and over the prior art, whether or not a particular advantage is achieved by a given aspect is not limiting of the disclosure. Thus, the following aspects, features, and advantages are merely illustrative and are not considered elements or limitations of the appended claims except where explicitly recited in a claim(s). Likewise, reference to “the disclosure” shall not be construed as a generalization of any inventive subject matter disclosed herein and shall not be considered to be an element or limitation of the appended claims except where explicitly recited in a claim(s).
Aspects of the present disclosure may be embodied as a system, method or computer program product. Accordingly, aspects of the present disclosure may take the form of an entirely hardware aspect, an entirely software aspect (including firmware, resident software, micro-code, etc.) or an aspect combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, aspects of the present disclosure may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus or device.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality and operation of possible implementations of systems, methods and computer program products according to various aspects of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. Each block of the block diagrams and flowchart illustrations, and combinations of blocks in the block diagrams and flowchart illustrations can be implemented by special-purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
Where one or more links of the humanoid robot 100 are in contact with the environment, enforcing m independent constraints, and the robot 100 has n DOF, 6 of which correspond to the translation and rotation of the base body and are thus not actuated, the full dynamics of the robot 100 may be described by the following equation:
M(θ)θ+c(θ, θ)+g(θ)=STT+fCT(θ)fC, (1)
where
θ ∈ RTh: generalized coordinates
τ C Rn−6: active joint torques
fC ∈ RM: contact forces
M(θ) ∈ Rm×n: joint-space inertia matrix
c(θ, θ) ∈ RM: centrifugal and Coriolis forces
g(θ) ∈ RM: gravitational force
IC(θ) ∈ Rm×n:contact Jacobian matrix
and ST ∈ Rm×(n−6) is the matrix that converts active joint torques to generalized forces and typically has the form
where 0, and 1, represent the zero and identity matrices of the size indicated by the subscript. The contact constraint may be represented as
I
C
θ+f
Cθ=0m, (3)
Here, the relationship between generalized acceleration:6 and active joint torques τ may be obtained by: 1) solving equation (1) for {umlaut over (θ)} and plugging in the result into equation (3); 2) solving the resulting equation for fC; 3) plugging fC back into equation (1). Doing so results in
{umlaut over (θ)}=Φ(θ)ST ε+φ(θ, θ) (4)
where
Φ(θ)=M−1−M−1fCT(fCM−1fCT)−1fCM−1
φ(θ, θ)=−M−1fCt(fCM−1fCT)−1fCθ−Φ(c+g).
Equation (4) describes the relationship between active joint torques and joint accelerations (including the base body), assuming that the contact constraints are satisfied. Note that while M is positive definite, Φ is only positive semi-definite due to the contact constraints, and therefore Φ−1 cannot be computed. This property corresponds to the fact that a {umlaut over (θ)} that violates the contact constraints cannot be generated by any generalized force.
Panels B-C show steps in a process for simplifying the model of equation (4), according to an embodiment. The objective of simplification is to approximate the dynamics of the humanoid robot with n degrees of freedom (DOF), and possibly subject to contact constraints, by a k (<<n)-DOF linear system
{umlaut over (M)}{umlaut over (q)}+{umlaut over (C)}{dot over (q)}+{umlaut over (C)}q=u (5)
where q ∈ Rk is the generalized coordinate of the simplified model, u ∈ Rk is the input force to the simplified model, {umlaut over (M)} ∈ Rk×k is a symmetric, positive-definite inertia matrix, and {umlaut over (C)} ∈ Rk×k and {umlaut over (G)} ∈Rk×k are constant matrices. Note, the value of k may be entered by a user or, alternatively, automatically determined. For the user may set a threshold for singular values, discussed in greater detail below, and the simplification application may automatically determine singular values that satisfy this condition. As another example, the simplification application may be configured to select singular values with large gaps between them.
Because {umlaut over (M)} is positive definite, equation (5) may be written as
{umlaut over (q)}=Φu−Φ{umlaut over (C)}q−Φ{umlaut over (C)}{dot over (q)} (6)
where Φ−M−1. The state-space differential equation model may then be derived as
{dot over (x)}=Ax+Ru (7)
by defining
To determine a simplified model of form in equation (7) for a given full dynamics model, a simplification application may first linearize the non-linear equation (4) for the full model around a given nominal state (θT, θT)T=(θ0T, θnT)T, as the ultimate goal is to derive a linear model. Then, the simplification application may determine the {circumflex over (Φ)} matrix by performing singular value decomposition of the inverse inertial matrix of the constrained system Φ0 and selecting particular singular values and vectors to keep. Illustratively, panel B shows components of a singular vector that corresponds to a smallest non-zero singular value (i.e., the largest inertia) for a standing nominal pose. As discussed in greater detail below, the smallest k non-zero singular values and their corresponding singular vectors may be kept for a given nominal pose so as to minimize the difference in kinetic energy between the original model and the simplified model.
As shown, singular vector elements of the smallest non-zero singular value are represented by solid and dashed bars starting from corresponding joints. For each rotational joint and the base body, a solid bar parallel to an axis of a rotational degree of freedom is depicted with length proportional to the magnitude of the corresponding singular vector element. If an element has negative value, its corresponding bar points in a direction opposite the axis. For example, bar 130 shows that a singular vector element corresponding to hip rotation has a large positive value, indicating a rotation of the hip that causes the robot to lean forward in the standing nominal pose. Dotted bars represent singular vector elements corresponding to linear degrees of freedom of the base body of the robot. For example, dotted bar 140 shows that a singular vector element corresponding to hip translation has a large positive value, indicating a translation forward in the standing nominal pose. In selecting singular values, the simplification application may attempt to minimize the difference between a kinetic energy of the simplified model and a kinetic energy of the full model, as discussed in greater detail below.
As shown in panel C, the simplification application may determine the Ĝ and Ĉ matrices by computing active joint torques z at random poses around the nominal pose, thereby obtaining linear models of the gravitational forces term Ĝ and the velocity-dependent forces term Ĉ. In one embodiment, a sampling-based numerical approach may be used. As discussed in greater detail below, the Ĝ and Ĉ matrices may be determined by computing joint torques via inverse kinematics at sample poses around the nominal pose, then solving for Ĝ and Ĉ. The state-space model of equation (7) may then be obtained by plugging in the Ĝ, Ĉ, and Φ0 matrices.
Panel D shows an inverted pendulum model 140, which is similar to a simplified model that uses the first singular value discussed above. Illustratively, the inverted pendulum model 140 includes one joint 150 on the ground, and has center of mass 160 above the joint 150. As discussed, the singular vector for a smallest non-zero singular value corresponds to leaning forward, which is similar in motion to that of the inverted pendulum model 140. Experience has shown that the standing nominal pose has other non-zero singular values that correspond to leaning left (similar to a single-inverted pendulum in the coronal plane), bending the upper body to the left (similar to the second joint of the two-joint inverted pendulum model in the sagittal plane), twisting the body to the right (not similar to any traditional simplified model), bending the upper body forward (similar to the second joint of the two-joint inverted pendulum model in the coronal plane), swinging both arms inwards (similar to the changing the inertia around the center of mass (COM), which is modeled in the Reaction Mass Pendulum model), swinging both arms backwards, and swinging both arms to the right (the last two of which are similar to shifting the COM without changing the contact point, which is modeled in the cart-table model). In general, k smallest non-zero singular values and corresponding singular vectors may be kept to obtain a simplified model having k DOF. Note, the motions corresponding to the remaining singular values and vectors may be lost in such a simplified model.
As discussed in greater detail below, a mapping from the simplified model to the original model may be determined using, e.g., numerical optimization. The simplified model may then be controlled to perform a motion (e.g., returning to the nominal pose) using a controller, such as an infinite-horizon linear quadratic regulator. In addition, joint torques may be sent to joint controllers of the robot being modeled to cause the robot to move.
θn=Φ0STτ0+φ0 (8)
where Φ0=Φ0(θ0) and φ0=φ(θ0, θn). When the pose and velocity change by small amounts δθ and δθ, respectively, the equation of motion becomes
δ{umlaut over (θ)}=Φ(θ0δθ)STτ+φ(θ0+δθ,δθ). (9)
Defining τ=τ0+δτ, using equation (8), and omitting second order terms of small changes gives
where ΦTt=δ(Φ0Stτu)/δθ, φpj=δφ/δθ, and ΦKJ=δφ/δθ.
At step 220, the simplification application performs a singular value decomposition (SVD) of the inverse inertial matrix for constrained system. Because Φ0 is symmetric and positive semi-definite, its SVD results in
Φ0=UΣUT (11)
where U ∈ RM×n is an orthogonal matrix containing singular vectors and Σ ∈ Rm×n is a diagonal matrix whose diagonal elements σt(t=1,2, . . . , n) are the singular values of Φ0 and are sorted in the descending order (σ1≧σ2≧ . . . ≧σn≧0).
Φ0 may be approximated by selecting: non-zero diagonal elements of Σ and corresponding columns of U such that
Φ0+Φ0=020T (12)
where Ũ∈ Rm×n and Σ ∈ Rk×k. Plugging equation (12) into equation (10) yields
δθ=020TSTδτ+Pδθ+Aδθ (13)
Because 0T0−1k×k , left-multiplying both sides of equation (13) by 0T gives
0τδθ=Σ0τSTδτ+0TPδθ+0TAδθ (14)
Note, a mapping from the generalized coordinates of the simplified model to those of the full model may be formulated based on the power that inputs to the full model and the simplified model do. Let a mapping from the full model configuration to the simplified model configuration be defined by
δθ=Ûq (15)
A possible inverse mapping may then be
q=0Tδθ. (16)
The input torque mapping may be based on the power that inputs to the full model and the simplified model do. The power that the inputs to the full model do is
δθTSττ=ĈTUTSτ0τ. (17)
This power can be matched to the power of the simplified model by mapping the inputs by
u=UτSτδε. (18)
Plugging into equation (14) gives
{umlaut over (q)}=Σu+U
TΓδθ+UτAδθ (19)
Note, the right-hand side of equation 19 has the same form as equation (6), with
{circumflex over (M)}
−1={umlaut over (Φ)}=Σ (20)
{umlaut over (G)}=−{umlaut over (Σ)}
−1
ΓU (21)
{umlaut over (C)}=−Σ
−1Λ
U (22)
because {circumflex over (Σ)} is a diagonal matrix with positive elements.
At step 230, the simplification application selects singular values and corresponding singular vectors from the SVD by minimizing a difference in kinetic energy between the full and the simplified models, and determines the inertial term {circumflex over (Φ)} in equation (6). A commonly-used technique for dimensionality reduction keeps larger singular values of and corresponding singular vectors of U. However, this technique may not preserve the essential properties of a full dynamics model.
The dynamics of a physical system is often characterized by kinetic energy. Given that Φn is the inverse inertia matrix of the constrained system and is singular, let
0=UEUT (23)
where Ē is a diagonal matrix whose diagonal elements are nonzero singular values of Φ0 and Ū is the matrix composed of the singular vectors corresponding to the nonzero singular values. The inverse of
0
−1
=ŪΣ
−1
Ū
T (24)
which is essentially the inertia matrix of the constrained system, as the robot cannot move in the directions of the singular vectors of the zero singular values. The kinetic energy is therefore
On the other hand, based on equations (12), (16), and (20), the kinetic energy of the simplified model is
In order to match the kinetic energies and ‘t, the simplification application may attempt to make
At step 240, the simplification application determines the gravitational forces and velocity-dependent forces e matrices in equation (6) by computing active joint torques r at poses around a nominal pose. In one embodiment, the simplification application may determine Ĝ by randomly sampling a number of poses around the nominal pose θ0 and computing the joint torques for realizing θ=00 and θ=00 at each sample via inverse kinematics. Let the difference between θ0 and the t-th sample pose be Δθt. Using inverse kinematics, the simplification application may modify to Δθlt so that the pose θ0+Δθlt satisfies the contact constraints. The simplification application may then determine joint torques T, required to produce zero accelerations. Using the mapping equations (16) and (18), the joint torques r may be converted to those of the simplified model and plugged into equation (5), giving
ĜÛTΔθtt=0TSTτt. (27)
Based on equation (27), the simplification application may collect Δθtt and τi values from a number of random sample poses around the nominal pose θu and solve a linear equation in the elements of Ĝ to obtain a that best fits the samples.
Similarly or the ĉ matrix, the simplification application may sample Δθ as well as Δθ from a number of random poses around the nominfal pose θu. The simplification application may then determine accelerations that satisfy the kinematic constraints of equation (3), as opposed to the zero accelerations for determining {tilde over (G)}, and compute the corresponding joint torques. Because the {tilde over (G)} matrix and the d matrix are known from earlier steps, a linear equation may be formed in the elements of {tilde over (C)}. The simplification application may solve such a linear equation to determine the elements of.
Once Ē, Ĝ, and Ĉ are known, the linear state-space model of equation (7) may be obtained by using these matrices in equation (7).
At step 330, a controller controls the simplified model. In one embodiment, infinite-horizon linear quadratic regulators (LQRs) are used for controlling the simplified models. An LQR determines the input by state feedback u−−Kx, where K is a constant gain matrix. K is determined such that the closed loop system is asymptotically stable and a cost function
I=∫
0
x(xTQx+uTRU)dt (28)
is minimized, where Q is a positive semi-definite matrix and R is a positive definite matrix.
A full-state observer may be used by the simplified model to estimate the state of the simplified model based on the measurements from a robot. Joint angle measurements give δθ in equation (16), which then gives the joint angles of the simplified model q. The observer may take q as a measurement and estimate the state q and {circumflex over (q)} of the simplified model. In one embodiment, the full-state observer may first compute the difference of the simplified model's pose from the nominal pose for the controller, and then multiply this difference by 0T. The observer's gain may be computed by pole assignment. Note, the state estimation does not require contact force measurement.
At step 340, the simplification application determines a mapping from the simplified model back to the full model. The input mapping of equation (18), namely u=0TSτδτ, gives a unique mapping from the full model to the simplified model, but the reverse mapping is not unique. As a result, other factors may be considered to determine joint torques for the full model.
In one embodiment, the controller designed for the simplified model may compute desired input u to to the simplified model, and an optimization problem may be formulated for computing the joint torque τ and expected contact forces fc of the full model, taking into account reference trajectories for the joints because the controller for the simplified model may not consider individual joints. An example cost function for the optimization problem may be
Where the first and second terms address the desired input u″ and reference trajectories, respectively, and the last two terms are damping terms with constant, positive-definite weight matrices WT and WC for the joint torque and contact forces, respectively. In one embodiment, these weight matrices may be user-specified. Za may be defined as
Z
u=(un−0Tτ)TWu(un−UTτ) (30)
where Wu is a constant weight matrix that may also be user-specified. To determine joint trajectories, joint acceleration may first be determined by
θn=θref+kd(θref−0)+ky(θref−0) (31)
and defining Za as
Z
a=(θn−θ)TWa(θn−θ) (32)
where Wa is a constant weight matrix. The final optimization problem is then to find joint torque τ and expected contact forces fc that minimize the cost function of equation (29), subject to the equation of motion (1). Optionally, constraints on joint torques may be considered to enforce joint torque limitations. fc may also be constrained such that it satisfies the constraints on vertical forces, center of pressure, and friction.
At step 350, the controller sends commands indicating the joint torques, mapped from the simplified model to the full-dynamics model according to the mapping determined at step 340, to a joint controller capable of configuring the articulated link positions of a humanoid robot by applying the joint torques, thereby causing the robot to move according to the modeled motion. In one embodiment, multiple models and controllers with different nominal poses and contact constraints may be used to generate complex motions, such as stepping. The conditions for switching between controllers may be manually (or automatically) defined according to, e.g., the number of contacts of current and next models. For example, if the next model has fewer contact links than the current model, the switch may take place when the vertical contact for a foot to be lifted is under a given force measurement for a given number of frames. Similarly, if the next model has more contact links, a switch may take place when the vertical contact force at the foot touching down is above a given force measurement threshold for a given number of frames, in order to confirm that the new contact is established. Note, the controller may not achieve the contact force condition for switching to another controller. In one embodiment, this problem is solved by increasing the elements of Wc in equation (29) that correspond to the foot to be lifted when the vertical contact force for the foot is lower than a given threshold. Similarly, if the contact force at a foot needs to be larger in order to establish a contact, the weights corresponding to the other foot may be increased when the vertical contact force rises above a given threshold.
The CPU 410 retrieves and executes programming instructions stored in the memory 460. Similarly, the CPU 410 stores and retrieves application data residing in the memory 460. The interconnect 615 facilitates transmission, such as of programming instructions and application data, between the CPU 410, IO device interface 440, storage 420, network interface 430, and memory 460. CPU 410 is included to be representative of a single CPU, multiple CPUs, a single CPU having multiple processing cores, and the like. And the memory 460 is generally included to be representative of a random access memory. The storage 420 may be a disk drive storage device. Although shown as a single unit, the storage 420 may be a combination of fixed andor removable storage devices, such as fixed disc drives, floppy disc drives, tape drives, removable memory cards or optical storage, network attached storage (NAS), or a storage area-network (SAN). Further, system 400 is included to be representative of a physical computing system as well as virtual machine instances hosted on a set of underlying physical computing systems. Further still, although shown as a single computing system, one of ordinary skill in the art will recognized that the components of the system 400 shown in
As shown, the memory 460 includes an operating system 461 and applications 462-465. Illustratively, the operating system may include Microsoft's Windows®. The applications 462-465 include a simplification application 462, which is configured to simplify a full-dynamics model 421 into a simplified model 422 that represents the essential properties of the full dynamics. In one embodiment, the simplification application 462 may be configured to linearize the full model 421 around a nominal state, perform SVD on an inverse inertia matrix of a system constrained by contact constraints and select singular values and corresponding singular vectors to be kept in the simplified model 422, and determine a gravitational forces term and a velocity-dependent forces term 422 by sampling torques at a number of poses around the nominal state, as described in detail above with respect to
The applications 462-465 further include a robot control application 465, which may be configured to send signals to a robot indicating the joint torques to exert. That is, the robot control application may convert calculated joint torques to instructions that are sent to the robot, thereby causing joints of the robot to move according to those instructions.
Advantageously, techniques disclosed herein permit full dynamics models of robots to be simplified. Conventionally, the controller developer must decide which mechanical model represents essential properties of dynamics of a robot, and mapping the state and input between models often requires model-specific code. By contrast, techniques described herein may be completely automated once contact constraints, nominal pose, and number of generalized coordinates of the simplified model are specified. The mapping between the states and inputs of the full and simplified models may also be obtained automatically in the same way for any simplified models obtained by techniques disclosed herein. Such a mapping has a simple linear relationship which does not require model-specific programs. As a result, simplified models for different poses and contact constraints may easily be derived, and switches among them may be made using the same program to, e.g., generate complex motions that sequentially combine multiple models and controllers.
While the foregoing is directed to aspects of the present invention, other and further aspects of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.