The present invention generally relates to the field of controlling motion of a system, and more specifically, to controlling motion of a system using human motion descriptors.
In recent years, there has been a growing interest in using captured human motion data as examples to simplify the process of programming or learning complex robot motions. See, e.g., A. Nakazawa, S. Nakaoka, K. Ikeuchi, and K. Yoko, “Imitating human dance motions through motion structure analysis”, Intl. Conference on Intelligent Robots and Systems (IROS), pages 2539-2544, Lausanne, Switzerland (2002). Captured human motion has been used to develop algorithms for ‘learning from demonstration’, a form of learning whereby a robot learns a task by watching the task being performed by a human. See, e.g., S. Schaal, “Learning from demonstration”, In M. C. Mozer, M. Jordan, and T. Petsche, editors, Advances in Neural Information Processing Systems, chapter 9, pages 1040-1046. MIT Press (1997). One goal of ‘learning from demonstration’ has been to replace the time-consuming manual programming of a robot by an automatic programming process, solely driven by showing the robot the task by an expert teacher. Captured human motion has also been used in computer animation to ‘retarget’ motion of one articulated figure to another figure with a similar structure. See, e.g., S. Tak, O. Song, and H. Ko, “Motion balance filtering”, Comput. Graph. Forum, (Eurograhics 2000), 19(3):437-446, (2000). See also, e.g., S. Tak and H. Ko, “A physically-based motion retargeting filter”, ACM Trans. on Graphics, 24(1):98-117 (2005).
The majority of existing approaches to control a robot from human demonstration use three steps. First, human motion is recorded using a marker based optical system, a magnetic system, or a wearable mechanical device resembling an exoskeleton. The recorded motion is then used as an input to a constrained non-linear optimization procedure to generate a re-designed joint level robot motion. See, e.g., A. Ude, C. G. Atkeson, and M. Riley, “Programming full-body movements for humanoid robots by observation”, Robotics and Autonomous Systems, 47:93-108 (2004). Constraints are also imposed to enforce the robot's kinematic, dynamic, and balance constraints. See, e.g., Y. Tamiya M. Inaba S. Kagami, F. Kanehiro and H. Inoue, “Autobalancer: An online dynamic balance compensation scheme for humanoid robots”, In Int. Workshop Alg. Found. Robot, (WAFR), Lausanne, Switzerland, (2000). Given sufficient computation time, a retargetted motion can be optimized off-line to satisfy a prescribed performance measure subject to the constraints. The retargetted joint motion is then used as the desired joint command input to the robot's independent joint controller. In the absence of external disturbances, the generated robot motion is kinematically and dynamically admissible by the robot and can be executed by the robot's control scheme during run-time.
Such a process has limitations. First, the complexity of sensing and instrumentation used to record human motion is time consuming and often limiting for use outside the laboratory. Moreover, the off-line re-targeting solution based on optimization methods is generally not suitable for online or reactive control of robots in dynamic scenes or in the presence of external disturbances. These limitations restrict the applicability of the approach to laboratory settings with static environments. An effective system should be interactive and therefore computationally fast, should work in static and dynamic environments, and should not rely on special instrumentation for the human demonstrator nor the environment.
Methods and systems provide for reconstructing, retargeting, tracking and estimating motion of an object from observation. The observation may include low dimensional task descriptors. Movement of members of a target system, which may be an articulated system such as a robot, may be controlled by tracking motion of an observed object. Constraints on movement of the members of the articulated system are enforced. Commands for controlling movement of the members of the articulated system are generated in response to the tracking motion and enforcing the constraints.
In other aspects, the methods and systems provide for reconstructing, retargeting, tracking and estimating motion of an object from received task descriptors. In other aspects, a balance control command is generated in response to the commands for controlling movement. In other aspects, the tracking may include first or second order closed loop inverse kinematics, or inverse dynamics, and may include regularization of the inverse Jacobian matrix.
Task descriptors may be used to define motion of a source system, such as a human, and to define motion of the robot. Task variables are assigned to portions of source system. Motion of the portions of the human represented by the task variables is detected. Commands for the articulated system are generated to replicate motion of the source system. The commands for the replication motion are adjusted in response to physical limitations of the articulated system for performing motions.
The features and advantages described in the specification are not all inclusive and, in particular, many additional features and advantages will be apparent to one of ordinary skill in the art in view of the drawings, specification, and claims. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter.
A preferred embodiment of the present invention is now described with reference to the figures where like reference numbers indicate identical or functionally similar elements. Also in the figures, the left most digits of each reference number corresponds to the figure in which the reference number is first used.
Reference in the specification to “one embodiment” or to “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiments is included in at least one embodiment of the invention. The appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment.
Some portions of the detailed description that follows are presented in terms of algorithms and symbolic representations of operations on data bits within a computer memory. These algorithmic descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. An algorithm is here, and generally, conceived to be a self-consistent sequence of steps (instructions) leading to a desired result. The steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of electrical, magnetic or optical signals capable of being stored, transferred, combined, compared and otherwise manipulated. It is convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like. Furthermore, it is also convenient at times, to refer to certain arrangements of steps requiring physical manipulations of physical quantities as systems or code devices, without loss of generality.
However, all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the following discussion, it is appreciated that throughout the description, discussions utilizing terms such as “processing” or “computing” or “calculating” or “determining” or “displaying” or “determining” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system memories or registers or other such information storage, transmission or display devices.
Certain aspects of the present invention include process steps and instructions described herein in the form of an algorithm. It should be noted that the process steps and instructions of the present invention could be embodied in software, firmware or hardware, and when embodied in software, could be downloaded to reside on and be operated from different platforms used by a variety of operating systems.
The present invention also relates to an apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general-purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, application specific integrated circuits (ASICs), or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus. Furthermore, the computers referred to in the specification may include a single processor or may be architectures employing multiple processor designs for increased computing capability.
The algorithms and displays presented herein are not inherently related to any particular computer or other apparatus. Various general-purpose systems may also be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatus to perform the required method steps. The required structure for a variety of these systems will appear from the description below. In addition, the present invention is not described with reference to any particular programming language. It will be appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any references below to specific languages are provided for disclosure of enablement and best mode of the present invention.
In addition, the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter. Accordingly, the disclosure of the present invention is intended to be illustrative, but not limiting, of the scope of the invention, which is set forth in the claims.
Methods and systems of the present invention may use a unified, task oriented dynamic control for re-targeting. A vision module may produce a set of human motion descriptors, represented at the Cartesian (or task) level. The retargeting module receives normalized human motion descriptors and computes the desired robot joint command torque based on a hierarchical task oriented dynamic control framework. A tracking controller commands joint torques using various control methods, including decoupling control, independent joint control, gravity compensation control, or other combinations. The tracking controller converts a trajectory from Cartesian space trajectories to joint space trajectories using an internal servo loop or an external servo loop.
The tracking controller may process joint limit constraints and singularities using weighting matrices by a regularized Jacobian inverse solution. A retargeting controller allows the representation of the large number of mechanical degrees of freedom involved in the execution of movement tasks by motion primitives which can in principle be lower in dimension than the total number of degrees of freedom.
The motion controller system 100 may be used, for example, for retargeting, human pose estimation, tracking and estimation, or joint torque estimation in biomechanics.
Retargetting
The motion controller system 100 captures human motion data as examples to simplify the process of programming or learning complex robot motions. (See, e.g., A. Nakazawa, S. Nakaoka, K. Ikeuchi, and K. Yokoi, “Imitating human dance motions through motion structure analysis,” Intl. Conference on Intelligent Robots and Systems (IROS), pages 2539-2544, Lausanne, Switzerland, 2002). The motion controller system 100 may use algorithms for ‘learning from demonstration’, a form of learning whereby a robot learns a task by watching the task being performed by a human. (See, e.g., S. Schaal, “Learning from demonstration,” M. C. Mozer, M. Jordan, and T. Petsche, editors, Advances in Neural Information Processing Systems, chapter 9, pages 1040-1046. MIT Press, 1997. The motion controller system 100 may use ‘learning from demonstration’ to replace the time-consuming manual programming of a robot by an automatic programming process, solely driven by showing the robot the task by an expert teacher. The motion controller system 100 may use captured human motion in computer animation to ‘retarget’ motion of one articulated figure to another figure with a similar structure. (See, e.g., S. Tak and H. Ko, “Motion balance filtering,” Comput. Graph. Forum. (Eurographics 2000), 19(3):437-446, 2000; See, e.g., also S. Tak and H. Ko, “A physically-based motion retargetting filter”, ACM Trans. On Graphics, 24(1):98-117, 2005.
Joint Torque Estimation in Biomechanics
Studies involving computational analysis of the neuromuscular system are typically concerned with either the synthesis or analysis of human motion. (See, e.g., S. Delp and P. Loan, “A computational framework for simulating and analyzing human and animal movement,” IEEE Computing in Science and Engineering, 2(5):46-55, 2000; See, e.g., also D. Thelen, F. C. Anderson, and S. Delp, “Generating dynamic simulations of movement using computed muscle control,” Journal of Biomechanics, 36:321-328, 2003. The synthesis problem, often referred to as forward dynamics analysis, provides the motion of the biomechanical system as a consequence of the applied forces and given initial conditions. From a system point of view, let S represent the system equations, describing the equation of motion of processed by the motion controller system 100. Because S may not be known precisely, an estimate (or model) of S may be represented by S, which describes the forward dynamics equations. By using the solution to the forward dynamics problem, the motion controller system 100 may simulate and predict the body segment's motion. (See, e.g. F. C. Anderson and M. G. Pandy, “Dynamic optimization of human walking,” Journal of Biomechanical Engineering, 123:381-390, 2001). Numerical computation of movements produced by forward dynamics simulations have many applications. For example, in an embodiment involving surgical alterations, simulations may predict the anticipated consequences of the surgery on the person's movement pattern. (See, e.g., S. Piazza and S. Delp, “Three-dimensional dynamic simulation of total knew replacement motion during a step-up task,” Journal of Biomechanical Engineering, 123:589-606, 2002.)
The analysis, or inverse dynamics problem, can be viewed as the inverse of the synthesis problem. This technique provides insights into the net summation of all torques, and all muscle activity at each joint. The inverse dynamics model is a transformation of quantities obtained from a set of inputs derived from measurements to the estimated joint loads. A full kinematic description obtained from motion capture of marker positions is sufficient to obtain an exact or approximate inverse solution; however, motion capture is often combined with output from other sensors, including force plates, in order to improve the precision of the estimated joint loads. Since the optimal representation of the dynamic equations of motion will differ depending on the available sensors, inverse dynamics is in general considered a multi-modal sensing problem. (See, e.g., B. Dariush, H. Hemami, and M. Parnianpour, “Multi-modal analysis of human movement from external measurements”, Journal of Dynamic Systems, Measurement, and Control, 123(2):272-278, 2002.)
Regardless of the sensing modes, inverse dynamics problems may be limited. First, the inverse dynamics equations are functions of linear and angular accelerations of body segments, using the calculations of higher order derivatives of experimental data contaminated by noise—a notoriously error prone operation (see J. Cullum, “Numerical differentiation and regularization,” SIAM J. Numer. Anal., 8(2):254-265, 1971.) Second, the applicability of inverse dynamics may be limited to the “analysis” problem. In other words, the solution may not directly answer the “what if” type questions (or the “synthesis” problem) typically encountered in clinical applications and addressed by forward dynamics simulations.
In one embodiment, the motion controller system 100 uses a task oriented resolved acceleration control scheme to control a framework for analysis and synthesis of human motion, whereby the estimation of internal forces and moments has been formulated as a task space tracking control problem. The system 100 tracks task variables which can represent, for example, the marker positions. Constraints can be imposed to avoid joint limits, muscular torque limits, and self collision avoidance, and the like.
The Source System 102 and Target System 104
In one embodiment, the target system 104 is a physical system or a model of a physical system, and the source system 102 is an abstraction of a physical system or model. The source system 102 may use one or more desired motion primitives, expressed in Cartesian (or task) space that are provided to the target system 104. These motion primitives are referred to as the desired task descriptors, obtained, for example, from either observations (measurements) or generated synthetically. The motion controller system 100 may obtain the desired motion primitives for the target system 104 using knowledge of the kinematic or dynamic structure of the source system 102.
In one embodiment, the source system 102 and the target system 104 may be a “Master-Slave” system where a master system drives the operation of a slave system. In this embodiment, the source system 102 (master) delivers a set of motion primitives that drives the target system 104 (slave). In one embodiment, the source motion is that which is extracted from observations of human motion. For the sake of illustration, without loss of generality, the source system 102 represents a human model and the source motion represents human motion primitives or “task descriptors” which are typically observed or inferred from measurements. Alternatively, the task descriptors may be synthetically generated or obtained from theoretical calculations. The target system 104 may be any generic articulated model, such as a human model or a humanoid robot.
In one embodiment, the dimension of the space to describe all observed task descriptors is different than the total number of degrees of freedom used to define the source system 102. In one embodiment, task descriptors are characterized by a vector space that occupies a smaller dimension that the total number of degrees of freedom specifying the source system 102.
Partially Observable/Controllable Tasks
An observed task descriptor from the source system 102 has a temporal correspondence between successive observations. In other words, there is a correspondence between the position and/or orientation of a task descriptor at successive time instances when the task descriptor is observable. Furthermore, spatial correspondence is assumed between an observed “Source” task descriptor and its associated “Target” task descriptor. For every observation of a given “Source” task descriptor, a corresponding “Target” task descriptor of the same dimension and analogous degrees of freedom may be defined. The “Source” and “Target” task descriptors need not be represented by six parameters required to specify the position of the task and the orientation of the task frame.
The scenarios for which task descriptors have fewer than six parameters are listed below.
Next, a task space control framework is described to generate motion for all degrees of freedom in the target system 104 from a set of desired motion primitives which are observed from measurements, synthesized, or computed from the current configuration of the target system 104. The tracking control system 302 generates motion results in a set of computed task descriptors which track the desired task descriptors, e.g., minimize the Cartesian tracking error. The balance control system 306 controls the resulting motion for balance and not move the target system 104 toward instability. The constraint system 304 provides commands to prevent the target system 104 from violating the physical limits, such as joint limits, velocity limits, torque limits, and also ensures the system 304 avoids obstacles, self collisions, and computational problems arising from singularities. The three control systems 302, 304 and 306 may present a large number of conflicting tasks which may be resolved through a hierarchical task management strategy. The precision of lower-priority (or lower level of importance) factors may be sacrificed at the expense of higher priority (or higher level of importance) factors as described below.
The tracking control system 302 includes a position/orientation error system 410 to generate an error in response to the observed task descriptors, the constraint task descriptors, the balancing task descriptors, and computed task descriptors from a forward kinematics system 412 of the tracking control system 302. The forward kinematics system 412 generates computed task descriptors in response to the joint variables q. A control system 414 generates a control signal in response to the position/orientation error from the position/orientation error system 410, the constraint task descriptors from the constraints system 304, and the joint variable q.
The constraints system 304 includes a collision avoidance system 422, a singularity robustness system 424, a joint velocity limits system 426, a joint limits system 428 and a torque limits system 430. The motion controller systems 300 and 400 are described in more detail below.
Kinematic Structure
In one embodiment, two different articulated body models are used—a human model as the source system 102 and a humanoid robot model as the target system 104. These two models are not necessarily equivalent: that is, they may have different dimensions, physical parameters, and degrees of freedom. A set of desired human motion primitives is associated with the human model. These motion primitives are derived by post-processing the observed motion primitives detected by a vision system (not shown). The post-processing may involve low pass filtering, interpolation, and re-scaling (or normaling) link dimensions from the human to the humanoid. The desired motion primitives are expressed in Cartesian (or task) space, and may be referred to as task descriptors. The dimension of the space used to describe all observed task descriptors need not be equal to the total number of degrees of freedom used to define the humanoid model.
A motion controller system 300 is described for one embodiment that includes a kinematic structure. In one embodiment, the target system 104 represents general articulated mechanism with a tree structure.
c(i)={j|ρ(j)=i} (1)
Link and Joint Geometry
The transformation of a spatial vector quantity from the coordinate system {A} to frame {B} is given by
where rab is the 3×1 vector from the origin of frame {A} to the origin of frame {B}, with components referred to frame {A}. With the vector rab=[rab1 rab2 rab3]T the 3×3 skew symmetric matrix {tilde over (r)}ab, is defined as:
The tilde symbol ({tilde over ( )}) is used to simplify notation of vector cross products. With this notation, cross products of arbitrary vectors a and b can be expressed as
a×b=ãb (4)
It follows that the transformation of a spatial vector quantity from the coordinate system of the parent link ρ(i) to link i is given by
where r is the 3×1 vector from the origin of frame {ρ(i)} to the origin of frame {i}, with components referred to frame {ρ(i)}. The link-to-link transformation described by Equation 5 includes a constant part, which describes the relative positions of two joints fixed to a single link, and a variable part, which describes the relative positions of two links connected by a joint. The variable part is a function of the appropriate joint variables, or the joint degrees of freedom, and is recomputed as the relative positions of the two links changes. The two transformations, described as link transforms and joint transforms, are illustrated in
iXρ(i)=iXJ
where Xj=iXJi represents the variable joint transform, and XL=JiXp(i) represents the constant link transform.
Jacobians
In one embodiment, the target system 104 operates a task in the full six dimensional task space (n≧m, m=6). The position and orientation of the task descriptor is described by the vector op, and the rotation matrix oR, respectively. The notation of a leading superscript describes the frame that a quantity is referred to. For simplicity, hereafter, the leading superscript is suppressed for any quantity referred to in the base frame. In another embodiment, the target system 104 operates a task that is not completely specified (m<6).
Let the vector q=[q1, . . . , qn]T describe the degrees of freedom which fully characterize the configuration of the target system 104. The set of all possible configurations is called configuration space, or joint space. The mapping between the joint space velocities and task space velocities is obtained by considering the differential kinematics relating the two spaces,
{dot over (x)}=J(q){dot over (q)} (7)
where Jε is the Jacobian of the task.
The vector {dot over (x)} is defined by,
where w and p are vectors corresponding to the angular velocity of the task frame and the linear velocity of the task position, respectively. The Jacobian matrix may be decomposed into its rotational and translational components, denoted by Jo and Jp, respectively, as:
Let ivi be the velocity of link i, referred to frame i. The standard recursive formula for computing link velocities in the link frame of reference is given by:
ivi=iXρ(i)ρ(i)vρ(i)+hi{dot over (q)}i (10)
where hi has dimensions 6×ni and represents the motion subspace for a particular joint, referred to the coordinate frame of body i. (See, e.g., Roy Featherstone and David Orin, “Robot dynamics: Equations and algorithms”, International Conference on Robotics and Automation, San Francisco Calif., (2000). The motion subspace represents the free modes of joint i, and its columns make up a basis for this vector space. For example, the motion subspace of rotation about the z axis of a revolute joint connecting link i and ρ(i) is given by:
hi=[0 0 1 0 0 0]T (11)
Equation 10 can be converted to a summation form:
and where k represents the indices of links in the path from link i to the root. As in Equation 7, Cartesian velocities are related to joint velocities as follows:
ivi=iJi{dot over (q)} (14)
where the Jacobian is expressed in the link frame and can be expressed as,
iJi=[ . . . Jk . . . ] (15)
where,
It follows that the Jacobian matrix for link i is related to the Jacobian matrix of its parent link by the following relationship:
where the Jacobian of the first link is given by 1J1=h1. To obtain the Jacobian referred to the base (or root) frame, the following transformation may be used:
Tracking Control
The tracking control system 302 is next described. In one embodiment, tracking control refers to a control policy that produces a torque command τcmd by which the computed task descriptor kinematics of the robot system 314 tracks the desired task descriptor kinematics, subject to constraints imposed by the target system 104 based on the joint variables, defined by the vector q.
The basis for the solution of task space tracking control algorithms are the differential kinematics relating task variables and joint variables described in equation (7). In one embodiment, the motion controller system 100 uses a second order algorithm for control. Differentiating equation (7) provides:
{umlaut over (x)}=J(q){umlaut over (q)}+{dot over (J)}(q,{dot over (q)}){dot over (q)} (19)
A variety of inverse solution may be obtained from Equation 19, where generally, the solutions based on the second order system in equation 19 may involve kinematic and/or dynamic analysis.
Consider the first order differential kinematics given by Equation 7. In one embodiment, a desired motion of a task descriptor in the full six dimensional space is assigned. The associated differential kinematics of the desired motion is expressed by:
where the desired angular velocity wd can be computed from:
wd=H(Θd){dot over (Θ)}d (21)
and the transformation matrix H depends on the particular set of the desired Euler angles, θd, considered. Equations 7 and 20 may be solved for the joint velocity vector {dot over (q)}, or the joint acceleration vector {umlaut over (q)} according to the following relationships:
{dot over (q)}=J*{dot over (x)}d (22)
{umlaut over (q)}=J*({umlaut over (x)}d−{dot over (J)}{dot over (q)}) (23)
where J* denotes the regularized right pseudo-inverse of J weighted by the positive definite matrix W1 and regularized by the positive definite damping matrix W2,
J*=W1−1JT(JW1−1JT+W2)−1 (24)
The damping matrix may be used if J is an ill-conditioned matrix, the details of which are described below. If W2=0, then Equation 24 is simply the weighted right pseudo-inverse of J. Furthermore, if J is a square non-singular matrix, W1 is the identity matrix, and W2=0, the matrix J* is replaced by the standard matrix inversion J−1.
A position term or position and velocity feedback terms is introduced in equation 22 or equation 23, respectively. The tracking controller system 302 generates resulting first and second order inverse kinematics control laws given by:
{dot over (q)}=J*({dot over (x)}d+Kpe) (25)
{umlaut over (q)}=J*({umlaut over (x)}d−{dot over (J)}{dot over (q)}+Kpe+Kvė) (26)
where Kp and Kv are positive definite gain matrices. In equation 26, Kv may be selected to produce a critically damped response by using the relationship:
Kv2√{square root over (Kp)} (27)
The error vector e=[eo ep]T describes the position error (ep) and the orientation error (e0) between the desired and computed task descriptors. The position error is defined as:
ep=pd−p (28)
where pd and p correspond to the desired and computed task positions, respectively.
The computation of the orientation error may be complex and may be performed using various representations of orientation. A method based on the angle axis error is described.
The desired task orientation frame is usually described by a minimal number of coordinates, typically three Euler angles, described by the vector θd. It should be noted that the desired task orientation may be expressed by parameters other than Euler angles; nevertheless, θd can be calculated if a desired rotation matrix Rd is known. A functional expression of the orientation error in terms of an angle and axis error is given by:
where (nd, sd, ad) and (n, s, a) correspond to the desired and actual unit vector triples of the task frame, respectively. (See, e.g., J. Y. S. Luh, M. W. Walker, and R. P. C. Paul, “Resolved-acceleration control of mechanical manipulators”, IEEE Transactions on Automatic Control, 25:468-474, (1980)). The rotation matrix Rd is given by:
Rd=└ndsdad┘ (30)
R=[n s a] (31)
The position and orientation error (e=[eo ep]T) are defined, as before, by Equations 28 and 29, respectively. The partition of the error rates follows accordingly,
ė=[ėoėp]T (32)
where
ėp={dot over (p)}d−{dot over (p)}
ėo=wd−w (33)
Dynamic Equations of Motion
The dynamic equations of motion for a typical robotic system has the general form,
M(q){umlaut over (q)}+C(q,{dot over (q)})+G(q)+F(q,{dot over (q)})=τ (34)
where M is the n×n joint space inertia matrix, C is the n×1 vector of Coriolis and Centripetal torques, G is the n×1 vector of gravitation torques, F is the n×1 vector of friction terms, and τ is the n×1 vector of joint torques. Equation 34 represents the joint space dynamics. The dynamics equations as well as the control may be expressed in terms task space, or “operational space”. From Equation 19 and Equation 34, the task space (or operational space) dynamics used to describe the motion of a robot with respect to task variables may be derived. See, e.g., O. Khatib, “A unified approach for motion and force control of robot manipulators: The operational space formulation”, IEEE Journal of Robotics and Automation, RA-3(1):43-53, (1987). The operational space dynamic equations have the structure:
M(x){umlaut over (x)}+C(x,{dot over (x)})+G(x)+F(x,{dot over (x)})=T (35)
where M is the n×n operational space inertia matrix, and C, G, F and T are the Centrifugal and Coriolis force vector, the gravity vector, the friction vector, and the force-torque vector acting in operational space. Respectively, in the following sections, the dynamic parameters are used to construct model based control laws.
Trajectory Conversion Outside the Servo Loop
The CLIK trajectory conversion system 608 applies a control law such that task descriptors on the robot system 314 track corresponding normalized task descriptors on the human demonstrator, subject to the physical constraints of the robot system 314. The output of the tracking control structure 610 is the command torque obtained by a control law with the following general structure,
τcmd=ατ*+b+c (36)
The choice for vectors a, b, c and τ* depend on the type of control selected for command. Cartesian control schemes may be performed in one of two ways. The first approach uses a trajectory conversion process outside the servo loop to convert task space trajectories to joint space trajectories. The joint space trajectories are then used as the reference inputs to a joint-based servo scheme. The second approach, as will be described below in conjunction with
The trajectory conversion process is described above. That is, the first and second order inverse kinematics equations followed by numerical integration can be used to convert the desired trajectories from Cartesian space to joint space. This trajectory conversion process yields the reference joint space trajectories qr, {dot over (q)}r and {umlaut over (q)}r, used as the reference inputs to a joint space tracking controller. The control structure for such a scheme is illustrated in
Joint Space Dynamic Decoupling
The control structure 610 utilizes the second order closed loop inverse kinematics in Equation (26) followed by a joint space computed torque control method. The control structure is defined as follows:
a={circumflex over (M)}(q) (37)
b=Ĉ(q,{dot over (q)})+Ĝ(q)+{circumflex over (F)}(q,{dot over (q)}) (38)
c=0 (39)
τ*={umlaut over (q)}r+Gp(qr−q)+Gv({dot over (q)}r−{dot over (q)}) (40)
where the caret notation ^ denotes estimates of the components of the dynamic model. The feedback gain matrices Gp and Gv are positive definite gain matrices which may be designed to produce a critically damped response. The vector {umlaut over (q)}r represents the reference acceleration as a result of the trajectory conversion process described in Equation 26:
{umlaut over (q)}r=J*(qr)({umlaut over (x)}d−{dot over (J)}(qr){dot over (q)}r+Kpe+Kvė) (41)
The vectors qr and {dot over (q)}r are obtained by numerical integration of {umlaut over (q)}r. The control structure 610 is a form of nonlinear model based compensation to dynamically decouple and linearize the closed loop system provided the estimated dynamic model parameters closely match the actual dynamics of the system. Under such circumstances, the control structure 610 provides tracking such that the resulting robot motion descriptors closely track the normalized human motion descriptors to produce human-like behaviors on the robot. However, exact dynamic parameters are sometimes difficult to obtain, particularly when modeling friction and Coriolis effects. In addition, for robots with a large number of degrees of freedom, the calculation of the model dynamics becomes time consuming and may be difficult to achieve at the servo rates of the robot. In such instances, the system 600 may use model based compensation, or purely error driven control schemes as will be described below.
Independent Joint Control
When robot parameters are difficult to compute in real time, or when their exact values are not available, the system 600 may use simple control laws that are strictly error driven and therefore computationally fast. With reference to the general control structure in Equation 32, the independent joint control law is given by,
a=I (42)
b=0 (43)
c={umlaut over (q)}r+Gv({dot over (q)}r−{dot over (q)})+Gp(qr−q)+Gi∫(qr−q)dt (44)
τ*=0 (45)
where I is an n×n identity matrix, and Gv, Gp, and Gi are constant diagonal feedback gain matrices. The term {umlaut over (q)}r, or the integral action term are often set to zero. In such a control scheme, no communication is required between the joint servos; therefore a separate microprocessor may be used to implement Equation 44.
Because each joint controller in the independent-joint control schemes acts independently of the dynamics, they may produce perturbations at neighboring joints. Moreover the feedback gains may be constants and pre-specified. The controllers may not update the feedback gains under varying payloads. This may pose a problem for robots that are highly nonlinear systems whose inertial effects, the coupling between joints, and gravity effects are all either position dependent or position and velocity dependent terms. Furthermore, the inertial loading terms can change drastically at high speeds. Thus, the independent-joint control schemes described above using constant feedback gains to control a highly nonlinear robot may be modified for varying speeds and payloads.
Gravity Compensation
The control structure 610 may include a gravity model in the independent joint control scheme to improve the positioning error. The control becomes:
a=I (46)
b=G(q) (47)
c={umlaut over (q)}r+Gr({dot over (q)}r−{dot over (q)})+Gp(qr−q)+Gi∫(qr−q)dt (48)
τ*=0 (49)
Such a control scheme is an example of partial model-based compensation. The gravity term compensates for the static torques due to gravity while the other terms provide additional damping torques at the joint which are expected to improve the transient behavior of the system. The addition of gravity terms makes the control of each joint depend on the configuration of the entire robot. Therefore, communication between each joint is performed. Gravity compensation control maybe a compromise between the complete model based compensation control and the independent joint control. Gravity compensation may simplify computational efficiency of the independent joint control while providing some dynamic compensation of the gravity terms.
In other embodiments other forms of partial model based compensation are used. In some embodiments, the diagonal elements of the joint space inertia matrix, M(q), are also included in the compensation strategy.
Feed-Forward Compensation Plus PID
In one embodiment, the control structure 610 may provide joint driving torques that include the sum of a feed-forward centralized nonlinear controller to compensate for the dynamics and a decentralized independent joint proportional-integral-derivative (PID) controller. Such a control scheme is given by:
a={circumflex over (M)}(qr) (50)
b=Ĉ(qr,{dot over (q)}r)+Ĝ(qr)+{circumflex over (F)}(qr,{dot over (q)}r) (51)
c=Gv({dot over (q)}r−{dot over (q)})+Gp(qr−q)+Gi∫(qr−q)dt (52)
τ*={umlaut over (q)}r=J*(qr)({umlaut over (x)}d−{dot over (J)}(qr){dot over (q)}r+Kpe+Kvė) (53)
Trajectory Conversion Inside the Servo Loop
a={circumflex over (M)}(q) (54)
b=Ĉ(qr,{dot over (q)}r)+Ĝ(qr)+{circumflex over (F)}(qr,{dot over (q)}r) (55)
c=0 (56)
τ*={umlaut over (q)}r=J*(q)({umlaut over (x)}d−{dot over (J)}(q){dot over (q)}+Kpe+Kvė) (57)
The trajectories from the actual sensed position of the robot system 314 are fed-back to the trajectory converter 708 and the control structure 710. The control structure 710 also provides nonlinear model based compensation to dynamically decouple and linearize the closed loop system.
The control structure 710 may use dynamically decoupled control inside the servo loop with the operational space dynamics given in Equation 35. A model based compensation in this space is given by,
a={circumflex over (M)} (58)
b=Ĉ(x,{dot over (x)})+Ĝ(x)+{circumflex over (F)}(x,{dot over (x)}) (59)
where, c=0 as in Equation 56. Decoupling and control of the task can therefore be achieved using τ*={umlaut over (x)}.
Managing Multiple Tasks
In one embodiment, the motion controller system 100 may perform multiple tasks simultaneously. For example, task descriptors may represent multiple landmark locations on the human structure. Execution of one task descriptor may have a higher importance or higher priority than other task descriptors. For example, suppose the target system 104 is to perform a grasping task. The source system 102 may provide multiple task descriptors, such as the desired hand position, and desired shoulder position. In instances in which the target system cannot satisfy multiple tasks, the motion controller system 100 may assign a higher weight or a higher priority to achieving the desired motion, such as hand position over achieving the desired shoulder position. Two embodiments of the motion controller system 100 for managing multiple tasks are described, namely, 1) weighting strategy and 2) prioritized strategy.
Task Augmentation: Weighting Strategy
The motion controller system 100 may use a weighting strategy for determining the tasks to execute. Suppose there are p number of tasks that are to be executed. Let {dot over (x)}i represent the spatial velocity of the ith task descriptor and Ji the associated Jacobian. Task augmentation refers to the concatenation of the individual spatial velocities {dot over (x)}i into a 6r×1 vector {dot over (x)}, and the concatenation of the associated task Jacobian matrix Ji to form the 6r×n matrix J, such that,
{dot over (x)}=[{dot over (x)}1 . . . {dot over (x)}p]T (60)
J=[J1 . . . Jp]T (61)
Likewise, {dot over (x)}d in the augmented space is the concatenation of the each desired task descriptor spatial velocity. The solution of a tracking control algorithm in the augmented system follows Equations 7 to 33, as described above.
The tracking error rate for each element of a task descriptor can be controlled by the feedback gain matrices. For a first order system in Equation 25, the trajectory tracking error convergence rate depends on the eignevalues of the feedback gain matrix K; the larger the eignevalues, the faster the convergence. In one embodiment, such systems may use a discrete time approximation of the continuous time system; therefore, it is reasonable to predict that an upper bound exists on the eigenvalues; depending on the sampling time. A particular task (or specific directions of particular task) can be more tightly controlled by increasing the eigenvalue of K associated with direction of the particular task.
Prioritizing Tasks
The motion controller system 100 may prioritize tasks for determining the tasks to execute. In one embodiment, a task comprises two subtasks with the order of priority. The first priority subtask is specified using the first task variable, x1ε, and the second subtask by the second task variable, x2ε The two task variables may represent the position of a task descriptor and the orientation of its frame, respectively.
The differential kinematic relationship between the joint velocity {dot over (q)}ε and the Cartesian variable is expressed by:
{dot over (q)}=Ji+(q){dot over (x)}i (62)
where Ji is the Jacobian matrix of the ith task descriptor and J+ is typically defined as right pseudo-inverse if J, given by J+=JT(JJT)−1. An inverse kinematics solution with movement prioritization with the two desired movements is described by:
{dot over (q)}=J1+{dot over (x)}1+N1vn (63)
where vn is an arbitrary vector and N1=(I−J1+J1) projects vn onto the null space of J1. If an exact solution does not exist, Equation 63 represents a least squares solution that minimizes ∥{dot over (x)}1−J1(q){dot over (q)}∥. By substituting Equation 63 into Equation 62 for i=2, an exact solution for vn may be found if it exists. If an exact solution does not exist, a solution for vn may be found that minimizes, ∥{dot over (x)}2−J2(q){dot over (q)}∥, given by:
vn=(J2N1)+({dot over (x)}2−J2J1+{dot over (x)}1) (64)
The above procedure for a prioritized two task problem has been outlined in Y. Nakamura, Advanced Robotics, Redundancy and Optimization, Adisson-Wesley, (1991).
Jiε,{dot over (x)}iε i,i=1, . . . ,k, where k is the number of subtasks
The constraint system 304 initializes 1504 a projection matrix N of the vectors, such as.
NO=I (65)
For each subtask i, the constraint system 304 calculates 1506 a velocity vector, such as:
vi={dot over (x)}i (66)
The constraint system 304 calculates 1508 the Jacobian matrix using the projection matrix, such as:
Ĵi=JiNi-1 (67)
The constraint system 304 estimates 1509 the velocity, such as:
The constraint system 304 calculates 1510 a projection matrix using the Jacobian, such as:
Ni=Ni-1(I−Ĵi+Ĵi) (69)
The constraint system 304 calculates 1512 a joint velocity {dot over (q)}, such as:
where zε is an arbitrary vector.
The constraint system 304 repeats the setting 1506 until the last subtask is prioritized 1514.
In the methodology of
Processing the singularity is described below, or may use a conventional process. See, e.g., Bruno Siciliano and Jean-Jacques E. Slotine, “A general framework for managing multiple tasks in highly redundant robotic systems”, In Proceedings of the IEEE International Conference on Advanced Robotics, pages 1211-1216, Pisa, Italy, June (1991). In another embodiment, the pseudo-inverse Ĵi+ in the calculate 1512 is replaced by a singularity inverse, e.g., the damped least squares inverse described above.
Balance Control
The motion controller system 300 may directly observe or synthesize desired task variables, or may “infer” from a control policy that creates the desired task descriptors to produce a certain behavior. Such desired task descriptors are referred to here as “inferred” task descriptors, because they are typically inferred from a system that monitors the current state of the target system 104 and produces a desired task descriptor. Balance maintenance is an example of a behavior that may be controlled by an inferred balance descriptor.
In one embodiment, the target system 104 represents a human or a humanoid robot. Balance criteria such as the Zero Moment Point (ZMP), or whole body center of mass are used in order to produce a desired balancing descriptor that is a function of q. Commanding the position of the whole body center is one effective task descriptor to partially control balance.
Let pcm represent the position of the whole body center of mass, computed as
where M is the total body mass, and mi and pcmi correspond to the mass and the position of the center of mass of segment i, respectively. As before, the relationship between the center of mass velocity and the joint velocity is established through the Jacobian of the center of mass JCM,
{dot over (p)}CM=JCM{dot over (q)} (72)
Unlike the observed task descriptors which represent the desired motion obtained from measurements, the system 300 may include a separate control system to determine the desired position of the center of mass pd
In addition to the desired center of mass position, other descriptors can be used to control balance. For example, body inclination and feet positions are some examples. If task augmentation is used to manage all task descriptors, the relevant balance descriptor velocities can be concatenated together and incorporated into a single balancing descriptor vector velocity, denoted by {dot over (x)}b. The 1 . . . p observed tasks may be augmented with the balancing tasks in a similar manner as was done in Equations 43 and 44. The augmented system of the system 300 may be represented as:
{dot over (x)}=[{dot over (x)}1 . . . {dot over (x)}p|{dot over (x)}b]T
J=[J1 . . . Jp|Jb]T (73)
where for purpose of illustration only, the observed descriptors and the balancing descriptors are partitioned by a vertical separating line. In one embodiment, the balance descriptor is among the highest priority or the most important task descriptor to satisfy in many applications.
Constraints
In one embodiment, the target system 104 has kinematic and dynamic constraints that are to be satisfied. Constraints to avoid joint limits, self collisions, and collisions with the environment are examples of kinematic constraints. Singular configurations also impose constraints on the allowable regions of the workspace that the target system 104 can pass through without regularization. Moreover, the target system 104 may also have limits on the allowable joint velocities and joint torques. This is especially true if the target system 104 is a robot whereby actuator velocity limits and torque limits are critical. These constraints are sometimes handled in the null-space by specifying the vector vn for a convenient utilization of redundant degrees of mobility. These constraints may also be considered at the highest priority level, in which case they are used as the first task.
If the constraints are handled in the null-space, the motion controller system 300 may specify the vector vn for a convenient utilization of redundant degrees of mobility by constructing an objection function w(q) whose gradient defines the vector vn, as:
where ko>0. Because the solution moves along the direction of the gradient of the objective function, it attempts to locally maximize it compatible to the primary objective (kinematic constraint). The motion controller system 300 may use an objective functions, e.g., for singularity avoidance, joint limit avoidance, and collision avoidance.
Handling Singularities
The motion controller system 100 may include processing for handling singularities, such as the singularity robustness system 324. The singularities may be task singularities due to physical limitations or algorithmic singularities due to mathematics of the motion. In configurations where the Jacobian matrix in Equation 7 is full rank, any velocity {dot over (x)} can be attained. When J becomes rank deficient, the mechanism is said to be at a singular configuration. In the neighborhood of singular configurations, a small change in {dot over (x)} may require a very large change in q. This causes a large error in the task motion, since the joint torques and velocities required to execute such a motion exceed the physical capabilities of the target system 104. The singularity problem becomes an inherent problem in controlling any target system 104 representing a physical model or system. Singularities impose constraints on the feasibility and accuracy of the solution and may be therefore characterized as constraints within control hierarchy of the motion controller system 100.
There are several embodiments for overcoming the inherent singularity problem. In one embodiment, the motion controller system 100 plans a trajectory that is away from singular points by using a potential function that is a measure of the manipulability of a task. In one embodiment, the mechanisms of the target system 104 may be allowed to pass through singular points and their neighborhood through the use of a singularity robust inverse of the Jacobian matrix (see Y. Nakamura and H. Hanafusa, “Inverse kinematic solution with singularity robustness for robot manipulator control”, ASME J. Dyn. Sys. Meas., Contr., 108(3):163-171, (1986)), also known as the damped least squares method (DLS) (see C. W. Wampler, “Manipulator inverse kinematic solutions based on vector formulations and damped least squares methods”, IEEE Trans. Sys., Man, Cyber., 16(1):93-101, (1986)). The transpose of the Jacobian can be used in lieu of matrix inversion, which provides a noticeable computation savings as well as it represents an alternative to the use of a DLS inverse for handling singularities (See L. Sciavicco and B. Siciliano, “A solution algorithm to the inverse kinematic problem for redundant manipulators”, IEEE Journal of Robotics and Automation, 4:403-410, (1988)). The DLS method is described next in more detail.
The most general form of the damped least squares inverse was defined in Equation 24 and repeated below,
J*=W1−1JT(JW1−1JT+W2)−1 (75)
where W2=λ2I is the damping term, λ>0 is the damping factor, and I is the identity matrix. If W1 is the identity matrix, Equation 53 satisfies the optimization condition:
Small values of λ give accurate solutions but low robustness to the occurrence of singular and near-singular configurations. Large values of λ result in low tracking accuracy even when a feasible and accurate solution would be possible. The damping factor establishes the relative weight between the two objectives. Choosing a constant value for λ may turn out to be inadequate for obtaining good performance over the entire “Target” system workspace. There exists methods for adaptively selecting the damping factor based on some measure of closeness to the singularity at the current “Target” system configuration (See S. Buss and J. S. Kim, “Selectively damped least squares for inverse kinematics”, Journal of Graphics Tools 10:3, 37-49, (2005)). (See A. A. Maciejewski and C. A. Klein, “Obstacle avoidance for kinematically redundant manipulators in dynamically varying environments”, International Journal of Robotics Research, 4:109-117, (1985)).
Joint Limit Avoidance
The motion controller system 100 may use several methods to avoid joint limits. One is the Gradient projection method. A performance criterion is defined as a function of joint limits and its gradient is projected onto the null space projection matrix of the Jacobian to obtain the self-motion necessary to optimize the performance criterion. A second method is the Weighted Least-Norm (WLN) solution to avoiding joint limits. The WLN solution was originally proposed by T. F. Chan and R. V. Dubey, “A weighted least-norm solution based scheme for avoiding joint limits for redundant joint manipulators”, IEEE Transactions on Robotics and Automation, 11(2), (1995). A WLN solution is formulated in the context of the Damped Least Squares Jacobian inverse.
Consider once again the weighted damped least squares inverse of J from Equation 24. Suppose W1 is a diagonal matrix used for joint limit avoidance, defined by:
The scalers wi corresponding to the diagonal elements of W2 are defined by:
where H(q) is the performance criterion to avoid joint limits and
is its gradient defined by:
where qi is ith degree of freedom, and qi,min and qi,max are the lower and upper joint limits, respectively. It should be noted that
is equal to zero if the joint is at the middle of its range and goes to infinity at either limit. The second condition in Equation 72 allows the joint to move freely if the joint is moving away from the limit because there is no need to restrict or penalize such motions.
Collision Avoidance
Collision avoidance of a target system 104 with itself or with other obstacles allows the target system 104 to safely execute a motion. The collision avoidance system 422 may handle collision avoidance by defining collision points which are identified at each instant when two point pairs are approaching collision. Each collision point in a point-pair is defined as either “re-active” or “passive”. A re-active collision point may be regarded as a task descriptor for collision avoidance and controlled in the same way that other task descriptors are controlled. A re-active collision point, therefore, represents a collision task descriptor whose desired motion is determined based on a control policy which reacts to avoid collision. A passive collision point is one that is not explicitly controlled for collision avoidance. Passive collision points are either attached to an external system (such as the environment or obstacle), or attached to a different rigid body on the target system 104. In one embodiment, the motion controller system 100 processes a collision point that is attached to an external system as passive. This embodiment may be used, for example, if the collision point is not controllable by an internal control system of the target system 104.
If the collision points in a point-pair are both attached to the target system 104 (self collision), the designation of “re-active” and “passive” collision points is more complex. A practical resolution for the designation may be to consider metrics related to minimizing the energy required to move a collision point away from collision. One possible metric is to consider an inertia weighted Jacobian inverse for the inverse kinematics or inverse dynamics solution, by substituting M=W1 in Equation 24, where M is the joint space inertia matrix. This solution minimizes the kinetic energy required to execute the task; however, the inertia weighted Jacobian inverse solution requires the calculation of the joint space inertia matrix which is computationally expensive and may not be practical for closed loop inverse kinematic solutions. A more computationally feasible and algorithmically simple alternative is to use the relative mass of the rigid body associated with collision point-pairs. The point-pair attached to the larger mass will be designated as the passive collision point while the point pair attached to the smaller mass will be designated as the re-active collision point. Consider, for example, the collisions between the hand and the torso. The torso collision point will be designated as a passive collision point while the hand collision point will be designated as a re-active collision point.
Once the designation of re-active and passive collision points have been established, the computed collision avoidance task descriptor velocity is defined by:
{dot over (x)}c=pp(q)−pr(q) (81)
where pp(q), pr(q) are the passive and reactive collision points. The term computed, as before, implies the quantities are calculated by the forward kinematics system 312 and are a function of q. The system in Equations (50) and (51) may be further augmented with the collision avoidance descriptor velocity and the associated Jacobian, denoted by {dot over (x)}c and Jc, respectively.
{dot over (x)}=[{dot over (x)}1 . . . {dot over (x)}p|{dot over (x)}b|{dot over (x)}c]T (82)
J=[J1 . . . Jp|Jb|Jc]T (83)
The collision avoidance system 422 may determine a control policy that produces a desired collision avoidance task descriptor to: a) monitor the distance d to collision, and b) stop the motion of the collision point in the direction of the collision if the collision distance is less than a threshold d<dthresh. The collision avoidance system 302 may command the desired velocity to be equal and opposite to the current direction of the collision point.
Joint Velocity Limits
There are typically limits on the allowable joint velocities of each degree of freedom in the target system 104, which may be determined by the physical limits of the actuators driving a joint. In robotics, joint velocity constraints may be handled by clamping the velocities when they reach their limit. Alternatively, velocities may be controlled by using a similar approach as the damped least squares Jacobian inverse described above by modulating the damping factor. While these approaches preserve the time to execute the entire motion, they do not preserve the original trajectory profile. The resulting motion trajectory may be significantly altered from the original motion. The joint velocity limits system 426 may limit joint velocities by adaptively modulating the time between two successive samples while keeping the original joint motion profile unaltered.
To simplify notation, the subscript i, previously referred to quantities associated with joint i (i.e. q=qi and {dot over (q)}={dot over (q)}i), is dropped. Let {dot over (q)}k (k=1 . . . N) represent a length N sequence corresponding to a discrete time representation of {dot over (q)}(t). In the discrete implementation of the algorithm, the discrete time sequence at sample k+1 is given by:
tk+1=tk+Δtk (85)
where Δtk is sample time. The time modulation scheme for joint limit avoidance replaces the time sequence in Equation 85 with the following:
t′k+1=t′k+Δtkεk (86)
where εk is a time-modulation factor which can be designed to ensure that the joint velocities remain below their limits. In one embodiment, the factor εk meets the following conditions:
where {dot over (q)}lim is the joint velocity limit associated with a particular degree of freedom. The first condition provides that:
Δq′k=Δqk (87)
where,
Δqk=qk+1−q
Δq′k=q′k+1−q′
A first order finite difference approximation of the time derivative provides:
Δqk={dot over (q)}kΔtk
Δq′k={dot over (q)}′kΔt′k (89)
where Δt′k=Δtkεk. It follows from Equations 86 and 89, and the third condition that:
|{dot over (q)}k|Δtk={dot over (q)}limΔtkεk (90)
where |{dot over (q)}′k={dot over (q)}lim is forced. From Equation 90, εk is solved:
The joint velocity limits system 426 may use a time modulation scheme that ensures the three conditions listed above, which are summarized as follows:
where εk is defined by Equation 91. Note that by definition, εk>1 implies that the joint velocities are equal or above their limits and corrective action is required by modulating (expanding) time. Furthermore, εk<1 implies the joint velocities are below their limits and time modulation is unnecessary.
Note that the example above preserves the original motion profile, but may indeed expand the total time to execute the motion. In order to preserve the motion profile as well as the total execution time, other time modulation embodiments may use a Δtk that is expanded if joint velocities exceed their limits and Δtk is shortened if the joint velocities are below their limits. This may be performed in such a way that the overall execution time remains unchanged.
In another embodiment, time modulations may use the first and second order time derivatives of q that have certain smoothness and continuity characteristics. In this embodiment, the factor εk may be determined accordingly. The concept of time modulation may be applied to limit joint accelerations by considering the second order time derivative of q. In fact, the concept of time modulation may used to limit any physical quantity that can be expressed as a function of time.
Commanding Torque to Position Control Robots
In the systems described above, the retargeting system computes torque and provides a torque control command to the robot system, for example, to drive a motor shaft in the robot. In one embodiment, an actuator in the robot system is a permanent magnet DC motor. Such motors are driven by a controllable voltage or current source. Some robot manufacturers allow the user to indirectly command a desired torque by applying a voltage that is proportional to the armature current. Because motor torque is proportional to armature current, the retargeting system can directly and accurately controls torque.
In some embodiments, a robot may not be commanded by current (and therefore torque), but the robot may include a control scheme, such as some combination of position, velocity, and derivative action to independently control each joint. Such combinations may include proportional plus integral (PI), proportional plus derivative (PD), or proportional plus integral plus derivative (PID). The tracking control system 900 provides a robot command 908, such as position or velocity, to control the robot, which executes the command using an internal joint controller. In one embodiment, the motor control inversion system 904 generates the robot command 908 to cancel the dynamics of the motor and the independent joint controller supplied by the manufacturer or provided by the robot system to the tracking control system 900. The motor control inversion system 904 may include an outer loop control to command torque to robots that are designed to accept only position commands.
The tracking control system 900 may provide robot commands 908 to linear or non-linear robots with varying payloads in response to non constant feedback gains. In one embodiment, the tracking control system 900 uses a dynamic model of the robot. See, e.g., C. An, C. Atkeson, and J. Hollerbach, Model-Based Control of a Robot Manipulator, MIT Press (1988). In one embodiment, the tracking control system 900 uses PD control with gravity compensation. See, e.g., M. Takegaki and S. Arimoto, “A new feedback method for dynamic control of manipulators”, ASME J. of Dynamic Systems, Measurement, and Control, 102:119-125, (1981). In one embodiment, the tracking control system 900 uses non-model based control strategies using soft computing approaches such as fuzzy logic, neural networks, genetic algorithms, and the like.
Motor and Controller Dynamics
In order to simplify analysis, a Laplace transform technique is used to derive the motor and controller dynamics, considering a single joint robot arm. The final results for multi-joint robots are also described using the state space representation.
Consider first the motor dynamics. The most commonly used actuator for robots is the permanent magnet DC-motor. In a Permanent Magnet motor, a coil of wire called the armature is arranged in the magnetic field of a permanent magnet in such a way that it rotates when a current is passed through it. Let ia and E, represent the armature current and voltage, respectively, and La, Ra, and Kb represent, respectively, the armature inductances, armature resistances, and back emf constants. The actuator dynamics for the armature controlled DC motor is described by:
where the motor shaft speed and the angular displacement of the motor shaft are {dot over (q)} and q, respectively. In geared robots, the angular displacement of the load shaft is related to q by the gear ratio. As an illustrative embodiment, the gear ratio is assumed to be unity for simplicity and clarity. The torque developed at the motor shaft is proportional to the armature current,
τ=Ktia (94)
where Kt is the motor-torque proportional constant. Substituting equation 88 into equation 87 and taking the Laplace transform, the torque developed by the motor may be expressed by:
τ(s)=KtHm(s)(E(s)−sKbQ(s)) (95)
Where Hm is defined by:
The torque τ developed at the motor shaft is equal to the sum of the torques dissipated by the motor (τm) and the torque on the robot referred to the motor shaft (τL). The plant, comprising the motor and the robot as seen by the shaft is denoted by R. The input output transfer characteristics of the motor are illustrated in
The system 1000 may be controlled with independent joint position controllers. The controllers servo the motor such that the actual angular displacement of the joint (q) tracks a desired angular displacement (qd). In one embodiment, a PID controller is used to servo the motor. Let {tilde over (q)} represent the error between the desired shaft angle qd and the actual shaft angle q. The transfer function of a PID controller relating the applied voltage to the motor E and the position error {tilde over (q)} is given by:
where Kp, Kl, and Kv, represent the proportional, integral, and derivative feedback gain constants. It follows from equation 77 that the applied voltage E is,
E(s)=Hc(s)(Qd(s)−Q(s)) (98)
Motor and Controller Inversion
To cancel the effect of the controller, the controller inversion block “C−1” with input Ê and output qd produce a unity gain transfer function between E and output Ê. To invert the motor, the MCI block 904 has an input τc, and an output E that when cascaded with blocks C−1 and C produces a unity gain transfer function between τ and τc. The system including cascading the four blocks. M−1, C−1, C, and M, as shown in
In one embodiment, the controller inversion block C−1 has a transfer function:
Qd(s)=Hc−1(s)Ê(s)+Q(s) (100)
where Hc−1(s) is given by,
For a robot with n actuated joints, the time domain equations corresponding to a multi-input, multi-output (MIMO) version of the control inversion equation 94 is given by:
qd=y+q (102)
where qd, y, and q correspond to n dimensional vectors. The solution to equation 100 may be obtained by solving the following state space equations for the output y:
where Kp Kv and KI are diagonal n×n position, velocity, and integral gain matrices, respectively, and Ø and Ī are n×n zero matrix and identity matrix, respectively.
To invert the motor, we follow a similar approach as the controller inversion C−1 may use a similar approach. From Equation 95 and
Ê(s)=Hm−1(s)Kt−1τc(s)+sKbQ(s) (105)
where
Hm−1(s)=sLa+Ra (106)
The MIMO time domain solution for the motor inversion block M−1 is obtained from the following equations,
where the boldface quantities denoted by upper case letters correspond to n×n diagonal matrices and boldface quantities denoted by lower case letters correspond to n×1 vectors. Equations 107 and 108 may be realized using the block diagram in
While particular embodiments and applications of the present invention have been illustrated and described herein, it is to be understood that the invention is not limited to the precise construction and components disclosed herein and that various modifications, changes, and variations may be made in the arrangement, operation, and details of the methods and apparatuses of the present invention without departing from the spirit and scope of the invention as it is defined in the appended claims.
This application claims a benefit of, and priority under 35 USC §119(e) to, U.S. Provisional Patent Application No. 60/796,060, filed Apr. 27, 2006, and titled “Reconstruction, retargetting, tracking, and estimation of human motion of low dimensional task descriptors”, the contents of which are herein incorporated by reference. This application is related to U.S. patent application Ser. No. 11/614,933, filed on Dec. 21, 2006, entitled “Reconstruction, Retargetting, Tracking, And Estimation of Motion of Articulated Systems”, the contents of which are incorporated herein by reference in its entirety. This application is related to U.S. patent application Ser. No. 11/614,930, filed on Dec. 21, 2006, entitled “Reconstruction, Retargetting, Tracking, And Estimation of Pose of Articulated Systems”, the contents of which are incorporated herein by reference in its entirety. This application is related to U.S. patent application Ser. No. 11/541,001, filed Sep. 29, 2006, U. S. Publication No: US 2007-0070738 A1, the subject matter of which is incorporated herein in its entirety. This application claims a benefit of, and priority under 35 USC 119(e) to, U.S. Provisional Patent Application No. 60/910,164, filed Apr. 4, 2007, and titled “Hierarchical Control of Robots From Human Motion Primitives”, the contents of which are herein incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
3301508 | Yamron | Jan 1967 | A |
4831549 | Red et al. | May 1989 | A |
4848525 | Jacot et al. | Jul 1989 | A |
4999553 | Seraji | Mar 1991 | A |
5159249 | Megherbi | Oct 1992 | A |
5276390 | Fisher et al. | Jan 1994 | A |
5293461 | Grudic et al. | Mar 1994 | A |
5341459 | Backes | Aug 1994 | A |
5430643 | Seraji | Jul 1995 | A |
5550953 | Seraji | Aug 1996 | A |
5581666 | Anderson | Dec 1996 | A |
5586224 | Kunii et al. | Dec 1996 | A |
5625577 | Kunii et al. | Apr 1997 | A |
5675720 | Sato et al. | Oct 1997 | A |
5737500 | Seraji et al. | Apr 1998 | A |
5808433 | Tagami et al. | Sep 1998 | A |
6004016 | Spector | Dec 1999 | A |
6016385 | Yee et al. | Jan 2000 | A |
6028608 | Jenkins | Feb 2000 | A |
6236737 | Gregson | May 2001 | B1 |
6311128 | Prum et al. | Oct 2001 | B1 |
6317651 | Gerstenberger et al. | Nov 2001 | B1 |
6331181 | Tierney et al. | Dec 2001 | B1 |
6341246 | Gerstenberger et al. | Jan 2002 | B1 |
6424885 | Niemeyer et al. | Jul 2002 | B1 |
6456901 | Xi et al. | Sep 2002 | B1 |
6493608 | Niemeyer | Dec 2002 | B1 |
6554706 | Kim et al. | Apr 2003 | B2 |
6577925 | Fromherz | Jun 2003 | B1 |
6674877 | Jojic et al. | Jan 2004 | B1 |
6683968 | Pavlović et al. | Jan 2004 | B1 |
6694044 | Pavlović et al. | Feb 2004 | B1 |
6699177 | Laby et al. | Mar 2004 | B1 |
6708142 | Baillot et al. | Mar 2004 | B1 |
6786896 | Madhani et al. | Sep 2004 | B1 |
6853964 | Rockwood et al. | Feb 2005 | B1 |
6888549 | Bregler et al. | May 2005 | B2 |
6985620 | Sawhney et al. | Jan 2006 | B2 |
6995536 | Challoner | Feb 2006 | B2 |
6999601 | Pavlović et al. | Feb 2006 | B2 |
7001413 | Butler | Feb 2006 | B2 |
7106334 | Imagawa et al. | Sep 2006 | B2 |
7274800 | Nefian et al. | Sep 2007 | B2 |
7292151 | Ferguson et al. | Nov 2007 | B2 |
7386366 | Dariush | Jun 2008 | B2 |
7403880 | Maille et al. | Jul 2008 | B2 |
7457733 | Maille et al. | Nov 2008 | B2 |
7469166 | Dariush | Dec 2008 | B2 |
7573477 | Ng-Thow-Hing | Aug 2009 | B2 |
7688016 | Aghili | Mar 2010 | B2 |
7859540 | Dariush | Dec 2010 | B2 |
8170287 | Dariush et al. | May 2012 | B2 |
8467904 | Dariush | Jun 2013 | B2 |
20020173878 | Watanabe et al. | Nov 2002 | A1 |
20030113018 | Nefian et al. | Jun 2003 | A1 |
20030171847 | Cheng et al. | Sep 2003 | A1 |
20040151504 | Triebes et al. | Aug 2004 | A1 |
20040193234 | Butler | Sep 2004 | A1 |
20040193413 | Wilson et al. | Sep 2004 | A1 |
20040267404 | Danko | Dec 2004 | A1 |
20050001842 | Park et al. | Jan 2005 | A1 |
20050107916 | Nagasaka | May 2005 | A1 |
20050177276 | Morel et al. | Aug 2005 | A1 |
20050271279 | Fujimura et al. | Dec 2005 | A1 |
20060074527 | Bhatt et al. | Apr 2006 | A1 |
20060111761 | Butler | May 2006 | A1 |
20060139347 | Choi et al. | Jun 2006 | A1 |
20060139355 | Tak et al. | Jun 2006 | A1 |
20060241809 | Goswami et al. | Oct 2006 | A1 |
20060269145 | Roberts | Nov 2006 | A1 |
20060293790 | Gienger | Dec 2006 | A1 |
20060293792 | Hasegawa et al. | Dec 2006 | A1 |
20070013336 | Nowlin et al. | Jan 2007 | A1 |
20070073365 | Butler | Mar 2007 | A1 |
20070083290 | Nagasaka | Apr 2007 | A1 |
20070140562 | Linderman | Jun 2007 | A1 |
20070162164 | Dariush | Jul 2007 | A1 |
20070233280 | Bacon et al. | Oct 2007 | A1 |
20080019589 | Yoon et al. | Jan 2008 | A1 |
20080181459 | Martin et al. | Jul 2008 | A1 |
20080234864 | Sugiura et al. | Sep 2008 | A1 |
20080317331 | Winn et al. | Dec 2008 | A1 |
20100145521 | Prisco et al. | Jun 2010 | A1 |
Number | Date | Country |
---|---|---|
S 63-106806 | May 1988 | JP |
S 63-300903 | Dec 1988 | JP |
08-108383 | Apr 1996 | JP |
10-097316 | Apr 1998 | JP |
11-33958 | Feb 1999 | JP |
2003-89082 | Mar 2003 | JP |
WO 2007076118 | Jul 2007 | WO |
WO 2007076118 | Jul 2007 | WO |
Entry |
---|
Chiacchio, Pasquale, et al. “Closed-loop inverse kinematics schemes for constrained redundant manipulators with task space augmentation and task priority strategy.” The International Journal of Robotics Research 10.4 (1991): 410-425. |
Choi, Kwang-Jin, and Hyeong-Seok Ko. “On-line motion retargetting.” Computer Graphics and Applications, 1999. Proceedings. Seventh Pacific Conference on. IEEE, 1999. |
Podder, Tarun Kanti, and N. Sarkar. “Motion planning and control of UVMS: a unified dynamics-based approach.” OCEANS 2003. Proceedings. vol. 5. IEEE, 2003. |
Japanese Patent Office, Non-Final Office Action, Japanese Patent Application No. JP 2010-532163, Dec. 18, 2012, seven pages. |
United States Patent and Trademark Office, Final Rejection, U.S. Appl. No. 12/317,369, Aug. 31, 2012, 21 pages. |
Japanese Office Action, Japanese Application No. P2010-531282, Jun. 18, 2012, 6 pages. |
PCT International Search Report and Written Opinion, PCT/US08/87657, Mar. 19, 2009, 8 pages. |
PCT International Search Report and Written Opinion, PCT/US08/81171, Jan. 5, 2009, 10 pages. |
PCT International Search Report and Written Opinion, PCT/US08/81201, Jan. 5, 2009, 9 pages. |
PCT International Search Report and Written Opinion, PCT/US06/49247, Mar. 4, 2008. |
PCT International Search Report and Written Opinion, PCT/US06/49253, May 30, 2008. |
PCT International Search Report and Written Opinion, PCT/US07/67289, Mar. 14, 2008. |
Number | Date | Country | |
---|---|---|---|
20070255454 A1 | Nov 2007 | US |
Number | Date | Country | |
---|---|---|---|
60796060 | Apr 2006 | US |