Robust high-performance control for robotic manipulators

Information

  • Patent Grant
  • 5049796
  • Patent Number
    5,049,796
  • Date Filed
    Wednesday, May 17, 1989
    35 years ago
  • Date Issued
    Tuesday, September 17, 1991
    33 years ago
Abstract
Model-based and performance-based control techniques are combined for an electrical robotic control system. Thus, two distinct and separate design philosophies have been merged into a single control system having a control law formulation including two distinct and separate components, each of which yields a respective signal component that is combined into a total command signal for the system. Those two separate system components include a feedforward controller and a feedback controller. The feedforward controller is model-based and contains any known part of the manipulator dynamics that can be used for on-line control to produce a nominal feedforward component of the system's control signal. The feedback controller is performance-based and consists of a simple adaptive PID controller which generates an adaptive control signal to complement the nominal feedforward signal.
Description

2. Field of the Invention
This invention relates to control systems for controlling robotic manipulators.
3. Description of the Prior Art
The next generation of robotic manipulators will perform high-precision tasks in partially unknown and unstructure environments. These tasks require precise motion control of the manipulator under unknown and varying payloads. These requirements are far beyond the capabilities of present-day industrial robot controllers, and demand robust high-performance manipulator control systems. The need for advanced manipulator control systems to accomplish accurate trajectory tracking has therefore been recognized for some time, and two parallel lines of research have been pursued. The primary outcome of such research is the development of two classes of advanced manipulator control schemes, namely model-based and performance-based techniques.
Model-based techniques, such as the Computed Torque Method by B. R. Markiewicz: Analysis Of The Computed Torque Drive Method And Comparison With Conventional Position Servo For A Computer-Controlled Manipulator, Technical Memorandum 33-601, Jet Propulsion Laboratory, 1973, are based on cancellation of the nonlinear terms in the manipulator dynamic model by the controller. This cancellation is contingent on two assumptions which are not often readily met in practice. First, the values of all parameters appearing in the manipulator dynamic model, such as payload mass and friction coefficients, must be known accurately. Second, the full dynamic model of the manipulator needs to be known and computed on-line in real-time at the servo control rate. Performance-based techniques, such as the direct adaptive control method by S. Dubowsky and D. T. DesForges: The Application Of Model-Referenced Adaptive Control To Robotic Manipulators, ASME Journal of Dynamic Systems, Measurement and Control, Vol. 101, pp. 193-200, 1979, attempt to overcome these limitations by adjusting the controller gains on-line in real-time, based on the tracking performance of the manipulator; and, thus, eliminating the need for the manipulator model. Therefore, the indentification of the manipulator and payload parameters of the complex manipulator dynamic model is not necessary, and hence a fast adaptation can be achieved. Adaptive control methods, however, may become unstable for high adaptation rates and treat the manipulator as a "black-box" by not utilizing any part of the manipulator dynamics in the control law formulation.
During the past few years, several attempts have been made to combine the model-based and performance-based techniques in order to take full advantage of the merits of both techniques and overcome their limitations. For instance, in the approach of J. J. Craig, P. Hsu, and S. S. Sastry: Adaptive Control Of Mechanical Manipulators, Proc. IEEE Intern. Conf. on Robotics and Automation, Vol. 1, pp. 190-195, San Francisco, 1986, and R. H. Middleton and G. C. Goodwin: Adaptive Computed Torque Control For Rigid Link Manipulators, Proc. IEEE Conf. on Decision and Control, Vol. 1, pp. 68-73, Athens, 1986, the manipulator parameters are estimated adaptively first and are then utilized in a dynamic-based control law.
First attention is directed to Oswald U.S. Pat. No. 4,200,827 which discloses a control system for a magnetic head including both velocity and position feedback and feedforward signals representing both velocity and acceleration. See FIG. 1 and column 3, line 17 to column 6, line 35. Also, see Penkar et al. U.S. Pat. No. 4,773,025 and Axelby U.S. Pat. No. 4,663,703.
Next attention is directed to Takahashi U.S. Pat. No. 4,639,652 which discloses a control system for a robot manipulator including adaptive position and velocity feedback gains. See gain adjuster 14 and gains 5 and 6 in FIG. 1. Particular attention should be given to the circuit diagram presented in FIG. 4 of this reference which differs from FIG. 1 only in the use of the transfer function T(s) and operates in accordance with the description beginning at column 4, line 48 through column 6, line 29. This reference is of interest only because of the high speed positioning control. It relies upon a prior art technique commonly known as "Identification", in that test runs permit the gains of its control law to be identified.
Next attention is directed to Shigemasa U.S. Pat. No. 4,719,561 which discloses a control system having robust controller 24 in combination with a PID controller 22. See FIG. 3 and column 5, line 13.
The following references all disclose a control system for a robot manipulator.
______________________________________Browder 4,341,986Hafner 4,546,426Horak 4,547,858Perzley 4,603,284Perreirra et al. 4,763,276______________________________________
The following references are cited as of interest.
______________________________________Littman et al. 3,758,762Hiroi et al. 4,563,735Matsumura et al. 4,670,843Shigemasa 4,679,136______________________________________
In contrast to the Computed Torque Method of the prior art, the invention does not rely on an accurate dynamic model in order to control the manipulator. Furthermore, global asymptotic stability of the control system is assured since the feedback adaptation laws are derived from a Lyapunov analysis, and the feedforward controller is outside the servo control loop.
A new, robust control system using a known part of the manipulator's dynamics in a freeforward control circuit and any unknown dynamics and uncertainties and/or variations in the manipulator/payload parameters is accounted for in an adaptive feedback control loop.
SUMMARY OF THE INVENTION
This invention discloses and claims a novel approach of combining model-based and performance-based control techniques. Two distinct and separate design philosophies have been merged into one novel control system. The invention's control law formulation is comprised of two distinct and separate components, each of which yields a respective signal component that is combined into a total command signal for the system.
These two separate system components include a feedforward controller and a feedback controller. The feedforward controller is model-based and contains any known part of the manipulator dynamics that can be used for on-line control to produce a nominal feedforward component of the system's command signal. The feedback controller is performance-based and consists of a simple adaptive PID controller which generates an adaptive control signal to complement the nominal feedforward signal. The feedback adaptation laws are very simple, allowing a fast servo control loop implementation.





BRIEF SUMMARY OF THE FIGURES OF THE DRAWING
FIG. 1 is a figure depicting a schematic diagram of a typical actuator and link assembly in accordance with the invention;
FIG. 2 is a figure depicting a feedforward/feedback tracking control scheme in accordance with the invention;
FIG. 3 is a figure depicting a two-link planar manipulator in a vertical plane in accordance with the invention;
FIG. 4(i) is a figure depicting the desired [dashed] and actual [solid] trajectories of the joint angle .theta..sub.1 (t) in accordance with the invention;
FIG. 4(ii) is a figure depicting the desired [dashed] and actual [solid] trajectories of the joint angle .theta..sub.2 (t) in accordance with the invention;
FIG. 5(i) is a figure depicting the variation of the tracking--error e.sub.2 (t) in accordance with the invention;
FIG. 5(ii) is a figure depicting the variation of the tracking--error e.sub.1 (t) in accordance with the invention;
FIG. 6(i) is a figure depicting the variation of the control torque T.sub.1 (t) in accordance with the invention;
FIG. 6(ii) is a figure depicting the variation of the control torque T.sub.2 (t) in accordance with the invention;
FIG. 7(i) is a figure depicting the variations of the auxiliary signals f.sub.1 (t) [solid] and f.sub.2 (t) [dashed] in accordance with the invention;
FIG. 7(ii) is a figure depicting the variations of the position gains k.sub.p.sup.1 (t) [solid] and k.sub.p.sup.2 (t) [dashed] in accordance with the invention;
FIG. 7(iii) is a figure depicting the variations of the velocity gains k.sub.v.sup.1 (t) [solid] and k.sub.v.sup.2 (t) [dashed] in accordance with the invention;
FIG. 8 is a figure depicting the functional diagram of the testbed facility in accordance with the invention;
FIG. 9(i) is a figure depicting the desired [dashed] and actual [solid] PUMA waist angles under adaptive controller in accordance with the invention;
FIG. 9(ii) is a figure depicting the waist tracking-error under adaptive controller in accordance with the invention;
FIG. 10(i) is a figure depicting the desired [dashed] and actual [solid] PUMA waist angles under unimation controller in accordance with the invention;
FIG. 10(ii) is a figure depicting the waist tracking-error under unimation controller in accordance with the invention;
FIG. 11(i) is a figure depicting the variation of the auxiliary signal f(t) in accordance with the invention;
FIG. 11(ii) is a figure depicting the variation of the position gain k.sub.p (t) in accordance with the invention;
FIG. 11(iii) is a figure depicting the variation of the velocity gain k.sub.v (t) in accordance with the invention;
FIG. 11(iv) is a figure depicting the variation of the control torque T(t) in accordance with the invention;
FIG. 12(i) is a figure depicting the desired [dashed] and actual [solid] waist angles with arm configuration charge in accordance with the invention; and
FIG. 12(ii) is a figure depicting the waist tracking-error with arm configuration change in accordance with the invention.





DESCRIPTION OF THE PREFERRED EMBODIMENT
1. Summary Of Presentation
The presentation of the invention is structured as follows. In Section 2, the integrated dynamic model of a manipulator and actuator system is derived. The tracking control scheme is described fully in Section 3. In Section 4, the digital control implementation of the scheme is given. The issue of robustness is discussed in Section 5. The control scheme is applied in Section 6 to the model of a two-link arm, and extensive simulation results are given to support the method. In Section 7, the implementation of the proposed control scheme on a PUMA industrial robot is described and experimental results are presented to validate the improved performance of the invention. Section 8 discusses the results and concludes the presentation of the description of the invention.
2. Integrated Dynamic Model of Manipulator-Plus-Actuator System
Most papers on manipulator control neglect the dynamics of joint actuators, and treat the joint torques as the driving signals. In this section, a realistic approach is taken by including the actuator dynamics and modeling the manipulator and actuators as an integrated system.
In many industrial robots such as the Unimation PUMA, the links of the manipulator are driven by electric actuators at the corresponding joints, and the dynamics of the joint actuators must be taken into account. Note that although electric actuators are modeled hereinafter, the results are general since the form of dynamic equations for other types of actuators is essentially the same.
Referring to FIG. 1, each actuator 100 may be considered as comprising a link 101, driven by a gear 110 that meshes with a motor-driven drive gear 125. Many such actuators are generally required for any given robotics application. A single actuator as a general case will be presented in this application for simplicity purposes. It should be understood that several actuators as needed are driven by the command signal as developed by the control system of this invention.
Any typical actuator is basically a DC servomotor with a permanent magnet 130 to provide the motor field and the driving signal is a voltage or a current applied to the armature winding. In FIG. 1, a driving voltage identified simply as V.sub.j is impressed across a pair of input terminals 140. The resistance and inductance shown in the FIG. simply represent the internal parameters typically found in any actuator and such matters are well known in the art and require no further description. Since servomotors are inherently high-speed low-torque devices, the gear assembly 110,125 is often required to mechanically couple the armature shaft 126 to the robot link 110 in order to obtain speed reduction and torque magnification.
Consider now the jth actuator 100 and suppose that the armature is voltage-driven, as shown in FIG. 1. This representation is general because in cases where the armature is current-driven using the current source i(t) with shunt resistance R.sub.c, the driving source can be replaced by the voltage source v(t)=R.sub.c i(t) with series resistance E1 ? ##STR1## Therefore, without loss of generality, we can assume that the driving source of the jth joint motor is always the voltage source v.sub.j (t) with the internal resistance r.sub.j. This source produces the current i.sub.j (t) in the armature circuit; and the electrical equation for the jth actuator can be written as ##EQU1## where R.sub.j and L.sub.j are the resistance and inductance of the jth armature winding, .phi..sub.j (t) is the angular displacement of the jth armature shaft, and the term ##EQU2## is due to the back-emf generated in the armature circuit. Let us now consider the mechanical equation of the actuator. Referring to the armature shaft, the "equivalent" moment of inertia and friction coefficient of the total load are given by K. Ogata: Modern Control Engineering, Prentice Hall Inc., N.J., 1970 ##EQU3## where {J.sub.jm,f.sub.jm } and {J.sub.j,f.sub.jl } are the moments of inertia and friction coefficients of the jth motor shaft and the jth robot link respectively, while N.sub.jm and N.sub.j are the numbers of gear teeth on the motor side and on the link side respectively, and ##EQU4## is the gear ratio. Although it is assumed that there is one gear mesh between the motor and the link, the result can be extended to multi-mesh gear trains in a trivial manner. See K. Ogata: Modern Control Engineering, Prentice Hall Inc., N.J., 1970. Equations (2) and (3) indicate that, as seen by the motor shaft, the link inertia and friction are reduced by a factor of (N.sub.j).sup.2. Now, the torque r.sub.j (t) generated by the jth servomotor is proportional to the armature current i.sub.j (t); that is, r.sub.j (t)=K.sub.aj i.sub.j (t), and will cause rotation of the armature shaft by .phi..sub.j (t). In addition, the armature will exert an "effective" torque T.sub.j (t) on the jth robot link through the gear train. Thus, the mechanical equation for the jth actuator can be expressed as (refer to K. Ogata, supra) ##EQU5## Let us now denote the angular displacement of the jth robot joint by .theta..sub.j (t), where
.theta..sub.j (t)=N.sub.j .phi..sub.j (t) (5)
due to the gear train. Then, (1) and (4) can be written in terms of .theta..sub.j (t) as ##EQU6## Eliminating the armature current i.sub.j (t) between (6) and (7), the dynamic model of the jth actuator can be described by the third order differential equation ##EQU7## Consequently, for an n-jointed robot, the n joint actuators as a whole can be represented by the (3n)th order vector differential equation
A.theta.t +B.theta.(t)+C.theta.(t)+DT(t)+ET(t)=V(t) (9)
where V(t), .theta.(t) and T(t) are n.times.1 vectors and the n.times.n diagonal matrices in (9) are defined by ##EQU8## In a typical DC servomotor, the inductance of the armature winding is in the order of tenths of millihenries, while its resistance is in the order of a few ohms. Refer to J. Y. S. Luh: Conventional Controller Design For Industrial Robots--A Tutorial, IEEE Trans. Systems, Man and Cybernetics, SMC 13(3), pp. 298-316, 1983. Thus, the inductances L.sub.j can safely be neglected (L.sub.j .apprxeq.0) and in this case the actuator model (9) reduces (2n)th order model
G.theta.(t)+C.theta.(t)+ET(t)=V(t) (10)
where ##EQU9## It is seen that the approximation L.sub.j .apprxeq.0 has resulted in A=D=0, hence a decrease in the order of the model from 3n to 2n.
Now that the joint actuators have been modeled, we shall consider the manipulator dynamics. In general, the dynamic model of an n-jointed manipulator which relates the n.times.1 "effective" joint torque vector T(t) to the n.times.1 joint angle vector .theta.(t) can be written as (See J. J. Craig: Robotics--Mechanics and Control, Addison Wesley Publishing Company, Reading, Mass., 1986).
M*(m,.theta.).theta.+N*(m,.theta.,.theta.)=T (11)
where m is the payload mass, M*(m,.theta.) is the symmetric positive-definite n.times.n inertia matrix, N*(m,.theta.,.theta.) is the n.times.1 vector representing the total torque due to Coriolis and centrifugal term, gravity loading term, and frictional term. The elements of M* and N* are highly complex nonlinear functions which depend on the manipulator configuration .theta., the speed of the motion .theta., and the payload mass m. On combining (10) and (11), we obtain the integrated dynamic model of the manipulator-plus-actuator system as
M(m,.theta.).theta.+N(m,.theta.,.theta.)=V (12)
where the terms in (12) are defined as
M(m,.theta.)=G+EM*(m,.theta.); N(m,.theta.,.theta.)=C.theta.+EN*(m,.theta.,.theta.)
Equation (12) represents a (2n)th order coupled nonlinear system with the n.times.1 input vector V(t) of the armature voltages and the n.times.1 output vector .theta.(t) of the joint angles.
Although the manipulator dynamic model (11) and the integrated system model (12) are both of order (2n), it is important to note that the integrated model (12) is a more accurate representation of the system than the manipulator model (11), which does not include the actuator dynamics. This is due to the following considerations:
(i) Electrical Parameters: The main contribution from the electrical part of the joint actuators to the integrated system dynamics is the back-emf term K.sub.b .theta.(t) in (6). This term can have a significant effect on the robot performance when the speed of motion .theta.(t) is high. Note that the back-emf appears as an internal damping term, contributing to the coefficient of .theta.(t) in (12). The other electrical parameter is the armature resistance R which appears in (6) and converts the applied armature voltage v(t) to the current i(t) and in turn to the driving torque T(t).
(ii) Mechanical Parameters. The major contribution from the mechanical part of the joint actuators to the robot performance is due to the gear ratios N.sub.j [<1] of the gear trains coupling the motor shafts to the robot links. As seen from (4), the "effective" driving torque on the jth link is reduced by a factor of N.sub.j as seen by the jth joint actuator. In addition, from (2) and (3), the moments of inertia and friction coefficients of the jth link are also reduced by a factor of (N.sub.j).sup.2 as seen by the motor shaft. This implies that the mechanical parameters of joint motors, namely the motor shaft inertia and friction, can have a significant effect on the overall system performance; particularly in robots with large gear ratios such as PUMA 560 where N.sub.j is typically, 1:100.
3. Tracking Control Scheme
Given the integrated dynamic model of the manipulator-plus-actuator system as
M(m,.theta.).theta.+N(m,.theta.,.theta.)=V (13)
The tracking control problem is to devise a control system which generates the appropriate armature voltages V(t) so as to ensure that the joint angles .theta.(t) follow any specified reference trajectories .theta..sub.r (t) as closely as possible, where .theta..sub.r (t) is an n.times.1 vector.
The intuitive solution to the tracking control problem is to employ the full dynamic model (13) in the control scheme in order to cancel out the nonlinear terms in (13). This approach is commonly known as the Computed Torque Technique (see B. R. Markiewicz, supra), and yields the control law
V=M(m,.theta.)[.theta..sub.r +K.sub.v (.theta..sub.r -.theta.)+K.sub.p (.theta..sub.r -.theta.)]+N(m,.theta.,.theta.) (14)
where K.sub.p and K.sub.v are constant diagonal n.times.n position and velocity feedback gain matrices. This results in the error differential equation
e(t)+K.sub.v e(t)+K.sub.p e(t)=0 (15)
where e(t)=.theta..sub.r (t)-.theta.(t) is the n.times.1 vector of position tracking-errors. When the diagonal elements of K.sub.p and K.sub.v are positive, (15) is stable; implying that e(t).fwdarw.0 or .theta.(t).fwdarw..theta.r(t) as t.fwdarw..infin., i.e. tracking is achieved. In the control law (14), we have implicitly made a few assumptions which are rarely true in practice. The major problem in implementing (14) is that the values of the parameters in the manipulator model (13) are often not known accurately. This is particularly true of the friction term and the payload mass. Another problem in implementation of (14) is that the entire dynamic model (13) of the manipulator must be computed on-line in real time. These computations are quite involved, and the computer expense may make the scheme economically unfeasible.
In an attempt to overcome the afore-mentioned limitations of the Computed Torque Technique, a new tracking control philosophy of the invention is proposed in this section. The underlying concept in this invention is that the full dynamic model is not required in order to achieve trajectory tracking and the lack of knowledge of full dynamics can readily be compensated for by the introduction of "adaptive elements" in the control system. Specifically, the proposed tracking control system, FIG. 2, is composed of two components: the nominal feedforward controller 220 and the adaptive feedback controller 250. In FIG. 2, the block 235 represents the manipulator plus actuators of the type generalized earlier herein. The output signals on leads 240 and 245 are the actual velocity and actual position of the system as sensed in any well known manner. Three separate input terms are depicted at leads 201, 202, and 203. The signals on these input leads represent, respectively, desired position (.phi..sub.r) on lead 201, desired velocity (.phi..sub.r) on lead 202, and desired acceleration (.phi..sub.r) on lead 203. The feedforward controller 220 contains computation elements 205 and 210. Computation elements 205 and 210 take the known information that is available about the manipulator/system and compute, based upon the input signals at leads 201, 202, or 203 the available partial information that is fed to a summing junction 215. The output from summing junction 215 is a signal V.sub.o, which signal is in turn fed into another signal junction 230. The signal V.sub.o from junction 215 is the feedforward component of the total command signal V that is developed by the invention at lead 232 into the manipulator plus actuators 235. Since the feedforward loop 220 is model-based, any known information about the manipulators or the actuator system is input into the control loop. Data on the manipulator dynamics can be used for real-time control at the required sampling rate. Such information can be, for instance, only the gravity loading term or the manipulator full dynamics excluding the payload. The feedforward controller 220 is model-based and it acts on the desired joint trajectory .theta..sub.r (t) to produce the actuators driving voltage V.sub.o (t).
The role of the feedback controller 250 is to generate the corrective actuator voltage V.sub..alpha. (t), based on the tracking-error e(t), that needs to be added to V.sub.o (t) to complement the feedforward controller. The feedback controller 250 is composed of adaptive position and velocity feedback terms and an auxiliary signal f(t). The feedback gains, K.sub.v (t) at element 240 and K.sub.p (t) element 242 are varied in accordance with an adaptation law that is described in greater detail hereinafter.
Suffice it to say at this point that the error terms and the desired terms for position and velocity are adapted to form an adaptation component, V.sub.a, at junction 230 which is then combined with the feedforward component V.sub.o from loop 220 in order to yield the total command signal for the control system of this invention. Thus, the feedback loop 250 employs these stated signals which are updated continuously in real time to cope with the nonlinear nature of the system and uncertainties/variations in the manipulator parameters or payload. The feedforward and feedback controllers 220, 250 are now discussed separately in Sections 3.1 and 3.2.
3.1 Nominal Feedforward Controller
Suppose that some partial knowledge about the manipulator dynamic model (13) is available in the form of "approximations" to {M(m,.theta.),N(m,.theta.,.theta.)} denoted by {M.sub.o (m.sub.o,.theta..sub.r),N.sub.o (m.sub.o,.theta..sub.r,.theta..sub.r }, where m.sub.o is an estimate of m. Note that {M.sub.o,N.sub.o } are functions of the reference trajectory .theta..sub.r (t) instead of the actual trajectory .theta.(t). The information available in M.sub.o and N.sub.o can vary widely depending on the particular situation. For instance, we can have M.sub.o (m.sub.o,.theta..sub.r)=0 and N.sub.o (m.sub.o,.theta..sub.r, .theta..sub.r)=G(m,.theta..sub.r), where only gravity information is available. Likewise, it is possible to have M.sub.o (m.sub.o,.theta..sub.r)=M(0,.theta..sub.r) and N.sub.o (m.sub.o,.theta..sub.r,.theta..sub.r)=N(0,.theta..sub.r,.theta..sub.r), where no information about the payload is available. Furthermore, the matrices M.sub.o and N.sub.o may have either a "centralized" or a "decentralized" structure. In the centralized case, each element of M.sub.o and N.sub.o can be a function of all joint variables. In the "decentralized" case, [M.sub.o ].sub.ii and [N.sub.o ].sub.i are functions only of the ith joint variable and ]M.sub.o ].sub.ij =0 for all i.noteq.j.
The nominal feedforward controller is described by
V.sub.o (t)=M.sub.o (m.sub.o,.theta..sub.r).theta..sub.r (t)+N.sub.o (m.sub.o,.theta..sub.r,.theta..sub.r) (16)
where V.sub.o is the n.times.1 nominal control voltage vector and the controller operates on the reference trajectory .theta..sub.r (t) instead of the actual trajectory .theta.(t). It is important to realize that M.sub.o and N.sub.o are based solely on the information available on the manipulator dynamics which is used in the real-time control system of this invention. The controller matrices {M.sub.o, N.sub.o } can therefore be largely different from the model matrices {M,N} due to lack of complete information, or due to computational constraints. For instance, in some cases we may wish to discard some elements of M and N in order to reduce the on-line computational burden, even if the full knowledge of manipulator dynamics is available.
3.2 Adaptive Feedback Controller
In contrast to the feedforward controller 220, FIG. 2, the feedback controller 250 does not assume any a priori knowledge of the dynamic model or parameter values of the manipulator plus actuators 235. This controller 250 operates solely on the basis of the tracking performance of the manipulator through the tracking-error e(t). The controller 250 is adaptive and its gains are adjusted continuously in real-time by simple adaptation laws to ensure closed-loop stability and desired tracking performance. The on-line adaptation compensates for the changing dynamic characteristics of the manipulator due to variations in its configuration, speed, and payload.
The adaptive feedback controller is described by
V.sub..alpha. (t)=f(t)+K.sub.p (t)e(t)+K.sub.v (t)e(t) (17)
where V.sub..alpha. (t) is the n.times.1 adaptive control voltage vector, e(t)=.theta..sub.r (t)-.theta.(t) is the n.times.1 position tracking-error vector, f(t) is an n.times.1 auxiliary signal generated by the adaptation scheme, and {K.sub.p (t),K.sub.v (t)} are the n.times.n adjustable PD feedback gain matrices. The feedback control law (17) can be either "centralized" or "decentralized." For the centralized case, the controller adaptation laws are obtained as I have discussed in my paper entitled A New Approach to Adaptive Control of Manipulators, ASME Journal of Dynamic Systems, Measurement, and Control, Vol. 109, No. 3, pp. 193-202, 1987. For the decentralized control case, the gains {K.sub.p (t),K.sub.v (t)} are diagonal matrices and their ith diagonal elements are obtained from the adaptation laws (21)-(24) with e(t) replaced by e.sub.i (t). See, for further explanation my paper H. Seraji: Decentralized Adaptive Control of Manipulators: Theory, Simulation, and Experimentation, IEEE Journal of Robotics and Automation, 1988, (to appear). The centralized case yields the controller adaptation laws as
f(t)=.gamma..sub.1 r(t)+.gamma..sub.2 r(t) (18) ##EQU10## where the prime denotes transposition, and r(t) is the n.times.1 vector of "weighted" position-velocity error defined as
r(t)=W.sub.p e(t)+W.sub.v e(t) (21)
In (18)-(21), {.gamma..sub.1,.alpha..sub.1,.beta..sub.1 } are zero or positive proportional adaptation gains, {.gamma..sub.2,.alpha..sub.2,.beta..sub.2 } are positive integral adaptation gains, and W.sub.p =diag.sub.i (w.sub.pi) and w.sub.v =diag.sub.i (w.sub.vi) are constant n.times.n matrices which contain the position and velocity weighting factors for all joints. Integrating (18)-(20) in the time interval [0,t], one obtains ##EQU11## Since the initial values of the reference and actual trajectories are the same, we have e(0)=e(0)=r(0)=0. This yields the Proportional=Integral (P=I) adaptation laws ##EQU12## It is noted that the choice of {W.sub.p,W.sub.v } affects all adaptation rates in (22)-(24) simultaneously; whereas the adaptation rate for each term {f(t),K.sub.p (t),K.sub.v (t)} can be affected individually by the selection of {.gamma..sub.i,.alpha..sub.i,.beta..sub.i } independently. The proportional terms in the adaptation laws (22)-(24) act to increase the rate of convergence of the tracking-error e(t) to zero. The use of P+I adaptation laws also yields increased flexibility in the design, in accordance with the features of the invention, by providing a larger family of adaptation schemes than obtained by the conventional I adaptation laws.
The physical interpretation of the auxiliary signal is obtained by substituting from (21) into (22) to yield ##EQU13## Hence, f(t) can be generated by a PID controller with fixed gains acting on the tracking-error e(t). Thus, the feedback controller (17) can be represented by the PID control law where
K*.sub.p (t)=K.sub.p (t)+.gamma..sub.1 W.sub.p +.gamma..sub.2 W.sub.v ; K*.sub.v (t)=K.sub.v (t)+.gamma..sub.1 W.sub.v ; K*.sub.I =.gamma..sub.2 W.sub.p (27)
It is seen that the feedback controller 250 as defined in accordance with equation (26) is composed of three terms which are effective during the initial, intermediate, and final phases of motion:
(i) The initial auxiliary signal f(0) can be chosen to overcome the stiction (static friction) and compensate for the initial gravity loading. This term improves the responses of the joint angles during the initial phase of motion.
(ii) The adaptive-gain PD term K*.sub.p (t)e(t)+K*.sub.v (t)e(t) is responsible for the tracking performance during gross motion while the manipulator model is highly nonlinear, i.e., the changes of .theta.(t) and .theta.(t) are large. Each gain consists of a fixed part and an adaptive part. The online gain adaptation is necessary in order to compensate for the changing dynamics during the intermediate phase of motion.
(iii) The fixed-gain I term ##EQU14## takes care of the fine motion in the steady-state, while the changes of .theta.(t) and .theta.(t) are small and the manipulator model is approximately linear. Thus the I term contributes during the final phase of motion.
3.3 Total Control System of the Invention
The total control system is obtained by combining the nominal feedforward controller 220 operating in accordance with equation (16) and the adaptive feedback controller 250 operating in accordance with equation (17) as shown in FIG. 2 to yield the control law ##EQU15## where V is the total voltage applied at the actuators. It is important to note that in this control configuration, closed-loop stability is not affected by the feedforward controller.
I shall now discuss two extreme cases:
Case i: UNKNOWN MANIPULATOR MODEL
When no a priori information is available on the manipulator dynamic model, that is M.sub.o =N.sub.o =0, the feedforward controller has no contribution, i.e. v.sub.o (t)=0. In this case, the control system approach reduces to the adaptive feedback control law
V(t)=f(t)+K.sub.p (t)e(t)+K.sub.v (t)e(t) (29)
which can be implemented with a high sampling rate.
Case ii: FULL MANIPULATOR MODEL
When the full dynamic model and accurate parameter values of the manipulator, actuators and payload are available for on-line control, that is M.sub.o =M and N.sub.o =N, the feedforward controller can generate the required actuator voltage V.sub.o (t). In this case, the adaptation process can be switched off and the feedback controller reduces to a fixed-gain PD controller {K.sub.p (0),V.sub..upsilon. v(0)}. The control law is now given by
V(t)=M(m,.theta..sub.r).theta..sub.r +N(m,.theta..sub.r,.theta..sub.r)+K.sub.p (O)e(t)+K.sub.v (0)e(t) (30)
which is the feedforward version of the Computed Torque Technique. See, for example, P. K. Khosla and T. Kanade: Experimental Evaluation of Nonlinear Feedback and Feedforward Control Schemes For Manipulators, Intern. Journ. of Robotics Research, Vol. 7, No. 1, pp. 18-28, 1988.
I have concluded that there is a trade-off between the availability of a full dynamic model and the controller adaptation process. In the range of possible operation, one can go from one extreme of no model knowledge and fully adaptive controller, to the other extreme of full model knowledge and non-adaptive controller.
4. Digital Control Algorithm
In Section 3, it is assumed that the control action is generated and applied to the manipulator in continuous time. In practical implementations, however, manipulators are controlled by means of digital computers in discrete time. In other words, the computer receives the measured data (joint positions .theta.) and transmits the control signal (actuator voltages V) ever T.sub.s seconds, where T.sub.s is the sampling period. It is therefore necessary to reformulate the manipulator control problem in discrete time from the outset. In practice, however, the sampling period T.sub.s is often sufficiently small to allow us to treat the manipulator as a continuous system and discretize the continuous control law to obtain a digital control algorithm. This approach is feasible for the invention, since the on-line computations involved for real-time control are very small; allowing high rate sampling to be implemented.
In order to discretize the control law, let us consider the adaptation laws (18)-(20) for the feedback controller and integrate them in the time interval [(N-1)T.sub.s,NT.sub.s ] to obtain ##EQU16## where N and N-1 denote the sample instants and refer to t=NT.sub.s and t=(N-1)T.sub.s, e(N)=.theta..sub.r (N)-.theta.(N) is the discrete position error, and the integrals are evaluated by the trapezoidal rule. The discrete adaptation laws can be written as
r(N)=W.sub.p e(N)+W.sub.v e(N) (31) ##EQU17## In the above equations, we have assumed that the discrete velocity error e(N) is directly available using a tachometer; otherwise the velocity error must be formed in software as ##EQU18## Equations (31)-(34) consititute the recursive algorithm for updating the feed-back controller.
Let us now evaluate the number of on-line mathematical operations that need to be performed in each sampling period T.sub.s to form the discrete feedback control law
V.sub..alpha. (N)=f(N)+K.sub.p (N)e(N)+K.sub.v (N)e(N) (35)
where e(N) and e(N) are assumed to be available. For a centralized feedback controller, the total numbers of additions and multiplications in forming V.sub..alpha. (N) are equal to 6n.sup.2 +3n and 6n.sup.2 +8n, respectively, where n is the number of manipulator joints. For a decentralized feedback controller, the numbers of operations are reduced to 9n additions and 14n multiplications. The small number of mathematical operations, particularly in the decentralized case, suggests that we can implement a digital servo loop with a high sampling rate, i.e. very small T.sub.s. This is a very important feature in digital control since slow sampling rates degrade the tracking performance of the manipulator, and may even lead to closed-loop instability.
Let us now turn to the discrete feedforward control law
V.sub.o (N)=M.sub.o [m.sub.o,.theta..sub.r (N)].theta..sub.r (N)+N.sub.o [m.sub.o,.theta..sub.r (N)] (36)
where .theta..sub.r (N) and .theta..sub.r (N) are directly available from the trajectory generator. Since the feedforward controller is "outside" the servo loop, it is possible to have a fast servo loop around the feedback controller, and the feedforward voltage V.sub.o (N) is then added at a slower rate. Furthermore, the feedforward control action V.sub.o (N) is computed as a function of the reference trajectory .theta..sub.r (N) only. In applications where the desired path .theta..sub.r (t) is known in advance, the values of the voltage V.sub.o (N) can be computed "off-line" before motion begins and stored in a look-up table in the computer memory. At run time, this precomputed voltage history is then simply read out of the look-up table and used in the control law. Such an approach can be quite inexpensive computationally at run time, while allowing implementation of a high servo rate for feedback control.
The total control law in discrete time is given by ##EQU19## Equations (31)-(37) constitute the digital control algorithm that is implemented for on-line computer control of robotic manipulators.
5. Robust Adaptive Control
The adaptation laws for the feedback controller as described in Section 3 are derived under the ideal conditions where unmodeled dynamics is not present and disturbances do not affect the system. In such idealistic conditions, the rate of change of a typical feedback gain K(t) which acts on the signal s(t) is found to be that set forth below in equation number (38). In expressing this relationship the auxiliary signal f(t), is developed by setting s(t).tbd.1. ##EQU20## where .mu..sub.1 and .mu..sub.2 are scalar adaptation gains. Extensive simulation and experimental studies suggest that too low adaptation gains result in smooth variations in K(t), but poor tracking performance. On the other hand, too high adaptation gains lead to oscillatory and noisy behavior of K(t), but yield perfect trajectory tracking. (Note that for both low and high adaptation gains, the range of control voltage is more or less than the same, since it is primarily dependent on the reference trajectory and manipulators dynamics). This argument suggests that large adaptations gains are necessary to maintain a high speed of controller adaptation in order to ensure rapid convergence of the tracking-error e(t) to zero. In practice, the adaptation gains cannot be selected too large due to a phenomenon known as "fast adaptation instability." When the speed of adaptation, i.e. K(t), is too high, the gain K(t) drifts to large values and excites the unmodeled dynamics (parasitic) of the system, which in turn leads to instability of the control system. High speed of adaptation can be either due to large adaptation gains or fast reference trajectory. Another mechanism for instability can be observed in decentralized adaptive control systems. The interconnections among subsystems can cause local controller parameters to drift to large values and hence excite the parasitic and lead to instability. For instance, a high amplitude or high frequency reference trajectory of one subsystem can destabilize the local adaptive controller of another subsystem by exciting its parasitic through the interconnections.
We conclude that in an adaptive robot control system, unstable behavior can be observed with large adaptation rates or a high degree of interjoint couplings. It is unfortunate that in trying to compensate for the change in the system, the adaptive controller may become sensitive to its own parameters.
I shall now discuss two possible approaches for avoiding the fast adaptation instability:
5.1 Time-Varying Adaptation Gains
From equation (38), it is seen that the speed of adaptation K(t) depends on the magnitude of the adaptation gains (.mu..sub.1,.mu..sub.2) and on the weighted tracking-error r(t). Simulation studies with constant adaptation gain algorithms suggest that high gains lead to faster convergence of the tracking-error to zero. However, in the initial phase of adaptation, the weighted tracking-error r(t) is large (e.g. due to static friction) and too high a value of adaptation gain causes instability problems. As the adaptation process goes on, the term r(t) decreases and at this time the adaptation gain is increased in order to achieve faster convergence. With this motivation, the constant adaptation gains .mu..sub.1 (t) and .mu..sub.2 (t) start with small initial values when the errors are usually large and, as time proceeds, build up to appropriate large final values when the errors are small.
5.2 Robustness VIA .sigma.-Modification
The P+I adaptation laws discussed so far have no provision for rejecting the destabilizing effect of "noise" introduced through unmodeled dynamics or disturbances. The integral term in the adaptation law acts to integrate a quantity related to the noise term squared. The integration of such a non-negative quantity inevitably creates an undesirable drift in the integral term and ultimately deteriorates the adaptive system performance.
Ioannou and Kokotovic (Instability Analysis and Improvement of Robustness of Adaptive Control, Automatica, Vol. 20, No. 5, pp. 583-594, 1984; Robust Redesign of Adaptive Control, IEEE Trans. Aut. Control, Vol. AC-29, No. 3, pp. 202-211, 1984) suggest ".sigma.-modification" to the adaptation law in order to eliminate the drift in the integral term and thus counteract instability. The basic idea is to modify the adaptation law (38) by adding a term -.sigma.K(t) which removes its purely integral action, that is, instead of (38) we use the .sigma.-modified law ##EQU21## where .sigma. is a positive scalar design parameter. The size of .sigma. reflects our lack of knowledge about the unmodeled dynamics and disturbances. In equation (39), the leakage or decay term -.sigma.K(t) acts to dissipate the integral buildup, and eliminate the drift problem which excites the parasitic and leads to instability. The price paid for the attained robustness is that the tracking-error .parallel.r(t).parallel. now converges to a bonded non-zero residual set, and hence perfect trajectory tracking is no longer achieved in theory. The size of this residual set depends on the value of .sigma., but can often be made sufficiently small so that performance degradation is acceptable in practice. The drawback of the .sigma.-modified adaptation law, however, is that in the absence of unmodeled dynamics and disturbances, we can no longer guarantee that lim.sub.t.fwdarw..infin. .parallel.r(t).parallel.=0, unless .sigma.=0.
The Proportional+Integral+Sigma (P+I+.sigma.) adaptation laws for the feedback controller in continuous time are now given by ##EQU22## where {.sigma..sub.1,.sigma..sub.2,.sigma..sub.3 } are positive scalar design parameters. For digital control implementation with sampling period T.sub.s, (40)-(42) yield the recursive adaptation laws ##EQU23##
In conclusion, the use of .sigma.-modification is essential in obtaining sufficient conditions for boundedness in the presence of parasitic. However, in the absence of parasitic, .sigma. causes a tracking-error of 0(.sqroot..sigma.) to remain. Therefore, there is a trade-off between boundedness of all signals in the presence of parasitic and loss of exact convergence of the tracking-error to zero in the absence of parasitic. In other words, we have sacrificed the performance in an idealistic situation in order to achieve robustness in realistic situations which are more likely to occur in practical applications.
6. Simulation Results
The tracking control scheme developed in Section 3 has been applied to a two-link manipulator for illustration of the benefits of the invention.
Consider the planar two-link manipulator in a vertical plane shown in FIG. 3, with the end-effector carrying a payload of mass m. The robot links are assumed to be driven directly (without gears) by two servomotors with negligible dynamics. Hence the arm is "direct drive" and we can treat the joint torques as the driving signals. The dynamic equation of motion which relates the joint torque vector ##EQU24## to the joint angle vector ##EQU25## is given by H. Seraji, A New Approach to Adaptive Control of Manipulators, supra, and H. Seraji, M. Jamshidi, Y. T. Kin, and M. Shahinpoor, Linear Multivariable Control of Two-Link Robots, Journal of Robotic Systems, Vol. 3, No. 4, pp. 349 -365, 1986. =M(.theta.).theta.+N(.theta.,.theta.)+G(.theta.)+H(.theta.)+mJ'(.theta.)[J(.theta.).theta.+J(.theta.,.theta.).theta.+g] (46)
where the above terms are: ##EQU26## In the above expressions, .alpha..sub.1, . . . ,.alpha..sub.5 are constant parameters obtained from the masses (m.sub.1,m.sub.2) and the lengths (l.sub.1,l.sub.2) of the robot links, and (V.sub.1,V.sub.3) and (V.sub.2,V.sub.4) are coefficients of viscous and Coloumb frictions respectively. For the particular robot under study, the numerical values of the link parameters are m.sub.1 =15.91 Kg; m.sub.2 =11.36 Kg; l.sub.1 =l.sub.2 =0.432 m so that they represent links 2 and 3 of the Unimation PUMA 560 robot. This yields the following numerical values for the model parameters (H. Seraji, M. Jamshidi, Y. T. Kim, and M. Shaninpoor, supra)
.alpha..sub.1 =3.82; .alpha..sub.2 =2.12; .alpha..sub.3 =0.71 .alpha..sub.4 =81.82; .alpha..sub.5 =24.06
The friction coefficients are chosen as V.sub.1 =V.sub.3 =1.0 Nt.m/rad.sec.sup.-1 and V.sub.2 =V.sub.4 =0.5 NT.m and the payload mass is initially m=10.0 Kg.
The joint angles .theta..sub.1 (t) and .theta..sub.2 (t) are required to track the cycloidal reference trajectories ##EQU27## so that the robot configuration changes smoothly from the initial posture ##EQU28## to the final posture ##EQU29## in three seconds. The joint angles are controlled by the feedforward and/or feedback tracking control scheme
T.sub.1 (t)=[M.sub.11 (.theta..sub.r).theta..sub.r.spsb.1 +M.sub.12 (.theta..sub.r).theta..sub.r.spsb.2 +G.sub.1 (.theta..sub.r)]+[f.sub.1 (t)+k.sub.p.spsb.1 (t)e.sub.1 (t)+k.sub.v.spsb.1 (t)e.sub.1 (t)]
T.sub.2 (t)=[M.sub.21 (.theta..sub.r).theta..sub.r.spsb.1 +M.sub.22 (.theta..sub.r).theta..sub.r.spsb.2 +G.sub.2 (.theta..sub.r)]+[f.sub.2 (t)+k.sub.p.spsb.2 (t)e.sub.2 (t)+k.sub.v.spsb.2 (t)e.sub.2 (t)](48)
where M and G are defined in (46). It is seen that each tracking control law is composed of feedforward and feedback components. The feedforward component has a centralized structure and is based on the manipulator dynamic model (46). It is a function of the reference trajectory .theta..sub.r (t) and contains the inertial acceleration term M(.theta..sub.r).theta..sub.r and the gravity loading term G(.theta..sub.r) of the arm itself, without the payload (i.e., m=0). The Coriolis and centrifugal term N(.theta.,.theta.), the frictional term H(.theta.), and the payload term are assumed to be unavailable for on-line control and are not incorporated in the feedforward controller. The feedback controller has a decentralized structure and is composed of the auxiliary signal f(t), the position feedback term k.sub.p (t)e(t), and the velocity feedback term k.sub.v (t)e(t) where e(t)=.theta..sub.r (t)-.theta.(t) is the position tracking-error in radians. The feedback terms are updated, as (In this example, the .sigma.-modification was not necessary, and hence .sigma..sub.1 =.sigma..sub.2 =.sigma..sub.3 =0). And it may be shown that: ##EQU30##
r.sub.i (t)=w.sub.pi e.sub.i (t)+w.sub.vi e.sub.i (t)=3000e.sub.i (t)+1500e.sub.i (t) (52)
Note that the initial values of the auxiliary signal and the feedback gains are all chosen arbitrarily as zero. (The numerical values of w.sub.p and w.sub.v in (52) are large since the unit of angle in the control program is "radian." A simple trapezoidal integration rule is used to compute the integrals in the adaptation laws (49)-(51) with dt=1 millisecond.
To evaluate the performance of the proposed control scheme, the nonlinear dynamic model of the manipulator-plus-payload (46) and the tracking control scheme (48) are simulated on a DEC-VAX 11/750 with the sampling period of 1 millisecond. In order to illustrate the effectiveness of the proposed control scheme to compensate for sudden gross variation in the payload mass, the mass is suddenly decreased from m=10.0 Kg to zero at 6=1.5 seconds (i.e. the payload is dropped) while the manipulator is in motion under the control system operating in accordance with equation (48). The results of the computer simulation are shown in FIG. 4(i)-(ii) and indicate that the joint angles .theta..sub.1 (t) and .theta..sub.2 (t) track their corresponding reference trajectories it is also shown that .theta..sub.r.spsb.1 (t) and .theta..sub.r.spsb.2 (t) vary closely throughout the motion, despite the sudden payload variation. FIGS. 5(i)-(ii) and 6(i)-(ii) show the responses of the tracking-errors e.sub.1 (t) and e.sub.2 (t) and the control torques T.sub.1 (t) and T.sub.2 (t), and indicate a sudden jump at t=1.5" due to payload change. To show the feedback adaptation process, time variations of the auxiliary signal F.sub.1 (t), the position gain k.sub.pi (t), and the velocity gain k.sub.vi (t) are shown in FIGS. 7(i)-(iii). It is seen that the feedback terms have adapted rapidly on-line to cope with the sudden payload mass change. The results demonstrate that the invention does not require knowledge of the payload mass m and can adapt itself rapidly to cope with unpredictable gross variations in m and sustain a good tracking performance.
7. Experimental Results
In this section, the tracking control system described in Section 3.2 is applied to a PUMA industrial robot for test and evaluation purposes.
The testbed facility at the JPL Robotics Research Laboratory consists of a Unimation PUMA 560 robot and controller, and a DEC MicroVAX II computer, as shown in the functional diagram of FIG. 8. The MicroVAX II hosts the RCCl (Robot Control "C" Library) software, which was originally developed at Purdue University (V. Hayward and R. Paul, Introduction to RCCL: A Robot Control `C` Library, IEEE Intern. Conf. on Robotics, pp. 293-297, Atlanta, 1984, and subsequently modified and implemented at JPL. During the operation of the arm, a hardware clock constantly interrupts the I/O program resident in the Unimation controller at a preselected sampling period T.sub.s, which can be chosen as 7, 14, 28 or 54 milliseconds. At every interrupt, the I/O program gathers information about the state of the arm (such as joint encoder readings), and interrupts the control program in the MicroVAX II to transmit this data. The I/O program then waits for the control program to issue a new set of control signals, and then dispatches these signals to the appropriate joint motors. Therefore, the MICROVAX II acts as a digital controller for the PUMA arm and the Unimation controller is effectively by-passed and is utilized merely as an I/O device to interface the MicroVAX II to the joint motors.
To test and evaluate the control system described in Section 3, the tracking controller is implemented on the waist joint .theta..sub.1 of the PUMA arm, while the other joints are held steady using the Unimation controller. The waist control law is coded within the RCCL environment on the MicroVAX II computer. It is assumed that the dynamic model and parameter values of the arm are not available, and hence the feedforward controller is eliminated. The control torque for the waist joint at each sampling instant N is obtained from the adaptive PID feedback control law
T(N)=f(N)+k.sub.p (N)e(N)+k.sub.v (N)e(N) (53)
where e(N)=.theta..sub.r.spsb.1 (N) -.theta..sub.1 (N) is the waist position error, ##EQU31## is the waist velocity error formed in the software, and .theta..sub.r.spsb.1 (t) is the reference trajectory for the waist joint. The feedback terms are generated by the following simple recursive adaptation laws (In the experiment, it was not necessary to use .sigma.-modification and hence we set .sigma.=0).
r(N)=30e(N)+20e(N) (54)
f(N)=f(N-1)+0.175 [r(N)+r(N-1)] (55)
k.sub.p (N)=k.sub.p (N-1)+0.35[r(N)e(N)+r(N-1)e(N-1)] (56)
k.sub.v (N)=k.sub.v (N-1)+2.8[r(N)e(N)+r(N-1)e(N-1)] (57)
where the adaptation gains are found after a few trial-and-errors. The sampling period is chosen as the smallest possible value T.sub.s =7 milliseconds (i.e. sampling f.sub.s =144 H.sub.z), since the on-line computations involved in the control law (53) are a few simple arithmetic operations. No information about the PUMA dynamics is used for implementation of the control system, and hence the controller terms are initially zero; i.e. f(0)=k.sub.p (0)=k.sub.v (0)=0.
The PUMA arm is initially at the "zero" position with the upper-arm horizontal and the forearm vertical forming a right-angle configuration. The waist joint angle .theta..sub.1 (t) is commanded to change from the initial position .theta..sub.1 =0 to the goal position ##EQU32## in 2 seconds. The reference trajectory .theta..sub.r.spsb.1 (t) is synthesized by the cycloidal trajectory generator in RCCL as ##EQU33##
While the arm is in motion, the reading of the waist joint encoder at each sampling instant is recorded directly from the arm, converted into degrees and stored in a data file. The values of the auxiliary signal and feedback gains are also recorded at each sampling and kept in the same data file. FIG. 9(i) shows the desired and actual trajectories of the waist joint angle and the tracking-error is shown in FIG. 9(ii). It is seen that the joint angle .theta..sub.1 (t) tracks the reference trajectory .theta..sub.r.spsb.1 (t) very closely, and the peak value of the tracking-error e(t) is 1.40.degree.. The initial lag in the .theta..sub.1 response is due to the large stiction (static friction) present in the waist joint.
FIGS. 10(i)-(ii) show the tracking performance of the waist joint for the same motion using the Unimation controller, which is operating with the sampling period of 1 millisecond f.sub.s =1 KH.sub.z. It is seen that the peak joint tracking-error in FIG. 10(ii) is 5.36.degree., which produces 4 centimeters peak position error at the end-effector. By comparing FIGS. 9(ii) and 10(ii), it is evident that the tracking performance of the adaptive controller is noticeably superior to that of the Unimation controller, despite that fact that the Unimation control loop is 7 times faster than the adaptive control loop. The variations of the auxiliary signal f(t), the feedback gains k.sub.p (t) and k.sub.v (t), and the control torque T(t) are also shown in FIGS. 11(i)-(iv). It is seen that f(t), k.sub.p (t), k.sub.v (t), and T(t) all start from the initial values of zero and change with time.
I shall now discuss a test relating to the tracking performance of the adaptive controller in a different situation. Suppose that the configuration of the arm is changed smoothly from the initial zero posture to the final vertical posture while the waist joint is in motion. This effectively imposes a dynamic inertial load on the waist motor, as well as introducing torque disturbances in the waist control loop due to inter-joint couplings. We now specify a different desired trajectory for the waist angle whereby .theta..sub.1 is commanded to change from 0 to ##EQU34## in three seconds while tracking a cycloidal trajectory. Using the same adaptive control law (53), the actual recording from the waist joint and the desired trajectory are shown in FIG. 12(i) and the tracking-error is plotted in FIG. 12(ii). It is seen that the actual trajectory tracks the new desired trajectory very closely, despite the dynamic loading on the waist joint and the inter-joint coupling disturbances. The experimental results demonstrate that the invention's control system is not sensitive to the arm configuration, torque disturbances, or the desired trajectory.
The following observations are made from further experiments on the PUMA robot:
1. Using the Unimation controller, the tracking-error increases for fast motion under heavy payload. For instance, when the arm is fully extended horizontally carrying a five pound payload and the waist joint is moved by 90.degree. in 1.2 seconds, the peak joint tracking-error is about 9.degree.. When transformed to the end-effector, this gives the peak tip error of 16 centimeters.
2. The rate of sampling, T.sub.s, has a central role in the performance of the proposed control scheme. In the adaptive feedback controller, the sampling rate determines the rate at which the feedback gains and the auxiliary signal are updated. Faster sampling rate (smaller T.sub.s) allows higher adaptation rates to be used, which in turn leads to a better tracking performance. When the sampling rate is slow (large T.sub.s), the tracking performance is degraded, and the use of high adaptation gains may lead to closed-loop instability. For instance, for T.sub.s =14 msec, the adaptation gains equations (54)-(57) must be reduced to maintain stability, and this degrades the tracking performance. In general, when T.sub.s is large, the effects of sampling and discretization are more pronounced and the control system performance is degraded. Therefore, in practical implementation, it is highly desirable to increase the sampling rate as much as possible by optimizing the real-time control program or using a multi-processor concurrent computing system. In the present experimental setup, for any sampling period T.sub.s, about three msec is taken up by the communication between the Micro VAX II and the Unimation controller; hence for T.sub.s =7 msec only four msec is available for control law computations.
3. The feedback adaptation gains in (54)-(57) should not be chosen unnecessarily high, since too high gains can lead to instability while producing negligible improvement in the tracking performance (e.g., an acceptable peak error of 1.degree. may decrease to 0.1.degree.). For instance, for the adaptation gains given in the above experiment, motion of .theta..sub.1 by 90.degree. in 1.2 seconds leads to instability. In this case, it is necessary to decrease the adaptation gains or to introduce .sigma.-modification in order to ensure stability. Therefore, in general, the adaptation gains must be chosen to yield an acceptable tracking performance for the fastest trajectory in the experiment, i.e. "the worst case design".
4. For very slow motions of the waist joint (e.g., average speed of 2.degree./sec), the friction present in the waist joint has a dominating effect and therefore the implemented control scheme has a poor performance. This is due to the fact that in this case, the control torque T.sub.c is comparable in magnitude to the stiction T.sub.s of the joint, and hence the net torque T.sub.c -T.sub.s applied to the joint is not sufficiently large. Therefore, for slow motions, it is necessary to introduce a feedforward controller or a friction compensation in order to counteract the effect of stiction. The situation is improved by increasing the sampling rate.
8. Conclusions
A new and simple robust control system for accurate trajectory tracking of robotic manipulators has been described. The control system takes full advantage or any known part of the manipulator dynamics in the feedforward controller. The adaptive feedback controller then compensates for any unknown dynamics and uncertainties/variations in the manipulator/payload parameters.
From the tracking point of view, it is desirable to set the feedback adaptation rates as high as possible so that the feedback controller can respond rapidly to variations in the manipulator dynamics or sudden changes in the payload. High rate adaptation, however, can cause instability through the excitation of unmodeled dynamics. The stability is counteracted by the addition of decay terms to the integral adaptation laws to yield an adaptive controller which is robust in the presence of unmodeled dynamics and disturbances.
The feedback adaptation laws in Section 3 are derived under the assumption that the robot model is "slowly time-varying" in comparison with the controller terms. In theory, this assumption is necessary in order to derive simple adaptation laws which do not contain any terms from the robot model. In practice, the simulation and experimental studies of Sections 5 and 6 justify the assumption, even under gross abrupt change in the payload. This is due to the robust nature of adaptive control schemes, which is discussed briefly in H. Seraji, A New Approach To Adaptive Control of Manipulators, supra. Nevertheless, further simulations and experiments need to be performed using direct-drive arms in fast motions to test the practical limitations of the simplifying assumption of slow time variation.
Simulation results for a two-link robot and experimental results of a PUMA industrial robot validate the capability of the invention's control system in accurate trajectory tracking with partial or no information on the manipulator dynamics.
Finally, the control features presented herein can readily be extended to the direct control of end-effector position and orientation in Cartesian space. In this formulation, the controllers operate on Cartesian variables and the end-effector control forces are then transformed to joint torques using the Jacobean matrix (H. Seraji, An Approach To Multivariable Control Of Manipulators, ASME Journ. Dynamic Systems, Measurement and Control, Vol. 109, No. 2, pp. 146-154, 1987 and H. Seraji, Direct Adaptive Control Of Manipulators In Cartesian Space, Journal of Robotic Systems, Vol. 4, No. 1, pp. 157-178, 1987.
The above description presents the best mode contemplated in carrying out my invention. My invention is, however, susceptible to modifications and alternate constructions from the embodiments shown in the drawings and described above. Consequently, it is not the intention to limit the invention to the particular embodiments disclosed. On the contrary, the invention is intended and shall cover all modifications, sizes and alternate constructions falling within the spirit and scope of the invention, as expressed in the appended claims when read in light of the description and drawing.
Claims
  • 1. A robotic system that combines model-based and performance-based techniques to control a manipulator by a control signal developed in response to input command terms and formed from nominal and complement control-signal components, said system comprising;
  • a distinct feedforward circuit means, model-based and containing user-accessible inputs for receiving a priori information concerning the manipulator's dynamics, as input into said feedforward circuit means by an operator of said system, said feedforward circuit means for controlling said manipulator by a nominal signal component delivered by the feedforward circuit means to said manipulator;
  • a second feedback circuit means, distinct and separate forms aid feedforward means, performance-based and responding adaptively to actual performance of said controlled manipulator, for emitting a feedback-related signal complement; and
  • signal combining means connected to said feedforward and feedback means for combining said feedback-related signal complement with said nominal signal component from said distinct feedforward circuit means in order to form a combined control signal, which combined control signal fully controls said manipulator's performance.
  • 2. A robotic system in accordance with claim 1 wherein desired manipulator position, velocity, and/or acceleration are individual command input terms for the system, and said distinct feedforward circuit means is model-based and contains any known part of the manipulator's dynamics that can be used for on-line control, and said feedforward circuit means further includes;
  • computation elements that receive said priori information about the manipulators dynamics as input by the operator and responds to such information and to the desired command input terms for computing partial information for on line control over said manipulator by said nominal signal as developed by said feedforward means.
  • 3. A robotic system that combines model-based and performance-based techniques to control a manipulator by a control signal developed in response to input command terms including desired manipulator position, velocity, and/or acceleration as individual command input terms for the system, and wherein said control signal is formed from nominal and complement control-signal components; said system comprising;
  • feedforward circuit means, model-based and containing a priori information concerning the manipulator's dynamics, as input into said feedforward circuit means by an operator of said system, for controlling said manipulator by a nominal signal component delivered by the feedforward circuit means to said manipulator;
  • feedback circuit means, performance-based and responding to actual performance of said controlled manipulator, for emitting a feedback related signal complement which combines with said nominal signal component from said feedforward circuit means in order to form a combined control signal, which combined control signal controls said manipulator's performance;
  • and said feedforward circuit means further includes;
  • computation elements that receive said priori information about the manipulators dynamics as input by said operator and responds to such information and to the desired command input terms for computing partial information for on-line control over said manipulator by said nominal signal as developed by said feedforward means; and
  • an actuator for said manipulator, and wherein the manipulator-plus-actuator has a given configuration, speed of motion, and a payload mass which is expressed as integrated dynamic model defined as;
  • M(m,.theta.).theta.+N(m,.theta.,.theta.)=V
  • wherein the above-noted terms are defined as
  • M(m,.theta.)=G+EM*(m,.theta.); N(m,.theta.,.theta.)=C.theta.+EN*(m,.theta.,.theta.)
  • and m is the payload mass, M*(m,.theta.) is the symmetric positive-definite n.times.n inertia matrix, N*(m,.theta.,.theta.) is the n.times.1 vector representing the total torque due to Coriolis and centrifugal term, gravity loading term, and frictional term, and the elements of M* and N* are highly coalex nonlinear functions which depend upon the configuration .theta., the speed of motion .theta., and, G and C respectively, are the inertia term and damping term for the payload mass m of said manipulator-plus-actuator.
  • 4. A control system in accordance with claim 3 wherein said manipulator has control joints and said system has an adaptation control law, and is further characterized in that;
  • said adaptation control law is decentralized with variable gains {K.sub.p (t),K.sub.v (t)}, being defined as diagonal matrices and the ith diagonal elements of said diagonal matrices being obtained from said adaptation law, with an error feedback voltage e(t) replaced by e.sub.i (t), as follows:
  • f(t)=.gamma..sub.i r(t)+.gamma..sub.2 r(t) ##EQU35## wherein the prime denotes transposition, and r(t) is the n.times.1 vector of "weighted" position-velocity error defined as r(t)=W.sub.p e(t)+W.sub.v e(t), and {.gamma..sub.i,.alpha..sub.1,.beta..sub.1 } are zero or positive proportional adaptation gains, {.gamma..sub.2,.alpha..sub.2,.beta..sub.2 } are positive integral adaptation gains, and W.sub.p =diag.sub.i (W.sub.pi) and W.sub.v +diag.sub.1 (W.sub.vi) are constant n.times.n matrices which contain the position and velocity weighting factors for all said control joints in the manipulator being controlled.
  • 5. A method of robotic control in which an on-line control over a manipulator has been initially established by a model-based control signal, the improvement comprising the steps of
  • complementing said model-based control signal with another signal developed by a feedback circuit means;
  • developing error signals each indicating the difference between the desired and actual manipulator velocity and position, with each error signal subject to an adaptation law which includes variable gain for each error signal in order to develop said complement signal component; and
  • expressing said adaptation law as r(t) which is the n.times.1 vector of "weighted" position-velocity error defined by:
  • r(t)=W.sub.p e(t)+W.sub.v e(t) ##EQU36## and wherein {.gamma..sub.1,.alpha..sub.1,.beta..sub.1 } are zero or positive proportional adaptation gains, {.gamma..sub.2,.alpha..sub.2,.beta..sub.2 } are positive integral adaptation gains, and W.sub.p =diag.sub.i (W.sub.pi) and W.sub.v =diag.sub.i (W.sub.vi) are constant n.times.n matrices which contain the position and velocity weighting factors for all joints of said manipulator.
  • 6. Formulating, in hardware, a control system apparatus which operates in accordance with the method steps of claim 5 and performing the further method steps of;
  • integrating the equations of claim 5 in the time interval [0,], to obtain ##EQU37## noting that the initial values of the reference and actual trajectories are the same, that is
  • e(0)=e(0)=r(0)=0, and
  • establishing a Proportional+ integral (P+I) adaptation law for improved control over said manipulator.
  • 7. The method of claim 6 and comprising the further steps of:
  • affecting all adaptation rates simultaneously; and
  • individually affecting the adaptation rate for each term {f(t),K.sub.p (t),K.sub.v (t)} by selecting {.gamma..sub.1,.alpha..sub.1,.beta..sub.1 } independently for improved control over said manipulator.
  • 8. The method of claim 7 comprising the further step of:
  • increasing the rate of convergence of the tracking-error e(t) to zero.
  • 9. In a controller having a control signal which controls a manipulator's position and velocity in Cartesean space, wherein the manipulator and environment form a system exhibiting nonlinear dynamics and system parameters which may not be fully known to an operator and wherein the controller includes an adaptive feedback servo control loop which senses actual position and velocity of said manipulator being controlled in Cartesian space, the improvement comprising:
  • manipulator driving means responsive to said control signal for driving said manipulator in said environment to achieve, in Cartesean space, a desired position, velocity, and acceleration as indicated by said control signal;
  • a model-based feedforward loop outside of said adaptive servo control loop that allows the operator to provide a nominal signal to the manipulator based upon a limited amount of knowledge concerning the manipulator and/or the system which the operator may elect to input into said system; and
  • adaptive control means in said feedback servo control loop, responsive to said sensed position and velocity, for varying said control signal applied to said manipulator's driving means, which variable control signal compensates in real-time for the system's nonlinearities as said manipulator is driven in Cartesean space to a controlled position and velocity.
  • 10. A control system in accordance with claim 9 and further wherein the system operates in real-time control with a given sampling rate, and said system further comprises;
  • a feedforward computational element connected in said feedforward loop for receiving operator-input values concerning said manipulator and/or system's dynamics in order to develop a nominal signal that is applied to said control signal for said manipulator independently of said adaptive controlling means.
  • 11. In a controller having a control signal which controls a manipulator's position and velocity in Cartesean space, wherein the manipulator and its environment form a system exhibiting nonlinear dynamics and system parameters which may not be fully known to an operator, and wherein the controller includes an adaptive feedback servo control loop which senses actual position and velocity of said manipulator being controlled in Cartesian space, the improvement comprising:
  • manipulator driving means responsive to said control signal for driving said manipulator in said environment to achieve, in Cartesean space, a desired position, velocity, and acceleration as indicated by said control signal;
  • a model-based feedforward loop separate and distinct from said adaptive servo control loop that allows the operator to provide a nominal signal to the manipulator based upon a limited amount of knowledge concerning the manipulator and/or the system which the operator may elect to input into said system;
  • adaptive control means in said feedback servo control loop, responsive to said sensed position and velocity, for varying said control signal applied to said manipulator's driving means, which variable control signal compensates in real-time for the system's nonlinearities as said manipulator is driven in Cartesean space to a controlled position and velocity;
  • and further wherein the system operates in real-time control with a given sampling rate, and said system further comprises;
  • a feedforward computational element connected in said feedforward loop for receiving operator-input values concerning said manipulator and/or system's dynamics in order to develop a nominal signal that is applied to said control signal for said manipulator independently of said adaptive controlling means in said separate and distinct feedback control loop; and
  • said adaptive control means operates in accordance with a control law which may be considered as though it was applied directly to the manipulator's end effector in Cartesian space, and said controller further comprises:
  • means connected in a force feedforward control loop for receiving a feedforward signal representing a desired mathematical term for said manipulator;
  • an adaptive proportional-integral-differential (PID) controller in said feedback control loop; and
  • a plurality of variable gain circuits for implementing a control law characterized as an adaptive feedback controller described by
  • Y.sub..alpha. (t)=f(t)+K.sub.p (t)e(t)+K.sub.v (t)e(t)
  • where V.sub..alpha. (t) is the n.times.1 adaptive control voltage vector, e(t)=.theta..sub.r (t)-.theta.(t) is then n.times.1 position tracking-error vector, f(t) is an n.times.1 auxiliary signal generated by the adaptation scheme, and {K.sub.p (t),K.sub.v (t)} are n.times.n adjustable PD feedback gain matrices.
  • 12. A combined model-based and performance-based robotic control system that generates a combined control signal developed from nominal and complement signal components, comprising;
  • first distinct means, model-based and containing a priori information known by an operator concerning a manipulator's dynamics, connected in a separate and distinct feedforward circuit for developing said nominal signal component; and
  • adaptive control means, also separate and distinct from said first means, performance-based and responding adaptively to actual performance of said controlled manipulator, for combining said complement signal generated by said adaptive control means with said nominal signal as developed by said first means so that the combined signal ultimately has full control over said manipulator.
  • 13. A control system in accordance with claim 12 which exhibits noise in the form of destabilization in the control system, which noise is compensated for by a .sigma.-modified law, and wherein said system is further characterized in that:
  • said noise is compensated for in the feedback means by said .sigma.-modified control law that is expressed by: ##EQU38## where .sigma. is a positive scalar design parameter.
  • 14. A control system in accordance with claim 12 and further wherein the size of .sigma. reflects a lack of knowledge about the unmodeled dynamics and disturbances of the manipulator system, and further wherein said system is characterized in that
  • the leakage or decay term-.sigma.K(t) acts to dissipate an integral buildup, and to eliminate the drift problem which leads to instability, and .parallel.r(t).parallel. converges to a bounded non-zero residual set.
  • 15. A control system in accordance with claim 12 in which said control system includes a Proportional+Integral+Sigma (P+I+.sigma.) adaptation law for said feedback controller, which law operates in continuous time, as given by: ##EQU39## where {.sigma..sub.1, .sigma..sub.2,.sigma..sub.3 } are positive scalar design parameters.
  • 16. A control system in accordance with claim 13 and further characterized by a digital control implementation with sampling period T.sub.s, which yields the recursive adaptation laws ##EQU40##
  • 17. A combined model-based and performance-based robotic control system that generates a combined control signal developed from nominal and complement signal components and which exhibits noise in the form of destabilization in the control system, which noise is compensated for by .sigma.-modified law, and wherein said system is characterized in that:
  • means, model-based and containing a priori information known by an operator concerning a manipulator's dynamic, connected in a feedback circuit for developing said nominal signal component;
  • adaptive control means, performance-based and responding adaptively to actual performance of said controlled manipulator, for combining said complement signal generated by said adaptive control means with said nominal signal so that the combined signal controls said manipulator; and
  • noise compensating means in the feedback means including said .sigma.-modified control law that is expressed by: ##EQU41## where .sigma. is a positive scalar design parameter, .mu.1 and .mu.2 are scalar adaptation gains, r is the weighted tracking error, s' is the transportation of s, where s stands for the signal on which k is acting.
  • 18. A control system in accordance with claim 17 and further wherein the size of .sigma. reflects a lack of knowledge about the unmodeled dynamics and disturbances of the manipulator system, and further wherein said system is characterized in that
  • a leakage or decay term-.sigma.K(t) acts to dissipate an integral buildup, and to eliminate the drift problem which leads to instability, and .parallel.r(t).parallel. converges to a bounded non-zero residual set.
  • 19. A control system in accordance with claim 17 in which said control system includes a
  • Proportional+Integral+Sigma(P+I+.sigma.) adaptation law for said feedback controller, which law operates in continuous time, as given by: ##EQU42## where {.sigma..sub.1,.sigma..sub.2,.sigma..sub.3 } are positive scalar design parameters.
BACKGROUND OF THE INVENTION

1. Origin of the Invention The invention described herein was made in the performance of the work under a NASA Contract and is subject to the provisions of Public Law 96517 (35 USC 202) in which the contractor has elected not to retain title.

US Referenced Citations (24)
Number Name Date Kind
3758762 Littman et al. Sep 1973
4200827 Oswald Apr 1980
4341986 Browder Jul 1982
4347578 Inaba Aug 1982
4362977 Evans et al. Dec 1982
4458321 Whitney et al. Jul 1984
4540923 Kade et al. Sep 1985
4546426 Hafner et al. Oct 1985
4546443 Oguchi et al. Oct 1985
4547858 Horak Oct 1985
4563735 Hiroi et al. Jan 1986
4603284 Perzley Jul 1986
4621332 Sugimoto et al. Nov 1986
4639652 Takahashi et al. Jan 1987
4663703 Axelby et al. May 1987
4670843 Matsumura et al. Jun 1987
4679136 Shigemasa Jul 1987
4719561 Shigemasa Jan 1988
4725942 Osaka Feb 1988
4733149 Culberson Mar 1988
4763276 Perreirra et al. Aug 1988
4773025 Penkar et al. Sep 1988
4792737 Goff et al. Dec 1988
4860215 Seraji Aug 1989