This application is a new US patent application that claims benefit of JP 2010-035160, filed Feb. 19, 2010, the content of 2010-035160 being incorporated herein by reference.
This invention relates to a robot having a learning control function and in particular, a robot with the operation speed thereof increased using a sensor mounted on an arm of the robot.
In a robot, the position and the speed of a member driven by a servo motor are controlled normally by a position feedback control, speed feedback control and current feedback control in such a manner that the position and the speed of the driven member coincide with the commanded position and the commanded speed, respectively.
Even in this feedback control of the position, the speed and current, a trajectory error and a position vibration component occur in a high-speed operation of the robot. Also, in the high-speed operation, the difference in dynamic characteristics between the motor and the arm makes it impossible to measure the trajectory error and the position vibration component of the arm directly from a motor encoder. Therefore, to measure the trajectory error and the position vibration component, it is necessary to mount a sensor directly on the arm. As an example of the learning control with a sensor mounted, a learning control using an acceleration sensor has been disclosed (Patent Document 1).
The robot mechanism unit 1 includes an acceleration sensor 10, an arm 11, an arm forward end portion 12 and a motor (not shown). The motor of the robot mechanism unit 1 is supplied with a signal from the normal control unit 4 of the control unit 2 and drives the arm 11. Further, the motor of the robot mechanism unit 1 moves the arm forward end portion 12 to the desired position and carries out a task such as welding. At the arm forward end portion 12, the acceleration sensor 10 is installed and can acquire the spatial position data (yj(k)) of the arm forward end portion 12. The position data (yj(k)) from the acceleration sensor 10 is output to the learning control unit 3 and used for the learning control. In the foregoing description, reference character j designates the number of times trials are made, k the time, and Ns the number of the times the sampling is made in each trial. Character yd(k) designates a position command, (yj(k)) the amount controlled in the preceding control session, and ej(k) the target correction amount calculated from yd(k) and (yj(k)) through a filter. Also, uj(k) designates the learning correction amount of the preceding control session.
The normal control 4 includes a position control unit 41, a speed control unit 42, a current control unit 43, an amplifier 44 and a differentiation means 45. The position control unit 41 receives the position command data (yd(k)) input from outside the control unit 2 and the position information of, for example, the motor of the robot mechanism 1, while at the same time outputting the desired position information of the arm forward end portion 12 of the robot mechanism unit 1 to the speed control unit 42. The differentiation means 45 receives the motor position information fed back from the robot mechanism 1, and by calculating the motor speed, outputs the motor speed to the speed control unit 42.
The speed control unit 42 calculates the desired motor speed taking the position information from the position control unit 41 and the motor speed information from the differentiation means 45 into consideration, and outputs the desired motor speed to the current control unit 43. The current control unit 43 receives the current value fed back from the amplifier 44 and, by calculating the current flowing in the motor in such a manner as to achieve the desired motor speed input from the speed control unit 42, outputs the resultant current to the amplifier 44. The amplifier 44 calculates the desired power based on the current value from the current control unit 43, and charges the desired power in the motor (not shown) of the robot mechanism unit 1.
The learning control unit 3 includes a one-trial delay unit W−1 300, a first memory 31, a learning controller L(q) 32, a low-pass filter Q(q) 33, a second memory 34 and a third memory 35. The first memory 31 is supplied with and stores, through a filter, a target correction amount ej(k) based on the position command data (yd(k)) for the arm forward end portion 12 and the position data (yj(k)) measured by the acceleration sensor 10, while at the same time outputting the target correction amount ej(k) to the learning controller L(q) 32. The target correction amount ej(k) corresponds to the trajectory and vibration errors with respect to the desired position of the arm forward end portion 12.
The learning controller L(q) 32, by executing the task program stored therein, calculates the learning correction amount uj+1(k) from the target correction amount ej(k) and the preceding learning correction amount uj(k), and outputs the learning correction amount uj+1(k) to the low-pass filter Q(q) 33. The learning correction amount uj+1(k) input to the low-pass filter Q(q) 33 is output to and stored in the second memory 34 while at the same time being added to the position error data calculated by the position control unit 41 of the normal control unit 4.
Based on the position error data thus corrected, the robot mechanism unit 1 is controlled and the learning control is repeated. In the learning control, this series of processes is repeatedly executed to converge the position error to “0”. After completion of the learning control, the loop for updating the learning correction amount indicated by the dotted line in
Patent Document 1: JP-A-2006-172149
In the conventional learning control, the improvement in the trajectory and vibration errors under a certain condition is considered. However, the problem is that the application range is narrow, and operating convenience is not taken into consideration.
The aforementioned conventional technique described above as an example of the learning control using a sensor, which represents an application to a machine tool, assumes the use of an acceleration sensor. In the case where the acceleration sensor is mounted on the robot, on the other hand, the problem is posed that the trajectory error and the position error, though capable of being extracted in orthogonal coordinates, cannot be calculated on each axis directly from the sensor data.
Also, according to the conventional technique described above, the normal high-pass filter is used to extract the trajectory and vibration errors from the acceleration sensor. In the machine tool, the frequency band for feedback control is as high as several tens of Hz to several hundred Hz, or in other words, the feedback control has a very high performance in this frequency band, and therefore, no serious problem is posed even in the case where the data of not more than 10 Hz cannot be learned to remove the offset data. Thus, the offset is not a great problem. In the industrial robot, on the other hand, the frequency band for feedback control is normally several Hz. In a higher frequency band, the feedforward control is conducted, and the performance is liable to depend on the intermodel error. Therefore, the particular part is corrected by learning control. In the case where a high-pass filter of 1 Hz is used to remove the offset of the data from the acceleration sensor, for example, the phase of the trajectory and the vibration errors of up to about 10 Hz rotates, and therefore, trajectory and vibration error data in the frequency band to be removed are also processed undesirably, thereby posing the problem that learning control performance is deteriorated.
Another problem is difficulty in adjusting the learning controller. Although various adjustment methods have been proposed, the problems that the number of controllers is high, stability is reduced and the vast amount of matrix calculation is required remain unsolved. Under the circumstances, the adjustment is made by trial and error in most work fields. Also, the fact that the robot system changes in accordance with the posture of the robot increases the difficulty of adjustment by trial and error. At present, an industrial robot having the learning function to increase the speed by adjusting the parameters automatically is still unavailable.
According to one aspect of the invention, there is provided a robot comprising a robot mechanism unit having a sensor on a part with the position thereof to be controlled and a control unit for controlling the operation of the robot mechanism unit, wherein the control unit includes a normal control unit for controlling the operation of the robot mechanism unit and a learning control unit for causing the robot mechanism unit to operate according to a task program and conducting the learning operation to calculate the learning correction amount in order that the position of the part of the robot mechanism unit to be controlled and detected by the sensor is made to approach a target trajectory or position assigned to the normal control unit, and wherein the learning control unit conducts the learning operation in such a manner that by calculating the maximum speed override that can be set in the learning operation and increasing the speed override a plurality of times until reaching the maximum speed override, the learning correction amount is calculated.
According to another aspect of the invention, the learning control unit may calculate the maximum speed override based on the maximum speed and the maximum acceleration allowed for the robot mechanism unit.
According to still another aspect of the invention, the learning control unit may include a high-pass filter for calculating the trajectory and vibration errors of the robot mechanism unit based on the data detected by the sensor.
According to yet another aspect of the invention, the learning control unit desirably calculates the position on each axis containing the trajectory position and vibration errors by inverse kinematics of the data detected by the sensor to those on the three basic axes.
According to a further aspect of the invention, the learning control unit may calculate the position and inclination of the sensor by causing the robot mechanism unit to perform a predetermined operation.
According to a still further aspect of the invention, the learning control unit desirably further includes a memory for holding the learning correction amount.
According to a yet further aspect of the invention, the sensor may be one of a vision sensor, an acceleration sensor, a gyro sensor, an inertia sensor and a distortion sensor.
According to yet another aspect of the invention, the sensor may desirably further include a mounting means or more desirably a magnet replaceably mounted on the robot mechanism unit as the mounting means.
According to this invention, the learning control unit conducts the learning operation by calculating the maximum speed override that can be set in the learning operation, and while increasing the speed override a plurality of times before reaching the maximum speed override, calculates the learning correction amount, thereby making it possible to increase the speed automatically in the learning operation.
These and other features and advantages of the present invention will be better understood by reading the following detailed description taken together with the drawings wherein:
The robot according to the invention is explained below with reference to the drawings. However, it should be noted that the technical scope of this invention is not limited to the embodiments described below and covers the invention described in the appended claims and equivalents thereof.
According to this invention, the speed of the spot operation is increased. First, the configuration of the robot mechanism unit of the robot according to the invention is shown in
Although the acceleration sensor is used in the embodiment of the invention described above, a vision sensor may be used in place of the acceleration sensor. An example in which the vision sensor is used is shown in
After mounting the acceleration sensor 10, the robot mechanism unit is operated as predetermined for calibration to calculate the position and inclination of the sensor 10. The calibration is made according to the steps described below.
First, the inclination of the acceleration sensor is specified. As shown in
Similarly, the operation is performed along Y axis from a given point P0, and by passing through a given point P2, the corresponding acceleration data a2 is acquired. In the process, the acceleration ayγ exceeding the gravitational acceleration (stationary acceleration) can be expressed as ayγ=a2−a0. The standardization is defined as follows.
The vector orthogonal to these two data is given as a2γ=ax×ay and can be expressed as follows.
As a result, the matrix Rt for transforming the posture to the world coordinate system from the tool coordinate system is expressed as follows.
Rt=└axayaz┘
Then, parts J5 and J6 are operated to specify the position of the acceleration sensor. First, as shown in
({umlaut over (φ)}1x,{umlaut over (φ)}1y,{umlaut over (φ)}1z)
Then, the sensor displacement Δφ1 is expressed as
Δφ1=∫∫√{square root over ({umlaut over (φ)}21x+{umlaut over (φ)}21z)}dtdt
In this case, the offset amount Δx in X direction of the world coordinate system is expressed as Δx=Δφ1/Δθ1.
As shown in
({umlaut over (φ)}2x,{umlaut over (φ)}2y,{umlaut over (φ)}2z)
Then, the sensor displacement Δφ2 is expressed as follows.
Δφ2=∫∫√{square root over ({umlaut over (φ)}22y+{umlaut over (φ)}22z)}dtdt
The relation holds that γ=Δφ2/θ2, and the offset amount Δy in Y direction of the world coordinate system is calculated as Δy=γ cos θ2, and the offset amount Δz in Z direction of the world coordinate system as Δz=γ sin θ2.
Next, the learning controller is designed. Initially, the frequency response from the input of the learning correction amount for each axis to the position estimated based on the acceleration sensor is measured. Also, the block of the learning control is plotted as shown in
A schematic diagram of the robot according to an embodiment of the invention is shown in
The learning control unit 3 operates the robot mechanism unit 1 according to a task program, and carries out the learning to calculate the learning correction amount in order that the position of the part of the robot mechanism unit 1 to be controlled and detected by the acceleration sensor 10 is rendered to approach the target position Yd(k) assigned to the normal control unit 4. The configuration of other parts than the learning control unit 3 is similar to that of the conventional robot shown in
The linear matrix inequality is the problem of calculating the value x minimizing cTx(cεRm) under the following restraint.
where Fi is a positive semidefinite matrix.
Now, assume that the learning controller is expressed as
L(z)=L0z−N
where N0εZ, and that the relation holds that
x=[γ2 L0 L1 . . . L2N
where
γεR,LkεRN
Then, the condition for guaranteeing the stability and the monotonic decrease of the learning controller is expressed as shown below on the frequency region.
∥Q(z)(I−L(z)P(z))∥∞=γ<1
In this equation, Q(z) designates a low-pass filter with the learning band in the cut frequency, L(z) the learning control filter and P(z) the transfer function from the input of the learning correction amount to the object to be controlled. The smaller the value y, the higher the performance of the learning controller. The optimization program of the learning controller is how to calculate the learning filter L(z) associated with the minimum value γ in a given learning control frequency band. This equation can be rewritten as follows.
where assuming that the relation φk(z)=z−N0+(k-1) holds, the equation expressed as
is obtained.
In this equation, Kk can be expressed by the linearity of αk,j and Vj, where Vj is in the same dimension as Lk, and always zero for other than the element (j,i). For example, Ny=2, Nu=2.
Also,
As a result, this equation is rewritten as
Considering the first and second terms of this equation as Σi=1mxiFi and the third term as F0, and the equation defined as
c=[1 0 0 . . . 0], x=└γ2 β1 β2 . . . β(2N
Then, the equation is expressed as Σi=1mxiFi+F0.
This is equivalent to the restraint of the linear matrix inequality (1), and the minimization problem leads to the problem of how to minimize CTx, i.e. γ2. This can be interpreted also as the optimization problem of the learning controller. Thus, the sufficient condition for stability and monotonic decrease is given as
Σi=1mxiFi+F0
By measuring P(ejΩi) experimentally and determining the learning band filter Q(z), the learning filter L(z) can be determined automatically.
Further, considering the robustness of the learning controller, the feature of the robot is that the system thereof is varied greatly with the posture.
Assuming that in the case where a given posture is determined as a reference posture, Pn(z) is the learning system for the reference posture. Then, an arbitrary posture Pm(z) is expressed as Pm(z)=Pn(z)+ΔPm(z), where ΔPm(z) is a change amount of the learning system from the reference posture. In this case, the restraint with the learning band filter Q(z) available is expressed as
Considering that the relation xm=└γm2 β1,m β2,m . . . β(2N
cTxm≦1
By measuring P(ejΩi) experimentally for the number m of postures, the learning controller can be automatically determined as in the preceding case.
Next, the data processing steps for the learning controller are explained. As shown in
After completion of the operation, the position transformer 35 estimates the trajectory/vibration error of the orthogonal coordinate, and by use of the high-pass filter 36 providing a zero-phase high-pass filter, the trajectory/vibration error Δr for other than the offset is extracted. This trajectory/vibration error is added to the position data r of the sensor estimated using FK from the motor position feedback (FB) data thereby to estimate the sensor position in the orthogonal coordinate system of the acceleration sensor 10 including the dynamics of the arm.
By inverse transformation of the sensor position estimated from the sensor into the three basic axes, the position on each axis including the arm dynamics is calculated. From this position on each axis including the arm dynamics, the position on each axis not including the arm dynamics, i.e. the motor position is subtracted thereby to calculate the target correction amount on each axis. In the equation shown below, Ψj designates the target correction amount on each axis for the j-th trial, IK the inverse transformation, and θmj the motor position on each axis for the j-th trial.
ψj=IK−1(r+Δr)−θmj
By inputting this target correction amount for each axis to the learning controller, the correction amount uj+1(k) for the next trial is calculated. Through the learning controller L(q) 32, the learning correction amount uj for the preceding trial is added from the third memory 35, and the correction amount uj+1 for the next trial is calculated through the low-pass filter Q(q) 33.
Next, the steps of increasing the operation speed of the robot mechanism unit in the learning control operation according to the invention are explained with reference to
First, in step S101, the maximum speed override that can be set during the learning is calculated by the learning control unit. The maximum speed override is calculated based on the maximum speed and the maximum acceleration allowed for the robot mechanism unit. In the first place, the robot mechanism unit is operated once, and the maximum speed override that can be learned for each axial motor in terms of the maximum acceleration and the maximum speed is calculated from the data on the first trial.
As in the first step, the maximum speed override that can be learned for each axial motor is calculated from the viewpoint of the maximum acceleration. The equation of motion of the robot is defined as follows.
τ=M(Θ){umlaut over (Θ)}+V(Θ,{dot over (Θ)})+G(Θ)
where Θ is the position and speed of the arm.
In this equation, M(Θ) is the matrix of the inertia term, V(Θ,{dot over (Θ)}) the vector of the speed term, and G(Θ) the vector of the gravity term. A great amount of torque is used mainly for acceleration or deceleration. Assuming that the torque increase due to the increase in speed override is mainly caused from M(Θ){umlaut over (Θ)}, the approximate value of the maximum speed override ovr_max1,i is calculated from the viewpoint of acceleration and deceleration.
Assuming that the maximum torque for the first trial is τmax,i, the maximum torque tolerance of the motor τp,i and the torque M(Θ){umlaut over (Θ)} used for acceleration and deceleration is τa,i. Then, the equation below is obtained.
τa,i=τmax,i−(V(Θ,{dot over (Θ)})+G(Θ))i
In the process, considering that over_max1,i is proportional to the square of τa,1, the relation holds that
where the affix i indicates the i-th axis.
In the manner described above, the maximum override ovr_max1,i is obtained from the viewpoint of the maximum acceleration.
Similarly, the maximum speed override ovr_max2,i is calculated from the viewpoint of the maximum speed. Assuming that the maximum speed for the first trial is ωv,i and the maximum speed tolerance of the motor is ωp,i, the equation shown below is obtained.
As described above, the maximum speed override ovr_max2,i is obtained from the viewpoint of the maximum speed. In addition to the two conditions described, the minimum speed override among the axes constitutes the maximum speed override usable for learning control. Thus, these conditions are collectively expressed as ovr_max. Then, the following formula is obtained.
Assuming that the amount of the speed override increased in one step is Δ, the number n of steps is calculated using Δ as shown below.
The learning is carried out by increasing the speed override to the maximum speed override in n steps, for example, and the learning correction amount is thus calculated. Specifically, in step S102, the learning control unit repeats the learning several times by increasing the speed override by a predetermined amount, and after the vibration is converged, calculates the learning correction amount in step S103.
Next, in step S104, the learning control unit determines whether the speed override has increased beyond the maximum speed override or not. In the case where the speed override is not greater than the maximum speed override, the learning control unit carries out the learning by increasing the speed override by a predetermined amount in step S102. In the case where the speed override has exceeded the maximum speed override, on the other hand, the learning control unit holds the learning correction amount in a F-ROM or a memory card (MC) in step S105.
In this way, the process for increasing the speed override and the process for carrying out the learning are repeated alternately until the speed override reaches the maximum speed override, thereby increasing the operation speed to a high level. During the actual operation, the learning correction amount is reproduced by accessing the F-ROM or the memory card (MC), as the case may be.
In the embodiments of this invention described above, the acceleration sensor is used as the sensor mounted on the robot mechanism unit. Nevertheless, a vision sensor, a gyro sensor, an inertia sensor or a distortion gauge may alternatively be used.
Number | Date | Country | Kind |
---|---|---|---|
2010-035160 | Feb 2010 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
6167328 | Takaoka et al. | Dec 2000 | A |
6522949 | Ikeda et al. | Feb 2003 | B1 |
7205743 | Iwashita et al. | Apr 2007 | B2 |
7996110 | Lipow et al. | Aug 2011 | B2 |
8170719 | Tsusaka et al. | May 2012 | B2 |
8175749 | Tsusaka et al. | May 2012 | B2 |
20050107920 | Ban et al. | May 2005 | A1 |
20050113977 | Nihei et al. | May 2005 | A1 |
20060082340 | Watanabe et al. | Apr 2006 | A1 |
20090125146 | Zhang et al. | May 2009 | A1 |
Number | Date | Country |
---|---|---|
06-289918 | Oct 1994 | JP |
2001022423 | Jan 2001 | JP |
2005-153047 | Jun 2005 | JP |
2005149299 | Jun 2005 | JP |
2006110702 | Apr 2006 | JP |
2006-172149 | Jun 2006 | JP |
2007144623 | Jun 2007 | JP |
Number | Date | Country | |
---|---|---|---|
20110208356 A1 | Aug 2011 | US |