This invention generally relates to control methods and systems in industrial automation systems and, in particular, to non-contact control methods and systems in such industrial automation systems.
It is often necessary to “couple” the linear motion of an assembly line or other linear conveyance to the action of a robot or other automated assembly or automated inspection mechanism which must “follow” the motion of an assembly line as an object proceeds along the assembly line. The purpose of this “coupling” or “following” is to maintain a steady relative position between an object traveling along the assembly line and a mechanism which is tasked with inspecting or somehow affecting that object traveling along the assembly line.
This coupling of motions is often achieved by means of mechanical coupling between a linear or rotational position encoder and a cogwheel used to drive an assembly line. Typically, in the current state of the art, a rotary position encoder is attached to a drive shaft or other rotating shaft which is used to create linear motion along a conveyor. As the drive shaft rotates, the rotation of the drive shaft is measured by the rotational encoder, the rotation of the drive shaft is encoded into a train of quadrature pulses which is transmitted electronically to a robot or other device. The quadrature pulses are used to measure linear position; the rate of quadrature pulses is used to measure linear velocity; and so on. The robot or other mechanism interprets these quadrature pulses to derive the position of an object conveyed by the assembly line.
The following U.S. patent documents are related to the present invention: U.S. Pat. Nos. 5,852,413; 6,124,589; 5,850,021; 7,719,224; 8,493,496; 9,393,695; and 2013/0329012; 2014/0261453; and 2018/0031364.
This technology has many limitations as follows:
Inaccurate—due to the backlash in the drive motors, couplings, and conveyor chain assemblies, inaccuracies of several hundred millimeters are common;
Unwieldy—a conveyor segment hundreds of meters long requires an electrical connection hundreds of meters long between the drive shaft's encoder and the robot or other mechanism which is attempting to follow the line;
Decoupled—by its very nature, any relative motion between the conveyor and an object riding on the conveyor cannot be measured by a rotary encoder fixed to a drive shaft. Experience shows that the inertia of heavy objects on an assembly line may drive a heavy object forwards or the object may even bounce backwards when the conveyor's drive motor stops and starts; and
Not Scalable—there are few drive shafts in any given conveyor assembly, but there may be many assembly stations requiring following the line.
Devices for the calculation of a limited set of range data from an electronic representation of a visible scene are also well known in the prior art. Typically, these devices employ a 2D sensor and one or more beams of radiation configured so that the beams of radiation intersect an object in the field of view of the 2D sensor, and some radiation from those beams is reflected by that object back to the 2D sensor. The mathematics of triangulation is used to calculate the range to the object for those pixels illuminated by the beam(s) of radiation (see, for example, U.S. Pat. Nos. 3,180,205 and 4,373,804). Using terms of the art: a picture element (designated by its horizontal and vertical coordinates within an imaging array) for which range data is known is termed a volume element or ‘voxel.’
Techniques similar to those disclosed in U.S. Pat. Nos. 3,180,205 and 4,373,804 generate a relatively small set of range data. This limitation was overcome by the invention of three-dimensional sensors which produce range data for all, or nearly all, picture elements in their imaging arrays, and hence much more complete range data for objects in their fields of view. See, for example, U.S. Pat. No. 4,195,221, which utilizes time of flight techniques, U.S. Pat. No. 5,081,530 which utilizes scanning beam techniques, or U.S. Pat. No. 6,751,344 which utilizes projected patterns to obtain voxels over an extended field of view. In recent years, the ideas in these early patents have been developed further so that relatively inexpensive consumer-grade 3D sensors are available commercially. For example, a 3D sensor based on the time of flight principle is the DepthSense DS325. A 3D sensor that derives depth from projected structured light is the PrimeSense Carmine. A 3D sensor that utilizes a scanning beam technique is the LMI Gocator.
In view of the above, there is a need for a non-contact, robust method and system that tracks the position of an object as it travels along an assembly line. Such tracking, done properly, can enable assembly and other industrial tasks to be accurately performed on a moving object.
It is the object of at least one embodiment of the present invention to overcome the above-noted shortcomings of the mechanical encoder method by providing a non-contact, ‘virtual’ encoder method and system. The virtual encoder method and system uses 3D sensing technology to make continuous, non-contact measurements of an object as it moves along an assembly line. In one embodiment, position and velocity measurements are used to create a train of quadrature signals which are electrically indistinguishable from the train of quadrature signals issued from a standard position encoder. This train of signals may then be fed to a robot or other apparatus configured to accept input from a mechanical position encoder.
In carrying out the above object and other objects of at least one embodiment of the present invention, a non-contact method of controlling an industrial automation machine is provided. The method includes providing at least one 3D or depth sensor at a vision station located in an industrial environment. Each sensor has a field of view at the vision station to obtain a stream of sensor data representative of a surface shape of an inanimate object moving along or about an axis in the vision station within its field of view. The method also includes tracking the pose of the object within the vision station as a function of time based on each stream of sensor data to obtain at least one stream of poses and generating a command signal for use by the industrial automation machine based on at least one stream of poses to control the machine.
The industrial automation machine may be configured to perform a task on the object.
The command signal may be a trigger signal.
The command signal may be a stream of quadrature signals for each axis.
The machine may be an inspection machine to inspect the object.
The machine may be configured to change or affect the object.
Motion of the object may be a linear motion in the vision station.
The step of tracking may include the steps of: processing each stream of sensor data in real time to obtain at least one stream of estimated poses; processing the at least one stream of estimated poses in real time to obtain kinematic state estimates of the object; and processing the kinematic state estimates in real time to obtain an evolution of the state of the object.
The step of processing the kinematic state estimates may utilize a transient model or a steady state model.
Each of the sensors may be mounted in a fixed position within the vision station.
The method may further include determining a multidimensional offset of the object from a reference pose and generating an offset signal for use by the industrial automation machine based on the offset.
Further in carrying out the above object and other objects of at least one embodiment of the present invention, a non-contact system for controlling an industrial automation machine is provided. The system includes at least one 3D or depth sensor. Each sensor has a field of view at a vision station to obtain a stream of sensor data representative of a surface shape of an inanimate object moving along or about an axis in the vision station within its field of view. The system also includes a tracker to track the pose of the object within the vision station as a function of time based on each stream of sensor data to obtain at least one stream of poses and a signal generator to generate a command signal for use by the industrial automation machine based on the at least one stream of poses to control the machine.
The tracker may include at least one programmed processor: to process each stream of sensor data in real time to obtain at least one stream of estimated poses; to process the at least one stream of estimated poses in real time to obtain kinematic state estimates of the object; and to process the kinematic state estimates in real time to obtain an evolution of the state of the object.
The programmed processor may utilize a transient model or a steady state model to process the kinematic state estimates.
As required, detailed embodiments of the present invention are disclosed herein; however, it is to be understood that the disclosed embodiments are merely exemplary of the invention that may be embodied in various and alternative forms. The figures are not necessarily to scale; some features may be exaggerated or minimized to show details of particular components. Therefore, specific structural and functional details disclosed herein are not to be interpreted as limiting, but merely as a representative basis for teaching one skilled in the art to variously employ the present invention.
“Following the Line”—tracking the pose of an object as it moves along a conveyor
“Target Object”—an object moving along a conveyor.
“Effector”—a Tool which changes a Target Object in some way, e.g. a robot.
“Inspector”—a Tool which measures a Target Object, e.g. a Gap and Flush sensor.
The “Virtual Encoder” method and system of the present invention is a method and apparatus to estimate the 6DOF pose of a Target Object as a function of time, then to communicate this information in some usable fashion. The consumer of the information is a Tool, which may be an Effector or an Inspector. The Virtual Encoder is distinguished from a Mechanical Encoder in that the Virtual Encoder is a non-contact device.
The Virtual Encoder method and apparatus is a superior means for determining the position of objects along a conveyor line, compared with present methods using Mechanical Encoders. The Virtual Encoder enables assembly methods which have heretofore not been practical because of the limitations of present methods.
=(x, y, z, α, β, γ)t—The pose of a rigid body requires 3 positions+3 rotations
{i, i, i, . . . }—the predicted pose, velocity, acceleration, . . . of an object at time ti
{i, i, i, . . . }—The measured pose, velocity, acceleration, . . . of an object at time ti
i={i, i, i, . . . }—The actual dynamic state of an object
i={i, i, i, . . . }—The predicted dynamic state of an object
i={i, i, i, . . . }—The measured dynamic state of an object
*i={*i, *i, *i, . . . }—The commanded dynamic state of an object
(tn)={0, . . . , n}—A (historical) trajectory of an object's pose through time
(tn)={0, . . . , n}—The (historical) trajectory of an object's state through time
(tn)={0, . . . , n, n+1, n+32, . . . }—The (predicted) evolution of an object's state
(0, . . . n)=(tn)—A kinematic estimator estimates the dynamic state of
M((tn))=(tn)—A kinematic model M predicts an object's state's evolution.
The Virtual Encoder method and system typically estimates the pose of a Target Object between ˜½ meter and ˜5 meters in length in motion at speeds less than ˜1000 mm/sec along a linear conveyor.
The Virtual Encoder method and system typically estimates the pose of Target Objects in motion along a one-dimensional axis (the axis of travel of the linear conveyor) to enable Assembly Operations on the Target Object in motion:
(t)=(x(t),y,z,α,β,γ)t
In at least one embodiment, the five steady-state dimensions {y, z, α, β, γ} may be modeled assuming they are fixed in a ‘reference’ pose. That is: the position and orientation of any point on the Target Object is determined by x(t) and knowledge of the original ‘reference’ orientation and position of the object, 0=(x0, y0, z0, α0, β0, γ0)t
A1(t)=({circumflex over (x)}(t),0,0,0,0,0)t+0
In at least one embodiment, the five steady-state dimensions may be modeled by assuming they have undergone a rigid transform, , relative to the ‘reference pose’. One embodiment requires a measurement of this rigid transform.
A2(t)=({circumflex over (x)}(t),0,0,0,0,0)t+·0
The Virtual Encoder method and system is preferably designed for assembly processes requiring position errors less than ˜25 mm: e.g. Paint Application, Badging Inspection, Paint Inspection, Gap and Flush Inspection. This translates to the requirement that
|(t)−(t)|max≤25 mm
Communication of the object position as a function of time should occur by mimicking a quadrature position encoder signal to take advantage of the pre-existing, low-latency, encoder inputs which are supported by current-generation robot controllers.
The Virtual Encoder system and method operate in two modes. In the first mode one assumes the Target Object is in motion along an assembly line while the Tool is immobile. In the second mode one assumes the Target Object is in motion and the Tool is tracking.
‘Trigger’ an operation at a desired position, while the Target Object moves along the assembly line. The Tool is immobile. This case can be thought of as taking a ‘snap shot’ of a Target Object when it reaches a predetermined position. The position (pose) at which an assembly process happens is controlled, but not the velocity.
‘Track’ the object in motion so that the Target Object is at a relative standstill relative to the Tool. This case can be thought of as working to fool the Tool into thinking the object is stationary. The pose and the velocity at which an assembly process happens are controlled.
Design Specification: Block Diagram of
Design Specification: Block Diagram of
One or more Volumetric Sensors (i.e. 3D sensors) gather 3D data (Point Clouds) from Target Objects in motion through an assembly station. These Volumetric Sensors are mounted in fixed positions at distances between 1 and 4 meters from the Target Object(s). A given assembly station may be monitored by 1 to 4 Volumetric or 3D Sensors.
Calculation of objects' positions as a function of time uses CICPEA technology (Continuously Iterated Cloud-Based Pose Estimation Algorithms). See ‘CICPEA’ below.
Quadrature signal emulation occurs as described in ‘Quadrature Signal Generation’ below.
Low-latency and low-jitter signaling (to trigger a Tool and/or to trigger a pose measurement) is important. For example, on a hypothetical assembly line moving at 200 mm/sec, a 50 millisecond trigger delay corresponds to a position error of 10 millimeters.
Volumetric Sensors: Several volumetric sensors are known to the art and available commercially which are capable of producing streams of point clouds indicative of the surface shape of objects in motion within their fields of view. For example: Microsoft Kinect, PrimeSense Carmine, Orbbec Astra, Intel RealSense, etc.
The sensors used to create streams of point clouds for the Virtual Encoder method and system are chosen according to the specific requirements of an application, and may or may not be modified or improved versions of commercially available sensors.
CICPEA Pose Estimation
CICPEA=Continuously Iterated Cloud-Based Pose Estimation Algorithms
Under the assumption that a Target Object is moving ‘slowly’ compared to the rate of point cloud sampling it is reasonable to estimate that as time evolves, object poses change slowly. Hence, a good estimate for the pose of an object at tn+1 is the pose of the object at tn:
Such approximations can improve the accuracy and speed of algorithms which operate on Point Clouds of data to produce Pose Estimates. Massively parallel geometric processors (such as NVIDIA computing hardware) enable Continuously Iterated Cloud-Based Pose Estimation Algorithms=CICPEA technology. CICPEA technology is used for pose estimation by the Virtual Encoder method and system.
A variety of CICPEA algorithms are known to the art for continuous pose estimation, a prominent example being the KinFu algorithm.
Several methods are known to the art for producing kinematic state estimates in one dimension from streams of (potentially noisy) pose estimates. In the context of the Virtual Encoder method and system these estimators are termed Kinematics Estimators.
The Kinematics Estimator, , block of the Virtual Encoder method and system, receives a stream of pose estimates from CICPEA algorithms and produces an estimate of the kinematic state of a Target Object:
(0, . . . , n)=(tn)
The kinematics estimator in use for any particular application is configurable by the Virtual Encoder method and system depending on the needs of that installation. Successful Kinematics Estimators used by the Virtual Encoder method and system include: Kalman Filters, a variety of robust estimators for position and velocity, linear least squares fit for position and velocity, and so on. Other appropriate methods will suggest themselves to persons versed in the art.
Kinematics Model: Given a historical state estimate from a Kinematics Estimator, the task of the Kinematics Model is to predict the evolution of the state of the Target Object. Any number of Kinematics Models will suggest themselves to persons versed in the art, but a favored method for the Virtual Encoder is the following:
For each new state estimate in the sequence {0, . . . , n} extract the position and velocity estimates from the final two (most recent) state estimates {{tilde over (x)}n−1, {dot over ({tilde over (x)})}n−1, {tilde over (x)}n, {dot over ({tilde over (x)})}n};
Calculate the sample frequency λ=tn−tn−1;
Set the steady-state velocity to the most recent velocity estimate ν∞={dot over ({tilde over (x)})}n and set a ‘slow velocity’ threshold νslow to 6σν where σν is the conveyor velocity uncertainty;
If v∞≤νslow calculate a 1st-order (position/velocity) ‘transient’ model: MT=M1T(tn+dt): x(dt)=x+{dot over (x)}dt;
else calculate a 3rd-order (position/velocity/acceleration/jerk) ‘transient’ model:
Calculate the evolution of the state of the Target Object anew as follows:
For t<tn+λ use the transient model to calculate MT(t): x(tn+λ−t);
For t≥tn+λ use the ‘steady state’ model:
MS(t): x(t−(tn+λ))={tilde over (x)}n+ν∞dt;
Transient Models:
If v∞≤νslow the 1st-order (position/velocity) transient model is calculated as follows
Else if ν∞>νslow the 3d-order (position/velocity/acceleration/jerk) transient kinematic model is calculated by solving the following equations
Which yields:
Position Trigger Generator: Given a sequence of trigger positions {q0, . . . , qk} at which an assembly operation for a Target Object should be triggered:
where ϵ is the signal latency for the assembly operation signal transmission.
Quadrature Signal Generator: The task of the Quadrature Signal Generator is to create a stream of output quadrature signals to emulate the position evolution of the Target Object. Each time a new state estimate is available the Quadrature Signal Generator must be reinitialized.
The time-resolution of the quadrature signal generating circuitry ϵg and the time-resolution of the quadrature signal receiving circuitry ϵr should be predetermined. The rate of production of output signals is limited by dt=4*max{ϵg, ϵr}. Pseudo-code for a quadrature signal is as follows:
Quadrature Signal Generation: The signal generation of a Mechanical Rotary Encoder is driven by the rotation of a wheel. The Virtual Encoder emulates a quadrature signal in response to the stream of Target Object pose measurements ‘as-if’ a mechanical encoder was measuring the position of the Target Object.
A quadrature encoder signal is a two-channel binary signal which indicates both the direction and rate of change of an object's position. The rate of change is indicated by the rate at which HIGH-LOW transitions occur. The direction is indicated by the relative phase of the A and B channels as illustrated in
The advantages of the above-noted method and system are numerous including but not limited to:
The pose of an object can be estimated using a sensor capable of measuring range (depth) data. Location of the object relative to the sensor can be determined from one or more range measurements. Orientation of the object can be determined if the sensor provides multiple range measurements for points on the object. Preferably a dense cloud of range measurements is provided by the sensor so that orientation of the object can be determined accurately.
Use of at least one embodiment of the present invention improves accuracy and robustness of position encoding along a conveyor system, and, consequently, enables manufacturing techniques which heretofore have not been possible or have not been not economically feasible. Examples of such practices and techniques are:
In one preferred embodiment, the system includes one or more volumetric or 3D sensors configured to observe an object as it traverses an assembly or inspection station. The point cloud data from these sensors is fed to a computer, which implements algorithms for tracking the 1D motion of an object. The position and velocity estimates from these tracking algorithms may be fed through linear or non-linear filtering means such as Kalman filters, model-predictive algorithms, or other filters known to the art for improving position estimations. The result is translated to a time series of quadrature signals by electronic means. The quadrature signal train is fed to a robot or other device configured to use such a signal train for tracking the linear motion of an object.
While exemplary embodiments are described above, it is not intended that these embodiments describe all possible forms of the invention. Rather, the words used in the specification are words of description rather than limitation, and it is understood that various changes may be made without departing from the spirit and scope of the invention. Additionally, the features of various implementing embodiments may be combined to form further embodiments of the invention.