A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction of the patent disclosure, as it appears in the Patent & Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
The invention relates to control of robotic manipulation; in particular motion compensation in robotic manipulation. The invention further relates to the use of stereo images.
Robotic manipulation is known in a range of fields. Typical systems include a robotic manipulator such as a robotic arm which is remote controlled by a user. For example the robotic arm may be configured to mirror the actions of the human hand. In that case a human controller may have sensors monitoring actions of the controller's hand. Those sensors provide signals allowing the robotic arm to be controlled in the same manner. Robotic manipulation is useful in a range of applications, for example in confined or in miniaturized/microscopic applications.
One known application of robotic manipulation is in medical procedures such as surgery. In robotic surgery a robotic arm carries a medical instrument. A camera is mounted on or close to the arm and the arm is controlled remotely by a medical practitioner who can view the operation via the camera. As a result keyhole surgery and microsurgery can be achieved with great precision. A problem found particularly in medical procedures but also in other applications arises when it is required to operate on a moving object or moving surface such as a beating heart. One known solution in medical procedures is to hold the relevant surface stationary. In the case of heart surgery it is known to stop the heart altogether and rely on other life support means while the operation is taking place. Alternatively the surface can be stabilized by using additional members to hold it stationary. Both techniques are complex, difficult and increase the stress on the patient.
One proposed solution is set out in U.S. Pat. No. 5,971,976 in which a position controller is also included. The medical instrument is mounted on a robotic arm and remotely controlled by a surgeon. The surface of the heart to be operated on is mechanically stabilized and the stabilizer also includes inertia or other position/movement sensors to detect any residual movement of the surface. A motion controller controls the robotic arm or instrument to track the residual movement of the surface such that the distance between them remains constant and the surgeon effectively operates on a stationary surface. A problem with this system is that the arm and instrument are motion locked to a specific point or zone on the heart defined by the mechanical stabilizer but there is no way of locking it to other areas. As a result if the surgeon needs to operate on another region of the surface then the residual motion will no longer be compensated and can indeed be enhanced if the arm is tracking another region of the surface, bearing in mind the complex surface movement of the heart.
The invention is set out in the appended claims. Because the motion sensor can sense motion of a range of points, the controller can determine the part of the object to be tracked. Eye tracking relative to a stereo image allows the depth of a fixation point to be determined.
Embodiments of the invention will now be described, by way of example, with reference to the drawings of which:
Referring to
As discussed above, it is known to add motion compensation to a system such as this whereby motion sensors on the surface send a movement signal which is tracked by the robotic arm such that the surface and arm are stationary relative to one another. In overview the present invention further incorporates an eye tracking capability at the surgical station 40 identifying which part of the surface the surgeon is fixating on and ensuring that the robotic arm tracks that particular point, the motion of which may vary relative to other points because of the complex motion of the heart's surface. As a result the invention achieves dynamic reference frame locking.
Referring to
Referring now to
An object 80 is operated on by a robotic manipulator designated generally 82. The manipulator 82 includes 3 robotic arms 84, 86, 88 articulated in any appropriate manner and carrying appropriate operating instruments. Arm 84 and arm 86 each support a camera 90a, 90b displaced from one another sufficient to provide stereo imaging according to known techniques. Since the relative positions of the three arms are known, the position of the cameras in 3D space is also known.
In use the system allows motion compensation to be directed to the point on which the surgeon is fixating (i.e. the point he is looking at, at a given moment). Identifying the fixation point can be achieved using known techniques which will generally be built in with an appropriate eye tracking device provided, for example, in the product discussed above. In the preferred embodiment the cameras are used to detect the motion of the fixation point and send the information back to the processor for control of the motion of the robotic arm.
In particular, once, at any one moment, the fixation point position is identified on the image viewed by the human operator, given that the position of the stereo cameras 90a and 90b are known the position of the point on the object 80 can be identified. Alternatively, by determining the respective direction of gaze of each eye, this can be replicated at the stereo camera to focus on the relevant point. The motion of that point is then determined by stereo vision. In particular, referring to
In particular, the computer 50 calculates the position in the image plane of the co-ordinates in the real world (so-called “world coordinates”). This may be done as follows:
A 3D point M=[x,y,z]T is projected to a 2D image point m=[x,y,]T through a 3×4 projection matrix P, such that S m=P M, where S is a non-zero scale factor and m=[x, y, 1]t and M=[x,y,z,1]t. In binocular stereo systems, each physical point M in 3D space is projected to m1 and m2 in the two image planes, i.e;
S1m1=P1M
S2m2=P2M (1)
If we assume that the world coordinate system is associated with the first camera, we have
P1=[A|]
P2=[A′R|A′t] (2)
Where R and t represent the 3×3 rotation matrix and the 3×1 translation vector defining the rigid displacement between the two cameras.
The matrices A and A1 are the 3×3 intrinsic parameter matrices of the two cameras. In general, when the two cameras have the same parameter settings and with square pixels (aspect ration=1), and the angle (θ) between the two image coordinate axes being π/2 we have:
Where (u0, v0) are the coordinates of the image principal point, i.e, the point where points located at infinity in world coordinates are projected.
Generally, matrix A can have the form of
Where fu and fv correspond to the focal distance in pixels along the axes of the image. All parameters of A can be computed through classical calibration method (e.g. as described in the book by O. Faugeras, “Three-Dimensional Computer Vision: a Geometric Viewpoint”, MIT press, Cambridge, Mass., 1993).
Known techniques for determining the depth are for example as follows. Firstly, the apparatus is calibrated for a given user. The user looks at predetermined points on a displayed image and the eye tracking device tracks the eye(s) of the user as they look at each predetermined point. This sets the user's gaze within a reference frame (generally two-dimensional if one image is displayed and three-dimensional if stereo images are displayed). In use, the user's gaze on the image(s) is tracked and thus the gaze of the user within this reference frame is determined. The robotic arms 84, 86 then move the cameras 90a, 90b to focus on the determined fixation point.
For instance, consider
Subsequently a user's gaze can be correlated to the part of the image being looked at by the user. For each eye, the coordinates [x1, y1] and [xr, yr] are known from each eye tracker from which [x, y, z]T can be calculated from Equations (1)-(4).
By carrying out this step across time the motion of the point fixated on by the human operator can be tracked and the camera and arm moved by any appropriate means to maintain a constant distance from the fixation point. This can either be done by monitoring the absolute position of the two points and keeping it constant or by some form of feedback control such as using PID control. Once again the relevant techniques will be well known to the skilled person.
It will be further recognized that the cameras can be focused or directed towards the fixation point determined by eye tracking, simply by providing appropriate direction means on or in relation to the robotic arm. As a result the tracked point can be moved to centre screen if desired.
In the preferred embodiment the surgical station provides a stereo image via binocular eyepiece 42 to the surgeon, where the required offset left and right images are provided by the respective cameras mounted on the robotic arm.
According to a further aspect of the invention enhanced eye tracking in relation to stereo images is provided. Referring to
Eye tracking devices are provided for each individual eyepiece. The eye-tracking device includes light projectors 206 and light detectors 208. In a preferred implementation, the light projectors are IR LEDs and the light detector comprises an IR camera for each eye. An IR filter may be provided in front of the IR camera. The images (indicated in
The angle of gaze of the eye can be derived using known techniques by the detecting the reflected light.
In a preferred, known implementation Purkinje images are formed by light reflected from surfaces in the eye. The first reflection takes place at the anterior surface of the cornea while the fourth occurs at the posterior surface of the lens of the eye. Both the first and fourth Purkinje images lie in approximately the same plane in the pupil of the eye and, since eye rotation alters the angle of the IR beam from the IR projectors 206 with respect to the optical axis of the eye, and eye translations move both images by the same amount, eye movement can be obtained from the spatial position and distance between the two Purkinje reflections. This technique is commonly known as the Dual-Purkinje Image (DPI) technique. DPI also allows for the calculation of a user's accommodation of focus i.e. how far away the user is looking. Another eye tracking technique subtracts the Purkinje reflections from the nasal side of the pupil and the temporal side of the pupil and uses the difference to determine the eye position signal. Any appropriate eye tracking system may in practice be used for example an ASL model 504 remote eye-tracking system (Applied Science Laboratories, MA, USA).
By tracking the individual motion of each eye and identifying the fixation point F on the left and right images 200a, 200b, not only the position of the fixation point in the X Y plane (the plane of the images) can be identified but also the depth into the image, in the Z direction.
Once the eye position signal is determined, the computer 50 uses this signal to determine where, in the reference field, the user is looking and calculates the corresponding position on the subject being viewed. Once this position is determined, the computer signals the robotic manipulator 82 to move the arms 84 and/or 86 which support the cameras 90a and 90b to focus on the part of the subject determined from the eye-tracking device, allowing the motion sensor to track movement of that part and hence lock the frame of reference to it.
Although the invention has been described with reference to eye tracking devices that use reflected light, other forms of eye tracking may be used, e.g. measuring the electric potential of the skin around the eye(s) or applying a special contact lens and tracking its position.
It will be appreciated that the embodiments above and elements thereof can be combined or interchanged as appropriate. Although specific discussion is made of the application of the invention to surgery, it will be recognized that the invention can be equally applied in many other areas where robotic manipulation or stereo imaging is required. Although stereo vision is described, monocular vision can also be applied. Also other appropriate means of motion sensing can be adopted, for instance, by the use of casting structured light onto the object and observing changes as the object moves, or by using laser range finding. These examples are not supposed to be limiting.
Number | Date | Country | Kind |
---|---|---|---|
0222265.1 | Sep 2002 | GB | national |
This application is a submission under 35 U.S.C. §371 based on prior international application PCT/GB2003/004077, filed 25 Sep. 2003, which claims priority from United Kingdom application 0222265.1, filed 25 Sep. 2002, entitled “Control of Robotic Motion,” the entire contents of which are hereby incorporated by reference as if fully set forth herein.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/GB03/04077 | 9/25/2003 | WO | 12/22/2005 |