This application is the national stage filing of corresponding international application number PCT/SG01/00171, filed Aug. 28, 2001.
The present invention relates to methods and systems for interacting with three-dimensional computer models.
Two main types of three-dimensional computer data models of the real world are in use today. One deals with the real world by sampling it, while the other approximates the real world by approximating it using mathematical functions. The sampling method leads to volumetric models, and uses voxels as the main representation unit. The mathematical method, an example of which is called CAD (Computer Aided Design), creates models which use polygons as primitives. Polygons are good for rendering the external surfaces of a three-dimensional object, but accurately rendering the internal features of the human body, for example, requires so-called “volume rendering” using “voxels”, which are three-dimensional image components.
Specifically, volume rendering is a technique for visualizing three-dimensional arrays of sampled data. Examples of sampled three-dimensional data are medical data from CAT or MRI scanners, seismic data, or any other volumetric information for which geometric surfaces are difficult to generate or unavailable. Volume rendering takes the original sampled data, interpolates between the available data to fill in the gaps, and displays the resulting three-dimensional image to the user on a screen (using perspective or orthogonal projection).
A major issue when interacting with typical images produced by volume rendering is the sheer size of the data sets. More often than not, the data sets sources like CT and MR) need to be processed at the same time. This factor, together with other factors such as the amount of interpolation which needs to be done and the footprint of the rendered volume, can adversely affect the rendering speed. This problem is compounded when the user tries to view the volume at a high magnification, during which the system can slow to a crawl.
By contrast, mathematical methods such as CAD represents three-dimensional objects as mathematical functions, usually polygons and polylines. However, as for volume rendering, the sheer size of the data makes real-time interaction a problem because of the rendering speed required to produce new images in less than 100 milliseconds (10 per second).
Both volume rendering and CAD result in a rendering in a “frame buffer”
With the increasing power of computers, volume rendering and CAD are capable of generating increasingly complex data models. However, the computers' ability to “render” the models is limited by several factors:
One existing technology for displaying three dimensional models (whether created by volume rendering or CAD) is called the Dextroscope, which is used for visualisation by a single individual. A variation of the Dextroscope, for use in presentations to an audience, and even a large audience, is called the DextroBeam. This Dextroscope technology displays a high-resolution stereoscopic virtual image in front of the user.
The software of the Dextroscope uses an algorithm having a main loop in which inputs are read from the user's devices and actions are taken in response. The software creates a “virtual world” which is populated by virtual “objects”. The user controls a set of input devices with his hands, and the Dextroscope operates such that these input devices correspond to virtual “tools”, which can interact with the objects. For example, in the case that one such object is virtual tissue, the tool may correspond to a virtual scalpel which can cut the tissue.
Within the Update stage, the main tasks are:
The tool controlled by the user has four states: “Check”, “StartAction”, “DoAction” and “EndAction”. Callback functions corresponding to the four states are provided for programming the behaviour of the tool.
“Check” is a state in which the tool is passive, and does not act on any object. For a stylus (a three-dimensional-input device with a switch), this corresponds to the “button-not-pressed” state. The tool uses this time to check the position with respect to the objects, for example if is touching an object.
“StartAction” is the transition of the tool from being passive to active, such that it can act on any object. For a stylus, this corresponds to a “button-just-pressed” state. It marks the start of the tool's action, for instance “start drawing”. DoAction is a state in which the tool is kept active. For a stylus, this corresponds to “button-still-pressed” state. It indicates that the tool is still carrying out its action, for instance, “drawing”. EndAction is the transition of the tool from being active to being passive. For a stylus, this corresponds to “button-just-released” state. It marks the end of the tool's action, for instance, “stop drawing”.
A tool is typically modelled such that its tip is located at object co-ordinates (0,0,0), and it is pointing towards the positive z-axis. The size of a tool should be around 10 cm. A tool has a passive shape and an active shape, to provide visual cues as to which states it is in. The passive shape is the shape of the tool when it is passive, and active shape is the shape of the tool when it is active. A tool has default passive and active shape.
A tool acts on objects when it is in their proximity. A tool is said to have picked the objects.
Generally, a tool is said to be “in” an object if its tip is inside a bounding box of the object. Alternatively, the programmers may define an enlarged bounding box which surrounds the object with a selected margin (“allowance”) in each direction, and arrange that the software recognises that a tool is “in” an object if its tip enters the enlarged bounding box. The enlarged bounding box enables easier picking. For example, one can set the allowance to 2 mm (in the world's coordinate system, as opposed to the virtual world), so that the tool will pick an object if it is within 2 mm of the object's proximity. The default allowance is 0.
The present invention seeks to provide a new and useful ways to interact with three dimensional computer generated models in an efficient way.
In general terms, the present invention proposes that a computer system defines an initial correspondence between a three-dimensional computer model and a real world workspace. An editing volume of the workspace is also defined, and a stereoscopic image of the section of the computer model within the editing volume is displayed. Using a first input device, the model can be virtually translated and/or rotated, and the editing volume can be rotated, so as to bring different sections of the model into the editing volume, and thus into the user's view. The user operates a second input device to indicate changes to be made to the model. The first and second input devices can be operated with the user's respective hands.
Thus, the present invention permits a visualisation of, and modification of, the three-dimensional model, in an efficient and natural manner.
Furthermore, since only the portion of the model within the editing volume need be displayed, the processing and display requirements are reduced, in comparison to displaying the entire model. This in turn means that the display can be updated quickly without the computer system requiring excessive computing power.
Specifically, in a first aspect, the present invention proposes a computer system for permitting interaction between a user of the computer system and a three-dimensional computer model, the computer system including:
In a second aspect the invention provides a computer-implemented method for permitting a user to interact with a three-dimensional computer model, the method including:
The second input devices may, for example, be of the form of the known input devices described above, corresponding to virtual “tools”.
The control device may be operated in a selected one of a plurality of modes, and in different modes different motions of the control device correspond to different motions of the model and/or the editing volume. Preferably, in at least one of these modes both the position and the orientation of the model in relation to the workspace are changed based on motions of the control device. As described below, such re-selections of the positioning of the editing volume and the model can be very helpfully used to visualise the model, without unnecessarily consuming computing power.
Preferably, the user is further empowered (e.g. using the first input device again, or by speaking into a microphone to generate commands which are transmitted to the processor) to change the scale of the correspondence between the model and the workspace. One such change in the correspondence is to magnify the model, to display within the editing volume (which remains constant in size) an expanded image of a smaller portion of the model.
Preferably, the modifications to the model based on the second signals are only to the section of the model corresponding to the editing volume. In other words, only the portion of the model which is displayed is affected by the virtual tools.
The three-dimensional computer-generated models may be data representations of any real world three-dimensional objects. For example, they may be representations of objects such as houses or biological cells. Alternatively, the model may be a model of at least part of the subject of a surgical operation, and the step of modifying the model based on the second signals may be performed to simulate the actions on the subject of a surgical device corresponding to the second input device (tool). For example, the virtual tool may be a scalpel cutting virtual tissue of the object.
This document uses the term “position” of an entity (e.g. an object, a tool, a device, the editing volume, etc) to mean its three-dimensional location (e.g. the location of a central point of the entity), and does not include the orientation of the entity. Thus, the “position” of an entity is written as three numbers, (X, Y, Z), with respect to a predefined system of axes X, Y and Z. The “orientation” of an entity is also written using three numbers, (α, β, γ), representing the orientation of the entity with respect to the same axis system. The term “placement” means the combination of the position and the orientation of the object (i.e. it is written as six numbers).
Preferably, the image is “stereoscopic”, a term which is used here to include any display technique which generates two different images, one for each eye, so that the apparent image fused in the brain can be perceived to occupy space. For example, the display techniques in use in the Dextroscope and the Dextrobeam are suitable for use in the present invention.
An embodiment of the invention will now be described, for the sake of example only, with reference to the following figures in which:
a) shows the three-dimensional workspace of a user of a system according to the present invention. This workspace may be a fixed three-dimensional region of real space. It may correspond to (at least part of) the visual field of the user. A computer system is provided to generate a stereoscopic display within this workspace. The computer stores a three-dimensional model (shown as 12 in
As shown in
The editing box 10 is itself displayed on the computer screen as a wire-frame showing the edges of the box. The editing box 10 cuts through the three-dimensional model 12. The editing box 10 preferably has a linear length and height of approximately 15 to 20 cm with an adjustable thickness (which is adjusted to suit the user's needs). The editing box 10 is centered in the three-dimensional workspace, providing a comfortable working area.
The position of the editing box 10 within the workspace is user-defined and can be adjusted with additional controls. However, during normal operation of the invention (as discussed below in relation to
The section of the computer model 12 which is within the editing box 10 is displayed stereoscopically (though this is not shown on
The user's non-dominant hand (e.g. for right-handed individuals, the left hand) holds a control device 14 like a joystick, which is used to control the view of the model given by the editing box 10. It allows six degrees of freedom control: three for position (X, Y, Z) and three for orientation (α, β, γ). Control devices 14 which are suitable for use in this context are widely publicly known.
Depending on the application, the model 12 can be moved together with the editing box 10, or the movement of one can be independent of the other. Table 1 shows three of the possible modes of operation of the control device 14.
As described earlier, the editing box 10 should remain centered in the workspace (i.e. an arbitrary central point of the editing box does not change its position). Therefore, in all cases when the user moves the tool along the X, Y and Z axes, the action pans the model 12 in the corresponding direction to bring different portions of the model 12 into the editing box 10. The editing box 10 itself does not move. When the user rotates the control device 14 (the angles of motion being measured by α, β, and γ), the action can either be translated into rotation of the editing box 10, the model 12, or both together:
We can also envisage a fourth case (case 4) which is shown in
Note that if it is desired to move the editing box perpendicular to face 18 this may be performed by a separate operation, e.g. performed by the user's dominant hand. For example, we may define a border of the editing box and if the user's dominant hand intersects with this border in case 4 the box is moved perpendicular to the face 18. Thus, the border of the editing box works like a holder which enables the editing box to be moved up and down by the user's dominant hand perpendicular to the face 18.
While the user's non-dominant hand manipulates the control device 14, the user's dominant hand holds a stylus (or other input device) which is used to operate on the portion of the three-dimensional computer model within the editing box 10. This second input device may for example be the known tool discussed above, which corresponds to a tool in the virtual word. As in the known system, for example, the tool may interact with any virtual object comprised in the model 12 if the virtual position of the tool coincides with that object (e.g. to within a tolerance defined by a bounding box, as discussed above). Furthermore, the object must generally be within the editing box 10 at this time. In this way, for example, the user is able to simulate operations (e.g. surgical operations) on the three-dimensional model. The type of operation depends on the application in which this embodiment is used. For example, if the embodiment is used in a three-dimensional volumetric contour editor, the stylus or other input device may be used to add the nodes that define the contours.
Note that the control device 14 does not have to be located within or even near the editing box 10. However, preferably a co-ordinate center of the control device 14 defines the centre of rotation of the model or editing box, whichever is presently selected. By contrast, the stylus (or other input devices) which correspond to virtual tools may optionally be selected only to interact with objects within the model 12 if they are positioned to virtually contact those objects (e.g. to within a certain tolerance); that is, these input devices are generally located within or very near to the editing box 10.
As mentioned above, the parts of the model 12 outside the editing volume 10 are normally not displayed. We say that the model 12 is “clipped”. Specifically, each face of the editing box 10 is defined by a plane equation of form: (ax+by+cz=d), where x, y, and z represent the three Cartesian axes, and a, b, c, and d are constants that characterize the plane. The model 12 is tested in turn against each of the planes. Those parts of the model 12 which are found to be outside the editing box 10 are not displayed, which can result in the non-display of either entire objects or parts of them. Clipping can be performed in the computer system of the invention either in hardware or in software with the former being faster.
The computer process runs in the computer system of the present embodiment on the same principles described above in relation to
Specifically, the initialisation of
The “update” step is modified to include an updating of the placement of the editing box 10 and the model 12. The “display” step is modified to include displaying only the portion of the model 12 within the editing box 10. This includes calculation of the clipping plane, and activating the clipping.
The objects are displayed within the editing box 10 by making use of the newly obtained clipping planes. The display takes place up to the boundaries of the editing box 10, a standard technique in computer graphics. This produces cut away views of volume rendered images or polygonal objects.
Changing the placement of the editing box 10 affects the final plane equations which determine which parts of the model 12 are clipped.
The control device 14 may be a joystick (or stylus) having a button. It has 4 states, analogous to the 4 states of the stylus (or input device) described above. The 4 states are as follows:
In each of these states the steps carried out by the process are as follows:
StartAction
DoAction
Specifically, this final step consists of:
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/SG01/00171 | 8/28/2001 | WO | 00 | 2/27/2004 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO03/019423 | 3/6/2003 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5237647 | Roberts et al. | Aug 1993 | A |
5371778 | Yanof et al. | Dec 1994 | A |
5561445 | Miwa et al. | Oct 1996 | A |
5581670 | Bier et al. | Dec 1996 | A |
5583977 | Seidl | Dec 1996 | A |
5798752 | Buxton et al. | Aug 1998 | A |
5923318 | Zhai et al. | Jul 1999 | A |
5929844 | Barnes | Jul 1999 | A |
5949401 | Kazarian | Sep 1999 | A |
6184867 | Kandogan et al. | Feb 2001 | B1 |
6184869 | Harding et al. | Feb 2001 | B1 |
6198471 | Cook | Mar 2001 | B1 |
6333753 | Hinckley | Dec 2001 | B1 |
6380958 | Guendel et al. | Apr 2002 | B1 |
6545663 | Arbter et al. | Apr 2003 | B1 |
6674440 | Kirkland et al. | Jan 2004 | B1 |
20020012013 | Abe et al. | Jan 2002 | A1 |
Number | Date | Country | |
---|---|---|---|
20040263509 A1 | Dec 2004 | US |