This invention relates to the field of computer graphics processing, and more specifically, to a method and system for generating and adjusting detail-in-context presentations in client/server systems.
Modern computer graphics systems, including virtual environment systems, are used for numerous applications such as mapping, navigation, flight training, surveillance, and even playing computer games. In general, these applications are launched by the computer graphics system's operating system upon selection by a user from a menu or other graphical user interface (“GUI”). A GUI is used to convey information to and receive commands from users and generally includes a variety of GUI objects or controls, including icons, toolbars, drop-down menus, text, dialog boxes, buttons, and the like. A user typically interacts with a GUI by using a pointing device (e.g., a mouse) to position a pointer or cursor over an object and “clicking” on the object.
One problem with these computer graphics systems is their inability to effectively display detailed information for selected graphic objects when those objects are in the context of a larger image. A user may require access to detailed information with respect to an object in order to closely examine the object, to interact with the object, or to interface with an external application or network through the object. For example, the detailed information may be a close-up view of the object or a region of a digital map image.
While an application may provide a GUI for a user to access and view detailed information for a selected object in a larger image, in doing so, the relative location of the object in the larger image may be lost to the user. Thus, while the user may have gained access to the detailed information required to interact with the object, the user may lose sight of the context within which that object is positioned in the larger image. This is especially so when the user must interact with the GUI using a computer mouse or keyboard. The interaction may further distract the user from the context in which the detailed information is to be understood. This problem is an example of what is often referred to as the “screen real estate problem”.
A need therefore exists for an improved method and system for generating and adjusting detailed views of selected information within the context of surrounding information presented on the display of a computer graphics system. Accordingly, a solution that addresses, at least in part, the above and other shortcomings is desired.
According to one aspect of the invention, there is provided a method for generating a presentation of a region-of-interest in an original image for display on a display screen of a client coupled over a network to a server, comprising: establishing a lens having a focal region for the region-of-interest at least partially surrounded by a shoulder region; if the lens is in transit between first and second locations for the region-of-interest in the original image, applying the lens to the original image by a first method to generate the presentation at the client; and, if the lens is stationary in the original image, receiving the presentation from the server, the server applying the lens to the original image by a second method to generate the presentation.
In the above method, the first method may require less resources than the second method. The lens may have a shape and the second method may more accurately reflect the shape of the lens in the presentation than the first method. The shoulder region may have a shape and the second method may more accurately reflect the shape of the shoulder region in the presentation than the first method. The second method may include displacing the original image onto the lens to produce a displaced image and projecting the displaced image onto a plane in a direction aligned with a viewpoint for the region-of-interest. The first method may include: creating a focal region image for the focal region by scaling the original image within the focal region by a focal region magnification; creating a shoulder region image for the shoulder region by scaling the original image within the shoulder region by a shoulder region magnification, the shoulder region magnification being less than the focal region magnification; and, overlaying the focal region image and the shoulder region image on the original image. The method may further include receiving a signal indicating the transit between the first and second locations from a graphical user interface (“GUI”) displayed over the lens on the display screen. The method may further include, if the lens is stationary in the original image, sending a signal from the client to the server requesting the presentation. The method may further include, if the lens is stationary in the original image and if the server is unavailable, applying the lens to the original image by the first method to generate the presentation at the client. And, the method may further include displaying the presentation on the display screen.
In accordance with further aspects of the present invention there is provided an apparatus such as a data processing system, a method for adapting this system, as well as articles of manufacture such as a computer readable medium having program instructions recorded thereon for practising the method of the invention.
Further features and advantages of the embodiments of the present invention will become apparent from the following detailed description, taken in combination with the appended drawings, in which:
It will be noted that throughout the appended drawings, like features are identified by like reference numerals.
In the following description, details are set forth to provide an understanding of the invention. In some instances, certain software, circuits, structures and methods have not been described or shown in detail in order not to obscure the invention. The term “data processing system” is used herein to refer to any machine for processing data, including the computer systems and network arrangements described herein. The present invention may be implemented in any computer programming language provided that the operating system of the data processing system provides the facilities that may support the requirements of the present invention. Any limitations presented would be a result of a particular type of operating system or computer programming language and would not be a limitation of the present invention. The present invention may also be implemented in hardware.
The “screen real estate problem” generally arises whenever large amounts of information are to be displayed on a display screen of limited size. Known tools to address this problem include panning and zooming. While these tools are suitable for a large number of visual display applications, they become less effective where sections of the visual information are spatially related, such as in layered maps and three-dimensional representations, for example. In this type of information display, panning and zooming are not as effective as much of the context of the panned or zoomed display may be hidden.
A recent solution to this problem is the application of “detail-in-context” presentation techniques. Detail-in-context is the magnification of a particular region-of-interest (the “focal region” or “detail”) in a data presentation while preserving visibility of the surrounding information (the “context”). This technique has applicability to the display of large surface area media (e.g. digital maps) on computer screens of variable size including graphics workstations, laptop computers, personal digital assistants (“PDAs”), and cell phones.
In the detail-in-context discourse, differentiation is often made between the terms “representation” and “presentation”. A representation is a formal system, or mapping, for specifying raw information or data that is stored in a computer or data processing system. For example, a digital map of a city is a representation of raw data including street names and the relative geographic location of streets and utilities. Such a representation may be displayed visually on a computer screen or printed on paper. On the other hand, a presentation is a spatial organization of a given representation that is appropriate for the task at hand. Thus, a presentation of a representation organizes such things as the point of view and the relative emphasis of different parts or regions of the representation. For example, a digital map of a city may be presented with a region magnified to reveal street names.
In general, a detail-in-context presentation may be considered as a distorted view (or distortion) of a portion of the original representation or image where the distortion is the result of the application of a “lens” like distortion function to the original representation. A detailed review of various detail-in-context presentation techniques such as “Elastic Presentation Space” (“EPS”) (or “Pliable Display Technology” (“PDT”)) may be found in a publication by Marianne S. T. Carpendale, entitled “A Framework for Elastic Presentation Space” (Carpendale, Marianne S. T., A Framework for Elastic Presentation Space (Burnaby, British Columbia: Simon Fraser University, 1999)), and incorporated herein by reference.
In general, detail-in-context data presentations are characterized by magnification of areas of an image where detail is desired, in combination with compression of a restricted range of areas of the remaining information (i.e. the context), the result typically giving the appearance of a lens having been applied to the display surface. Using the techniques described by Carpendale, points in a representation are displaced in three dimensions and a perspective projection is used to display the points on a two-dimensional presentation display. Thus, when a lens is applied to a two-dimensional continuous surface representation, for example, the resulting presentation appears to be three-dimensional. In other words, the lens transformation appears to have stretched the continuous surface in a third dimension. In EPS graphics technology, a two-dimensional visual representation is placed onto a surface; this surface is placed in three-dimensional space; the surface, containing the representation, is viewed through perspective projection; and the surface is manipulated to effect the reorganization of image details. The presentation transformation is separated into two steps: surface manipulation or distortion and perspective projection.
EPS is applicable to multidimensional data and is well suited to implementation on a computer for dynamic detail-in-context display on an electronic display surface such as a monitor. In the case of two dimensional data, EPS is typically characterized by magnification of areas of an image where detail is desired 233, in combination with compression of a restricted range of areas of the remaining information (i.e., the context) 234, the end result typically giving the appearance of a lens 230 having been applied to the display surface. The areas of the lens 230 where compression occurs may be referred to as the “shoulder” 234 of the lens 230. The area of the representation transformed by the lens may be referred to as the “lensed area”. The lensed area thus includes the focal region 233 and the shoulder region 234. Typically, the distorted surface, distortion function, or lens 230 provides a continuous or smooth transition from the base plane 210 through the shoulder region 234 to the focal region 233 as shown in
For example, and referring to
Thus, the data processing system 300 includes computer executable programmed instructions for directing the system 300 to implement the embodiments of the present invention. The programmed instructions may be embodied in one or more hardware or software modules 331 resident in the memory 330 of the data processing system 300. Alternatively, the programmed instructions may be embodied on a computer readable medium (such as a CD disk or floppy disk) which may be used for transporting the programmed instructions to the memory 330 of the data processing system 300. Alternatively, the programmed instructions may be embedded in a computer-readable signal or signal-bearing medium that is uploaded to a network by a vendor or supplier of the programmed instructions, and this signal-bearing medium may be downloaded through an interface (e.g., 350) to the data processing system 300 from the network by end users or potential buyers.
As mentioned, detail-in-context presentations of data using techniques such as pliable surfaces, as described by Carpendale, are useful in presenting large amounts of information on limited-size display surfaces. Detail-in-context views allow magnification of a particular region-of-interest (e.g., the focal region) 233 in a data presentation while preserving visibility of the surrounding information 210. In the following, a GUI 400 is described having lens control elements that can be implemented in software (and/or hardware) and applied to the control of detail-in-context data presentations. The software (and/or hardware) can be loaded into and run by the data processing system 300 of
In general, the GUI 400 has lens control elements that, in combination, provide for the interactive control of the lens 410. The effective control of the characteristics of the lens 410 by a user (i.e., dynamic interaction with a detail-in-context lens) is advantageous. At any given time, one or more of these lens control elements may be made visible to the user on the display surface 340 by appearing as overlay icons on the lens 410. Interaction with each element is performed via the motion of an input or pointing device 310 (e.g., a mouse) with the motion resulting in an appropriate change in the corresponding lens characteristic. As will be described, selection of which lens control element is actively controlled by the motion of the pointing device 310 at any given time is determined by the proximity of the icon representing the pointing device 310 (e.g., cursor) on the display surface 340 to the appropriate component of the lens 410. For example, “dragging” of the pointing device at the periphery of the bounding rectangle of the lens base 412 causes a corresponding change in the size of the lens 410 (i.e., “resizing”). Thus, the GUI 400 provides the user with a visual representation of which lens control element is being adjusted through the display of one or more corresponding icons.
For ease of understanding, the following discussion will be in the context of using a two-dimensional pointing device 310 that is a mouse, but it will be understood that the invention may be practiced with other 2D or 3D (or even greater numbers of dimensions) input devices including a trackball, a keyboard, a position tracking device, an eye tracking device, an input from a navigation device, etc.
A mouse 310 controls the position of a cursor icon 401 that is displayed on the display screen 340. The cursor 401 is moved by moving the mouse 310 over a flat surface, such as the top of a desk, in the desired direction of movement of the cursor 401. Thus, the two-dimensional movement of the mouse 310 on the flat surface translates into a corresponding two-dimensional movement of the cursor 401 on the display screen 340.
A mouse 310 typically has one or more finger actuated control buttons (i.e., mouse buttons). While the mouse buttons can be used for different functions such as selecting a menu option pointed at by the cursor 401, the disclosed invention may use a single mouse button to “select” a lens 410 and to trace the movement of the cursor 401 along a desired path. Specifically, to select a lens 410, the cursor 401 is first located within the extent of the lens 410. In other words, the cursor 401 is “pointed” at the lens 410. Next, the mouse button is depressed and released. That is, the mouse button is “clicked”. Selection is thus a point and click operation. To trace the movement of the cursor 401, the cursor 401 is located at the desired starting location, the mouse button is depressed to signal the computer 320 to activate a lens control element, and the mouse 310 is moved while maintaining the button depressed. After the desired path has been traced, the mouse button is released. This procedure is often referred to as “clicking” and “dragging” (i.e., a click and drag operation). It will be understood that a predetermined key on a keyboard 310 could also be used to activate a mouse click or drag. In the following, the term “clicking” will refer to the depression of a mouse button indicating a selection by the user and the term “dragging” will refer to the subsequent motion of the mouse 310 and cursor 401 without the release of the mouse button.
The GUI 400 may include the following lens control elements: move, pickup, resize base, resize focus, fold, magnify, zoom, and scoop. Each of these lens control elements has at least one lens control icon or alternate cursor icon associated with it. In general, when a lens 410 is selected by a user through a point and click operation, the following lens control icons may be displayed over the lens 410: pickup icon 450, base outline icon 412, base bounding rectangle icon 411, focal region bounding rectangle icon 421, handle icons 481, 482, 491 magnify slide bar icon 440, zoom icon 495, and scoop slide bar icon (not shown). Typically, these icons are displayed simultaneously after selection of the lens 410. In addition, when the cursor 401 is located within the extent of a selected lens 410, an alternate cursor icon 460, 470, 480, 490, 495 may be displayed over the lens 410 to replace the cursor 401 or may be displayed in combination with the cursor 401. These lens control elements, corresponding icons, and their effects on the characteristics of a lens 410 are described below with reference to
In general, when a lens 410 is selected by a point and click operation, bounding rectangle icons 411, 421 are displayed surrounding the base 412 and focal region 420 of the selected lens 410 to indicate that the lens 410 has been selected. With respect to the bounding rectangles 411, 421 one might view them as glass windows enclosing the lens base 412 and focal region 420, respectively. The bounding rectangles 411, 421 include handle icons 481, 482, 491 allowing for direct manipulation of the enclosed base 412 and focal region 420 as will be explained below. Thus, the bounding rectangles 411, 421 not only inform the user that the lens 410 has been selected, but also provide the user with indications as to what manipulation operations might be possible for the selected lens 410 though use of the displayed handles 481, 482, 491. Note that it is well within the scope of the present invention to provide a bounding region having a shape other than generally rectangular. Such a bounding region could be of any of a great number of shapes including oblong, oval, ovoid, conical, cubic, cylindrical, polyhedral, spherical, etc.
Moreover, the cursor 401 provides a visual cue indicating the nature of an available lens control element. As such, the cursor 401 will generally change in form by simply pointing to a different lens control icon 450, 412, 411, 421, 481, 482, 491, 492, 440. For example, when resizing the base 412 of a lens 410 using a corner handle 491, the cursor 401 will change form to a resize icon 490 once it is pointed at (i.e., positioned over) the corner handle 491. The cursor 401 will remain in the form of the resize icon 490 until the cursor 401 has been moved away from the corner handle 491.
Lateral movement of a lens 410 is provided by the move lens control element of the GUI 400. This functionality is accomplished by the user first selecting the lens 410 through a point and click operation. Then, the user points to a point within the lens 410 that is other than a point lying on a lens control icon 450, 412, 411, 421, 481, 482, 491, 492, 440. When the cursor 401 is so located, a move icon 460 is displayed over the lens 410 to replace the cursor 401 or may be displayed in combination with the cursor 401. The move icon 460 not only informs the user that the lens 410 may be moved, but also provides the user with indications as to what movement operations are possible for the selected lens 410. For example, the move icon 460 may include arrowheads indicating up, down, left, and right motion. Next, the lens 410 is moved by a click and drag operation in which the user clicks and drags the lens 410 to the desired position on the screen 340 and then releases the mouse button 310. The lens 410 is locked in its new position until a further pickup and move operation is performed.
Lateral movement of a lens 410 is also provided by the pickup lens control element of the GUI. This functionality is accomplished by the user first selecting the lens 410 through a point and click operation. As mentioned above, when the lens 410 is selected a pickup icon 450 is displayed over the lens 410 near the centre of the lens 410. Typically, the pickup icon 450 will be a crosshairs. In addition, a base outline 412 is displayed over the lens 410 representing the base 412 of the lens 410. The crosshairs 450 and lens outline 412 not only inform the user that the lens has been selected, but also provides the user with an indication as to the pickup operation that is possible for the selected lens 410. Next, the user points at the crosshairs 450 with the cursor 401. Then, the lens outline 412 is moved by a click and drag operation in which the user clicks and drags the crosshairs 450 to the desired position on the screen 340 and then releases the mouse button 310. The full lens 410 is then moved to the new position and is locked there until a further pickup operation is performed. In contrast to the move operation described above, with the pickup operation, it is the outline 412 of the lens 410 that the user repositions rather than the full lens 410.
Resizing of the base 412 (or outline) of a lens 410 is provided by the resize base lens control element of the GUI. After the lens 410 is selected, a bounding rectangle icon 411 is displayed surrounding the base 412. For a rectangular shaped base 412, the bounding rectangle icon 411 may be coextensive with the perimeter of the base 412. The bounding rectangle 411 includes handles 491, 492. These handles 491, 492 can be used to stretch the base 412 taller or shorter, wider or narrower, or proportionally larger or smaller. The corner handles 491 will keep the proportions the same while changing the size. The middle handles (not shown) will make the base 412 taller or shorter, wider or narrower. Resizing the base 412 by the corner handles 491 will keep the base 412 in proportion. Resizing the base 412 by the middle handles will change the proportions of the base 412. That is, the middle handles change the aspect ratio of the base 412 (i.e., the ratio between the height and the width of the bounding rectangle 411 of the base 412). When a user points at a handle 491 with the cursor 401 a resize icon 490 may be displayed over the handle 491 to replace the cursor 401 or may be displayed in combination with the cursor 401. The resize icon 490 not only informs the user that the handle 491 may be selected, but also provides the user with indications as to the resizing operations that are possible with the selected handle. For example, the resize icon 490 for a corner handle 491 may include arrows indicating proportional resizing. The resize icon (not shown) for a middle handle may include arrows indicating width resizing or height resizing. After pointing at the desired handle 491 the user would click and drag the handle 491 until the desired shape and size for the base 412 is reached. Once the desired shape and size are reached, the user would release the mouse button 310. The base 412 of the lens 410 is then locked in its new size and shape until a further base resize operation is performed.
Resizing of the focal region 420 of a lens 410 is provided by the resize focus lens control element of the GUI. After the lens 410 is selected, a bounding rectangle icon 421 is displayed surrounding the focal region 420. For a rectangular shaped focal region 420, the bounding rectangle icon 421 may be coextensive with the perimeter of the focal region 420. The bounding rectangle 421 includes handles 481, 482. These handles 481, 482 can be used to stretch the focal region 420 taller or shorter, wider or narrower, or proportionally larger or smaller. The corner handles 481 will keep the proportions the same while changing the size. The middle handles 482 will make the focal region 420 taller or shorter, wider or narrower. Resizing the focal region 420 by the corner handles 481 will keep the focal region 420 in proportion. Resizing the focal region 420 by the middle handles 482 will change the proportions of the focal region 420. That is, the middle handles 482 change the aspect ratio of the focal region 420 (i.e., the ratio between the height and the width of the bounding rectangle 421 of the focal region 420). When a user points at a handle 481, 482 with the cursor 401 a resize icon 480 may be displayed over the handle 481, 482 to replace the cursor 401 or may be displayed in combination with the cursor 401. The resize icon 480 not only informs the user that a handle 481, 482 may be selected, but also provides the user with indications as to the resizing operations that are possible with the selected handle. For example, the resize icon 480 for a corner handle 481 may include arrows indicating proportional resizing. The resize icon 480 for a middle handle 482 may include arrows indicating width resizing or height resizing. After pointing at the desired handle 481, 482, the user would click and drag the handle 481, 482 until the desired shape and size for the focal region 420 is reached. Once the desired shape and size are reached, the user would release the mouse button 310. The focal region 420 is then locked in its new size and shape until a further focus resize operation is performed.
Folding of the focal region 420 of a lens 410 is provided by the fold control element of the GUI. In general, control of the degree and direction of folding (i.e., skewing of the viewer aligned vector 231 as described by Carpendale) is accomplished by a click and drag operation on a point 471, other than a handle 481, 482, on the bounding rectangle 421 surrounding the focal region 420. The direction of folding is determined by the direction in which the point 471 is dragged. The degree of folding is determined by the magnitude of the translation of the cursor 401 during the drag. In general, the direction and degree of folding corresponds to the relative displacement of the focus 420 with respect to the lens base 410. In other words, and referring to
Magnification of the lens 410 is provided by the magnify lens control element of the GUI. After the lens 410 is selected, the magnify control is presented to the user as a slide bar icon 440 near or adjacent to the lens 410 and typically to one side of the lens 410. Sliding the bar 441 of the slide bar 440 results in a proportional change in the magnification of the lens 410. The slide bar 440 not only informs the user that magnification of the lens 410 may be selected, but also provides the user with an indication as to what level of magnification is possible. The slide bar 440 includes a bar 441 that may be slid up and down, or left and right, to adjust and indicate the level of magnification. To control the level of magnification, the user would click on the bar 441 of the slide bar 440 and drag in the direction of desired magnification level. Once the desired level of magnification is reached, the user would release the mouse button 310. The lens 410 is then locked with the selected magnification until a further magnification operation is performed. In general, the focal region 420 is an area of the lens 410 having constant magnification (i.e., if the focal region is a plane). Again referring to
Zoom functionality is provided by the zoom lens control element of the GUI. Referring to
In particular, after the lens 410 is selected, a bounding rectangle icon 411 is displayed surrounding the base 412 and a bounding rectangle icon 421 is displayed surrounding the focal region 420. Zoom functionality is accomplished by the user first selecting the zoom icon 495 through a point and click operation When a user selects zoom functionality, a zoom cursor icon 496 may be displayed to replace the cursor 401 or may be displayed in combination with the cursor 401. The zoom cursor icon 496 provides the user with indications as to what zoom operations are possible. For example, the zoom cursor icon 496 may include a magnifying glass. By choosing a point within the extent of the focal region, within the extent of the lens, or without the extent of the lens, the user may control the zoom function. To zoom in to the extent of the focal region such that the extent of the focal region fills the display screen 340 (i.e., “zoom to focal region extent”), the user would point and click within the extent of the focal region. To zoom in to the extent of the lens such that the extent of the lens fills the display screen 340 (i.e., “zoom to lens extent”), the user would point and click within the extent of the lens. Or, to zoom in to the presentation area without the extent of the focal region, such that the area without the extent of the focal region is magnified to the same level as the extent of the focal region (i.e., “zoom to scale”), the user would point and click without the extent of the lens. After the point and click operation is complete, the presentation is locked with the selected zoom until a further zoom operation is performed.
Alternatively, rather than choosing a point within the extent of the focal region, within the extent of the lens, or without the extent of the lens to select the zoom function, a zoom function menu with multiple items (not shown) or multiple zoom function icons (not shown) may be used for zoom function selection. The zoom function menu may be presented as a pull-down menu. The zoom function icons may be presented in a toolbar or adjacent to the lens 410 when the lens is selected. Individual zoom function menu items or zoom function icons may be provided for each of the “zoom to focal region extent”, “zoom to lens extent”, and “zoom to scale” functions described above. In this alternative, after the lens 410 is selected, a bounding rectangle icon 411 may be displayed surrounding the base 412 and a bounding rectangle icon 421 may be displayed surrounding the focal region 420. Zoom functionality is accomplished by the user selecting a zoom function from the zoom function menu or via the zoom function icons using a point and click operation. In this way, a zoom function may be selected without considering the position of the cursor 401 within the lens 410.
The concavity or “scoop” of the shoulder region 430 of the lens 410 is provided by the scoop lens control element of the GUI. After the lens 410 is selected, the scoop control is presented to the user as a slide bar icon (not shown) near or adjacent to the lens 410 and typically below the lens 410. Sliding the bar (not shown) of the slide bar results in a proportional change in the concavity or scoop of the shoulder region 430 of the lens 410. The slide bar not only informs the user that the shape of the shoulder region 430 of the lens 410 may be selected, but also provides the user with an indication as to what degree of shaping is possible. The slide bar includes a bar that may be slid left and right, or up and down, to adjust and indicate the degree of scooping. To control the degree of scooping, the user would click on the bar of the slide bar and drag in the direction of desired scooping degree. Once the desired degree of scooping is reached, the user would release the mouse button 310. The lens 410 is then locked with the selected scoop until a further scooping operation is performed.
Advantageously, a user may choose to hide one or more lens control icons 450, 412, 411, 421, 481, 482, 491, 492, 440, 495 shown in
In addition, the GUI 400 maintains a record of control element operations such that the user may restore pre-operation presentations. This record of operations may be accessed by or presented to the user through “Undo” and “Redo” icons 497, 498, through a pull-down operation history menu (not shown), or through a toolbar.
Thus, detail-in-context data viewing techniques allow a user to view multiple levels of detail or resolution on one display 340. The appearance of the data display or presentation is that of one or more virtual lenses showing detail 233 within the context of a larger area view 210. Using multiple lenses in detail-in-context data presentations may be used to compare two regions-of-interest at the same time. Folding enhances this comparison by allowing the user to pull the regions-of-interest closer together. Moreover, using detail-in-context technology, a region-of-interest can be magnified to pixel level resolution, or to any level of detail available from the source information, for in-depth review. The digital images may include graphic images, maps, photographic images, or text documents, and the source information may be in raster, vector, or text form.
For example, in order to view a selected object or region-of-interest in detail, a user can define a lens 410 over the object or region-of-interest using the GUI 400. The lens 410 may be introduced to the original image to form the a presentation through the use of a pull-down menu selection, tool bar icon, etc. Using lens control elements for the GUI 400, such as move, pickup, resize base, resize focus, fold, magnify, zoom, and scoop, as described above, the user adjusts the lens 410 for detailed viewing of the object or region-of-interest. Using the magnify lens control element, for example, the user may magnify the focal region 420 of the lens 410 to pixel quality resolution revealing detailed information pertaining to the selected object or region-of-interest. That is, a base image (i.e., the image outside the extent of the lens) is displayed at a low resolution while a lens image (i.e., the image within the extent of the lens) is displayed at a resolution based on a user selected magnification 440, 441.
In operation, the data processing system 300 employs EPS techniques with an input device 310 and GUI 400 for selecting objects or regions-of-interest for detailed display to a user on a display screen 340. Data representing an original image or representation is received by the CPU 320 of the data processing system 300. Using EPS techniques, the CPU 320 processes the data in accordance with instructions received from the user via an input device 310 and GUI 400 to produce a detail-in-context presentation. The presentation is presented to the user on a display screen 340. It will be understood that the CPU 320 may apply a transformation to the shoulder region 430 surrounding the focal region 420 to affect blending or folding in accordance with EPS techniques. For example, the transformation may map the focal region 420 and/or shoulder region 430 to a predefined lens surface 230, defined by a transformation or distortion function and having a variety of shapes, using EPS techniques. Or, the lens 410 may be simply coextensive with the region-of-interest or focal region 420.
The lens control elements of the GUI 400 are adjusted by the user via an input device 310 to control the characteristics of the lens 410 in the detail-in-context presentation. Using an input device 310 such as a mouse, a user adjusts parameters of the lens 410 using icons and scroll bars of the GUI 400 that are displayed over the lens 410 on the display screen 340. The user may also adjust parameters of the image of the full scene. Signals representing input device 310 movements and selections are transmitted to the CPU 320 of the data processing system 300 where they are translated into instructions for lens control.
Moreover, the lens 410 may be added to the presentation before or after the object or area is selected. That is, the user may first add a lens 410 to a presentation or the user may move a pre-existing lens into place over the selected object or region-of-interest. The lens 410 may be introduced to the original image to form the presentation through the use of a pull-down menu selection, tool bar icon, etc.
Advantageously, by using a detail-in-context lens 410 to select an object or region-of-interest for detailed information gathering, a user can view a large area (i.e., outside the extent of the lens 410) while focusing in on a smaller area (or within the focal region 420 of the lens 410) surrounding the selected object or region-of-interest. This makes it possible for a user to accurately gather detailed information without losing visibility or context of the portion of the original image surrounding the selected object or region-of-interest.
Thus, computer generated detail-in-context lens (or fisheye lens) presentations are a valuable tool for computer users. These presentations provide the ability to view data at multiple scales simultaneously, while preserving context, and maintaining continuity of data.
In order to render or generate such fisheye lens presentations, it is sometimes desirable or necessary to execute optimized or specialized rendering algorithms other than the displacement followed by perspective projection algorithm described above. These algorithms can be useful for overcoming limitations of hardware or software in any particular operating environment. As an example, United States Patent Application Publication No. 2003/0151625 by Shoemaker, which is incorporated herein by reference, discusses a rendering technique using pre-calculated texel coverages for the rendering of lenses. Also, United States Patent Application Publication No. 2003/0151626 by Komar et al., which is incorporated herein by reference, discusses the use of stretch bit-block transfer (“blit”) graphics operations for efficient rendering of pyramid shaped lenses.
While these two patent applications discuss rendering techniques that are useful for situations where performance needs to be optimized, there is another situation where a specialized rendering technique can be useful. This is the situation where not all standard graphics operations are available for a given data processing system. For example, if pixel copying operations are not available, then the technique described by U.S. Patent Application Publication No. 2003/0151625 would not be possible, and if stretch blit operations are not available, then the technique described in U.S. Patent Application Publication No. 2003/0151626 would not be possible.
In the following, a method is described for rendering pyramid shaped fisheye lenses using a minimum of graphics operations. Specifically, only image rendering, image scaling, depth ordering, and image masking capabilities are required. This method is advantageous in environments in which standard graphics operations are not all available. An example of such an environment is a Web browser. While it is possible to run full-featured executables, such as a Java™ Applet or ActiveX™ control (in which a full array of graphics capabilities are available) in a browser, sometimes it is desirable to implement all functionality using basic browser capabilities, such as hypertext markup language (“HTML”) rendering, using the document object model (“DOM”), and basic scripting, such as JavaScript™. Recently, this approach has become particularly popular and has been referred to as asynchronous JavaScript and XML (“AJAX”), where XML refers to the extensible markup language. While the method of the present invention is not limited to this particular environment, this environment is one in which the method may be advantageously used.
At the root of the problem of rendering lenses in an AJAX client (e.g., Web browser) is the fact that rendering operations in such a client are limited. For example, JavaScript™ has almost no capability for rendering. It is used instead for manipulating elements in the DOM. The DOM does provide some capabilities for the visual presentation of data. Accordingly, the relevant client capabilities with respect to the present invention are as follows: images can be placed at a particular location in the browser window; images can be resized; rendering order can be changed; and,
images can be masked (i.e., rectangular regions can be defined for each image where rendering occurs, outside of which no rendering takes place). The lens rendering or generating method of the present invention differs from that of U.S. Patent Application Publication No. 2003/0151626 in that the graphics operations required are different. According to the method of the present invention, one or more of image rendering, image resizing, image ordering, and image masking are the required operations.
In order to construct the presentation 500 of
In
A pyramid fisheye lens may be considered as a rectangular inset lens (e.g., 510) with an added shoulder region of variable magnification that joins the lens focal region (i.e., equivalent to the inset image 520 region of presentation 500 of
The method of the present invention uses a layering technique which stacks multiple renderings or images (i.e., intermediate level images 631, 632, 633, 634) on top of one another in order to render a pyramid lens 610. The method includes several steps (i.e., n steps). Step 1 consists of rendering the contextual image 640. Steps 2 to n−1 consist of rendering the intermediate level images 631, 632, 633, 634, where n is the number of intermediate levels (e.g., n=4 for
Steps 2 to n−1 are similar to the inset image rendering step n. What differs is that with each step from step 2 to step n−1, the region that is masked, in terms of screen coordinates, grows progressively smaller, and the data magnification level increases (and hence the data source may change, if different data sources are being used for different scales or magnification levels). The end result is that all intermediate level images 631, 632, 633, 634 are hidden except for a thin boundary around their respective perimeters or bounds. The effect is similar to a number of picture frames being stacked within one another, with each picture frame showing its picture (or data) at a different scale.
According to one embodiment, the change in region mask size can be varied in order to optimize for either quality or performance. If quality is to be optimized, then the mask can decrease in size to as little as 1 pixel per level 631, 632, 633, 634. This makes the approximation of the shoulder accurate to the level of 1 pixel, the best possible for a typical display screen 340. This will, however, result in possibly a large number of levels n being used, which may result in poor performance. The opposite strategy is to decrease the mask size in steps larger than 1 pixel per level 631, 632, 633, 634. Decreasing the number of steps lowers the quality of the rendering, but requires fewer levels n, hence improving performance.
Regardless of how the change in region masking size per level 631, 632, 633, 634 is chosen, the change in coverage of the level in data space, and hence the magnification of the underlying data, must be chosen appropriately. In this case, “appropriately” means, first, that the levels 631, 632, 633, 634 must vary such that at the lens boundary 612 where the shoulder region 630 meets the contextual image 640 and at the focal bounds 621 where the shoulder region 630 meets the focus image 620, the data (i.e., images 631, 634) in the shoulder region lines up with the adjoining data (i.e., images 640, 620) in the contextual and focal regions, and the magnification levels converge. The parameters defining the magnification and area of the levels 631, 632, 633, 634 may vary through the shoulder region 630. That is, the shoulder function or drop-off function (see above) defining the “shape” of the shoulder 630 may be arbitrary. However, according to one embodiment, the shape of the shoulder function (or distortion function defining the shape of the lens) is continuous providing a smooth transition from the contextual region 640 through the shoulder region 630 to the focal region 620.
According to one embodiment, the GUI 400 of
According to another embodiment, the GUI 550 of
To reiterate, according to one embodiment, there is provided a method for generating a presentation of a region-of-interest in an original image 640 for display on a display screen 340, comprising: establishing a focal region for the region-of-interest at least partially surrounded by a shoulder region (e.g., selected by a user); creating a focal region image 620 for the focal region by scaling the original image within the focal region by a focal region magnification; creating a shoulder region image 631 for the shoulder region by scaling the original image within the shoulder region by a shoulder region magnification, the shoulder region magnification being less than the focal region magnification; and, overlaying the focal region image 620 and the shoulder region image 631 on the original image 640 to thereby generate the presentation.
In the above method, the step of creating the focal region image 620 may further include masking regions of the original image 640 outside the focal region, the step of creating the shoulder region image 631 may further include masking regions of the original image 640 outside the shoulder region, and the step of overlaying may further include masking regions of the original image 640 within the focal and shoulder regions. The shoulder region image 631 may comprise a sequence of shoulder region images 631, 632, 633, 634 to smoothly (e.g., continuously) join the focal region image 620 to the original image 640. Each of the sequence of shoulder region images 631, 632, 633, 634 may have a respective shoulder region magnification that increases from a shoulder region image 631 adjacent to the original image 640 to a shoulder region image 634 adjacent to the focal region image 620. Each of the sequence of shoulder regions images 631, 632, 633, 634 may have a respective size that decreases from a shoulder region image adjacent 631 to the original image 640 to a shoulder region image 634 adjacent to the focal region image 620. The method may further include receiving one or more signals to adjust the focal region through a graphical user interface (“GUI”) 400, 550 having means for adjusting at least one of a size of the focal region, a shape of the focal region, and the focal region magnification. The means for adjusting the size and shape may be at least one handle icon 481, 482 positioned on a perimeter 421, 621 of the focal region and the means for adjusting the focal region magnification may be at least one of a slide bar icon 440, 441, an increase magnification button 553, and a decrease magnification button 554. The shoulder region magnification may be a function of the focal region magnification. The method may further include receiving one or more signals to adjust the shoulder region through a graphical user interface (“GUI”) 440, 550 having means for adjusting at least one of a size of the shoulder region, a shape of the shoulder region, and a shape of the function (e.g., the scoop or shape of the distortion function, shoulder function, or shoulder drop-off function, etc.). The means for adjusting the size and shape may be at least one handle icon 491 positioned on a perimeter 411, 412, 612 of the shoulder region and the means for adjusting the shape of the function may be a slide bar icon. The method may further include receiving one or more signals to adjust at least one of the focal region, the shoulder region, and the original image outside the shoulder region through a graphical user interface (“GUI”) 400 having means for at least one of: increasing the focal region magnification such that the focal region fills the display screen 340; increasing the focal and shoulder region magnifications such that the focal and shoulder regions fill the display screen 340; and, applying the focal region magnification uniformly to the focal region, the shoulder region, and the original image outside the shoulder region. The means may be a respective selectable zoom icon for each of the focal region, the shoulder region, and the original image outside the shoulder region. And, the means may be a respective selectable zoom area in each of the focal region, the shoulder region, and the original image outside the shoulder region.
Thus, there are a number of methods for generating detail-in-context presentations including the following: displacement followed by perspective projection (as described above and in U.S. Pat. No. 6,768,497 to Baar, et al, which is incorporated herein by reference); using pre-calculated texel coverages (as described in United States Patent Application Publication No. 2003/0151625 by Shoemaker); using stretch bit-block transfer (“blit”) operations (as described in United States Patent Application Publication No. 2003/0151626 by Komar et al.); and, using layering (and scaling, masking, etc.) as described above.
However, challenges remain with respect to generating detail-in-context presentations on the Internet and in other client/server applications where limitations on network bandwidth and server capacity may exist. In addition, limitations may exist with respect to the software installation and execution capabilities of client software (e.g., browser software) installed on clients coupled to a server. For example, in the case of an Internet “portal” site which may have thousands of users, the load on the server with respect to its rendering capacity and the impact on network bandwidth from thousands of connected clients may present significant design challenges. In addition, browser capabilities may be severely limited by security rules and other constraints at the client. Furthermore, it is often desirable that clients have no software installed other than JavaScript™ when browsing a given website.
The layering method described above may be considered as a client-side method for generating detail-in-context lens presentations. The software necessary for implementing the method may be client-side software. However, this layering method may be limited due to current browser JavaScript™ capabilities. For example, the lenses generated may be restricted to simple truncated pyramid shapes (or similar shapes) and the quality of rendering of the lens' shoulder region 630 may be restricted by the number of layers 631, 632, 633, 634 used to build the pyramid shape. As described above, improved visual quality of the shoulder region 630 may be achieved by increasing the number of layers in the shoulder region and decreasing the size of each layer.
According to one embodiment of the invention there is provided a method for generating detail-in-context lens presentations in client/server systems (e.g., in performance-constrained online environments). This method can be used in conjunction with the above-described layering method (and potentially with other client-side lens generation methods) to improve the quality of detail-in-context lens presentations and to support the generation of new lens shapes. The method minimizes the demands on servers to perform server-side rendering yet preserves some lens generation functionality at the client in the event that website traffic or network or server limitations make server-side lens generation unavailable.
Now, during periods when a user is actively moving a lens 610 (i.e., a presentation of the lens) across an original image 640 on the display screen 340, the user is less sensitive to the quality of rendering of the shoulder or shoulder region 630 of the lens 610. Hence, the rendering quality may be decreased for the shoulder region 630 during periods of lens movement. According to one embodiment, during periods of lens movement initiated by the user (or otherwise), the operations required to generate a presentation of the lens 610 are performed by the client using, for example, the layering method described above. According to this embodiment, when the user stops moving the lens 610 about the original image 640 (e.g., if the user selects a particular location for the lens 610 in the original image 640, if a predetermined period of time expires, etc.), rendering of a presentation of the lens 610 is performed by the server and the rendered presentation of the lens 610 is then downloaded to the client for display of the client's display screen 340. Advantageously, since the server typically does not have the rendering limitations of the client, this method allows higher quality lens shoulders to be rendered by the server (e.g., by the displacement followed by perspective projection method, by the pre-calculated texel coverages method, by the stretch bit-block transfer method, etc.).
According to one embodiment, the layering method may be performed by the client during periods when the server or network is heavily loaded or is otherwise performing slowly.
According to one embodiment, the server may be used to render lens shapes other than simple truncated pyramids. For example, lenses with rounded shoulders, etc., may be generated by the server. Furthermore, the server can provide additional server-side rendering of new information or blending of new information layers.
According to one embodiment, the rendering or occasional rendering of lenses by the server can also be used to temporarily present content such as advertising to the client browser for presentation to the user on the display screen 340.
According to one embodiment, the higher quality rendering (e.g., by the displacement followed by perspective projection method, by the pre-calculated texel coverages method, by the stretch bit-block transfer method, etc.) may be provided by a separate lens rendering server or proxy server or by a lens rendering module downloaded to the client.
Advantageously, the above embodiments address the problem of a server not being able to keep up with the rendering demands of a large number of client users. In this case, client-side lens generation is maintained and the user is provided with useful detail-in-context presentations, albeit presentations that may have lens images 620, 630 or at least shoulder images 630 that are rendered at a lower quality.
The above described method may be summarized with the aid of a flowchart.
At step 801, the operations 800 start.
At step 802, a lens 610 having a focal region 620 for the region-of-interest at least partially surrounded by a shoulder region 630 is established (e.g., by user selection, etc).
At step 803, if the lens 610 is in transit between first and second locations for the region-of-interest in the original image 640, the lens 610 is applied to the original image 640 by a first method to generate the presentation.
At step 804, if the lens 610 is stationary in the original image 640, the presentation is received from the server, the server having applied the lens 610 to the original image 640 by a second method to generate the presentation.
At step 805, the operations 800 end.
In the above method, the first method may require less resources (e.g., processing power, rendering functionality, etc.) than the second method. The lens 610 may have a shape and the second method may more accurately reflect the shape of the lens in the presentation than the first method. The shoulder region 630 may have a shape and the second method may more accurately reflect the shape of the shoulder region in the presentation than the first method. The second method may include displacing the original image 640 onto the lens 610 to produce a displaced image and projecting the displaced image onto a plane 201 in a direction 231 aligned with a viewpoint 240 for the region-of-interest 233. The first method may include: creating a focal region image for the focal region 620 by scaling the original image 640 within the focal region 620 by a focal region magnification; creating a shoulder region image for the shoulder region 630 by scaling the original image 640 within the shoulder region 630 by a shoulder region magnification, the shoulder region magnification being less than the focal region magnification; and, overlaying the focal region image and the shoulder region image on the original image 640. The method may further include receiving a signal indicating the transit between the first and second locations from a graphical user interface (“GUI”) 400 displayed over the lens 610 on the display screen 340. The method may further include, if the lens 610 is stationary in the original image 640, sending a signal from the system 300 to the server requesting the presentation. The method may further include, if the lens 610 is stationary in the original image 640 and if the server is unavailable, applying the lens 610 to the original image 640 by the first method to generate the presentation within the system 300. And, the method may further include displaying the presentation on the display screen 340.
According to one embodiment, the above method may be implemented by the server rather than, or in addition to, the client.
While this invention is primarily discussed as a method, a person of ordinary skill in the art will understand that the apparatus discussed above with reference to a data processing system 300, may be programmed to enable the practice of the method of the invention. Moreover, an article of manufacture for use with a data processing system 300, such as a pre-recorded storage device or other similar computer readable medium including program instructions recorded thereon, may direct the data processing system 300 to facilitate the practice of the method of the invention. It is understood that such apparatus and articles of manufacture also come within the scope of the invention.
In particular, the sequences of instructions which when executed cause the method described herein to be performed by the data processing system 300 can be contained in a data carrier product according to one embodiment of the invention. This data carrier product can be loaded into and run by the data processing system 300. In addition, the sequences of instructions which when executed cause the method described herein to be performed by the data processing system 300 can be contained in a computer software product according to one embodiment of the invention. This computer software product can be loaded into and run by the data processing system 300. Moreover, the sequences of instructions which when executed cause the method described herein to be performed by the data processing system 300 can be contained in an integrated circuit product (e.g., a hardware module or modules) which may include a coprocessor or memory according to one embodiment of the invention. This integrated circuit product can be installed in the data processing system 300.
The embodiments of the invention described above are intended to be exemplary only. Those skilled in the art will understand that various modifications of detail may be made to these embodiments, all of which come within the scope of the invention.
This application claims priority from U.S. Provisional Patent Application No. 60/727,507, filed Oct. 18, 2005, and incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
60727507 | Oct 2005 | US |