The present invention relates generally to visualization of datasets, and more particularly the invention relates to visualization of a portion of a dataset and visualization of a context of the portion of the dataset.
A variety of types of two dimensional (2D) visualizations or displays are useful for many applications. Maps may provide geographic and directional information. 2D data graphs convey relationships between variables having meaning to technology, business and everyday life. 2D visualizations are used in design of buildings and devices. 2D displays are also used in critical situations, such as the routing of emergency vehicles and in response to disasters. Satellites orbiting the earth provide us with multi-dimensional data that can be displayed as detailed 2D maps of extensive areas. The amount of data included with the maps and other visualizations may be very large and can be expected to increase over time as a consequence of developing sensor, web, storage and computing technologies.
Browsing and inspecting data (e.g., two, three or multi dimensional data) in 2D visual representations, such as maps and graphs, can be challenging especially when the amount of data is large. Inspecting a particular data element may require zooming in to a small portion of a total 2D space covered by the data, then using panning and scrolling to view surrounding information. Understanding a subset of the data, and gaining good insight into its context has been a difficult problem in visualization.
Principles of the invention provide, for example, methods and apparatus for visualizing a dataset. For example, in accordance with one aspect of the invention, a method for visualizing a dataset is provided. The method includes identifying a first portion and at least a second portion of the dataset, forming a summary of the second portion of the dataset, and visualizing, on a display device, the first portion of the dataset and the summary of the second portion of the dataset. The summary is represented by one or more spatial shapes different from a spatial shape representative of the second portion before the formation of the summary. The identification of the first portion and the second portion, the formation of the summary, and the visualization of the first portion and the summary are implemented in accordance with a processor device associated with the display device.
In accordance with another embodiment of the invention, apparatus for visualizing a dataset is provided. The apparatus includes a memory and a processor coupled to the memory. The apparatus is operative or configured to perform the above method.
In accordance with another embodiment of the invention, a system for visualizing a dataset is provided. The system comprises modules for implementing the above method.
In accordance with one more embodiment of the invention, an article of manufacture for visualizing a dataset is provided. The article of manufacture tangibly embodies a computer readable program code which, when executed, causes the computer to carry out the above method for visualizing a dataset.
Aspects of the invention provide, for example, viewing a focused-upon portion of a dataset while also viewing contextual information about other data of the dataset that is outside of the focused-upon portion. Further aspects of the invention provide visual metadata for information beyond a viewing window (e.g., a focused-upon viewing window).
These and other features, objects and advantages of the present invention will become apparent from the following detailed description of illustrative embodiments thereof, which is to be read in connection with the accompanying drawings.
Techniques of the present invention will be described herein in the context of illustrative methods for visualization of two-dimensional data. It is to be appreciated, however, that the techniques of the present invention are not limited to the specific method shown and described herein. Rather, embodiments of the invention are directed broadly to techniques for visualization and display of data, information or knowledge of any dimension. For this reason, numerous modifications can be made to the embodiments shown that are within the scope of the present invention. No limitations with respect to the specific embodiments described herein are intended or should be inferred.
The term summary, as used herein, may refer to a brief, concise or compressed representation of what is being summarized, for example, data that is being summarized. By way of example only, a summary may be a concise representation of all data within a dataset, or may be a concise representation of selected data from the dataset. For example, if a dataset comprises two groups of data, a summary may be a mathematical average of both groups of data together or a mathematical average of only one of the two groups of data. More generally, the term summary may refer to a representation of one or more particular aspects of a dataset or a representation of one or more particular aspects of a visualization of a dataset. For example, a particular aspect of a geographical map may be a location of a land mass. The corresponding summary may be an icon indicating the position of the land mass.
A dataset, as used herein, comprises data associated with a visualization, for example, the visualizations shown in
A visualization is a visual representation of information, data or knowledge. Visualizations include, but are not limited to, images such as, for example, images displayed in accordance with computing or processor devices, cellular phones and gaming devices. Visualizations may be dynamic in that the visualization may be updated periodically or continuously, or visualizations may be static in that the visualization is fixed. Examples of dynamic visualizations may include images associated with processor devices. Examples of static visualizations may include maps and images of information on paper, film or other media. Visualizations and images may be presented on display devices, for example, display devices associated with processor devices, cellular phones and gaming devices; paper; billboards and other public display devices.
Exemplary visualizations or images, according to certain embodiments of the invention comprise an in-view visualization or image and summary strips. The in-view visualization includes visualization (e.g., a view or views) of a map, data, information or knowledge. By way of example only, the in-view visualization may include data or information that is original, for example, not compressed or summarized for representation in the summary strips. The data or information in the in-view visualization may include, for example, data or information that could be direct information, such as a heat map showing population across the United States, or of processed information, such as a heat map of calculated income per capita, or a visualization that highlights only those zip codes with income per-capita above a certain value. The summary strips include visualizations or images of the map, data, information or knowledge that is currently outside of the in-view visualization (i.e., out of view information). The summary strips may include information, data or knowledge that is compressed, summarized or transformed (e.g., spatially, statistically or mathematical transformed). A summary strip may be, for example, a rectangular strip, a circular annulus surrounding a circular view of the data, or other geometric forms depending on the application.
Browsing and inspecting two dimensional (2D) data in visual representations can be challenging when the amount of data is large. Inspecting a particular data element may require zooming in to a small portion of a total 2D space covered by the data. While the data element under inspection may be visible at a deep zoom level, it becomes much harder for the user to see the context of this data element, especially when context involves more than just the area that is close to the element under inspection in Euclidean distance. For example, the user may want to keep the context of other elements that are close to the data element under inspection in just the x-direction or the y-direction, and may be interested in understanding how events far away in x and y influence the data element under analysis.
Techniques for visualizing 2D data include a graphical fish-eye and a zoom-panel. The graphical fish-eye distorts the data and does not work well for large amounts of data. The zoom-panel is a small image panel that only shows the spatial position of the zoomed upon viewpoint relative to the total 2D space. Both the fish-eye and zoom panel may show scaled down (e.g., smaller sized) visualizations of individual data items, for example, de-magnified portions of a map. The scaled down visualizations are otherwise similar to their respective original visualizations, that is, the scaled down visualizations do not characterize statistical, analytical, or summary data about the data outside the main view.
Exemplary features of the invention include techniques for displaying information (e.g., context information) which is out of range (i.e., out of view) of an in-view window. The out of view information for a scrolled 2D image, may be presented, for example, along the boundaries of the visualization in displayed summary strips, so that users can get efficient and quick insight into the information that is currently out of range of the in-view window. Information about features that are out of range to the top, bottom or sides of the in-view window may be displayed in these summary strips, which may be placed, for example, at or outside of the top, bottom and/or sides of the in-view window. As the user scrolls, moving attention to different parts of the representation (i.e., scrolling for different in-view visualizations), the information in the summary strips may be dynamically updated.
Visualizations of three dimensional data may also be presented according to embodiments of the invention. Summary strips may represent out of view portions of data for each dimension. More than three dimensions are also contemplated.
The summary strips may comprise, for example, abstractions, summaries (e.g., summary statistics, summary mathematics, summary spatial representations, or a summary of spatial features) or representations for information or data that is not presented in the in-view window. The summary strips, or the information in the summary strips, may, for example, represent, in the form of icons, glyphs, charts, or other representations, large amounts of data that is not presented in the in-view window. Information in the summary strips may also have a temporal component, for example, in the form of a glyph that flashes, moves, or changes shape or color over time. This is distinctly different from visualizations that simply provide scaled down representations of the information or data that is not presented in the in-view window (e.g., a de-magnified portion of a map). Scaled down representations are not precluded from information contained in the summary strips.
By way of example only, consider a map visualization of islands in the Pacific Ocean some of which are inhabited by turtles. For islands that are not presented in the in-view window, the summary strip could contain icons to represent one or more islands and icons to represent those islands inhabited by turtles. Information contained in the summary strips may be, for example, simpler or less than the original information (e.g., information represented by the summary strips and that is not presented in the in-view window). Metadata about the of-screen data are represented in the summary strip.
By way of example only, embodiments of the invention may be visualizations of maps, processes or data points.
Step 110 of method 100 comprises partitioning a dataset by identifying first and second portions of the dataset. The partitioning of the dataset may, for example, be considered as a partitioning of a visualization of the dataset into a first part of the visualization and a second part of the visualization. The step 110 may comprise, for example, scrolling, magnifying or de-magnifying of data within the dataset, by a user or viewer, to place a portion of the dataset into or out of view (i.e., into or out of the in-view visualization). The portion of the visualization or dataset that is placed in the in-view visualization becomes a first part of the visualization or the first portion of the dataset. The portion of the visualization or dataset that remains out of view (i.e., not within the in-view visualization) becomes a second part of the visualization or the second portion of the dataset. Thus, as a user continues to scroll or zoom in upon different data within the dataset, the first and second parts of the visualization are redefined and the first and second portions of the dataset are redefined. This redefinition of the first and second portions in response to scrolling or zooming may happen in a continuous manner or in a periodic manner. In either case, the step 110 may be repeated one or more times.
After partitioning, the dataset comprises a first portion and a second portion. The first portion and the second portion each may have respective original visualizations, each represented by an original spatial shape (i.e., a first portion original spatial shape and a second portion original spatial shape). An original visualization, as used herein, means a visualization of all of the data in the dataset, or a portion thereof, for example, a visualization before any data of the dataset is summarized or transformed to form summary information displayed in the summary strips or indicators. Thus, an original visualization of the second portion of the dataset comprises a visualization of all data in the second portion of the dataset, and/or a visualization of the second portion of the dataset before data of the second portion of the dataset is summarized or transformed to form summary information. An original spatial shape may be considered, for example, a spatial shape that renders or provides for visualization of all of the data within the associated dataset or associated portion of the dataset (e.g., the second portion). An original spatial shape may comprise additional attributes besides just an outline, physical or geometric shape. The original spatial shape may comprise, for example, attributes of color, texture, pattern, shading, transparency and size. Alphanumeric characters are considered examples of shapes.
For example, the 2D map 200 is a rendered original visualization or image of both a first portion and a second portion of a dataset comprising the map 200, that is, it is a rendering of the complete dataset of the map. In
Step 120 comprises forming a summary (e.g., a visual summary) of the second portion of the dataset. The summary may be formed on or by a processor device (e.g., a processor device associated with a display device for displaying visualizations according to principles of the invention and/or a processor device coupled to a memory). By way of example only, the summary may include an abstraction of the second portion, a statistical summary of the second portion, a mathematical summary of the second portion, and a summary of spatial features of the second portion original spatial shape.
Because the summary of the second portion of the dataset may be intended to represent the second portion in a visually simplified manner (e.g., a visual representation of the summary may be visually simpler or less complex than the original visualization of the second portion) information may be lost in forming the summary. That is, an amount of information contained in the summary may be less that an amount of information contained in the second portion of the dataset.
In an embodiment of the invention, step 120 comprises forming a summary of a portion of the dataset (e.g., the second portion of the partitioned dataset) that is represented by the second part (i.e. an out of view part) of the first visualization when the second part of the first visualization is selectively moved out of view on a display device displaying the first visualization.
In an alternate embodiment of the invention, step 120 comprises forming a summary of a portion of the dataset (e.g., the first portion of the partitioned dataset) that is represented by a first part (i.e., and in-view part) of a visualization.
In another alternate embodiment of the invention, step 120 comprises forming a summary of a first portion of the dataset (e.g., the first portion of the partitioned dataset) that may, for example, be represented by a first part (i.e., the in-view part) of a visualization, and a second portion of the dataset (e.g., the second portion of the partitioned dataset) that may, for example, be represented by a second part (i.e. an out of view part) of the visualization. The summary may represent aspects of the second portion or the second part combined with aspects of the first portion or the first part, or may represent aspects of the second portion or second part and aspects of the first portion or first part.
It is to be appreciated that in an embodiment of the invention, a summary is metadata of the second portion of the dataset and/or metadata of the second part of the visualization.
By way of example only, consider map 200 shown in
Step 130 comprises visualization of the first portion of the dataset and visualization of the summary of the second portion of the dataset. The visualization for example, may be on a display device. The visualization may comprise, for example, an original visualization of the first portion of the dataset. Alternately, the visualization may comprise a new visualization or a transformation of the original visualization. Transformations may include, but are not limited to, shape changes, color changes, mathematical operations, statistical operations, magnifications and demagnifications. The visualization of the first portion of the dataset may comprise, for example, rendering a display of the visualization, for example, on the display device or within a viewing or display window which may be termed an in-view window.
The visualization of the first portion of the dataset may be, for example, comprised within a first visualization of the dataset, the first visualization comprising a first part of the first visualization and at least a second part of the first visualization. According to methods of the invention, the first part is displayed in an in-view window and the second part is an out of view part summarized is a visual summary.
In the visualization of the summary, the summary is represented by one or more spatial shapes, which may be termed summary spatial shapes. For example, in
In an embodiment of the invention, step 130 comprises presenting a second visualization of the dataset on the display device, the second visualization comprising a visual summary of the second part of the first visualization (i.e., an out of view part) in spatial coordination with the first part of the first visualization (i.e. an in-view part). The first and second visualizations are presented on a display device coupled to or associated with a processor device, for example, the processor device associated with forming the visual summary.
Because the summary spatial shapes are typically, although not necessarily, visualized in less area that an un-summarized view (e.g., an original visualization) of the second portion of the dataset or the out of view part of the original first visualization, and because the summary spatial shapes represent a summary of the second part or the out of view part, it may be typical to have summary spatial features that are simpler or less complex than more complete views (i.e., un-summarized views) of the second portion of the dataset or the out of view part of the first visualization. Thus, the visualization of the summary will be different from an un-summarized view of the second portion. That is, in the visualization of the summary, the summary is represented by one or more summary spatial shapes that are different from the second portion original spatial shape. For example, one or more spatial shapes representing the summary may have fewer spatial features that a number of spatial features in the out of view part of the first visualization or of the second portion original spatial shape. A spatial feature may be, for example, a straight line segment, a curved line segment, a bent line segment, or any geometric feature. The indicators 321, 322, 323, 331 and 332 of
The summary spatial shapes may be visualized within an image window, a visualization areas or a display window. By way of example only, consider the summary strips 320 and 330 of
The step 130 may, optionally, further comprise selection, by a user or viewer, of one of the one or more summary spatial shapes (e.g., indicators 321, 322, 323, 331 and 332 of
In an embodiment of the invention, steps 120 and 130 are performed in response to the step 110, the partitioning of the dataset. If the step 110 is repeated, as may occur during scrolling or zooming in upon data of the dataset, step 120 and 130 may be repeated, reflecting a dynamic, on-the-fly, and real-time nature of the method.
In the case where scrolled map 300 is displayed and subsequently scrolled to form scrolled map 400, the left summary strip and the right summary strip are updated to reflect changes in out of view and in-view information.
Other embodiments of the invention comprising datasets instantiated as maps may comprise, for example, information on population, terrain, weather, climate, topology or any other location dependent information that may be visualized on a map. The summary strips and indicators may provide summary information on the population, terrain, weather, climate, topology or any the other location dependent information. These embodiments, as well as the embodiments of
The process flow sheet 500 is divided into horizontal time segments (lines) 510, with time proceeding forward in going from one horizontal time segment to a next horizontal time segment below the one horizontal time segment. The flow sheet is further divided into first, second and third vertical segments 521, 522 and 523, respectively, representing address spaces in which the processes execute. The first, second, and third shaded or patterned rectangle 531, 532 and 533, respectively, represent event types of the processes. A dataset comprises the flow sheet 500 and/or the data contained within or represented by the flow sheet. The dataset can be segmented into a first portion and a second portion.
The in-view visualization 860 may be a focused upon visualization by magnifying the first portion of the dataset, and/or the in-view visualization may be focused upon by scrolling in both the X and Y directions. The summary strips preserve the context of the in-view visualization by providing reminders about (e.g., summary information of) the structure of the data in the area that is currently out of view.
In
Within the summary strip, a mouse over or tool tip may be used to reveal, to users or viewers, associated information of the second portion of the dataset. Raw, aggregate, summary, transformed or compressed information from the second portion of the dataset may be provided or shown in response to the request (e.g., mouse over) by the user or viewer. Statistics or computations (e.g., mathematical operations or transformations) of the second portion of the dataset may be provided. The provided information may be according to not only the second portion of the dataset, but additionally, according to the first portion of the data set. By way of example only, consider
The examples presented in
A feature of the invention is that information in the summary strips may be updated dynamically as the user explores the data, e.g., as the user zooms in or scrolls upon the dataset.
One or more summary strips may be drawn on any side of in-view visualization.
One or more summary strips may be drawn on the left, to the right, to the bottom and/or to the top of the in-view visualization. Other configurations are contemplated, for example, summary data may be visualized in an area placed within or in the interior of an in-view visualization. Summary data may be visualized or shown in shapes other than strips, for example, squares, rectangles, circles or other geometric shapes.
As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
Aspects of the present invention are described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
Referring again to
Accordingly, techniques of the invention, for example, as depicted in
One or more embodiments can make use of software running on a general purpose computer or workstation. With reference to
A data processing system suitable for storing and/or executing program code can include at least one processor 902 coupled directly or indirectly to memory elements 904 through a system bus 910. The memory elements can include local memory employed during actual execution of the program code, bulk storage, and cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution.
Input/output or I/O devices (including but not limited to keyboard 908, display 906, pointing device, and the like) can be coupled to the system either directly (such as via bus 910) or through intervening I/O controllers (omitted for clarity).
Network adapters such as network interface 914 may also be coupled to the system to enable the data processing system to become coupled to other data processing systems or remote printers or storage devices through intervening private or public networks. Modems, cable modem and Ethernet cards are just a few of the currently available types of network adapters.
As used herein, including the claims, a “server” includes a physical data processing system (for example, system 912 as shown in
It will be appreciated and should be understood that the exemplary embodiments of the invention described above can be implemented in a number of different fashions. Given the teachings of the invention provided herein, one of ordinary skill in the related art will be able to contemplate other implementations of the invention. Indeed, although illustrative embodiments of the present invention have been described herein with reference to the accompanying drawings, it is to be understood that the invention is not limited to those precise embodiments, and that various other changes and modifications may be made by one skilled in the art without departing from the scope or spirit of the invention.