This application claims priority under 35 U.S.C. §119 from Chinese Patent Application No. 201010136979.7 filed Mar. 30, 2010, the entire contents of which are incorporated herein by reference
1. Field of the Invention
The present invention relates to a computer network technology, and more particularly the present invention is related to a network visualization processing method and apparatus.
2. Description of the Related Art
Dynamic Network Visualization is an effective method for time-spatial analysis in several scenarios such as information network, cognitive/social network and communications network. Except for showing the static relationships of the network in each specific time, the dynamic network visualization also reveals the significant time-evolving patterns of both the entity and the relationship within the network. Previous solutions for dynamic network visualization generally fall into two categories. One is to “animate” the network and show the dynamic network as a movie, with elaborate network layouts balancing the stability and aesthetics of the time-evolving network graph.
Therefore, it is currently desirable for a network visualization processing solution which is more automated and easier for a user to understand.
Accordingly, one aspect of the present invention provides a network visualization processing method. The method includes: obtaining main-information-dimension-based topological data of an analytic object within a network; performing visualization processing to the main-information-dimension-based topological data of the analytic object so as to reveal change of a relationship between an analytic node and a neighboring node in the analytic object along the main information dimension.
Another aspect of the present invention provides a network visualization processing apparatus. The apparatus can include: data obtaining module for obtaining main-information-dimension-based topological data of an analytic object within a network; and visualization processing module for performing visualization processing to the main-information-dimension-based topological data of the analytic object so as to reveal change of a relationship between an analytic node and a neighboring node in the analytic object along the main information dimension.
The network visualization processing method and apparatus according to the present invention can reveal dynamic change of a network based on main information dimension in a single view, and provide a better graphical resolution, so as to facilitate a user to perform analysis to the network and to reduce comprehension overhead of the user.
a)-7(b) illustrate diagrams of network visualization presentation according to an embodiment of the present invention.
a)-9(c) illustrate diagrams of topological data extractions according to an embodiment of the present invention.
The present invention will now be described via the preferred embodiments. However, the present invention should not be viewed as being limited thereto.
At step S301, main-information-dimension-based topological data of an analytic object within a network are obtained. The network here can be a social network or a computer/telecommunications network. The analytic object is a network whose center is an analytic node set, wherein the analytic node set includes one or more analytic nodes. The analytic node is a node on which the user attempts to perform analysis in a specific aspect. For example, the user analyzes evolution of the node in which he/she is interested in a dimension. Main information can include time and space of various operations performed by the analytic object within the network, node within an organization, role of the node, and particular content (key words, etc) and any other information in which the user can be interested. For example, communication among a plurality of email users is a social network. When performing visualization processing on emails of one or more users in the social network as analytic nodes, for example, email sending/receiving time can be taken as main information, then the above one or more users are taken as an analytic node set, and the above one or more users and those users having email communications therewith constitute a network whose center is the analytic node set, namely, an analytic object.
With an analytic node set including one user as an example, time dimension-based topological data of a network whose center is the user can be obtained based on email communication historical record of the user. Topological data can simply include an analytic node and information such as the number or time of emails sent or received thereby. Or, topological data can include nodes and edges, wherein the nodes can include the user, i.e., analytic node, and users having email communications with the user, called neighboring nodes; the edges can represent email communications between the analytic node and neighboring nodes, and times and time information of the email communications.
At step S302, visualization processing is performed on main-information-dimension-based topological data of the analytic object, so as to reveal change of the analytic object along the main information dimension. For example, the main information dimension can be presented as a main information dimension graph. It should be understood that the main information dimension can assume various forms. For example, the analytic node can be presented with a main information dimension aware glyph as the main information dimension graph.
As an embodiment of the present invention, in the vertical glyph as illustrated in
As another embodiment of the present invention, in the horizontal glyph as illustrated in
The glyphs as illustrated in
As a further embodiment of the present invention, the spiral glyph in
It should be noted that vertical, horizontal, and spiral time dimension aware glyphs are only examples. In implementation, the time dimension can be presented into any graph capable of displaying time information as required by analysis, for example, in a calendar manner.
Besides, neighboring nodes of the analytic node can be displayed as a neighboring node graph and connected to the above main information dimension graph, wherein a connection location between the neighboring node graph and the main information dimension graph denotes main information of an edge between the analytic node and its neighboring nodes.
As an example,
Edges in the topology can include a time-dependent edge and a time-independent edge. In this example, a time-dependent edge denotes an edge varying with time. For example, it is present in a static topology, while absent in another static topology. A time-independent edge is an edge not varying with time, for example present in all static topologies. As an embodiment of the present invention, a time-dependent edge connected to the analytic node is de-multiplexed based on a time value corresponding to the edge, so as to be connected to a particular part of the main information dimension graph corresponding to the time value. On the other hand, other non-analytic nodes and time-independent edges can retain their shapes and connection types as in a conventional visualization representation.
Optionally, this graph displays characteristics of edges in the network, i.e., characteristics of relationships between the neighboring nodes and the analytic nodes, through connection parts between the neighboring nodes and the main information dimension graph. For example, a narrow edge denotes a unidirectional edge, for example an edge between the analytic node and the node Li BJ Zhang in the graph, while a wide edge denotes a bidirectional edge, for example an edge between the analytic node and the node Nan CN Cao in the graph.
Embodiments of the present invention reveal a plurality of key characteristics of a dynamic ego network, including: grouping information surrounding an analytic node, which is equivalent to the community information the analytic node is participating in the entire time period in the social network scenario; a temporal connection information between the analytic node and one of its neighbors, with which information, temporal patterns within a social relationship in the social network scenario can be found; encoded time information of the analytic node, for example, sending/receiving frequency/volume. In a social network scenario, it can be a social activeness of a user represented by an analytic node, changing over time.
The major issue of the traditional visualization over the combined dynamic network lies in the deficiency of presenting the time-evolving network patterns. In previous visualization methods, time-dependent edges are drawn in parallel, and time information is only illustrated as a label, making it impossible to diagnose the sequential/causal relationships within the dynamic network. Embodiments of the present invention can present a plurality of static topologies in a single view, which can clearly represent change of the network with the main information dimension and is thus convenient for a visualization user to analyze the network state.
An overhead of applying the embodiments of the present invention, also known as additional visual complexity and computation, remains minor. Only the main information glyph presenting the selected analytic node set (normally 1˜2 nodes) will be shown as occupying more screen space, and the edge number is not increased from the combined dynamic network.
What is illustrated above is a network visualization representation method according to the present invention with reference to a simple network. Hereinafter, performing visualization processing to a complex network or improvement on the above topological representation will be illustrated with reference to
At step S802, static topological data of a plurality of static topologies are combined to obtain main-information-dimension-based topological data of an analytic object.
At step S803, visualization processing is performed on main-information-dimension-based topological data of the analytic object, so as to reveal change of the analytic object along the main information dimension. This step can be similar to step S302 of
At step S804, visualization analysis is performed to analyze and diagnose the network. For example, select/de-select instruction for a node from a user or a dimension scaling instruction from the user can be received.
Additionally, alternatively, a loop path can exist among the three steps. For example, a select/de-select instruction for a node from a user will trigger an online dynamic network data processing, and based on the select/de-select instruction, the data extracting step will determine an analytic node in an analytic object, which causes a new visualization of the network thereafter. For another example, after the dimension scaling instruction from the user is received, the combining step will determine the number of a plurality of static topologies for combining based on the dimension scaling instruction, and the visualization processing step will scale a display granularity of the main information dimension based on the dimension scaling instruction.
Hereinafter, exemplary depiction on an embodiment of the present invention will be made only with the example of time as main information. But it should be noted that the embodiment of the present invention is universal to all networks evolving along time dimension. For example, time evolution can be replaced as evolution along geographical locations or traveling path, evolution of role of a node, etc.
In this embodiment, step S801 can include analytic node-based dynamic network extraction. An analytic node is also called ego node. An analytic object is a network with an analytic object as the center, also called an ego-network.
As an embodiment of the present invention, a dynamic network is defined by an underlying graph of the network, which includes network nodes and edges connecting the nodes, while both the nodes and edges vary with time. A dynamic network D is represented by a time-evolving graph G(t), wherein t ∈ [0, T] denotes time, V(t) denotes a vertex set in the graph, while E(t) denotes an edge set in the graph.
This step is based on a user-defined analytic node set, and topological data are obtained through the network extracting step. An analytic node set is the focused node set within a network which receives the highest interest of the user. An analytic node set can include a single node or a plurality of nodes, and then the extracted network topological data are relevant to an analytic object whose center is the analytic node.
As mentioned above, step S801 is an extracting step. In this step, network extraction is performed on a static snapshot of the dynamic network at each specific time frame. Given the static network graph G(t) at time t with a vertex set V(t) and edge set E(t), an ego-network whose center is the node set Ω within N(t) is defined by an ego-graph G (Ω, t) with vertex set V(Ω, t) and edge set E(Ω, t), as illustrated in the following formula:
V(Ω, t)=Ω∪{v|v∈V(t)λ(∃u∈Ωv(v, u) ∈ E(t)λ(u, v) ∈ E(t)}
E(Ω, t)={e=(v1, v2)|e∈E(t)λv1∈V(Ω, t)λv2∈V(Ω, t)},
Namely, a node set V(Ω, t) of the ego-network is composed of nodes having edges with the analytic nodes within given time t, also known as neighboring nodes, and an edge set E(Ω, t) of the ego-network is composed of edges between analytic nodes and between the analytic nodes and neighboring nodes within given time t. The above embodiment can present neighboring nodes having a “one-hop” relationship with analytic nodes, edges between analytic nodes, edges between analytic nodes and neighboring nodes, and edges between neighboring nodes. However, it is only an example, and in specific implementation, it can include one or more of the above, or select other nodes and edges to be displayed as required by analysis.
As an embodiment of the present invention, at step S802, the dynamic ego-networks are combined based on a static ego-network at each time frame. Given a time-evolving static ego-network graph G(Ω, t), whose center is the node set Ω, wherein the node set of time t is denoted by V(Ω, t) and edge set is denoted by E(Ω, t), the combined dynamic ego-network D, represented by its underlying graph G(Ω), is computed by:
wherein,
Additionally, the edge set E(Ω) is composed of two subsets: EI(Ω), which includes time-independent edges represented by e=(v1, v2); ED(Ω), which includes a time-dependent edge presented by e=(v1, v2, t). A time-independent edge is independently determined by a source and a target node connected by the edge, and a plurality of time-dependent edges can exist simultaneously between a pair of nodes, wherein one of the time-dependent edges is for each specific time frame.
In the above embodiment, the dynamic network combination step maintains all edges incident to the node set Ω as time-dependent edges and aggregates other edges not incident to the node set Ω as time-independent edges.
An extension for dynamic network combination is to introduce time dimension combination to time-dependent edges. Given a time dimension mapping from [0, T] to {S1, S2, . . . , Sm}, wherein Si
As an embodiment of the present invention, the combination step can determine neighboring nodes having a predetermined number of edges with the analytic nodes in a plurality of static topologies, for example, to maintain only neighboring nodes having edges exceeding a predetermined number with the analytic nodes.
Step S803 can include visualization composition and presentation. This step substantially creates visualization of dynamic networks and illustrates a combined dynamic network. This step is similar to the above step S302.
Optionally, layout of a visualization graph can be optimized so as to avoid graph overlap, which can also make the visualization graph much clearer so as to provide convenience for the user to analyze the network state. In embodiments of the present invention, because only a selected analytic node is fixed in the graph layout, a layout algorithm can be provided in a sufficient space to generate a visual aesthetic graph layout.
As an embodiment of the present invention, a force-directed algorithm can be referred to lay out a visualization graph of the analytic object. Based on the force-directed algorithm, an objective of laying out a graph is to minimize the graph energy of the final layout. A significant difference of the embodiments of the present invention over a standard force-directed algorithm lies in three aspects: 1) before being inputted to the layout algorithm, each node in the analytic node set is split into a plurality of sub-nodes based on time dimension values; 2) before implementing layout, positions of split sub-nodes are fixed, and the layout algorithm only counts energy of nodes not in the analytic node set; 3) adding a customized layout adjustment stage to avoid potential overlap with nodes in the analytic node set.
As an embodiment of the present invention, the layout algorithm works in three steps: graph preparation; graph layout calculation; and graph layout adjustment.
In the graph preparation step, given a combined dynamic network graph G(Ω) whose center is analytic node set Ω, with a total node set V(Ω) and a total edge set E(Ω); the generated graph for layout is computed as LG(Ω), with a node set LV(Ω) and edge set LE(Ω), according to the following formula:
LV(Ω)=(V(Ω)−Ω)∪ΦV(Ω),
LE(Ω)=EI(Ω)∪ΦE(ED(Ω)),
wherein,
ΦV(Ω)={v(t)|v∈Ωλt∈[0, T]},
ΦE(ED(Ω))={(v1, v(t))|v∈Ωλt∈[0, T]λ(v1, v, t) ∈ ED(Ω)}∪{(v(t), v2)|v∈Ωλt∈[0, T]λ(v, v2, t) ∈ ED(Ω)}
In the above formula, v(t) denotes a split sub-node of the analytic node v in a time frame t.
In a graph layout calculation step, graph layout is calculated over the LG(Ω) through a force-directed algorithm. Generally, the force-directed algorithm works by inserting spring embedders/stress-forces between nodes or by defining energy function for the graph. The final goal of the algorithm is to adjust the node positions to achieve the global minimization of the system energy. According to an embodiment of the present invention, improvement over these kinds of algorithms is to only consider the energy related to the non-analytic nodes (not in the analytic node set) and not to move the position of nodes in the analytic node set during the layout process.
For example, in the well-known Kamada-Kawai layout method, the energy function is defined by:
Laying out the positions of the neighboring node graph includes laying out a position of the neighboring node graph according to the above formula. In this formula, the first item denotes the graph layout aesthetic energy, Xi denotes the horizontal coordinate of the node vi in the graph LG(Ω), Xj denotes the transversal coordinate of the node vj, dij denotes the optimal distance between node vi and node vj, wij denotes the modification coefficient; the second item denotes stable energy, Xi′ denotes the stable point of node vi, and α denotes a stable coefficient for balancing the first item and the second item.
As an embodiment of the present invention, the energy function can be set more accurately through the following formula:
Where wij denotes a modification coefficient, dij denotes the optimal distance between node vi and node vj, and Ω denotes an analytic node set.
It should be noted that the above formula and selection of the modification coefficient are empirical values, which can be adaptively adjusted in practical implementation.
In the above embodiment, the energy introduced by the mutual interaction of nodes in the analytic node set is not considered in the system energy minimization.
In the step of graph layout adjustment, node positions have been adjusted to avoid overlapping. Substantially, the force-directed layout algorithm already resolves the node overlapping issue by imposing the optimal distance and/or spring forces between nodes. However, this is the case for graphs with regular-shape nodes, while in the graph layout according to embodiments of the present invention, the analytic nodes in the analytic node set are displayed by glyphs that occupy irregular screen space. In order to solve this problem, a post-layout adjustment is introduced into an embodiment of the present invention.
With a graph with a vertical glyph as an example, the x-axis coordinate of each non-analytical node is adjusted. Let vi denote one of the non-central node, with position (xi, yi) after the layout. Assume vi is placed between two vertical glyphs with x-axis coordinates of xs and xt, maximal width ws and wt. If there is no glyph in the left of vi, then xs is set to the x-coordinate of left margin of the screen and ws set to 0. Similarly in the case where there is no glyph in the right of vi, xs is set to the x-coordinate of right margin of the screen and wt set to 0. Then the x-axis coordinate of vi is adjusted to:
Through the above embodiment, the graph layout with horizontal glyphs can be adjusted accordingly. For other form of glyph, position adjustment can be performed with reference to the above method.
As an embodiment of the present invention, apart from the general-purpose interactions of network visualization analysis (such as dragging, highlighting and zooming), step S804 further includes several types of customized interactions for dynamic ego network visualization, for example analytic node select/de-select, main information dimension expand/collapse, and dimension scaling of main information dimension, etc. If the analytic task is entity-centric rather than topology-centric, then the visualization analysis step according to embodiments of the present invention will become more useful. Exemplary tasks include role analysis and spammer detection/validation.
During the interaction between analytic node select/de-select, expansion of the circle in space and topology is implemented by selecting the nodes not in the analytic node set.
During the interaction of main information dimension expand/collapse, for example, time dimension node expand/collapse, the expansion operation is to expand the graph at the time dimension. When the additional node is selected for expansion, it will be illustrated by glyph according to the graph type, in replacement of a regular-shape node.
With the increase of the range of the main information dimension such as the increase of time range, the dynamic network visualization processing method will suffer from visual clutter caused by a great number of edges. In order to solve this problem, a dimension scaling interaction, for example time dimension scaling, is introduced in an embodiment of the present invention, which is also called time dimension edge grouping. The user can select to group the time dimension edges by different scales, such as year/month/week/day/hour. For example, when the edges are grouped by year, all the time-dependent edges connecting the same pair of nodes and happening in the same year will run as one single edge after the operation, which allows the user to diagnose the temporal relationships in multiple levels of granularities.
In the network visualization processing apparatus of
As an embodiment of the present invention, the extracting module 1811 is further for extracting at least one piece of the following information: neighboring nodes of an analytic node in the analytic object, wherein the neighboring nodes include nodes having edges with the analytic node in the static topology; and edges between the analytic node and its neighboring nodes.
As an embodiment of the present invention, the combining module 1812 is further for determining a neighboring node having a specific number of edges with the analytic node in a plurality of static topologies.
In the network visualization processing apparatus of
The network visualization processing apparatus of
The data obtaining module 181 is further for determining a length of the main information dimension based on a dimension scaling instruction from a user. The visualization processing module 182 is further for scaling the display granularities of the main information dimension based on the dimension scaling instruction.
As an embodiment of the present invention, main information can include time, space, node within an organization, role of the node, and any other information in which a user can be interested.
The structural block diagram in
Further, the embodiments of the present invention can be implemented in software, hardware or the combination thereof. The hardware part can be implemented by a special logic; the software part can be stored in a memory and executed by a proper instruction execution system such as a microprocessor or a dedicated designed hardware. Those normally skilled in the art can understand that the above method and system can be implemented with a computer-executable instruction and/or in a processor control code, for example, such code is provided on a bearer medium such as a magnetic disk, CD, or DVD-ROM, or a programmable memory such as a read-only memory (firmware) or a data bearer such as an optical or electronic signal bearer. The system and its components in the present embodiment can be implemented by hardware circuitry of a programmable hardware device such as a very large scale integrated circuit or gate array, a semiconductor such as logical chip or transistor, or a field-programmable gate array, or a programmable logical device, or implemented by software executed by various kinds of processors, or implemented by combination of the above hardware circuitry and software.
The network visualization processing method and apparatus according to embodiments of the present invention facilitates temporal, spatial, and social compression to thereby reduce the network complexity. A new visual metaphor is introduced to represent time dimension information within a single network view.
Advantageous effects of the network visualization processing method and apparatus according to the embodiments of the present invention include: compared with the video approach splitting the network in time-dimension and presenting different networks in continuous time, the method and apparatus according to embodiments of the present invention aggregate the network scenario of the entire time period into a single view, so that the user needs not to navigate in the timeline to analyze the dynamic network; compared with the small multiple display dividing the view space spatially to show network of different time simultaneously, the method and apparatus according to embodiments of the present invention perform much better with the entire screen showing the single aggregated network, providing tens of times better resolution than the previous approach.
The network visualization processing method and apparatus according to the embodiments of the present invention actually show a subset of the dynamic network. However, this is compensated by the advanced user interaction, through which the user can navigate the entire network. Furthermore, the user can select to expand/aggregate the specific node/edge along the main information dimension to view more/less main information.
Through the above description of specific embodiments, it can be understood by those skilled in the art that the system, means and method as described above can be implemented by using computer executable instructions and/or being contained in processor control codes, for example, such codes are provided on an carrier medium such as a disk, CD or DVD-ROM, an programmable memory such as a read-only memory (firmware), or a data carrier such as a optical or electronic signal carrier. The means, server and its unit can be implemented by a hardware circuit of a super large-scale integrated circuit or gate array, a semiconductor such as a logic chip, transistor and so on, or a programmable hardware device such as a field programmable gate array, programmable logic device and so on, or by software executed by a variety of processors, or by a combination of above hardware circuit and software.
Although detailed description of the system and method for processing network visualization according to the invention has been made in combination with the above specific embodiments, the present invention is not limited thereto. Various variations, alternations and modifications to the invention can be made by those skilled in the art under teachings of the specification without departing from the scope and principle of the invention. It should be understood that all such variations, alternations and modifications still fall into the protection scope of the invention which is defined by the attached claims.
Number | Date | Country | Kind |
---|---|---|---|
201010136979.7 | Mar 2010 | CN | national |