Characterizing application performance within a network

Description

A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights. The following notice applies to the software and data as described below and in the drawings hereto: Copyright© 2001, Compuware, All Rights Reserved.

FIELD OF THE INVENTION

The present invention relates to the field of network computing, and more particularly, to a method and system for monitoring network, client, and server performance.

BACKGROUND OF THE INVENTION

One method of monitoring network performance includes measuring the processing time on a first node, such as a client, and the processing time on a second node, such as a server. Conventionally, this method was applied where the client and server were on the same local area network (LAN), so that factors such as network delay external to the LAN did not need to be considered. Also, a given node may perform multiple processes concurrently. Conventional methods may determine the time for each such process, and then sum the individual times to yield a total node processing time. The calculated total may thus exceed the actual time required by the node to perform the concurrent processes. Such a method may therefore result in confusion or misinterpretation of results.

Realistic networks generally include multiple LANS and interconnecting equipment, and/or communications links. Furthermore, such implementations may involve concurrent processing as discussed. Thus, there is a need for an improved system and method of monitoring network performance whereby network delay is considered, and whereby times allocated to concurrent processes sum to actual node processing time.

SUMMARY OF THE INVENTION

In one embodiment of the present invention, a method for monitoring network performance while executing an application is disclosed. The method monitors a flow having one or more frames within a thread by calculating a node's active time. This includes the amount of time each frame is processed on a sending node in a network, the amount of time each frame is processed on a receiving node in the network, and the amount of time each frame is in transit across the network.

A second embodiment of the invention provides a method of analyzing concurrent processing within nodes. This embodiment builds a resource table that includes a number of resource sets describing initiation and termination times for the various network nodes. A resource timeline that sequentially describes initiations and terminations is then generated from the resource table. A resource matrix is then derived that includes a sequence of processing sets organized according to time intervals. Each processing set includes a unique combination of active resources, or a generic network resource where no physical resource is active. The processing sets represent and allocate time to multiple concurrent processes, so that the allocated times sum to the actual overall task time.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention is illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings and in which like reference numerals refer to similar elements and in which:

FIG. 1 illustrates an Application Performance Report in a Chart format.

FIG. 2 illustrates an Application Performance Report in a Details format.

FIG. 3 illustrates an example of Request Preparation and Reply Preparation processing types.

FIG. 4 illustrates one exemplary embodiment.

FIG. 5 illustrates an example of a flow.

FIG. 6 illustrates an example of a multi-tier algorithm.

FIG. 7 illustrates a method of analyzing concurrent processing within nodes.

FIG. 8 illustrates an example of a resource table.

FIG. 9 illustrates an example of a resource chart.

FIG. 10 illustrates an example of a resource timeline.

FIG. 11 illustrates an example of a resource matrix.

DETAILED DESCRIPTION OF THE INVENTION

The present invention includes various operations, which will be described below. These operations may be performed by hardware components or may be embodied in machine-executable instructions, which in turn may be used to cause a general-purpose or special-purpose processor or logic circuits programmed with the instructions to perform the operations. Alternatively, the operations may be performed by a combination of hardware and software.

The present invention may be provided as a computer program product that may include a machine-readable medium having stored thereon instructions which may be used to program a computer (or other electronic devices) to perform a process according to the present invention. The machine-readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs (Compact Disc-Read Only Memories), and magneto-optical disks, ROMs (Read Only Memories), RAMs (Random Access Memories), EPROMs (Erasable Programmable Read Only Memories), EEPROMs (Electromagnetic Erasable Programmable Read Only Memories), magnetic or optical cards, flash memory, or other type of media/machine-readable medium suitable for storing electronic instructions. Moreover, the present invention may also be downloaded as a computer program product, wherein the program may be transferred from a remote computer (e.g., a server) to a requesting computer (e.g., a client) by way of data signals embodied in a carrier wave or other propagation medium via a communication link (e.g., a modem or network connection). Accordingly, herein, a carrier wave shall be regarded as comprising a machine-readable medium.

Introduction

Response Time Analysis (hereinafter RTA) first produces underlying data. This data is then formatted and presented to a user in several reports, including the Application Performance Report (Chart and Details), Node Processing Time Report, and Flows Report. The reports are presented in the form of a Graphical User Interface (GUI).

The user's first exposure to the RTA is by way of the high-level summaries presented in the health report. The summaries enable the user to quickly obtain critical information without needing to process details. Details are made available in the Node Processing Time report and Flows Report.

The following sections give a thorough description of the algorithms and techniques used in the underlying RTA, as well as examples of the GUI.

Application Performance Report

FIG. 1 illustrates an Application Performance Report Chart. The four summary results shown in the first panel of the application performance report are described in the following paragraphs.

Network Busy Time

The Network Busy Time 100 is the total time that one or more meaningful frames are in transit across the network. Network Busy Time is computed and reported for both network directions (from primary to secondary and vice-versa). After the Flow concept has been introduced, the Network Busy Time will be revisited to describe which frames are meaningful. Because only a subset of all frames generated traverse the network, only frames that are exchanged between the two capture points are included in the Network Busy Time. Consequently, Network Busy Time is applicable only with a multi-segment merged or adjusted trace. A trace is a sequence of frames that have flowed between two or more nodes over the capture period (to be discussed below). Traces can be merged and adjusted. The Network Busy Time can be broken down into insertion time and queuing propagation and processing time.

Insertion Time (sec): This is the cumulative time for the frames to be inserted into the network. In a merged or adjusted trace, the insertion time for each frame is computed as AdjustedBytes*8/Bandwidth. The network bandwidth utilization is computed for each direction of the network. Only frames that are known to have traversed the network are included. The term AdjustedBytes is used in the above expression to indicate the bytes that would have traversed the WAN link. The capture environment allows the user to specify whether the frame size should be adjusted to compensate for the different WAN headers. If the user chooses to do so, then AdjustedBytes=Bytes(as captured)−DLCHeader(as captured)+DLCHeader (specified in capture environment).

Network Queuing, Propagation & Processing (QPP) Time (sec) This is the time that frames were present in the network due to queuing (in routers), processing and propagation. This represents the portion of the total transit time that is not counted as insertion time. Throughout the description, this term is referred to as QPP time.

Node Active Time

Referring again to FIG. 1, in the Node Active Time area 102, there is one bar for each node that shows the active (processing or sending) duration. Each bar is broken down into two components: (1) node processing time and (2) node sending time. Each is further described below.

Node Processing Time For each node, the overall task processing time is shown. A task is a user-invoked operation that creates network traffic during execution and causes a screen update or other acknowledgement on the user's machine. Once invoked, a task executes to completion without further user intervention.

For single-thread applications, this is simply the sum of the individual node processing times (described below). If an application can have multiple concurrent server requests outstanding (such as the typical Web browser), then the node is considered processing if it is handling one or more requests. The actual Node Processing Time for any node cannot exceed the overall task duration. However, if not corrected, the sum of the Processing Times for all nodes as calculated can exceed the actual task duration if there are parallel (overlapping) threads. This can lead to confusion or misinterpretation of results.

To eliminate such shortcoming, a method that corrects for overlapping threads is provided according to one embodiment of the invention. The overall correction method is shown in FIG. 7. The method starts 704 and then builds 708 a resource table. FIG. 8 illustrates an exemplary resource table 800 corresponding to a client and two servers. The rows, or resource sets of the exemplary resource table 800 each include one resource. The resource sets of the exemplary resource table 800 also include times of initiation and termination of the activity of the respective resource in seconds. The resource sets are arranged sequentially according to initiation time. According to this example, the overall task duration is 4.5 seconds.

The exemplary resource table 800 is represented graphically in FIG. 9. In this example, two instances of a Client are active concurrently during the period from 1.7 to 1.9 seconds, according to the corresponding parallel threads. In addition, there are periods of inactivity, e.g., between 2.5 and 2.7 seconds. These characteristics will be taken into consideration in subsequent steps.

Referring back to FIG. 7, a resource timeline is next derived 712 from the resource table 708. FIG. 10 illustrates an exemplary resource timeline 1000 according to the present example. Each row, or event, corresponds to an initiation or termination of the activity of a resource, as indicated in the exemplary resource table 800. Thus, each event of the resource timeline includes one resource, an event time, and a sense. The event times are sequential, and positive and negative senses reflect initiation and termination of activity of the resource, respectively. Thus, the exemplary resource timeline 1000 represents each initiation and termination of activity of each resource during the overall task.

Referring again to FIG. 7, a resource matrix 716 is next derived from the resource timeline. An exemplary resource matrix 1100 according to the present example is shown in FIG. 11. Each row, or processing set, of the exemplary resource matrix 1100 includes a time interval, a set of instances of resources and a set of allocated times. The time interval of each processing set is derived as the time interval spanned by successive events of the exemplary resource timeline 1000. Within each time interval, the number of instances of each resource that is active is indicated. For example, during the interval from 1.6 to 1.7 seconds, two instances of the Client are active. From one processing set to the next, an instance of a particular resource is added if that resource was associated with a positive sense at the beginning of the respective time interval in the exemplary resource timeline 1000. Similarly, a particular resource is deleted if that resource was associated with a negative sense at the beginning of the respective time interval in the resource timeline. For time intervals during which no resource is active, the entry corresponding to Instances of Resources is 1 Network. In other words, such an interval is allocated to a generic network resource.

Next, a set of allocated times is calculated for each processing set. Specifically, the respective time interval is divided by the total number of instances of all resources in the processing set. The result is then multiplied by the number of instances of each resource to give a set of resource allocations; that is, the time interval is allocated on a per-instance basis. For example, within time interval 1110a (corresponding to the interval from 1.7-1.8 seconds), there are four instances active in total. Dividing the interval 0.1 seconds by four gives 0.025. Multiplying this by the number of instances of each resource of the processing set gives 0.05 Client, 0.025 Server 1 and 0.025 Server 2. During time interval 1110b (2.5-2.7 seconds), no physical resources are active, and thus 0.2 Network is allocated. According to step 716, and as shown in exemplary resource matrix 1100, the total of the allocated times equals the duration of the time interval for each processing set.

Summing the allocated times of the exemplary resource matrix 1100 for each resource gives subtotals (in seconds) of 2.1 Client, 0.94167 Server 1, 0.55833 Server 2 and 0.9 Network. Summing these subtotals in turn gives the overall task duration of 4.5 seconds. An analogous process may be applied to other scenarios. Thus, the method of FIG. 7 provides a process for allocating processing times to network resources that will amalgamate to 100 percent of the overall task duration.

Node Sending Time For each node, the overall node sending time represents the period during which the node is sending data but not otherwise processing. If a node is in the process of sending a set of frames, then the node is considered to be in the sending state, but only if it is not processing another request at the same time. Sending time is important because it could indicate that the node is processing in order to prepare the remaining frames, or otherwise not processing. The most likely other factor is that the network is heavily utilized and the node cannot send all data in one transaction. Other potential scenarios include insufficient TCP window size, the normal slow-start nature of TCP, inability of the receiving node to remove the data from the TCP buffer quickly enough, or an inefficient TCP implementation at either the sending or receiving node.

Referring again to FIG. 1., the following detailed reports are available by selecting, e.g., by double-clicking, certain areas as follows:

On a Node Processing Time portion of a node bar: Brings up the Node Processing Detail report filtered/highlighted on the specified node.

On a Node Sending Time portion of a node bar: Brings up the Node Sending Detail report filtered/highlighted on messages sent by the specified node.

On either network bar 100: Brings up the Network Utilization and Latency graph.

Details

An example of an Application Performance Report—Details is shown in FIG. 2. The major sections of the detailed report are described in the following sections.

Overall Summary

The overall summary 200 provides information on the task duration, any errors that were detected, the capture environment, and a summary of the conversations, threads and turns. The information in the overall summary can alert the user to errors, or to the fact that other than the desired information was captured. The parameters displayed in this section are as follow.

Parameter/

Units
Description

Task Time
Duration of the task, should be equivalent to the task “stopwatch time.”

(seconds)
If this is not the case, then portions of the time may be missing or may

need to be deleted

Traffic
Duration within which frames exist. The user can click on the label to

Duration
open the bounce diagram, which will show all constituent frames and their

(seconds)
durations

Errors
A click on the Errors label opens the Error Report, which is a graphical

summary of the number and types of errors and warnings detected in the

trace

Capture
This legend indicates the meaning of the arrows shown in many other

Environment
places within the report. A capture is a process whereby a traffic

collector or agent collects network frames exchanged between nodes in

a distributed application and stores the data for off-line analysis. The

legend keys are taken from the Capture Environment as specified by

the user. The primary capture location is indicated on the left and the

secondary location is on the right

Conv
Indicates Conversations, both for the task and the total, and those that

occur between a node on the primary location and another node in the

secondary location, as indicated by the “<-->” label. Hereafter, the

phrase “conversations that traverse the network” means conversations

for which one node is in the primary location and the other is in the

secondary location

Threads
Shows both the total number of threads in the task as well as in the

conversations that traverse the network between the primary and

secondary capture points. A thread is a sequence of frames exchanged

between two nodes that constitutes a single application or protocol

action. For example, the retrieval of a graphic from a WWW (World

Wide Web) server is a thread

Turns
The number of turns in the task, and in the conversations that traverse

the network between the primary and secondary capture points. <-->

Turns specifies the sum of the turns for threads corresponding to one

node in the primary location and another other node in the secondary

location

Bytes/Turn
The average number of bytes per turn, both as a total for the task and

for the conversations that traverse the network

Frames/Turn
The average number of frames in each turn, both as a total for the task

(not shown)
and for the conversations that traverse the network

Traffic

Traffic section 202 provides the user with a summary of several traffic measures, for the entire task (“Total” row), over the network (“<-->” row, i.e., in both directions between the primary and secondary capture points), and in each direction across the network, i.e., primary to secondary location (“→”), and secondary to primary location (“←”). The columns in this section are as follows.

Column
Description

Bytes
The sum of the bytes for all frames in each classification. For the network

classifications, the byte counts for each frame will be adjusted for the

DLC header size if the user has chosen to do so in the capture

environment.

For <-->, → and custom character

Bytes, the value is the number of bytes that crossed

the point of contention in the network as specified for the task in the

Capture Environment. If the user did not choose to adjust frames for DLC

header in the capture environment, then Network Bytes = Bytes for each

frame. If the user did so choose, then Network Bytes = Bytes − DLC

Header(as captured) + DLCHeaderbytes (specified in the capture

environment).

% of <-->
The percentage of <--> bytes that traversed the network in each direction.

Bytes
The two values will add to 100% (subject to rounding).

Frames
This is the total number of frames in the task (top row), and the number of

frames that should have crossed the network, whether they were

contained in both captures or not. If they were not, such will be reported

in the two Frames Missing entries to the right of this section. A frame is a

collection of bits comprising data and control information (overhead) that

is transmitted between nodes in a network.

Avg Frame
The average frame size, in bytes, i.e., Bytes/Frames.

Captured
This is the average rate, in kbps and as a percentage of the total

Load (kbps
bandwidth, of the frames that traversed the network in each direction. The

and %)
adjusted bytes, as discussed above, are used.

Frames
The number of additional frames from the sending (source) node that

Missing at
should have been in the trace. This can be caused by the capture

Source
beginning too late or ending too early, or by the inability of the capture

device to capture all of the frames

Frames
The number of additional frames that should have been in the trace from

Missing at
the receiving (destination) node. Such frames could have been lost due to

Destination
network congestion or failure of a network component, or for the reasons

(not shown)
discussed in the preceding paragraph.

There can be other situations wherein not all of the frames that should have traversed the network were actually captured. For example, one of the captures may have started before or ended after the other and may contain frames that traversed the network. However, since those frames are not represented in the other capture, it is not known if they actually traversed the network. Such frames will be flagged Lost Frame or Dropped Frame, depending on whether they were captured only at the source or destination segment, respectively.

If it is assumed that missing frames really did traverse the network, then their bytes/frames/threads/conversations are included in the <--> metrics. There is no way to know whether a frame that is missing at the destination actually consumed bandwidth at the network contention point before being dropped. Thus, the worst case is assumed, i.e., that network bandwidth was consumed, and the missing frames are included in the <--> metrics.

Network Busy Time

The Network Busy Time section 204 helps the user determine how active the network was during the task, and how the busy time breaks down into insertion time and QPP time. This section is presented for merged tasks and single-trace adjusted tasks. Once again, there are two rows for the network metrics, one for the primary to the secondary location (“→”), and one from the secondary to the primary location (“←”). The columns in this section correspond exactly to the portions of the network bars 100 in the Application Performance Report chart of FIG. 1, and provide the following information.

Column
Description

Insertion Time
Represents the cumulative time that it took to insert the captured

(sec and % of
frames into the network at the point of lowest bandwidth specified by

Task Time)
the user. Should the user decide to adjust for DLC headers of the

network by the captured bytes, the insertion time is based on the

network bytes of each frame. The bandwidth utilization in kbps is

determined as (Bytes that traversed the network in the specified

direction * 8)/(duration of the task in seconds * 1000). The bandwidth

utilization in % is (the bandwidth utilization in kbps/capacity of the link

in kbps), expressed as a percentage. The capacity of the link is

specified by the user in the capture environment.

QPP Time
The Queuing, Processing and Propagation time.

(sec and % of

Task Time)

Total
The total time that one or more meaningful frames was traversing the

(sec and % of
network. Meaning frames include all data frames and TCP

Task Time)
acknowledgements that are within the data portion of a message.

Meaningful frames do not include TCP acknowledgements that are

sent after the last data frame in a message is sent.

Network Frame Transit Statistics

The network frame transit statistics section 206 comprises two rows, one for each network direction as described above. This section includes only meaningful frames. As such, the statistics do not include TCP acknowledgements that are sent after the last data frame in a message is sent. The columns of this section are as follows.

Column
Description

Transit
Reflects the graphical depiction of the minimum, average and maximum

Time
frame transit time. The transit time of each frame is the difference between its

(seconds)
Time Sent and Time Received.

Transit
The transit time of the frame that has the lowest transit time.

Time Min

(seconds)

Transit
The average (mean) transit time for the meaningful frames.

Time Avg

(seconds)

Transit
The transit time of the frame that has the largest transit time.

Time Max

(seconds)

Transit
The number of frames included in the transit time statistics, and is equal to

Time
the number of meaningful frames that were captured at both sides (or

Frame
adjusted). Note that this number is equal to or less than the number of

Count
frames that traversed the network in the specified direction. It does not

include TCP acknowledgements after a message or frames that are

missing at the source or destination.

Latency
Statistics on the latency of meaningful frames that traverse the network.

Statistics
As described above, meaningful frames are data frames and the TCP

(min, avg,
acknowledgements that occur during the data transfer portion of a flow.

max, not

shown)

Overlap
The average number of frames that are in transit when there is at least

Avg
one frame in transit. It is a measure of the application's ability to send

(Frames)
more than one frame at a time, and the network conditions requiring the

application to do so. In other words, it is the average number of frames

sent by the application when the application has sent frames. Higher

values mean the application is less susceptible to network latency and

bandwidth.

Overlap
The largest number of frames that are in transit in the network at any given

Max
time.

(Frames)

Node Active Time

This is a tabulation 208 of the node bars 102 (Node Processing And Sending times) that were described earlier in the Application Performance Report Chart of FIG. 1.

Node Processing Statistics

Statistics 210 regard the node processing periods, as follows.

Column
Description

Processing
A graphical depiction of the minimum, average and

Time (seconds)
maximum node processing time period.

Processing
The shortest node processing period.

Min (seconds)

Processing
The average node processing period.

Avg (seconds)

Processing
The longest node processing period.

Max (seconds)

Processing
The number of processing periods.

Periods

Overlap Avg
The average number of processing periods that occur

simultaneously at the node during the times that the node

is in at least one processing period. A value of 1.0 means

that the node never processed more than one request at a

time. This value cannot be smaller than 1.0.

Overlap Max
The largest number of processing periods occurring at

any given instant.

Node Processing Detail Report

In addition to the overall summary results presented in the Application Performance Report, the RTA also identifies and reports several sets of details. One of these is the node processing detail (not shown).

Each node processing time is one component in the Overall Node Processing Time. The GUI allows the operator to see the Individual Node Processing Times for all nodes or for one node at a time. Since individual node processing times can overlap, the method 700 described above corrects the individual node processing times so that they sum to the Overall Node Processing Time for a given node. The attributes of each node processing time, as given in the columns in the Node Processing Detail Report are listed or described as follows.

Column
Description

Node Name
The DNS or WINS node name, or a user-provided description of the

node.

Node
A machine-readable address such as an IP or MAC address.

Address

Errors
The number of errors associated with either the start or the end frame.

The user can click on the error report to see the individual errors.

Duration
The time span of the processing time, in seconds.

Start Time
The time at which the node began processing.

Start Frame
The number of the frame captured at the beginning of the processing

time. The user can click on this parameter to view the bounce or packet

trace as of the start frame.

End Time
The time at which the node stopped processing.

End Frame
The number of the frame captured at the end of the processing time. The

user can click on this parameter to view the bounce or packet trace at

the end of the frame.

Start Frame
The description (decoded) of the start frame.

Description

End Frame
The description (decoded) of the end frame.

Description

Node
One of the types specified in the sections below. For most node

Processing
processing types, there are corresponding types for client and server.

Type
“Client” is assigned when the node is the client in a thread, and vice

versa. Each node processing type has an internal code that does not

appear in the GUI. The node processing types and their meanings are

given in the following sections.

Client
This is shown in FIG. 3 at 300, and represents the time period prior to

Before
sending of the first data frame by the client. The period extends back to

Thread
the previous data frame that was received by the node, or to the

beginning of the task if there is no such frame.

Client
The period within a thread from receipt of a data frame by the client node

Processing
to the time that node sends a subsequent request. The data frame can

be on the same thread or another thread.

After last
The time period from the last data frame to the end of the task, and is

frame 308
always assigned to the client node for that task. The client node can be

reassigned in the conversation map.

Server
This is similar to Client Before Thread, but arises when the first data

Before
frame in the thread is sent by the server of the thread. Normally the

Thread
server does not send the first data frame, since the client normally

initiates activity by sending a request. However, if the capture starts in

the middle of a thread, or if the client and server are assigned incorrectly

then, then this processing type results.

Note that the Thread Analysis window comprises a command to swap

the client and server of a thread if they are identified incorrectly.

Server
The period within a thread from receipt of the last frame in a request by

Processing
the server to the time that the first response frame is returned by the

304
server. During this time, the server is assumed to be processing the

request. Note that with multi-tier applications (discussed below) there are

cases where an upper-tier request may interrupt a server processing

time.

Request
This processing type arises in multi-tier applications. It is the period from

Preparation
receipt of a request by a mid-level server (from a lower tier) until the mid-

302
level server begins sending a subsequent request to another server.

Reply
This processing type also arises in multi-tier applications. It is the period

Preparation
from receipt of a reply by a mid-level server (from another server) to

306
initiation of a reply to the requesting node.

Flows Report

A flow is a set of data frames that is sent from one node to another, comprises only frames in a single thread, and spans a time period during which no data frames travel in the opposite direction. A flow includes the TCP acknowledgements that are sent in the opposite direction before the transmission direction reverses.

Meaningful frames, discussed above, comprise all data frames and TCP acknowledgements within a data flow. TCP acknowledgements that occur after the last data frame in a flow are not meaningful frames for the purpose of network busy time computation.

Within the flows report, each flow includes a number of attributes, arranged in columns, as discussed in the following sections.

Column
Description

Errors
The flows report provides a graphical depiction of relevant errors and

warnings, as with other reports. A link to the error report is provided.

Since a flow comprises one or more frames, the errors associated with

any frame in the flow should be included in this summary.

Sending
The name and address of the node that sent the data frames in the

Node
flow. This node will receive TCP acknowledgements from the

corresponding receiving node.

Receiving
The name and address of the node that received the data frames in the

Node
flow. This node will send TCP acknowledgements to the corresponding

sending node.

Data
The period in seconds from the time the sending node sent the first

Duration
frame in the flow to the time that the receiving node received the last

data frame in the flow. This is related to other fields by the relationship

Data Duration = (End Data Time − Start Time).

Avg Data
The average data rate in bits/sec during the flow. This information may

Rate
be important to the user, because flows with low data rate may be

demand investigation. For example, the user may need to determine

why the data is not being transferred more quickly, particularly if the

flow is also longer than most other flows. This is computed as (Data

Payload Bytes * 8)/(End Data Time − Start Time).

Bytes
The total number of bytes in all frames in the flow.

Data
The sum of the payload bytes in all frames in the flow.

Payload

Bytes

Frames
The number of frames in the flow.

Data Frames
The number of data frames in the flow.

First Frame
The sequence number of the first frame in the flow.

Last Data
The sequence number of the last data frame in the flow.

Frame

Last Frame
The sequence number of the last frame, either data or

acknowledgement. If there are TCP acknowledgement frames after the

last data frame, then this will differ from the Last Data Frame.

Start Time
The time that the first data frame was sent.

End Data
The time that the last data frame was received.

Time

End Time
The time that the last frame (data or acknowledgement) was received.

Note that this is normally not important because trailing TCP

acknowledgements do not have an impact on the response time.

Data
This reflects primary-to-secondary direction or vice versa, as indicated

Direction
with (→ and custom character

) arrows. For flows that are within a capture location, the

caption “within <capture point>” appears, where <capture point>

indicates the location of interest.

Network
The total time in seconds that one or more frames was in transit during

Busy Time
the flow.

RTA Algorithm Details

Overall Approach

The RTA functions first at the thread level. Each thread is assumed to be a single-threaded sequence of request/response exchanges between the client and server. It is the responsibility of the protocol decoder to ensure this requirement. A thread is broken down into five time periods, described as follows.

Column
Description

Client
The time during which the client prepares

Processing
packets to send over the network.

Client
The time during which flow is sent from

Sending
client to server.

Server
The time during which the server prepares

Processing
packets to send over the network.

Server
The time during which flow is sent from

Sending
server to client.

Processing
This period occurs only in the case of multi-tier

interrupted
or overlapping requests at the client.

by another

thread

Exemplary Embodiment—Single Thread

RTA concepts are illustrated according to one embodiment as shown in FIG. 4. FIG. 4 is a bounce diagram for a typical client/server or Web application. The application could be, e.g., a 2-tier SQL application, a web browser/web server, or an application based on ad hoc protocols.

Assume that all frames shown exist within a single thread. In this example, the client sends a 2-frame request to the server, the server processes over a period, and then the server sends a 3-frame reply to the client. The diagram shows the data frames that would be exchanged, as well as exemplary TCP acknowledgements if the application uses TCP/IP. Note that TCP is a very dynamic protocol, and therefore may not send a TCP acknowledgement for every frame. Accordingly, the diagram illustrates only one of many possible variations.

Node Processing and Sending

The Application Performance Report Chart of FIG. 1 would show the processing time for the client as the sum of the two processing times identified in FIG. 4, i.e., one at the beginning and one at the end. The processing time for the server is just the single processing time 304, and the sending time for both nodes is as indicated in the FIG. 4.

Flows

In this example there are two flows. The first flow is from the client to the server and comprises frames 1 through 4. The last data frame in this flow is frame 3. The overall flow duration and flow data duration are annotated in FIG. 5. Because Frame 4 is a TCP acknowledgement that is sent after the last data frame in the flow (Frame 3) is sent, it is not considered a meaningful frame. Thus, the network is not considered busy for the time that frame 4 is in transit.

Similar analysis applies to the second flow in this example. The second flow is sent from the server to the client, and comprises frames 5 through 10. The data duration for this flow spans the time frame 5 is sent to the time frame 8 is received.

Network Frame in Transit and Latency Statistics

Again referring to FIG. 4, the times when a frame is in transit can be seen. In general, the TCP acknowledgements that are returned after data is completely received have no impact on the overall response time; therefore, they are omitted from the network Frame in Transit measure and Latency statistics. Accordingly, in this example frames 4 and 10 do not impact the user-perceived application response time, and thus they are not included in the network latency measures.

A high network-frame-in-transit time can be an indication that the combination of network latency and the application's sequential request/response behavior is affecting the response time. A high network busy time may be caused by insufficient network bandwidth. This may be investigated by consulting the network utilization information in the Performance Report Chart. This chart breaks down a network frame in transit into components caused by bandwidth and latency. If the bandwidth component dominates, then lack of bandwidth is causing the network to be busy.

Node Processing and Sending Time

Exemplary Embodiment—Multi-Thread

Node processing time analysis becomes more complicated when the application is multi-threaded. In the example above, the sum of the node processing times equals the node active times. When node processing times overlap at a node, only one such time is counted; consequently, the method 700 described above corrects the individual node processing times so that they sum to the overall node processing time.

The user is shown results for both node processing and sending times. Node processing time unambiguously reflects periods during which the node is processing requests (as a server) or processing a reply prior to sending the next request (as a client). Conversely, node sending time reflects not only node processing, but potentially other activity as well. It may be difficult to determine what contributed to the node sending time without examining the individual flows sent. For example, node sending times could result from factors such as the receiving node's inability to process incoming data quickly enough, insufficient bandwidth in the network, the sending node's inability to make all of the data available quickly enough, or a TCP window size that is too small. Consequently, node sending time might contribute to the total response time. For example, in cases where the network is heavily utilized or has high latency, a high node sending time can be caused by limitations of the network. To explore this, a user can link from the node-sending-time bar to view the flows sent by that node. The user could thereby attempt to identify whether the lapse between frames sent by the node represented actual processing time. Often this determination will require some knowledge of the application.

Node Processing Times

The Node Processing Detail report (not shown) shows the processing times that were detected for each node. The report is arranged in order of descending duration, with the largest processing times at the top. Processing time at a client node begins just prior to sending the first request in a thread, and ends within a thread prior to each succeeding request that the node sends. Processing time at a server node begins when the server receives a request and ends when the server begins sending a reply.

For further understanding of node processing times, a Node Processing Detail report can be opened on a trace. The window can then be split, and the packet trace placed at the bottom. For each node processing time selected, there will be two frames surrounding the processing time. For client processing times, there will be a prior reply (or the beginning of the trace) and the request that the client sends at the end of its processing time. For server processing times, there will be the request (actually the last frame in the request if it is a multi-frame request) and the response (actually the first frame in the response if it is a multi-frame response).

Flows

As discussed previously, there can be many causes for a high node sending time, and the cause of this can be difficult to determine.

Overlapping Threads

Any time two or more threads overlap, more than one node can be processing or sending at a time. Alternately, a single node could be processing or sending on more than one thread at the same time. These processing and sending times are aggregated by concluding that a node is processing if it is processing on one or more threads, or that a node is sending if it is sending on one or more threads and is not processing.

Exemplary Embodiment—Multi-Tier

FIG. 6 describes the handling of multi-tier applications, as follows. At each time that a data frame enters a node, the time, frame seq# and its thread is recorded in member variables of the CNode class. When a client of a node sends out the first frame in a request, there will be a processing time. To determine its type, it is determined whether the most recent data frame that arrived at the node was a request frame from another client. If it was, then the type is ‘Request Preparation,’ otherwise, the type is ‘Client Before Thread.’

Conclusion

A method and system for performing response time analysis of network performance have been described. Although the present invention has been described with reference to specific exemplary embodiments, it will be evident that various modifications and changes may be made to these embodiments without departing from the spirit and scope of the invention. Accordingly, the above description and drawings are to be regarded in an illustrative rather than a restrictive sense.

Claims

1. A method of characterizing application performance within a network, the method comprising: determining an amount of time each frame of a flow in a thread is processed on a sending node and on a receiving node in the network;determining an amount of time each frame of the flow is in transit on the network;building a resource table, the resource table including a plurality of resource sets, each resource set comprising a resource, an initiation time that indicates the start of activity of the resource and a termination time that indicates the end of activity of the resource;deriving a resource timeline from the resource table, the resource timeline including a plurality of events, each of the events comprising: one of the resources,an event time comprising one of an initiation time and a termination time, anda sense that indicates initiation or expiration of the event; andderiving a resource matrix from the resource timeline, the resource matrix including a plurality of processing sets arranged sequentially, each processing set comprising: an instance of a resource,a time interval during which all of the instances of all of the resources comprising the processing set are active, andan allocated time for each resource within the processing set, the allocated time for each resource being equal to the time interval divided by the number of instances of said each resource within the processing set.
2. The method of claim 1, wherein the sense is positive or negative where the event time is said initiation time or said termination time, respectively.
3. The method of claim 1, further comprising the step of summing the allocated times for all resources within the resource matrix to produce a summed time.
4. The method of claim 1, wherein each of the resources comprises a physical resource or a generic network resource.
5. The method of claim 1, further comprising displaying an allocated time.
6. A computer readable medium storing computer program instructions executable by a processor, the computer program instructions implementing a method of characterizing application performance within a network, the method comprising: determining an amount of time each frame of a flow in a thread is processed on a sending node and on a receiving node in the network;determining an amount of time each frame of the flow is in transit on the network;building a resource table, the resource table including a plurality of resource sets, each resource set comprising a resource, an initiation time that indicates the start of activity of the resource and a termination time that indicates the end of activity of the resource;deriving a resource timeline from the resource table, the resource timeline including a plurality of events, each of the events comprising: one of the resources,an event time comprising one of an initiation time and a termination time, anda sense that indicates initiation or expiration of the event; andderiving a resource matrix from the resource timeline, the resource matrix including a plurality of processing sets arranged sequentially, each processing set comprising: an instance of a resource,a time interval during which all of the instances of all of the resources comprising the processing set are active, andan allocated time for each resource within the processing set, the allocated time for each resource being equal to the time interval divided by the number of instances of said each resource within the processing set.
7. The computer readable medium of claim 6, wherein the sense is positive or negative where the event time is said initiation time or said termination time, respectively.
8. The computer readable medium of claim 6, further comprising computer program instructions for implementing a method of summing the allocated times for all resources within the resource matrix to produce a summed time.
9. The computer readable medium of claim 6, wherein each of the resources comprises a physical resource or a generic network resource.
10. The computer readable medium of claim 6, further comprising computer program instructions implementing a method for displaying an allocated time.
11. A computer-implemented method for determining node processing time relative to total transaction time, the method comprising: deriving a resource timeline from a resource table, the resource timeline including a plurality of events, each of the events comprising: one of the resources,an event time comprising one of an initiation time and a termination time, anda sense that indicates initiation or expiration of the event;deriving a resource matrix from the resource timeline, the resource matrix including a plurality of processing sets arranged sequentially, each processing set comprising: an instance of a resource,a time interval during which all of the instances of all of the resources comprising the processing set are active, andan allocated time for each resource within the processing set, the allocated time for each resource being equal to the time interval divided by the number of instances of said each resource within the processing set; andproviding for display through a graphical user interface results of the derived resource matrix.
12. The method of claim 11, wherein the sense is positive or negative where the event time is said initiation time or said termination time, respectively.
13. The method of claim 11, further comprising summing the allocated times for all resources within the resource matrix to produce a summed time.
14. The method of claim 11, wherein each of the resources comprises a physical resource or a generic network resource.
15. The method of claim 11, further comprising displaying an allocated time.
16. A computer readable medium storing computer program instructions executable by a processor, the computer program instructions when executed by a processor cause the processor to: derive a resource timeline from a resource table, the resource timeline including a plurality of events, each of the events comprising: one of the resources,an event time comprising one of an initiation time and a termination time, anda sense that indicates initiation or expiration of the event;derive a resource matrix from the resource timeline, the resource matrix including a plurality of processing sets arranged sequentially, each processing set comprising: an instance of a resource,a time interval during which all of the instances of all of the resources comprising the processing set are active, andan allocated time for each resource within the processing set, the allocated time for each resource being equal to the time interval divided by the number of instances of said each resource within the processing set; andprovide for display through a graphical user interface results of the derived resource matrix.
17. The computer readable medium of claim 16, wherein the sense is positive or negative where the event time is said initiation time or said termination time, respectively.
18. The computer readable medium of claim 16, further comprising instructions that cause the processor to sum the allocated times for all resources within the resource matrix to produce a summed time.
19. The computer readable medium of claim 16, wherein each of the resources comprises a physical resource or a generic network resource.
20. The computer readable medium of claim 16, further comprising instructions that cause the processor to display an allocated time.

RELATED APPLICATION

The present application is a continuation-in-part of U.S. patent application Ser. No. 09/800,080, titled “Method of Performing Response Time Analysis of Network Performance,” filed Mar. 5, 2001, now U.S. Pat. No. 7,133,911, the contents of which are incorporated by reference herein.

US Referenced Citations (5)

Number	Name	Date	Kind
6393480	Qin et al.	May 2002	B1
6449643	Hyndman et al.	Sep 2002	B1
7167821	Hardwick et al.	Jan 2007	B2
7369505	Mengerink	May 2008	B2
7417950	Hofmeister et al.	Aug 2008	B2

Related Publications (1)

	Number	Date	Country
	20060168272 A1	Jul 2006	US

Continuation in Parts (1)

	Number	Date	Country
Parent	09800080	Mar 2001	US
Child	11206486		US

Characterizing application performance within a network

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications