System and method for automated tuning of program execution tracing

Information

  • Patent Grant
  • 7827539
  • Patent Number
    7,827,539
  • Date Filed
    Thursday, June 23, 2005
    20 years ago
  • Date Issued
    Tuesday, November 2, 2010
    15 years ago
Abstract
A tracing system that provides automated tuning of execution tracing by adjusting the collection of trace data is described. In one embodiment, the user sets an initial tracing profile for a tracing program. In addition, the user sets an upper limit for the tracing performance penalty. The auto-tuning system monitors the performance penalty induced by tracing and, when the performance impact is excessive, removes trace points that are causing the most impact on performance. Auto tuning is especially useful for performing software recording in mission-critical and/or time-critical applications, such as servers, real-time applications, etc. The system typically adjusts relatively quickly such that most users do not feel the influence of the tracer.
Description
BACKGROUND

1. Field of the Invention


The invention relates to maintenance and debugging of computer software by execution tracing.


2. Description of the Related Art


Program execution tracing has become a valuable tool for troubleshooting run-time software problems and for remote application support. One of the major problems with this approach is keeping the right balance between completeness of the trace information and the performance penalty of the tracing process. Users of such a tool want to gather enough information to allow them to diagnose the problem. However, tracing of each function call, source line, variable and other run-time information can create an undesirable performance impact on the traced program. Extensive tracing can lead to situations where the traced process is slowed below acceptable levels. In real-life deployment scenarios, it is typically desirable to keep performance penalty within a 5-25% range.


SUMMARY

These and other problems are solved by a tracing system that provides automated tuning of execution tracing by adjusting the collection of trace data. In one embodiment, the tuning system can adjust the collection of trace data without the need for program sources or symbolic information, and hence, makes such automated tuning of program execution tracing possible in production environments.


In one embodiment, the user sets an initial tracing profile for a tracing program (tracer) that allows the user to gather a superset of information relevant for software problem troubleshooting. In addition, the user sets an upper limit for the tracing performance penalty. The auto-tuning system monitors the performance penalty induced by the tracer and, when the performance impact is excessive, removes trace points that are causing the most impact on performance. Using the described auto-tuning mechanism, the application operator can limit the performance penalty when access to the application sources and symbolic information is not available. Auto tuning is especially useful for performing software recording in mission-critical and/or time-critical applications, such as servers, real-time applications, etc. The system typically adjusts itself quickly enough (e.g., often in less than a second) and most users do not feel the influence of the tracer.


The auto-tuning process maintains a desired allowable performance penalty by adjusting the trace process to operate within preset limits. This is achieved by automatic adjustment of the trace points in the traced process. In one embodiment, tracing starts with a relatively maximal set of trace points that involves, for example, a list of functions of the traced process or methods of the debugged class. The auto-tuning module checks the performance penalty caused by tracing and turns off those trace points with the highest performance impact until the performance penalty drops below a specified limit.


In one embodiment, the auto-tuning process adapts to a variable CPU load. If the auto tuner system detects that the performance penalty is lower than the preset threshold, it turns on disabled trace points until the trace performance penalty reaches the threshold value.


In one embodiment, the user can mark certain trace points as obligatory for tracing and the auto-tuning procedure will not turn off these trace points.


In one embodiment, the auto-tuning system maintains a desired balance between trace completeness and trace performance penalty. In one embodiment, the system simplifies the trace point setting process and makes the trace-based troubleshooting process more user-friendly.


In one embodiment, the performance impact is specified in terms of program runtime. In one embodiment, the performance impact is specified in terms of CPU utilization. In one embodiment, the performance impact is specified in terms of system throughput. In one embodiment, the performance impact is specified in terms of an amount of time used by the tracing program. In one embodiment, the performance impact is specified in terms of an amount of time penalty caused by tracing.


One embodiment includes specifying an initial trace profile listing a first list of runtime events to be traced, monitoring a performance penalty incurred by tracing execution of a program using the first list of runtime events, determining which of the runtime events produces a relatively larger impact on the performance penalty to produce a second list of one or more runtime events, and removing the second list of events from the first list of events. In one embodiment, the monitoring, determining, and removing are repeated until the performance penalty falls below a desired level. In one embodiment, trace items are returned to the list of runtime events to be traced until the performance penalty approximates a desired level. In one embodiment, the performance criteria includes time taken by a tracing engine compared to time taken by a program being traced. In one embodiment, the list of runtime events is specified without recourse to compiler-generated debug information from the program. In one embodiment, the list of runtime events is specified without recourse to source code from the program. In one embodiment, the list of runtime events is specified using at least a portion of compiler-generated debug information from the program. In one embodiment, the list of runtime events is specified using at least a portion of the source code from the program.





BRIEF DESCRIPTION OF THE FIGURES


FIG. 1 is a block diagram of an auto-tuning tracing process.



FIG. 2 shows a code recording profile dialog.



FIG. 3 shows an initial tracing view.



FIG. 4 shows a code recording profile properties dialog.



FIG. 5 shows an auto-tuning dialog with a list of excluded functions.



FIG. 6 shows an auto-tuning dialog with a list of excluded functions and a list of included functions.



FIG. 7 shows the tracing view of FIG. 3 after tuning.





DETAILED DESCRIPTION


FIG. 1 is a block diagram of an auto-tuning process 100. The process begins by specifying an initial collection of trace points 102. The initial collection can be specified by the user, set automatically, etc. In addition, two system-level parameters are set. These parameters include a timer time-out value and a penalty threshold 104. The timer time-out is a parameter that defines a time interval for checking the performance penalty. In one embodiment, the timer timeout is set at approximately 1 second. In other embodiments, it can be any number defined by a user or by the system. The penalty threshold is a user-defined parameter that specifies the maximum allowable performance penalty caused by tracing. Application execution tracing 106 traces the execution according to the defined collection of trace points. Execution tracing is described, for example, in U.S. Pat. No. 6,202,199 and U.S. Published Application 20020087949 (the entire contents of both are hereby incorporated by reference), or other tracing tools.


A performance module 108 calculates the tracing performance penalty. At time intervals, when the timer timeout has elapsed, the system adjusts the list of trace points 112 in such a way that the performance penalty stays relatively close to the penalty threshold.


In typical real-world cases, the auto-tuning reduces the performance penalty without substantial loss of significant information because the functions that are called relatively more often (and hence, consume a large recording penalty) are typically low-level functions, such as string manipulators, mouse movement handlers, etc. These functions are usually well debugged and, therefore, are not usually interesting for troubleshooting purposes. Thus, the removal of such functions from the trace list typically does not have a substantial impact on the value of the trace data.


In one embodiment, the following procedure is used for the calculation of the performance penalty.


Calculate an instrumentation time (InstrumentationTime) spent by the tracing engine. The value InstrumentationTime is calculated by summing time differences between entries and exits of the instrumentation engine for a specified interval of time called the TimeInterval.


Calculate the processor time used by the instrumented process (InstrumentedProcessTime). In one embodiment, the value InstrumentedProcessTime is calculated by using a performance counter (such as, for example, PercentProcessorTime on the Windows operating system) for the traced process. In order to take into account fluctuations in the CPU load, the counter values are averaged over the TimeInterval period of time. Time spent by the process is then calculated as InstrumentedProcessTime=TimeInterval*PercentProcessorTime/100.


In one embodiment, an upper level estimate is based on the assumption that the instrumented process uses all of the CPU resources: InstrumentedProcessTime=TimeInterval.


Calculate the CPU time used by the process, without the instrumentation overhead:

ProcessTime=InstrumentedProcessTime−InstrumentationTime.


Calculate performance penalty of the instrumentation:

Penalty=100*InstrumentaionTime/ProcessTime


In one embodiment, the system displays the current recording penalty, and/or the penalty over time, e.g. in a graph, table, display, etc. This can be used to determine the maximum penalty level for the auto-tuning mechanism.


A typical workflow of the tracing system starts with the setting of the tracing or recording profile. A recording profile represents a collection of trace points that are inserted into the traced program. Each trace point is associated with a specific function or a source code line. The recording profile allows the trace user to set what will be recorded during tracing, such as function arguments and return values, local and global variables or calls to library functions, etc. FIG. 2 shows the code recording profile window of a sample application httpsrv.exe representing a simple HTTP server provided by Microsoft on MSDN. The user selects functions to be traced by marking checkboxes of corresponding function names in the left pane 202. In one embodiment, the tracer automatically turns on tracing of all source code functions of the traced process. In many situations this leads to a substantial performance penalty.


The results of the initial tracing are shown in FIG. 3, which shows the traced function call tree in a left pane and the statistics of the traced functions in a lower-right pane. By way of example, FIG. 3 shows are several functions that are called very often in one sample application. The Function CHttpSrvView:GetRunTimeClass 302 is called 28,988 times and the function CHttpSrvView:AssertValid 304 is called 17,379 times.


In order to limit the performance penalty, the user selects the Properties dialog by pressing the Properties button 204 as illustrated in FIG. 2. This action opens the Properties dialog presented in FIG. 4. The user selects an Auto-tuning check box 402 and enters a performance penalty value 404 expressed in percents. Clicking an OK button 406 applies the settings. Pressing an “Advanced” button 408 opens an Auto-tuning dialog 500 shown in FIG. 5.


The Auto-tuning dialog 500 has two panes. A right pane 502 shows the list of functions excluded from tracing by the auto-tuning process. The user can force a desired function to stay in the trace list by selecting the desired function in the right pane and pressing the left arrow button 504. The list of functions that are to be retained in the trace appears in the left window. These functions are a part of the recording profile. The user can move functions back to the removed list using the right arrow button 506. The list of functions deleted by the auto-tuning procedure 502 closely corresponds to the list of the most-frequently executed functions that are displayed in the right lower pane 304 in FIG. 3.



FIG. 6 shows the results of adding a function to the list of functions that are retained in the trace list 602.



FIG. 7 shows the tracing window after applying the auto-tuning procedure. The right-lower pane 702 shows the updated list of most frequently executed functions that were traced using the tuned recording profile.


CPU time usage is only one of several performance characteristics. There are a number of other ways to measure useful performance of an application. For example, in the case of HTTP servers it is accepted practice to measure performance by the number of hits-per-second that the server can serve under stressed conditions. Table 1 below shows the reduction in performance penalty achieved by the auto-tuning process. The sample HTTP server httpsrv.exe was stress-tested in 3 modes: 1) without tracing; 2) complete function-level tracing; and 3) auto-tuned tracing. The results of the experiment presented in the table show that auto-tuning allowed to reduce the performance penalty from 75% to 7%.











TABLE 1







Performance


Mode
Performance (hits/sec)
Penalty %







1. No tracing
68.8
0%


2. Full function-level tracing
17.3
75% 


3. Auto-tuned tracing
63.9
7%









Although various embodiments have been described above, other embodiments will be apparent to one of ordinary skill in the art based on the above disclosure. For example, the above equations and calculations are provided by way of example and not by way of limitation. One of ordinary skill in the art will recognize that however the performance penalty is specified and calculated, users will typically adjust the desired penalty value in order to keep the performance impact of tracing within acceptable levels. One of ordinary skill in the art will recognize that many methods of specifying the allowable performance penalty and/or computing the performance penalty can be used. Thus, the invention is limited only by the claims that follow.

Claims
  • 1. A method for tracing comprising: receiving from a user an initial trace profile listing a first list of runtime events to be traced during execution of a program;monitoring on one or more processors, via an auto-tuning process, a performance penalty incurred as a result of the tracing of said first list of runtime events during the execution of the program;determining, via the auto-tuning process, a subset of said first list of runtime events the tracing of which produces the largest impact on said performance penalty to produce a second list of one or more runtime events to not be traced;removing, via the auto-tuning process, said second list of runtime events from said first list of runtime events;displaying, in response to the removing, a visual display, wherein the visual display comprises a graphical representation of the second list of runtime events;receiving from the user a selection from the visual display, wherein the selection comprises one or more runtime events from the second list of runtime events; andrestoring the one or more runtime events contained in the selection from the second list of runtime events to the first list of runtime events.
  • 2. The method of claim 1, further comprising: repeating said monitoring, determining, and removing until said performance penalty falls below a desired level.
  • 3. The method of claim 1, further comprising: returning one or more items on said second list of runtime events to said first list of runtime events until said performance penalty approximates a desired level.
  • 4. The method of claim 1, wherein said performance penalty comprises time taken by a tracing engine compared to time taken by the program being traced.
  • 5. The method of claim 1, wherein said first list of runtime events is specified without recourse to compiler-generated debug information from said program.
  • 6. The method of claim 1, wherein said first list of runtime events is specified without recourse to source code from said program.
  • 7. The method of claim 1, wherein said first list of runtime events is specified using at least a portion of compiler-generated debug information from said program.
  • 8. The method of claim 1, wherein said first list of runtime events is specified using at least a portion of the source code from said program.
  • 9. A method for automated tuning of program execution tracing, the method comprising: receiving from a user an initial collection of trace points to be included in the program execution tracing;defining a maximum allowable level for a performance penalty of the program execution tracing;monitoring on one or more processors, via an auto-tuning process, the performance penalty of the program execution tracing while operating the program execution tracing in accordance with the trace points;removing, via the auto-tuning process, one or more trace points within the program execution tracing to maintain the monitored performance penalty below the maximum allowable level;displaying a graphical representation of the one or more removed trace points;receiving from the user a selection from the graphical representation, wherein the selection comprises a subset of the one or more removed trace points; andrestoring the selected subset of the one or more removed trace points to the trace points within the program execution tracing.
  • 10. The method of claim 9, further comprising initializing the program execution tracing to include all possible trace points.
  • 11. The method of claim 9, further comprising adapting the monitoring to a variable CPU load.
  • 12. The method of claim 9, further comprising increasing the number of trace points within the program execution tracing while maintaining the performance penalty below the maximum allowable level.
  • 13. The method of claim 9, wherein removing one or more trace points comprises disabling at least one trace point.
  • 14. The method of claim 9, wherein the one or more removed trace points comprise the trace points that consume the highest amount of processor time.
  • 15. The method of claim 9, further comprising defining at least one mandatory trace point that is always enabled.
  • 16. A non-transitory computer readable medium storing computer executable instructions which when executed on a processor perform a method for reducing a performance penalty incurred by a program being traced, the method-comprising: receiving from a user an initial collection of trace points;defining a maximum allowable level for the performance penalty incurred by the program;tracing, on one or more processors, the program in accordance with the initial collection of trace points;assessing, via an auto-tuning process, the performance penalty incurred by the program due to the tracing;adjusting, via the auto-tuning process, the initial collection of trace points to maintain the assessed performance penalty below the maximum allowable level, wherein the adjusting comprises removing one or more trace points;displaying a visual display, wherein the visual display comprises a graphical representation of the one or more removed trace points;receiving from the user a selection from the visual display, wherein the selection comprises one or more of the removed trace points;restoring the one or more selected trace points to the adjusted initial collection of trace points to create a second collection of trace points; andtracing, on the one or more processors, the program in accordance with the second collection of trace points.
  • 17. The computer readable medium of claim 16, further comprising defining a frequency at which the performance penalty is assessed.
  • 18. The computer readable medium of claim 16, wherein the initial collection of trace points includes all possible trace points.
  • 19. The computer readable medium of claim 16, wherein adjusting the initial collection of trace points further comprises increasing the number of trace points while maintaining the performance penalty below the maximum allowable level.
  • 20. The computer readable medium of claim 16, wherein removing one or more trace points comprises disabling at least one trace point.
  • 21. The computer readable medium of claim 16, wherein removing one or more trace points comprises disabling the trace points that consume the highest amount of processor time.
  • 22. The computer readable medium of claim 16, further comprising defining certain trace points as mandatory trace points that cannot be disabled.
  • 23. A method for tracing comprising: specifying an initial trace profile listing a first list of runtime events to be traced during execution of a program;monitoring on one or more processors, via an auto-tuning process, a performance criteria during the execution of the program while tracing said first list of runtime events;determining, via the auto-tuning process, a subset of said first list of runtime events the tracing of which produces the largest impact on said performance criteria to produce a second list of one or more runtime events to not be traced;removing, via the auto-tuning process, said second list of runtime events from said first list of runtime events;receiving, in response to the removing, a user-selected subset of the second list of runtime events; andrestoring the user-selected subset of the second list of runtime events to the first list of runtime events.
  • 24. The method of claim 23, further comprising: repeating said monitoring, determining, and removing until said performance criteria reaches an acceptable level.
  • 25. The method of claim 23, wherein said performance criteria comprises time taken by a tracing engine.
REFERENCE TO RELATED APPLICATION

The present application claims priority benefit of U.S. Provisional Application No. 60/582,761, titled “SYSTEM AND METHOD FOR AUTOMATED TUNING OF PROGRAM EXECUTION TRACING,” filed Jun. 25, 2004, the entire contents of which is hereby incorporated by reference.

US Referenced Citations (80)
Number Name Date Kind
4503495 Boudreau Mar 1985 A
4511960 Boudreau Apr 1985 A
4598364 Gum et al. Jul 1986 A
4782461 Mick et al. Nov 1988 A
4879646 Iwasaki et al. Nov 1989 A
5021949 Morten et al. Jun 1991 A
5121489 Andrews Jun 1992 A
5193180 Hastings Mar 1993 A
5265254 Blasiak et al. Nov 1993 A
5297274 Jackson Mar 1994 A
5335344 Hastings Aug 1994 A
5347649 Alderson Sep 1994 A
5386522 Evans Jan 1995 A
5386565 Tanaka et al. Jan 1995 A
5394544 Motoyama et al. Feb 1995 A
5408650 Arsenault Apr 1995 A
5410685 Banda et al. Apr 1995 A
5421009 Platt May 1995 A
5446876 Levine et al. Aug 1995 A
5450586 Kuzara et al. Sep 1995 A
5465258 Adams Nov 1995 A
5481740 Kodosky Jan 1996 A
5483468 Chen et al. Jan 1996 A
5513317 Borchardt et al. Apr 1996 A
5526485 Brodsky Jun 1996 A
5533192 Hawley et al. Jul 1996 A
5551037 Fowler et al. Aug 1996 A
5574897 Hermsmeier et al. Nov 1996 A
5581697 Gramlich et al. Dec 1996 A
5590354 Klapproth et al. Dec 1996 A
5612898 Huckins Mar 1997 A
5615331 Toorians et al. Mar 1997 A
5632032 Ault et al. May 1997 A
5642478 Chen et al. Jun 1997 A
5657438 Wygodny et al. Aug 1997 A
5732210 Buzbee Mar 1998 A
5740355 Watanabe et al. Apr 1998 A
5745748 Ahmad et al. Apr 1998 A
5771385 Harper Jun 1998 A
5781720 Parker et al. Jul 1998 A
5848274 Hamby et al. Dec 1998 A
5867643 Sutton Feb 1999 A
5870606 Lindsey Feb 1999 A
5896535 Ronstrom Apr 1999 A
5903718 Marik May 1999 A
5928369 Keyser et al. Jul 1999 A
5938778 John, Jr. et al. Aug 1999 A
5940618 Blandy et al. Aug 1999 A
5960198 Roediger et al. Sep 1999 A
5983366 King Nov 1999 A
6003143 Kim et al. Dec 1999 A
6026433 D'Arlach et al. Feb 2000 A
6026438 Piazza et al. Feb 2000 A
6047124 Marsland Apr 2000 A
6065043 Domenikos et al. May 2000 A
6108330 Bhatia et al. Aug 2000 A
6202199 Wygodny et al. Mar 2001 B1
6219826 De Pauw et al. Apr 2001 B1
6237138 Hameluck et al. May 2001 B1
6263456 Boxall et al. Jul 2001 B1
6282701 Wygodny et al. Aug 2001 B1
6321375 Blandy Nov 2001 B1
6360331 Vert et al. Mar 2002 B2
6374369 O'Donnell Apr 2002 B1
6415394 Fruehling et al. Jul 2002 B1
6467052 Kaler et al. Oct 2002 B1
6490696 Wood et al. Dec 2002 B1
6507805 Gordon et al. Jan 2003 B1
6634001 Anderson et al. Oct 2003 B2
6865508 Ueki et al. Mar 2005 B2
7058928 Wygodny et al. Jun 2006 B2
7089536 Ueki et al. Aug 2006 B2
7114150 Dimpsey et al. Sep 2006 B2
7386839 Golender et al. Jun 2008 B1
20020087949 Golender et al. Jul 2002 A1
20030005414 Elliott et al. Jan 2003 A1
20040060043 Frysinger et al. Mar 2004 A1
20060150162 Mongkolsmai et al. Jul 2006 A1
20060242627 Wygodny et al. Oct 2006 A1
20080244534 Golender et al. Oct 2008 A1
Provisional Applications (1)
Number Date Country
60582761 Jun 2004 US