This application claims priority under 35 U.S.C. §119(a) from Chinese Patent Application No. 201010117730.1, filed on Feb. 26, 2010, the contents of which are incorporated herein by reference.
The present invention relates to the field of computers, particularly to the Integrated Development Environment (IDE), and more particularly, to indicating to developers the effect of program modifications on program performance in an IDE.
Software development is usually carried out in an IDE. The traditional software development cycle includes editing source code, building, testing and profiling. Performance problems, such as performance bottlenecks, may be discovered by profiling. To address the discovered performance problems, a developer edits the source code, builds, tests and profiles the program over again.
In the above performance problem solving manner, the performance problems introduced in the editing phase will be discovered in the profiling phase and solved by editing the source code in the next cycle. The problem with this solving process is that it is long, since testing and profiling will generally run for a period of time. Furthermore, this process may result in the performance regression problem, i.e., new performance problems may be introduced when the developer is modifying the source code, and the developer is usually unaware of the performance problems caused by his/her modifications. This is because programs are usually complicated, and it is difficult for the developer to evaluate the overall performance effect of the code modifications. In addition, in a collaborative development, a developer only knows about the modules for which he/she is responsible, and has no idea about the modules for which other developers are responsible; more importantly, development is usually separated from testing and profiling. Even though some IDEs comprise an integrated profiler, the developer still needs to wait for a period of time to get the profiling results, then to check the positions where performance bottlenecks and performance regressions are generated, and to try to solve the problems. Such a process for solving performance problems is obviously inefficient.
In one embodiment of the present invention, a method for indicating program modifications affecting program performance in an Integrated Development Environment comprises detecting modifications that a developer makes to a program. The method further comprises determining a code location where the modifications are made. Additionally, the method comprises acquiring previous profiling data of the program. Furthermore, the method comprises evaluating an effect of the modifications on program performance according to the previous profiling data and the code location where the modifications are made. In addition, the method comprises providing results of the evaluation to the developer.
Other forms of the embodiment of the method described above are in a computer program product.
The foregoing has outlined rather generally the features and technical advantages of one or more embodiments of the present invention in order that the detailed description of the present invention that follows may be better understood. Additional features and advantages of the present invention will be described hereinafter which may form the subject of the claims of the present invention.
A better understanding of the present invention can be obtained when the following detailed description is considered in conjunction with the following drawings, in which:
The present invention comprises a method, computer program product and apparatus for analyzing previous profiling data of a program, and for indicating to developers the performance loss that may be caused by their modifications while they are modifying the program.
The embodiments of the present invention will now be described with reference to the accompanying drawings. In the following description, numerous details are described to enable the present invention to be fully understood. However, it is obvious to those skilled in the art that the realization of the present invention can be without some of these details. In addition, it should be appreciated that the present invention is not limited to the described specific embodiments. In contrast, it is contemplated to implement the present invention by using any combination of the following features and elements, no matter whether they involve different embodiments or not. Therefore, the following aspects, features, embodiments and advantages are illustrative, rather than elements or limitations of the appended claims, unless explicitly stated otherwise in the claims.
Now referring to
Referring to
In step 302, the code locations where the modifications occur are determined. According to an embodiment of the present invention, step 302 includes determining the function where the modified code is located. In one embodiment, step 302 assumes that all the modifications to the program are performed inside existing functions of the program. As for changes outside the existing functions (e.g., adding a new function outside the existing functions), there are callings of the newly added function being added in some existing function, and thus this step will determine the function calling the newly added function, and will not determine the newly added function itself. In programming languages, such as the C programming language, a macro may be defined, and the macro may be invoked by a function. Modifying an existing macro or adding a new macro will also result in performance variation of the function calling the macro. Therefore, determining the function in which the modified code is located also includes determining all the functions invoking a modified macro or a newly added macro.
It should be noted that the term “function” used herein refers to any relatively independent code segment that accomplishes a specific function in the program, including subroutines, methods, procedures, sub-programs, etc. in different programming languages.
In step 303, the previous profiling data of the program is obtained. Profiling data is generated during the profiling phase of the program, and it includes various performance data and performance relevant data in a number of previous program executions, such as performance indexes (e.g., a statistic value of the number of processor clock cycles consumed by the functions during a number of program runnings), performance relevant indexes (e.g., the amount of machine instructions or the amount of machine instruction clock cycles consumed by the functions), and the invocation frequencies of the functions in a number of previous program runnings. The following table illustrates an exemplary structure of the profiling data:
where the Function ID is the identification of a function in the program; performance relevant index refers to the performance relevant factors of the function itself (e.g., the number of static machine instructions, the number of machine instruction clock cycles of the function); performance index 1 . . . performance index n are the actual performance indexes profiled for the corresponding function in a number of previous runnings, such as the total number or average number of the consumed processor clock cycles; and the invocation frequency is the frequency or percentage that the function is invoked during the previous runnings
The above profiling data may be generated directly by the profiling process, or may be extracted from the data generated in the profiling process.
In step 304, the effect of the modifications on the program performance is evaluated according to the previous profiling data and the code location where the modifications occur. According to an embodiment of the present invention, step 304 includes evaluating the expected variation, which is caused by the modifications, of the performance index of the function in which the modifications are located according to the previous profiling data, and evaluating the effect of the modifications on the program performance according to the expected variation of the performance index of the function. Therein, evaluating the expected variation, which is caused by the modifications, of the performance index of the function in which the modifications are located according to the previous profiling data includes: evaluating the variation, which is caused by the modifications, of the performance relevant index of the function in which the modifications are located; and evaluating the expected variation, which is caused by the modifications, of the performance index of the function in which the modifications are located according to the variation of the performance relevant index. And therein, evaluating the effect of the modifications on the program performance according to the expected variation of the performance index of the function includes: evaluating the effect of the modifications on the program performance according to the ratio of the expected variation of the performance index of the function to the sum of the performance indexes of all the functions of the program, as well as the frequency that the function is invoked during the program running
In step 305, the evaluation results are provided to the developer.
Referring to
In step 402, the variation amount of the performance relevant index, PerformanceRelevantIndex_Add, of the function caused by the modified code is evaluated.
In step 403, the following equation (EQ1) is calculated:
where PerformanceIndex and PerformanceRelevantIndex are the performance index and performance relevant index of the function in which the modifications are located that are obtained in step 401, respectively. p denotes the expected variation of the performance index of the function in which the modifications are located, caused by the modifications, as evaluated according to the variation of the performance relevant index of the function in which the modifications are located, caused by the modifications.
In step 404, the variable α is calculated using the following equation (EQ2):
where p is the expected variation of the performance index in which the modifications are located, as evaluated in step 403; β is the invocation frequency of the function as obtained from the profiling data of the program (e.g., the percentage that the function is invoked in a number of program runnings); α denotes the effect of the modifications on the program's overall performance, and it is evaluated according to the expected variation, which is caused by the modifications, of the performance index of the function in which the modifications are located and in consideration of the ratio of the expected variation of the performance index of the function in the sum of performance indexes of all the functions in the program and the invocation frequency of the function.
According to an embodiment of the present invention, in step 305, the value α, as calculated in step 404, may be presented to the developer, to indicate to him/her the effect of the modifications on the program performance. According to another embodiment of the present invention, the value a may also first be compared with a pre-defined threshold, and when the value α is larger than the threshold, it is indicated to the developer that his/her modifications may greatly affect the program performance. It should be noted that the above steps are performed in the phase that the developer is editing the code in an IDE so that the performance influence probably caused by the editing actions of the developer can be timely indicated. As a result, the developer may carry out performance optimization in time, thus reducing performance regression and enhancing the development efficiency.
According to an embodiment of the present invention, the performance relevant index is the number of machine instructions consumed by the function, and the variation amount of the performance relevant index, PerformanceRelevantIndex_Add, is the variation amount of the machine instructions of the function, named as FIA (Function_Instruction_Add) caused by the code modifications.
Referring to
In step 502, a determination is made as to whether the modified code introduces a new function invocation.
If the modified code does not introduce a new function invocation, then, in step 503, the number of machine instructions of this line of code, LineInstruction, is calculated. As known in the art, the number of machine instructions of a code line may be obtained by compiling the modified program code and then calculating from the compiled binary code.
In step 504, the equation FIA=FIA+LineInstruction is calculated, assuming that the initial value of FIA is 0. Upon executing step 504, the modified code in the function is detected line by line in step 501. If the modifications are not simply adding new code lines, but modifying old code lines into new code lines, then the number of machine instructions of the old code lines should be further subtracted from the above expression.
If, however, the modified code introduces a new function invocation, then in step 505, the number of machine instructions introduced by the new function, FunInstruction, is calculated. As known in the art, the number of machine instructions introduced by the new function may be obtained by compiling the modified program code and from the compiled binary code, or may be obtained by accumulating the number of machine instructions of all the instruction lines contained in the new function. For the case where the newly introduced function invocation is an I/O function invocation, since the I/O functions with the same number of instructions may consume substantially different processor clock cycles, it is preferable to use another embodiment illustrated in
In step 506, the equation FIA=FIA+FunInstruction is calculated. Upon executing step 506, the modified code in the function is detected line by line in step 501. If the modifications are not simply adding a new function invocation, but replacing an old function invocation by a new function invocation, then the number of machine instructions of the old function may be further subtracted from the above expression.
In one embodiment, the above process is performed iteratively, until all the modified code lines in the function have been processed, so as to obtain the final machine instruction variation amount FIA of the function caused by the code modification.
According to another embodiment of the present invention, the performance relevant index is the number of machine instruction clock cycles consumed by the function, and the variation amount of the performance relevant index, PerformanceRelevantIndex_Add, is the variation amount of machine instruction clock cycles FCA (Function_Cycle_Add) of the function caused by the code modifications.
Referring to
In step 602, a determination is made as to whether the modified code introduces a new function invocation.
If the modified code does not introduce a new function invocation, then, in step 603, the number of machine instruction clock cycles, LineCycle, of this line of code is calculated. As known in the art, the number of machine instruction clock cycles of a code line may be calculated by the number of machine instructions of the code line and the clock cycles occupied by each machine instruction.
In step 604, the equation FCA=FCA+LineCycle is calculated, assuming that the initial value of FCA is 0. Upon executing step 604, the modified code in the function is detected line by line in step 601. If the modifications are not only simply adding new code lines, but modifying old code lines into new code lines, then the number of machine instruction clock cycles of the old code lines should be further subtracted from the above expression.
If, however, the modified code introduces a new function invocation, then in step 605, the number of machine instruction clock cycles, FunCycle, introduced by the new function is calculated. As known in the art, the number of machine instruction clock cycles introduced by the new function may be obtained by accumulating the number of machine instruction clock cycles of all the instruction lines contained in the new function. For the case where the newly introduced function invocation is an I/O function invocation, since the number of processor clock cycles consumed by I/O functions with the same number of machine instructions may differ greatly, and since I/O functions are usually included in third-party library functions, the number of clock cycles consumed by the I/O functions needs to be recorded in the third-party library functions (which is evaluated by the third-party). Thus, when the I/O function is introduced for the first time due to program modifications, the number of clock cycles may be obtained from the third-party library function as the number of machine instruction clock cycles introduced by this new I/O function. In future profiling, performance data of the I/O function (i.e., the number of clock cycles) may be further collected to replace the number of clock cycles provided by the third party for evaluating the variation amount of the performance relevant index.
In step 606, the equation FCA=FCA+FunCycle is calculated. Upon executing step 606, the modified code in the function is detected line by line in step 601. If the modifications are not simply adding a new function invocation, but modifying an old function invocation into a new function invocation, then the number of machine instruction clock cycles of the old function should be further subtracted from the above expression.
In one embodiment, the above process is performed iteratively, until all the modified code lines in the function are processed, so as to obtain the final machine instruction variation amount FCA of the function caused by the code modifications.
Above is described a method for indicating program modifications affecting program performance in an IDE according to the embodiments of the present invention. It should be pointed out that the above description and illustration are exemplary. In other embodiments of the present invention, the method may have more, less or different steps, and the details of the respective steps and the order among the steps may be different from that is illustrated and described. For example, according to one embodiment of the present invention, in step 304, the invocation frequency of the function in which modifications are located may further be determined; and if the invocation frequency is too small, e.g., smaller than 0.1, then it may be determined that the modifications to this function will not cause significant effect on the program performance, so that the evaluation for this function may be omitted. For further example, while the above description uses the number of processor clock cycles as the performance index of the function, and uses the number of machine instructions and the number of machine instruction clock cycles as the performance relevant index of the function; this is exemplary; and in some other embodiments of the present invention, other performance indexes and performance relevant indexes may be used. Furthermore, the above described method for calculating the performance relevant index of the function and the formula in steps 403 and 404 are exemplary as well, and they may have other variations in other embodiments of the present invention.
Referring again to
Computer system 700 may further include a communications adapter 709 coupled to bus 702. Communications adapter 709 may interconnect bus 702 with an outside network (not shown) thereby allowing computer system 700 to communicate with other similar devices.
I/O devices may also be connected to computer system 700 via a user interface adapter 710 and a display adapter 711. Keyboard 712, mouse 713 and speaker 714 may all be interconnected to bus 702 through user interface adapter 710. Data may be inputted to computer system 700 through any of these devices. A display monitor 715 may be connected to system bus 702 by display adapter 711. In this manner, a user is capable of inputting to computer system 700 through keyboard 712 or mouse 713 and receiving output from computer system 700 via display 715 or speaker 714.
As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or computer program product. Accordingly, aspects of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, aspects of the present invention may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus or device.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the C programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
Aspects of the present invention are described above with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the present invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to product a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the function/acts specified in the flowchart and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the function/acts specified in the flowchart and/or block diagram block or blocks.
Now referring to
According to an embodiment of the present invention, determining the code location where the modifications occur comprises determining the function in which the modifications are located, and the previous profiling data comprises the performance index of the function in the program in the previous program running.
According to an embodiment of the present invention, the performance influence evaluating module 804 is used for: evaluating the expected variation, which is caused by the modifications, of the performance index of the function in which the modifications are located according to the previous profiling data, and evaluating the effect of the modifications on the program performance according to the expected variation of the performance index of the function.
According to an embodiment of the present invention, evaluating the expected variation, which is caused by the modifications, of the performance index of the function in which the modifications are located according to the previous profiling data comprises: evaluating the variation, which is caused by the modifications, of the performance relevant index of the function in which the modifications are located; and evaluating the expected variation, which is caused by the modifications, of the performance index of the function in which the modifications are located according to the variation of the performance relevant index.
According to an embodiment of the present invention, evaluating the effect of the modifications on the program performance according to the expected variation of the performance index of the function comprises: evaluating the effect of the modifications on the program performance according to the ratio of the expected variation of the performance index of the function to the sum of the performance indexes of all functions of the program and the invocation frequency of the function in the program running
According to an embodiment of the present invention, the performance index is a statistic value of the processor clock cycles consumed by the function in previous program running
According to an embodiment of the present invention, the performance relevant index is the number of machine instructions of the function. According to another embodiment of the present invention, the performance relevant index is the number of machine instruction clock cycles of the function.
Above is described the apparatus for indicating program modifications affecting program performance in an IDE according to the embodiments of the present invention. It should be pointed out that the above description and illustration are exemplary. In other embodiments of the present invention, the apparatus may have more, less or different modules, and the relationships, such as connection and inclusion relationships, among the modules may be different from that is described. The modules discussed above may be inside an IDE as a module of the IDE, or may be a standalone module outside the IDE and work together with IDE.
The following describes the working process of the method of the present invention by way of an example using the following exemplary program:
Assuming the processor clock cycles is selected as the performance index for profiling, and the number of machine instructions is selected as the performance relevant index, the profiling data are shown in the following table:
It is assumed that the developer modifies the above example program as follows:
In the above modified program, lines 15-19 are a newly added function, sort_data ( ), and lines 29-30 are the newly added code. According to the process illustrated in
This means that the newly added 428 machine instructions in main ( ) will consume 5136 processor clock cycles. Then, in step 404, the following is calculated.
That is, the newly added code will result in a performance loss of 39% of the original program. Assuming the predetermined threshold is 10%, since 39%>10%, it is indicated to the developer.
Although the present invention has been illustrated and described with reference to the preferred embodiments, those skilled in the art will understand that various changes both in form and details may be made thereto without departing from the spirit and scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
20100117730.1 | Feb 2010 | CN | national |