This application claims the benefit of Indian Patent Application Serial No. 680/CHE/2014 filed Feb. 13, 2014, which is hereby incorporated by reference in its entirety.
This disclosure relates generally to the technical field of software performance evaluation and, more specifically to a method and/or a system for evaluating performance of a software application through run-time assembly code execution.
Popularity of cloud deployments has brought forth several challenges. One of the challenges being performance of a software application under different operational workloads. Cloud computing environments, for example Software as a Service (SaaS) may have multiple customers using a common service. Each customer should potentially have unique workload characteristics.
An underlying service may use a set of software programs and/or an architecture for a typical deployment and/or a specific workload. Any variance from the unique workload characteristics shall deteriorate the performance of the software application resulting in service level agreement violations. Often performance enhancements may have been brought by making changes to a source code, an architecture and/or a hardware capacity of the system.
There may have several performance evaluation techniques based on a software development lifecycle. The performance evaluation techniques may include code profiling, stress tests and/or capacity management.
Mutation testing has been used extensively to verify correctness of a software functionality and/or a test suite to be used to test the software application. The source code and/or an assembly code may be changed to mutate the source code and/or the assembly code. The mutated code may semantically be different from the original source code. The mutated code when run may expect to throw an exception, and/or show incorrect results at a point where the mutated code may have been introduced.
Absence of the exception and/or the incorrect results may prove that a part of the source code of the software application may be redundant and/or the test suite may be inadequate. Thus, may not cover the mutated part of the source code. In other words, the mutation testing may be considered as a means to verify completeness and/or correctness of the test suite.
Software Implemented Fault-injections (SWIFI) may relate to injecting faults in the source code and/or into a memory address of the software application just before execution. The SWIFI techniques may be used to test either fault tolerance of the system, and/or study failure modes. The SWIFI may operate with the assembly code, where at run-time various instructions and/or data may be manipulated to change randomly and/or in a pre-defined location. Such a manipulation may be a single bit and/or a multi bit flip.
The source code of a software application written in high level languages such as COBOL, C, C++, JAVA may have become prevalent. There may have been various consequences of providing the source code for testing the software application. Intellectual property (IP) risk may have been main concern in organizations. There may have the source code changes during the testing of the software application. Bugs may also be added during the testing of the software application.
Software performance evaluation by run-time assembly code execution is disclosed. In one aspect, a method includes evaluating performance of a software application in a data processing system. A plurality of program code lines of the software application stored in one or more computer databases are analyzed by one or more computing devices. The one or more computing devices access the one or more computer databases through a computer network. One or more equivalent program regions within the plurality of program code lines are identified. One or more markers in the identified one or more equivalent program regions are inserted. The one or more markers are stored in the one or more computer databases. The plurality of program code lines are compiled and assembled by a compiler and an assembler respectively to generate an executable code. The executable code includes a plurality of instructions. Performance metrics of the software application is measured recurrently by manipulating the plurality of instructions based on the one or more equivalent program regions identified by the inserted one or more markers and executing the executable code.
The one or more equivalent program regions may include one or more nested control flow statements and loop statements. The executable code may include at least one of an assembly code and a byte code. The executable code may be executed by subjecting the executable code to one or more workloads stored in the one or more computer databases. The plurality of instructions may be manipulated by changing at least one of a sequence and one or more values of the plurality of instructions based on the one or more equivalent program regions. An optimal sequence of the plurality of instructions in the executable code may be determined. The optimal sequence may be stored in the one or more computer databases.
In another aspect, a system for evaluating performance of a software application is disclosed. The system includes one or more computer databases, associated through a computer network. The system further includes one or more computing devices, associated through the one or more computer databases. The one or more computing devices analyzes a plurality of program code lines of the software application stored in the one or more computer database, identifies one or more equivalent program regions within the plurality of program code lines. The one or more computing devices further inserts one or more markers in the identified one or more equivalent program regions and stores the one or more markers in the one or more computer databases. The plurality of program code lines are compiled and assembled by a compiler and an assembler respectively to generate an executable assembly code. The executable code includes a plurality of instructions. A performance measuring unit associated through the one or more computing devices measures performance metrics of the software application by manipulating the plurality of instructions based on the one or more equivalent program regions identified by the inserted one or more markers and executing the executable code.
In a further aspect a computer program product comprising a non-transitory computer usable medium having a computer readable program code embodied therein for evaluating performance of a software application in a data processing system. The computer program product includes analyzing by one or more computing devices a plurality of program code lines of the software application stored in a one or more computer databases. The one or more computing devices access the one or more computer databases through a computer network. One or more equivalent program regions within the plurality of program code lines are identified. One or more markers in the identified one or more equivalent program regions are inserted. The one or more markers are stored in the one or more computer databases. The plurality of program code lines are compiled and assembled by a compiler and an assembler respectively to generate an executable code. The executable code includes a plurality of instructions. Performance metrics of the software application is measured by manipulating the plurality of instructions based on the one or more equivalent program regions identified by the inserted one or more markers and executing the executable code.
The methods, systems, and apparatuses disclosed herein may be implemented in any means for achieving various aspects, and may be executed in a form of a machine/readable medium embodying a set of instructions that, when executed by a machine, cause the machine to perform any of the operations disclosed herein. Other features will be apparent from the accompanying drawings and from the detailed description that follows.
Example embodiments are illustrated by way of example and not limitation in the figures of the accompanying drawings, in which like references indicate similar elements and in which:
Other features of the present embodiments will be apparent from the accompanying drawings and from the detailed description that follows.
Software application performance evaluation by run-time assembly code execution method and system is disclosed. In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the various embodiments. It will be evident, however to one skilled in the art that the various embodiments may be practiced without these specific details.
In various embodiments of the present invention, performance of a software application may be evaluated with a set of equivalent program regions. Each equivalent program region may have the same functional and/or semantic behavior but may differ in a performance outcome of the software application. The performance difference emerging from the set of equivalent program regions may form a basis for determining the optimal solution for the software application under a specific workload.
The equivalent program regions may be executed in an automated and intelligent manner. The executions may happen without any changes in a source code of the software application. There are several factors involving access to the source code of the software application. For example, the source code may not be available, the source code reading may not be permissible due to intellectual property issues, changing the source code to perform changes may result in inadvertent bugs to be introduced in the source code. Thus, an assembly level code changes may be performed for determining the performance of the software application.
In other embodiments of the present invention, performance evaluation of the software application may happen by executing the specific workload and measuring of a set of performance metrics. The set of performance metrics being an application response time, an application throughput and/or a system resource utilization. The set of performance metrics may reflect the performance of the software application for the specific operational workload. With changes to the workload and/or an operating environment (hardware and/or software) the performance of the software application may change.
Markers may be defined in the program code lines to mark the identified one or more equivalent program regions for automatic identification during run-time. The markers may provide very exact feedback from known points in the software application. Values and/or properties associated with the markers may create a marker property. The marker property may form run-time conditions for the markers. The markers may be stored in one or more computer databases 108, 110, 112, 114.
The marker property may be for example, a toggle between ‘process per connection’ and/or ‘shared connection’. A polling frequency of the software application may also be set by the marker. In case of polling time being too frequent, resources may be wasted, and in case the polling time being too long the performance may be degraded. To overcome subjectivity of “too frequent” or “too long”, the markers may be inserted objectively and/or quantitatively to set arrival rate correctly. The markers may also be inserted in case statements. The markers may be inserted even in multiple independent filters. Interchanging data of the filters may not impact the program code lines of the software application semantically, but may impact the performance.
In accordance with an example embodiment of the present invention, the program code lines may be in any of a variety of programming languages such as C, C++, COBOL, JAVA, Visual C, PHP, XML and/or Pascal, the marker property may be programming language-independent. Thus, the marker property may be same for two programming languages with different syntax.
The program code lines may be compiled and assembled by a compiler 116 and an assembler 118 respectively to generate an executable code. The executable code may have a plurality of instructions. The executable code may include an assembly code and/or a byte code. The markers may be easily identified in the executable code. The one or more equivalent program regions may be identified in the executable code based on the inserted markers. Performance metrics of the software application may be measured through a performance measuring unit 120 by repeating a process of manipulating the plurality of instructions based on the one or more equivalent program regions identified by the inserted one or more markers and executing the executable code. The performance metrics may be stored in the one or more computer databases 108, 110, 112, 114.
A combination of one or more workloads may be applied at operation 210 and the operations 206, 208 and 210 may be repeated to obtain various performance metrics of the software application. The operations 206, 208 and 210 may be optimized based on various algorithms. For example, the performance metrics of the software application may be measured by only executing the plurality of instructions corresponding to the equivalent program regions with an immediate boundary of the plurality of instructions.
The various performance metrics may be stored in the one or more computer databases 108, 110, 112, 114. At operation 212, an optimal sequence of the plurality of instructions under a specific workload may be determined based on the various performance metrics. An accurate estimation of performance of the software application may be achieved, as the estimation may rely on the execution of the assembly code.
For example, during a testing phase of the software application, the equivalent program regions may be identified in the assembly code based on the markers. The sequence of the instructions may be changed in the assembly code. The software application may be subjected to different workloads. The various performance metrics of the software application shall be measured by testing the software application with the different sequences of instructions in the assembly code under the different workloads. The optimal sequence of the instructions in the executable code may be determined. Thus, best performance under a specific sequence of instructions and a particular workload may be determined.
For example, if a client specifies that the software application shall be used by approximately 5000 users. Out of 5000 users approximately 3000 users shall be updating data in the software application and 2000 users shall be inserting data in the software application. An appropriate sequence of the plurality of instructions in the assembly code may be determined for best performance of the software application based on the various performance metrics.
In case of cloud computing environments, especially Software-as-a-Service, multiple clients may access a software service. Each client shall have a specific workload characteristic and Quality of Service (QoS) requirements. The optimal sequence of the plurality of instructions in the executable code may be determined for each client. Thus, a particular code base for the software application may solve purpose for various clients.
Another example embodiment of the present invention includes analyzing by the one or more computing devices the plurality of program code lines of the software application stored in the one or more computer databases. Further, the one or more equivalent program regions within the plurality of program code lines may be identified. The one or more markers in the identified one or more equivalent program regions may be inserted and stored in the one or more computer databases. Further, the plurality of program code lines may be compiled and assembled respectively to generate the executable code. The executable code may include the plurality of instructions. Further, the performance metrics of the software application may be measured for the specific workload, by manipulating the plurality of instructions based on the one or more equivalent program regions identified by the inserted one or more markers and executing the executable code, while maintaining semantics of the plurality of program code lines same.
In a networked deployment, the machine may operate in the capacity of a server and/or a client machine in server-client network environment, and or as a peer machine in a peer-to-peer (or distributed) network environment. The machine may be a personal-computer (PC), a tablet PC, a set-top box (STB), a Personal Digital Assistant (PDA), a cellular telephone, a web appliance, a network router, switch and or bridge, an embedded system and/or any machine capable of executing a set of instructions (sequential and/or otherwise) that specify actions to be taken by that machine. Further, while only a single machine is illustrated, the term “machine” shall also be taken to include any collection of machines that individually and/or jointly execute a set (or multiple sets) of instructions to perform any one and/or more of the methodologies discussed herein.
The example computer system 600 includes a processor 602 (e.g., a central processing unit (CPU) a graphics processing unit (GPU) and/or both), a main memory 604 and a static memory 606, which communicate with each other via a bus 608. The computer system 600 may further include a video display unit 610 (e.g., a liquid crystal displays (LCD) and/or a cathode ray tube (CRT)). The computer system 600 also includes an alphanumeric input device 612 (e.g., a keyboard), a cursor control device 614 (e.g., a mouse), a disk drive unit 616, a signal generation device 618 (e.g., a speaker) and a network interface device 620.
The disk drive unit 616 includes a machine-readable medium 622 on which is stored one or more sets of instructions 624 (e.g., software) embodying any one or more of the methodologies and/or functions described herein. The instructions 624 may also reside, completely and/or at least partially, within the main memory 604 and/or within the processor 602 during execution thereof by the computer system 600, the main memory 604 and the processor 602 also constituting machine-readable media.
The instructions 624 may further be transmitted and/or received over a network 106 via the network interface device 620. While the machine-readable medium 622 is shown in an example embodiment to be a single medium, the term “machine-readable medium” should be taken to include a single medium and/or multiple media (e.g., a centralized and/or distributed database, and/or associated caches and servers) that store the one or more sets of instructions. The term “machine-readable medium” shall also be taken to include any medium that is capable of storing, encoding and/or carrying a set of instructions for execution by the machine and that cause the machine to perform any one or more of the methodologies of the various embodiments. The term “machine-readable medium” shall accordingly be taken to include, but not be limited to, solid-state memories, optical and magnetic media, and carrier wave signals.
In addition, it will be appreciated that the various operations, processes, and methods disclosed herein may be embodied in a machine-readable medium and/or a machine accessible medium compatible with a data processing system (e.g., a computer system), and may be performed in any order. The modules in the figures are shown as distinct and communicating with only a few specific module and not others. The modules may be merged with each other, may perform overlapping functions, and may communicate with other modules not shown to be connected in the Figures. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense.
Number | Date | Country | Kind |
---|---|---|---|
680/CHE/2014 | Feb 2014 | IN | national |
Number | Name | Date | Kind |
---|---|---|---|
5047919 | Sterling | Sep 1991 | A |
5265254 | Blasciak | Nov 1993 | A |
5530964 | Alpert et al. | Jun 1996 | A |
6006033 | Heisch | Dec 1999 | A |
6282707 | Isozaki | Aug 2001 | B1 |
6345384 | Sato | Feb 2002 | B1 |
6922829 | Ward | Jul 2005 | B2 |
6970805 | Bierma et al. | Nov 2005 | B1 |
7020905 | Chiang | Apr 2006 | B2 |
7203936 | Gillies et al. | Apr 2007 | B2 |
7373550 | Brawn et al. | May 2008 | B2 |
7657881 | Nagendra | Feb 2010 | B2 |
7908593 | Arnold | Mar 2011 | B2 |
7954095 | Archer | May 2011 | B2 |
8225284 | Larsen | Jul 2012 | B2 |
20010032332 | Ward | Oct 2001 | A1 |
20020184615 | Sumner | Dec 2002 | A1 |
20050114736 | Larsen | May 2005 | A1 |
20060129997 | Stichnoth | Jun 2006 | A1 |
20060136712 | Nagendra | Jun 2006 | A1 |
20080101232 | Archer | May 2008 | A1 |
20080168433 | Arnold | Jul 2008 | A1 |
Entry |
---|
Anonymous, “Control Flow” Mathworks [online], 2012 [retrieved Sep. 8, 2015], Retreived from Internet: <URL: https://web.archive.org/web/20121104072804/http://www.mathworks.com/help/matlab/learn_matlab/flow-control.html?>, pp. 1-6. |
Anonymous, “assemble”, IEEE 100: The Authoritative Dictionary of IEEE Standards Terms, IEEE, 7th Ed., 2000, pp. 54-55. |
Pettis, K., et al., Profile Guided Code Positioning, Proceedings of the ACM SIGPLAN '90 Conf. on Programming Language Design and Implementation [online], 1990 [retrieved Apr. 24, 2017], Retreived from Internet: <URL: http://perso.ensta-paristech.fr/˜bmonsuez/Cours/B6-4/Articles/papers15.pdf>, pp. 16-27. |
Number | Date | Country | |
---|---|---|---|
20150227448 A1 | Aug 2015 | US |