1. Field of the Invention
This invention relates to program development, and particularly to a method, system, and computer program product for providing real-time developer feedback in an integrated development environment.
2. Description of Background
Currently, new and less experienced software developers are expected to learn, keep up with, and adapt to the accelerated pace of a programming environment. In many instances, programmers are required to self-train with respect to a given project. A common challenge that new programmers face is how to avoid repeating common programming errors without having to wait for peer code reviews and before receiving defect reports from the field.
What is needed, therefore, is a way to provide real-time feedback to program developers during the program writing process for assisting the programmers in learning the job, as well as avoiding common or repeatable errors.
The shortcomings of the prior art are overcome and additional advantages are provided through the provision of a system, method, and computer program product for providing real-time developer feedback. The system includes a computer executing a source code editor; a feedback repository storing a programming language (PL) construct database, a profile database, and a general database; and a lexical analyzer executing on the computer. The analyzer parses code entered by a user into the editor. The constructs are used to search the construct database to determine a PL used in entering the code. The PL is used to search the construct database to determine a construct type. The analyzer searches the profile database for a developer profile of the user for the construct type. In response to a hit resulting from the search of the developer profile, the analyzer determines a frequency of occurrence of the construct type associated with the hit, identifies a cue assigned to the frequency of occurrence for the construct type, and delivers the cue to the computer.
Additional features and advantages are realized through the techniques of the present invention. Other embodiments and aspects of the invention are described in detail herein and are considered a part of the claimed invention. For a better understanding of the invention with advantages and features, refer to the description and to the drawings.
As a result of the summarized invention, technically we have achieved a solution that provides real-time, programmer-specific and general feedback, which is based on historical data that is relevant to the tasks being performed. The historical data is aggregated and organized to provide indicators and suggestions to guide the programmer during development. Rule-based filtering and heuristics define how and when the programmer is notified of a possible infraction, which can dramatically reduce the number of defects originating from code and increase quality and developer knowledge.
The subject matter which is regarded as the invention is particularly pointed out and distinctly claimed in the claims at the conclusion of the specification. The foregoing and other objects, features, and advantages of the invention are apparent from the following detailed description taken in conjunction with the accompanying drawings in which:
The detailed description explains the preferred embodiments of the invention, together with advantages and features, by way of example with reference to the drawings.
Exemplary embodiments of the invention are directed to a method, system, and computer program product for providing real-time, programmer-specific and general feedback, which is based on historical data that is relevant to the tasks being performed. The historical data is aggregated and organized to provide indicators and suggestions to guide the programmer during development. Rule-based filtering and heuristics define how and when the programmer is notified of a possible infraction, which can dramatically reduce the number of defects originating from code and increase quality and developer knowledge.
An exemplary embodiment of the invention discloses a mechanism for software developers to get real-time feedback on their most common programming mistakes while in the actual process of writing code. This mechanism promotes self-learning (from previous mistakes), and prevents these commonly made coding mistakes from being made in the first place, thereby decreasing the number of defects found during the verification cycle and increasing overall product quality delivery.
There are historical data which can be gathered through the lifetime of a product's development and this data can be used to guide other developers. Some of these data can come from data inputs, such as: orthogonal defect classification results, a defect tracking system, automated code reviews and team code reviews, historical compilation errors, and a source control system, to name a few.
Turning now to the system of
The feedback repository 104 includes data repositories with databases relating to information used in by the IDE and may be implemented using a variety of devices for storing electronic information. It is understood that the feedback repository 104 may be implemented using memory contained in one or more computer devices or that it may be comprised of separate physical devices. The feedback repository 104 may be logically addressable as a consolidated data source across a distributed environment that includes the user system 102. Information stored in the feedback repository 104 may be retrieved and manipulated via the user system 102. In an exemplary embodiment, the feedback repository 104 processes data from various information sources (e.g., sources 106, 108, 110, and 112). These sources of information are used by the lexical analyzer 116 to provide real-time programmer feedback. Orthogonal defect classification (ODC) techniques may be used to identify defect types and categories. The results of these techniques are stored in orthogonal defect classification database 106. A defect tracking system may be employed as part of the programming environment of
A source control system may be employed by the programming environment to identify and track versions of source code applications for a development project. This information is stored in source control system 110. In addition, code reviews may be conducted by members of the programming environment. The code reviews may be automated, e.g., via a tool which checks source code against a set of rules based on the runtime environment or development rules. Alternatively, or in addition thereto, the programming environment may implement team/peer code reviews. This information is stored in code review database 112. These, and other, types of information sources may be utilized in implementing the real-time programmer feedback processes.
The information sources 106, 108, 110, and 112 are processed by the lexical analyzer 116 (in part) to produce databases of information referred to as programming language (PL) constructs database 120, developer profiles database 122, and general database 124. The information sources assist in tracking the defects attributed to a particular developer, which is then used to assess the frequency of occurrence of the defects in proportion to the number of times the construct associated with the defects is used. This process is described further herein.
The PL constructs database 120 stores a comprehensive list of PL constructs (e.g., assignment, condition checking, iteration, counter increase, etc.). There may be separate databases for each programming language used by the IDE of the system of
The developer profile database 122 contains lists which reference the list in PL database 120. These lists represent developer profiles and indicate the mistakes most frequently made by the developer (i.e., developer-specific information). A sample developer profile record 300 is shown and described in
General database 124 stores a comprehensive list of general hints and tips, best practices, standard specification information applicable to each of the programming languages and their corresponding constructs, which are used by the IDE of
Turning now to
At step 202, the developer enters code via the source code editor 114 and user system 102. The lexical analyzer 116 breaks down the source code to examine the kinds of constructs within the line currently being typed at step 204. At step 206, the lexical analyzer 116 identifies the programming language for the construct being typed, accesses the PL database 120 for the programming language, and determines the programming construct type defined for the programming language. The lexical analyzer 116 is configured to track the number of times the developer uses each construct. This information is used in conjunction with the defects tracked (e.g., via databases 106/108) to assess the relative frequency of occurrence. This frequency is stored in the developer's profile in database 122 and is updated as the developer continues to enter code.
Using the construct type, the lexical analyzer 116 accesses the developer profile database 122 and the general database 124 and searches these databases 122 and 124 for the construct type at step 208. The search in the developer profile database 122 (in particular, the profile specific to the developer) is performed to determine whether the developer is writing code in the area where he/she commonly makes mistakes. For example, the developer could be writing a line of code where a variable is being assigned to another variable (e.g., myNewValue=myOldValue), and the profile in database 122 has an entry that identifies that this particular developer has historically made mistakes in assigning variables.
If there is no match at step 210, the lexical analyzer 116 takes no further action at step 212 and the programmer continues to enter code via the source code editor 114. Otherwise, if there is a match at step 210, one or both of two paths may be followed as will now be described. If the hit is in the developer profile database 122 only, then steps 214-220 are performed. If the hit is in the general database 124 only, then step 222 is performed. If the hit is in both databases 122 and 124, then both paths are followed.
If the hit is in the developer profile database 122, the lexical analyzer 116 checks the developer profile in database 122 to determine the frequency of the occurrence of mistake that is identified by the hit at step 214. In an exemplary embodiment, the frequency determines what type of cue will be presented to the developer. For example, if the frequency is relatively high, the line of code being written may be highlighted in red. A medium range frequency may result in the line of code being highlighted in yellow. The frequency level determinations (i.e., high versus medium) may be defined by members of the IDE of
Turning back to step 210, if the hit is in the general database 124, the lexical analyzer 116 retrieves the associated information stored in the general database that relates to the hit, and presents the information to the developer at step 222 and is described below. The process then returns to step 202.
A sample computer screen window 400 illustrating a portion of code 402 entered by the developer via the user system 102 is shown in
From the developer profile:
From the general database 124:
In an alternative embodiment, the data from the developer profiles may be aggregated across the integrated development environment in which the information provided to the developer during code writing is specific to a group of developers whose defect data is aggregated from one or more of the information sources of databases 106, 108, 110, and 112.
The capabilities of the present invention can be implemented in software, firmware, hardware or some combination thereof.
As one example, one or more aspects of the present invention can be included in an article of manufacture (e.g., one or more computer program products) having, for instance, computer usable media. The media has embodied therein, for instance, computer readable program code means for providing and facilitating the capabilities of the present invention. The article of manufacture can be included as a part of a computer system or sold separately.
Additionally, at least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform the capabilities of the present invention can be provided.
The flow diagrams depicted herein are just examples. There may be many variations to these diagrams or the steps (or operations) described therein without departing from the spirit of the invention. For instance, the steps may be performed in a differing order, or steps may be added, deleted or modified. All of these variations are considered a part of the claimed invention.
While the preferred embodiment to the invention has been described, it will be understood that those skilled in the art, both now and in the future, may make various improvements and enhancements which fall within the scope of the claims which follow. These claims should be construed to maintain the proper protection for the invention first described.
Number | Name | Date | Kind |
---|---|---|---|
4751635 | Kret | Jun 1988 | A |
5548718 | Siegel et al. | Aug 1996 | A |
5754737 | Gipson | May 1998 | A |
5778402 | Gipson | Jul 1998 | A |
5960196 | Carrier et al. | Sep 1999 | A |
6016467 | Newsted et al. | Jan 2000 | A |
6026233 | Shulman et al. | Feb 2000 | A |
6305008 | Vaidyanathan et al. | Oct 2001 | B1 |
6314559 | Sollich | Nov 2001 | B1 |
6467081 | Vaidyanathan et al. | Oct 2002 | B2 |
6502233 | Vaidyanathan et al. | Dec 2002 | B1 |
6820075 | Shanahan et al. | Nov 2004 | B2 |
6965990 | Barsness et al. | Nov 2005 | B2 |
7272823 | Ball | Sep 2007 | B2 |
7296264 | Zatloukal et al. | Nov 2007 | B2 |
7313784 | Hawley et al. | Dec 2007 | B2 |
7322023 | Shulman et al. | Jan 2008 | B2 |
7373634 | Hawley et al. | May 2008 | B2 |
7451439 | Nickell et al. | Nov 2008 | B2 |
7464119 | Akram et al. | Dec 2008 | B1 |
20020016953 | Sollich | Feb 2002 | A1 |
20020095657 | Vaidyanathan et al. | Jul 2002 | A1 |
20040003335 | Gertz et al. | Jan 2004 | A1 |
20040040014 | Ball | Feb 2004 | A1 |
20040153995 | Polonovski | Aug 2004 | A1 |
20040199904 | Schmidt | Oct 2004 | A1 |
20040230964 | Waugh et al. | Nov 2004 | A1 |
20050015747 | Zatloukal et al. | Jan 2005 | A1 |
20050114771 | Piehler et al. | May 2005 | A1 |
20050125767 | Hawley et al. | Jun 2005 | A1 |
20050125773 | Hawley et al. | Jun 2005 | A1 |
20050289503 | Clifford | Dec 2005 | A1 |
20060277525 | Najmabadi et al. | Dec 2006 | A1 |
20070168946 | Drissi et al. | Jul 2007 | A1 |
20070226546 | Asthana et al. | Sep 2007 | A1 |
20070250816 | Rose | Oct 2007 | A1 |
20070288910 | Bhat et al. | Dec 2007 | A1 |
20080155508 | Sarkar et al. | Jun 2008 | A1 |
20080184209 | LaFrance-Linden | Jul 2008 | A1 |
20080189688 | Schmidt | Aug 2008 | A1 |
Number | Date | Country |
---|---|---|
WO-2007041242 | Apr 2007 | WO |