1. Field
This invention relates to methods and systems for recommending next components, and particularly to methods and systems for recommending next components based on common usage patterns.
2. Description of Background
A mashup is a web application that combines data from multiple sources into a single integrated tool, thereby creating new and distinct web services. Mashup development systems typically present users with component options that can be used for assembling these mashups. Depending on the source, the component options can be numerous.
At each stage in the development process of these mashups or applications, the developer must know which components to select. Minimal assistance in the selection process is offered by the development system. In one example, the development system provides a set of categories from which components can be selected. Particularly for novice developers, the problem of selecting the “right” next component from these categories can be daunting, especially in cases where there are multiple unrelated sources.
The shortcomings of the prior art are overcome and additional advantages are provided through the provision of a method of recommending a next component. The method includes: identifying one or more candidate software applications based on a first similarity metric, wherein the one or more candidate software applications include one or more reusable software components; identifying one or more candidate software components from the one or more reusable software components based on a second similarity metric; estimating a score for each of the one or more candidate software components based on a composition of the one or more candidate software applications; and generating a recommendation based on the scores of each of the one or more candidate components.
Additional features and advantages are realized through the techniques of the present invention. Other embodiments and aspects of the invention are described in detail herein and are considered a part of the claimed invention. For a better understanding of the invention with advantages and features, refer to the description and to the drawings.
As a result of the summarized invention, a developer can be given a set of specific recommendations that are more likely to be useful. The developer can be confident in the recommendation because it is based on a similarity between the application being assembled and already-built applications.
The subject matter which is regarded as the invention is particularly pointed out and distinctly claimed in the claims at the conclusion of the specification. The foregoing and other objects, features, and advantages of the invention are apparent from the following detailed description taken in conjunction with the accompanying drawings.
The detailed description explains the preferred embodiments of the invention, together with advantages and features, by way of example with reference to the drawings.
An exemplary embodiment of the present invention provides a system, method and computer program product for collecting information from previously constructed applications, and using that information to recommend next components to developers. The recommendations are based on a set of already assembled components within a partially-built application.
Turning now to
The computer 101 is shown to include a processor 102, memory 104 coupled to a memory controller 106, one or more input and/or output (I/O) devices 108, 110 (or peripherals) that are communicatively coupled via a local input/output controller 112, and a display controller 114 coupled to a display 116. In an exemplary embodiment, the system 100 can further include a network interface 118 for coupling to a network 120. The network 120 transmits and receives data between the computer 101 and external systems. In an exemplary embodiment, a conventional keyboard 122 and mouse 124 can be coupled to the input/output controller 112.
In various embodiments, the memory 104 stores instructions that can be executed by the processor 102. The instructions stored in memory 104 may include one or more separate programs, each of which comprises an ordered listing of executable instructions for implementing logical functions. In the example of
When the computer 101 is in operation, the processor 102 is configured to execute the instructions stored within the memory 104, to communicate data to and from the memory 104, and to generally control operations of the computer 101 pursuant to the instructions. The processor 102 can be any custom made or commercially available processor, a central processing unit (CPU), an auxiliary processor among several processors associated with the computer 101, a semiconductor based microprocessor (in the form of a microchip or chip set), a macroprocessor, or generally any device for executing instructions.
The processor 102 executes the instructions of the component recommendation system 128 of the present disclosure. In various embodiments, the component recommendation system 128 of the present disclosure is stored in the memory 104 (as shown), is executed from a portable storage device (e.g., CD-ROM, Diskette, FlashDrive, etc.) (not shown), and/or is run from a remote location, such as from a central server (not shown).
As shown in
In one example, the partial application 138 and the candidate applications 140a-140n are any composite software-based entities, for example, web pages, or application user interfaces (UIs). In this case, the recommendation 140 can be used by a developer to select an appropriate next component during development. As can be appreciated, the partial application 138 and the candidate applications 140a-140n can be any software, hardware, or service that is defined by one or more reusable sub-components.
Turning now to
The application matcher 130 extracts the set of matching applications 136 by comparing the candidate applications 140a, 140b, 140c, and 140d to the partially constructed application 138 and by filtering out all dissimilar candidate applications (e.g., candidate application 140c). As can be appreciated, a variety of factors can be used to determine similarity/dissimilarity, including, but not limited to, explicit naming/tagging of the applications or components, the presence of identical or similar components, and a pattern of connections between components. As can be appreciated, component based similarity can be based on identity (containing exactly the same set of components that are in the partially constructed application), partial matching (some of the components are in the partially constructed application), inexact matching (matching components that have similar function, but are not identical), and/or any other matching schemes. Based on the similarities of the non-filtered candidate applications with the partially constructed application, the application matcher 130 then identifies similar portions 156, for example, by labeling the components and/or connections of the application as similar.
Turning now to
Turning now to
The accumulator 164 generates the recommendation 144 based on the scores 166. In one example, the accumulator 164 computes a normalized score 168 for each component type in the component set 142 based on the associated scores 166. The normalized score 168 is set equal to the sum of all scores 166 for the similar or the same components divided by the total number of different components. The accumulator 164 then ranks the components based on the normalized score 168 and generates the recommendation 144 based on the ranking. The recommendation 144 includes the predicted next component 170 and the normalized score 168.
Turning now to
In one example, the method may begin at block 300. All the candidate applications are added to the matching application set at block 302. The partially constructed application is then evaluated at block 304. For each component (M) of the partially constructed application at block 304, each candidate application (A) is evaluated to determine whether the candidate application includes the same component (M) or a similar component (M′) at block 308. If the candidate application contains a same or similar instance of the component of the partially constructed application at block 308, then the component of the candidate application is labeled to indicate that it is the same or similar at block 310. Otherwise, if the candidate application does not contain a same or similar instance of the component of the partially constructed application at block 308, the candidate application is removed from the application set at block 312.
The method iterates on all components in the partially constructed application. Once processing of each component is complete at block 304, the method may end at block 314.
Turning now to
In one example, the method may begin at block 400. Each application in the application set is processed at block 402. For each application (A) at block 402, an empty entry (E), in the next component set is created at block 404. Each component in the application is then processed at block 406. For each component (M) in the application at block 406, a check is made to see if the component has been labeled at block 408, as described above. If the component is not labeled at block 408, then the component is a potential “next component” and is added to the corresponding entry of the component set at block 410. The score is estimated and saved to the corresponding entry at block 411. If however, the component is labeled at block 408, the method continues to process the next components at block 406.
Once each component, for each application in the application set has been processed at block 402-411, the recommendation is generated at blocks 412-424. In one example, an empty accumulation set is created at block 412. Each entry (E) in the component set is then processed at block 414. For each entry in the component set at block 414, and for each component (M) in the entry, the corresponding score is added to a cumulative score for that component type at block 417 and a frequency count is incremented at block 418. The component, cumulative score, and the frequency count are then added as an entry to the accumulation set for that component type at block 418.
Once all entries in the next component set have been processed at block 414, the cumulative scores are normalized for each entry in the accumulation set at block 420. Normalization is performed by dividing the cumulative score for an entry by the frequency count for that entry. Once normalized at block 420, the normalized scores can be used as a sort key, to sort or rank the entries in the accumulation set at block 422. Finally, the components in the accumulation set or a subset of the components are presented in sort order (e.g., high to low) as recommendations at block 424. Thereafter, the method may end at block 426.
As described above, the recommendations are based on the structure of previously-constructed applications. As can be appreciated, a variety of other types of information, beyond structure, might be employed to generate the recommendations. Such other information can include, but is not limited to: application-level tagging or metadata, component-level tagging or metadata, runtime behavior (accumulated via logging, for example), developer-imposed structure (such as groupings), etc. As can also be appreciated, knowledge of the component actually selected by the developer can be used by the component recommendation system 128 (
In various embodiments, a selection of a component within the recommendation by the developer can be fed back into the system to improve future recommendations. In one example, the selected component is fed back into the next component extractor 132 (
The capabilities of the present invention can be implemented in software, firmware, hardware or some combination thereof.
As one example, one or more aspects of the present invention can be included in an article of manufacture (e.g., one or more computer program products) having, for instance, computer usable media. The media has embodied therein, for instance, computer readable program code means for providing and facilitating the capabilities of the present invention. The article of manufacture can be included as a part of a computer system or sold separately.
Additionally, at least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform the capabilities of the present invention can be provided.
The flow diagrams depicted herein are just examples. There may be many variations to these diagrams or the steps (or operations) described therein without departing from the spirit of the invention. For instance, the steps may be performed in a differing order, or steps may be added, deleted or modified. All of these variations are considered a part of the claimed invention.
While the preferred embodiment to the invention has been described, it will be understood that those skilled in the art, both now and in the future, may make various improvements and enhancements which fall within the scope of the claims which follow. These claims should be construed to maintain the proper protection for the invention first described.