1. Field of the Invention
The present invention relates to the field of content searching and more particularly to specifying a content search according to multiple different search terms.
2. Description of the Related Art
Content browsing refers to the retrieval and presentation of electronic content in a browser client. Content generally can include electronic documents, messages, audio, audiovisual and video materials and imagery. Most commonly, content can be stored in a server environment and published for access by content consumers over a computer communications network such as the global Internet. Content consumers, in turn, can retrieve content over the network by reference to a network address for the content. Once retrieved, the content can be presented in a browser client including not only conventional visual browsers such as the venerable Web browser, but also in alternative browsers such as those deployed in pervasive devices and those supporting different modes of presentation such as the audible presentation of material.
Given the vast amount of content published for accessibility over the Internet, one cannot enjoy an awareness of a possible content of interest available for access over the Internet. Search engines fill the gap by providing an interface through which end users can instruct searches for documents according to one or more search terms. Basic search engines permit simple term searches where the presence of one or more search terms in content result in the return of a reference to the located content. More sophisticated search engines permit boolean searching and even more sophisticated search engines allow for natural language searching.
Where search terms are provided in a search engine, the results often reflect the relevance of a “hit” based upon a percentage of search terms present in the located content. Yet, in many cases, the search terms provided are not intended to be located in desired content. Rather, search engines frequently permit one to require the omission of any content containing a search term. In boolean terms, the NOT operation is permitted as well as the AND and OR operations. In any event, limiting the end user to providing search terms on an absolute basis of either “is present” or “is absent” reflects the reality that users in many cases lack a familiarity of particular search domain as most searches are ad hoc in nature.
Even still, from time to time users enjoy a substantial understanding of a search domain. In consequence, limiting such users to absolutes can be counterproductive. For example, when searching for content including terms A, B and C, but not D, one may recognize that in the search domain of interest, relevant content must include term A, and may include terms B and C, but should more often than not include term B in respect to term C. Further, relevant content should not include term D usually, but once in a while relevant content may include term D. Conventional search engines cannot support this type of search as a term is either included or excluded in relevant content from the perspective of the conventional search engine. Notwithstanding it would desirable to permit such users to emphasize the importance of one search term over another.
Embodiments of the present invention address deficiencies of the art in respect to content searching and provide a novel and non-obvious method, system and computer program product for specifying weighted search terms for a search engine. In an embodiment of the invention, a content searching data processing system can be configured for specifying weighted search terms for a search engine. The system can include a search engine executing in a host server. The search engine can be coupled to a search index and can be configured for communicative coupling to different content browsers executing in respective clients over a computer communications network. Finally, the system can include weighted search term logic coupled to the search engine. The logic can include program code enabled to assign variable weights to corresponding search terms and to issue a search to the search engine with the search terms and variable weights and to return results of the search to a requesting one of the content browsers.
In one aspect of the embodiment, the program code of the weighted search term logic further can be enabled to render a search term entry user interface in which the variable weights are specified for corresponding ones of the search terms. For instance, the search term user interface can include an iconic representation of each of the search terms, and a target comprising different portions for different variable weights, the target being configured for dragging-and-dropping of each iconic representation to assign a variable weight to a search term for each iconic representation. For instance, the different portions for different variable weights, can include different portions for different variable weights ranging from strongly exclusive (−1.0) to strongly inclusive (1.0).
As another example, the search term user interface can include a table listing each search term and in which a variable weight is specified manually in the table for each search term. As yet another example, the search term user interface can include a table in which a search term with a corresponding variable weight is specified manually in the table for each search term. As even yet another example, the search term user interface can include a table listing each search term and in which a variable weight is specified by way of a slider control in the table for each search term. As an additional example, the search term user interface can include a table listing each search term and in which a variable weight is specified by way of a slider control in the table responsive to a context menu selection for each search term. As yet a further example, the search term user interface can provide a text field into which each search term can be specified, and further into which a suffix can be provided to each search term to indicate a degree of relevance of the search term.
In another embodiment of the invention, a method for specifying weighted search terms for a search engine can be provided. The method can include assigning variable weights to corresponding search terms for a content search to be conducted by a search engine. The method further can include issuing a search to a search engine with the search terms and assigned variable weights. Even yet further, the method can include returning results of the search to a requesting content browser. In one aspect of the embodiment, assigning variable weights to corresponding search terms for a content search to be conducted by a search engine, can include rendering a search term entry user interface to receive the assignment of the variable weights.
Additional aspects of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The aspects of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
The accompanying drawings, which are incorporated in and constitute part of this specification, illustrate embodiments of the invention and together with the description, serve to explain the principles of the invention. The embodiments illustrated herein are presently preferred, it being understood, however, that the invention is not limited to the precise arrangements and instrumentalities shown, wherein:
Embodiments of the present invention provide a method, system and computer program product for specifying weighted search terms for a search engine. In accordance with an embodiment of the present invention, different search terms can be specified for a search of content. The search terms can be both inclusive and exclusive in that some terms are to be present in located content and other terms are to be absent from located content in order to satisfy the search. The presence of the search terms, however, need not be absolute and can be variable in nature. In this regard, a variable weight can be applied to one or more of the search terms such that while it may be preferred that the search term is present or absent from located content, as the case may be, it is not required and depending upon the presence or absence of other search terms of more significant weight, located content containing or excluding specified search terms can be returned as part of search results for the search.
The variable weights for the search terms can be specified through a search term entry interface. By way of example,
The relative weights shown in the target 120 can range from not relevant (0.0 value) to most relevant (1.0) value. Alternatively, the relative weights shown in the target 120 can range from strongly exclusive (−1.0 value) to not relevant (0.0) to strongly inclusive (1.0) value. In this regard, a value of +1 can represent extreme inclusion, meaning that, on the basis of this search term, the content associated with this search term has the highest likelihood of being returned as part of the search results. Conversely, a value of −1 represent extreme exclusion, meaning that, on the basis of this search term, the content associated with this search term has the lowest likelihood of being returned as part of the search results. In practice, the determination of whether a particular content appears in the search results depends on the combination of search terms and their weights. In any event, each iconic representation 130 of a search term can be dragged and dropped onto a portion of the target 120. The placement of the iconic representation 130 can result in an assignment of an associated variable weight to an underlying search term. In this way, the variable weights for each search term can be applied intuitively in a familiar graphical user interface.
The search term user interface 110A, 110B, 110C, 110D, 110E shown in
Notably, weighted search term logic 300 can be coupled to the search engine 240 by way of host server 210. The search term logic 300 can include program code enabled to prompt an end user through a respective content browsers 250 for variable weights specified in a search request 270. The prompt can be provided within a search term entry user interface. Once received, the search index 260 can be searched for results in accordance with the provided search terms and the corresponding weights. Subsequently, the search results 280 can be returned to the end user for rendering in the respective content browser 250 from whence the request 270 originated.
In further illustration of the operation of the weighted search term logic 300,
Embodiments of the invention can take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment containing both hardware and software elements. In a preferred embodiment, the invention is implemented in software, which includes but is not limited to firmware, resident software, microcode, and the like. Furthermore, the invention can take the form of a computer program product accessible from a computer-usable or computer-readable medium providing program code for use by or in connection with a computer or any instruction execution system.
For the purposes of this description, a computer-usable or computer readable medium can be any apparatus that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. The medium can be an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system (or apparatus or device) or a propagation medium. Examples of a computer-readable medium include a semiconductor or solid state memory, magnetic tape, a removable computer diskette, a random access memory (RAM), a read-only memory (ROM), a rigid magnetic disk and an optical disk. Current examples of optical disks include compact disk-read only memory (CD-ROM), compact disk-read/write (CD-R/W) and DVD.
A data processing system suitable for storing and/or executing program code will include at least one processor coupled directly or indirectly to memory elements through a system bus. The memory elements can include local memory employed during actual execution of the program code, bulk storage, and cache memories which provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution. Input/output or I/O devices (including but not limited to keyboards, displays, pointing devices, etc.) can be coupled to the system either directly or through intervening I/O controllers. Network adapters may also be coupled to the system to enable the data processing system to become coupled to other data processing systems or remote printers or storage devices through intervening private or public networks. Modems, cable modem and Ethernet cards are just a few of the currently available types of network adapters.