None.
None.
Embodiments of the present invention relate to a system and method for caching information in order to improve the efficiency of a search engine.
In recent years, computer search systems have become heavily utilized and various search systems compete to provide relevant and rapid results. Since user satisfaction depends upon these factors, search system developers strive to improve search system speed and performance.
Currently, when a user submits a query to a search engine, the query is sent through a front-end server for processing. The front-end server checks the query against information stored in a cache and subsequently processes the request. The cache may either be filled by results of incoming queries or by a pre-fetching mechanism. The pre-fetching mechanism typically processes a list of queries and caches the results. The search system often determines the contents of the cache based upon a least recently used (LRU) strategy. Using the LRU strategy, the search system will maintain its cache size constraints by deleting items from the cache that have not been fetched recently.
If query results cannot be served from the cache, the search system sends the query to an index server or multiple index servers to retrieve search results. The more queries that can be served from the cache as opposed to the index server increases the capacity of a search system and decreases the amount of hardware needed to run the search system.
Thus, a solution is needed that overcomes the shortcomings and drawbacks of the prior art by providing a method that serves more queries from a cache as opposed to the index server in order to increase the capacity of a search system and decrease the amount of hardware needed to run that search system. Thus, a solution is also needed for predicting and caching results based on keywords and phrases that are most likely to be queried by a user of the search engine.
Embodiments of the present invention include a method for implementing information from an advertising system within a search system including a search system cache. The method includes accessing information contained within an advertising database of the advertising system. The method additionally includes generating search results based on the accessed information and storing the accessed information and the generated search results based on the accessed information in the search system cache.
Additional embodiments include a method for implementing information from an advertising system within a search system that includes a search system cache. The search system is linked to an advertising system. The method includes comparing at least one input user query term with information in or extracted from an advertising database. The method additionally includes determining if the information matches the at least one input user query term and storing search results for the input user query term in the search system, possibly with attributes indicating the entry matches information in the advertising database.
In additional embodiments, a system is provided for implementing information from an advertising system within a search system. The system includes an advertising database within the advertising system for storing advertising information. The system additionally includes a caching system within the search system for extracting the information from the advertising database and storing the extracted information in a cache within the caching system.
The present invention is described in detail below with reference to the attached drawings figures, wherein:
I. System Overview
Embodiments of the invention provide a method and system for using data from an advertising system to improve the functionality of a cache utilized within a search system. More specifically, the method and system in accordance with embodiments of the invention operate in an environment including both a search system and an advertising system associated with the search system.
In the advertising system, advertisers purchase keywords and/or phrases that may be used to trigger display of their advertisement. Typically, advertisers research their selections such that their purchased sets of keywords and phrases are a strong predictor of input user queries and are thus far more likely to be queried by a user of the search engine than randomly selected keywords and phrases. Thus, the disclosed method and system extract the list of keywords and phrases from the advertising system and feed that data into the caching system of a search system in order to improve functionality of the search system.
II. Exemplary Operating Environment
The invention is described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Moreover, those skilled in the art will appreciate that the invention may be practiced with other computer system configurations, including hand-held devices, multiprocessor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers, and the like. The invention may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote computer storage media including memory storage devices.
With reference to
Computer 110 typically includes a variety of computer readable media. By way of example, and not limitation, computer readable media may comprise computer storage media and communication media. The system memory 130 includes computer storage media in the form of volatile and/or nonvolatile memory such as read only memory (ROM) 131 and random access memory (RAM) 132. A basic input/output system 133 (BIOS), containing the basic routines that help to transfer information between elements within computer 110, such as during start-up, is typically stored in ROM 131. RAM 132 typically contains data and/or program modules that are immediately accessible to and/or presently being operated on by processing unit 120. By way of example, and not limitation,
The computer 110 may also include other removable/nonremovable, volatile/nonvolatile computer storage media. By way of example only,
The drives and their associated computer storage media discussed above and illustrated in
The computer 110 in the present invention will operate in a networked environment using logical connections to one or more remote computers, such as a remote computer 180. The remote computer 180 may be a personal computer, and typically includes many or all of the elements described above relative to the computer 110, although only a memory storage device 181 has been illustrated in
When used in a LAN networking environment, the computer 110 is connected to the LAN 171 through a network interface or adapter 170. When used in a WAN networking environment, the computer 110 typically includes a modem 172 or other means for establishing communications over the WAN 173, such as the Internet. The modem 172, which may be internal or external, may be connected to the system bus 121 via the user input interface 160, or other appropriate mechanism. In a networked environment, program modules depicted relative to the computer 110, or portions thereof, may be stored in the remote memory storage device. By way of example, and not limitation,
Although many other internal components of the computer 110 are not shown, those of ordinary skill in the art will appreciate that such components and the interconnection are well known. Accordingly, additional details concerning the internal construction of the computer 110 need not be disclosed in connection with the present invention.
III. System and Method of the Invention
As set forth above,
Since some search systems have begun to generate revenue by allowing advertisers to bid on specified keywords or phrases, these search systems have already incorporated advertising logic. Each time a phrase that has been bid upon is entered as a user query, an advertiser might pay a pre-set amount to have an advertisement displayed. By incorporating the information contained within the advertising system 50 into the caching system 30, the method and system are capable of efficiently and effectively increasing cache size and cache-hit ratio. Typically, the research that has already been conducted by advertisers 4 shows that the set of keywords and phrases purchased by the advertisers 4 is a strong predictor of the keywords and phrases likely to be queried by a user of the search system 10 than randomly selected keywords and phrases.
The advertising system 50 may be composed of the advertising management component 60 and the advertising database 70. The keywords and phrases purchased by the advertisers 4 may be processed by the advertising management component 60 and stored in the advertising database 70 along with specified constraints on such factors as the language or market of the user query. The advertising database 70 may additionally store advertisements for display upon that are triggered by entry of the purchased keywords and phrases within the specified constraints. The list of keywords and phrases contained within the advertising database 70 may either be updated in real time as keyword advertisements are accepted from the advertiser or asynchronously such that batches of new keywords are added or expired from the system. The advertising database 70 may additionally store tracking information for billing or other purposes, as frequently the advertiser 4 agrees to pay a set amount each time the purchased keywords and phrases generate display of the selected advertisement.
With further reference to the search system 10, upon receiving a user query, the search system 10 will send the query or a command containing the query to the advertising system 50. The search system 10 processes the query through the front-end server 20 and its caching system 30. The caching system 30 communicates with the advertising system 50 and the advertising system 50 compares the query to keywords and phrases purchased by advertisers. If the advertising management component 60 finds any matching advertisements in the advertising database 70, it sends the matching advertisements that are found back to the search system 10 for display with search results obtained through the front-end server 20.
The index server 40, which may actually include multiple servers, is shown as a portion of the search system 10, and processes the query input by the user computer 2 if the caching system 30 is unable to process the query. This would occur in instances when the results for the particular input query are not cached. The use of the index server 40 can slow down query processing given the additional work required to fetch, assemble, and return query results. Thus, the search system 10 strives to maintain an efficient caching system 30 that will have the capability to process as many input queries as possible.
Embodiments of a caching system 30, 80 are illustrated in
In either of the disclosed caching system embodiments, the keywords and phrases from the advertising database are exported and fed into the caching system. After acquisition, those purchased keywords and phrases may be used for pre-fetching of results, extending cache entry expiration, or other type of incorporation in to the caching system. The disclosed method and system may additionally be combined with standard LRU caching and query log analysis for pre-fetching repeated queries.
Thus embodiments of the system and method allow for using data from an existing component, the advertising system, to improve the functionality of another existing component, the caching system. In either of the disclosed caching system embodiments, the keywords and phrases from the advertising database are exported and fed into the caching system. After acquisition, those purchased keywords and phrases may be used for pre-fetching of results, extending cache entry expiration, or other type of incorporation in to the caching system.
While particular embodiments of the invention have been illustrated and described in detail herein, it should be understood that various changes and modifications might be made to the invention without departing from the scope and intent of the invention. The embodiments described herein are intended in all respects to be illustrative rather than restrictive. Alternate embodiments will become apparent to those skilled in the art to which the present invention pertains without departing from its scope.
From the foregoing it will be seen that this invention is one well adapted to attain all the ends and objects set forth above, together with other advantages, which are obvious and inherent to the system and method. It will be understood that certain features and sub-combinations are of utility and may be employed without reference to other features and sub-combinations. This is contemplated and within the scope of the appended claims.