With the widespread availability of information over networks, such as the Internet, search engines have come into widespread use. Search engines receive user queries and find content matching the query to return to the user. A common approach to implementing a search engine is through a page index. The page index relates terms that may appear in a search query to units of content on the network, frequently called web pages.
Various approaches exist for constructing and applying the page index. Constructing the index frequently entails “crawling” a network, such as the Internet, containing the body of data that will eventually be searched. Crawling entails following links from one web page to the next and analyzing each page. As part of the analysis, terms characterizing the web page may be identified and added to the page index in a way that associates that web page with those terms. Additionally, information, such as the number of links to a web page, may be captured and used to prioritize the web pages.
The page index is applied as part of a search stack. When a user submits a search query, a search engine matches terms in the query to web pages based on the search index. The search stack may include components that modify the search query before the index is consulted, such as to correct misspelling of search terms or attach terms that can be inferred based on a user profile. The search stack may also include components to filter search results. For example, web pages identified using the page index may be filtered, such as by ranking the web pages based on a metric indicating relevance to a query.
In some scenarios, information that may be useful as part of processing search queries may be pre-computed. An entry may be made in the page index, pointing to the pre-computed information rather than a web page.
The usefulness of a search system may be improved by incorporating into a search stack of the system components that may select and execute a model characterizing information to be provided in response to a search query. The model may be selected from a set of models based on user context information, such as a search query from a user and/or data identified by a search engine in response to the search query.
Execution of the model may result in generation of new information, in addition to formatting, filtering or processing data returned by a search engine executing the query. The new information may be generated based on an expression contained in the model and may include expressions that may be evaluated based on dynamic data, including data based on a search query.
The foregoing is a non-limiting summary of the invention, which is defined by the attached claims.
The accompanying drawings are not intended to be drawn to scale. In the drawings, each identical or nearly identical component that is illustrated in various figures is represented by a like numeral. For purposes of clarity, not every component may be labeled in every drawing. In the drawings:
The inventors have recognized and appreciated that functionality and utility of search systems may be expanded by incorporating into a search stack of a search system components that can select and execute one or more models characterizing data to be provided to a user in the user's context, and also characterizing the interactions between the data and user. In response to a search query, or other input identifying user context, the system may identify a model to apply in generating information for the user and to handle further interactions between the user and the information (as the user seeks to better understand the information or subsets of the information, including by providing more data). The model may be selected based on user context information, that may include the search query itself or data generated by a search engine applying the search query to select web pages.
Once a model is selected, a model execution engine may apply the model to generate information to be provided to the user. A model may include one or more elements, at least some of which define a computation to be performed based on data dynamically identified for the user's context. For example, execution of the model may result in a computation that has inputs relating to terms in the search query or data generated by a search engine applying the search query. The output of the computation may be provided to the user as a result of the search query, either alone or in conjunction with data located by a search engine or other dynamically generated data.
In some embodiments, the computation may be based on an equation represented in the model. The equation may specify a mathematical operation to be performed on data that is dynamically identified by the search engine. Such a mathematical operation may include other data, such as user data obtained from a user profile or based on context information. Executing models in a search stack may greatly expand the type of information that may be returned to a user and may be applied in many different contexts.
For example, a model may include an equation, defining computation of calorie content from a recipe. Such a model may be applied in response to a query requesting recipes such that, in addition to receiving content representing recipes found on web pages, a user may receive calorie content, even though that information was not included on the web pages. Such a model may allow the user to interact with the data returned by the query without going back to the search engine. For example, if a facility is provided to edit a model, a user may specify or change his personal profile and nutrition goals by editing the model and see alternative meal plans for the day.
As another example, a model may include a formula for computing commuting distance or time from a location. Such a model may be applied in response to a query requesting information on houses for sale such that, in addition to receiving a listing of houses for sale identified as a result of a search, a user may receive commuting information with each house. A system may support user edits to the commuting model to let the user interact with and do “what-ifs” with the information, e.g. the commuting impact of particular school/gym/workplace/transportation mode choices.
As yet another example, a model may include a formula for generating a metric comparing a patient's lab results to a population norm. Such a model may be applied in a search system coupled to an intranet in a hospital such that, when a search is conducted for lab results of a patient, a clinician may receive, in addition to lab results for the patient, comparative data characterizing the results based on an analysis of lab results in medical records for other users returned as a result of the search. The model may enable the clinician to do what-ifs like change some assumptions about the patient or the relationship between lab data and underlying disease.
A search system may contain multiple models, applicable in different contexts. Accordingly, a search system may contain a component that selects a model for a specific context. A component to perform this function may access a model index. In some embodiments, the model index may have a form analogous to a page index used by a search engine. In this way, existing techniques optimized for high speed search may also be used to quickly select and apply a model. Such an approach may be useful in scenarios in which a model is identified based on terms or other aspects of a search query. Further, in some embodiments, the models may be treated as web pages and may include meta tags to aid in indexing them in a search engine.
To facilitate the use of models, each model may be represented as a collection of declarative statements (“expressions”). An expression may be a symbolic representation of a computation to be performed, which may consist of operators and operands. The operators of an expression may include any operators known to one of skill in the art (such as the common mathematical operators of addition, subtraction, multiplication, and division), any functions known to one of skill in the art, and functions defined by a user. The operands of an expression may include data (such as numbers or strings), symbols that represent data, and other expressions. An expression may thus be recursive in that an expression may be defined by other expressions.
A symbol may represent any type of data used in common programming languages or known to one of skill in the art. For example, a symbol may represent an integer, a rational number, a string, a Boolean, a sequence of data (potentially infinite), a tuple, or a record. In some embodiments, a symbol may also represent irrational numbers, while in other embodiments, symbols may not be able to represent irrational numbers.
For example, an expression may take the form of a symbolic representation of an algebraic expression, such as x2+2xy+y2, where x and y may be symbols that represent data or other expressions. An expression may take the form of an equation, such as E=mc2, where E, m, and c may by symbols representing data or other expressions. An expression may take the form of a function definition, such as ƒ(x)=x2−1, where ƒ is a symbol representing the function, x is a symbol representing an operand or argument of the function, and x2−1 is an expression that defines the function. An expression may also take the form of a function invocation, such as ƒ(3), which indicates that the function ƒ is to be invoked with an argument of 3.
Expressions may be solved by an expression engine to produce a result. For example, where the symbol x (itself an expression) represents the number 3 and the symbol y (also an expression) represents the number 2, the expression x2+2xy+y2 may be solved by replacing the symbols with the values the represent, e.g., 22+2×2×3+32, and then applying the operators to the operands to solve the entire expression as 25. In another example, where m is a symbol representing the number 2 and c is a symbol representing the number 3, the expression E, defined above, may be solved by replacing E with its definition, e.g., mc2, replacing the symbols m and c with the values they represent, e.g., 2×32, and applying the operators to the operands to solve the expression as 18.
In evaluating an expression, the expression engine may apply the operators to the operands to the extent that the operators and operands are defined and to the extent that expression engine knows how to apply the operators to the operands. For example, where the symbol x represents the number 3 and the symbol y is not defined, the expression x2+2xy+y2 may be solved by replacing the known symbols with the values the represent, e.g., 22+2×2×y+y2, and then applying the operators to the operands to solve the entire expression as 4+4y+y2. Where the symbol x represents the number 3 and the symbol y represents the string “hello”, the expression x2+2xy+y2 may be solved as 4+4×hello+hello2, since the expression engine may not know how to perform arithmetic operations on the string “hello.”
In some embodiments, expressions may be declarative. A declarative expression may indicate a computation to be performed without specifying how to compute it. A declarative expression may be contrasted with an imperative expression, which may provide an algorithm for a desired result.
In some embodiments, expressions may be immutable. An expression is immutable if it cannot be changed. For example, once a definition is given, such as E=mc2, the expression E cannot later be given a different definition. One advantage of immutability is that applications defined by immutable expressions may be side-effect free in that the functionality of the application may not be able to be altered by users of the application.
An application may be defined by a set of expressions, aka a model. An application defined by expressions may have input variables and output variables and the relationship between the input variables and the output variables may be defined by the set of expressions that defines the application. The determination of which variables are input variables and which variables are output variables may be determined by the user. In solving for the output variables, the expression engine may produce data (e.g., a number or a string) or may produce an expression of the input variables.
In this way, the models may be relatively easy to produce and apply. Further, by having the models executed in a model engine, an entity providing search services may receive models from third parties and execute them in an environment in which execution of the model cannot interfere with operation of the computer equipment that implements the search system.
As a result, knowledge useful in generating search results may be captured in models and shared across search systems. Further, by allowing the models to be executed in the search stack of a search system, models may perform computations or other operations based on data that is only available within the search system.
Computing device 105 may have the capability to communicate over any suitable wired or wireless communications medium to a server 106. The communication between computing device 105 and server 106 may be over computer network(s) 108, which may be any suitable number or type of telecommunications networks, such as the Internet, a corporate intranet, or cellular networks. Server 106 may be implemented using any suitable computing architecture, and may configured with any suitable operating system, such as variants of the WINDOWS® Operating System developed by MICROSOFT® Corporation. Moreover, while server 106 is illustrated in
In the example of
Regardless of the type of input provided by user 102 that triggers generation of a query, computing device 105 may send the query to server 106 to obtain information relevant to the query. After retrieving data relevant to the search query, such as, for example, web pages, server 106 may apply one or more declarative models to the data to generate higher level information to be returned to user 102. The information generated by server 106 may be sent over computer network(s) 108 and be displayed on display 104 of computing device 105. Display 104 may be any suitable display, including an LCD or CRT display, and may be either internal or external to computing device 105.
Regardless of the specific configuration of search stack 200, a user query 202 may be provided as input to search stack 200 over a computer networking communications medium, and may be either implicit or explicit, as discussed in connection with
Search engine 204 may consult data index 206 to retrieve data 208 related to the user query 202. The retrieved data 208 may be a data portion of search results that are retrieved based on user query 202 and/or other factors relevant to the search, such as a user profile or user context. That is, data index 206 may comprise a mapping between one or more factors relevant to a search query (e.g., user query terms, user profile, user context) and data, such as data pages, that match and/or relate to that query. The mapping in data index 206 may be implemented using conventional techniques or in any other suitable way.
Regardless of the type of mapping performed using data index 206 to retrieve data 208 relevant to the search, data 208 may comprise any suitable data retrieved by search engine 204 from a large body of data, such as, for example, web pages, medical records, lab test results, financial data, demographic data, video data (e.g., angiograms, ultrasounds), or image data (e.g., x-rays, EKGs, VQ scans, CT scans, or MRI scans). Data 208 may be retrieved or identified dynamically by search engine 204 or it may be cached as the result of a prior search performed by search engine 204 based on similar or identical query. Data 208 may be retrieved using conventional techniques or in any other suitable way.
The search stack 200 may also include a model selection component, such as model selector 210, which may select one or more appropriate declarative model(s) 214 from a set of models stored on one or more computer readable media accessible to the model selector 210. The model selector 210 may then apply the selected model(s) 214 to the results (i.e., data 208) of the search performed by search engine 204, Model selector 210 may be coupled to model index 212, which may be the same as data index 206 or may be a separate index. Model index 212 may be implemented on any suitable storage media, including those described in connection with data index 206, and may be in any suitable format, including those described in connection with data index 206. Model index 212 may comprise a mapping between one or more factors relevant to the user's search (e.g., terms in user query 202, user profile, user context, and/or the data 208 retrieved by the search engine 204) and appropriate model(s) 214 that may be applied to the data 208 retrieved by search engine 204.
Selected models 214 may be selected from a larger pool of models 250 stored on computer-readable media associated with server 106 (
In some embodiments, to facilitate easy addition of models to pool of models 250, the search system illustrated in
Search stack 200 may also include a model execution engine 216, which may apply the selected model(s) 214 to the data 208 retrieved by search engine 204. In the application of a model, data 208 may serve as a parameter over which the selected model(s) is executed by model execution engine 216. Additional parameters, such as portions of user query 202, may also be provided as input to the selected model(s) during model execution. Though, it should be appreciated that any data available within the search environment illustrated in
As a result of the application of the model to the search results performed by model execution engine 216, information 218 may be generated. Generated information 218 may be returned to the user by an output component (not shown) of search stack 200. Though, the generated information may be used in any suitable way, including as a query for further searching by search engine 204. Generated information 218 may include the results of model execution performed by model execution engine 216, may include data 208 retrieved by the search engine 204, or any suitable combination thereof. For example, based on the application of a model performed by the model execution engine 216, the ordering of the presentation to a user of data 208 may change, the content presented as part of data 208 may be modified so that it includes additional or alternative content that is the result of a computation performed by model execution engine 216, or any suitable combination of the two. Thus, when selected model(s) 214 are applied to raw data, such as data 208, retrieved by a search engine, the generated information 218 may be at a higher level of abstraction and therefore be more useful to a user than the raw data itself.
Model 300 may comprise one or more elements, which in the embodiment illustrated are statements in a declarative language. In some embodiments, the declarative language may be at a level that a human being who is not a computer programmer could understand and author. For example, it may contain statements of equations and the form of a result based on evaluation of the equation, such as equation 304 and result 305, and equation 306 and result 307. An equation may be a symbolic or mathematical computation over a set of input data.
Model 300 may also comprise statement(s) of one or more rules, such as rule 308 and the form of a result based on evaluation of the equation, such as rule result 309. The execution of some types of rules may trigger a search to be performed, thereby collecting new information. According to some embodiments, when a model such as model 300 containing a rule, such as rule 308, is executed, such as by model execution engine 216, the evaluation of the rule performed as part of the execution of the model may generate a search query and trigger a search to be performed by the data search engine, such as search engine 204. Thus, in such embodiments, an Internet search may be triggered based on a search query generated by the application of a model to the search data. Though a rule may specify any suitable result. For example, a rule may be a conditional statement and a result that applies, depending on whether the condition evaluated dynamically is true or false. Accordingly, the result portion of a rule may specify actions to be conditionally performed or information to be returned or any other type of information.
Model 300 may also comprise statement(s) of one or more constraints, such as constraint 310 and result 311. A constraint may define a restriction that is applied to one or more values produced on application of the model. An example of a constraint may be an inequality statement such as an indication that the result of applying a model to data 208 retrieved from a search be greater than a defined value.
Model 300 may also include statements of one or more calculations to be performed over input data, such as calculation 312. Each calculation may also have an associated result, such as result 313. In this example, the result may label the result of the specified calculation 312 such that it may be referenced in other statements within model 300 or otherwise specifying how the result of the computation may be further applied in generating information to a user. Calculation 312 may be an expression representing a numerical calculation with a numerical value as a result, or any other suitable type of calculation, such as symbolic calculations. In applying model 300 to data 208 retrieved by a search engine, model execution engine 216 may perform any calculations over data 208 that are specified in the model specification, including attempting to solve equations, inequalities and constraints over the data 208. In some embodiments, the statements representing equations, rules, constraints or calculations within a model may be interrelated, such that information generated as a result of one statement may be referenced in another statement within model 300. In such a scenario, applying model 300 may entail determining an order in which the statements are evaluated such that all statements may be consistently applied. In some embodiments, applying a model may entail multiple iterations during which only those statements for which values of all parameters in the statement are available are applied. As application of some statements generates values used to apply other statements, those other statements may be evaluated in successive iterations. If application of a statement in an iteration changes the value of a parameter used in applying another statement, the other statement will again be applied based on the changed values of the parameters on which it relies. Application of the statements in a model may continue iteratively in this fashion until a consistent result of applying all statements in the model occurs from one iteration to the next, achieving a stable and consistent result. Though, it should be recognized that any suitable technique may be used to apply a model 300.
Equation statement 404 is an example of equation 304 of
Result statement 405 is an example of result 305 of
The example of
In step 502, the search stack may receive the user's query. As discussed above, a user's query may be either implicit or explicit. For example, in some embodiments, a search stack may generate a search query on behalf of the user. The search stack, for example, may generate a search query based on context information associated with the user. This may be performed for example, by search engine 204 of
Regardless of how the query is generated, in step 504, the search engine may then retrieve data matching the search results query. The data returned may be based on a match (whether explicit or implicit) between the query (and/or other factors, such as user context and a user profile) and terms in an index accessible to the search engine, such as data index 206 of
The process then flows to step 506, in which the search stack may retrieve one or more models appropriate to the user's search. In the exemplary implementation of
At step 508, the search stack may then apply the retrieved model(s) to the retrieved data. In the exemplary implementation of
Turning to step 510, the search stack may then output results generated as a result of the application of the selected model(s) to the retrieved data. In this example the output may entail returning information to a user computer which can then render the information on a display for a user. In some embodiments, the generated information may include some combination of the result of executing the model on the data returned from the search engine and the data itself. For example, the generated information may filter or reorder the search data based on the application of the model, or may provide additional information or information in a different format than the data returned by the search results. In some embodiments, the reordering of the search data may incorporate a time element. For example, a model may identify a time order of a set of multiple events. Application of such a model may then entail identifying search data related to those events, and generating the information returned to the user in an order in accordance to the time order of the model. Though, it should be recognized that the nature of the information generated may be in any suitable form that can be specified as a result of application of a model, which may contain a combination of elements, such as calculations, equations, constraints and/or rules.
After the data is returned to the user (via the user's computing device), the process of
The user has entered a text query 604, “houses for sale near my office,” in a query input field 602 in the user interface, and sent that query via web browser 600 to a search engine that is part of a search stack according to some embodiments. In response, the search stack returned generated information to the user via the web browser, illustrated in
After receiving the user's query, the search engine may retrieve a set of data (e.g., web pages) including results of houses for sale near the user's office. The set of data returned from the search engine may be based on matches between the query terms and terms in an index relating to the web pages, as discussed above. Though, as illustrated, other sources of data may be used in evaluating the search query. In this example, the search query includes the phrase “my office.” That phrase may be associated with information in a user profile accessible to the search and retrieval system processing the query. Accordingly, on execution of the query, the search and retrieval system may filter results based on geographic location in accordance with the information specified in the user profile. Though, it should be recognized that any suitable technique may be used to process a search query and retrieve data.
Based on the query and/or the retrieved data, an appropriate model may then be selected by the search stack, such as by model selector 210 of
The selected model is then retrieved and applied to the data (i.e., the web pages of houses for sale) resulting from the search. The application of the model to the data may be performed, for example, by model execution engine 216. In the example of
Thus, in the example of
Accordingly, as the result of the application of the model specified by the example of
The search stack may also be configured to modify the index of models, such as model index 212 of
Having thus described several aspects of at least one embodiment of this invention, it is to be appreciated that various alterations, modifications, and improvements will readily occur to those skilled in the art.
Such alterations, modifications, and improvements are intended to be part of this disclosure, and are intended to be within the spirit and scope of the invention. Accordingly, the foregoing description and drawings are by way of example only.
The above-described embodiments of the present invention can be implemented in any of numerous ways. For example, the embodiments may be implemented using hardware, software or a combination thereof. When implemented in software, the software code can be executed on any suitable processor or collection of processors, whether provided in a single computer or distributed among multiple computers.
Further, it should be appreciated that a computer may be embodied in any of a number of forms, such as a rack-mounted computer, a desktop computer, a laptop computer, or a tablet computer. Additionally, a computer may be embedded in a device not generally regarded as a computer but with suitable processing capabilities, including a Personal Digital Assistant (PDA), a smart phone or any other suitable portable or fixed electronic device.
Also, a computer may have one or more input and output devices. These devices can be used, among other things, to present a user interface. Examples of output devices that can be used to provide a user interface include printers or display screens for visual presentation of output and speakers or other sound generating devices for audible presentation of output. Examples of input devices that can be used for a user interface include keyboards, and pointing devices, such as mice, touch pads, and digitizing tablets. As another example, a computer may receive input information through speech recognition or in other audible format.
Such computers may be interconnected by one or more networks in any suitable form, including as a local area network or a wide area network, such as an enterprise network or the Internet. Such networks may be based on any suitable technology and may operate according to any suitable protocol and may include wireless networks, wired networks or fiber optic networks.
Also, the various methods or processes outlined herein may be coded as software that is executable on one or more processors that employ any one of a variety of operating systems or platforms. Additionally, such software may be written using any of a number of suitable programming languages and/or programming or scripting tools, and also may be compiled as executable machine language code or intermediate code that is executed on a framework or virtual machine.
In this respect, the invention may be embodied as a computer readable medium (or multiple computer readable media) (e.g., a computer memory, one or more floppy discs, compact discs (CD), optical discs, digital video disks (DVD), magnetic tapes, flash memories, circuit configurations in Field Programmable Gate Arrays or other semiconductor devices, or other non-transitory, tangible computer storage medium) encoded with one or more programs that, when executed on one or more computers or other processors, perform methods that implement the various embodiments of the invention discussed above. The computer readable medium or media can be transportable, such that the program or programs stored thereon can be loaded onto one or more different computers or other processors to implement various aspects of the present invention as discussed above.
The terms “program” or “software” are used herein in a generic sense to refer to any type of computer code or set of computer-executable instructions that can be employed to program a computer or other processor to implement various aspects of the present invention as discussed above. Additionally, it should be appreciated that according to one aspect of this embodiment, one or more computer programs that when executed perform methods of the present invention need not reside on a single computer or processor, but may be distributed in a modular fashion amongst a number of different computers or processors to implement various aspects of the present invention.
Computer-executable instructions may be in many forms, such as program modules, executed by one or more computers or other devices. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Typically the functionality of the program modules may be combined or distributed as desired in various embodiments.
Also, data structures may be stored in computer-readable media in any suitable form. For simplicity of illustration, data structures may be shown to have fields that are related through location in the data structure. Such relationships may likewise be achieved by assigning storage for the fields with locations in a computer-readable medium that conveys relationship between the fields. However, any suitable mechanism may be used to establish a relationship between information in fields of a data structure, including through the use of pointers, tags or other mechanisms that establish relationship between data elements.
Various aspects of the present invention may be used alone, in combination, or in a variety of arrangements not specifically discussed in the embodiments described in the foregoing and is therefore not limited in its application to the details and arrangement of components set forth in the foregoing description or illustrated in the drawings. For example, aspects described in one embodiment may be combined in any manner with aspects described in other embodiments.
Also, the invention may be embodied as a method, of which an example has been provided. The acts performed as part of the method may be ordered in any suitable way. Accordingly, embodiments may be constructed in which acts are performed in an order different than illustrated, which may include performing some acts simultaneously, even though shown as sequential acts in illustrative embodiments.
Use of ordinal terms such as “first,” “second,” “third,” etc., in the claims to modify a claim element does not by itself connote any priority, precedence, or order of one claim element over another or the temporal order in which acts of a method are performed, but are used merely as labels to distinguish one claim element having a certain name from another element having a same name (but for use of the ordinal term) to distinguish the claim elements.
Also, the phraseology and terminology used herein is for the purpose of description and should not be regarded as limiting. The use of “including,” “comprising,” or “having,” “containing,” “involving,” and variations thereof herein, is meant to encompass the items listed thereafter and equivalents thereof as well as additional items.