Search queries are often issued by a user to a search engine. The search query may contain search terms that define information or other web sites that the user is seeking. Search queries may generally be classified as falling into one of two broad categories: discovery queries and navigational queries.
Discovery queries are issued by a user seeking information about a topic. Search results are then generated by the search engine according to one or more algorithms. Discovery queries are further characterized by the user seeking information from any source, and not any particular web site. For instance, a search query containing the element “Seattle” would be recognized as a discovery query. Multiple links to URLs may be generated by the search engine as a set of search results. Often a user may browse the search results and select results presented throughout the set of search results. Users may also select one link in the search result set and view the corresponding web site, only to discover that site did not provide the information they were seeking. The users may then return to the set of search results and select a link presented farther down in the list of search results.
Navigational queries are those issued by a user seeking to navigate to a specific web site. A search query of “Microsoft” would be recognized as a navigational query, as the user is likely seeking to navigate to the web site of the Microsoft Corporation, based in Redmond, Wash. Under most algorithms employed by search engines, a link to the URL of http://www.microsoft.com would be displayed as the first search result. While multiple search results may be generated by the search engine, often the user will click on the first search result presented and will not click on a link presented farther down the list. Additionally, it is unlikely that a user will return to the list of search results to select a link presented farther down in the list of search results.
Related URLs and corresponding web sites are defined to have similar content to that of a navigational search query and corresponding URL. For instance, if a navigational search query was directed to a major electronics retailer, a related URL and corresponding web site would be displayed for that of a competitor to the major electronics retailer.
Embodiments of the present invention relate to methods and computer-storage media for determining whether search queries are navigational search queries and presenting related search queries and corresponding URLs in association with search results for the navigational search queries. Query logs are analyzed to determine if a search query is a navigational search query. The query logs contain search queries issued by users and corresponding URLs selected in the search results. Navigational search queries and corresponding URL pairs are identified by applying one or more analytical methods upon the query logs.
Related search queries and corresponding URLs may be provided to a user issuing a navigational search query, in addition to a corresponding URL of the target of the navigational search query. The query log is consulted to determine which web sites are related to the navigational search query. By determining the identity of search queries issued within a predetermined time of the receipt of the navigational search query by other users, a set of related search queries and corresponding URLs may be determined. The query log may be consulted to determine which queries on the set are navigational queries, and queries determined to be non navigational queries may then be culled from the set. The remaining navigational search queries contained on the list may then be presented to the user issuing the navigational search query, along with the URL of the target of the navigational search query.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter nor is it intended to be used to limit the scope of the claimed subject matter.
The present invention is described in detail below with reference to the attached drawing figures, wherein:
The subject matter of the present invention is described with specificity herein to meet statutory requirements. However, the description itself is not intended to limit the scope of this patent. Rather, the inventors have contemplated that the claimed subject matter might also be embodied in other ways, to include different steps or combinations of steps similar to the ones described in this document, in conjunction with other present or future technologies. Moreover, although the terms “step” and/or “block” may be used herein to connote different elements of methods employed, the terms should not be interpreted as implying any particular order among or between various steps herein disclosed unless and except when the order of individual steps is explicitly described.
Embodiments of the present invention are directed to methods and computer-readable media with computer-executable instructions embodied thereon for determining whether one or more search queries are navigational search queries and displaying links to related web sites in conjunction with a Uniform Resource Locator (URL) for the navigational search query. One embodiment of the invention involves receiving a query log containing a plurality of search queries and corresponding URLs selected from search results generated in response thereto. Analytical methods are employed on the query log to determine if a particular search query is a navigational search query. A URL corresponding to the navigational search query is also determined. Other embodiments of the invention are directed to determining related queries and corresponding URLs for navigational search queries received from a user. The query log is analyzed to determine related queries and corresponding URLs for the navigational search query. At least one related query and corresponding URL is then presented to the user in conjunction with the corresponding URL for the navigational search query.
Having briefly described an overview of embodiments of the present invention, an exemplary operating environment suitable for use in implementing embodiments of the present invention is described below.
Referring to the drawings in general, and initially to
The invention may be described in the general context of computer code or machine-useable instructions, including computer-executable instructions such as program components, being executed by a computer or other machine, such as a personal data assistant or other hand-held device. Generally, program components including routines, programs, objects, components, data structures, and the like, refer to code that performs particular tasks, or implements particular abstract data types. Embodiments of the present invention may be practiced in a variety of system configurations, including hand-held devices, consumer electronics, general-purpose computers, specialty-computing devices, and the like. Embodiments of the present invention may also be practiced in distributed computing environments where tasks are performed by remote-processing devices that are linked through a communications network.
With continued reference to
Computing device 100 typically includes a variety of computer-readable media. By way of example, and not limitation, computer-readable media may comprise Random Access Memory (RAM); Read Only Memory (ROM); Electronically Erasable Programmable Read Only Memory (EEPROM); flash memory or other memory technologies; CDROM, digital versatile disks (DVD) or other optical or holographic media; magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to encode desired information and be accessed by computing device 100.
Memory 112 includes computer-storage media in the form of volatile and/or nonvolatile memory. The memory may be removable, non-removable, or a combination thereof. Exemplary hardware devices include solid-state memory, hard drives, optical-disk drives, and the like. Computing device 100 includes one or more processors that read data from various entities such as memory 112 or I/O components 120. Presentation component(s) 116 present data indications to a user or other device. Exemplary presentation components include a display device, speaker, printing component, vibrating component, etc. I/O ports 118 allow computing device 100 to be logically coupled to other devices including I/O components 120, some of which may be built in. Illustrative components include a microphone, joystick, game advertisement, satellite dish, scanner, printer, wireless device, and the like.
Turning now to
Computing network system 200 includes user devices 206, 208, and 210, and search engine server 204 all in communication with one another via a network 202. The network 202 may include, without limitation, one or more local area networks (LANs) and/or wide area networks (WANs). Such networking environments are commonplace in offices, enterprise-wide computer networks, intranets, and the Internet. Accordingly, the network 202 is not further described herein.
User devices 206, 208, and 210 may be any type of computing device used by an end user. By way of example, and not limitation, user devices may be personal computers, PDAs, cell phones, smart phones, or any other computing device used by an end user. Common to the exemplars presented is the ability to receive a search query and communicate that query to the search engine server 204 through network 202.
While an end user may be an ordinary human user, other types of end users are contemplated as well. For instance, an end user may also be another computing application. Accordingly, an end user should not be construed as being limited to a human user.
Search engine server 204 receives search queries entered on user devices 206, 208, and 210 through network 202. The search queries may be entered into a depiction of a web site generated on the user devices 206, 208, and 210. Search queries may generally be grouped into one of two classifications: discovery search queries and navigational search queries.
Discovery search queries are oftentimes issued by a user seeking information about a topic. Search results are then generated by the search engine server 204 according to one or more search algorithms. Discovery search queries are further characterized by the user seeking information from any source, and not any particular web site. For instance, a search query containing the element “Seattle” would be recognized as a discovery query. Multiple links to URLs may be generated by the search engine server 204 as a list of search results. Often a user may browse the search results and select results presented throughout the list of search results. Users may also select one link in the search result and view the corresponding web site, only to discover that web site did not provide the information they were seeking. The users may then return to the list of search results and select a link presented farther down in the list of search results.
Navigational search queries are those issued by a user seeking to navigate to a specific web site. For instance, a user may enter the name of an entity or organization as a search query when they seek to be presented with a link to the entity or organizations web site. As such, a search query of “Microsoft” would be recognized as a navigational search query, as the user is likely seeking to navigate to the web site of the Microsoft Corporation, based in Redmond, Wash. Under most algorithms employed by search engines such as search engine server 204, a link to the URL of http://www.microsoft.com would be displayed as the first search result. While multiple search results may be generated by a search engine in response to a navigational search query, often the user will select the first search result presented and will not select a link presented farther down the list. Additionally, it is unlikely that a user will return to the list of search results to select a link presented farther down in the list of search results.
Returning to the discussion of the search engine server 204 of
Turning now to
Turning now to
In block 420, the number of instances that, in response to a particular search query of the plurality of search queries, a particular corresponding URL was selected without a subsequent selection of a different URL from the search results. A particular search query within the meaning of
Simply put, although not explicitly depicted in
Proceeding on with
In block 440, a comparison is made to the number of instances that the particular URL was selected without a subsequent selection of a different URL to the number of instances that any URL was selected in response to the particular search to generate a comparison value. The comparison value can take the form of a ratio. Based upon the comparison value, the particular search query may be determined to be a navigational search query in block 450. The predefined percentage may vary considerably in practice and be regularly adjusted to provide optimal results. In some embodiments of the invention the percentage may be in the 40-50% range. Although it should be understood that in other embodiments of the invention, the percentage may vary significantly based on any number of factors. Those factors may include, but are not limited to: the total number of entries of the query log, the number of URLs selected in response to the particular search query, or the number of instances that the particular URL was selected in response to the particular search query. Accordingly, the range of percentages may vary considerably based on any number of factors.
The navigational search query can then be stored on computer-readable media in association with the particular corresponding URL. When search results are presented in response to the issuance of the navigational search query, at least one of a related query and a URL corresponding to a related query can be presented in association with the navigational search query and the particular corresponding URL.
Turning now to
The method begins at block 510 with the receiving of at least one search query. The search query may be entered into one of the user devices 206, 208, and 210 and communicated to the search engine server 204 through the network 202. Further discussion of
At block 520, the search query received in block 510 is analyzed to determine if it is a navigational search query. Methods described in relation to
Simply put, although not explicitly depicted in
A determination is then made of the number of instances that any URL was selected in response to the search query received at block 510. This determination is made by analyzing the entries of the query log in which the particular search query was issued. The number of distinct URLs selected in response to the particular search query can then be determined.
Further, the numbers identified and determined in blocks 420 and 430 are compared and a comparison value is generated. The comparison value can take the form of a ratio. If the ratio of the number of instances identified in block 420 to the number determined in block 430 exceed a predefined percentage, the particular search query is determined to be a navigational search query in block 520. The predefined percentage may vary considerably in practice and be regularly adjusted to provide optimal results. In some embodiments of the invention the percentage may be in the 40-50% range. Although it should be understood that in other embodiments of the invention the percentage may vary significantly based on any number of factors. Those factors may include, but are not limited to: the total number of entries of the query log, the number of URLs selected in response to the particular search query, or the number of instances that the particular URL was selected in response to the particular search query. Accordingly, the range of percentages may vary considerably based on any number of factors.
In block 530, a query log containing at least one such query is accessed. For purposes of discussion, it will be assumed that one query log is accessed, although it should be recognized that multiple query logs may be accessed in accordance with embodiments of the invention. As discussed above in regard to
In block 540, at least one related search query and a corresponding URL is identified. Related queries and corresponding URLs may be identified by analyzing the query log to determine queries issued by a user within a predetermined period of time from the same user issuing the navigational query. A set may then be generated of prospective related search queries and corresponding URLs. The search queries and corresponding URLs in the set may then be analyzed to determine if they are navigational search queries and related URLs. In some embodiments of the invention, queries and URLs may be filtered from the set if they are not navigational search queries and corresponding URLs. A filtered set is thereby generated that contains related search queries and corresponding URLs that are also navigational search queries. This filtered set, or some portion thereof, can then be presented in association with the navigational search and corresponding URL.
Alternative embodiments of the invention may retain the queries and URLs in the set, regardless of the status of the query as a navigational search query. In these embodiments, a determination may not even be made as to whether or not the search queries and URLs are navigational search queries and corresponding URLs.
At block 550, at least one of the related search query and the corresponding URL link is presented. The at least one of the related search query and corresponding URL may be presented in association with the corresponding URL for the navigational search query in the search results. The at least one of the related search query and corresponding URL may be presented directly beneath the link to the corresponding URL for the navigational search query. In addition, multiple links to related search queries and/or navigational search queries may be displayed in conjunction to the corresponding URL for the navigational search query. A visual depiction of the arrangement of the link to related web sites is presented in
Turning now to
The method begins in block 610 with the receiving of a search query. The search query may be received from any of user devices 206, 208, and 210 after being communicated through network 202 to search engine server 204.
A query log is then received in block 620, in a similar fashion related in conjunction with block 410 of
In block 630, the number of instances that, in response to a particular search query of the plurality of search queries, a particular corresponding URL was selected without a subsequent selection of a different URL from the search results.
In accordance with block 640, the query log is analyzed to determine the number of instances that any URL was selected in response to the particular search query. This determination is made by analyzing the entries of the query log in which the particular search query was issued. The number of distinct URLs selected in response to the particular search query is then determined. Although not depicted in
Simply put, although not explicitly depicted in
In block 650, a comparison value is generated by comparing the number of instances that the particular corresponding URL was selected without a subsequent selection of a different URL to the number of instances that any URL was selected in response to the particular search query to generate a comparison value.
In block 660, the particular search query is determined to be a navigational search query based upon the comparison value. This comparison value can take the form of a ratio. If the ratio exceeds a predefined percentage, the search query can be determined to be a navigational search query. The predefined percentage may vary considerably in practice and be regularly adjusted to provide optimal results. In some embodiments of the invention the percentage may be in the 40-50% range. Although it should be understood that in other embodiments of the invention, the percentage may vary significantly based on any number of factors. Those factors may include, but are not limited to: the total number of entries of the query log, the number of URLs selected in response to the particular search query, or the number of instances that the particular URL was selected in response to the particular search query. Accordingly, the range of percentages may vary considerably based on any number of factors.
In block 670, a set of related search queries and corresponding URLs is identified within the at least one query log. Related queries and URLs may be identified by analyzing the query log to determine queries issued by other users within a predetermined period of time from when they issued the same user issuing the navigational query. A set may then be generated of prospective related queries and URLs.
At block 680, a determination is made as to which queries and URLs in the set are navigational search queries and corresponding related URLs. Although not depicted in
At least a portion of the set of related queries and URLs are presented to the user in association with the navigational search query and corresponding URL in block 690. The set may be presented directly beneath the corresponding URL for the navigational search query. A visual depiction of the arrangement of the link to related web sites is presented in
Turning now to
The present invention has been described in relation to particular embodiments, which are intended in all respects to be illustrative rather than restrictive. Alternative embodiments will become apparent to those of ordinary skill in the art to which the present invention pertains without departing from its scope.
From the foregoing, it will be seen that this invention is one well adapted to attain all the ends and objects set forth above, together with other advantages which are obvious and inherent to the system and method. It will be understood that certain features and subcombinations are of utility and may be employed without reference to other features and subcombinations. This is contemplated by and is within the scope of the claims.