Search Tool Using Multiple Different Search Engine Types Across Different Data Sets

Information

  • Patent Application
  • 20080033926
  • Publication Number
    20080033926
  • Date Filed
    August 03, 2006
    18 years ago
  • Date Published
    February 07, 2008
    16 years ago
Abstract
Various embodiments provide a search tool that utilizes multiple different search engines. The individual search engines are configured to conduct searches in different ways across a search space that includes different types of data sets. In at least some embodiments, the type of search engine that is utilized is a function of characteristics of the data set(s) that is (are) to be searched. In search spaces that include different types of data sets, combining and mixing different search engines to collectively search the search space can provide a desirably fast and robust user experience.
Description

BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 illustrates an exemplary system in accordance with one embodiment.



FIG. 2 is a flow diagram that describes steps in a method in accordance with one embodiment.



FIG. 3 is an exemplary user interface in accordance with one embodiment.



FIG. 4 is an exemplary user interface in accordance with one embodiment.



FIG. 5 is an exemplary user interface in accordance with one embodiment.



FIG. 6 is an exemplary user interface in accordance with one embodiment.



FIG. 7 is an exemplary user interface in accordance with one embodiment.



FIG. 8 is a flow diagram that describes steps in a method in accordance with one embodiment.





DETAILED DESCRIPTION

Overview


Various embodiments provide a search tool that utilizes multiple different search engines. The individual search engines are configured to conduct searches in different ways across a search space that includes different types of data sets. In at least some embodiments, the type of search engine that is utilized is a function of characteristics of the data set(s) that is (are) to be searched. In search spaces that include different types of data sets, combining and mixing different search engines to collectively search the search space can provide a desirably fast and robust user experience.


The search tool about to be described can be utilized in accordance with any suitable type of computing device and can be used in any suitable searching scenario in which is it desirable to allow a user to search across a search space. The search space can include, by way of example and not limitation, all or several parts of the user's own computing device, one or more other computing devices, one or more servers or other networked data repositories and the like.



FIG. 1 illustrates an exemplary system generally at 100 in accordance with one embodiment. Here, system 100 includes a search space 102 and search tool 104. Search space 100 includes multiple different types of data sets and search tool 104 includes multiple different types of search engines.


The various data sets that make up the search space can have varying characteristics or properties. For example, one property of a data set can be its size. Specifically, some data sets in the search space may be relatively small, while other data sets may be relatively large. Another property of a data set can be the size that it is expected to grow to over time. For example, some data sets will be unlikely, in expected usage, to grow past some small size, while others are likely, in expected usage, to grow to a very large size.


As noted above, the search tool 104 includes multiple different types of search engines. These search engines are typically embodied in the form of computer-readable instructions or software that resides on some type of computer-readable medium. In practice and as described below in more detail, the principles of operation of the individual search engines are different. For example, a first of the search engines can be configured to conduct its searching in a manner that is different from the manner in which a second of the search engines is configured to search. For example, one search engine can be configured to conduct linear searches, while another search engine can be configured to conduct index searches. Linear searches and index searches are generally well known by the skilled artisan. A linear search involves, as the name implies, linearly or serially searching a collection of items in a data set. An index search involves searching an index which indexes content that may reside, for example, on a user's Computer. Indexes can vary in terms of how they are set up and maintained. Typically, however, an index contains an index entry, such as a keyword, and then a number of properties associated with that keyword. For example, an index may contain the word “note” as a keyword, and then include a property of the files in which that word appears.


When using the search tool 104 to search the search space 102, by appreciating the various different characteristics and properties as between the different data sets, individual search engines can be selected to conduct searches that are appropriate for the data set that is being searched. Hence, when searching across a search space that includes different types of data sets, a combination of search engines can be selected and selectively employed with an appropriate data set to collectively provide a very fast search and, in turn, enhance the user's searching experience. For example, on data sets that are relatively small, a search engine that conducts a linear search can be used. Similarly, on data sets that are relatively large, a search engine that conducts an index search can be used.



FIG. 2 is a flow diagram that describes steps in a method in accordance with one embodiment. The method can be implemented in connection with any suitable hardware, software, firmware or combination thereof. In at least one embodiment, the method can be implemented by a suitably configured search tool.


Step 200 receives user search input. This step can be performed in any suitable way. For example, a user interface component can be presented to the user and the user can type in a particular search term that is of interest. Step 202 selects a first search engine to conduct a search across a data set that is part of a relevant search space. Step 204 selects at least one other search engine to conduct a search across another data set. It is to be appreciated and understood that steps 202 and 204 can be performed simultaneously. Specifically, different search engines can be called at the same time to perform their respective searches. Step 206 presents the search results to the user.


By selecting different search engines depending on the characteristics of the data sets being searched, efficient searches across diverse data sets can be conducted and search results can be very quickly returned to the user.


In the example described just below, a search scenario in the form of a desk top search conducted from a start menu is described. It is to be appreciated and understood that this scenario is described, among other reasons, to give the reader an appreciation of one particular specific context in which the inventive search tool can be used. As such, other search scenarios can be utilized without departing from the spirit and scope of the claimed subject matter.


Implementation Example


Preliminarily, before describing the exemplary start menu implementation, consider the following.


A start menu is typically used by a computer user when they are either initiating their computing activities and/or performing a limited number of typically well understood actions, such as looking for a program to launch, looking for documents, pictures or music, accessing a control panel and the like. That is, there is a common expectation that a start menu will be used for certain definable actions and activities.


In accordance with one embodiment, a search box is provided as part of the start menu user interface and enables the user to quickly search for items on their computer. A good assumption about a user who uses a start menu search box is that they are more likely to search for some types of data (data sets) than others. For example, one of the primary uses of the start menu is to launch programs. Hence, if a user decides to use the start menu's search box, a good assumption is that they might be looking for a particular program. It is, of course, possible that they are looking for something else—but generally, the assumption that the user might be looking for a program is a good one.


Thus, when one looks at the possible uses of a start menu and juxtaposes the types of data that a user might search using a start menu search box, groups of data sets begin to emerge. As an example of one collection of data set groups, consider the following.


A first data set or group that a user might be interested in can be considered as “programs”. Programs can include the programs that are loaded on the user's computing device, application in their path, and control panels. One characteristic of the programs data set is that it is relatively small and does not grow very large with normal usage.


A second data set or group that a user might be interested in can be considered as “web-related items”. Web related items can include web pages from the user's favorites folder and/or web pages from the user's internet browsing history. One characteristic of the web-related items data set is that it is relatively small and does not grow very large with normal usage.


A third data set or group that a user might be interested in can be considered as “files”. Files can include any files that the user has on their device such as document files, music files and the like. One characteristic of the files data set is that it tends to be relatively larger and tends to grow to a larger size than those data sets mentioned above.


A fourth data set or group that a user might be interested in can be considered as “communications”. Communications can include email messages, instant messaging messages, appointments, contacts and the like. One characteristic of the communications data set is that it is relatively larger and can tend to grow to a larger size than the first two data sets mentioned above.


In accordance with one embodiment, when a user accesses the start menu's search box and begins typing in letters, search results that match their query appear in the start menu. To enhance the user experience, this can be done on a letter by letter basis. Accordingly, as the user types in the first letter, they can see a set of search results that match the first letter. As they type in the second letter, the search results can change, and so on.


In practice and in view of the different types of data sets that make up the searchable search space, different search engines are selected to search individual data sets. For example, in the implementation example just above, for the first two groups, i.e. programs and web-related items, because of the relative size of such data sets, a search engine that conducts a linear search can be used, as will be appreciated by the skilled artisan. However, for the last two groups, i.e. files and communications, because of the relative size of such data sets, a different search engine and one that conducts an index search can be used.


It should be appreciated and understood that for queries that will return Many results, items from the smaller data sets can be returned faster if they are linearly searched separately, rather than if they are included in the index of the larger data set and only one search is performed. In addition, the complexity and overhead of an index can be avoided if the search is known to cover only a small data set.


As an example of a user interface that can be used to enable a user to search in accordance with one embodiment, consider FIG. 3. There, a start menu 300 is shown with a display of programs and other items that can be selected by a user. In addition, a search box 302 is shown. Assume now that a user wishes to search their computing device using search box 302. Assume also that they are looking for something having to do with “mail”.



FIG. 4 shows search box 302 after the user has typed in the letter “m”. Notice that the display in the start menu now changes. In this example, a programs portion 304 displays programs that include a word that starts with the letter “m”. Similarly, a favorites and history portion 306 displays web related items that have a word that starts with the letter “m”.



FIG. 5 shows search box 302 after the user has typed in the letters “ma”. Notice that the display in the start menu now changes from that which is shown in FIG. 4. In this example, programs portion 304 displays programs that include a word that starts with the letters “ma”. Similarly, a favorites and history portion 306 displays web related items that have a word that starts with the letters “ma”. Additionally, a files portion 308 displays files that either have a word in their name that starts with “ma”, or have a word in them that starts with “ma”. In this particular example, the files portion was not shown when only one letter was typed in. The reason for this is that the result set of files that include a particular letter is potentially very large so as to present information to the user that is of questionable value. Accordingly, by waiting to display the file portions results until multiple letters have been typed in, certain efficiencies may be gained.



FIG. 6 shows search box 302 after the user has typed in the letters “mai”. Notice that the display in the start menu now changes from that which is shown in FIG. 5. In this example, programs portion 304 displays a single program that include a word that starts with the letters “mai”. Files portion 308 displays a file that contains a word that starts with “mai”. Notice here that the favorites and history portion has been removed from the user interface because it contains no items that have a word that starts with “mai”.



FIG. 7 shows search box 302 after the user has rounded out their search term by typing in the term “mail”. Here, the display has not changed from that which is shown in FIG. 6.


By presenting search results in a letter-by-letter fashion, the user can instantly see their search results as they develop. In addition, by combining different types of search engines for different types of data sets, the collective search space can be quickly and efficiently searched.


Adapting the Search Engine Type Based on Data Set Characteristics


In at least some embodiments, the search engine type that is used to search a particular data set can be changed when the characteristics associated with that data set change in a manner which indicates that a different search engine would be more efficient. For example, as noted above, a linear search can be used for data sets that are relatively small. If, however, the data grows over time and assumes a size that lends itself more readily to an index search, then a different search engine can be selected for searching that particular data set. In this case, a size threshold can be set and if the data set exceeds the defined size threshold, then a different search engine can be used.


Changing Search Engines Based on the Length of the Query String


In at least some embodiments, the search engine that is used can be changed based on the length of the query string that is entered by the user. For example, the search tool might use a search engine to conduct a linear search of programs for query strings that are two letters or less, and then switch to a search engine that uses an index search for three letters or more.


Exemplary Method



FIG. 8 is a flow diagram that describes steps in a method in accordance with one embodiment. The method can be implemented in connection with any suitable hardware, software, firmware or combination thereof. In at least one embodiment, the method can be implemented by a suitably configured search tool that comprises part of a start menu.


Step 800 presents a start menu user interface having a search tool that is includes a search box. Step 802 receives a letter that is entered by a user. Step 804 selects a first search engine to conduct a search across one or more data sets. Examples of data sets are given above.


Step 806 displays search results, associated with the letter, to the user. But one way of displaying the search results is to display the results in accordance with pre-defined categories or groups that make up subject matter that a user is likely to want to see. But one example of such groups is given above.


Step 808 determines whether the user has typed any additional letters. This step can be implemented by defining a short period of time and then ascertaining whether, during this period of time, the user types any additional letters. If the user types additional letters, then the method returns to step 802 and repeats the process described above. If, on the other hand, the user does not type any additional letters, step 810 selects a second or additional search engine(s) to conduct a search across one or more data sets. The data sets across which the second or additional search engine(s) search can be the same as or different from those searched using the first search engine.


Step 812 displays additional search results to the user. This step can be performed in any suitable way. Step 814 ascertains whether there are any additional letters entered by the user. If so, the method returns to step 802 and continues the search. If not, the method can return to step 812 which simply displays or continues to display the search results developed for the user.


In practice, in at least this embodiment, the first search that is conducted is a fast search, e.g. a linear search on small data set, whose results are returned to the user. If the user does not type additional letters, then a slower search, e.g. an index search on a large data set, is performed. One advantage of this approach is that at least some results are returned to the user very quickly. Additionally, if the user has typed more letters, then time and resources are not wasted performing the slower search on an inappropriate string.


CONCLUSION

Various embodiments provide a search tool that utilizes multiple different search engines. The individual search engines are configured to conduct searches in different ways across a search space that includes different types of data sets. In at least some embodiments, the type of search engine that is utilized is a function of characteristics of the data set(s) that is (are) to be searched. In search spaces that include different types of data sets, combining and mixing different search engines to collectively search the search space can provide a desirably fast and robust user experience.


Although the invention has been described in language specific to structural features and/or methodological steps, it is to be understood that the invention defined in the appended claims is not necessarily limited to the specific features or steps described. Rather, the specific features and steps are disclosed as preferred forms of implementing the claimed invention.

Claims
  • 1. A computer-implemented method comprising: receiving user search input;responsive to said receiving, selecting a first search engine to conduct a search across a data set comprising part of a search space;responsive to said receiving, selecting at least one other search engine to conduct a search across another data set comprising part of the search space;using said search engines to conduct a search of the search space; andpresenting results of the search to a user.
  • 2. The method of claim 1, wherein said acts of selecting are performed as a function of a data set's size.
  • 3. The method of claim 1, wherein said acts of searching are conducted on a user's computing device.
  • 4. The method of claim 1, wherein one of the search engines comprises a search engine configured to conduct a linear search.
  • 5. The method of claim 1, wherein one of the search engines comprises a search engine configured to conduct an index search.
  • 6. The method of claim 1, wherein one of the search engines comprises a search engine configured to conduct a linear search and another of the search engines comprises a search engine configured to conduct an index search.
  • 7. The method of claim 1, wherein said acts of using and presenting can be performed on a letter-by-letter basis.
  • 8. A computer-implemented method comprising: presenting a start menu user interface having a search tool being configured to utilize multiple different search engines individual ones of which being configured to conduct different types of searches;receiving, via the search tool, user search input;responsive to said receiving, selecting a first search engine to conduct a search across one or more data sets;responsive to said receiving, selecting at least a second search engine to conduct a search across one or more data sets;using said search engines to conduct a search; anddisplaying search results to the user.
  • 9. The method of claim 8, wherein said one or more data sets searched by the first search engine are different from said one or more data sets searched by the second search engine.
  • 10. The method of claim 8, wherein said first search engine comprises a search engine configured to conduct a linear search.
  • 11. The method of claim 8, wherein said second search engine comprises a search engine configured to conduct an index search.
  • 12. The method of claim 8, wherein said acts of selecting are performed as a function of the size of a data set to be searched.
  • 13. The method of claim 8, wherein said acts of using and displaying can be performed on a letter-by-letter basis.
  • 14. The method of claim 8 further comprising changing a search engine type associated with a data set when a characteristic of that data set changes.
  • 15. The method of claim 8 further comprising changing a search engine for conducting a search based on a query string length entered by the user.
  • 16. The method of claim 8, wherein the act of displaying comprises displaying search results in defined groups.
  • 17. The method of claim 16, wherein said defined groups include programs, web-related items, files, and communications.
  • 18. A computer-readable medium comprising computer-readable instructions which, when executed, implement a search tool comprising: multiple different search engines individual ones of which being configured to be selected and used to conduct different types of searches across different types of data sets,wherein the search tool is configured to select a search engine as a function of a data set size.
  • 19. The computer-readable medium of claim 18, wherein the search tool is configured to display search results to a user on a letter-by-letter basis.
  • 20. The computer-readable medium of claim 18, wherein the search tool comprises part of a start menu.