The present invention relates generally to a search and retrieval system, and more particularly, to an intelligent query system and method used in a search and retrieval system.
Existing search query systems have been designed to help provide comprehensive search and retrieval services. However, terms or phrases used by writers may extend to different meanings that belong to different categories. For example, many documents contain phrases “strike outs” or “home run.” These terms are generally related to baseball. Occasionally, these terms are also used when evaluating the performance of financial equities analysts, such as “Those Internet picks were major strike outs”, or “Choosing MSFT back in '86 was a real home run.”
In the existing search and retrieval systems, the documents that contain “strike outs” or “home run” in the above example, whether they are baseball documents or financial documents, are searched and retrieved. Readers can be very frustrated by wasting a lot of time in reading the irrelevant documents.
Therefore, there is a need for an intelligent query system and method that is used in a search and retrieval system capable of providing an intelligent and efficient search and retrieval.
The present invention provides an intelligent query system and method used in a search and retrieval system with a document feed and a categorization engine.
In one embodiment of the present invention, documents about baseball are marked with a taxonomy element “BASE”, and those about equities are marked with “EQUITIES”. Accordingly, the intelligent query system of the present invention recognizes that the phrases “strike outs” and “home run” are much more strongly correlated with “BASE” as opposed to “EQUITIES.” Therefore, when a search is conducted or a lookup is done in a map, the system recommends the strongest correlation as “BASE.”
In one embodiment of the present invention, an intelligent query (“IQ”) method comprises the steps of:
With the data collected from the above process, an IQ map can be generated by the following steps:
One exemplary PCF-IPCDF scoring system or model is described in the patent application, U.S. Utility application Ser. No. 11/060,928, U.S. Publication No. 2005/0187923, filed on Feb. 18, 2005, the subject matter of which is hereby incorporated by reference.
The map structure can be loaded into applications which benefit from being able to deduce relevant taxonomy elements from terms. Such applications include, but not limited to, search engines and tracking engines.
Some exemplary uses of the map (or IQ map) include guiding a user toward relevant search topics, presenting a user with a list of related taxonomy terms, and/or transparently focusing a search for a user.
Therefore, in the above baseball example, the intelligent query system of the present invention recognizes that the phrases “strike outs” and “home run” are much more strongly correlated with “BASE” as opposed to “EQUITIES.” Therefore, when a lookup is done in the map, the system recommends the strongest correlation as “BASE.”
These and other features and advantages of the present invention will become apparent to those skilled in the art from the attached detailed descriptions, wherein it is shown, and described illustrative embodiments of the present invention, including best modes contemplated for carrying out the invention. As it will be realized, the invention is capable of modifications in various obvious aspects, all without departing from the spirit and scope of the present invention. Accordingly, the descriptions are to be regarded as illustrative in nature and not restrictive.
The present invention provides an intelligent query system and method used in a search and retrieval system with a document feed and a categorization engine.
It is noted that an exemplary PCF-IPCDF scoring system or model has been described in the co-pending patent application, U.S. Utility application Ser. No. 11/060,928, U.S. Publication No. 2005/0187923, filed on Feb. 18, 2005, the subject matter of which is hereby incorporated by reference.
In particular, a Phrase-Code Frequency-Inverse Phrase-Code Document Frequency (PCF-IPCDF) module in accordance with the present invention selects the codes for improving user searches. The system outputs the codes or restricts sources of the query and thereby improve very simply specified searches.
Definitions of certain terms are as follows:
The map structure can be loaded into applications which benefit from being able to deduce relevant taxonomy elements from terms. Such applications include, but not limited to, search engines and tracking engines.
As a result, documents about baseball are marked with a taxonomy element “BASE”, and those about equities are marked with “EQUITIES”. The intelligent query system of the present invention recognizes that the phrases “strike outs” and “home run” are much more strongly correlated with “BASE” as opposed to “EQUITIES.” Therefore, when a search is conducted or a lookup is done in a map, the system recommends the strongest correlation as “BASE.”
One of the advantages of the present invention is that it provides end-users the most relevant, meaningful, up-to-date, and precise search results.
Another advantage of the present invention is that an end-user is able to benefit from an experienced recommendation that is tailored to a specific industry.
These and other features and advantages of the present invention will become apparent to those skilled in the art from the attached detailed descriptions, wherein it is shown, and described illustrative embodiments of the present invention, including best modes contemplated for carrying out the invention. As it will be realized, the invention is capable of modifications in various obvious aspects, all without departing from the spirit and scope of the present invention. Accordingly, the above detailed descriptions are to be regarded as illustrative in nature and not restrictive.
This application claims the benefit of U.S. Provisional Application No. 60/590,247, entitled “INTELLIGENT QUERY SYSTEM AND METHOD USING PHRASE-CODE FREQUENCY-INVERSE PHRASE-CODE DOCUMENT FREQUENCY MODULE”, filed on Jul. 22, 2004, the subject matter of which is hereby incorporated by reference; and this application is also related to a co-pending patent application, U.S. Utility application Ser. No. 11/060,928, U.S. Publication No. 2005/0187923, filed on Feb. 18, 2005, the subject matter of which is hereby incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
5542090 | Henderson et al. | Jul 1996 | A |
5754939 | Herz et al. | May 1998 | A |
5924090 | Krellenstein | Jul 1999 | A |
5960422 | Prasad | Sep 1999 | A |
6038561 | Snyder et al. | Mar 2000 | A |
6067552 | Yu | May 2000 | A |
6233575 | Agrawal et al. | May 2001 | B1 |
6260041 | Gonzalez et al. | Jul 2001 | B1 |
6292830 | Taylor et al. | Sep 2001 | B1 |
6332141 | Gonzalez et al. | Dec 2001 | B2 |
6418433 | Chakrabarti et al. | Jul 2002 | B1 |
6711585 | Copperman et al. | Mar 2004 | B1 |
6735583 | Bjarnestam et al. | May 2004 | B1 |
6868525 | Szabo | Mar 2005 | B1 |
6873990 | Oblinger | Mar 2005 | B2 |
6961737 | Ritchie et al. | Nov 2005 | B2 |
7035864 | Ferrari et al. | Apr 2006 | B1 |
7146361 | Broder et al. | Dec 2006 | B2 |
7266548 | Weare | Sep 2007 | B2 |
20010000356 | Woods | Apr 2001 | A1 |
20020087565 | Hoekman et al. | Jul 2002 | A1 |
20030014405 | Shapiro et al. | Jan 2003 | A1 |
20030154196 | Goodwin et al. | Aug 2003 | A1 |
20030172059 | Andrei | Sep 2003 | A1 |
20030212666 | Basu et al. | Nov 2003 | A1 |
20030217052 | Rubenczyk et al. | Nov 2003 | A1 |
20040024790 | Everett | Feb 2004 | A1 |
20040060426 | Weare et al. | Apr 2004 | A1 |
20040267718 | Milligan et al. | Dec 2004 | A1 |
20050060312 | Curtiss et al. | Mar 2005 | A1 |
20050097075 | Hoekman et al. | May 2005 | A1 |
20050187923 | Cipollone | Aug 2005 | A1 |
Number | Date | Country | |
---|---|---|---|
20060031218 A1 | Feb 2006 | US |
Number | Date | Country | |
---|---|---|---|
60590247 | Jul 2004 | US |