Claims
- 1. A computer-implemented method for selectively retrieving text strings from a plurality of stored text strings contained in a document set stored on a data storage medium accessible by a computer processor, said method comprising the steps of:
- A. transmitting a text string query defined by a user to the processor;
- B. generating a dissimilarity value D.sub.i associated with the user-defined text string query and each of selected ones of the text strings in the stored document set in accordance with a first set of rules, wherein each dissimilarity value D.sub.i is a function of at least one rule R.sub.n from the first set of rules and a weight value w.sub.n associated with each R.sub.n, wherein each weight value indicates the relative importance of the associated rule in predicting a desired outcome of the retrieval;
- C. retrieving a text string set, including at least one retrieved text string, from the selected text strings in accordance with a second set of rules, based upon the dissimilarity values D.sub.i generated in step B;
- D. generating a display of the retrieved text strings in a display order;
- E. said user choosing at least one of the retrieved text strings from the displayed text string set for further processing; and
- F. automatically adjusting the weight value w.sub.n, defining an adjusted weight value w.sub.n ', associated with at least one rule of the first set of rules, based upon the results of the choosing step.
- 2. The method of claim 1, wherein, for subsequent retrievals, the dissimilarity value D.sub.i assigned to each selected text string uses the adjusted weight value w.sub.n '.
- 3. The method of claim 1, wherein ##EQU2## where r.sub.n is the number of times R.sub.n must be applied to the selected text string to create a test string that matches the text string query, and k is the total number of rules in the first set of rules.
- 4. The method of claim 3, wherein the step of generating a display comprises generating a display of the retrieved text strings in an increasing order based on the dissimilarity value D.sub.i associated with each retrieved text string.
- 5. The method of claim 3, wherein the step of adjusting the weight value comprises:
- determining a measure of correspondence C between a mean dissimilarity value D.sub.i associated with chosen text strings and a mean dissimilarity value D.sub.m associated with non-chosen text strings.
- 6. The method of claim 5, further comprising increasing the measure C by selectively adjusting the weight values w.sub.n associated with selected ones of the rules in the first set of rules.
- 7. The method of claim 6, further comprising normalizing at least one of the weight values w.sub.n.
- 8. The method of claim 1, wherein the step of adjusting the weight value w.sub.n comprises:
- (a) decreasing the dissimilarity value associated with each of the chosen text strings; and
- (b) increasing the dissimilarity value associated with each of the non-chosen text strings.
- 9. A computer system for selectively retrieving text strings from a plurality of stored text strings contained in a document set stored on a data storage medium accessible by a computer processor, said system comprising:
- A. data input means for transmitting a text string query defined by a user to the processor;
- B. a first storage medium for storing a first set of rules, each rule having an associated weight value w.sub.n, wherein each weight value indicates the relative importance of the associated rule in predicting a desired outcome of the retrieval, and a second storage medium for storing a second set of rules;
- C. means for associating a dissimilarity value D.sub.i with each of selected ones of text strings from the data storage medium, wherein each dissimilarity value D.sub.i is a function of at least one rule R.sub.n from the first set of rules and the weight value w.sub.n associated with said R.sub.n ;
- D. means for retrieving a text string set, including at least one text string, from the selected text strings, in accordance with the second set of rules, based upon the dissimilarity values D.sub.i ;
- E. a display device for generating a display of the retrieved text strings in a display order;
- F. choosing means for allowing said user to select at least one of the retrieved text strings for further processing; and
- G. coupled to the choosing means and responsive to said user's selection of said at least one retrieved text string for further processing, means for automatically adjusting at least one weight value w.sub.n, defining an adjusted weight value w.sub.n '.
- 10. The system of claim 9, wherein the dissimilarity value D.sub.i associated with each chosen text string includes the adjusted weight value w.sub.n '.
- 11. The system of claim 9, wherein ##EQU3## where r.sub.n is the number of times R.sub.n must be applied to the selected text strings to create a text string that matches the text string query, and k is the total number of rules in the first set of rules.
- 12. The system of claim 11, wherein the display device comprises means for displaying the retrieved text strings in an increasing order based on the dissimilarity value D.sub.i associated with each retrieved text string.
- 13. The system of claim 11, further comprising: means, associated with the first storage medium, for generating and storing a measure of correspondence C between a mean dissimilarity value D.sub.i associated with chosen text strings and a mean dissimilarity value D.sub.m associated with non-chosen text strings.
- 14. The system of claim 13, further comprising means for increasing the measure C by selectively adjusting at least one weight value w.sub.n.
- 15. The system of claim 14, further comprising normalizing means for normalizing at least one weight value w.sub.n.
- 16. The system of claim 9, wherein the adjusting means comprises:
- (a) means for decreasing the dissimilarity value associated with each of the chosen text strings; and
- (b) means for increasing the dissimilarity value associated with each of the non-chosen text strings.
Parent Case Info
This is a continuation of U.S. patent application Ser. No. 08/110,093, filed on Aug. 20, 1993, and now abandoned. Another related application is U.S. patent application Ser. No. 08/330,968, filed on Oct. 28, 1994, which application is a continuation-in-part of U.S. patent application Ser. No. 08/110,093, now abandoned.
US Referenced Citations (3)
Number |
Name |
Date |
Kind |
4453217 |
Boivie |
Jun 1984 |
|
5008818 |
Bocast |
Apr 1991 |
|
5099425 |
Kanno: Yuji et al. |
Mar 1992 |
|
Foreign Referenced Citations (1)
Number |
Date |
Country |
0271664A2 |
Jun 1988 |
EPX |
Continuations (1)
|
Number |
Date |
Country |
Parent |
110093 |
Aug 1993 |
|