Claims
- 1. A method of creating and searching a database of documents in a plurality of supported language, the method comprising the steps of:
using a keyword dictionary to extract keywords from each of the documents; translating the keywords into the supported languages; creating a listing of keywords for each of the documents in each supported language; using the keywords in each supported language to create an inverted index of the documents including the keywords, and ranking factor information about the documents; retrieving documents in any supported language by interrogating the inverted index with search terms in one supported language; and ranking the documents using the ranking factor information of the documents contained in the inverted index.
- 2. A method including the steps of claim 1, wherein the step of using the keywords to create the inverted index includes the step of using the Unicode system to manage supported languages.
- 3. A method of searching a database including the steps of claim 1, wherein the step of generating an inquiry includes identifying the language in which it is presented.
- 4. A method of searching a database including documents written in more than one language, the steps of the method comprising:
creating a searching tool identifying keyword search terms on each document aligned with their keywords with a synonymous meaning in other supported languages; identifying search documents using keyword search terms in any supported language to interrogate the searching tool using a search engine responsive to keyword search terms in to both the search language and other supported languages; and providing a list of retrieved documents in multiple languages based on the search terms and their aligned synonyms.
- 5. The method of claim 4 including the step of converting the search results into another language.
- 6. The method of claim 5, wherein the method further includes the step of providing an ordered list of documents based on ranking information in the search tool.
- 7. The method of claim 4, wherein the step of providing the search list of documents in a supported language selected by the searcher when the document exists in that language.
- 8. A system for searching a database of documents based on an input search, the system comprising:
a list of keywords which includes, for each of a plurality of documents in the database, keywords with synonymous meanings in at least a first and a second language; an inverted index for the plurality of documents of the database including the keywords with synonymous meaning in the list; and a search engine responsive to keywords in both the first and second languages, said search engine using the inverted index to obtain a set of results which include the documents which correspond to input keyword search terms in the first or input language and which includes documents which are in the second language.
- 9. A system of the type described in claim 8, wherein the list is a two-way list which is usable in converting a search term from a first language to a second language and from a second language to a first language.
- 10. A searching system of the type described in claim 8, wherein the converting of search terms uses the Unicode system for managing multiple languages.
- 11. A computer program on a computer usable medium for searching a database of documents based on an input search, the software comprising:
software for a list of keywords which includes, for each of a plurality of documents in the database, keywords with synonymous meanings in at least a first and a second language; software for an inverted index for the plurality of documents of the database including the keywords with synonymous meaning in the list; and software for a search engine responsive to keywords in both the first and second languages, said search engine using the inverted index to obtain a set of results which include the documents which correspond to input keyword search terms in the first or input language and which includes documents which are in the second language.
- 12. The computer program of the type described in claim 11, wherein the list is a two-way list which is usable in converting a search term from a first language to a second language and from a second language to a first language.
- 13. A computer program of the type described in claim 11, wherein the converting of search terms uses the Unicode system for managing multiple languages.
RELATED APPLICATIONS
[0001] This application is a continuation-in-part of U.S. patent application, Ser. No. 10/066,346 (CHA920010230US1) filed on Feb. 1, 2002 and entitled “Method and System for Searching a Multi-lingual Database”.
Continuation in Parts (1)
|
Number |
Date |
Country |
Parent |
10066346 |
Feb 2002 |
US |
Child |
10180195 |
Jun 2002 |
US |