Claims
- 1. A method for creating a cross-idea association database comprising:
providing a first document in a first language and a second document in a second language, wherein said documents include parallel or comparable text with respect to each other; locating in the first document all occurrances of a recurring word string; translating the recurring word string into the second language to produce a recurring word string tranlation; defining initial testing ranges in the second document corresponding to occurrances of the recurring word string in the first document, wherein the initial testing ranges include a desired number of words; comparing words in the recurring word string translation with words in the initial testing ranges to identify matching words; and increasing the number of words in the initial testing ranges to form expanded testing ranges and comparing words in the recurring word string translation with words in the expanded testing ranges to identify matching words; identifying the expanded testing range as the final range if the number of matching words in the expanded testing ranges is not greater than the number of matching words in the initial testing ranges.
- 2. A computer device including a processor, a memory coupled to the processor, and a program stored in the memory, wherein the computer is configured to execute the program and perform the steps of:
locating in a first document all occurrances of a recurring word string, wherein said first document is in a first language; translating the recurring word string into a second language to produce a recurring word string tranlation; defining initial testing ranges in a second document corresponding to occurrances of the recurring word string in the first document, wherein the second document is in the second language and includes parallel text or comparable text with respect to the first document, and wherein the initial testing ranges include a desired number of words; comparing words in the recurring word string translation with words in the initial testing ranges to identify matching words; and increasing the number of words in the initial testing ranges to form expanded testing ranges and comparing words in the recurring word string translation with words in the expanded testing ranges to identify matching words; identifying the expanded testing range as the final range if the number of matching words in the expanded testing ranges is not greater than the number of matching words in the initial testing ranges.
- 3. A computer readable data storage medium having stored thereon a computer executable program for:
locating in a first document all occurrances of a recurring word string, wherein said first document is in a first language; translating the recurring word string into a second language to produce a recurring word string tranlation; defining initial testing ranges in a second document corresponding to occurrances of the recurring word string in the first document, wherein the second document is in the second language and includes parallel text or comparable text with respect to the first document, and wherein the initial testing ranges include a desired number of words; comparing words in the recurring word string translation with words in the initial testing ranges to identify matching words; and increasing the number of words in the initial testing ranges to form expanded testing ranges and comparing words in the recurring word string translation with words in the expanded testing ranges to identify matching words; identifying the expanded testing range as the final range if the number of matching words in the expanded testing ranges is not greater than the number of matching words in the initial testing ranges.
RELATED APPLICATIONS
[0001] This application is a continuation-in-part of U.S. application Ser. No. 10/116047, filed Apr. 5, 2002, which is a continuation-in-part of U.S. application Ser. No. 10/024,473, filed Dec. 21, 2001 and claims the benefit of U.S. Provisional Application No. 60/276,107 filed Mar. 16, 2001, and U.S. Provisional Application No. 60/299,472 filed Jun. 21, 2001, all of which are hereby incorporated by reference.
Provisional Applications (2)
|
Number |
Date |
Country |
|
60276107 |
Mar 2001 |
US |
|
60299472 |
Jun 2001 |
US |
Continuation in Parts (2)
|
Number |
Date |
Country |
Parent |
10116047 |
Apr 2002 |
US |
Child |
10146441 |
May 2002 |
US |
Parent |
10024473 |
Dec 2001 |
US |
Child |
10116047 |
Apr 2002 |
US |