Claims
- 1. A method comprising:
receiving an input text segment in a source language; and translating the input text segment to a translated text segment in a target language using both a statistical decoding algorithm and text strings in a translation memory.
- 2. The method of claim 1, further comprising:
generating a cover phrase for the input text segment from one or more text segments in the translation memory.
- 3. The method of claim 1, further comprising:
using said cover as an initial translation for the statistical decoding algorithm.
- 4. The method of claim 1, wherein said translating comprises:
matching at least a portion of the input text segment with a first text segment in a translation memory; replacing the at least a portion of the input text segment with a text segment in a target language in the translation memory corresponding to the first text segment; generating an initial target language translation of the input text segment; and generating a target language translation by performing a statistical machine translation decoding operation on the initial translation.
- 5. The method of claim 4, wherein said generating the initial target language translation comprises replacing one or more words not in said at least a portion of the input text segment with a with a word-for-word translation.
- 6. The method of claim 1, wherein the translation memory comprises a statistical translation memory.
- 7. The method of claim 1, further returning a final target language translation when the entire input text sentence matches a text segment in the translation memory.
- 8. The method of claim 4, wherein said performing the statistical machine translation decoding operation comprises:
performing one or more sentence modification operations to the current target language translation to generate one or more modified target language translations, said current target language translation comprising the initial target language translation in a first instance; determining whether one or more of the modified target language translations represents an improved translation in comparison with the current target language translation; setting a modified target language translation as the current target language translation; and repeating said applying, said determining and said setting until occurrence of a termination condition.
- 9. The method of claim 8, wherein said performing one or more sentence modification operations comprises changing in the current target language translation the translation of one or two words.
- 10. The method of claim 8, wherein said performing one or more sentence modification operations comprises changing in the current target language translation a translation of a word and concurrently inserting another word at a position that yields an alignment of highest probability between the source language text segment and the current target language translation, the inserted other word having a high probability of having a zero-value fertility.
- 11. The method of claim 8, wherein said performing one or more sentence modification operations comprises deleting from the current target language translation a word having a zero-value fertility.
- 12. The method of claim 8, wherein said performing one or more sentence modification operations comprises modifying an alignment between the source language text segment and the current target language translation by swapping non-overlapping target language word segments in the current target language translation.
- 13. The method of claim 8, wherein said performing one or more sentence modification operations comprises modifying an alignment between the source language text segment and the current target language translation by eliminating a target language word from the current target language translation and linking words in the source language text segment.
- 14. The method of claim 8, wherein the termination condition comprises a determination that a probability of correctness of a modified target language translation is no greater than a probability of correctness of the current target language translation.
- 15. The method of claim 8, wherein the termination condition comprises a completion of a predetermined number of iterations.
- 16. The method of claim 8, wherein the termination condition comprises a lapse of a predetermined amount of time.
- 17. Apparatus comprising:
a translation memory including a plurality of translation pairs, each translation pair including a text segment in a source language and a corresponding text segment in a target language; and a decoder operative to generate a cover of an input text segment in the source language from one or more text segments in the translation memory and to generate a translation in the target language from said cover using a statistical decoding algorithm.
- 18. The apparatus of claim 17, wherein the translation memory comprises a statistical translation memory.
- 19. The apparatus of claim 17, wherein the statistical decoding algorithm comprises a greedy decoding algorithm.
- 20. The apparatus of claim 17, wherein the generating the cover comprises replacing the at least a portion of the input text segment with a text segment in a target language in the translation memory corresponding to the first text segment and replacing one or more words not in said at least a portion of the input text segment with a gloss.
- 21. The apparatus of claim 17, wherein the statistical decoding algorithm comprises:
performing one or more sentence modification operations to the current target language translation to generate one or more modified target language translations, said current target language translation comprising the initial target language translation in a first instance; determining whether one or more of the modified target language translations represents an improved translation in comparison with the current target language translation; setting a modified target language translation as the current target language translation; and repeating said applying, said determining and said setting until occurrence of a termination condition.
- 22. The apparatus of claim 21, wherein the termination condition comprises a determination that a probability of correctness of a modified target language translation is no greater than a probability of correctness of the current target language translation.
- 23. The apparatus of claim 21, wherein the termination condition comprises a completion of a predetermined number of iterations.
- 24. The apparatus of claim 21, wherein the termination condition comprises a lapse of a predetermined amount of time.
- 25. An article comprising a machine-readable medium including machine-executable instructions, the instructions operative to cause a machine to:
receive an input text segment in a source language; and translate the input text segment to a translated text segment in a target language using both a statistical decoding algorithm and text strings in a translation memory.
- 26. The article of claim 25, further comprising instructions operative to cause the machine to:
generate a cover phrase for the input text segment from one or more text segments in the translation memory.
- 27. The article of claim 26, further comprising instructions operative to cause the machine to:
use said cover phrase as an initial translation for the statistical decoding algorithm.
- 28. The article of claim 25, wherein the instructions operative to cause the machine to translate include instructions operative to cause the machine to:
match at least a portion of the input sentence with a first text segment in a translation memory; replace the at least a portion of the input text segment with a text segment in a target language in the translation memory corresponding to the first text segment; generate an initial target language translation of the input text segment; and generate a target language translation by performing a statistical machine translation decoding operation on the initial translation.
- 29. The article of claim 28, wherein the instructions for generating the initial target language translation include instructions operative to cause the machine to replace one or more words not in said at least a portion of the input text segment with a word-for-word translation.
- 30. The article of claim 25, wherein the translation memory comprises a statistical translation memory.
- 31. The article of claim 25, further comprising instructions operative to cause the machine to return a final target language translation when the entire input text sentence matches a text segment in the translation memory.
- 32. The article of claim 25, wherein the instructions for performing the statistical machine translation decoding operation include instructions operative to cause the machine to:
perform one or more sentence modification operations to the current target language translation to generate one or more modified target language translations, said current target language translation comprising the initial target language translation in a first instance; determine whether one or more of the modified target language translations represents an improved translation in comparison with the current target language translation; set a modified target language translation as the current target language translation; and repeat said applying, said determining and said setting until occurrence of a termination condition.
- 33. The article of claim 32, wherein the instructions for performing one or more sentence modification operations include instructions operative to cause the machine to changing in the current target language translation the translation of one or two words.
- 34. The article of claim 32, wherein the instructions for performing one or more sentence modification operations include instructions operative to cause the machine to change in the current target language translation a translation of a word and concurrently inserting another word at a position that yields an alignment of highest probability between the source language text segment and the current target language translation, the inserted other word having a high probability of having a zero-value fertility.
- 35. The article of claim 32, wherein the instructions for performing one or more sentence modification operations include instructions operative to cause the machine to delete from the current target language translation a word having a zero-value fertility.
- 36. The article of claim 32, wherein the instructions for performing one or more sentence modification operations include instructions operative to cause the machine to modify an alignment between the source language text segment and the current target language translation by swapping non-overlapping target language word segments in the current target language translation.
- 37. The article of claim 32, wherein the instructions for performing one or more sentence modification operations include instructions operative to cause the machine to modify an alignment between the source language text segment and the current target language translation by eliminating a target language word from the current target language translation and linking words in the source language text segment.
- 38. The article of claim 32, wherein the termination condition comprises a determination that a probability of correctness of a modified target language translation is no greater than a probability of correctness of the current target language translation.
- 39. The article of claim 32, wherein the termination condition comprises a completion of a predetermined number of iterations.
- 40. The article of claim 32, wherein the termination condition comprises a lapse of a predetermined amount of time.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of, and incorporates herein, U.S. Provisional Patent Application No. 60/291,853, filed May 17, 2001, and U.S. patent application Ser. No. 09/854,327, filed May 11, 2001.
ORIGIN OF INVENTION
[0002] The research and development described in this application were supported by DARPA-ITO under grant number N66001-00-1-9814. The U.S. Government may have certain rights in the claimed inventions.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60291853 |
May 2001 |
US |