Claims
- 1. A method comprising:
training a phrase-based joint probability model with a parallel corpus comprising a plurality of parallel text segments in two languages.
- 2. The method of claim 1, further comprising:
determining high frequency n-grams in a sentence pair comprising E and F; initializing a t-distribution table; performing a Viterbi-based Expectation Maximum training procedure; and deriving a conditional probability model.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority to U.S. Provisional Application Serial No. 60/368,450, filed on Mar. 27, 2002, the disclosure of which is incorporated by reference.
ORIGIN OF INVENTION
[0002] The research and development described in this application were supported by DARPA-ITO under grant number N66001-00-1-9814 and by NSF-STTR grant 0128379. The U.S. Government may have certain rights in the claimed inventions.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60368450 |
Mar 2002 |
US |