David T. Brown, “A Note on Approximations to Discrete Probability Distributions”, Information and Control, vol. 2, pp. 386-392 (1959). |
Brown et al., “The Mathematics of Statistical Machine Translation: Parameter Estimation”, Computational Linguistics, vol. 19, No.2, pp. 263-311. |
I. Csiszar, “I-Divergence Geometry of Probability Distributions and Minimization Problems”, The Annals of Probability, 1975, vol. 3, No. 1, pp. 146-158. |
J.N. Darroch et al., “Generalized Iterative Scaling for Log-Linear Models”, The Annals of Mathematical Statistics, 1972, vol. 43, No. 5, pp. 1470-1480. |
I. Csiszar, “A Geometric Interpretation of Darroch and Ratcliff s Generalized Iterative Scaling”, The Annals of Statistics, 1989, vol. 17, No. 3, pp. 1409-1412. |
Eurospeech 89, European Conference on Speech Communication and Technology, “A Massively Parallel Model of Speech-To-Speech Dialog Translation: A Step Toward Interpreting Telephony” by H. Kitano et al., Center for Machine Translation, Carnegie Mellon University, Pennsylvania, vol. 1, pp. 198-201. Editors: J.P. Tubach, et al. (Sep. 1989). |
Research Report, A Maximm Entropy Approach to Natural Language Processing, Adam L. Berger, et al., IBM Research Division, Yorktown Heights, NY, Aug. 5, 1994. |