D. Roy and C. Malamud, Speaker identification based text to audio alignment for an audio visual retrieval system, Proc. ICASSP 97, IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, Munich, 1099-1102, 1997. |
M-H. Siu, G. Yu, and H. Gish, An unsupervised, sequential learning algorigthm for the segmentation of speech waveforms with multiple speackers, Proc. ICASSP 92, IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, San Francisco, vol. II, 189-192.. |
Speech segmentation and clustering based on speaker features, Proc. ICASSP 93 IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, Minneapolis 395-398, 1993. |
C. Montacie and Marie-Jose Caraty, Sound Channel Video Indexing, ESCA, Eurospeech97, Rhodes, Greece ISSN 1018-4074, pp. 2359-2362. |
L. Wilcox, F. Chen, D. Kimber, and V. Balasubramanian, Segmentation of speech using speaker identification, Proc. ICASSP 94, IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, Adelaide, 161-164, 1994. |
D.A. Reynolds & R.C. Rose, “Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models,” IEEE Trans. On Speech and Audio Processing, vol. 3, 1995, pp. 72-83. |
Foote et al. (“Finding Presentations in Recorded Meetings using Audio and VidEo Features,” 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar. 1999). |