Claims
- 1. A method for electronic document or word searching, comprising the steps of:
given an input, expanding the input as a function of at least one of (i) acoustic similarity and/or (ii) frequency of word sequence occurrence, said expanding resulting in alternative input words or phrases; and returning the alternative input words or phrases for further processing.
- 2. A method as claimed in claim 1, wherein the electronic documents are transcriptions of spoken input.
- 3. A method as claimed in claim 1, wherein the word searching is a spell checker.
- 4. A method as claimed in claim 1, wherein the input is a query.
- 5. A method as claimed in claim 1, wherein the step of expanding includes:
translating words in the given input to one or more phoneme strings; determining word boundaries in each of the phoneme strings to produce respective phoneme subsequences; and for each produced phoneme subsequence, generating at least one confusable word phrase having a pronunciation which is acoustically similar to the phoneme subsequence, said confusable word phrase forming an alternative input word or phrase.
- 6. A method as claimed in claim 5, wherein the step of translating words in the given input to one or more phoneme strings further includes: converting the words to phonemes by at least (i) looking up the words in a dictionary or (ii) using letter to phoneme rules to automatically generate a pronunciation.
- 7. A method as claimed in claim 6, wherein the step of translating words in the given input to one or more phoneme strings further includes:
generating one or more phoneme strings of the converted word phonemes based on a confusion matrix; and scoring the one or more phoneme strings based on the confusion matrix.
- 8. A method as claimed in claim 7, wherein the confusion matrix stores indications of the likelihood of confusion between pairs of phonemes, the likelihood of deleting each phoneme in a pronunciation of a word, and the likelihood of inserting each phoneme in a pronunciation of a word.
- 9. A method as claimed in claim 5, wherein the step of determining word boundaries in each of the phoneme strings to produce respective phoneme subsequences uses an example-based word boundary.
- 10. A method as claimed in claim 9, wherein the example-based word boundary includes:
breaking up each of the phoneme strings into a list of segmented phoneme strings; looking up each segmented phoneme string in a dictionary; and producing a list of segmented phoneme strings found in the dictionary.
- 11. A method as claimed in claim 5, wherein the step of determining word boundaries in each of the phoneme strings to produce respective phoneme subsequences uses a syllable-based word boundary.
- 12. A method as claimed in claim 11, wherein the syllable-based word boundary includes:
breaking up each of the phoneme strings into a list of segmented phoneme strings; determining if each segmented phoneme string contains at least a vowel; and producing a list of segmented phoneme strings containing at least one vowel.
- 13. A method as claimed in claim 5, wherein the step of generating at least one confusable word phrase includes:
comparing each phoneme subsequence to word pronunciations from a dictionary, said comparing resulting in a list of words from the dictionary; and ordering the list of words from the dictionary, the most likely word being at the top of the list.
- 14. A method as claimed in claim 13, wherein said ordering of the list includes scoring the word pronunciation of each word in the dictionary against the phoneme subsequence using a distance metric, the distance metric defined as follows:
- 15. A method as claimed in claim 14, wherein scoring the word pronunciation of each word in the dictionary against the phoneme subsequence using a distance metric further includes, scoring the list of words using a language model.
- 16. A method as claimed in claim 1, wherein the step of further processing includes using the alternative input words or phrases resulting from said step of expanding, searching an index of electronic documents for the alternative input words or phrases in response to the given input.
- 17. A method as claimed in claim 16, wherein the step of searching the index includes:
searching for an exact match of each alternative input word or phrase; searching for a “proximate” match of each alternative input phrase; augmenting an indexer's score with a score associated with each alternative input word or phrase, including a probability of the alternative input word or phrase and/or probability of a proximate match; and re-ranking the electronic documents according to the augmented score.
- 18. A method as claimed in claim 1, wherein the step of returning to a user the alternative input words or phrases further includes the user choosing correct alternative input words or phrases.
- 19. A computer system for electronic document or word searching, comprising:
an expansion module for expanding a input as a function of at least one of (i) acoustic similarity and/or (ii) frequency of word sequence occurrence, said expanding resulting in alternative input words or phrases and returning the alternative input words or phrases for further processing.
- 20. A computer system as claimed in claim 19, wherein the electronic documents are transcriptions of spoken input.
- 21. A computer system as claimed in claim 19, wherein word searching is a spell checking module.
- 22. A computer system as claimed in claim 19, wherein the input is a query.
- 23. A computer system as claimed in claim 19, wherein the expansion module comprises:
a translation module for translating words in the given input to one or more phoneme strings; a determination module for determining word boundaries in each of the phoneme strings to produce respective phoneme subsequences; and a production module for each produced phoneme subsequence, generating at least one confusable word phrase having a pronunciation which is acoustically similar to the phoneme subsequence, said confusable word phrase forming an alternative input word or phrase.
- 24. A computer system as claimed in claim 23, wherein the translation module comprises: a conversion module for converting the words to phonemes by at least (i) looking up the words in a dictionary or (ii) using letter to phoneme rules to automatically generate a pronunciation.
- 25. A computer system as claimed in claim 24, wherein the translation module further comprises:
a generation module for generating one or more phoneme strings of the converted word phonemes based on a confusion matrix; and a scoring module for scoring the one or more phoneme strings based on the confusion matrix.
- 26. A computer system as claimed in claim 25, wherein the confusion matrix stores indications of the likelihood of confusion between pairs of phonemes, the likelihood of deleting each phoneme in a pronunciation of a word, and the likelihood of inserting each phoneme in a pronunciation of a word.
- 27. A computer system as claimed in claim 23, wherein the determination module further comprises an example-based word boundary module for determining word boundaries in each of the phoneme strings to produce respective phoneme subsequences.
- 28. A computer system as claimed in claim 27, wherein the example-based word boundary module comprises:
a decipher module for breaking up each of the phoneme strings into a list of segmented phoneme strings; a lookup module for looking up each segmented phoneme string in a dictionary; and an assembler module for producing a list of segmented phoneme strings found in the dictionary.
- 29. A computer system as claimed in claim 23, wherein the determination module further comprises a syllable-based word boundary module for determining word boundaries in each of the phoneme strings to produce respective phoneme subsequences.
- 30. A computer system as claimed in claim 29, wherein the syllable-based word boundary module comprises:
a decipher module for breaking up each of the phoneme strings into a list of segmented phoneme strings; a determination module for determining if each segmented phoneme string contains at least a vowel; and a assembler module for producing a list of segmented phoneme strings containing at least one vowel.
- 31. A computer system as claimed in claim 23, wherein the production module comprises:
a comparison module for comparing each phoneme subsequence to word pronunciations from a dictionary, said comparing resulting in a list of words from the dictionary; and an ordering module for ordering the list of words from the dictionary, the most likely word being at the top of the list.
- 32. A computer system as claimed in claim 31, wherein the ordering module scores the word pronunciation of each word in the dictionary against the phoneme subsequence using a distance metric, the distance metric defined as follows:
- 33. A computer system as claimed in claim 32, wherein the ordering module further scores the list using a language model.
- 34. A computer system as claimed in claim 19, wherein the step of further processing includes a search module for using the alternative input words or phrases resulting from said step of expanding, searching an index of electronic documents for the alternative input words or phrases in response to the given input.
- 35. A computer system as claimed in claim 34, wherein the search module comprises:
a matching module for searching for an exact match of each alternative input word or phrase; a proximity module for searching for a “proximate” match of each alternative input phrase; an augmentation module for augmenting an indexer's score with a score associated with each alternative input word or phrase including a probability of the alternative input word or phrase and/or probability of a proximate match; and a re-ranking module for re-ranking the electronic documents according to the augmented score.
- 36. A computer system comprising:
means for expanding an input, said expanding resulting in alternative input words or phrases; and means for searching an index of electronic documents for the alternative input words or phrases in response to the given input.
RELATED APPLICATION
[0001] This application is related to U.S. application Ser. No. 10/017,797, filed Dec. 12, 2001. The entire teachings of the above application are is incorporated herein by reference.