Claims
- 1. A method of creating a named entity language model, comprising:
recognizing input communications from a training corpus; parsing the training corpus; tagging the parsed training corpus; aligning the recognized training corpus with the tagged training corpus; and creating a named entity language model from the aligned corpus.
- 2. The method of claim 1, further comprising:
transcribing the training corpus.
- 3. The method of claim 2, wherein the transcribing step is performed automatically.
- 4. The method of claim 1, wherein the training corpus includes at least one of untranscribed and transcribed speech.
- 5. The method of claim 1, wherein the training corpus includes communications from one or more languages.
- 6. The method of claim 1, further comprising:
labeling the training corpus.
- 7. The method of claim 1, further comprising:
storing the named entity language model in a database.
- 8. The method of claim 1, wherein the training corpus includes at least one of verbal and non-verbal speech.
- 9. The method of claim 8, wherein the non-verbal speech includes the use of at least one of gestures, body movements, head movements, non-responses, text, keyboard entries, keypad entries, mouse clicks, DTMF codes, pointers, stylus, cable set-top box entries, graphical user interface entries and touchscreen entries.
- 10. The method of claim 1, wherein the training corpus includes multimodal speech.
- 11. The method of claim 1, wherein the tagging step tags the training corpus with named entity tags.
- 12. The method of claim 1, wherein the named entity tags are at least one of context-dependent named entity tags and context-independent named entity tags.
- 13. The method of claim 1, wherein named entities are represented by at least one of a tag, a context and a value.
- 14. The method of claim 1, wherein recognizing step recognizes a lattice from the training corpus.
- 15. A system that creates a named entity language model, comprising:
a recognizer that recognizes input communications from a training corpus; a parser that parses the training corpus; a tagger that tags the parsed training corpus; an aligner that aligns the recognized training corpus with the tagged training corpus, and creates a named entity language model from the aligned corpus.
- 16. The system of claim 15, further comprising:
a transcriber that transcribes the training corpus.
- 17. The system of claim 16, wherein the transcriber transcribes automatically.
- 18. The system of claim 15, wherein the training corpus includes at least one of untranscribed and transcribed speech.
- 19. The system of claim 15, wherein the training corpus includes communications from one or more languages.
- 20. The system of claim 15, further comprising:
a labeler that labels the training corpus.
- 21. The system of claim 15, further comprising:
storing the named entity language model in a database.
- 22. The system of claim 15, wherein the training corpus includes at least one of verbal and non-verbal speech.
- 23. The system of claim 22, wherein the non-verbal speech includes the use of at least one of gestures, body movements, head movements, non-responses, text, keyboard entries, keypad entries, mouse clicks, DTMF codes, pointers, stylus, cable set-top box entries, graphical user interface entries and touchscreen entries.
- 24. The system of claim 15, wherein the training corpus includes multimodal speech.
- 25. The system of claim 15, wherein the tagging step tags the training corpus with named entity tags.
- 26. The system of claim 15, wherein the named entity tags are at least one of context-dependent named entity tags and context-independent named entity tags.
- 27. The system of claim 15, wherein named entities are represented by at least one of a tag, a context and a value.
- 28. The system of claim 15, wherein the recognizer recognizes a lattice.
- 29. A method of creating a named entity language model, comprising:
recognizing input communications from a training corpus; parsing the training corpus; tagging the parsed training corpus; aligning the recognized training corpus with the tagged training corpus; creating a named entity language model from the aligned corpus; and detecting named entities in input communications using the named entity language model.
Parent Case Info
[0001] This non-provisional application claims the benefit of U.S. Provisional Patent Application No. 60/307,624, filed Apr. 5, 2002, and U.S. Provisional Patent Application No. 60/443,642, filed Jan. 29, 2003, which are both incorporated herein by reference in their entireties. This application is also a continuation-in-part of 1) U.S. Patent Application No. 10/158,082 which claims priority from U.S. Provisional Patent Application No. 60/322,447, filed Sep. 17, 2001, 2) U.S. Patent Application No. 09/690,721 and 3) No. 09/690,903 both filed Oct. 18, 2000, which claim priority from U.S. Provisional Application No. 60/163,838, filed Nov. 5, 1999. U.S. patent application Ser. Nos. 09/690,721, 09/690,903, 10/158,082 and U.S. Provisional Application Nos. 60/163,838 and 60/322,447 are incorporated herein by reference in their entireties.
Provisional Applications (3)
|
Number |
Date |
Country |
|
60443642 |
Jan 2003 |
US |
|
60322447 |
Sep 2001 |
US |
|
60163838 |
Nov 1999 |
US |
Continuation in Parts (3)
|
Number |
Date |
Country |
Parent |
10158082 |
May 2002 |
US |
Child |
10402976 |
Apr 2003 |
US |
Parent |
09690721 |
Oct 2000 |
US |
Child |
10402976 |
Apr 2003 |
US |
Parent |
09690903 |
Oct 2000 |
US |
Child |
10402976 |
Apr 2003 |
US |