Claims
- 1. A system for measuring a degree of independence of semantic classes in separate domains, comprising:
a cross-domain distance calculator that estimates a similarity between n-gram contexts for said semantic classes in each of said separate domains to determine domain-dependent relative entropies associated with said semantic classes; and a distance summer, associated with said cross-domain distance calculator, that adds said domain-dependent distances over a domain vocabulary to yield said degree of independence of said semantic classes.
- 2. The system as recited in claim 1 wherein said cross-domain distance calculator estimates said similarity between said n-gram contexts for each of said semantic classes in a lexical environment of an associated domain.
- 3. The system as recited in claim 1 wherein said cross-domain distance calculator estimates said similarity between said n-gram contexts for one of said semantic classes in a lexical environment of a domain other than an associated domain.
- 4. The system as recited in claim 1 wherein said cross-domain distance calculator employs a Kullback-Liebler distance to determine said domain-dependent relative entropies.
- 5. The system as recited in claim 1 wherein said n-gram contexts are generated manually or automatically.
- 6. The system as recited in claim 1 wherein each of said separate domains contains multiple semantic classes, said cross-domain distance calculator and said distance summer operating with respect to each permutation of said semantic classes.
- 7. The system as recited in claim 1 wherein said distance summer adds left and right context-dependent distances to yield said degree of independence.
- 8. A method of measuring a degree of independence of semantic classes in separate domains, comprising:
estimating a similarity between n-gram contexts for said semantic classes in each of said separate domains to determine domain-dependent relative entropies associated with said semantic classes; and adding said domain-dependent distances over a domain vocabulary to yield said degree of independence of said semantic classes.
- 9. The method as recited in claim 8 wherein said estimating comprises estimating said similarity between said n-gram contexts for each of said semantic classes in a lexical environment of an associated domain.
- 10. The method as recited in claim 8 wherein said estimating comprises estimating said similarity between said n-gram contexts for one of said semantic classes in a lexical environment of a domain other than an associated domain.
- 11. The method as recited in claim 8 wherein said estimating comprises employing a Kullback-Liebler distance to determine said domain-dependent relative entropies.
- 12. The method as recited in claim 8 wherein said n-gram contexts are generated manually or automatically.
- 13. The method as recited in claim 8 wherein each of said separate domains contains multiple semantic classes, said estimating and said adding carried out with respect to each permutation of said semantic classes.
- 14. The method as recited in claim 8 wherein said adding comprises adding left and right context-dependent distances to yield said degree of independence.
- 15. A method of porting a semantic class from a first domain into a second domain, comprising:
measuring a degree of independence of said semantic class, said measuring including:
estimating a similarity between n-gram contexts for said semantic class in said first domain and said second domain to determine a domain-dependent relative entropy associated with said semantic class, and adding said domain-dependent distances over a domain vocabulary to yield said degree of independence of said semantic classes; and employing said degree of independence to determine whether said semantic class is properly portable into said second domain.
- 16. The method as recited in claim 15 wherein said estimating comprises estimating said similarity between said n-gram contexts for said semantic class in a lexical environment of said first domain.
- 17. The method as recited in claim 15 wherein said estimating comprises estimating said similarity between said n-gram contexts for said semantic class in a lexical environment of said second domain.
- 18. The method as recited in claim 15 wherein said estimating comprises employing a Kullback-Liebler distance to determine said domain-dependent relative entropies.
- 19. The method as recited in claim 15 wherein said n-gram contexts are generated manually or automatically.
- 20. The method as recited in claim 15 wherein said first and second domains each contain multiple semantic classes, said estimating and said adding carried out with respect to each permutation of said semantic class.
- 21. The method as recited in claim 15 wherein said adding comprises adding left and right context-dependent distances to yield said degree of independence.
CROSS-REFERENCE TO RELATED APPLICATION
[0001] The present application is related to U.S. patent application Ser. No. ______, [ATTORNEY DOCKET NO. AMMICHT 6-1-3], entitled “System and Method for Representing and Resolving Ambiguity in Spoken Dialogue Systems,” commonly assigned with the present application and filed concurrently herewith.