Claims
- 1-16. (cancelled).
- 17. A method of determining a probability distribution present in prespecified data, comprising:
initially calculating association probabilities for all classes that have an association probability less than or equal to a specifiable value, the initial calculation of association probabilities being performed using an iterative procedure; and subsequently using the iterative procedure to calculate association probabilities for classes only if the resulting association probabilities are below a selectable value.
- 18. The method in accordance with claim 17, wherein the specifiable value is zero.
- 19. The method in accordance with claim 17, wherein the prespecified data forms clusters.
- 20. The method in accordance with claim 17, wherein the iterative procedure includes an expectation maximization algorithm.
- 21. The method in accordance with claim 20, wherein association probabilities are calculated by calculating a product of probability factors.
- 22. The method in accordance with claim 21, further comprising ceasing calculation of the product of probability factors when one of the probability factors shows a valve approaching zero.
- 23. The method in accordance with 20, wherein the calculation of the product of probability factors is performed so that a probability factor associated with a variable which seldom occurs is processed before a probability factor associated with a variable which often occurs.
- 24. The method in accordance with claim 23, wherein
an ordered list is used in the calculation of the product of probability factors, the ordered list contains probability factors and products, probability factors associated with a variable which seldom occurs are stored before the beginning of the products in the ordered list, the probability factors being arranged in the ordered list in accordance with the frequency of their occurrence.
- 25. The method in accordance with claim 17, wherein a logarithmic representation of probability tables is used in calculating association probabilities.
- 26. The method in accordance with claim 17, wherein the representation of the probability tables only employs a list only containing elements that differ from zero.
- 27. The method in accordance with claim 17, wherein sufficient statistics are calculated.
- 28. The method in accordance with claim 27, wherein
the prespecified data forms clusters, and for the calculation of sufficient statistics, only those clusters are taken into account which have a weight other than zero.
- 29. The method in accordance with claim 17, wherein
the prespecified data forms clusters, and the clusters which have a weight other than zero are stored in a list.
- 30. The method in accordance with claim 17, wherein
the association probabilities are calculated in an expectation maximization learning process, the prespecified data has data points that form clusters, when a cluster is given an a posteriori weight of zero for a data point, the cluster is given a weight of zero in all further steps for the data point, when a cluster is given an a posteriori weight of zero, the cluster is not considered in subsequent expectation maximization process steps.
- 31. The method in accordance with claim 29, wherein,
wherein the prespecified data has data points that form clusters, and for each data point, a list of all references to clusters which have a weight other than zero is stored.
- 32. The method in accordance with claim 26, wherein the iterative process is performed only for clusters which have a weight other than zero.
- 33. The method in accordance with claim 18, wherein the prespecified data forms clusters.
- 34. The method in accordance with claim 33, wherein the iterative procedure includes an expectation maximization algorithm.
- 35. The method in accordance with claim 34, wherein association probabilities are calculated by calculating a product of probability factors.
- 36. The method in accordance with claim 35, further comprising ceasing calculation of the product of probability factors when one of the probability factors shows a valve approaching zero.
- 37. The method in accordance with 35, wherein the calculation of the product of probability factors is performed so that a probability factor associated with a variable which seldom occurs is processed before a probability factor associated with a variable which often occurs.
- 38. The method in accordance with claim 37, wherein
an ordered list is used in the calculation of the product of probability factors, the ordered list contains probability factors and products, probability factors associated with a variable which seldom occurs are stored before the beginning of the products in the ordered list, the probability factors being arranged in the ordered list in accordance with the frequency of their occurrence.
- 39. The method in accordance with claim 38, wherein a logarithmic representation of probability tables is used in calculating association probabilities.
- 40. The method in accordance with claim 39, wherein the representation of the probability tables only employs a list only containing elements that differ from zero.
- 41. The method in accordance with claim 40, wherein sufficient statistics are calculated.
- 42. The method in accordance with claim 41, wherein
the prespecified data forms clusters, and for the calculation of sufficient statistics, only those clusters are taken into account which have a weight other than zero.
- 43. The method in accordance with claim 38, wherein
the prespecified data forms clusters, and the clusters which have a weight other than zero are stored in a list.
- 44. The method in accordance with claim 39, wherein
the association probabilities are calculated in an expectation maximization learning process, the prespecified data has data points that form clusters, when a cluster is given an a posteriori weight of zero for a data point, the cluster is given a weight of zero in all further steps for the data point, when a cluster is given an a posteriori weight of zero, the cluster is not considered in subsequent expectation maximization process steps.
- 45. The method in accordance with claim 43, wherein,
wherein the prespecified data has data points that form clusters, and for each data point, a list of all references to clusters which have a weight other than zero is stored.
- 46. The method in accordance with claim 41, wherein the iterative process is performed only for clusters which have a weight other than zero.
- 47. A system to determine a probability distribution present in prespecified data, comprising:
a first calculation unit to calculate association probabilities for all classes that have an association probability less than or equal to a specifiable value, the initial calculation of association probabilities being performed using an iterative procedure; and a second calculation unit to subsequently use the iterative procedure to calculate association probabilities for classes only if the resulting association probabilities are below a selectable value.
- 48. The system in accordance with claim 47, wherein the specifiable value is zero.
- 49. The system in accordance with claim 47, wherein the prespecified data forms clusters.
- 50. The system in accordance with claim 47, wherein the iterative procedure includes an expectation maximization algorithm.
- 51. The system in accordance with claim 50, wherein association probabilities are calculated by calculating a product of probability factors.
- 52. The system in accordance with claim 51, further comprising ceasing calculation of the product of probability factors when one of the probability factors shows a valve approaching zero.
- 53. The system in accordance with 50, wherein the calculation of the product of probability factors is performed so that a probability factor associated with a variable which seldom occurs is processed before a probability factor associated with a variable which often occurs.
- 54. The system in accordance with claim 53, wherein
an ordered list is used in the calculation of the product of probability factors, the ordered list contains probability factors and products, probability factors associated with a variable which seldom occurs are stored before the beginning of the products in the ordered list, the probability factors being arranged in the ordered list in accordance with the frequency of their occurrence.
Priority Claims (1)
Number |
Date |
Country |
Kind |
102-33-609.1 |
Jul 2002 |
DE |
|
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application is based on and hereby claims priority to PCT Application No. PCT/DE03/02484 filed on Jul. 23, 2003 and German Application No. 10233609.1 filed on Jul. 24, 2002, the contents of which are hereby incorporated by reference.
PCT Information
Filing Document |
Filing Date |
Country |
Kind |
PCT/DE03/02484 |
7/23/2003 |
WO |
|