Claims
- 1. A method for estimating a rank order for records with respect to a selected characteristic of subjects of said records, said records including attributes of said subjects, and said attributes including a categorical attribute said method comprising the steps of:
a) estimating a statistic relating values of said categorical attribute to said characteristic; b) for one of said records, applying a feature vector having elements derived from said attributes in said one record, said elements including a value of said statistic corresponding to a value of said categorical attribute in said one record, to an artificial intelligence algorithm to obtain an output value; c) repeating step b for each remaining one of said records; and d) rank ordering said records in accordance with said output values.
- 2. A method as described in claim 1 wherein said subjects are customers of a provider of goods or services and said characteristic is bad debt risk.
- 3. A method as described in claim 2 wherein said provider is a provider of long distance services, and said attributes include attributes derived from current traffic data.
- 4. A method as described in claim 3 wherein said attributes include attributes derived from customer data.
- 5. A method as described in claim 3 wherein said statistic is derived from a statistic data set comprising a time line of archived records for said subjects and corresponding data related to said characteristic for said subject.
- 6. A method as described in claim 5 wherein said corresponding data is data relating to customer service deactivations.
- 7. A method as described in claim 3 wherein said elements of said feature vector are normalized.
- 8. A method as described in claim 7 wherein said elements are normalized with respect to a training data set.
- 9. A method as described in claim 1 wherein said elements of said feature vector are normalized.
- 10. A method as described in claim 1 wherein said artificial intelligence algorithm is a trained neural network.
- 11. A method as described in claim 1 wherein said feature vector includes derived fields calculated from said attributes.
- 12. A method as described in claim 1 further comprising the steps of:
a) estimating a probability distribution, Prob (AI), for said characteristic as a function of said output value; b) estimating a probability distribution, Prob (balance), for said characteristic as a function of a variable of interest; c) determining a maximum value for Prob (AI), MaxProb (AI) and a maximum value for Prob(balance), MaxProb(balance); d) determining a prioritization value, priority, for each of said records as: Priority=[Prob(AI)+Prob(balance)]/[MaxProb(AI)+MaxProb(balance)]; and e) reviewing said records in order of said prioritization value.
- 13. A method as described in claim 12 where said characteristic is bad debt risk for said record and said variable of interest is a balance due.
- 14. A method for estimating a rank order for records with respect to a selected characteristic of subjects of said records, said records including attributes of said subjects, and said attributes including a categorical attribute said method comprising:
a) a step for estimating a statistic having values relating values of said categorical attribute to said characteristic; b) a step for, for one of said records, applying a feature vector having elements derived from said attributes in said one record, said elements including a value of said statistic corresponding to a value of said categorical attribute in said one record, to an artificial intelligence algorithm to obtain an output value; c) a plurality of steps for repeating step b for each remaining one of said records; and d) a step for rank ordering said records in accordance with said output values.
- 15. A method as described in claim 14 wherein said artificial intelligence algorithm is a trained neural network.
- 16. A method as described in claim 12 further comprising:
a) a step for estimating a probability distribution, Prob (AI) , for said characteristic as a function of said output value; b) a step for estimating a probability distribution, Prob (balance), for said characteristic as a function of a variable of interest; c) a step for determining a maximum value for Prob (AI), MaxProb (AI) and a maximum value for Prob(balance), MaxProb(balance); d) a step for determining a prioritization value, priority, for each of said records as: Priority [Prob(AI)+Prob(balance)]/[MaxProb(AI)+MaxProb(balance)]; and e) a step for reviewing said records in order of said prioritization value.
- 17. A method for developing a neural network whose outputs can be used to rank order records with respect to a selected characteristic of subjects of said records, said records including attributes of said subjects, said method comprising:
a) selecting a group of said records as training records, values of said characteristic being known for subjects of said training records; b) selecting a current topology and learning algorithm to configure said network; c) applying attributes from said training records and said known characteristic values for said subjects of said training records to said configured network to train said configured network to generate current weights; d) selecting a group of said records as evaluation records, values of said characteristic being known for subjects of said evaluation records; e) applying attributes from said evaluation records to said trained configured network to generate said outputs for said evaluation records; f) ordering said evaluation records in rank order in accordance with said outputs for said evaluation records; h) evaluating said rank order of said evaluation records in accordance with predetermined criteria; and j) modifying said current topology or learning algorithm or both to configure said network; and k) repeating steps c) through j) a plurality of times to generate a plurality of neural networks; and l) selecting one of said plurality of neural networks which best meets said criteria.
- 18. A method for estimating the relative likelihood that, or extent to which, subjects will have a characteristic, said subjects having a plurality of attributes, said attributes including a categorical attribute, said method comprising the steps of:
a) estimating a statistic relating values of said categorical attribute to said characteristic; b) for each of said subjects, processing said attributes to generate an input vector descriptive of said each subject, said processing including substituting a value of said statistic for corresponding values of said categorical attribute; c) for each of said subjects, generating an output value as a function of said input vector; and d) using said output values as a measure of said relative likelihood or extent.
- 19. A method as described in claim 18 wherein said statistic is an estimate of the probability of said characteristic as a function of values of said categorical attribute, said estimate being determined from a statistics file of records where occurrence of said characteristic is known and having values as a function of said categorical attribute of:
a) StatProb=the number of records in said statistics file having a particular value of said categorical attribute and where said characteristic occurs, divided by the total number of records having said particular value of said categorical attribute, Nstat, if said Nstat is greater than Nmax; b) DefaultProb=the number of records in said statistics file where said characteristic occurs, divided by Nstat, if Nstat is less than Nmin; and c) WeightedProb=DefaultProb*(Nmax−Nstat)/(Nmax−Nmin)+StatProb*(Nstat−Nmin)/(Nmax−Nmin), if Nstat is between Nmax and Nmin; where d) Nmax defines a statistically significant sample size and Nmin defines a statistically sparse sample size.
- 20. A method as described in claim 18 where said attributes are selected on the basis of a level of significance as determined by a relevance analysis.
- 21. A method as described in claim 20 where correlation coefficients between said attributes are determined and less significant ones of highly correlated attributes are eliminated from said input vector.
- 22. A method as described in claim 18 wherein said function is defined by a trained neural network and an associated set of weights.
- 23. A method as described in claim 18 wherein said feature vector includes derived fields calculated from said attributes.
- 24. A system for estimating a rank order for records with respect to a selected characteristic of subjects of said records, said records including attributes of said subjects, and said attributes including a categorical attribute, said system comprising:
a) means for estimating a statistic relating values of said categorical attribute to said characteristic; b) means for, successively applying, for each of said records, a feature vector having elements derived from said attributes in said each record, said elements including a value of said statistic corresponding to a value of said categorical attribute in said each record, to an artificial intelligence algorithm to obtain output values; and c) rank ordering said records in accordance with said output values.
- 25. A system as described in claim 24 wherein said subjects are customers of a provider of goods or services and said characteristic is bad debt risk.
- 26. A system as described in claim 25 wherein said provider is a provider of long distance services, said system further comprising means for providing said attributes, said attributes including attributes derived from current traffic data.
- 27. A system as described in claim 26 wherein said attributes further include attributes derived from customer data.
- 28. A system as described in claim 26 wherein said statistic is derived from a statistic data set comprising a time line of archived records for said subjects and corresponding data related to said characteristic for said subject, said system further comprising means for providing said statistic data set.
- 29. A system as described in claim 28 wherein said corresponding data is data relating to customer service deactivations.
- 30. A system as described in claim, 26 further comprising means for normalizing said elements of said feature vector.
- 31. A system as described in claim 30 wherein said elements are normalized with respect to said training data set.
- 32. A system as described in claim 24 further comprising means for normalizing said elements of said feature vector.
- 33. A system as described in claim 24 where said artificial intelligence algorithm is a trained neural network.
- 34. A system as described in claim 24 where said feature vector includes derived fields calculated from said attributes.
- 35. A computer-readable medium carrying one or more sequences of one or more instructions for controlling a data processing system to estimate a rank order for records with respect to a selected characteristic of subjects of said records, said records including attributes of said subjects, and said attributes including a categorical attribute, the one or more sequences of one or more instructions including instructions which, when executed by one or more processors, cause the one or more processors to control said system to perform the steps of:
a) estimating a statistic relating values of said categorical attribute to said characteristic; b) for one of said records, applying a feature vector having elements derived from said attributes in said one record, said elements including a value of said statistic corresponding to a value of said categorical attribute in said one record, to a artificial intelligence algorithm to obtain an output value; c) repeating step b for each remaining one of said records; and d) rank ordering said records in accordance with said output values.
- 36. A computer-readable medium as described in claim 35 wherein said subjects are customers of a provider of goods or services and said instructions cause the one or more processors to control said system to perform said estimating step to estimate said statistic to relate values of said categorical attribute to said subject's potential to be a bad debt risk.
- 37. A computer-readable medium as described in claim 36 wherein said provider is a provider of long distance services, and said instructions cause the one or more processors to control said system to perform said applying step to apply attributes including attributes derived from current traffic data.
- 38. A computer-readable medium as described in claim 37 wherein said instructions cause the one or more processors to control said system to perform said applying step to apply attributes including attributes from customer data.
- 39. A computer-readable medium as described in claim 37 wherein said instructions cause the one or more processors to control said system to perform said estimating step to estimate said statistic from a statistic data set comprising a time line of archived records for said subjects and corresponding data related to said characteristic for said subject.
- 40. A computer-readable medium as described in claim 39 wherein said instructions cause the one or more processors to control said system to perform said estimating step to estimate said statistic from a statistic data set comprising a time line of archived records for said subjects and data relating to customer service deactivations.
- 41. A computer-readable medium as described in claim 37 wherein said instructions cause the one or more processors to control said system to perform the further step of normalizing said elements prior to said estimating step.
- 42. A computer-readable medium as described in claim 41 wherein said instructions cause the one or more processors to control said system to normalize said elements with respect to said training data set.
- 43. A computer-readable medium as described in claim 35 wherein said instructions cause the one or more processors to control said system to perform the further step of normalizing said elements prior to said estimating step.
- 44. A computer-readable medium as described in claim 35 wherein said artificial intelligence algorithm is a trained neural network.
- 45. A computer-readable medium as described in claim 35 wherein said feature vector includes derived fields calculated from said attributes.
RELATED APPLICATIONS
[0001] This application is related to, and claims the benefit of the earlier filing date under 35 U.S.C. § 119(e) of, U.S. Provisional Patent Application (Serial No. 60/266,864: Attorney Docket COS-01-002), filed Feb. 7, 2001, entitled “AN ARTIFICIAL INTELLIGENCE TRENDING SYSTEM,” the entirety of which is incorporated herein by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60266864 |
Feb 2001 |
US |