Algorithm for explaining credit scores

Information

  • Patent Grant
  • 8001041
  • Patent Number
    8,001,041
  • Date Filed
    Friday, July 27, 2007
    18 years ago
  • Date Issued
    Tuesday, August 16, 2011
    14 years ago
Abstract
An exemplary Web-based score explanation service typically requires only the credit bureau identifier, credit score, and up to four reason codes as input. The invention herein discloses an algorithm that is used to provide an explanation of the primary factors influencing the score, where a rich data feed is provided to the facility implementing the algorithm
Description
BACKGROUND OF THE INVENTION

1. Technical Field


The invention relates to credit scoring. More particularly, the invention relates to an algorithm for explaining credit scores.


2. Description of the Prior Art


Recent events have made it desirable for developers of credit scoring algorithms, such as Fair, Isaac and Company, Inc. of San Rafael, Calif. (FICO) to move toward offering a service to deliver credit bureau risk scores and explanations directly to consumers and lenders. Consumer advocacy groups and credit counseling organizations have provided positive feedback on these announced intentions. Additionally, credit scoring developers clients, i.e. the credit grantors themselves, have expressed their understanding of the need to pursue this undertaking. Most organizations are comfortable that each credit scoring developer, such as Fair, Isaac, is the only entity in the market that can actively take on the role of credit score delivery and explanation.


A comprehensive score delivery and explanation service should include all of the following pieces:

  • 1. Credit scores delivered to consumers.
  • 2. The primary reason codes that describe why the score was not higher.
  • 3. The consumer's credit bureau report from which the score was calculated to allow them to cross-reference the information with his/her actual credit report.
  • 4. A personalized score explanation that describes to that consumer, in plain language, how their individual score was derived. This explanation service can be further enhanced using data elements present in the consumer's credit report.


Given the desirability of providing such information to consumers, it would be advantageous to provide a method and apparatus for explaining credit scores.


A. Flint, D. Lear, C. St. John, Method and Apparatus for Explaining Credit Scores, U.S. patent application Ser. No. 09/790,453 (Feb. 22, 2001) describe a Web site containing an array of informative resources including for-pay services and extranet functions to serve consumers and traditional players in the financial services industry, including financial counselors, mortgage brokers, direct lenders, large national credit issuers, and third-party credit report re-sellers, plus information seekers such as the press, consumer groups, and government agencies. A primary focus of the Flint et al. invention is to educate consumers, consumer groups, and the consumer press by offering them access to the exceptionally high-quality information, both general and personal, about the practices of collection, storing, reporting, and evaluating consumer credit data.


It would be advantageous to provide an algorithm for explaining credit scores, for example in connection with a credit score explanation service.


SUMMARY OF THE INVENTION

An exemplary Web-based score explanation service typically requires only the credit bureau identifier, credit score, and up to four reason codes as input. The invention herein discloses an algorithm that is used to provide an explanation of the primary factors influencing the score based upon a rich data feed.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a block schematic diagram showing targeted users, access and entry points, and services provided by myFICO.com; and



FIG. 2 is a flow diagram showing an algorithm for explaining credit scores according to the invention.





DETAILED DESCRIPTION OF THE INVENTION

The presently preferred embodiment of the herein described algorithm for explaining credit scores is provided for use in conjunction with a credit score explanation service that may be implemented in any of several embodiments. The preferred embodiment of the invention operates in conjunction with a Web site containing an array of informative resources including for-pay services and extranet functions to serve consumers and traditional players in the financial services industry, including financial counselors, mortgage brokers, direct lenders, large national credit issuers, and third-party credit report re-sellers, plus information seekers such as the press, consumer groups, and government agencies. A primary focus of such Web site is to educate consumers, consumer groups, and the consumer press by offering them access to the exceptionally high-quality information, both general and personal, about the practices of collection, storing, reporting, and evaluating consumer credit data.


The working title of the Fair, Isaac and Company (FICO) Web presence is myFICO (myfico.com) because the most visible elements of the service are aimed directly at consumers who want to learn about their own FICO score.


Although the on-demand receipt of FICO scores is thought to be the primary draw to the site (based on consumer interest and press coverage), the invention also offers access to additional valuable services, such as registration in an opt-in/opt-out database, the ability to initiate requests for credit investigations, the ability to link to consumer credit counseling services should scores be low and represent high risk, and the ability to access multiple reports from different repositories upon request. These services heighten the level of consumer education, and also offer individuals access to information, actions, and preferences they have not had previously.


An additional benefit is to use myFICO.com to supply the consumer with their score and if that score is sufficient to pass the cutoff scores of specific brokers or lenders, the credit scoring developer can pass the consumer's name, application, and credit score on to the lender for consideration. The invention allows the credit scoring developer to build broker networks to refer these applicants to lenders who would approve them. The credit scoring developer can also link the applicants' email address to credit companies who wish to pre-approve and solicit these consumers based on score. This is a much more cost effective origination process (via email) than direct mail today.



FIG. 1 is a block schematic diagram showing targeted users, access and entry points, and services provided by myFICO.com 10. Access by consumers 12 may be through a credit reporting agency 13, using an identification verification process to access credit score reports 17, for opt-in/opt-out requests 16, to access a report service 18, and to initiated on line investigations 19; through a secure, one time connection 15 for one-off credit score reports 20; or through an entirely anonymous access method 14 (the latter also allows access by government agencies 22) for consumer oriented information 21. myFICO.com also provides an extranet logon facility 25 to credit score reports 37 for such users as financial counselors 24, mortgage brokers 26, and direct lenders 27; an automated application service provider entry 29 to credit score reports 30 and other reports 31 for such users as large credit issuers 28, on line financial service providers 32, and credit report resellers 33; and repository access 35 to credit score reports and other reports 36 for repository consumer representatives 34 and credit report resellers 33. See A. Flint, D. Lear, C. St. John, Method and Apparatus for Explaining Credit Scores, U.S. patent application Ser. No. 09/790,453 (Feb. 22, 2001).


Score Explanation Service


An exemplary Web-based score explanation service requires, for example, only the credit bureau identifier, credit score, and up to four reason codes as input. In contrast, the invention herein discloses an algorithm that is used to provide an explanation of the primary factors influencing the score based upon a rich data feed. This algorithm can be enhanced depending upon the amount of input data available, although the use of an enhanced algorithm is optional and not considered to be a key element of the subject invention. The actual explanations may be selected as appropriate for the application to which the invention is put. Typical explanations are those described in A. Flint, D. Lear, C. St. John, Method and Apparatus for Explaining Credit Scores, U.S. patent application Ser. No. 09/790,453 (Feb. 22, 2001).



FIG. 2 is a flow diagram showing an algorithm for explaining credit scores according to the invention. The invention comprises a score explanation algorithm, which could be applied to any score. The presently preferred embodiment of the invention provides the basis for a general web-based score explanation service (see FIG. 1).


Score Explainer


Consider a score, which can be written as some function of a set of prediction characteristics (100)

Score=ζ(χ12, . . . ,χc),

    • where
    • χj=Prediction Characteristic j.


For a credit bureau risk score, there are about 80 prediction characteristics, which can include, for example, such characteristics as the number of trade lines with a current delinquency, although the actual number chosen when practicing the herein disclosed invention can vary depending upon the particular application to which the invention is put. If the score is a single scorecard, i.e. a scoring model where the score for an individual is the sum of their characteristic scores, and where the number of terms in this sum is the number of characteristics, so that, for each characteristic, the individual is assigned a score weight and then their final score is the sum these score weights, then it can be written in the form






Score
=




j
=
1

c





ς
j



(

χ
j

)


.






If the score is a segmented scorecard, i.e. where the population is segmented into mutually exclusive segments, and where a separate scorecard model is developed for each segment, then the formula is more complicated. For example the score weight associated with a particular value of a particular characteristic depends on what segment the individual is in. In such cases, the formula can be written down and analyzed. In fact, the invention disclosed herein works for any score, which can then be computed in a reasonably fast manner by application of the invention thereto.


To explain a score in detail, we define a set of surrogate characteristics, z1, z2, . . . , zp, which are labeled “Areas for improvement” (110). Areas for improvement include, for example, such factors as too much delinquency, too much debt, short credit history, etc. There are p Areas for Improvement, each area represented mathematically by a surrogate characteristic. Typically, p<c. For example, see below.


Use these Areas for Improvement surrogate characteristics to develop a surrogate score of the form

ψ(z1,z2, . . . ,zp),

which is developed using z1, z2, . . . , zp as the prediction characteristics and

y=ζ(χ12, . . . ,χc),

as the performance (dependent) variable (120). Now consider the customer, who wants their score explained. Their actual score is

Score*=ζ(χ1*,χ2*, . . . ,χc*),

and their values of z1, z2, . . . , zp are

z1*,z2*, . . . ,zp*),


A rich data feed with the values of the z's is needed, but not necessarily the values of the x's.


Associated with each Area for Improvement, define the potential improvement metric (130)










I
k

=



100



[


max

z
k




{


ψ


(



z
1
*



(

z
k

)


,





,

z
k

,





,


z
p
*



(

z
k

)



)


-

ψ


(


z
1
*

,








z
k
*


,





,

z
p
*


)



}


]


Score
*









=



Maximum





possible





percent





improvement











for





Area





of





Improvement





k

,









where, e.g. z1*(zk)=z1* unless z1* cannot coexist with zk. In that case, z1*(zk)=E[z1|zk], or some other value of z1 that can coexist with zk.


Suppose that the values of the I's are ordered as follows:

I4>I7>I2>I11> . . . .


Then one could optionally create an ordered Area for Improvement table of the form shown in Table “A” below (140).









TABLE A







Areas for Improvement








Areas for Improvement
Potential Percent Improvement in Score





4
I4


7
I7


2
I2


11 

I11



.
.


.
.


.
.









The number of Areas for Improvement listed would be small relative to the value of p.


Definitions of the Areas for Improvement


In the simple case, one could use the original prediction characteristics as the Areas for Improvement (152). However, in cases such as the credit bureau risk score, this would yield too many Areas for Improvement.


Another possibility is to use the set of prediction characteristics, which are computed, for example, as part of Fair Isaac's Search™ Software product as the Areas for Improvement (154). When Search™ goes to the credit bureau to get information on a person, these Search™ prediction characteristics are returned. The Search™ Software product improves origination decision making by automatically obtaining credit bureau reports and scores from the major North American consumer credit bureaus, as well as providing comprehensive and sophisticated analysis. In Search™ there are about 40 characteristics.


Another approach is to define a set of surrogate characteristics from some standard categorization of the credit bureau characteristics that go into scores (150). The categories listed on a Fair Isaac Web page (see http://www.myfico.com/filfsf.html) are (i) Payment history, (ii) Amounts Owed, (iii) Length of credit history, (iv) New credit, and (v) Types of credit use. Suppose that the 80 original characteristics are categorized into the five categories above as follows:

    • Category 1: x1, x2, . . . , x15
    • Category 2: x16, x2, . . . , x35
    • Category 3: x36, x2, . . . , x50
    • Category 4: x51, x2, . . . , x65
    • Category 5: x66, x2, . . . , x80


Then one could develop a surrogate characteristic of the form

z11(x1,x2, . . . ,x15),
where
φ1(x1,x2, . . . ,x15)

is a score developed using x1, x2, . . . , x15 as the prediction characteristics and the real credit score as the performance (dependent) variable (160). The variables z2, z3, . . . , z5 could be developed in a similar way.


This creates five possible Areas for Improvement (170), which might be too small. Each of the above categories could be broken into several sub-categories to create more Areas for Improvement.


The three approaches described above are shown on FIG. 2 as alternatives.


Discussion


Reason Codes


The following discussion of the invention is a generalization of the technique used today for computing reason codes. Associated with a person's score is a set of about four reason codes, which can be easily returned from a credit bureau along with the score value. In the background, associated with each reason code, is a score difference. These score differences are defined slightly differently than the score differences in the formula for lk but they could be converted to percentages in the same way that the score differences are converted to percentages. This idea is readily implemented with a data feed, which includes the score differences associated with the reason codes.


Segment Split Variables


The above formula for lk takes into account automatically and precisely the split variables used to define the segments in a segmented score. The maximization over zk for a split variable involves the computation of the customer's score in several segments, because, as zk varies over its range, one moves from segment to segment.


Rich Data Feed


The presently preferred embodiment of the herein disclosed invention requires a rich data feed, i.e. the values of the z*'s. However, the values of the x*'s are not needed, unless they are the same as the z*'s.


Number of Areas for Improvement


In the preferred embodiment, the total number of Areas for Improvement could be just about any number between five and 80. However, showing the customer the typical top four is presently preferred. The total number is preferably significantly more than four, i.e. five is too small. Twenty is presently preferred. This means that the five categories mentioned above in connection with the Fair Isaac site should be expanded to about twenty. The original 80 prediction characteristics should be put into about twenty categories.


Generality


The algorithm described herein would work for any score, which can then be computed in a reasonably fast manner by application of the invention herein thereto. This includes neural networks and regression trees.


Although the invention is described herein with reference to the preferred embodiment, one skilled in the art will readily appreciate that other applications may be substituted for those set forth herein without departing from the spirit and scope of the present invention. Accordingly, the invention should only be limited by the Claims included below.

Claims
  • 1. A computer-implemented method for explaining credit scores, the method comprising: providing a Web site on a web server that contains informative resources, the Web site:providing consumers access to the informative resources via a network connected to the web server, the informative resources comprising information representing practices of the collecting, storing, reporting, and evaluating of consumer credit data;receiving consumer credit scores and reason codes from individual consumers or third parties, in interactive or batch modes;providing an explanation report to the individual consumers based upon the consumer credit scores associated with each individual consumer;defining a credit score as a function of prediction characteristics;defining a surrogate set of characteristics as representing areas for improvement;developing, by at lest one data processor, by a server, a surrogate score to approximate a real credit score, using said real credit score as a performance dependent variable and using said surrogate characteristics as predictors; anddefining a potential improvement metric for each area for improvement.
  • 2. The method of claim 1, wherein the Web site further creates an ordered areas for improvement table.
  • 3. The method of claim 1, wherein the Web site further uses said surrogate characteristics as said areas for improvement.
  • 4. The method of claim 1, wherein the Web site further defines a set of surrogate characteristics from a standard categorization of original prediction characteristics.
  • 5. The method of claim 4, wherein the Web site further develops surrogate characteristics as models developed using said categorized prediction characteristics and said credit score as said performance variable.
  • 6. The method of claim 5, wherein the Web site further creates said areas for improvement.
  • 7. The method of claim 1, wherein said defining a credit score step is defined as: Score=ζ(x1,x2, . . . ,xc),wherexj=Prediction Characteristic j.
  • 8. The method of claim 1, wherein said developing a surrogate score step further comprises the step of: using said areas for improvement prediction variables to develop a surrogate score of the form Ψ(z1,z2, . . . ,zp)
  • 9. The method of claim 1, wherein the Web site further: associates a set of reason codes with a score returned from a credit bureau;associates a score difference with each reason code;provides a data feed which includes score differences associated with said reason codes; andconverts score differences to percentages.
  • 10. The method of claim 1, wherein the Web site further compares a current score to a maximum score that can be obtained by varying said prediction characteristic.
  • 11. A credit score explanation system comprising: a Web site running on a server that contains informative resources, said Web site comprising any of for-pay services and extranet/internet functions;said Web site offering any of consumers and said third parties access to information contained in said informative resources, the resources being both general and personal, about practices of any of collection, storing, reporting, and evaluating consumer credit data; andsaid Web site for accepting consumer credit scores and reason codes from any of individual consumers or third parties, in interactive or batch mode, for providing an explanation report to said individual consumers based upon the individual consumers' credit scores, for defining a credit score as a function of prediction characteristics, for defining a surrogate set of characteristics as representing areas for improvement, and for developing a surrogate score to approximate a real credit score, using said real credit score as a performance dependent variable and using said surrogate characteristics as predictors, and for defining a potential improvement metric for each area for improvement.
  • 12. The system of claim 11, wherein the Web site further creates an ordered areas for improvement table.
  • 13. The system of claim, 11, wherein the Web site further uses said surrogate characteristics as said areas for improvement.
  • 14. The system of claim 11, wherein the Web site further defines a set of surrogate characteristics from a standard categorization of original c prediction characteristics.
  • 15. The system apparatus of claim 14, wherein the Web site further develops surrogate characteristics as models developed using said categorized prediction characteristics and said credit score as said performance variable.
  • 16. The system of claim 15, wherein the Web site further creates said areas for improvement.
  • 17. The system of claim 11, wherein defining a credit score comprises: Score=ζ(x1,x2, . . . ,xc),wherexj=Prediction Characteristic j.
  • 18. The system of claim 11, wherein said developing a surrogate score step further comprises: using said areas for improvement prediction variables to develop a surrogate score of the form Ψ(z1,z2, . . . ,zp),
  • 19. The system of claim 11, wherein defining a potential improvement metric step is defined as:
  • 20. The system of claim 11, wherein the Web site is further adapted for: associating a set of reason codes with a score returned from a credit bureau;associating a score difference with each reason code;providing a data feed which includes score differences associated with said reason codes; andconverting score differences to percentages.
  • 21. The system of claim 11, wherein the Web site further compares a current score to a maximum score that can be obtained by varying said prediction characteristic.
  • 22. An article of manufacture comprising: computer executable instructions stored on non-transitory computer readable media, which, when executed by a computer, causes the computer to perform operations comprising: providing a Web site on a web server that contains informative resources, the Web site:providing consumers access to the informative resources via a network connected to the web server, the informative resources comprising information representing practices of the collecting, storing, reporting, and evaluating of consumer credit data;receiving consumer credit scores and reason codes from individual consumers or third parties, in interactive or batch modes;providing an explanation report to the individual consumers based upon the consumer credit scores associated with each individual consumer;defining a credit score as a function of prediction characteristics;defining a surrogate set of characteristics as representing areas for improvement;developing a surrogate score to approximate a real credit score, using said real credit score as a performance dependent variable and using said surrogate characteristics as predictors; anddefining a potential improvement metric for each area for improvement.
  • 23. The article of claim 22, wherein developing a surrogate score step further comprises: using said areas for improvement prediction variables to develop a surrogate score of the form Ψ(z1,z2, . . . ,zp),
  • 24. The article of claim 22, wherein the Web site further: associates a set of reason codes with a score returned from a credit bureau;associates a score difference with each reason code;provides a data feed which includes score differences associated with said reason codes; andconverts score differences to percentages.
RELATION TO OTHER PATENT APPLICATIONS

This application is a Continuation of U.S. patent application Ser. No. 09/919,074 filed 30 Jul. 2001 now U.S. Pat. No. 7,280,980 which is a continuation in part of U.S. patent application Ser. No. 09/790,453 filed 22 Feb. 2001, and claims priority to U.S. Patent Application Ser. No. 60/222,231 filed 1 Aug. 2000, and U.S. Patent Application Ser. No. 60/222,205 filed 1 Aug. 2000.

US Referenced Citations (35)
Number Name Date Kind
4736294 Gill Apr 1988 A
4876592 Von Kohorn Oct 1989 A
4895518 Arnold Jan 1990 A
5034807 Von Kohorn Jul 1991 A
5259766 Sack Nov 1993 A
5274547 Zoffel et al. Dec 1993 A
5611052 Dykstra Mar 1997 A
5615408 Johnson Mar 1997 A
5704029 Wright Dec 1997 A
5732400 Mandler Mar 1998 A
5774883 Andersen Jun 1998 A
5793972 Shane Aug 1998 A
5875236 Jankowitz Feb 1999 A
5878403 DeFrancesco Mar 1999 A
5930764 Melchione Jul 1999 A
5930776 Dykstra Jul 1999 A
5940812 Tengel Aug 1999 A
5950172 Klingman Sep 1999 A
5966695 Melchione Oct 1999 A
5995947 Fraser Nov 1999 A
6029149 Dykstra Feb 2000 A
6064987 Walker May 2000 A
6070141 Houvener May 2000 A
6088686 Walker Jul 2000 A
6094643 Anderson Jul 2000 A
6105007 Norris Aug 2000 A
6115690 Wong Sep 2000 A
6119103 Basch Sep 2000 A
6128599 Walker Oct 2000 A
6128603 Dent Oct 2000 A
6324524 Lent et al. Nov 2001 B1
6405181 Lent Jun 2002 B2
6567791 Lent May 2003 B2
7143063 Lent Nov 2006 B2
20020077964 Brody et al. Jun 2002 A1
Foreign Referenced Citations (4)
Number Date Country
0869652 Oct 1998 EP
0913789 May 1999 EP
WO-0026833 May 2000 WO
WO0157720 Aug 2001 WO
Related Publications (1)
Number Date Country
20070288338 A1 Dec 2007 US
Provisional Applications (2)
Number Date Country
60222231 Aug 2000 US
60222205 Aug 2000 US
Continuations (1)
Number Date Country
Parent 09919074 Jul 2001 US
Child 11829606 US
Continuation in Parts (1)
Number Date Country
Parent 09790453 Feb 2001 US
Child 09919074 US