The present invention is related to the field of loan risk assessment and computer-based loan risk tools. The invention is directed towards a method, system, and apparatus for semi-automatic and automatic loan risk analysis and automatic loan analysis capabilities in loan monitoring applications. In an off-line mode associated with a computing device, a plurality of loan account histories are utilized to train a predictive multi-output risk model. In an online mode, the multi-window computer-based tool presents options for both automatic loan analysis and semi-automatic loan analysis via a graphic user interface, or multiple graphic user interfaces, allowing a user or users to monitor a plurality of loan accounts for risk of default.
The personal lending industry, including the lending of student loans, auto loans, commercial loans, and mortgages, as well as other types of personal loans is valued at trillions of dollars in the United States in the twenty-first century. The total value of mortgages outstanding alone in the United States is $10 trillion dollars. The total value of all student loans outstanding in the United States in 2013 is currently between $902 billion and $1 trillion. The sheer volume of this debt leads to a large amount of competition among lenders, trying to extend the greatest number of loans which have a reasonable chance of being repaid with interest. The tendency to over-purchase existing personal loan accounts from other lenders as well as over-lend leads to situations such as presented in the 2009 Financial Crisis in which defaults of large amounts of mortgages and mortgage-backed securities consisting of individual homeowners' mortgages led to the failure of the entire banking industry, and the need for government bailouts to prevent another Great Depression.
Personal loan accounts consist of accounts such as auto loans, home mortgages, personal lines of credit, credit cards, student loans, and similar types of lending arrangements made to individuals. Whether a lender or loan servicer obtains management of personal loan accounts through direct lending, via assignment of an existing personal loan account, or any other means, the need to obtain information on loan risks remains. In any event once management of a personal loan account has been obtained it is necessary to continuously monitor the potential for default for the personal loan account itself. Collection services as well require information on the status of loans, and whether collection should be pursued or not. Monitoring is required to determine whether the personal loan remains an asset valuable enough to remain “on the books,” whether to file a lawsuit against the personal loan holder to collect on the debt, sell the personal loan to another owner loan servicer, or any other recourse.
With the specific example of the loan service provider industry, the number of loan accounts managed by a loan account service provider may number in the millions, even if only 6-10% of a population of loan accounts may be at risk of default at any one time, this may still amount to hundreds of thousands of loan accounts at risk of default at any one time, all requiring close surveillance. Accordingly, a need exists for a system, method, and apparatus for loan risk targeting and action prioritization which facilitates semi-automatic and automatic loan analysis capabilities and determination of future risk.
The present invention is directed towards a method, system, and apparatus for loan risk targeting and action prioritization using a multi-window computer-based tool associated with a computing device offering semi-automatic and automatic loan analysis capabilities for a user or multiple users.
In an embodiment of the invention, the invention comprises an off-line mode associated with the computing device. Some or all of a number of steps are performed during the off-line mode as is discussed herein. A definition is received from a single user of a predetermined maximum look-ahead timeframe p. A computing device associated with the multi-window computer-based tool then receives a plurality of loan account histories describing a plurality of loan accounts transmitted from a first computer database for loan risk analysis. A predictive multi-output risk model is trained with the plurality of loan account histories. The predictive multi-output risk model indicates a loan risk level associated with each of the received plurality of loan account histories and loan accounts according to a periodic basis up to the predetermined maximum look-ahead timeframe p. The “periodic basis” is discussed more extensively herein but the periodic basis may be a daily, weekly, bi-weekly, monthly, bi-monthly, or annual basis. The periodic basis may be selected by a single user during the off-line mode, such as via a periodic basis selection window in the multi-window computer-based tool. The predictive multi-output risk model is then stored in a second computer database.
The invention further comprises in some embodiments an “online mode.” The online mode is associated with the computing device (and may also be associated with other operatively connected computing devices). During the online mode, the presently disclosed invention may be accessed by one or more users. The online mode comprises one, all, or some of the steps as described below. The online mode may begin with the computing device presenting to the user or multiple users the output of the predictive multi-output risk model trained during the off-line mode indicating the loan risk level associated with each of the plurality of loan accounts according to the periodic basis up to the adjusted look-ahead timeframe p.
During the online mode an option for semi-automatic loan analysis is presented to the user or multiple users via a first graphic user interface in the multi-window computer-based tool, allowing the user or multiple users to be presented the output of the predictive multi-output risk model indicating the loan risk level associated with each of the plurality of loan accounts via the multi-window computer-based tool within the predetermined maximum look-ahead timeframe p according to the periodic basis. An option for automatic loan analysis is also presented to the user or multiple users via a second graphic user interface in the multi-window computer-based tool allowing the user or multiple users to be automatically presented with one or a plurality of loan accounts at a greatest level of risk of all loan accounts within the predetermined maximum look-ahead timeframe p according to the periodic basis via display in the multi-window computer-based tool. Before presenting these options, the computing device may determine a number of users using the multi-window computer-based tool.
The first graphic user interface window may also present further options such as allowing the user or multiple users to select, drag, and rank loan accounts in the multi-window computer-based tool or present the option of selecting two or more loan accounts for comparison at two or more points in time.
The first graphic user interface window may allow the user or multiple users to adjust the predetermined look-ahead timeframe p and, upon adjustment, trains a new predictive multi-output risk model with the received plurality of loan account histories based on the adjusted maximum look-ahead timeframe p. If a user chooses to adjust the predetermined look-ahead timeframe p, the option for semi-automatic loan analysis presents the output of the new predictive multi-output risk model indicating the loan risk level associated with each of the plurality of loan accounts via the multi-window computer-based tool according to the periodic basis up to the adjusted look-ahead timeframe p.
Finally, the first graphic user interface window may inform the user or multiple users of the loan risk level associated with each of the plurality of loan accounts via one of the following: (a.) display of the loan risk level associated with each of the plurality of loan accounts at a beginning of the maximum adjusted look-ahead timeframe p according to the periodic basis; (b.) display of the loan risk level associated with each of the plurality of loan accounts at an end of the adjusted maximum look-ahead timeframe p according to the periodic basis, and (c.) display of a determination of whether the loan risk level associated with each of the plurality of loan accounts is anticipated to be at a level of risk higher than a level of risk threshold for each intervening period within the periodic basis up to the end of the adjusted maximum look-ahead timeframe p.
The second graphic user interface associated with the option for automatic loan analysis may also present further options. The second graphic user interface may only present loan accounts which are not actively monitored by the user or multiple users. The second graphic user interface window may further present the one or plurality of loan accounts at the greatest level of risk of all loan accounts by presenting the one or plurality of loan accounts at the greatest level of risk in batches of a given size. The batches may consist of 1, 2, 3, 4, 10, 20, 25, or 50 loan accounts, may be in the range of 1 through 50, or any other number. In such a case, the computing device may consider that each user processes a certain number of loan accounts simultaneously and the computing device may assign loan accounts to users according to a metric that indicates a length of time a loan account has been unattended by a user. The metric may be calculated by an equation such as:
or any other.
The computing device may assign loan accounts to users according to a half-daily basis, a daily basis, a weekly basis, a monthly basis, and a bi-monthly basis.
The computing device may assign loan accounts unattended for a length of time in any embodiment. The length of time a loan account must be unattended before being assigned may be any time period which has passed during which a user has not reviewed a loan account, such as one hour, one day, two days, one week, one month, or six months.
The second graphic user interface may prioritize display of the one or the plurality of loan accounts at the greatest level of risk of all loan accounts via a further criterion selected by the user or multiple users. The criterion may be (a.) a level of variance in a predicted level of risk of one or a plurality of loan accounts at the greatest level of risk from a baseline value during the maximum look-ahead timeframe p according to the periodic basis; (b.) monotonic increasing of the predicted level of risk of one or the plurality of loan accounts at the greatest level of risk during the maximum look-ahead timeframe p according to the periodic basis; and/or (c.) monotonic decreasing of the predicted level of risk of the one or plurality of loan accounts at the greatest level of risk during the maximum look-ahead timeframe p according to the periodic basis.
These and other aspects, objectives, features, and advantages of the disclosed technologies will become apparent from the following detailed description of illustrative embodiments thereof, which is to be read in connection with the accompanying drawings.
Describing now in further detail these exemplary embodiments with reference to the figures as described herein, the method, system, and apparatus for semi-automatic and automatic risk targeting and action prioritization in loan monitoring applications is described below. It should be noted that the drawings are not to scale.
A “loan account” (within the context of this and associated patent applications) and the associated “loan account history” describing the loan account is a record of debt for the lending of money (typically, for a specific purpose such as a payment for school tuition, refinancing a house, purchasing an automobile, etc.). A loan account contains one or more of the following: principal amount, interest rate, terms of repayment, date(s) of repayment, etc. As discussed within this patent application and associated patent applications, a loan account and an associated loan account history exist in a format accessible to a computing device for processing as a spreadsheet, .csv value, a matrix (as defined by certain programming languages), an array, a database entry, a linked-list, a free-structure, other types of computer files or variables (or any other presently existing or after-arising equivalent). Variables tracked include the origination date of the loan, the original amount of the loan, the remaining principle balance to be paid, the date of the monthly payment, the current interest rate, the terms of repayment, number of original monthly payments, number of remaining monthly payments, whether each monthly payment was timely (true/false), number days delinquent of every monthly payment (from 0 to a positive integer), credit score of loan account holder at various points in time, etc. In a further embodiment of the invention, variables further include loan status (is) (current or not), delinquency days (dd), and forbearance months (fm).
A “computing device,” as discussed in the context of this patent application and related patent applications, refers to one or multiple computer processors acting together, a logic device or devices, an embedded system or systems, or any other device or devices allowing for programming and decision making. Multiple computer systems may also be networked together in a local-area network or via the internet to perform the same function. In one embodiment, a computing device may be multiple processors or circuitry performing discrete tasks in communication with each other. The system, method, and apparatus described herein are implemented in various embodiments to execute on a “computing device[s],” or, as is commonly known in the art, such a device specially programmed in order to perform a task at hand. A computing device is a necessary element to process the large amount of data (i.e., thousands, tens of thousands, hundreds of thousands, or more of loan accounts and loan account histories). Furthermore, the present invention may take the form of a computer program product embodied in any tangible medium of expression having computer usable program code embodied in the medium. Computer program code for carrying out operations of the present invention may operate on any or all of the “server,” “computing device,” “computer device,” or “system” discussed herein. Computer program code for carrying out operations of the present invention may be written in any combination of one or more programming languages, including an object-oriented programming language such as Java, Smalltalk, C++, or the like, conventional procedural programming languages, such as Visual Basic, “C,” or similar programming languages. After arising programming languages are contemplated as well.
Referring to
A single user may initiate all set-up for the multi-window computer-based tool during the “Off-Line Mode” 105 associated with a computing device. “Users,” as discussed herein, refer to any person, persons, or entities who express an interest in setting up the multi-window computer-based tool or utilizing the multi-window computer-based tool seeking data regarding one or more loan accounts. Users may be but are not limited to customer service representatives, agents communicating on behalf of lenders, persons representing the loan issuer or servicer, or any other person, persons, or entities evaluating risk to a loan account or a plurality of loan accounts. A “user” may even be other computer software or computer “daemons” established to automatically access the presently disclosed invention.
Off-Line Mode does not refer to the lack of connectivity by the computing device to a network or the internet, but rather the unavailability of certain functionality to a user or users during this stage as during a “set-up” mode. At step 107, a definition is received from a single user of a predetermined maximum look-ahead timeframe p. In an embodiment of the invention, the predetermined maximum look-ahead timeframe p is the furthest length of time into the future for which loan prediction is performed, as provided by the multi-window computer-based tool. The user may be presented with a window in the multi-window computer-based tool to enter a value for p. Any value may be utilized for p, but a range of six months to several years is typical. An initial value for p may also be pre-established. In general, as the value of the maximum look-ahead timeframe p increases the accuracy of risk predictions decreases (for these further away points in time).
At step 110, a computing device associated with the multi-window computer-based tool receives a plurality of loan account histories describing a plurality of loan accounts transmitted from a first computer database for loan risk analysis. The loan account histories are transmitted for the purpose of determination of risk associated with them and the associated loan accounts at various points in time, and thereafter presenting such data to a user. Determination of risk associated with the loan accounts is associated with automated planning and control of actions to be taken with regard to the loan accounts in certain embodiments of the invention. In order to gain the advantage of using the presently disclosed invention over a simple by-hand analysis of such loan accounts and loan account histories, in the presently disclosed invention the number of loan accounts processed is substantial. The number of loan account histories in question most likely numbers at least in the tens of thousands or even millions. A computing device is a necessary element to process this number of loan account histories in a realistic fashion.
Optionally, at step 120, in an embodiment of the invention, the user selects a periodic basis via a periodic basis selection window in the multi-window computer based tool (such as via a slider-bar, radio button, or other menu option selectors). Typically the periodic basis is a daily, weekly, bi-weekly, monthly, bi-monthly, or annual basis, although any periodic basis may be utilized. The periodic basis is a timeframe upon which loan risk will be analyzed. The periodic basis is utilized by the multi-window computer-based tool in generation of a multi-output risk model. In an embodiment of the invention, the periodic basis selection window presents only options for the user to decide upon of several available periodic bases. The periodic basis is used in several ways, as further detailed below. In further embodiments of the invention the periodic basis is pre-established.
At step 125 a predictive multi-output risk model is trained with the received plurality of loan account histories, the predictive multi-output risk model indicating a loan risk level associated with each of said received plurality of loan account histories and loan accounts according to a periodic basis up to the predetermined maximum look-head timeframe p. The multi-output risk model uses the computing device to analyze the received plurality of loan account histories. In an embodiment of the invention, the multi-output risk model has one output per period as defined by the periodic basis, up to the maximum look-ahead timeframe p. Again, a computing device is a necessary element to handle the large amount of data being processed in a realistic timeframe. Furthermore, complex mathematical models as utilized in the presently disclosed method, system, and apparatus require a computing device to be calculated within a reasonable time.
In an embodiment of the invention, the predictive multi-output risk model is trained (as in step 125) as follows: Given the training input data (from the plurality of m-dimensional loan account histories for n different loan accounts) x Rn×m and the output data (containing risk metrics for n different accounts across look-ahead times 1 through p) y Rn×p, proceed to derive a mapping β ∈ Rm×p. Thus, in one embodiment, the predicted risk values may be obtained by computing ŷ=x*β, where ŷ∈ Rn×p Note that the x may contain linear and nonlinear terms along its second dimension, so either linear or nonlinear models may be constructed. Also, it is understood that the second dimension of x may also contain temporal data. In one embodiment, l-dimensional data for k months is available in the loan account histories, so m=l×k. In another embodiment, the l-dimensional data comprises linear terms and non-linear combinations of the linear terms (e.g., quadratic terms, first-order interactions, cubic terms, second-order interactions, and so on).
For example, consider application of an embodiment of the invention proposed here to real data collected from n=197,125 accounts that have m=332 variables. Ten-fold cross-validation is used to specify the training and test data and select without loss of generality of the partial least squares regression (PLS) linear method to predict the risk values. Other regression methods may also be applied. The input data sets from the plurality of loan account histories contain information from 12 months and outputs (risk factors) are computed for each of the p=6 months ahead from now. Thus, the model will estimate the risk from 1 month (M1) to 6 months (M6 below) ahead of defaulting. Table 1 (as below) shows the MSE for the training and testing.
Original input data: Contains a set of variables from the plurality of loan account histories x Rn×m from current month (mc) up to i months back (mc−i), where i Z (integer numbers). In this case, k=i+1.
Output data: Contains the risk factors y Rn×p assigned to all loan accounts from one month ahead up top months ahead (mc+p) from the current month (i.e., from 1 through mc+p). Let y(k) Rp denote the risk factors assigned to loan account k. In various embodiments the computation of risk values or risk intervals associated with each bank account is performed by inspection of the set y. Rules to assign risk values or risk intervals may be applied via standard logic, fuzzy logic, or even via an expert carrying out an inspection of the accounts themselves.
Performance metric: The performance is evaluated using the mean squared error between the actual and estimated risk values, available in the original input data. Other performance metrics can be used, including mean absolute error, mean absolute scaled error, etc.
Returning to
Execution continues onto
Steps 140 through 180 also take place during “On-Line Mode,” during which the multi-window computer-based tool is available to one user or multiple users to analyze risk associated with loan accounts and automatically make or assist in making loan planning decisions. As previously noted, in various embodiments of the invention, steps described 140 through 180 take place in any order, or individual steps may not take place at all. Additional steps not discussed may take place as well. In further embodiments of the invention, after completion of execution (as after steps 149 or 180), execution terminates or returns to steps 135 or 105 (of
In an embodiment of the invention, at step 140 the output of the predictive multi-output risk model previously trained is displayed to the logged-in user or users. The output of the predictive multi-output risk model indicates the loan risk level associated with each of the plurality of loan accounts according to the periodic basis up to the adjusted look-ahead timeframe p. This occurs during the online mode and previous to the user or multiple users being presented the option for semi-automatic loan analysis 145 or the option for automatic loan analysis 155. This display takes place in the embodiment of the invention via a computer monitor or printer associated with the computing device and that a user or users have access to.
At step 145, an option for a semi-automatic loan analysis is presented to the user or multiple users via a first graphic user interface window in the multi-window computer-based tool. This option allows the user or multiple users to be presented the output of the predictive multi-output risk model indicating the loan risk level associated with each of the plurality of loan accounts via the multi-window computer-based tool within the predetermined maximum look-ahead timeframe p according to the periodic basis. In an embodiment of the invention, the first graphic user interface window may also allow the user or multiple users to select, drag, and rank loan accounts in the multi-window computer-based tool. This is to facilitate a user or users viewing the loan accounts in an accessible fashion. The option to select, drag, and rank loan accounts in the multi-window computer-based tool allows the user or users to move and place loan accounts displayed in the first graphic user interface in more visible locations, allowing the user to best visualize, analyze, and manipulate the various loan accounts he or she is responsible for. Clicking and dragging, menus, and hot keys are convenient ways for a user or users to access this information.
At step 146 the first graphic user interface window additionally allows the user or multiple users to adjust the predetermined maximum look-ahead timeframe p, and, upon adjustment, trains a new predictive multi-output risk model with the received plurality of loan account histories based upon the adjusted maximum look-ahead timeframe p.
At step 147 the output of the new predictive multi-output risk model indicating the loan risk level associated with each of the plurality of loan accounts according to the periodic basis up to the adjusted look-ahead timeframe p is presented to the user or users. In an embodiment of the invention, the first graphic user interface further informs the user or multiple users of the loan risk level associated with each of the plurality of loan accounts via display of the following (a.)-(c.). (a.) A loan risk level associated with each of the plurality of loan accounts at the beginning of the adjusted maximum look-ahead timeframe p, according to the periodic basis; (b.) A loan risk level associated with each of the plurality of loan accounts at the end of the adjusted maximum look-ahead timeframe p according to the periodic basis; and (c.) A determination of whether the loan risk level associated with each of the plurality of loan accounts is anticipated to be at a level of risk higher than a level of risk threshold for each intervening period within the periodic basis up to the adjusted maximum look-ahead timeframe p.
At step 149, in an embodiment of the invention, the first graphic user interface window presents the option to the user or users selecting two or more loan accounts for comparison at two or more points in time.
As an alternate to selection of the option for semi-automatic loan analysis (as discussed above, with regard to steps 145 et seq.), at step 155, an option is presented for automatic loan analysis to users or multiple users via a second graphic interface window in the multi-window computer-based tool. The option for automatic loan analysis allows the user or multiple users to be automatically presented with one or a plurality of loan accounts at a greatest level of risk of all loan accounts within a maximum look-ahead timeframe p (which may be in various embodiments of the invention, either a predetermined maximum look-ahead timeframe or an adjusted maximum look-ahead timeframe p). If selected, the user(s) are then presented with the one or plurality of loan accounts at the greatest level of risk via display in the multi-window computer-based tool. In an embodiment of the invention the computing device assigns loan accounts to users according to a metric that indicates a length of time a loan account has been unattended by a user: the longer the length of time unattended relative to the length of time other loan accounts are unattended, the more likely a loan account is to be assigned. In a further embodiment of the invention, loan accounts are assigned to users on a half-daily, daily, weekly, monthly, or bi-monthly basis. In yet a further embodiment of the invention, loan accounts unattended for any length of time are assigned by the computing device.
In an embodiment of the invention, the first graphic user interface window (as discussed in step 145) and second graphic user interface window (as discussed in step 155) are actually contained in the same graphic user interface or window. As a user or users continue to access the multi-window computer-based tool in the course of performing his or her duties regarding loan analysis, her or she might switch back and forth between the first graphic user interface and second graphic user interface or use one graphic user interface to completion and then the next to completion.
In an embodiment of the invention, previous to presenting the option for automatic loan analysis as discussed in connection with step 155, at step 142 the computing device determines a number of users using the multi-window computer-based tool. In various embodiments of the invention, the determination of the number of users using the multi-window computer-based tool occurs previous to, during, or after the first or second graphic user interface is presented to the user or users.
At step 160 the second graphic user interface prioritizes display of the one or plurality of all loan accounts via a further criterion. The second graphic user interface may, for example, only present loan accounts which are not actively monitored by the user or multiple users. With multiple users accessing the multi-window computer-based tool, the number of users as well as which loan account or accounts each user is actively tracking, processing, or otherwise pursuing (such as by a user sending an email, calling, mailing a letter, or any other action directed towards a loan account by a user) is monitored, and the option for automatic loan analysis will thus only present loan accounts which are not being currently actively tracked by a user or multiple users of the multi-window computer-based tool. In further embodiments of the invention, the second graphic user interface prioritizes display of the one or plurality of loan accounts at the greatest level of risk of all loan accounts via another criterion such as: (a.) a level of variance in a predicted level of risk of one or the plurality of loan accounts at the greatest level of risk from a baseline variance value during the maximum look-ahead timeframe p according to the periodic basis; (b.) monotonic increasing in the predicted level of risk of the one or the plurality of loan accounts at the greatest level of risk during the maximum look-ahead timeframe p according to the periodic basis; and (c.) monotonic decreasing of the predicted level of risk of the one or the plurality of loan accounts at the greatest level of risk during the maximum look-ahead timeframe p according to the periodic basis. In a further embodiment of the invention, the display of the plurality of loan accounts at the greatest level of risk is updated when new data is available regarding the plurality of loan account histories. The second graphic user interface may only present the one or plurality of loan accounts in batches of a given size. In yet a further embodiment of the invention, the second graphic user interface only presents the one or plurality of loan accounts in batches of 1, 2, 3, 5, 10, 20, 25, or 50.
In a further embodiment of the invention, after step 160 at step 180 the computing device assigns loan accounts according to an assignment algorithm. In various embodiments of the invention, the assignment algorithm may be:
or any other.
Referring to
Referring to
Referring to
Referring to
Examples are provided as follows: In an embodiment of the invention, at time t′, each user j ∈ Q is given an account k*j ∈ P such that
If Tk
Referring to
Referring to
At 700, compute the standard deviation of risk values for each loan account. This calculation takes place via methodology as commonly known to one of skill in the art, and considering the number of loan accounts at issue (at least tens of thousands), a computing device is necessary for these calculations.
At 705, denote variable δk as the standard deviation of loan account k. At 710, denote priority
In an embodiment of the invention assume that priority is constant all the time, but in other embodiments it may be updated. At 715, the calculated priority value is used to assign loan accounts to users. Loan accounts may be assigned to users on a monthly basis or otherwise.
In other embodiments of the invention other values are used to assign loan accounts to users. ek is calculated for loan account k (with ek=[y(k)[i+2]−y(k)[i+1] . . . y(k)[i+p]−y(k)[i+p−1]], where y(k)[i+2] denotes the risk factor of loan account k two months from the current month. Now δk for account k may be computed as the standard deviation of the vector ek. If users are only interested in loan accounts with monotonically increasing risk values, loan accounts may be selected that have elements ek with values greater than or equal to zero. If users are interested in loan accounts with monotonically decreasing risk values, loan accounts may be selected that have values less than or equal to zero.
The preceding description has been presented only to illustrate and describe the invention. It is not intended to be exhaustive or to limit the invention to any precise form disclosed. Many modifications and variations are possible in light of the above teachings.
The preferred embodiments were chosen and described in order to best explain the principles of the invention and its practical application. The preceding description is intended to enable others skilled in the art to best utilize the invention in its various embodiments and with various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims.
This application is related to co-filed U.S. patent application Ser. No. 14/221,723 and U.S. patent application Ser. No. 14/221,944. These patent applications are incorporated in their entirety here.