Computers are being exploited increasingly to enable commerce between firms (e.g., businesses) and their customers. For example, many customer transactions are performed via communication with one or more websites of a firm. In any event, since customers often are identified uniquely in computer-logged activities, customer transactions with a firm can be stored as data for analysis. The activities of individual customers can be mined to provide information about customer behavior.
Customers can engage in commerce with a firm in a contractual or non-contractual setting. In a contractual setting, the firm may provide goods/services under an agreement that is maintained and/or renewed explicitly or implicitly over time and that is terminated expressly. For example, the firm may provide cable television service to customers via a monthly contract that can be terminated by each customer at the end of any month. As another example, the firm may be a bank that provides banking services to account holders that entrust the bank with their money and that remain customers as long as some of the money remains with the bank. Accordingly, commerce performed in a contractual setting allows a firm to observe when customers become inactive, which is referred to as customer “churning.” Thus, a firm in a contractual setting can identify its active customer base with accuracy. In contrast, in a non-contractual setting, a firm may provide goods/services on demand, without any agreement about whether or not a customer will remain active with the firm.
Distinguishing active customers from inactive ones in a non-contractual setting can be problematic. Customers that are still active, but have not exhibited recent activity, cannot be distinguished unambiguously from those that have churned. Thus, in a non-contractual setting, customers often are deemed as active or inactive based on an approach using an arbitrary measure of activity, such as whether or not a customer has performed a transaction with the firm within a given period of time, such as one year. However, this approach is inaccurate and reactive, instead of proactive.
The systems and methods disclosed herein may permit a film to identify customer current and future status for developing retention and loyalty stratagies at the individual customer level in a non-contractual setting. A customer's status relates to whether the customer is active (“alive”) or inactive (“dead” or “churned”). The customers that are active may still be using the firm's products and/or services and thus have potential future value to the firm. In contrast, churned customers interacted with the firm in the past but may have chosen to use the goods and/or services of a competitor of the firm, or may have left the industry altogether, among others. In some cases, a churned customer may bring negative value to the firm through negative comments or flagging the firm's communications as spam. In any event, by assessing the status of its customers, the firm may work more effectively to improve customer retention. The ability to retain a customer adds tremendous value to the firm. For example, reducing churn rate by one percent may add, on average, about five percent to the firm's value. The ability to predict the status of customers now (current status) and also predict their status in any given time window into the future (future status) may be of tremendous value to the firm as it enables the firm to implement retention and loyalty strategies that are proactive instead of reactive. Furthermore, estimating weights of different customer attributes as drivers of customer action and customer churn provide additional insights into which attributes are the key drivers of customer experience and which of the firm's processes and systems need to be improved to ensure an enhanced customer experience.
A customer “action,” as used herein, is any type of session, such as a transaction and/or interaction, involving both a customer 102 and firm 106. An action also may be termed an “action session.” The actions available to a customer may be determined by the type of business conducted by the firm. For example, the firm may conduct business over a computer network (e.g., the Internet), such as via one or more websites. Exemplary types of customer actions that may be performed over a computer network include registration, a visit (e.g., to a firm website), a download of one or more files, an upload of one or more files, an order and/or purchase of one or more goods and/or services, file viewing, sharing a file(s) (e.g., with another customer), or the like. Exemplary types of customer actions that may be performed by a customer physically present at the firm include registration, a purchase of one or more goods/services, a visit, a consultation, a trade, a return of one or more purchased goods/services, or the like.
A “non-contractual setting,” as used herein, is any business arrangement between a firm and customers in which each customer can become inactive at any selected time without notifying the firm and thus without observation by the firm. The term “churn” is used herein to denote silent attrition, namely, the unobservable event of a customer becoming inactive. In a non-contractual setting, the firm cannot know with certainty whether any given customer that has not performed an action for an extended period of time has actually churned or is just taking a long hiatus from doing business with the firm. The present disclosure provides a measure of the likelihood of customer action at a selected time point after a customer's most recent action and thus offers an indication of customer churn.
A “firm,” as used herein, is any person or organized group of people that offers goods and/or services to customers, generally for commercial purposes.
A “customer,” as used herein, is any person or organized group of people that performs actions, such as transactions and/or interactions, with a firm, generally for commercial purposes.
A customer “attribute,” as used herein, is any characteristic of customers. An attribute for an individual customer may be constant or may vary with respect to time and/or. customer action number. Exemplary attributes have values and/or may be assigned values for each customer, and may include age, gender, income, occupation, total number of actions, average time interval between actions, a time interval elapsed since the customer's most recent action, number of a particular type of action taken, etc. If the attribute varies over time, a value for the attribute for an individual customer may be determined, such as a value determined after an action has been taken by the customer. An attribute also may be termed a “covariate.”
The customer data may also be processed in step 301. The action sessions may be organized as intervals with a beginning (from-date, from-action) and an ending (to-date, to-action). Action sessions for a customer where actions of the same type (e.g., uploading, ordering, sharing, etc.) occurred within a predefined length of time (e.g., 5, 10, or 30 minutes, among others) may be grouped as only one action session. For example, a customer may upload a set of files, one file at a time, and intuitively the uploading of these files should be grouped as the same action, that is, the same “upload session.”
In step 301, each action (or action session) for a customer may be assigned an action-session number, or “action number.” The action number may be an ordinal number that describes the relative temporal position of a particular customer action relative to the entire sequence of actions taken by the client. For example, as shown in
In step 302, the customer data is split into training and test data sets. The trailing data set is a collection of all the actions taken by the customers before the split date with the action dates as they were observed. After processing, a training data table may list, for each customer any combination of the following: customer identification number, from-action type (e.g., registration), from-action date/time, to-action type (e.g., upload), to-action date/time, duration time (time interval between from-action and to-action), values of attributes for the customer on or at the from-action date/time, and so on. For example, an attribute value may be the cumulative number of uploads from registration to the from-action (including the from-action).
The test data set is created from the same customer data set and can be composed of N artificial monthly intervals from the split date. After processing, the test data table may list any combination of the following: customer identification number, from-date, to-date, from-action, to-action, values of attributes assuming no additional observed customer actions after the split date, date of last action before the split date. The test data for computing the churn-risk-scores described below in step 203 of the flowchart 200 assumes prediction into the future from the split date or date of analysis, providing knowledge of any actions the customer may take in the future. The test data set for model performance testing described below in step 204 of the flowchart 200 includes the counts of the number of actions in the test data time horizon and intervals.
Returning to the flowchart 200 shown in
In step 403, hazard functions may be estimated from the strata. A “hazard function,” as used herein, is a probability measure that a customer will have a next action at a given time after a previous action, conditional on the occurrence of the previous action. The hazard function may be based on a Cox conditional proportional hazard model for multiple events (also termed “the Model”). The Model provides a statistical model in which a baseline hazard probability is resealed by one or more covariates. The hazard probability may respond exponentially to changes in the value of each covariate. The Model may be semi-parametric, with the hazard baseline function determined as an empirical probability distribution. Alternatively, the Model may be parametric, with the hazard baseline function specified by a theoretical probability distribution.
The Model may be exploited to incorporate the impact of time from the latest action (the “previous action”) and the values of customer attributes along with their weights. The Model may estimate the conditional probability of a next action at a time t after the previous action. The outcome of the Model for each stratum may be a set of one or more weights for respective corresponding customer attributes and baseline hazard rates at different time points from the previous action (i.e., the time at which the previous action was taken by a customer is time zero). Generally, a distinct hazard function may be estimated for each stratum j with the form
h
j(t)=h0j(t)exp (βjxj)
where h0j(t) is the baseline hazard function with respect to time t from the latest customer action (if represented by the stratum), and where βj represents an attribute weight for the stratum j and is multiplied by a corresponding customer attribute xj for the attribute weight. The attribute may have a customer-specific value defined at the time the stratum data occurred, such as when a customer performed the previous action. Any number of attribute weights and corresponding attributes may be included in the hazard function. Here, is shown explicitly, but in other examples, two, three, or more weights and corresponding attributes may be utilized. In other words, the baseline hazard function may be scaled using one or more weight values multiplied by their corresponding customer attribute values. Weights may be estimated separately for each stratum. The weights may be estimated using a maximum likelihood method, which may utilize both the uncensored data (from-action and to-action in stratum) and the censored data (EOD; from-action but not to-action in stratum).
In step 404, a likelihood of a next action may be calculated from a hazard function for a stratum. The likelihood may be calculated with a computer and may provide a likelihood of next action at one or more different time points from the latest action taken by individual customers whose latest action has an action number represented by the stratum. The calculation for an individual customer may be performed by selecting a time value (for a time interval beginning at the customer's latest action), obtaining values for weights of the stratum, determining values for attributes of the customer corresponding to each of the weights at the time the customer completed the latest action), and placing the values into the hazard function to compute a likelihood of next action. In some embodiments, the likelihood of next action may be operated on to provide a probability that expresses a churn-risk score described below in step 203 of the flowchart 200.
In step 405, the method of predicting the likelihood of next action returns to the method of flowchart 200 shown in
Returning to
The churn-risk scores enable the selection of particular customer retention and loyalty strategies to be applied at different levels of customer aggregation. For example, customers with churn-risk scores that fall within a certain range, such as 0.80 to 0.99, may indicate that specifically targeted customer retention and loyalty strategies have to be applied to prevent these customers from churning. Different strategies, such as less aggressive stategies, can be applied to groups of customers with churn-risk scores that fall within a different range, and other strategies can be applied to the whole customer base. The ability to compute churn likelihood at the individual customer level allows for one-on-one targeting of messages, such as sending e-mail messages or letters that make specific offers directed to catching the attention of individual customers as part of the retention and loyalty strategy.
A customer churn-risk score is the probability of no action for a period of time, such as 6 months or 1 year, from the last observed action date of the customer, conditional on the last action number of the customer. In particular, the churn-risk score can be computed using a computing device for each customer based on the customer's last action date, last action number for assigning the customer to a particular stratum, s, weights of the attributes from the stratum s, and values of the attributes on the last action date. The method of computing the churn-risk score allows for computing the churn-risk scores for different suitable churn time horizons, which can be defined by the user. For example, a churn time horizon can be 6 months, 1 year, 2 years, or even 3 or more years as defined by the user. The method also allows for computing the churn-risk scores for different time periods into the future from the date of analysis, such as 6 months or 1 year.
The method for computing a customer churn-risk score allows for time varying attributes. Customer attributes may include gender, age, income, and health just to name a few. Over time, certain attributes, such as age, income, and health can change. For example, a customer's age continues to increase with tune, income may increase or decrease, and a customer's health may deteriorate with time. Thus, methods of the present invention account for time varying attributes, which can be accomplished by breaking the time period into smaller time intervals and computing a conditional survival probability of surviving until the end of the interval conditional on having survived until the beginning of the interval. In certain embodiments, the churn-risk of a customer is given by the survival probability:
P(S>t|x(t1))
where x(t1) represents the time dependent attributes of the customer at time t1. Assuming that additional attributes are know at later times t1, 12, . . . , tJ, where t1<t2< . . . <tJ, and predicting the risks at all of these times to be r1, r2, . . . , rJ, respectively, the survival probability, or churn-risk score, becomes:
where tj is the largest time for which time varying attributes are known and tj≦t.
The method presented in flowchart 200 also includes an optional step 204. In optional step 204, the test data set generated in step 302 of the flowchart 300 is used to test the Model's performance. In other words, in step 204, the performance of the methods carried out in steps 202 and 203 are tested. For example, for a given customer on the date of analysis, such as the last observation date, the probability that the customer will have no action for a period of time t can be computed using the conditional probability:
P(S>y0+t|S>y0)
where y0 is the time from the date of the last action to the date of analysis. The conditional probability can be calculated for different values of t. For example, time t can be 1 day to 1 month to 3 months, or 1 year to 3 years. The actual time from the date of analysis to the next action can also be computed, if the actual time has been observed. In step 204, actual and predicted rates of inactivity can be computed in the period of time t from the date of analysis and can be computed at different levels of aggregation for individual customers, customer segments or groups, and the whole customer base level.
Step 204 also allows for the creation of a feedback loop, in step 205, in order to tune and improve the methods of steps 202 and 203. In step 205, when the Model is deemed acceptable, the method proceeds to step 207. Otherwise, when the output of the Model is deemed unacceptable, the method proceeds to step 206. in step 206, parameters, such as weights and attributes can be adjusted by the user and steps 202-204 can be repeated.
In step 207, what-if-scenarios can be created to identify the action, called the “next-best-action,” taken by the firm to reduce a customer's churn-risk score, or probability of churning. What-if-scenarios are created by comparing a customer's churn-risk score with the same customer's churn-risk score under different hypothetical scenarios, which can be accomplished as follows. For each customer, a hypothetical churn-risk score is computed as if the customer had performed an action of a certain type on a particular day, such as the date of analysis. Next, what if scenarios are created and performed at different times in the future for combinations of actions and can incorporate the likelihood, or probability, of the customer taking a certain type of action based on the results obtained in step 202. What-if-scenarios can be performed at different levels of aggregation ranging from individual customers, to groups of customers, to the whole customer data base. For example, a churn-risk score computed for an action taken on the date of analysis is compared with the customer churn-risk scores for different hypothetical actions taken at later dates.
In determining what-if-scenarios, the data preparation described above in step 201 is modified to create a data set where a hypothetical action of a certain type performed on a certain date is added to the customer data base and attribute values that depend on the hypothetical action are updated to reflect the change. Steps 201-203 are repeated to create a new churn-risk score for the same customer as if the customer had performed the hypothetical action on the date of analysis. A user can define the date of analysis as well as other aspects of steps 201-203. For example, a hypothetical upload action taken on the date of analysis by a customer changes the customer's data in the customer data base because the number of uploads since registration has increased by 1 and the time since last upload decreases to 0.
In step 208, when-to-act time thresholds are computed in order to identify when, from the date of analysis, a non-high risk customer of churning will become a high-risk customer of churning at sonic later time. The when-to-act time threshold is the time from the date of analysis when the customer's churn-risk score is greater than a churn-risk threshold defined by the user. The time threshold can be used to trigger active retention and loyalty strategies for the customer. A churn-horizon time, t*, defined as the length of time over which an inactive customer is considered to have churned is defined by the use is used in the calculation of the time thresholds. For example, the churn-horizon time can be 6 months or 1 year. Suppose a user sets the churn-risk threshold at 0.95. The when-to-act time threshold, t(s), for a particular customer segment s can be defined as:
P(S>t*|S>t(s),s)=0.95
or equivalently by:
In other words, the probability of the customer segment s churning after the when-to-act time threshold has passed is equal to the probability of the customer segment s churning after the churn-horizon time has passed.
In step 209, retention and loyalty strategies are selected. What-if-scenarios help define and execute one or more, called “call-to-action strategies” as part of the customer retention and loyalty strategy. What-if-scenarios can be used to decide a retention strategy and communication strategy that places greater incentives for a customer to take the next-best-action from a set of several possible actions. The firm can select one or more retention and loyalty strategies at an individual customer level, group level, or whole customer base level on what the next-best-action to use in carrying out the retention and loyalty strategy.
In general, methods of the present invention can be implemented on a computing device, such as a desktop computer, a laptop, or any other suitable device configured to carrying out the processing steps of a computer program.
The computer readable medium 508 can be any suitable article, or medium, that participates in providing instructions to the processor 502 for execution. For example, the computer readable medium 508 can be non-volatile media, such as firmware, an optical disk, a magnetic disk, or a magnetic disk drive; volatile media, such as memory; and transmission media, such as coaxial cables, copper wire, and fiber optics. The computer readable medium 508 can also store other software applications, including word processors, browsers, email, Instant Messaging, media players, and telephony software.
The computer-readable medium 508 may also store an operating system 512, such as Mac OS, MS Windows, Unix, or Linux; network applications 514; and a document layout application 518 that performs the methods of the present invention. The operating system 512 can be multi-user, multiprocessing, multitasking, multithreading, real-time and the like. The operating system 512 can also perform basic tasks such as recognizing input from input devices, such as a keyboard, a keypad, or a mouse; sending output to the display 504; keeping track of files and directories on medium 510; controlling peripheral devices, such as disk drives, printers, image capture device; and managing traffic on the one or more buses 510. The network applications 514 include various components for establishing and maintaining network connections, such as software for implementing communication protocols including TCP/IP, HTTP, Ethernet, USB, and FireWire.
The computer readable medium 508 can also store a customer status application 516 that provides various software components for operating on the data 518 and automatically carrying out the methods described above with reference to
The foregoing description, for purposes of explanation, used specific nomenclature to provide a thorough understanding of the invention. However, it will be apparent to one skilled in the art that the specific details are not required in order to practice the invention. The foregoing descriptions of specific embodiments of the present invention are presented for purposes of illustration and description. They are not intended to be exhaustive of or to limit the invention to the precise foams disclosed. Obviously, many modifications and variations are possible in view of the above teachings. The embodiments are shown and described in order to best explain the principles of the invention and its practical applications, to thereby enable others skilled in the art to best utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalents:
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US10/26488 | 3/8/2010 | WO | 00 | 9/20/2012 |