The embodiments discussed herein are related to a hierarchical based sequencing (HBS) machine learning model.
Machine learning is a form of artificial intelligence that is employed to allow computers to evolve behaviors based on empirical data. Machine learning may take advantage of training examples to capture characteristics of interest of their unknown underlying probability distribution. Training data may be seen as examples that illustrate relations between observed variables. A major focus of machine learning research is to automatically learn to recognize complex patterns and make intelligent decisions based on data.
One main difficulty in machine learning lies in the fact that the set of all possible behaviors, given all possible inputs, is too large to be covered by a set of training data. Hence, a machine learning model must generalize from the training data so as to be able to produce a useful output in new cases.
One example of machine learning is traditional structured prediction (SP). Traditional SP is a single model approach to dependent output. With SP, once an input feature vector x is specified, a single correct output vector z can be fully specified. Thus the output vector z is fully conditioned on the input feature vector x and the different output components of output vector z (z1, z2, . . . ) are conditionally independent of each other given the input feature vector x. Thus, the probability of z1 given x is equal to the probability of z1 given x and z2, or p(z1|x)=p(z1|x, z2). However, traditional SP cannot handle an interdependent relationship between different output components. In addition, traditional SP cannot handle a problem having multiple correct output decisions for a given input.
The subject matter claimed herein is not limited to embodiments that solve any disadvantages or that operate only in environments such as those described above. Rather, this background is only provided to illustrate one example technology area where some embodiments described herein may be practiced.
In general, example embodiments described herein relate to methods of employing a hierarchical based sequencing (HBS) machine learning model to predict multiple interdependent output components of a multiple output dependency (MOD) output decision. The example methods disclosed herein may be employed to solve MOD problems.
In one example embodiment, a method includes employing a machine learning model to predict multiple interdependent output components of an MOD output decision.
In another example embodiment, a method of employing an HBS machine learning model to predict multiple interdependent output components of an MOD output decision may include determining an order for multiple interdependent output components of an MOD output decision. The method may also include sequentially training a classifier for each component in the selected order to predict the component based on an input and based on any previous predicted component(s).
In yet another example embodiment, a method of employing an HBS machine learning model to predict multiple interdependent output components of an MOD output decision may include selecting an order for multiple interdependent output components of an MOD output decision. The method may also include training a first classifier to predict the first component in the selected order based on an input. The method may further include training a second classifier to predict the second component in the selected order based on the input and based on the first predicted component.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention, as claimed.
Example embodiments will be described and explained with additional specificity and detail through the use of the accompanying drawings in which:
Some embodiments described herein include methods of employing a hierarchical based sequencing (HBS) machine learning model to predict multiple interdependent output components of a multiple output dependency (MOD) output decision. The example methods disclosed herein may be employed to solve MOD problems.
As used herein, the term “multiple output dependency” or “MOD” refers to an output decision, or a problem having an output decision, that includes multiple output components which are interdependent in that each component is dependent not only on an input but also on the other components. Some example MOD problems include, but are not limited to: 1) which combination of stocks to purchase to balance a mutual fund given current stock market conditions, 2) which combination of players to substitute into a lineup of a sports team given the current lineup of the opposing team, and 3) which combination of shirt, pants, belt, and shoes to wear given the current weather conditions. In each of these examples, each component of the output decision depends on both the input (current stock market conditions, an opposing team lineup, or current weather conditions) and the other components (the other stocks purchased, the other substituted player, or the other clothing selected). Other examples of MOD problems may relate to hostage negotiations, retail sales, online shopping carts, web content management systems, customer service, contract negotiations, or crisis management, or any other situation that requires an output decision with multiple interdependent output components.
Another example MOD problem is lead response management (LRM). LRM is the process of responding to leads in a manner that optimizes contact or qualification rates. Leads may come from a variety of sources including, but not limited to, a web form, a referral, and a list purchased from a lead vendor. When a lead comes into an organization, the output decision of how to respond to the lead may include multiple interdependent components such as, but not limited to, who should respond to the lead, what method should be employed to respond to the lead, what content should be included in the response message, and when should the response take place. Each of these components of the output decision depends on both the input (the lead information) and the other components. For example, the timing of the response may depend on the availability of the person selected to respond. Also, the content of the message may depend on the method of response (e.g. since the length of an email message is not limited like the length of a text message). Although the example methods disclosed herein are generally explained in the context of LRM, it is understood that the example methods disclosed herein may be employed to solve any MOD problem.
Example embodiments will be explained with reference to the accompanying drawings.
As disclosed in
In one example embodiment, the local area network 160 resides within a call center 140 that uses VoIP and other messaging services to contact users connected to the PSTN 110 and/or the internet 130. The various servers in the call center 140 function cooperatively to acquire leads, store lead information, analyze lead information to decide how best to respond to each lead, distribute leads to agents via agent terminals such as the local agent workstations 192 and the remote agent stations 121 for example, facilitate communication between agents and leads via the PSTN 110 or the internet 130 for example, track attempted and successful agent interaction with leads, and store updated lead information.
The web server 170 may provide one or more web forms 172 to users via browser displayable web pages. The web forms may be displayed to the users via a variety of communication and/or computing devices 112 including phones, smart phones, tablet computers, laptop computers, desktop computers, media players, and the like that are equipped with a browser. The web forms 172 may prompt the user for contact data such as name, title, industry, company information, address, phone number, fax number, email address, instant messaging address, referral information, availability information, and interest information. The web server 170 may receive the lead information associated with the user in response to the user submitting the web form and provide the lead information to contact server 200 and the lead data server 190, for example.
The contact server 200 and the lead data server 190 may receive the lead information and retrieve additional data associated with the associated user such as web analytics data, reverse lookup data, credit check data, web site data, web site rank information, do-not-call registry data, data from a customer relationship management (CRM) database, and background check information. The lead data server 190 may store the collected data in a lead profile (not shown) and associate the user with an LRM plan (not shown).
The contact server 200 may contact a lead in accordance with an associated LRM plan and deliver lead information to an agent to enable the agent to respond to the lead in a manner that optimizes contact or qualification rates. The particular purpose of such contact or qualification may include, for example, establishing a relationship with the lead, thanking the lead for their interest in a product, answering questions from the lead, informing the lead of a product or service offering, selling a product or service, surveying the lead on their needs and preferences, and providing support to the lead. The contact server 200 may deliver the information to the agent using a variety of delivery services such as email services, instant messaging services, short message services, enhanced messaging services, text messaging services, telephony-based text-to-speech services, and multimedia delivery services. The agent terminals 121 or 192 may present the lead information to the agent and enable the agent to respond to the lead by communicating with the lead.
The contact manager 210 establishes contact with users and agents and manages contact sessions where needed. The contact manager 210 may initiate contact via the dialing module 220 and/or the messaging module 230.
The HBS machine learning module 212 employs an HBS machine learning model to predict multiple interdependent output components of an MOD output decision, according to the example methods disclosed herein. In at least some example embodiments, the HBS machine learning module 212 utilizes the lead data server access module 208 to access and analyze lead information stored on the lead data server 190 of
The LRM plan selection module 214 presents and or selects one or more LRM plans for a particular lead and/or offering. Similarly, the agent selection module 216 selects an agent, class of agent, or agent skill set that is designated in each LRM plan.
The lead data server access module 218 enables the contact manager 210 to access lead information that is useful for contacting a lead. In one embodiment, the data storage access module 218 enables the contact manager 210 to access the lead data server 190.
The dialing module 220 establishes telephone calls including VOIP telephone calls and PSTN calls. In one embodiment, the dialing module 220 receives a unique call identifier, establishes a telephone call, and notifies the contact manager 210 that the call has been established. Various embodiments of the dialing module 220 incorporate auxiliary functions such as retrieving telephone numbers from a database, comparing telephone numbers against a restricted calling list, transferring a call, conferencing a call, monitoring a call, playing recorded messages, detecting answering machines, recording voice messages, and providing interactive voice response (IVR) capabilities. In some instances, the dialing module 220 directs the PBX module 240 to perform the auxiliary functions.
The messaging module 230 sends and receives messages to agents and leads. To send and receive messages, the messaging module 230 may leverage one or more delivery or messaging services such as email services, instant messaging services, short message services, text message services, and enhanced messaging services.
The PBX module 240 connects a private phone network to the PSTN 110. The contact manager 210 or dialing module 220 may direct the PBX module 240 to connect a line on the private phone network with a number on the PSTN 110 or internet 130. In some embodiments, the PBX module 240 provides some of the auxiliary functions invoked by the dialing module 220.
The termination hardware 250 routes calls from a local network to the PSTN 110. In one embodiment, the termination hardware 250 interfaces to conventional phone terminals. In some embodiments and instances, the termination hardware 250 provides some of the auxiliary functions invoked by the dialing module 220.
Having described a specific environment (an LRM system) and specific application (LRM) with respect to
Although the model 300 may be employed in any number of applications to produce MOD output decisions, the model 300 is employed in
For example, the model 300 may be employed to produce an LRM MOD output decision z=(z1, z2, z3, z4), where z1, z2, z3, and z4 are four components of the output decision z, based on an input x. In this example, z1=response agent title, z2=response method, z3=response message type, and z4=response timing. The input x may be an input feature vector that includes information about a particular lead.
It is understood that the components of response agent title, response method, response message type, and response timing are only example components of an LRM MOD output decision. Other example components may include, but are not limited to, agent or lead demographic profile, agent or lead histographic profile (i.e. a profile of events in the life of the agent or the lead which could include past interactions between the agent and the lead), lead contact title (i.e. the title of a particular contact person within a lead organization), agent or lead psychographic profile (i.e. a profile of the psychological characteristics of the agent or the lead), agent or lead social network profile (i.e. the proximity of the agent to the lead in an online social network such as LinkedIn® or FaceBook® or in an offline social network such as the Entrepreneurs Organization®, civic clubs, fraternities, or religions), agent or lead geographic profile (i.e. cities, states, or other geographic designations that define current and/or past locations of the agent or the lead), response frequency (i.e. how often an agent contacts a lead), and response persistence (i.e. how long an agent persists in contacting a lead).
It is understood that the input features of lead source, lead title, lead industry, lead state, lead created date, lead company size, lead status, number of previous dials, number of previous emails, previous action, and hours since last action are only example input features to an LRM MOD output decision. Other example input features may include, but are not limited to, response agent title, response method, response message type, response timing, agent or lead demographic profile, agent or lead histographic profile, agent or lead psychographic profile, agent or lead social network profile, agent or lead geographic profile, response frequency, and response persistence. Additionally, input features could include data on current events, such as current events related to politics, economics, natural phenomena, society, and culture. It is further understood that where a particular input feature is employed as an input to a particular LRM MOD output decision, the particular input feature will not be included among the output components of the particular LRM MOD output decision.
As disclosed in
Therefore, in the example application of
The model 300 of
The method 400 may begin at block 402, in which an order is selected for multiple interdependent output components of an MOD output decision. For example, the HBS machine learning module 212 may select an order for the multiple interdependent output components z1, z2, z3, and z4 of the MOD output decision z. The MOD output decision z has four components including response agent title, response method, response message type, and response timing. One possible order that could be selected is response agent title, response method, response message type, and response timing. Another possible order may be response method, response agent title, response message type, and response timing.
Various methods may be employed to determine the order of the output components. For example, one method to determine the order may include trying all possible orders on testing data and then selecting the one with the best overall performance. In the example embodiment disclosed in
In block 404, a classifier for each component in the selected order is sequentially trained to predict the component based on an input and based on any previous predicted component(s). For example, the HBS machine learning module 212 may sequentially train the MLP neural networks MLP1, MLP2, MLP3, and MLP4 to predict the components z1, z2, z3, and z4 in the selected order based on the input feature vector x of
In one example, assume that each component has three possible values as follows: z1ε{z11, z12, z13}={sales vice president, sales manager, sales representative}; z2δ{z21, z22, z23}={call, email, fax}; z3ε{z31, z32, z33}={MT1, MT2, MT3}; and z4ε{z41, z42, z43}={short, medium, long}. As disclosed in
In at least some example embodiments, the eighty-one (81) output values may be scaled in order to more easily handle multiplication of relatively small output values. For example, logarithmic output values for all eighty-one (81) possible output decisions may be calculated as follows: log(p(x; z1i)·p(x, z1i; z2j)·p(x, z1i, z2j; z3k)·p(x, z1i, Z2j, z3k; z4l)); where iε{1, 2, 3}; jε{1, 2, 3}; kε{1, 2, 3}; and lε{1, 2, 3}. It is understood that calculating logarithmic output values is just one example of scaling output values, and other scaling techniques may be employed. It is further understood that the scaling of the (81) output values may be omitted in at least some example embodiments.
It is understood that this is but one example of sequentially training a classifier for each component in the selected order to predict the component based on an input and based on any previous predicted component(s), and the block 404 is not limited to the particular application of this example or to the LRM MOD problem solved in this example.
Where multiple output decisions are simultaneously considered to be correct, the term “correct” may refer to multiple output decisions each having a substantially similar output value. For example, each of the output decisions 502 and 504 of
Having described example methods of employing an HBS machine learning model to predict multiple interdependent output components of an MOD output decision with respect to
Upon selection of the “more info” link 812 by the agent, by clicking on the more info link 812 with a mouse pointer for example, the agent may be presented with a pop-out display 814 as disclosed in
Therefore, the embodiments disclosed herein include methods of employing an HBS machine learning model to predict multiple interdependent output components of an MOD output decision. The example methods disclosed herein enable the prediction of each output component based on an input and based on any previous predicted output component(s). Therefore, the example methods disclosed herein may be employed to solve MOD problems such as LRM problems.
The embodiments described herein may include the use of a special purpose or general-purpose computer including various computer hardware or software modules, as discussed in greater detail below.
Embodiments described herein may be implemented using computer-readable media for carrying or having computer-executable instructions or data structures stored thereon. Such computer-readable media may be any available media that may be accessed by a general purpose or special purpose computer. By way of example, and not limitation, such computer-readable media may include non-transitory computer-readable storage media including RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other storage medium which may be used to carry or store desired program code in the form of computer-executable instructions or data structures and which may be accessed by a general purpose or special purpose computer. Combinations of the above may also be included within the scope of computer-readable media.
Computer-executable instructions comprise, for example, instructions and data which cause a general purpose computer, special purpose computer, or special purpose processing device to perform a certain function or group of functions. Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.
As used herein, the term “module” may refer to software objects or routines that execute on the computing system. The different modules described herein may be implemented as objects or processes that execute on the computing system (e.g., as separate threads). While the system and methods described herein are preferably implemented in software, implementations in hardware or a combination of software and hardware are also possible and contemplated.
All examples and conditional language recited herein are intended for pedagogical objects to aid the reader in understanding the example embodiments and the concepts contributed by the inventor to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions.
Number | Name | Date | Kind |
---|---|---|---|
7152051 | Commons et al. | Dec 2006 | B1 |
8285667 | Jaros et al. | Oct 2012 | B2 |
20030140023 | Ferguson et al. | Jul 2003 | A1 |
20050265607 | Chang | Dec 2005 | A1 |
20070005539 | Bergman et al. | Jan 2007 | A1 |
20080249844 | Abe et al. | Oct 2008 | A1 |
20090157571 | Smith et al. | Jun 2009 | A1 |
20090176580 | Herrmann et al. | Jul 2009 | A1 |
20100145678 | Csomai et al. | Jun 2010 | A1 |
20100280827 | Mukerjee et al. | Nov 2010 | A1 |
20110046970 | Fontenot | Feb 2011 | A1 |
20110106735 | Weston et al. | May 2011 | A1 |
20110106743 | Duchon | May 2011 | A1 |
20110119213 | Elisseeff et al. | May 2011 | A1 |
20110153419 | Hall | Jun 2011 | A1 |
20110213741 | Shama et al. | Sep 2011 | A1 |
20110270779 | Showalter | Nov 2011 | A1 |
20120041727 | Mishra et al. | Feb 2012 | A1 |
20120203720 | Baker | Aug 2012 | A1 |
Entry |
---|
Seok-Beom Roh et al., “A fussy ensemble of parallel polynomial neural networks with information granules formed by fuzzy clustering” in Knowledge Based Systems vol. 23 pp. 202-219 (2010). |
Notice of Allowance mailed Dec. 13, 2012 in related U.S. Appl. No. 13/590,028. |
Notice of Allowance mailed Nov. 9, 2012 in related U.S. Appl. No. 13/590,028. |
Leclercq et al. “Autonomous learning algorithm for fully connected recurrent networks”, ESANN, 2003, pp. 379-384. |
Bilmes et al. “Generalized rules for combination and joint training of classifiers”, FAA 2003, pp. 201-211. |
International Search Report and Written Opinion mailed Feb. 28, 2014 in related PCT Application No. PCT/US13/55856. |
International Search Report and Written Opinion dated Mar. 6, 2014 in related PCT Application No. PCT/US2013/055859. |
International Search Report and Written Opinion dated Apr. 14, 2014 in related PCT Application No. PCT/US2013/077260. |
Number | Date | Country | |
---|---|---|---|
20140052678 A1 | Feb 2014 | US |