This disclosure relates generally to evaluation tools, and more particularly to a system and method for artificial intelligence based automatic form filling in an evaluation system.
Large organizations often use call centers staffed by a number of call center agents to provide services to customers or other individuals calling the call center. A call center agent must respond to an incoming call courteously and efficiently to satisfy the calling customer's need and the goals of the organization implementing the call center. To promote effective handling of calls by agents, call centers typically include telecommunications equipment programmed to route incoming calls to call center agents having particular skills or expertise. While helping to ensure that calls are handled by agents with the proper skillsets, such mechanisms do not evaluate the interactions between the agents and customers.
An agent interacting with a customer represents the organization to that customer, and is responsible for a significant part of the customer experience. Thus, there is great importance in evaluating the agents' performance on a regular basis. Call centers may therefore employ computer-implemented evaluation tools to facilitate evaluating interactions between agents and customers. In a typical call center evaluation scenario, an evaluator listens to a randomly selected call of a specific agent and fills in an evaluation form via the user interface to attribute to the agent or to the call a quality score or other scores and indications. More particularly, when an evaluator selects a call to evaluate, the evaluator also selects an evaluation form to use. The evaluation tool presents an instance of the evaluation form to the evaluator (e.g., in a web browser-based interface). The evaluator listens to the transaction and answers the questions in the evaluation. When completed, the evaluator or supervisor usually reviews the evaluation with the call center agent.
However, some computer-implemented evaluation schemes have a number of deficiencies. First, they are a time-consuming manual process allowing the call center to evaluate only a small sample of interactions. Second, they are prone to human error, bias or carelessness, such as evaluators not understanding performance standards, failing to enforce them consistently, or failing to fully listen to parts of an interaction due to fatigue or other conditions. Third, they do not provide a mechanism to at least partially evaluate transactions in the absence of an evaluator.
One embodiment comprises a data processing system for populating selections in an evaluation operator interface. The data processing system comprises a data store that stores a plurality of transactions, where each of the plurality of transactions comprises a voice session recording of an inbound call recorded by a call center recording system and a transcript of the voice session. The data store further stores a first set of auto answer parameters used by an evaluation system to automatically preset answer controls in an evaluation operator interface; that is, to automatically preselect an answer to a question presented in an evaluation operator interface. According to one embodiment, the auto answer parameters are defined by one or more of a lexicon, an autoscore template, a question or an answer template. The data store may further include a plurality of completed evaluations, where each completed evaluation of the plurality of completed evaluations corresponds to a transaction of the plurality of transactions and includes an associated evaluation answer to a question and an auto answer to the question that was determined using the first set of auto answer parameters. By way of example, but not limitation, each completed evaluation in the plurality of evaluations may include an evaluation answer and an autoscore auto answer.
The data processing system is configured to automatically adjust the parameters used to automatically preset the answer controls in the evaluation operator interface to provide increased automated answering accuracy over time. According to one embodiment, the data processing system includes an artificial intelligence (AI) engine configured to automatically adjust the lexicon applied when auto answering a question. The AI engine determines a word or phrase common to transcripts of a first subset of transactions from the plurality of transactions and creates a revised set of auto answer parameters that includes the word or phrase. The determined word or phrase may be a word or phrase that is not in a lexicon of the first set of auto answer parameters. Further, the determined word or phrase may be selected such that the word or phrase does not appear in the transcripts of a second subset of transactions.
The AI engine can be further configured to auto answer the question for a set of test transactions from the plurality of transactions to generate a revised auto answer for each test transaction of the set of test transactions. Based on a determination that the revised set of auto answer parameters more accurately auto answer the question than the first set of auto answer parameters, the AI engine automatically reconfigures the evaluation system to use the revised set of auto answer parameters to preset the answer control in the evaluation operator interface.
According to one embodiment, the AI engine may, for each of the test transactions, compare the revised auto answer to the question determined for the test transaction to the evaluation answer to the question from the completed evaluation corresponding to the test transaction. Based on the comparing, the data processing system can determine a confidence for the revised set of auto answer parameters. The data processing system, may be configured to determine that the revised set of auto answer parameters are more accurate than the first set of auto answer parameters based on comparing the confidence for the revised set of auto answer parameters to a confidence for the first set of auto answer parameters. In another embodiment, the data processing system can be configured to determine that the revised set of auto answer parameters are more accurate than the first set of auto answer parameters based on the confidence for the revised set of auto answers meeting a confidence threshold.
According to one embodiment, the AI engine can determine the first subset of evaluations as the completed evaluations of the plurality of completed evaluations that have a first evaluation answer to the question, where the first subset of evaluations correspond to the first subset of transactions from the plurality of transactions. The AI engine can further determine a second subset of evaluations as the completed evaluations of the plurality of completed evaluations that have a second evaluation answer to the question, where the second subset of evaluations correspond to the second subset of transactions from the plurality of transactions. Determining the word or phrase common to transcripts of the first subset of transactions can comprise determining a word or phrase that also is not in the transcripts of the second subset of transactions. Put another way, the system can be configured such that the determined word or phrase appears in transcripts of transactions for which the corresponding evaluations have a first evaluation answer to a question but not transcripts of transactions for which the corresponding evaluations have the second evaluation answer to the question.
Determining the word or phrase common to transcripts of the first subset of transactions can include comparing word vectors that represent the transcripts of the first subset of transactions. Determining that the word or phrase is not in the transcripts of the second subset of transactions can include comparing the word or phrase to word vectors that represent the second subset of transactions.
The data processing system may further comprise a search engine comprising an index of the transcripts of the plurality of transactions. The AI engine can be configured to query the search index for term frequencies of terms in the transcripts of the plurality of transactions determine the word or phrase common to the transcripts of the first subset of transactions based on the word frequencies. The AI engine may also use word frequencies associated with the second subset of transactions to determine that a word or phrase that is common to the first subset of transcripts does not appear in the second subset of transcripts.
Embodiments described herein provide systems and methods that automatically set answer controls in an evaluator operator interface and automatically tune the parameters for setting the controls so that auto answering becomes increasingly accurate. Thus, embodiments described herein increase the efficiency of using an evaluation operator interface as the need to manually change answers decreases over time.
Furthermore, embodiments herein provide an advantage by providing systems and methods that can automatically and accurately evaluate a large number of transactions based on evaluations of a relatively small number of transactions. Further, the accuracy of automated evaluation can increase over time.
These, and other, aspects of the invention will be better appreciated and understood when considered in conjunction with the following description and the accompanying drawings. The following description, while indicating various embodiments of the invention and numerous specific details thereof, is given by way of illustration and not of limitation. Many substitutions, modifications, additions or rearrangements may be made within the scope of the invention, and the invention includes all such substitutions, modifications, additions or rearrangements.
The drawings accompanying and forming part of this specification are included to depict certain aspects of the invention. A clearer impression of the invention, and of the components and operation of systems provided with the invention, will become more readily apparent by referring to the exemplary, and therefore non-limiting, embodiments illustrated in the drawings, wherein identical reference numerals designate the same components. Note that the features illustrated in the drawings are not necessarily drawn to scale.
The disclosure and various features and advantageous details thereof are explained more fully with reference to the exemplary, and therefore non-limiting, embodiments illustrated in the accompanying drawings and detailed in the following description. It should be understood, however, that the detailed description and the specific examples, while indicating the preferred embodiments, are given by way of illustration only and not by way of limitation. Descriptions of known programming techniques, computer software, hardware, operating platforms and protocols may be omitted so as not to unnecessarily obscure the disclosure in detail. Various substitutions, modifications, additions and/or rearrangements within the spirit and/or scope of the underlying inventive concept will become apparent to those skilled in the art from this disclosure.
Software implementing embodiments disclosed herein may be implemented in suitable computer-executable instructions that may reside on a computer-readable storage medium. Within this disclosure, the term “computer-readable storage medium” encompasses all types of data storage medium that can be read by a processor. Examples of computer-readable storage media can include, but are not limited to, volatile and non-volatile computer memories and storage devices such as random access memories, read-only memories, hard drives, data cartridges, direct access storage device arrays, magnetic tapes, floppy diskettes, flash memory drives, optical data storage devices, compact-disc read-only memories, hosted or cloud-based storage, and other appropriate computer memories and data storage devices.
Call center system 102 comprises a platform 106 coupled to a voice network 108 and a data network 110. Voice network 108 comprises a call routing system 150 to connect incoming calls to terminals in call center system 102 and outgoing calls to telephony network 104. Call routing system 150 may comprise any combination of hardware and/or software operable to route calls. According to one embodiment, call routing system 150 comprises an automatic call distributor (ACD) with interactive voice response (IVR) menus. In addition or in the alternative, call routing system 150 may include a private branch exchange switch or other call routing hardware or software.
The ACD or other call routing component may perform one or more various functions, such as recognizing and answering incoming calls, determining how to handle a particular call, identifying an appropriate agent and queuing the call, and/or routing the call to an agent when the agent is available. The call routing system 150 may use information about the call, caller or call center or other information gathered by system 102 to determine how to route a call. For example, the call routing system may use the caller's telephone number, automatic number identification (ANI), dialed number identification service (DNIS) information, the caller's responses to voice menus, the time of day, or other information to route a call. The call routing system 150 may communicate with data network 110, a private branch exchange or other network either directly or indirectly, to facilitate handling of incoming calls. The call routing system 150 may also be operable to support computer-telephony integration (CTI).
Call routing system 150 may be coupled to recording server 114 and a survey server 120 of platform 106 by communications lines 152. Lines 152 support a variety of voice channels that allow platform 106 to monitor and record voice sessions conducted over a voice network 108. Call routing system 150 may also be coupled to a voice instrument 162 at agent workstation 160 and a voice instrument 172 at supervisor workstation 170 via a private branch exchange link, VoIP link or other call link. Platform 106 may receive information over lines 152 regarding the operation of call routing system 150 and the handling of calls received by system 102. This information may include call set-up information, traffic statistics, data on individual calls and call types, ANI information, DNIS information, CTI information, or other information that may be used by platform 106.
Voice network 108 can further include adjunct services system 154 coupled to call routing system 150, call recording server 114 and survey server 120 by data links 156, 158. Adjunct services system 154 may comprise a CTI application or platform, contact control server, or other adjunct device accessible by platform 106 to perform call center functions. Adjunct services system 154 may include a link to other components of the call center's management information system (MIS) host for obtaining agent and supervisor names, identification numbers, expected agent schedules, customer information, or any other information relating to the operation of the call center.
Data network 110 may comprise the Internet or other wide area network (WAN), an enterprise intranet or other a local area network (LAN), or other suitable type of link capable of communicating data between platform 106 and computers 164 at agent workstations 160, computers 174 at supervisor workstations 170 and client computers 180 of other types of users. Data network 110 may also facilitate communications between components of platform 106. Although
Platform 106 includes a recording system 112 to record voice sessions and data sessions. In the embodiment illustrated, recording system 112 includes a recording server 114 and an ingestion server 116. Recording server 114 comprises a combination of hardware and/or software (e.g., recording server component 115) operable to implement recording services to acquire voice interactions on VoIP, TDM or other networks, record the voice sessions. Recording server 114 may also be operable to record data sessions for calls. A data session may comprise keyboard entries, screen display and/or draw commands, video processes, web/HTTP activity, e-mail activity, fax activity, applications or any other suitable information or process associated with a client computer. To facilitate recording of data sessions, agent computers 164 or supervisor computers 174 may include software to capture screen interactions related to calls and send the screen interactions to recording server 114. Recording server 114 stores session data for voice and data sessions in transaction data store 118.
Ingestion server 116 comprises a combination of hardware and software (e.g., ingestion server component 117) operable to process voice session recordings recorded by recording server 114 or live calls and perform speech-to-text transcription to convert live or recorded calls to text. Ingestion server 116 stores the transcription of a voice session in association with the voice session in data store 118.
Platform 106 further comprises survey server 120. Survey server 120 comprises a combination of hardware and software (e.g., survey component 121) operable to provide post-call surveys to callers calling into call center 102. For example, survey server 120 can be configured to provide automated interactive voice response (IVR) surveys. Call routing system 150 can route calls directly to survey server 120, transfer calls from agents to survey server 120 or transfer calls from survey server 120 to agents. Survey data for completed surveys can be stored in data store 118.
Data store 118 may also store completed evaluations. Platform 106 may include an evaluation feature that allows an evaluator to evaluate an agent's performance or the agent to evaluate to evaluate his or her own performance. An evaluation may be performed based on a review of a recording. Thus, an evaluation score may be linked to a recording in data store 118.
In operation, call routing system 150 initiates a session at call center system 102 in response to receiving a call from telephony network 104. Call routing system 150 implements rules to route calls to agent voice instruments 162, supervisor voice instruments 172, recording server 114 or survey server 120. Depending on the satisfaction of a variety of criteria (e.g., scheduling criteria or other rules), routing system 150 may establish a connection using lines 152 to route a call to a voice instrument 162 of an agent workstation 160 and recording server 114. Routing system 150 may also establish a connection for the call to the voice instrument 172 of a supervisor.
Recording server 114 stores data received for a call from adjunct system 154 and routing system 150, such as call set-up information, traffic statistics, call type, ANI information, DNIS information, CTI information, agent information, MIS data. In some cases, recording server 114 may also store a recording of the voice session for a call. Additionally, recording sever may record information received from agent computer 164 or supervisor computer 174 with respect to the call such as screen shots of the screen interactions at the agent computer 164 and field data entered by the agent. For example, platform 106 may allow an agent to tag a call with predefined classifications or enter ad hoc classifications and recording server may store the classifications entered by the agent for a call.
Recording server 114 stores data and voice sessions in data store 118, which may comprise one or more databases, file systems or other data stores, including distributed data stores. Recording server 114 stores a voice session recording as a transaction in data store 118. A transaction may comprise transaction metadata and associated session data. For example, when recording server 114 records a voice session, recording server 114 can associate the recording with a unique transaction id and store a transaction having the transaction id in data store 118. A data session may also be linked to the transaction id. Thus, the transaction may further include a recording of a data session associated with the call, such as a series of screen shots captured from the agent computer 164 during a voice session. The transaction may also include a transcript of the voice session recording created by ingestion server 116. In some embodiments, the voice session may be recorded as separate recordings of the agent and caller and thus, a transaction may include an agent recording, a customer recording, a transcript of the recording of the agent (agent transcript) and a transcript of the recording of the customer (inbound caller transcript). According to one embodiment, the voice session recording, transcript of the voice session or data session recording for a call may be stored in a file system and the transaction metadata stored in a database with pointers to the associated files for the transaction.
Transaction metadata can include a wide variety of metadata stored by recording server 114 or other components. Transaction metadata may include, for example metadata provided to recording server 114 by routing system 150 or adjunct system 154, such as call set-up information, traffic statistics, call type, ANI information, DNIS information, CTI information, agent information, MIS data or other data. For example, the transaction metadata for a call may include call direction, line on which the call was recorded, ANI digits associated with the call, DNSI digits associated with the call, extension of the agent who handled the call, team that handled the call (e.g., product support, finance), whether the call had linked calls, name of agent who handled the call, agent team or other data. The transaction metadata may further include data received from agent computers 164, supervisor computers 174, or other components, such as classifications (pre-defined or ad hoc tag names) assigned to the call by a member, classification descriptions (descriptions of predefined or ad hoc tags assigned by a call center member to a call) other transaction metadata. The transaction metadata may further include call statistics collected by recording server 114, such as the duration of a voice session recording, time voice session was recorded and other call statistics. Furthermore, other components may add to the transaction metadata as transactions are processed. For example, transaction metadata may include scores assigned by intelligent data processing system 130. Transaction metadata may be collected when a call is recorded, as part of an evaluation process, during a survey campaign or at another time. As one of skill in the art will appreciate, the foregoing transaction metadata is provided by way of example and a call center system may store a large variety of transaction metadata.
Intelligent data processing system 130 provides a variety of services such as support for call recording, performance management, real-time agent support, and multichannel interaction analysis. Intelligent data processing system 130 can comprise one or more computer systems with central processing units executing instructions embodied on one or more computer readable media where the instructions are configured to perform at least some of the functionality associated with embodiments of the present invention. These applications may include a data application 131 comprising one or more applications (instructions embodied on a computer readable media) configured to implement one or more interfaces 132 utilized by the data processing system 130 to gather data from or provide data to client computing devices, data stores (e.g., databases or other data stores) or other components. Interfaces 132 may include interfaces to connect to various sources of unstructured information in an enterprise in any format, including audio, video, and text. It will be understood that the particular interface 132 utilized in a given context may depend on the functionality being implemented by data processing system 130, the type of network utilized to communicate with any particular system, the type of data to be obtained or presented, the time interval at which data is obtained from the entities, the types of systems utilized. Thus, these interfaces may include, for example web pages, web services, a data entry or database application to which data can be entered or otherwise accessed by an operator, APIs, libraries or other type of interface which it is desired to utilize in a particular context.
Data application 131 can comprise a set of processing modules to process data obtained by intelligent data processing system 130 (obtained data) or processed data to generate further processed data. Different combinations of hardware, software, and/or firmware may be provided to enable interconnection between different modules of the system to provide for the obtaining of input information, processing of information and generating outputs. In the embodiment of
Intelligent data processing system 130 can include a data store 140 that stores various templates, files, tables and any other suitable information to support the services provided by data processing system 130. Data store 140 may include one or more databases, file systems or other data stores, including distributed data stores.
Evaluation system 200 may automatically answer questions in evaluations provided to evaluators. In particular, evaluation system 200 may auto answer questions based on lexicons 249. To this end, evaluation system 200 may be associated with a set of auto answer parameters. The auto answer parameters may include a lexicon of words or phrases and parameters that are used to control how evaluation system 200 automatically determines an answer to the question based on the lexicon. In one embodiment, auto answer parameters associated with a question 248 are defined in a lexicon 244, an auto-score template 246, the question 248 or an auto-answer template 249. In any event, as discussed further below, evaluation system 200 may include an AI engine 275 that retunes the auto answer parameters used to auto answer questions.
Evaluation system 200 comprises a server tier 202, a client tier 203, a data store 206 and a data store 208. The client tier 203 and server tier 202 can connected by a network 111. The network 111 may comprise the Internet or other WAN, an enterprise intranet or other LAN, or other suitable type of link capable of communicating data between the client and server platforms. According to one embodiment, server tier 202 and data store 206 may be implemented by intelligent data processing system 130 and data store 140, client tier 203 may be implemented on one or more client computers, such as computers 180, and data store 208 may be an example of data store 118 that stores a set of transactions, survey results and evaluation results. Each transaction may comprise transaction metadata, a voice session recording of an inbound call recorded by a call center recording system and a transcript of the voice session. The transaction may further include a data session recording. The transaction metadata for each transaction can comprise an identifier for that transaction and other metadata. Search component 218 provides a search engine that indexes data in data store 206 or data store 208 to create a search index 219, such as an inverted index storing term vectors. More particularly in one embodiment, search component 218 can be a search tool configured to index transcripts of transactions in data store 208.
Server tier 202 comprises a combination of hardware and software to implement platform services components comprising search component 218, sever engine 220, server-side lexicon component 224, server-side autoscore template component 228, server-side autoscore processor (“auto scorer”) 232, server-side question component 236, server-side answer template component 237, server-side evaluation component 238, evaluation manager 240, server-side analytics component 242 and an artificial intelligence (AI) engine 275. According to one embodiment, lexicons, autoscore templates, questions, answer templates, and evaluation forms may be implemented as objects (e.g., lexicon objects, template objects, question objects, answer template objects, evaluation form objects) that contain data and implement stored procedures. Thus, lexicon component 224 may comprise lexicon objects, server-side autoscore template component 228 may comprise autoscore template objects, server-side question component 236 may comprise question objects, answer template component 237 may comprise answer template objects and server-side evaluation component 238 may comprise evaluation form objects.
Data store 206 may comprise one or more databases, file systems or other data stores, including distributed data stores. Data store 206 may include user data 207 regarding users of a call center platform, such as user names, roles, teams, permissions and other data about users (e.g., agents, supervisors, designers). Data store 206 may further include data to support services provided by server tier 202, such as lexicons 244, autoscore templates 246, questions 248 and evaluation forms 250. According to one embodiment, lexicons 244 comprise attributes of lexicon objects, autoscore templates 246 comprise attributes of autoscore template objects, questions 248 comprise attributes of question objects, answer templates 249 comprise attributes of answer template objects, and evaluation forms 250 comprise attributes of evaluation objects.
Client tier 203 comprises a combination of hardware and software to implement designer operator interfaces 210 for configuring lexicons, autoscore templates, questions, answer templates, evaluation forms and autoscore tuning parameters and evaluator operator interfaces 212 for evaluating transactions. Designer operator interfaces 210 include controls 214 that allow designers to define lexicons, autoscore templates, questions, answer templates or evaluation forms and configure autoscore tuning. Evaluation operator interfaces 212 comprise controls 216 that allow users to evaluate recordings of interactions. Designer operator interfaces 210 and evaluation operator interfaces 212 can comprise one or more web pages that include scripts to provide controls 214 and controls 216. To this end server tier 202 can comprise a sever engine 220 configured with server pages 222 that include server-side scripting and components. The server-side pages 222 are executable to deliver application pages to client tier 203 and process data received from client tier 203. The server-side pages 222 may interface with server-side lexicon component 224, autoscore template component 228, question component 236, answer template component 237, evaluation component 238, evaluation manger 240 and analytics component 242.
Server engine 220 and lexicon component 224 cooperate to provide a designer operator interface 210 that allows a user to create new lexicons 226 or perform operations on existing lexicons 226.
For a lexicon, the designer can specify lexicon parameters, such as a name, description and a set of lexicon entries. Each lexicon entry includes words or phrases 308 that evaluation system 200 applies to a recording of an agent interaction to determine if the agent interaction contains the word or phrases. A lexicon entry may include search operators, such as “welcome DNEAR company1” to indicate that the evaluation system should search for any combination of “welcome” and “company1” within a pre-defined word proximity to each other. Thus, a lexicon may include a word or phrase or a search expression. Each entry may include a lexicon entry weight 310, such as a weight of 0-1.
Returning briefly to
Lexicons may be used by autoscore templates to score transactions. Server engine 220 and autoscore template component 228 cooperate to provide a designer operator interface 210 that allows a user to create new autoscore templates 246 or perform operations on existing autoscore templates 246 (e.g., edit or delete an existing autoscore template). If a user selects to create a new autoscore template, autoscore template component 228 can assign the new autoscore template a unique identifier to uniquely identify the template in data store 206. Each autoscore template 246 can comprise a variety of autoscore template data.
An autoscore template can include a lexicon and scoring parameters. To this end, operator interface page 400 includes controls that allow a user to associate lexicons with the autoscore template, specify gross scoring parameters and lexicon specific scoring parameters. Operator interface page 400 includes gross scoring controls that allow the user to specify scoring parameters that are not lexicon specific. In
Highest possible: The score for the associated lexicon with the highest score is added to the base score. Results for other lexicons are ignored.
Lowest possible. The score for the associated lexicon with the lowest score is added to the base. Results for other lexicons are ignored.
Cumulative scoring. The scores for all associated lexicons are added to the base score.
The target score control 412 can allow the user to select which method should be applied. It can be noted that, in this embodiment, when only one associated lexicon is used in the template, the target scoring setting may have no effect on the final score.
The designer operator interface page 400 further includes controls 414 and 418 to allow the user to associate one or more lexicons of lexicons 244 with the autoscore template. For example, by clicking on “Add Lexicon” or “Add Auto-Fail Lexicon,” the user can be presented with an operation interface 210 that provides controls to allow the user to select lexicons from the library of lexicons 244 to add to the autoscore template.
Returning to
When the template is applied by auto scorer 232, point values for the template begin with the specified base score 410 and then points are added or deducted based on behavior that matches each specified lexicon and the lexicon weight value specified for the lexicon in the autoscore template. The autoscore template can be configured so that different point values are added to or subtracted from the base score by selecting a positive or negative lexicon weight. Multiplier control 424 allows the user to specify how points for the specific lexicon are applied when the auto-template is used to evaluate a transaction. If the multiplier is enabled the designated number of points defined by the lexicon weight value are added to or subtracted from the base score 410 each time behavior, defined by the lexicon, is exhibited by the specified speaker. If the multiplier is not enabled, the number of points defined by the lexicon weight value is added to or subtracted from the base score 410 only the first time the speaker is specified by control 419 for the lexicon matches the lexicon.
In the illustrated example, a transaction to which the autoscore template is applied will be awarded fifty points if either the agent or incoming caller transcript matches any of the entries in the “Generic Lexicon” regardless of how many entries in Generic Lexicon the transaction transcripts match. On the other hand, the transaction is awarded 10 points for every entry in “Standard Company Greeting” that the agent transcript matches. As there are seven entries in the “Standard Company Greeting” lexicon according to the example of
Further, in some embodiments, the points awarded for matching an entry in a lexicon may be further weighted by the entry weight for that entry (e.g., as specified by weights 310). In any event, the evaluation system may limit the final score that a transaction can be awarded to a particular range, such as 0-100.
Using control 418, the user may designate an auto-fail lexicon for the template. If, in a transaction to which the autoscore template is applied, the transcript for the channel specified for the auto-fail template uses words that match the specified auto-fail lexicon, the final score for the transaction for the template can be zero, regardless of other lexicon scores awarded by the template.
Target control 430 allows the user to specify the transactions to which the template will be applied. In this case, when the user clicks on the text of target control 430, the evaluation system 200 presents an operator interface page that displays a search options interface, one example of which is illustrated in
Turning briefly to
Returning to
Returning briefly to
In general, evaluation instances (evaluations) are created from evaluation forms that define the questions and other content in the evaluations. A page of the operator interface may allow a user to select to create new questions, edit an existing question 248 or delete an existing question 248. If the user selects to create or edit the question the user may be presented with an operator interface page that allows the user to question parameters for the question.
Server engine 220, question component 236 and answer template component 237 cooperate to provide an operator interface 210 that allows a user to define questions 248 and answer templates 249.
Turning to
Operator interface page 750 allows the user to provide answer parameters for the question (question answer parameters) specified in operator interface page 700. Operator interface page 750 includes an answer type control 752 that allows the user to specify what type of control will appear in the evaluation presented to an evaluator. Answer controls are included in the evaluation page based on the answer type selected. Examples of answer controls include, but are not limited to radio buttons, drop down list, edit box and multi-select.
A question may include an answer template. Answer template control 754, for example, allows the user to associate an existing answer template or a new answer template with the question. The user can add, edit or delete a selected answer template (e.g., an answer template selected from answer templates 249). The answer template control 754 may limit the answer templates from which the user may select based on the type selected in control 752.
Operator interface page 750 further includes an autoscore range portion 755 from the selected answer template. Autoscore ranges are discussed further below.
Control 756 allows the user to select whether autoscoring will apply to the question. Control 758 further allows the user to select the autoscore template that the question applies. The user can, for example, select an autoscore template from autoscore templates 246. If the user selects to enable autoscore, then the evaluation system can autoscore transactions for the question based on the specified autoscore template and, for an evaluation that incorporates the question, evaluation system 200 can automatically populate the evaluation with an autoscore auto answer. If the user selects not to use autoscoring, then the acceptable answers and question scores of a selected answer template will apply, but evaluation system 200 will not automatically assign question scores to transactions for the question and will not generate an autoscore auto answer for evaluations that incorporate the question.
Further the user can set a target answer using control 776. The target answer is the preferred answer to a question and can be used for further evaluation and analysis purposes. The target answer may be selected from the acceptable answers for the question. For example, if a question incorporates the auto-ranges of
As discussed above, a question may include an answer template by, for example, referencing the answer template.
Operator interface page 800 further includes controls to allow a user to add autoscore ranges. If the user selects to add an autoscore range, the user can be presented with an operator interface page that allows the user to input an autoscore range.
Referring briefly to
A question 248, associated autoscore template 246, associated answer template 249 and lexicon 244 define auto answer parameters that control how evaluation system automatically answers an instance of question 248 based on lexicon 244. By way of example, but not limitation, the auto answer parameters include words or phrases, scoring parameters (e.g., lexicon entry weights, gross scoring and lexicon specific scoring parameters), answer parameters for a question, autoscore ranges or other parameters.
Instances of questions 248 may be incorporated in evaluations created from evaluation forms 250. Server engine 220 and evaluation component 238 cooperate to provide an operator interface 210 that allows a user to define evaluation forms 250.
Responsive to the user selecting to add questions to the evaluation form (e.g., responsive to user interaction with menu item 1206), the user can be presented with a question library interface page 1250 (
Returning briefly to
Server tier 202 further comprises auto scorer 232 which scores transactions according to autoscore templates 246. According to one embodiment, auto scorer 232 may run as a background process that accesses transactions in data store 208 and autoscores the transactions. The autoscores generated by auto scorer 232 may be stored in transaction metadata in data store 208 or elsewhere. In any event, the autoscores can be stored in a manner that links an autoscore generated for transaction to the autoscore template that was used to generate the autoscore.
Server engine and evaluation manager 240 cooperate to provide evaluations to evaluators based on evaluation forms 250. The evaluations may be displayed, for example, in operator interface 212. Evaluation manager 240 can user pre-generated autoscores (that is, autoscores determined before the evaluation was requested) or autoscores generated in real time when the evaluation is requested to prepopulate answers in the evaluations. One advantage of using pre-generated autoscores is that transactions can be autoscored in batch by auto scorer 232.
More particularly, based on a request for an evaluation to evaluate a transaction, the evaluation manager 240 accesses the evaluation form 250, a question 248 included in the evaluation form and an answer template 249 included in the question 248 and determines an autoscore template 246 associated with the question 248. Evaluation manager 240 further accesses the transaction or other record to determine the autoscore assigned to the transaction based on the autoscore template 246. Using the autoscore that was assigned to the transaction based on the autoscore template and the correspondences of autoscores to acceptable answers in the answer template 249, the evaluation system can preselect an acceptable answer to the question. Evaluation manager sets an answer control in the evaluation to the preselected answer and provides the evaluation to the evaluator with the preselected answer.
The system identifies active autoscore templates from a set of autoscore templates (e.g., autoscore templates 246) (step 1302). For example, the evaluation system may query a data store for templates having an execution start data that is less than or equal to the current date and an execution end date that is greater than equal to the current date. The evaluation system can select an active template as the “current autoscore template”, load the current autoscore template, including any lexicons included in the autoscore template, and execute the current autoscore template (step 1304). For the current autoscore template, evaluation system formulates search queries based on the search criteria in the current autoscore template and searches a data store (e.g., data store 208) for candidate transactions that meet the search criteria of the active autoscore template (step 1306). The evaluation system may include, as an implicit search criteria for the autoscore template, that the candidate transactions are transactions that have not previously been autoscored based on the current autoscore template. The evaluation system can search the transactions to determine candidate transaction based on transaction metadata that meets the search criteria. In addition or in the alternative, the evaluation system searches the transcripts of the transactions to determine candidate transactions that meet the search criteria of the current autoscore template. If there are no candidate transactions that meet the search criteria of the current autoscore template, the evaluation system can move to the next active autoscore template. If there are transactions that meet the search criteria for the current autoscore template, processing can proceed to step 1308 and the evaluation system selects a candidate transaction as the current transaction.
The evaluation system applies the current autoscore template to the current transaction to determine an autoscore associated with the transaction for the autoscore template (step 1310) and stores the autoscore for the autoscore template in association with the current transaction (step 1312). For example, the identity of the autoscore template and the score generated according to the autoscore template for the transaction may be stored as part of the transaction's metadata in a data store 118, 208. In another embodiment, the autoscore generated for the transaction is stored elsewhere in a manner such that the score is linked to both the transaction and the autoscore template that was used to generate the transaction. One embodiment of scoring a transaction according to an autoscore template is described in conjunction with
The current autoscore template can be applied to each candidate transaction. Furthermore, each active autoscore template may be executed. The steps of
If a transaction transcript matches the lexicon entry and the current lexicon is designated as an auto fail lexicon for the autoscore template, the evaluation system can output an autoscore of 0 for the transaction for the current autoscore template (step 1410) and move to the next candidate transaction. If the lexicon is not designated as an auto fail lexicon for the autoscore template, the evaluation system can add the lexicon weight value (e.g., as specified via control 422) to the current lexicon score to update the current lexicon score (step 1412). The lexicon weight value may be reduced for the entry if the entry has an entry weight that is less than 1.
If the multiplier is not enabled (e.g., as specified via control 424) (step 1414), the evaluation system stores the lexicon score for the current lexicon, which will equal the lexicon weight value for the current lexicon at this point (step 1416), and moves to the next lexicon in the current autoscore template.
If the multiplier is enabled, the evaluation system can move to the next entry in the lexicon; that is, return to step 1406 and select the next lexicon entry from the current lexicon as the current lexicon entry. Thus, the lexicon score for the current lexicon can increase for each lexicon entry that the transaction transcript matches. When all the lexicon entries in the lexicon have been processed, the evaluation system can store the lexicon score for the current lexicon (step 1416). The evaluation system can apply each lexicon incorporated in the autoscore template and determine a lexicon score for each lexicon.
At step 1418, the evaluation system determines an autoscore for the current transaction and current autoscore template based on the lexicon scores for the lexicons in the current autoscore template, a base score specified in the autoscore template (e.g., as specified via control 410) and a target score algorithm selected for the autoscore template. For example, the evaluation system can add the highest lexicon score of the lexicons associated with the autoscore template to the base score, add the lowest lexicon score of the lexicons associated with the autoscore template to the base score, or add the lexicon scores for all the lexicons associated with the autoscore template to the base score. According to one embodiment, the evaluation system may limit the minimum and maximum for the autoscore for the current transaction to a range, such as e.g., 0-100.
In flow 1400 with the multiplier enabled, a transaction may be scored once for each lexicon entry in a lexicon that the appropriate transcript of the transaction matches. In another embodiment, the transaction may be scored for every hit over every lexicon entry in the transaction transcript to which the lexicon applies. Using the examples of
The steps of
According to one embodiment, evaluation system receives a request from an evaluator for an evaluation to evaluate a transaction. As discussed above, the transaction may be assigned an automated score according to an automated scoring template based on a transcript of the transaction having matched a lexicon associate with the automated scoring template (step 1502). Responsive to the request, the evaluation system creates the requested evaluation from an evaluation form.
The evaluation system accesses the appropriate evaluation form (step 1504) and selects a question from the evaluation form (step 1506). The evaluation system further determines the acceptable answers to the question (step 1508). For example, the evaluation system may access an answer template included in the selected question to determine the acceptable answers for the selected question.
The evaluation system if autoscore is enabled for the question (step 1510) (e.g., as was specified for the question using control 756). If autoscore is not enabled, the evaluation system may generate the page code for the question where the page code includes the question text, scoring tip and answer controls (e.g., drop down list, edit box or multi-select controls) (step 1511). In some cases, an answer may be prepopulated using predefined values that are not based on the autoscores.
If autoscore is enabled for the question, the evaluation system determines the autoscore ranges associated with the acceptable answers. For example, the evaluation system may access an answer template referenced by the question, where the answer template holds the associations between autoscore ranges and acceptable answers for the question (step 1512). Further, the evaluation system determines the autoscore template associated with the question (for example, the autoscore template specified via control 750) (step 1514).
Based on the transaction identification of the transaction to be evaluated and the autoscore template associated with the question, the evaluation system determines the autoscore assigned to the transaction based on the autoscore template associated with the question (step 1516). For example, an autoscore assigned to a transaction and identity of autoscore template that generated the autoscore may be stored in the transaction metadata for the transaction. Thus, the evaluation system can determine the autoscore assigned to the transaction based on the autoscore template associated with the question from the metadata of the transaction to be evaluated. In another embodiment, a data store that holds autoscore records that specify autoscores assigned to transactions and the identities of the autoscore templates that generated the autoscores. The evaluation system can determine the autoscore assigned to the transaction based on the autoscore template by searching the autoscore records.
The evaluation system determines if the autoscore assigned to the transaction based on the autoscore template associated with the question is associated with an acceptable answer to the question. For example, the evaluation system compares the autoscore with the autoscore ranges corresponding to the acceptable answers for the question (step 1518). If the autoscore is not in an autoscore range for an acceptable answer to the question, the evaluation system may generate the page code for the question where the page code includes the question text and answer controls (e.g., drop down list, edit box or multi-select controls) to allow the evaluator to submit an answer (step 1519). In some cases, an answer may be prepopulated using predefined values that are not based on the autoscores (e.g., based on inputs via controls 772, 774).
If the autoscore assigned to the transaction based on the autoscore template associated with the question is associated with an acceptable answer to the question, the evaluation system selects the acceptable answer to which the autoscore corresponds. For example, if the autoscore is in an autoscore range corresponding to an acceptable answer for the question, the evaluation system selects that acceptable answer as the autoscore auto answer for the question (step 1520). The evaluation system generates the page code for the question where the page code includes the question text and answer controls (e.g., drop down list, edit box or multi-select controls) to allow the evaluator to submit an answer (step 1522). In this case, the evaluation system presets the answer controls in the page code for the question to the preselected autoscore auto answer (the answer selected in step 1520) (e.g., sets a radio button to “checked”, sets a selected list option value for a dropdown list as selected or otherwise sets the answer control to indicate the preselected answer).
Using the example of
The evaluation system assembles the evaluation page and serves the evaluation to the evaluator (e.g., for display in operator interface 212) (step 1526). The evaluation answers submitted by the evaluator can be received by the evaluation system and recorded as a completed evaluation (e.g., in data store 118, 208) (step 1528). The evaluation answer to a question that was auto answered based on an autoscore may be the autoscore auto answer or other answer selected by the evaluator. The completed evaluation can include for each question, the autoscore auto answer determined by the evaluation system (that is the answer to the question preselected based on the autoscore), if any, and the evaluation answer. If the evaluation answer is the same as the autoscore auto answer, the evaluation answer may simply be stored as a flag indicating that the autoscore auto answer is the evaluation answer—that is, that the evaluator did not select another answer to the question.
The steps of
Thus, as can be seen from the examples of
An evaluation system (e.g., evaluation system 200) can periodically determine the accuracy of an auto answer parameters and retune the auto answer parameters to increase accuracy. For example, the evaluation system may adjust an autoscore template (e.g., add a new lexicon, update an existing lexicon) or adjust other auto answer parameters.
According to one embodiment, the evaluation system determines if an evaluator changed the answer to a question from the prepopulated autoscore auto answer determined based on the autoscore. The evaluator submitted answers to a question provided by evaluators over a number of transactions can be compared to the auto answers provided to the evaluators for the question to determine a confidence score for the auto answer parameters that were used to auto answer questions for the transactions. If the evaluators changed the answers to the question frequently from auto answers, this can indicate that auto answer parameters require retuning. For example, the evaluator submitted answers to a question provided by evaluators over a number of transactions can be compared to the autoscore auto answers provided to the evaluators for the question to determine a confidence score for the auto score template that was used to autoscore the transactions for the questions. If the evaluators changed the answers to the question frequently from the autoscore auto answers, this could indicate and issue with the accuracy of the autoscore template. The evaluation system can be configured to retune auto answer parameters when the confidence level for a set of auto answer parameters drops below a threshold.
Returning briefly to
With reference to
At step 1602, the evaluation system selects an autoscore template for analysis and identifies and accesses the completed evaluations of transactions scored using that autoscore template. For each of the evaluations identified in step 1602, the evaluation system compares the autoscore auto answer to a question associated with the autoscore template—that is, the preselected answer to the question preselected based on the autoscore assigned according to the autoscore template—to the evaluation answer to the question to determine if the evaluator changed the answer from the preselected answer (step 1604).
Based on the comparisons of step 1604, the evaluation system can determine a confidence score for the selected autoscore template (step 1606). According to one embodiment, for example, if the evaluation system determines that evaluators changed the answer to the question from the preselected answer in twenty five percent of the evaluations, the evaluation system can assign a confidence score of 75 to the selected autoscore template.
At step 1608, the evaluation system compares the confidence score for the autoscore template to a threshold. If the confidence score for the autoscore template meets the threshold, the evaluation system can use the autoscore template to score non-evaluated transactions (step 1610). If the confidence score for the autoscore template does not meet the confidence threshold, the evaluation system may implement low confidence processing for the autoscore template (step 1612).
Low confidence processing may involve a wide variety of processing. According to one embodiment for example, the evaluation flags the autoscore template so that auto scorer 232 stops using the autoscore template. As another example, the evaluation system generates an alert to a user so that the user can retune the autoscore template. Other processing may also be implemented.
The steps of
Returning to
Returning to
AI engine 275 automatically tunes auto answer parameters. For example, AI engine 275 may adjust an autoscore template, an answer template, a question or a lexicon to tune the auto answer parameters. To this end, AI engine 275 periodically reads the confidence scores associated with auto answer parameters and compares the confidence scores to a threshold. If the confidence score for an auto answer parameter falls below a threshold, AI engine identifies an auto answer parameter as candidates for tuning. For example, if the confidence score for an autoscore template falls below a threshold, AI engine identifies the auto score template as a candidate for tuning.
AI engine 275 applies machine learning techniques to the results of evaluations and transcripts to automatically tune auto answer parameters. For example, AI engine 275 may adjust the lexicons incorporated in a selected autoscore template or parameters of an autoscore template, answer template or question to create revised autoscore templates, answer template or questions. AI engine 275 then applies the revised auto answer parameters to transactions to generate revised auto answers for the evaluations. AI engine 275 further compares the revised auto answers to a question to the answers submitted by evaluators to determine if a revised auto answer parameters exhibit a higher confidence score than the prior auto answer rule.
If the revise auto answer parameters result in a higher confidence level, the AI engine 275 can update the auto answer parameters for a question with the revised auto answer parameters. For example, if at least one revised autoscore template results in a higher confidence level than a prior autoscore template, the AI engine 275 can replace the selected autoscore template with the revised autoscore template. Revising a selected autoscore template may involve, for example, modifying or adding a lexicon 244, modifying or adding an autoscore template 246, modifying or adding an association between an autoscore template 246 and a lexicon 244, modifying or adding an association between a question 248 and autoscore template 246 or otherwise changing how transactions are autoscored or a question auto-answered based on an autoscore template.
AI engine 275 can iterate through multiple revisions of auto answer parameters (e.g., multiple revisions of a template) until a confidence score that meets the confidence threshold is reached. In the example of
With reference to
The AI engine identifies a set of completed evaluations having autoscore auto answers generated based on a selected autoscore template (step 1752) and the set of transactions evaluated by the completed evaluations (step 1754). In one embodiment, for example, the metadata of a completed evaluation includes the identity of the autoscore templates that were used to autoscore a transaction and the transaction that was evaluated. Thus, in one embodiment, step 1752 and 1754 may include querying a data store (e.g., data store 208) for evaluations that are associated with a selected autoscore template.
At step 1756, the AI engine identifies a first subset of completed evaluations corresponding to a first acceptable answer. According to one embodiment, the first subset of completed evaluations are the evaluations from the set of completed evaluations in which evaluation answers to a question associated with the selected autoscore template are the first acceptable answer. More particularly, according to one embodiment, the first subset of completed evaluations may be the evaluations from the set of completed evaluations in which evaluation answers to a question associated with the selected autoscore template are the first acceptable answer, where the evaluation answer was not changed from the autoscore auto answer. In one embodiment, the first acceptable answer may be the target answer set with control 776.
For example, for a yes/no question associated with the selected autoscore template, the AI engine may identify the first subset of completed evaluations as those completed evaluations in which the evaluation answers to the question are “yes.” Further, in some embodiments, the AI engine may identify the first subset of completed evaluations as those completed evaluations in which the evaluation answers to the question are “yes” and the autoscore auto answers are “yes” (e.g., evaluations for which the evaluator did not change the answer from “no” to “yes”). As another example, for a question having the acceptable answers illustrated in
The AI engine may also determine at least one additional subset of completed evaluations. The at least one additional subset of completed evaluations may be the evaluations from the set of completed evaluations in which the evaluation answers to the question associated with the selected autoscore template are not the first acceptable answer. The AI engine may determine, for example, a second subset of completed evaluations corresponding to a second acceptable answer, where the second subset of completed evaluations includes the evaluations from the set of completed evaluations in which the evaluation answers to the question associated with the selected autoscore template are a second acceptable answer.
Continuing with the previous example of a yes/no question associated with the selected autoscore template, the AI engine may identify the second subset of completed evaluations as those completed evaluations in which the evaluation answers to the question are “no.” As another example, for a question having the acceptable answers illustrated in
At step 1758, the AI engine determines a list of candidate words or phrases for the autoscore template. The candidate words and phrases are words or phrases common to a first subset of transcripts, where the first subset of transcripts are the transcripts of the identified transactions evaluated by the evaluations in the first subset of evaluations; that is, the candidate words or phrases include words or phrases common to a first subset of transcripts to which the autoscore template was applied.
For example, for a yes/no question associated with the selected autoscore template, the first subset of transcripts may be the transcripts of the transactions having corresponding evaluations in which the evaluation answers to the question are “yes” and the AI engine can thus identify words or phrases common to be the transcripts of the transactions having corresponding evaluations in which the evaluation answers to the question are “yes.” As another example, for a question having the acceptable answers illustrated in
According to one embodiment, the AI engine determines common words or phrases using term vectors that represent each transcript as a vector of terms or phrases, where each word or phrase is a dimension. Generally, if a term or phrase appears in a transcript, the term or phrase has a nonzero value in the term vector for the transcript. The value for a term or phrase in a term vector may represent the frequency of the term in the transcript. For example, the value of a term or phrase in a term vector may be a term frequency-inverse document frequency (tf-idf) measure that reflects the importance of a word to a transcript in a corpus, where the corpus comprise the transcripts that were autoscored by the autoscore template or other collection of transcripts of which the first subset of transcripts are part.
Some search tools, such as APACHE SOLR by the APACHE SOFTWARE FOUNDATION of Forest Hill, Md., United States, support queries for term vectors and can return the term vector, the term frequency, inverse document frequency, position, and offset information for terms in documents. Thus, the AI engine can query a search engine (e.g., search component 218) for the term vectors. The AI engine may determine the set of terms or phrases that are common to the transcripts in the first subset of transcripts as the set of terms or phrases that have nonzero values in all of the term vectors for the transcripts in the first subset of transcripts.
In one embodiment, for a yes/no question associated with the selected autoscore template, the AI engine may identify the words or phrases having a nonzero weight in the term vectors of the transcripts of transactions having corresponding evaluations in which the evaluation answers to the question are “yes.” As another example, for a question having the acceptable answers illustrated in
The AI engine reduces the set of terms or phrases that are common to the transcripts in the first subset of transcripts to a set of candidate terms. According to one embodiment, the AI engine identifies the words or phrases that have a greater than zero frequency in the transcripts of transactions evaluated by the evaluations in the second subset of evaluations and does not select those words or phrases as candidate words or phrases. In addition or in the alternative, the AI engine may also remove words or phrases that appear in a lexicon of the selected autoscore template.
For example, for a yes/no question associated with the selected autoscore template, the second subset of transcripts may be the transcripts of the identified transactions having corresponding evaluations in which the evaluation answers to the question are “no.” That is, the second subset of transcripts includes the transcripts of transactions evaluated by a completed evaluation in the second subset of completed evaluations. The AI engine can identify the words or phrases common to the first subset of transcripts and remove the words or phrases that also appear in the second subset of transcripts from the candidate words or phrases. In one embodiment, for example, the AI engine identifies the words or phrases common to the first subset of transcripts that also have a nonzero value in any term vector of a transcript of an identified transaction that has corresponding evaluation in which the evaluation answers to the question of “no” and removes the identified words or phrases from the candidate terms or phrases.
As another example, for a question having the acceptable answers illustrated in
In addition or in the alternative, the AI engine, in one embodiment, may select only the terms from the set of terms or phrases that are common to the transcripts in the first subset of transcripts that have greater than a threshold frequency in each of transcripts in the first subset of transcripts as the candidate terms. As one example, the AI engine only selects as candidate terms or phrases the terms or phrases that are common to the transcripts in the first subset of transcripts and have greater than a threshold term frequency or tf-idf for each of the transcripts in the first subset of transcripts.
At step 1760, the AI engine creates a revised autoscore template. More particularly, the AI engine can create a new lexicon containing a candidate word or phrase as a lexicon entry and associate the autoscore template with the new lexicon. The AI engine may use a configured default value for the lexicon weight value for the new lexicon and a default multiplier setting. In another embodiment, the AI engine adds a candidate word or phrase to an existing lexicon associated with the selected autoscore template to create a revised autoscore template. The existing lexicon weight and multiplier setting may be used. Each new lexicon entry added to a new lexicon or existing lexicon may have a weight of 1 or other weight (e.g. a weight based on frequency of the term or phrase).
Adding a candidate word or phrase to a new or existing lexicon can include adding a single candidate word or phrase as a lexicon entry. This process can be repeated iteratively, adding a candidate word or phrase, testing the revised lexicon and repeating until a desired confidence is reached or all the candidate words and phrases have been added. In another embodiment, all the candidate words and phrases may be added to a new or existing lexicon before the revised autoscore template is tested.
At step 1762, the AI engine applies the revised autoscore template to a set of test transactions evaluated by the set of completed evaluations to autoscore the transactions (the transactions identified in step 1754). The set of test transactions may include all the transactions identified in step 1754 or a subset thereof. The AI engine repeats method 1400 using the revised autoscore template on the set of test transactions to generate revised autoscores for the set of transactions. The AI engine further auto-answers the question associated with the selected autoscore template to generate revised autoscore auto answers based on the revised autoscores for the set of test transactions. For example, the AI engine may perform steps 1512, 1514, 1516, 1520 for a question previously auto answered based on the selected autoscore template using the revised autoscore template and revised autoscores to determine a revised autoscore auto answers for the test transactions. The revised autoscore auto answer for a test transaction may be the same as or different than the previous autoscore auto answer.
At step 1764, the AI engine determines a confidence score for the revised autoscore template. For example, the AI engine may perform steps 1602, 1604, 1606 using the evaluation answers to the question in the evaluations of the test transactions from the set of completed evaluations identified in step 1752 and the revised autoscore auto answers determined in step 1762. For each of the evaluation of a test transaction from the evaluations identified in step 1752, the evaluation system compares the revised autoscore auto answer to the question associated with the revised autoscore template to the evaluation answer to the question to determine if the evaluator would have changed the answer from the preselected answer had the revised autoscore template been used.
Based on the comparisons, the evaluation system can determine a confidence score for the revised autoscore template. According to one embodiment, for example, if the evaluation system determines that evaluators would have changed the answer to the question from the revised autoscore auto answer in twenty percent of the test transaction evaluations had the revised autoscore template been used, the evaluation system can assign a confidence score of 80 to the revised autoscore template.
The AI engine, at step 1766, compares the confidence score for the revised autoscore template to a threshold. If the confidence score for the autoscore template meets the threshold and is greater than the confidence score of the selected autoscore template, the evaluation system can replace the selected autoscore template with the revised autoscore template (step 1768) and update the autoscore auto answers in the set of completed evaluations (step 1770). Thus, subsequent autoscoring and auto answering is performed using a more accurate autoscore template. In another embodiment, the AI engine may store the revised autoscore template, revised lexicon or new lexicon and alert the user that a revised autoscore template is available. The user can be responsible for approving the use of the revised autoscore template.
If the confidence score for the revised autoscore template does not meet the confidence threshold, the evaluation system may implement further low confidence processing for the revised autoscore template (step 1772). For example, the AI engine may iterate through additional revised autoscore templates, such as by adding additional candidate terms. In another embodiment, the AI engine may simply discard the revised autoscore template. As another example, the AI engine may select the version of the autoscore template that has the highest confidence score as the active autoscore template to use going forward.
Low confidence processing may involve a wide variety of processing. According to one embodiment for example, the evaluation flags the autoscore template so that auto scorer 232 stops using the autoscore template. As another example, the evaluation system generates an alert to a user so that the user can retune the autoscore template. Other processing may also be implemented.
The steps of
The AI engine identifies a set of completed evaluations having autoscore auto answers generated based on a selected autoscore template (step 1802) and the set of transactions evaluated by the completed evaluations (step 1804). In one embodiment, for example, the metadata of a completed evaluation includes the identity of the autoscore templates that were used to autoscore a transaction and the transaction that was evaluated. Thus, in one embodiment, step 1802 and 1804 may include querying a data store (e.g., data store 208) for evaluations that are associated with a selected autoscore template.
At step 1806, the AI engine incrementally adjusts parameters of the selected autoscore template to create a revised autoscore template. Incrementally adjusting the parameters may include for example, adjusting the lexicon entry weights in a lexicon associated with the selected autoscore template, adjusting the lexicon weight assigned to a lexicon in the selected autoscore template, selecting or deselecting the multiplier for a lexicon, adjusting the base score.
At step 1810, the AI engine applies the revised autoscore template to a set of test transactions evaluated by the set of completed evaluations to autoscore the test transactions (the transactions identified in step 1804). For example, the AI engine may repeat method 1400 using the revised autoscore template on the set of test transactions to generate revised autoscores for the set of transactions. The AI engine further auto-answers the question associated with the selected autoscore template to generate revised autoscore auto answers based on the revised autoscores for the set of test transactions. For example, the AI engine may perform steps 1512, 1514, 1516, 1520 for a question previously auto answered based on the selected autoscore template using the revised autoscore template and revised autoscores to determine revised autoscore auto answers for the test transactions. The revised autoscore auto answer for a test transaction may be the same as or different than the previous autoscore auto answer.
At step 1812, the AI engine determines a confidence score for the revised autoscore template. For example, the AI engine may perform steps 1602, 1604, 1606 using the evaluation answers to the question from the set of completed evaluations of the test transactions from the evaluations identified in step 1804 and the revised autoscore auto answers determined in step 1812. For each of the evaluations of a test transaction, the evaluation system compares the revised autoscore auto answer to the question associated with the revised autoscore template to the evaluation answer to the question to determine if the evaluator would have changed the answer from the preselected answer had the revised autoscore template been used.
Based on the comparisons, the evaluation system can determine a confidence score for the revised autoscore template. According to one embodiment, for example, if the evaluation system determines that evaluators would have changed the answer to the question from the revised autoscore auto answer in twenty percent of the completed evaluations of the test transactions had the revised autoscore template been used, the evaluation system can assign a confidence score of 80 to the revised autoscore template.
The AI engine, at step 1814, compares the confidence score for the revised autoscore template to a threshold. If the confidence score for the autoscore template meets the threshold and is greater than the confidence score of the selected autoscore template, the evaluation system can replace the selected autoscore template with the revised autoscore template (step 1816) and update the autoscore auto answers in the set of completed evaluations (step 1818). Thus, subsequent autoscoring and auto answering is performed using a more accurate autoscore template. In another embodiment, the AI engine may store the revised autoscore template, revised lexicon or new lexicon and alert the user that a revised autoscore template is available. The user can be responsible for approving the use of the revised autoscore template.
If the confidence score for the revised autoscore template does not meet the confidence threshold, the evaluation system may implement further low confidence processing for the revised autoscore template (step 1820). For example, the AI engine may iterate through additional revised autoscore templates, such as by further adjusting parameters (e.g., returning to step 1806). In another embodiment, the AI engine may simply discard the revised autoscore template. As another example, the AI engine may select the version of the autoscore template that has the highest confidence score as the active autoscore template to use going forward.
Low confidence processing may involve a wide variety of processing. According to one embodiment for example, the evaluation flags the autoscore template so that auto scorer 232 stops using the autoscore template. As another example, the evaluation system generates an alert to a user so that the user can retune the autoscore template. Other processing may also be implemented.
As can be understood from the foregoing, as new transactions are recorded (e.g., in data store 208, auto answered and evaluated, the confidence score for a set of auto answer parameters may change. Thus, the system can detect that a set of auto answer parameters are no longer sufficiently accurate and retune the parameters. For example, as new transactions are recorded, autoscored, and evaluated, the confidence score for an autoscore template may change. Thus, the system can detect that an autoscore template is no longer sufficiently accurate if the confidence score for the template drops below a threshold and automatically retune the template.
Further, network computing environment includes a telephony network 2007 to connect server computer 2002 and server computer 2004 to call center voice instruments 2060 and external voice instruments 2062. Telephony network 2007 may utilize various types of voice communication known in the art. Telephony network may comprise, for example, a PTSN, PBX, VOIP network, cellular network or combination thereof.
For the purpose of illustration, a single system is shown for each of computer 2002, 2004, 2006, 2008 and 2009 and a single voice instrument is shown for each of voice instruments 2060, 2062. However, each computer 2002, 2004, 2006, 2008 and 2009 may comprise a plurality of computers (not shown) interconnected to each other over network 2005. For example, a plurality of computers 2002, a plurality of computers 2004, a plurality of computers 2006, a plurality of computers 2008 and a plurality of computer 2009 may be coupled to network 2005. Furthermore, a plurality of computers 2002, plurality of computers 2004, a plurality of voice instruments 2062 and a plurality of voice instruments 2060 may be coupled to telephony network 2007.
Server computer 2002 can include can include central processing unit (“CPU”) 2020, read-only memory (“ROM”) 2022, random access memory (“RAM”) 2024, hard drive (“HD”) or storage memory 2026, input/output device(s) (“I/O”) 2028 and communication interface 2029. I/O 2028 can include a keyboard, monitor, printer, electronic pointing device (e.g., mouse, trackball, stylus, etc.), or the like. Communications interface may include a communications interface, such as a network interface card, to interface with network 2005 and phone interface cards to interface with telephony network 2007.
According to one embodiment, server computer 2002 may include computer executable instructions stored on a non-transitory computer readable medium coupled to a processor. The computer executable instructions of server 2002 may be executable to provide a recording system. For example, the computer executable instructions may be executable to provide a recording server, such as recording server 114, or an ingestion server, such as ingestion server 116. Server computer 2002 may implement a recording system that records voice sessions between a voice instrument 2060 and a voice instrument 2062 (e.g., between a call center agent voice instrument and a customer voice instrument) and data sessions with client computer 2006. Server computer 2002 stores session data for voice and data sessions in transaction data store 2022.
Server computer 2004 can comprise CPU 2030, ROM 2032, RAM 2034, HD 2036, I/O 2038 and communications interface 2039. I/O 2038 can include a keyboard, monitor, printer, electronic pointing device (e.g., mouse, trackball, stylus, etc.), or the like. Communications interface 2039 may include a communications interface, such as a network interface card, to interface with network 2005 and telephony interface card to interface with telephony network 2007.
According to one embodiment, server computer 2004 may include a processor (e.g., CPU 2030) coupled to a data store configured to store transactions (e.g., transaction metadata and associated recorded sessions). For example, server computer 2004 may include CPU 2030 coupled to data store 2022 via network 2005. Server computer 2004 may further comprise computer executable instructions stored on a non-transitory computer readable medium coupled to the processor. The computer executable instructions of server 2004 may be executable to provide an evaluation system.
The computer executable instructions may be executable to provide a variety of services to client computer 2006, 2008, such as providing interfaces to allow a designer to design lexicons, autoscore templates, questions, answer templates and evaluation forms. The computer executable instructions of server computer 2004 may be further executable to execute to evaluations to evaluators. The computer executable instructions may further utilize data stored in a data store 2040. According to one embodiment, the computer executable instructions of server computer 2004 may be executable to implement server tier 202.
Computer 2006 can comprise CPU 2050, ROM 2052, RAM 2054, HD 2056, I/O 2058 and communications interface 2059. I/O 2058 can include a keyboard, monitor, printer, electronic pointing device (e.g., mouse, trackball, stylus, etc.), or the like. Communications interface 2059 may include a communications interface, such as a network interface card, to interface with network 2005. Computer 2006 may comprise call center agent software to allow a call center agent to participate in a data session that is recorded by server 2002. Computer 2006 may be an example of an agent computer 164 or a supervisor computer 174.
Computer 2008 can similarly comprise CPU 2070, ROM 2072, RAM 2074, HD 2076, I/O 2078 and communications interface 2079. I/O 2078 can include a keyboard, monitor, printer, electronic pointing device (e.g., mouse, trackball, stylus, etc.), or the like. Communications interface 2079 may include a communications interface, such as a network interface card, to interface with network 2005. Computer 2008 may comprise a web browser or other application that can cooperate with server computer 2004 to allow a user to define lexicons, autoscore templates, questions, answer templates and evaluation forms. Computer 2008 may be an example of a client computer 180. According to one embodiment, computer 2006 or computer 2008 may implement client tier 203.
Computer 2009 can similarly comprise CPU 2080, ROM 2082, RAM 2084, HD 2086, I/O 2088 and communications interface 2089. I/O 2088 can include a keyboard, monitor, printer, electronic pointing device (e.g., mouse, trackball, stylus, etc.), or the like. Communications interface 2089 may include a communications interface, such as a network interface card, to interface with network 2005. Computer 2009 may comprise a web browser that allows an evaluator to complete evaluations. Computer 2009 may be another example of a client computer 180.
Call center voice instrument 2060 and external voice instrument 2062 may operate according to any suitable telephony protocol. Call center voice instrument 2060 can be an example of agent voice instrument 162 or supervisor voice instrument 172 and external voice instrument 2062 may be an example of a customer voice instrument.
Each of the computers in
Portions of the methods described herein may be implemented in suitable software code that may reside within ROM 2012, 2032, 2052, 2072 and 2082; RAM 2014, 2034, 2054, 2074 and 2084; HD 2016, 2036, 2056, 2076 and 2086. In addition to those types of memories, the instructions in an embodiment disclosed herein may be contained on a data storage device with a different computer-readable storage medium. Alternatively, the instructions may be stored as software code elements on a data storage array, magnetic tape, floppy diskette, optical storage device, or other appropriate data processing system readable medium or storage device.
Although the invention has been described with respect to specific embodiments thereof, these embodiments are merely illustrative, and not restrictive of the invention. Rather, the description is intended to describe illustrative embodiments, features and functions in order to provide a person of ordinary skill in the art context to understand the invention without limiting the invention to any particularly described embodiment, feature or function, including any such embodiment feature or function described. While specific embodiments of, and examples for, the invention are described herein for illustrative purposes only, various equivalent modifications are possible within the spirit and scope of the invention, as those skilled in the relevant art will recognize and appreciate. As indicated, these modifications may be made to the invention in light of the foregoing description of illustrated embodiments of the invention and are to be included within the spirit and scope of the invention. Thus, while the invention has been described herein with reference to particular embodiments thereof, a latitude of modification, various changes and substitutions are intended in the foregoing disclosures, and it will be appreciated that in some instances some features of embodiments of the invention will be employed without a corresponding use of other features without departing from the scope and spirit of the invention as set forth. Therefore, many modifications may be made to adapt a particular situation or material to the essential scope and spirit of the invention.
Embodiments described herein can be implemented in the form of control logic in software or hardware or a combination of both. The control logic may be stored in an information storage medium, such as a computer-readable medium, as a plurality of instructions adapted to direct an information processing device to perform a set of steps disclosed in the various embodiments. Based on the disclosure and teachings provided herein, a person of ordinary skill in the art will appreciate other ways and/or methods to implement the invention. At least portions of the functionalities or processes described herein can be implemented in suitable computer-executable instructions. The computer-executable instructions may reside on a computer readable medium, hardware circuitry or the like, or any combination thereof.
Those skilled in the relevant art will appreciate that the invention can be implemented or practiced with other computer system configurations including, without limitation, multi-processor systems, network devices, mini-computers, mainframe computers, data processors, and the like. The invention can be employed in distributed computing environments, where tasks or modules are performed by remote processing devices, which are linked through a communications network such as a LAN, WAN, and/or the Internet. In a distributed computing environment, program modules or subroutines may be located in both local and remote memory storage devices. These program modules or subroutines may, for example, be stored or distributed on computer-readable media, including magnetic and optically readable and removable computer discs, stored as firmware in chips, as well as distributed electronically over the Internet or over other networks (including wireless networks).
Any suitable programming language can be used to implement the routines, methods or programs of embodiments of the invention described herein, including C, C++, Java, JavaScript, HTML, or any other programming or scripting code, etc. Different programming techniques can be employed such as procedural or object oriented. Other software/hardware/network architectures may be used. Communications between computers implementing embodiments can be accomplished using any electronic, optical, radio frequency signals, or other suitable methods and tools of communication in compliance with known network protocols.
As one skilled in the art can appreciate, a computer program product implementing an embodiment disclosed herein may comprise a non-transitory computer readable medium storing computer instructions executable by one or more processors in a computing environment. The computer readable medium can be, by way of example only but not by limitation, an electronic, magnetic, optical or other machine readable medium. Examples of non-transitory computer-readable media can include random access memories, read-only memories, hard drives, data cartridges, magnetic tapes, floppy diskettes, flash memory drives, optical data storage devices, compact-disc read-only memories, and other appropriate computer memories and data storage devices.
Particular routines can execute on a single processor or multiple processors. Although the steps, operations, or computations may be presented in a specific order, this order may be changed in different embodiments. In some embodiments, to the extent multiple steps are shown as sequential in this specification, some combination of such steps in alternative embodiments may be performed at the same time. The sequence of operations described herein can be interrupted, suspended, or otherwise controlled by another process, such as an operating system, kernel, etc. Functions, routines, methods, steps and operations described herein can be performed in hardware, software, firmware or any combination thereof.
It will also be appreciated that one or more of the elements depicted in the drawings/figures can be implemented in a more separated or integrated manner, or even removed or rendered as inoperable in certain cases, as is useful in accordance with a particular application. Additionally, any signal arrows in the drawings/figures should be considered only as exemplary, and not limiting, unless otherwise specifically noted.
Reference throughout this specification to “one embodiment”, “an embodiment”, or “a specific embodiment” or similar terminology means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment and may not necessarily be present in all embodiments. Thus, respective appearances of the phrases “in one embodiment”, “in an embodiment”, or “in a specific embodiment” or similar terminology in various places throughout this specification are not necessarily referring to the same embodiment. Furthermore, the particular features, structures, or characteristics of any particular embodiment may be combined in any suitable manner with one or more other embodiments. It is to be understood that other variations and modifications of the embodiments described and illustrated herein are possible in light of the teachings herein and are to be considered as part of the spirit and scope of the invention.
As used herein, the terms “comprises,” “comprising,” “includes,” “including,” “has,” “having,” or any other variation thereof, are intended to cover a non-exclusive inclusion. For example, a process, product, article, or apparatus that comprises a list of elements is not necessarily limited only those elements but may include other elements not expressly listed or inherent to such process, product, article, or apparatus.
Furthermore, the term “or” as used herein is generally intended to mean “and/or” unless otherwise indicated. For example, a condition A or B is satisfied by any one of the following: A is true (or present) and B is false (or not present), A is false (or not present) and B is true (or present), and both A and B are true (or present). As used herein, a term preceded by “a” or “an” (and “the” when antecedent basis is “a” or “an”) includes both singular and plural of such term, unless clearly indicated within the claim otherwise (i.e., that the reference “a” or “an” clearly indicates only the singular or only the plural). Also, as used in the description herein and throughout the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise.
Additionally, any examples or illustrations given herein are not to be regarded in any way as restrictions on, limits to, or express definitions of, any term or terms with which they are utilized. Instead, these examples or illustrations are to be regarded as being described with respect to one particular embodiment and as illustrative only. Those of ordinary skill in the art will appreciate that any term or terms with which these examples or illustrations are utilized will encompass other embodiments which may or may not be given therewith or elsewhere in the specification and all such embodiments are intended to be included within the scope of that term or terms. Language designating such nonlimiting examples and illustrations includes, but is not limited to: “for example,” “for instance,” “e.g.,” “in one embodiment.”
In the description herein, numerous specific details are provided, such as examples of components and/or methods, to provide a thorough understanding of embodiments of the invention. One skilled in the relevant art will recognize, however, that an embodiment may be able to be practiced without one or more of the specific details, or with other apparatus, systems, assemblies, methods, components, materials, parts, and/or the like. In other instances, well-known structures, components, systems, materials, or operations are not specifically shown or described in detail to avoid obscuring aspects of embodiments of the invention. While the invention may be illustrated by using a particular embodiment, this is not and does not limit the invention to any particular embodiment and a person of ordinary skill in the art will recognize that additional embodiments are readily understandable and are a part of this invention.
Generally then, although the invention has been described with respect to specific embodiments thereof, these embodiments are merely illustrative, and not restrictive of the invention. Rather, the description is intended to describe illustrative embodiments, features and functions in order to provide a person of ordinary skill in the art context to understand the invention without limiting the invention to any particularly described embodiment, feature or function, including any such embodiment feature or function described. While specific embodiments of, and examples for, the invention are described herein for illustrative purposes only, various equivalent modifications are possible within the spirit and scope of the invention, as those skilled in the relevant art will recognize and appreciate.
As indicated, these modifications may be made to the invention in light of the foregoing description of illustrated embodiments of the invention and are to be included within the spirit and scope of the invention. Thus, while the invention has been described herein with reference to particular embodiments thereof, a latitude of modification, various changes and substitutions are intended in the foregoing disclosures, and it will be appreciated that in some instances some features of embodiments of the invention will be employed without a corresponding use of other features without departing from the scope and spirit of the invention as set forth. Therefore, many modifications may be made to adapt a particular situation or material to the essential scope and spirit of the invention.