The development of the EDVAC computer system of 1948 is often cited as the beginning of the computer era. Since that time, computer systems have evolved into complicated devices. These complicated devices include complicated computer memory. Methods and systems for efficient use of computer memory continue to evolve.
Example methods, apparatus, and products for configuring computer memory are described with reference to the accompanying drawings, beginning with
The overall example computer system illustrated in
The word (509) of digitized speech (508) is speech for recognition (315) from a conversation (313). The speech for recognition can be the entire conversation, where, for example, both persons speaking are in the same room, and the entire conversation is picked up by a microphone on a speech-enabled device. The scope of speech for recognition can be reduced by providing to a speech-enabled device only one side of the conversation, as only through a microphone on a headset (105). The scope of speech for recognition can be reduced even further by providing for recognition only speech that responds to a prompt from a VoiceXML dialogue executing on a speech-enabled device. As the scope of speech for recognition is reduced, data processing burdens are reduced across the system as a whole, although it remains an option, in some embodiments at least, to recognize the entire conversation and stream across a display (110) a flow of all words in the conversation.
Speech from the conversation (313) is recognized into digitized speech by operation of a natural language processing speech recognition (“NLP-SR”) engine (153), shown here disposed upon a voice server (151), but also amenable to installation on speech-enabled devices. The NLP-SR engine also carries out the parsing of a word (509) of the speech so digitized (508) into a triple (752) of a description logic.
A triple is a three-part statement expressed in a form of logic. Depending on context, different terminologies are used to refer to effectively the same three parts of a statement in a logic. In first order logic, the parts are called constant, unary predicate, and binary predicate. In the Web Ontology Language (“OWL”) the parts are individual, class, and property. In some description logics the parts are called individual, concept, and role.
In this paper, the elements of a triple are referred to as subject, predicate, and object—and expressed like this: <subject> <predicate> <object>. There are many modes of expression for triples. Elements of triples can be represented as Uniform Resource Locaters (“URLs”), Uniform Resource Identifiers (“URIs”), or International Resource Identifiers (“IRIs”). Triples can be expressed in N-Quads, Turtle syntax, TriG, Javascript Object Notation or “JSON,” the list goes on and on. The expression used here, subject-predicate-object in angle brackets, is one form of abstract syntax, optimized for human readability rather than machine processing, although its substantive content is correct for expression of triples. Using this abstract syntax, here are examples of triples:
The same item can be referenced in multiple triples. In this example, Bob is the subject of four triples, and the Mona Lisa is the subject of one triple and the object of two. This ability to have the same item be the subject of one triple and the object of another makes it possible to effect connections among triples, and connected triples form graphs.
For further explanation of relations among triples and graphs,
In systems of knowledge representation, knowledge represented in graphs of triples, including, for example, knowledge representations implemented in Prolog databases, Lisp data structures, or in RDF-oriented ontologies in RDFS, OWL, and other ontology languages. Search and inference are effected against such graphs by search engines configured to execute semantic queries in, for example, Prolog or SPARQL. Prolog is a general-purpose logic programming language. SPARQL is a recursive acronym for “SPARQL Protocol and RDF Query Language.” Prolog supports queries against connected triples expressed as statements and rules in a Prolog database. SPARQL supports queries against ontologies expressed in RDFS or OWL or other RDF-oriented ontologies. Regarding Prolog, SPARQL, RDF, and so on, these are examples of technologies explanatory of example embodiments of the present invention. Thus, such are not limitations of the present invention. Knowledge representations useful according to embodiments can take many forms as may occur to those of skill in the art, now or in the future, and all such are now and will continue to be well within the scope of the present invention.
A description logic is a member of a family of formal knowledge representation languages. Some description logics are more expressive than propositional logic but less expressive than first-order logic. In contrast to first-order logics, reasoning problems for description logics are usually decidable. Efficient decision procedures therefore can be implemented for problem of search and inference in description logics. There are general, spatial, temporal, spatiotemporal, and fuzzy descriptions logics, and each description logic features a different balance between expressivity and reasoning complexity by supporting different sets of mathematical constructors.
Search queries are disposed along a scale of semantics. A traditional web search, for example, is disposed upon a zero point of that scale, no semantics, no structure. A traditional web search against the keyword “derivative” returns HTML documents discussing the literary concept of derivative works as well as calculus procedures. A traditional web search against the keyword “differential” returns HTML pages describing automobile parts and calculus functions.
Other queries are disposed along mid-points of the scale, some semantics, some structure, not entirely complete. This is actually a current trend in web search. Such systems may be termed executable rather than decidable. From some points of view, decidability is not a primary concern. In many Web applications, for example, data sets are huge, and they simply do not require a 100 percent correct model to analyze data that may have been spidered, scraped, and converted into structure by some heuristic program that itself is imperfect. People use Google because it can find good answers a lot of the time, even if it cannot find perfect answers all the time. In such rough-and-tumble search environments, provable correctness is not a key goal.
Other classes of queries are disposed where correctness of results is key, and decidability enters. A user who is a tele-agent in a data center speaking by phone with an automotive customer discussing a front differential is concerned not to be required to sort through calculus results to find correct terminology. Such a user needs correct definitions of automotive terms, and the user needs query results in conversational real time, that is, for example, within seconds.
In formal logic, a system is decidable if there exists a method such that, for every assertion that can be expressed in terms of the system, the method is capable of deciding whether or not the assertion is valid within the system. In practical terms, a query against a decidable description logic will not loop indefinitely, crash, fail to return an answer, or return a wrong answer. A decidable description logic supports data models or ontologies that are clear, unambiguous, and machine-processable. Undecidable systems do not. A decidable description logic supports algorithms by which a computer system can determine equivalence of classes defined in the logic. Undecidable systems do not. Decidable description logics can be implemented in C, C++, SQL, Lisp, RDF/RDFS/OWL, and so on. In the RDF space, subdivisions of OWL vary in decidability. Full OWL does not support decidability. OWL DL does.
For further explanation,
In the particular example of
The example apparatus of
VOIP stands for ‘Voice Over Internet Protocol,’ a generic term for routing speech over an IP-based data communications network. The speech data flows over a general-purpose packet-switched data communications network, instead of traditional dedicated, circuit-switched voice transmission lines. Protocols used to carry voice signals over the IP data communications network are commonly referred to as ‘Voice over IP’ or ‘VOIP’ protocols. VOIP traffic may be deployed on any IP data communications network, including data communications networks lacking a connection to the rest of the Internet, for instance on a private building-wide local area data communications network or ‘LAN.’
Many protocols may be used to effect VOIP, including, for example, types of VOIP effected with the IETF's Session Initiation Protocol (‘SIP’) and the ITU's protocol known as ‘H.323.’ SIP clients use TCP and UDP port 5060 to connect to SIP servers. SIP itself is used to set up and tear down calls for speech transmission. VOIP with SIP then uses RTP for transmitting the actual encoded speech. Similarly, H.323 is an umbrella recommendation from the standards branch of the International Telecommunications Union that defines protocols to provide audio-visual communication sessions on any packet data communications network.
Speech-enabled application (195) is a user-level, speech-enabled, client-side computer program that presents a voice interface to user (128), provides audio prompts and responses (314) and accepts input speech for recognition (315). Speech-enabled application (195) provides a speech interface through which a user may provide oral speech for recognition through microphone (176) and have the speech digitized through an audio amplifier (185) and a coder/decoder (‘codec’) (183) of a sound card (174) and provide the digitized speech for recognition to a voice server (151). Speech-enabled application (195) packages digitized speech in recognition request messages according to a VOIP protocol, and transmits the speech to voice server (151) through the VOIP connection (216) on the network (100).
Voice server (151) provides voice recognition services for speech-enabled devices by accepting dialog instructions, VoiceXML segments, or the like, and returning speech recognition results, including text representing recognized speech, text for use as variable values in dialogs, and output from execution of semantic interpretation scripts as well as voice prompts. Voice server (151) includes computer programs that provide text-to-speech (‘TTS’) conversion for voice prompts and voice responses to user input in speech-enabled applications such as, for example, X+V applications, SALT applications, or Java Speech applications.
The speech-enabled device (152) in the example of
Example formulations of semantic queries are had in C, C++, Java, Prolog, Lisp, and so on. The semantic web technology stack of the W3C, for example, offers SPARQL to formulate semantic queries in a syntax similar to SQL. Semantic queries are used against data structured in triple stores, graph databases, semantic wikis, natural language, and artificial intelligence systems. As mentioned, semantic queries work on structured data, and in the particular examples of the present case, the structured data is words described and defined in semantic triples connected in ways that conform to a description logic. In many embodiments of the present invention, semantic queries are asserted against data structured according to a description logic that implements decidability.
In the example apparatus of
Configuring computer memory with triples of description logic according to embodiments of the present invention, particularly in a thin-client architecture, may be implemented with one or more voice servers. A voice server is a computer, that is, automated computing machinery, that provides speech recognition and speech synthesis. For further explanation, therefore,
Stored in RAM (168) is a voice server application (188), a module of computer program instructions capable of operating a voice server in a system that is configured for use in configuring memory according to embodiments of the present invention. Voice server application (188) provides voice recognition services for multimodal devices by accepting requests for speech recognition and returning speech recognition results, including text representing recognized speech, text for use as variable values in dialogs, and text as string representations of scripts for semantic interpretation. Voice server application (188) also includes computer program instructions that provide text-to-speech (‘TTS’) conversion for voice prompts and voice responses to user input in speech-enabled applications such as, for example, speech-enabled browsers, X+V applications, SALT applications, or Java Speech applications, and so on.
Voice server application (188) may be implemented as a web server, implemented in Java, C++, Python, Perl, or any language that supports X+V, SALT, VoiceXML, or other speech-enabled languages, by providing responses to HTTP requests from X+V clients, SALT clients, Java Speech clients, or other speech-enabled client devices. Voice server application (188) may, for a further example, be implemented as a Java server that runs on a Java Virtual Machine (102) and supports a Java voice framework by providing responses to HTTP requests from Java client applications running on speech-enabled devices. And voice server applications that support embodiments of the present invention may be implemented in other ways as may occur to those of skill in the art, and all such ways are well within the scope of the present invention.
The voice server (151) in this example includes a natural language processing speech recognition (“NLP-SR”) engine (153). An NLP-SR engine is sometimes referred to in this paper simply as a ‘speech engine.’ A speech engine is a functional module, typically a software module, although it may include specialized hardware also, that does the work of recognizing and generating human speech. In this example, the speech engine (153) is a natural language processing speech engine that includes a natural language processing (“NLP”) engine (155). The NLP engine accepts recognized speech from an automated speech recognition (‘ASR’) engine, processes the recognized speech into parts of speech, subject, predicates, object, and so on, and then converts the recognized, processed parts of speech into semantic triples for inclusion in triple stores.
The speech engine (153) includes an automated speech recognition (‘ASR’) engine for speech recognition and a text-to-speech (‘TTS’) engine for generating speech. The speech engine also includes a grammar (104), a lexicon (106), and a language-specific acoustic model (108). The language-specific acoustic model (108) is a data structure, a table or database, for example, that associates speech feature vectors (‘SFVs’) with phonemes representing pronunciations of words in a human language. The lexicon (106) is an association of words in text form with phonemes representing pronunciations of each word; the lexicon effectively identifies words that are capable of recognition by an ASR engine. Also stored in RAM (168) is a Text-To-Speech (‘TTS’) Engine (194), a module of computer program instructions that accepts text as input and returns the same text in the form of digitally encoded speech, for use in providing speech as prompts for and responses to users of speech-enabled systems.
The grammar (104) communicates to the ASR engine (150) the words and sequences of words that currently may be recognized. For further explanation, distinguish the purpose of the grammar and the purpose of the lexicon. The lexicon associates with phonemes all the words that the ASR engine can recognize. The grammar communicates the words currently eligible for recognition. The two sets at any particular time may not be the same.
Grammars may be expressed in a number of formats supported by ASR engines, including, for example, the Java Speech Grammar Format (‘JSGF’), the format of the W3C Speech Recognition Grammar Specification (‘SRGS’), the Augmented Backus-Naur Format (‘ABNF’) from the IETF's RFC2234, in the form of a stochastic grammar as described in the W3C's Stochastic Language Models (N-Gram) Specification, and in other grammar formats as may occur to those of skill in the art. Grammars typically operate as elements of dialogs, such as, for example, a VoiceXML <menu> or an X+V <form>. A grammar's definition may be expressed in-line in a dialog. Or the grammar may be implemented externally in a separate grammar document and referenced from with a dialog with a URI. Here is an example of a grammar expressed in JSFG:
In this example, the elements named <command>, <name>, and <when> are rules of the grammar. Rules are a combination of a rulename and an expansion of a rule that advises an ASR engine or a voice interpreter which words presently can be recognized. In this example, expansion includes conjunction and disjunction, and the vertical bars ‘|’ mean ‘or.’ An ASR engine or a voice interpreter processes the rules in sequence, first <command>, then <name>, then <when>. The <command> rule accepts for recognition ‘call’ or ‘phone’ or ‘telephone’ plus, that is, in conjunction with, whatever is returned from the <name> rule and the <when> rule. The <name>rule accepts ‘bob’ or ‘martha’ or ‘joe’ or ‘pete’ or ‘chris’ or ‘john’ or ‘artoush’, and the <when> rule accepts ‘today’ or ‘this afternoon’ or ‘tomorrow’ or ‘next week.’ The command grammar as a whole matches utterances like these, for example:
The voice server application (188) in this example is configured to receive, from a speech-enabled client device located remotely across a network from the voice server, digitized speech for recognition from a user and pass the speech along to the ASR engine (150) for recognition. ASR engine (150) is a module of computer program instructions, also stored in RAM in this example. In carrying out automated speech recognition, the ASR engine receives speech for recognition in the form of at least one digitized word and uses frequency components of the digitized word to derive a speech feature vector or SFV. An SFV may be defined, for example, by the first twelve or thirteen Fourier or frequency domain components of a sample of digitized speech. The ASR engine can use the SFV to infer phonemes for the word from the language-specific acoustic model (108). The ASR engine then uses the phonemes to find the word in the lexicon (106).
Also stored in RAM is a VoiceXML interpreter (192), a module of computer program instructions that processes VoiceXML grammars. VoiceXML input to VoiceXML interpreter (192) may originate, for example, from VoiceXML clients running remotely on speech-enabled devices, from X+V clients running remotely on speech-enabled devices, from SALT clients running on speech-enabled devices, from Java client applications running remotely on multimedia devices, and so on. In this example, VoiceXML interpreter (192) interprets and executes VoiceXML segments representing voice dialog instructions received from remote speech-enabled devices and provided to VoiceXML interpreter (192) through voice server application (188).
A speech-enabled application (195 on
As mentioned above, a Form Interpretation Algorithm (‘FIA’) drives the interaction between the user and a speech-enabled application. The FIA is generally responsible for selecting and playing one or more speech prompts, collecting a user input, either a response that fills in one or more input items, or a throwing of some event, and interpreting actions that pertained to the newly filled-in input items. The FIA also handles speech-enabled application initialization, grammar activation and deactivation, entering and leaving forms with matching utterances and many other tasks. The FIA also maintains an internal prompt counter that is increased with each attempt to provoke a response from a user. That is, with each failed attempt to prompt a matching speech response from a user an internal prompt counter is incremented.
Also stored in RAM (168) is an operating system (154). Operating systems useful in voice servers according to embodiments of the present invention include UNIX™, Linux™, Microsoft NT™, AIX™, IBM's i5/OS™, and others as will occur to those of skill in the art. Operating system (154), voice server application (188), VoiceXML interpreter (192), ASR engine (150), JVM (102), and TTS Engine (194) in the example of
Voice server (151) of
Voice server (151) of
The example voice server of
The example voice server (151) of
For further explanation,
Also disposed in RAM are a triple server application program (297), a semantic query engine (298), a general language triple store (323), a jargon triple store (325), a triple parser/serializer (294), a triple converter (292), and one or more triple files (290). The triple server application program (297) accepts, through network (100) from speech-enabled devices such as laptop (126), semantic queries that it passes to the semantic query engine (298) for execution against the triple stores (323, 325).
The triple parser/serializer (294) administers the transfer of triples between triple stores and various forms of disk storage. The triple parser/serializer (294) accepts as inputs the contents of triple stores and serializes them for output as triple files (290), tables, relational database records, spreadsheets, or the like, for long-term storage in non-volatile memory, such as, for example, a hard disk (170). The triple parser/serializer (294) accepts triple files (290) as inputs and outputs parsed triples into triple stores. In many embodiments, when the triple parser/serializer (294) accepts triple files (290) as inputs and outputs parsed triples into triple stores, the triple parser/serializer stores the output triple stores into contiguous segments of memory. Contiguous storage can be effected in the C programming language by a call to the malloc( ) function. Contiguous storage can be effected by the Python buffer protocol. Contiguous storage can be effected in other ways as will occur to those of skill in the art, and all such ways are within the scope of the present invention. In many embodiments, both triple stores (323, 325) would be stored in one or two segments of contiguous memory.
Contiguous memory is explained in more detail with reference to
Here is the challenge addressed by the use of contiguous memory. Cache access takes 10 nanoseconds. RAM access takes 100 nanoseconds. Disk access takes 10,000,000 nanoseconds. Those numbers are not intuitive. People don't experience time in nanoseconds. Look at it in more familiar terms. If cache access is viewed as taking one minute, then RAM access takes 10 minutes, and disk access for the same data takes two years. Triples scattered across virtual memory addresses risk being stored in multiple page frames. Triples stored near one another in a contiguous memory segment are much more likely to be stored in a small number of page frames.
Suppose a jargon is composed of 500-word families each of which includes three words so that the entire jargon is expressed in 1500 words each of which on average is composed of 10 bytes of storage for a total for the jargon of 15 kilobytes of storage. Some computer systems today support memory page sizes of a megabyte or more. Such a jargon can be stored in a single memory page, and, once that page is in RAM, operation of the jargon can proceed with no risk of page faults at all. Even if contiguous storage for such a jargon fell across a page boundary, the entire jargon can be loaded with only two page faults, and, after it is loaded into RAM, it can be operated with zero page faults going forward. Cache misses would still be required to load the contents into cache, but, except for the first one or two misses, none of the others would risk a page fault. The inventors estimate that after a short period of operation, the cache miss rate would be less than one percent. That is, when a jargon is disposed in contiguous memory according to embodiments of the present invention, memory access times generally will approximate cache access times, just a few nanoseconds, for more than 99% of memory access.
For further explanation,
The thick-client speech-enabled device in the example of
The triple parser/serializer (294) administers the transfer of triples between triple stores and disk storage (170), and disk storage is available directly on the thick-client (152) rather than across a network on a server. The triple parser/serializer (294) accepts as inputs the contents of triple stores and serializes them for output as triple files (290), tables, relational database records, spreadsheets, or the like, for long-term storage in non-volatile memory, such as, for example, a hard disk (170). The triple parser/serializer (294) accepts triple files (290) as inputs and outputs parsed triples into triple stores. In many embodiments, when the triple parser/serializer (294) accepts triple files (290) as inputs and outputs parsed triples into triple stores, the triple parser/serializer stores the output triple stores into contiguous segments of memory.
The speech-engine engine (153) is a full-service NLP-SR engine that includes natural language processing (155), speech recognition (150), a grammar (104), a lexicon (106), a model (108), and text-to-speech processing (194), all as described in more detail above with regard to
For further explanation,
The method of
In typical embodiments, both recognizing (302) speech and parsing (304) words into triples are carried out by an NLP-SR engine (153). The NLP-SR engine inputs, optionally at least, the entire conversation (313) and presents to the parsing function (304) a stream of digitized speech (508). The parsing function processes a word (509), hands a triple off to a query engine (298) for further processing, and then loops (312) back to parse the next word in the stream of digitized speech.
The NLP-SR engine hands off a parsed triple (752) to a semantic query engine (298). The semantic query engine determines whether the parsed triple is recorded (308) in a general language triple store (323) in computer memory (168), and, if it is not (314), the query engine then determines whether the parsed triple is recorded (316) in a jargon triple store (325) in the computer memory.
If the parsed triple is recorded in neither the general language triple store nor the jargon triple store (314, 320), then a speech-enabled application (195) records (322) the parsed triple in the jargon triple store (325). After recording (322), the method loops (312) back to parse (304) the next word (509) in the stream of digitized speech (508). If the parsed triple is already stored in either a general language triple store (310) or a jargon triple store (318), then the method loops (312) directly back to parsing (304) a next word (509) without recording (322).
The general language triple store (323) is composed of structured definitions of words not special to any particular knowledge domain, in which each structured definition of the general language store is composed of a triple of the description logic. The jargon triple store (325) is composed of structured definitions of words that are special to a particular knowledge domain, where each structured definition of the jargon triple store is composed of a triple of the description logic. Both triple stores (323, 325) in embodiments can be stored in contiguous memory, either in a same segment of contiguous memory or in separate segments of contiguous memory.
And triple stores can include more than one jargon. User (128), for example, can be a tele-agent in a call center who is expected to field calls from customer in health care, law offices, and residential construction. The user needs help on the phone in three specialized terminologies, not just one. At least some embodiments, therefore, include selecting (301) among a plurality of j argon triple stores the jargon triple store (325) for use in any particular conversation. The user can simply press or click a button in a dashboard on a GUI to select among jargons. The correct jargon can be inferred from the industry specification or SIC code of the company with which the user is dealing in the dashboard, the company with whom the tele-agent is speaking. The correct jargon can be inferred from usage, asserting queries with parsed triples against multiple jargon triple stores and select the jargon triple store with the most hits. Inferring from usage may mean that the user need do nothing—the user might not even have the dashboard up on screen, or the user might even have up a dashboard at the beginning of a new conversation that is unrelated to the subject area of the new conversation.
In some embodiments, the general language triple store (323) and the jargon triple store (325) are logically-connected segments of a language-specific triple store. For further explanation,
In other embodiments, the general language triple store and the jargon triple store are implemented in separate stores and separate graphs. For further explanation,
Queries against a jargon triple store in a graph like graph (201) require a clause specifying the particular jargon against which the query is issued:
Query:
Response: :definition(244)
Queries against a jargon triple stores in separate graphs like those shown in
Query on graph (240):
Response: :definition(244)
For further explanation,
The method of
These queries (339, 340) ask only whether the parsed triple is presently recorded in a triple store. They do not ask for definitions or inferences. These queries (339, 340) are semantic queries that take a form similar to the data they query. Take for example the connected triples describing Bob and the Mona Lisa, again expressed here in an abstract syntax with (subject predicate object) separated by spaces:
In an example where the parsed triple asserts (Bob isAFriendOf Mike), then this query:
returns “No.” The ASK syntax is fashioned after SPARQL, and it reports only success or failure in matching the query, nothing else. In an example where the parsed triple asserts (Bob isAFriendOf Alice), then this query:
returns “Yes.” Thus, the queries (339, 340) ask only whether the parsed triple is presently recorded in one of the triple stores (323, 325).
The method of
Query (341) is different from the queries (339, 340) discussed just above. Query (341) does not ask merely whether data is in a store, query (341) asks for the data itself to be returned. Thus, again with reference to Bob and the Mona Lisa, this query, which requests predicates and objects from all triples in which Bob is the subject:
returns this:
This query:
returns this:
And this query:
returns this:
For further explanation,
The method of
Consider a use case in which a user who is a tele-agent in a call center is discussing with a customer a prospective sale of residential construction materials. The NLP-SR engine typically provides as digitized speech words organized in sentences and identified as parts of speech. The parsing process then can, for example, take the subject, the verb, and a direct object from a sentence and parse them into a triple with a subject, predicate, and object. From this sentence, for example, “Doors are construction materials,” the parser can produce (door isA constructionMateral), and pass that triple along to the rest of the process as a candidate parsed triple (752) for inclusion in the jargon store (325). But these assertions regarding doors are more difficult:
It is hard to tell why such results originate in a conversation about residential construction. Perhaps the speech engine made a mistake. Perhaps the users wandered off topic. All of which makes little difference. The point is that the parser will produce triples from these words that are not pertinent to residential construction. Such triples are likely not in the jargon triple store, but also such triples may not be in the general language triple store. In which case, such triples become candidates for inclusion in the jargon triple store despite the fact that such triples do not really belong in the jargon triple store.
The confirmation process (328) can query a technical dictionary (336) of construction terms. The technical dictionary is unstructured text, no semantics, no triples, but it does contain searchable definitions. If “gateway in time,” “rock band,” or “The Doors Of Perception” do not appear in definitions of doors, the confirmation process can conclude (330) that certain triples do not belong (330) in the jargon triple store (325) and loop (312) to continue processing. The confirmation process (328) can query a user (334), through, for example, a GUI interface or a speech interface, present a candidate parsed triple, and ask whether to add it to the jargon triple store. If the user confirms (332), then record (322) and loop (312). If the user does not confirm (330), loop (312) without recording (322).
It will be understood from the foregoing description that modifications and changes may be made in various embodiments of the present invention without departing from its true spirit. The descriptions in this specification are for purposes of illustration only and are not to be construed in a limiting sense. The scope of the present invention is limited only by the language of the following claims.
Number | Name | Date | Kind |
---|---|---|---|
6785380 | Ribera | Aug 2004 | B2 |
6829603 | Chai | Dec 2004 | B1 |
7275083 | Seibel et al. | Sep 2007 | B1 |
7486785 | Flores | Feb 2009 | B2 |
8108237 | Bourne et al. | Jan 2012 | B2 |
8332279 | Woolston | Dec 2012 | B2 |
8411843 | Cyriac | Apr 2013 | B1 |
9049295 | Cooper | Jun 2015 | B1 |
9165556 | Sugar | Oct 2015 | B1 |
9848082 | Lilland | Dec 2017 | B1 |
9860391 | Wu et al. | Jan 2018 | B1 |
9948783 | Farrell | Apr 2018 | B1 |
10026092 | Heater et al. | Jul 2018 | B2 |
10057423 | Sheikh | Aug 2018 | B1 |
10101976 | Cavalcante | Oct 2018 | B2 |
10303466 | Karman | May 2019 | B1 |
10482384 | Stoilos | Nov 2019 | B1 |
20030126136 | Omoigui | Jul 2003 | A1 |
20040143473 | Tivey et al. | Jul 2004 | A1 |
20040210881 | Friedman | Oct 2004 | A1 |
20050005266 | Datig | Jan 2005 | A1 |
20050044357 | Fano | Feb 2005 | A1 |
20050105712 | Williams et al. | May 2005 | A1 |
20060095273 | Montvay et al. | May 2006 | A1 |
20060098625 | King | May 2006 | A1 |
20060239439 | Blackwood | Oct 2006 | A1 |
20070019618 | Shaffer | Jan 2007 | A1 |
20070064913 | Shaffer | Mar 2007 | A1 |
20070094183 | Paek | Apr 2007 | A1 |
20070233561 | Golec | Oct 2007 | A1 |
20080162498 | Omoigui | Jul 2008 | A1 |
20080275744 | MacIntyre et al. | Nov 2008 | A1 |
20090070322 | Salvetti | Mar 2009 | A1 |
20090132474 | Ma | May 2009 | A1 |
20090245500 | Golec | Oct 2009 | A1 |
20090271192 | Marquette | Oct 2009 | A1 |
20100010802 | Ruano | Jan 2010 | A1 |
20100036788 | Wu | Feb 2010 | A1 |
20100063799 | Jamieson | Mar 2010 | A1 |
20100114563 | Choi | May 2010 | A1 |
20110077999 | Becker et al. | Mar 2011 | A1 |
20110082829 | Kolovski | Apr 2011 | A1 |
20110113094 | Chunilal | May 2011 | A1 |
20110206198 | Freedman | Aug 2011 | A1 |
20110264451 | Hoepfinger | Oct 2011 | A1 |
20120059776 | Estes | Mar 2012 | A1 |
20120078636 | Ferrucci | Mar 2012 | A1 |
20120233558 | Naim | Sep 2012 | A1 |
20120275642 | Aller | Nov 2012 | A1 |
20120303355 | Liu | Nov 2012 | A1 |
20130091090 | Spivack et al. | Apr 2013 | A1 |
20130006916 | McBride | Jun 2013 | A1 |
20130163731 | Yan | Jun 2013 | A1 |
20130204663 | Kahlow | Aug 2013 | A1 |
20140022328 | Gechter et al. | Jan 2014 | A1 |
20140081585 | Cappucino | Mar 2014 | A1 |
20140081934 | Mizell | Mar 2014 | A1 |
20140122535 | Gerard | May 2014 | A1 |
20140164502 | Khodorenko | Jun 2014 | A1 |
20140189680 | Kripalani | Jul 2014 | A1 |
20140201234 | Lee | Jul 2014 | A1 |
20140270108 | Riahi et al. | Sep 2014 | A1 |
20140278343 | Tran | Sep 2014 | A1 |
20140314225 | Riahi et al. | Oct 2014 | A1 |
20140372630 | Bostick | Dec 2014 | A1 |
20140379755 | Kuriakose | Dec 2014 | A1 |
20150012350 | Li et al. | Jan 2015 | A1 |
20150066479 | Pasupalak | Mar 2015 | A1 |
20150189085 | Riahi et al. | Jul 2015 | A1 |
20150201077 | Konig et al. | Jul 2015 | A1 |
20150242410 | Pattabhiraman et al. | Aug 2015 | A1 |
20150254234 | Dixit | Sep 2015 | A1 |
20150261743 | Sengupta | Sep 2015 | A1 |
20150294405 | Hanson | Oct 2015 | A1 |
20150309994 | Liu | Oct 2015 | A1 |
20150348551 | Gruber | Dec 2015 | A1 |
20150379603 | Gupta | Dec 2015 | A1 |
20160019882 | Matula | Jan 2016 | A1 |
20160021181 | Ianakiev et al. | Jan 2016 | A1 |
20160034457 | Bradley | Feb 2016 | A1 |
20160036981 | Hollenberg | Feb 2016 | A1 |
20160036982 | Ristock | Feb 2016 | A1 |
20160036983 | Korolev | Feb 2016 | A1 |
20160117593 | London | Apr 2016 | A1 |
20160162474 | Agarwal | Jun 2016 | A1 |
20160162913 | Linden et al. | Jun 2016 | A1 |
20160171099 | Lorge | Jun 2016 | A1 |
20160188686 | Hopkins | Jun 2016 | A1 |
20160189028 | Hu et al. | Jun 2016 | A1 |
20160217479 | Kashyap et al. | Jul 2016 | A1 |
20160239851 | Tanner | Aug 2016 | A1 |
20160321748 | Mahatm | Nov 2016 | A1 |
20160335544 | Bretschneider | Nov 2016 | A1 |
20170017694 | Roytman et al. | Jan 2017 | A1 |
20170024375 | Hakkani-Tur | Jan 2017 | A1 |
20170091390 | Joul | Mar 2017 | A1 |
20170147635 | McAteer | May 2017 | A1 |
20170154108 | Li | Jun 2017 | A1 |
20170177715 | Chang | Jun 2017 | A1 |
20170200220 | Nicholson | Jul 2017 | A1 |
20170195488 | Pendyaia | Aug 2017 | A1 |
20170262429 | Harper | Sep 2017 | A1 |
20170262530 | Okura | Sep 2017 | A1 |
20170293610 | Tran | Oct 2017 | A1 |
20180082183 | Hertz | Mar 2018 | A1 |
20180115644 | Al-Khaja | Apr 2018 | A1 |
20180144250 | Kwon | May 2018 | A1 |
20180150459 | Farid | May 2018 | A1 |
20180288098 | Wang | Oct 2018 | A1 |
20180300310 | Shinn | Oct 2018 | A1 |
20180315000 | Kulkarni | Nov 2018 | A1 |
20180315001 | Garner | Nov 2018 | A1 |
20180338040 | Carly | Nov 2018 | A1 |
20180365772 | Thompson | Dec 2018 | A1 |
20180376002 | Abraham | Dec 2018 | A1 |
20190007711 | Geva | Jan 2019 | A1 |
20190042988 | Brown | Feb 2019 | A1 |
20190080370 | Copeland | Mar 2019 | A1 |
20190124193 | Kim et al. | Apr 2019 | A1 |
20190188617 | Copeland | Jun 2019 | A1 |
20190206400 | Cui | Jul 2019 | A1 |
20190220794 | Kulkarni | Jul 2019 | A1 |
20190340294 | Spangler | Nov 2019 | A1 |
20200042642 | Bakis | Feb 2020 | A1 |
20200097814 | Devesa | Mar 2020 | A1 |
20200110835 | Zhao | Apr 2020 | A1 |
Number | Date | Country |
---|---|---|
1020180058877 | Jul 2018 | KR |
2016139666 | Sep 2016 | WO |
Entry |
---|
International Search Report and Written Opinion dated Dec. 30, 2019 for related PCT/US2019/062629. |
International Search Report and Written Opinion dated Jan. 9, 2020 for related PCT/US2019/055488. |
International Search Report and Written Opinion dated Jan. 14, 2020 for related PCT/US2019/060174. |
International Search Report and Written Opinion dated Jan. 17, 2020 for related PCT/US2019/058997. |
International Search Report and Written Opinion dated Mar. 9, 2020 for related PCT/US2019/059949. |
International Search Report and Written Opinion dated Apr. 1, 2020 for related PCT/US2019/055483. |
Liew. “Strategic integration of knowledge management and customer relationship 1-20 management.” In: Journal of Knowledge Management. Jul. 18, 2008 (Jul. 18, 2008) Retrieved on Dec. 25, 2019 (Dec. 25, 2019) from <http://student.bms.lk/GDM/49/Slides/MarManaSampleAssi/MMAsuportingJouArti/13673270810884309.pdf> entire document. |
Tung. “Google's human-sounding AI to answer calls at contact centers.” In: ZDNet. Jul. 25, 2018 (Jul. 25, 2018) Retrieved on Dec. 25, 2019 (Dec. 25, 2019) from <https://www.zdnet.com/article/googles-human-sounding-ai-to-answer-calls-at-contact-centers/> entire document. |
Jan. 10, 2020 Office Action for corresponding U.S. Appl. No. 15/700,210. |
Mar. 5, 2020 Office Action for corresponding U.S. Appl. No. 16/183,736. |
Jan. 25, 2020 Office Action for corresponding U.S. Appl. No. 15/844,512. |
International Search Report dated Nov. 28, 2018 for PCT/US2018/049813. |
International Search Report dated Feb. 22, 2019 for PCT/US2018/065584. |
Jul. 25, 2019 Office Action for related U.S. Appl. No. 16/198,742. |
Final Office Action dated Jul. 27, 2020 for corresponding U.S. Appl. No. 15/844,512. |
Final Office Action dated Jul. 30, 2020 for corresponding U.S. Appl. No. 16/154,718. |
Final Office Action dated Jul. 7, 2020 for corresponding U.S. Appl. No. 15/700,210. |
Non-Final Office Action dated May 4, 2020 for corresponding U.S. Appl. No. 16/154,718. |
Non-Final Office Action dated Sep. 30, 2020 for corresponding U.S. Appl. No. 16/911,717. |
Number | Date | Country | |
---|---|---|---|
20200118555 A1 | Apr 2020 | US |