Natural language processing (NLP) is a subfield of artificial intelligence (AI) concerned with technology that interprets natural language inputs. Natural language understanding (NLU) is a subfield of NLP, where NLU is concerned with technology that interprets natural language inputs. There is an ever-growing need in the art for improved NLU technology.
One of the pressing challenges in NLP and NLU is how to machine-recognize higher level meanings that are present within a natural language input. In many instances, when an AI system is deciding on how to best respond to a given natural language input, it is helpful that the AI system be able to recognize the higher level meaning of that input before it can respond appropriately. For example, if the AI system includes a natural language generation (NLG) component that produces a natural language output in response to the natural language input, it will be helpful for the NLG component to know the higher level meanings associated with natural language inputs when deciding what information should be presented to a user. NLG is a subfield of artificial intelligence (AI) concerned with technology that produces language as output on the basis of some input information or structure (e.g., where the input constitutes data about a situation to be analyzed and expressed in natural language).
The inventors disclose a number of technical innovations in the NLU arts that provide adaptive mechanisms for learning concepts that are expressed by natural language sentences, and then apply this learning to appropriately classify new natural language sentences with the relevant concepts that they express.
In an example embodiment, a training process operates on concept-labeled sentences and employs new rules that discriminate between different concepts based on sentence composition structure. Different concepts can have their own associated set of rules that are used by a classifier to then classify new sentences as either expressing a known concept or being unclassified.
In an example embodiment, unclassified sentences can be analyzed based on defined criteria such as their root verbs to form clusters of unclassified sentences. These sentence clusters can then be reviewed manually by users to assess if there any commonalities that would facilitate further classification by users.
In another example embodiment, the terms used in sentences can be analyzed to assess their uniqueness relative to a training corpus for the system. Terms with high uniqueness scores can then be reviewed through user interfaces, and mechanisms can be provided for adding selected terms to an ontology for the system if a user deems such an addition to be appropriate.
Example embodiments also disclose various user interfaces for reviewing and adapting how the system classifies sentences and updates the ontology. For example, concept classifications can be added to unclassified sentences in response to user inputs through the user interfaces; and newly classified sentences can then be used to re-train the classifier to adapt the system to better determine appropriate concept classifications for documents.
Further still, by using NLU as described herein to learn how different concepts are expressed in natural language, an AI system can use this information to improve how an NLG system produces natural language outputs that express such concepts.
For example, the term discovery mechanisms described herein can be used to populate and teach the NLG system's ontology about new entity types and/or new expressions for existing entity types. In doing so, the NLU techniques described herein can be used to improve the language output of an NLG system.
As another example, the NLG system may use NLG templates to produce language output that expresses a given concept, and these NLG templates may have counterparts in the concepts recognized by the NLU system described herein (where these NLU concepts have corresponding discrimination rules used by the NLU system to recognize such concepts in documents). The NLG templates can then be linked to the discrimination rules described herein that that share the same concepts, and these linked NLG templates can then be prioritized by the NLG system to be selected more frequently when producing language outputs in order to structure any language output in a manner that conforms to the NLU-recognized concept expressions. An example of an approach to link NLG templates with discrimination rules would be to generate a set of all tokens in all discrimination rules for a concept (Set A), generate a set of all tokens in each NLG template for a concept (Set B), and then perform a set intersection operation (A∩B) for each NLG template. The largest set resulting from the intersection would then be selected to link the subject NLG template with the subject discrimination rules. Also, if desired, a weighting mechanism could also be employed whereby if the same token is present in multiple discrimination rules this would result in set intersections which contain that token would be weighted more heavily than others.
As yet another example, the NLU techniques described herein that recognize and develop rules for recognizing concepts expressed by natural language sentences can be linked with additional NLG training techniques whereby an NLG system is trained to produce language output that resembles training inputs. An example of such an NLG training system is described in U.S. patent application 62/691,197 (entitled “Applied Artificial Intelligence for Using Natural Language Processing to Train a Natural Language Generation System”, filed Jun. 28, 2018) (see also U.S. patent application Ser. No. 16/444,649 (entitled “Applied Artificial Intelligence Technology for Using Natural Language Processing and Concept Expression Templates to Train a Natural Language Generation System”, filed Jun. 18, 2019, now U.S. Pat. No. 10,706,236)), each referenced below. The NLU system described herein can be used to recognize and tag input sentences with given concepts, and a decision can then be made as to what concepts and which concept-tagged sentences should be used to train the NLG system. Thus, one or more of the concept-tagged sentences recognized by the NLU system can then be fed into the NLG system to train the NLG system on how to produce language output for a given concept that stylistically resembles the concept-tagged input sentence.
Through these and other features, example embodiments of the invention provide significant technical advances in the NLU arts by harnessing computer technology to improve how the expression of concepts within sentences are recognized via machine processing.
The AI platform 102 analyzes unstructured text and identifies new forms of expressions for known NLG concepts and ontological entity types. The AI platform 102 can also discover entirely new concepts and entity types. The AI platform 102 presents its findings to users via a UI 122 that allows users to refine the system's discovery mechanism, as well as expedite the addition of new ontological entity types to an underlying NLG platform.
The AI platform 102 can interoperate with an NLG computer system as discussed above and below to improve the operation of the NLG computer system. An example of NLG technology that can be used as the NLG system 108 is the QUILL™ narrative generation platform from Narrative Science Inc. of Chicago, Ill. Aspects of this technology are described in the following patents and patent applications: U.S. Pat. Nos. 8,374,848, 8,355,903, 8,630,844, 8,688,434, 8,775,161, 8,843,363, 8,886,520, 8,892,417, 9,208,147, 9,251,134, 9,396,168, 9,576,009, 9,697,178, 9,697,197, 9,697,492, 9,720,884, 9,720,899, 9,977,773, 9,990,337, and 10,185,477; and U.S. patent application Ser. No. 15/253,385 (entitled “Applied Artificial Intelligence Technology for Using Narrative Analytics to Automatically Generate Narratives from Visualization Data, filed Aug. 31, 2016), 62/382,063 (entitled “Applied Artificial Intelligence Technology for Interactively Using Narrative Analytics to Focus and Control Visualizations of Data”, filed Aug. 31, 2016), Ser. No. 15/666,151 (entitled “Applied Artificial Intelligence Technology for Interactively Using Narrative Analytics to Focus and Control Visualizations of Data”, filed Aug. 1, 2017), Ser. No. 15/666,168 (entitled “Applied Artificial Intelligence Technology for Evaluating Drivers of Data Presented in Visualizations”, filed Aug. 1, 2017), Ser. No. 15/666,192 (entitled “Applied Artificial Intelligence Technology for Selective Control over Narrative Generation from Visualizations of Data”, filed Aug. 1, 2017), 62/458,460 (entitled “Interactive and Conversational Data Exploration”, filed Feb. 13, 2017), Ser. No. 15/895,800 (entitled “Interactive and Conversational Data Exploration”, filed Feb. 13, 2018), 62/460,349 (entitled “Applied Artificial Intelligence Technology for Performing Natural Language Generation (NLG) Using Composable Communication Goals and Ontologies to Generate Narrative Stories”, filed Feb. 17, 2017), Ser. No. 15/897,331 (entitled “Applied Artificial Intelligence Technology for Performing Natural Language Generation (NLG) Using Composable Communication Goals and Ontologies to Generate Narrative Stories”, filed Feb. 15, 2018), Ser. No. 15/897,350 (entitled “Applied Artificial Intelligence Technology for Determining and Mapping Data Requirements for Narrative Stories to Support Natural Language Generation (NLG) Using Composable Communication Goals”, filed Feb. 15, 2018, now U.S. Pat. No. 10,585,983), Ser. No. 15/897,359 (entitled “Applied Artificial Intelligence Technology for Story Outline Formation Using Composable Communication Goals to Support Natural Language Generation (NLG)”, filed Feb. 15, 2018), Ser. No. 15/897,364 (entitled “Applied Artificial Intelligence Technology for Runtime Computation of Story Outlines to Support Natural Language Generation (NLG)”, filed Feb. 15, 2018), Ser. No. 15/897,373 (entitled “Applied Artificial Intelligence Technology for Ontology Building to Support Natural Language Generation (NLG) Using Composable Communication Goals”, filed Feb. 15, 2018), Ser. No. 15/897,381 (entitled “Applied Artificial Intelligence Technology for Interactive Story Editing to Support Natural Language Generation (NLG)”, filed Feb. 15, 2018), 62/539,832 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Based on Analysis Communication Goals”, filed Aug. 1, 2017), Ser. No. 16/047,800 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Based on Analysis Communication Goals”, filed Jul. 27, 2018), Ser. No. 16/047,837 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Based on a Conditional Outcome Framework”, filed Jul. 27, 2018), 62/585,809 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Based on Smart Attributes and Explanation Communication Goals”, filed Nov. 14, 2017), Ser. No. 16/183,230 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Based on Smart Attributes”, filed Nov. 7, 2018), Ser. No. 16/183,270 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Based on Explanation Communication Goals”, filed Nov. 7, 2018), 62/618,249 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Using an Invocable Analysis Service”, filed Jan. 17, 2018), Ser. No. 16/235,594 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Using an Invocable Analysis Service”, filed Dec. 28, 2018), Ser. No. 16/235,636 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Using an Invocable Analysis Service with Analysis Libraries”, filed Dec. 28, 2018), Ser. No. 16/235,662 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Using an Invocable Analysis Service and Data Re-Organization”, filed Dec. 28, 2018), Ser. No. 16/235,705 (entitled “Applied Artificial Intelligence Technology for Narrative Generation Using an Invocable Analysis Service and Configuration-Driven Analytics”, filed Dec. 28, 2018), 62/632,017 (entitled “Applied Artificial Intelligence Technology for Conversational Inferencing and Interactive Natural Language Generation”, filed Feb. 19, 2018), Ser. No. 16/277,000 (entitled “Applied Artificial Intelligence Technology for Conversational Inferencing”, filed Feb. 15, 2019), Ser. No. 16/277,003 (entitled “Applied Artificial Intelligence Technology for Conversational Inferencing and Interactive Natural Language Generation”, filed Feb. 15, 2019), Ser. No. 16/277,004 (entitled “Applied Artificial Intelligence Technology for Contextualizing Words to a Knowledge Base Using Natural Language Processing”, filed Feb. 15, 2019), Ser. No. 16/277,006 (entitled “Applied Artificial Intelligence Technology for Conversational Inferencing Using Named Entity Reduction”, filed Feb. 15, 2019), Ser. No. 16/277,008 (entitled “Applied Artificial Intelligence Technology for Building a Knowledge Base Using Natural Language Processing”, filed Feb. 15, 2019), 62/691,197 (entitled “Applied Artificial Intelligence for Using Natural Language Processing to Train a Natural Language Generation System”, filed Jun. 28, 2018), Ser. No. 16/444,649 (entitled “Applied Artificial Intelligence Technology for Using Natural Language Processing and Concept Expression Templates to Train a Natural Language Generation System”, filed Jun. 18, 2019, now U.S. Pat. No. 10,706,236), Ser. No. 16/444,689 (entitled “Applied Artificial Intelligence Technology for Using Natural Language Processing to Train a Natural Language Generation System With Respect to Numeric Style Features”, filed Jun. 18, 2019), Ser. No. 16/444,718 (entitled “Applied Artificial Intelligence Technology for Using Natural Language Processing to Train a Natural Language Generation System With Respect to Date and Number Textual Features”, filed Jun. 18, 2019), and Ser. No. 16/444,748 (entitled “Applied Artificial Intelligence Technology for Using Natural Language Processing to Train a Natural Language Generation System”, filed Jun. 18, 2019); the entire disclosures of each of which are incorporated herein by reference.
In an example embodiment, the AI platform 102 takes a text document 104 as input. The document 104 may comprise one or more sentences in a natural language. The AI platform 102 can perform any of the following forms of analysis via a concept classifier 106, a sentence clusterer 114, and/or a term analyzer 118:
The AI platform 102 can be adaptive via a training mechanism through which the classifier 106 learns how to recognize sentences that express known concepts. As used herein, “concept” can refer to a higher level meaning that is expressed by a sentence beyond the literal meaning of the words in the sentence. For example, a given sentence can include the literal words: “In 2018, the sales team improved their sales 10% over their benchmark.” This sentence can be characterized as expressing the concept of “Deviation from Target” because this concept encapsulates a higher level meaning expressed by the sentence. A concept can be explicitly represented in the AI system by a combination of (1) analytics and logic for recognizing the concept, and (2) language that is used to express the concept. For example, the “Deviation from Target” concept can be explicitly represented by analytics that determine how a metric is tracking to a goal (or determine how the spread from the metric to its goal changes over time). The “Deviation from Target” concept can also be explicitly represented by logic that determines what in a data set is most relevant to express when describing how the metric deviates from the target. For instance, if a salesperson was above his or her goal for 90% of the time, it may be desirable for the NLG system to product a sentence that describes, on average, how much higher than the target the salesperson was. Then the “Deviation from Target” can be explicitly represented by language that expresses that content. Additional examples of concepts that can expressed by sentences and recognized by the AI platform 102 can include, without limitation, (1) average value, (2) average over time frame”, (3) count contributors, (4) deviation drivers, (5) deviation from others, (6) deviation from self, (6) latest value, (7) maximum within time frame, (8) minimum within time frame, (9) outlier assessment, (10) project against target, (11) runs comparison, and/or (12) total across time frame. To support this training, a training corpus of concept-labeled sentences can be processed as described in
In an example embodiment, the classifier 106 operates using string match rules. These rules define matching operations that are targeted toward particular strings of text, and they can operate in a manner similar to regular expressions but with more restricted semantics. In example embodiments, the string match rules describe co-occurrence, adjacency, and order within a text string. String match rules may be comprised of the following types of token components:
As an example, we can consider the following 3 string match rules:
The following sentence will be matched by only the first rule above: “Higher costs contributed to the decrease in profit.” This sentence includes the root word “contribute” as a verb; but it does not includes the noun “decline” (or have a number), which causes the second and third rules to fail.
The following sentence will be matched by only the first two rules above: “Higher costs contributed to a decline in profit.” This sentence includes the root word “contribute” as a verb (causing a hit on Rule 1), and it also includes both the root word “contribute” as a verb and the root word “decline” as a noun (causing a hit on Rule 2); but it does not include a number), which causes the third rule to fail.
The following sentence will be matched by all three of the rules above: “Higher costs contributed to a decline in profit by 50%.”. With this sentence, the inclusion of the numeric value (50%) also caused the third rule to be a hit in addition to the first two rules.
A.1: Rule Induction:
As noted above,
At step 204, the classifier tokenizes the selected sentence string and tags each token in the sentence with its part-of-speech pair to thereby convert the sentence string into a list of components, including (token, part-of-speech) pairs. This step tags each token with its part-of-speech. This step also converts all numeric values to {NUM} tokens and converts all expressions of ontological entity types to {ENT} tokens. To perform step 204, the classifier 204 can use an NLP library technology such as the Explosion AI's Spacy tool.
As an example, with reference to
Next, at step 206, the classifier 106 creates a set of all permutations of the token components of the token string generated by step 204. As part of this operation, an index integer can be associated with each token to maintain a record of order, which can later be used to determine adjacency.
At step 208, the classifier 106 generates a set of string match rules from the permutation set. As part of this step, the different permutations of the permutation set are compared to a stoplist that seeks to remove rules that target non-salient components of a sentence. For example, the stoplist can be like the Natural Language Toolkit (NLTK) stoplist that filters out words such as “a”, “the”, etc., or isolated prepositions or prepositional phrases that are not anchored to a reference. The stoplist can also filter out rules that are comprised solely of a numeric token. Further still, at step 208, for any rules whose token components are non-sequential based on their indexing values, the classifier 106 can insert the span operator token, { . . . }.
At step 210, the classifier 106 discards conflicting rules after comparing the string match rules of the rule set produced by step 206 against all of the string match rules generated from other sentences in the classifier 106. If the same rule was generated for a sentence labeled with a different concept, then that rule is marked as invalid because it will not be helpful when distinguishing among concepts. Once marked as invalid, future occurrences of that invalid rule can also be discarded. To support step 210, the classifier 106 can interact with classifier database 110 to access the rules generated from other sentences and their associated concept labels. If step 210 results in all of the rules of the rule set for the subject sentence being discarded, this would result in the subject sentence being deemed unclassifiable, and the process flow could then jump to step 214.
At step 212, the classifier 106 sorts the valid rules according to defined sorting criteria and then removes the valid rules that are redundant in that they do not add to the classifier's ability to distinguish between concepts. For example, if the valid rule set from step 210 includes 2 rules linked to Concept X, and both of those rules operate to produce matches on the same set of sentences (there are no sentences linked to Concept X that match on Rule 1 but nor Rule 2 and vice versa), then the system can remove one of the rules as being redundant.
This sorting step 320 can produce a sorted rule set 314 as shown by
The process flow of
At step 326, the classifier selects the next sorted rule (Rule k+1), which in this example can be sorted Rule 2. At step 328, the classifier tests selected Rule 2 against all of the sentences in the training corpus 200 that are labeled with the subject concept. This testing produces a set of sentences that match against Rule 2, which can defined as Set 2.
At step 330, the classifier compares Set 1 with Set 2. If these two sets have the same composition of sentences, this means that Rule 2 is redundant to Rule 1, and Rule 2 can be discarded (step 332). However, it should be understood that a practitioner could also design the classifier to instead discard Rule 1 in this situation. However, if the two sets do not have the same compositions of sentences, this means that Rule 2 is not redundant to Rule 1, in which case Rule 2 can be retained as part of the rule set for the subject concept (step 334).
At step 336, the classifier checks for whether there are more rules in the sorted rule set 314 to be assessed for redundancy. If there are, the classifier can increment k (step 338) to go to the next rule (e.g., Rule 3) and return to step 326. In this fashion, Rule 3 can also be tested for redundancy against Rule 1 (and so on for the other rules of the sorted rule set 312). Once all rules have been redundancy tested, the classifier produces rule set 305 for the subject concept, where rule set 350 is an optimal rule set for testing sentences to determine whether they express the subject concept. In this example, optimal rule set 350 includes two rules as shown by
It should be understood that the
At step 214, the classifier checks whether there are more concept-labeled sentences in the training corpus 200 to be processed. If so, the process flow returns to step 202 so that a rule set can be induced from the next sentence in the training corpus 200. Once all of the sentences in the training corpus 200 have been processed through steps 202-212, the classifier will have rule sets for each of the concepts recognized within the training corpus, and the process flow can proceed to step 216.
At step 216, the classifier generates a classification structure based on the valid rules for each of the concepts that were used to label the training sentences. This classification structure can then be used to process new sentences and determine whether any of the new sentences are fits with any of the recognized concepts. The classification structure can take the form of a prefix tree data structure that are loaded with the optimal rule sets produced by step 212 for the different recognized concepts.
Accordingly, it should be understood that the
A.2: Custom Rules:
The classifier 106 may also support an ability to define custom, human-intuited string match rules. With a custom rule, a user can enter a specific string match rule as a sequence of tokens (as discussed above), and then pair that specific string match rule with a concept. The classifier 106 can give precedence to custom rules over the induced rules produced by the
A.3: Classifier Operation:
The classifier 106 can then operate to classify new documents using a process flow such as that shown by
If step 402 finds a match, then the process flow proceeds to step 404. At step 404, the classifier 404 labels the selected sentence with the concept corresponding to the matching hit within the classification structure. Thus, the selected sentence becomes associated with a concept that the classifier deems the sentence to express.
If step 402 does not find a match, then the process flow proceeds to step 406. At step 406, the selected sentence is labeled as unclassified. This means that the sentence is not recognized as matching a known concept. As described below, unclassified sentences can be applied to a sentence clusterer 114 to further extract information from them that may be helpful to a user.
From steps 404 and 406, the process flow progresses to step 408. At step 408, the classifier checks for another sentence in the input document 104. If another sentence is present, the process flow can return to step 400. Otherwise, the process flow can terminate.
Thus,
B. Sentence Clustering:
The AI platform 102 can also support the clustering of unclassified sentences. By grouping together unclassified sentences that are deemed similar according to defined criteria, the sentence clusterer 114 allows users to review the unclassified sentences in related clusters that allows users to make qualitative judgments as to any significance to the commonly-grouped unclassified sentences. For example, such clustering may allow the user to recognize a new concept that may be expressed by one or more of these sentence clusters. In an example embodiment, the sentence clusterer 114 uses the sentences' root verbs as the heuristic criteria for clustering. However, it should be understood that other criteria could be employed. For example, the system could use machine learning techniques to identify unclassified sentences with similar structures, and use that as the basis for sentence clustering. As another example, different words (or groups of words) in the sentence could be used for clustering, such as the subject noun.
At step 502, the clusterer creates a dependency parse tree of the selected sentence. This will produce a traversable tree structure for the sentence, where the tree structure includes nodes can take the form of (token, part-of-speech) pairs. As an example, a tool such as Explosion AI's open-source Spacy tool can be used at step 502 to create the dependency parse tree. However, other tools such as Stanford's CoreNLP and Google's cloud NLP tools could be used for dependency parsing if desired by a practitioner.
At step 504, the clusterer identifies the root verb of the selected sentence based on its dependency parse tree. To do so, the dependency parse tree can be traversed breadth-first until the first VERB node is encountered. The corresponding token for this verb can be identified as the root verb for the sentence. With respect to the example of
At step 506, the clusterer checks for more unclassified sentences in pool 112. If there is another unclassified sentence to be processed, the process flow returns to step 500 for a repeat of steps 502-504 on the next unclassified sentence. Once the clusterer has performed steps 502 and 504 on all of the unclustered sentences in the pool 112, the clusterer will have identified a root verb for each of those unclustered sentences, and the process flow can proceed to step 508.
At step 508, for each different root verb identified at step 504 for the various unclassified sentences, the clusterer groups the unclassified sentences that share the same root verb. This produces a set of sentence clusters 116, where each cluster 166 is linked to a particular root verb and includes all of the sentences that share that root verb. These sentence clusters can then be reviewed by a user through the UI 122 to assess whether any adjustments to the system are needed. If desired, a practitioner can set a minimum population requirement for a sentence cluster for a sentence cluster to be tagged as such in the system. Any unclassified sentences that are sorted into groups below the population count could then be discarded. For example, a minimum population requirement for a cluster 116 could be 3 sentences. However, it should be understood that if desired by a practitioner, a sentence cluster could include only a single sentence.
C. Term Discovery:
The AI platform 102 can also support the discovery of terms in the document 104 that are distinguishable from the terms found in the training corpus 200. This will allow users to audit the document's most unique terms and decide if the term can be used to express a new or existing ontological entity. Term analyzer 118 can thus process an input document 104 in combination with information learned by classifier 106 to generate a list of significant terms 120 for review through UI 122.
At step 700, the system operates on the training corpus 200. Step 700 can be performed by classifier 106 and/or term analyzer 118 depending on the desires of a practitioner. For the labeled sentences processed by the classifier 106, step 700 identifies the terms that appear in those training sentences. For each term, a count is maintained for the number of sentences in which each term appears. This count can be referred as a term's Document Frequency (DF). Thus, step 700 produces a DF value that is associated with each term in the training corpus 200. Step 700 can be performed as part of ingesting the document(s) 104 of the training corpus, where as part of this ingestion, the AI platform can split the document into sentences, and then for each term in the document, it can count the number of sentences that contain that term. The resulting total is then used to dynamically update the DF counts for the training corpus 200. After the DF counts are updated for a given document 104, the process flow can proceed to step 702.
At step 702, the term analyzer selects an input document 104. This document is then tokenized and part-of-speech tagged as described above in connection with steps 204 and 402.
At step 706, for each term in the input document 104, the term analyzer generates a count of that term's frequency in that document. This frequency count can be referred to as a term's Term Frequency (TF). Thus, step 706 produces a TF value that is associated with each term in document 104.
At step 708, for each term in the input document 104, the term analyzer computes a score that measures the uniqueness of that term relative to the training corpus 200, where this computation uses the term's associated DF and TF values. This uniqueness score can be referred as a TFIDF score. In an example embodiment, the TFIDF score for a given term can be computed according to the formula:
It can be seen that this scoring metric will produce larger scores for terms that have lower DF scores than for terms which have higher DF scores. For example, at the farthest extreme, if a given term has the maximum possible DF (where the DF score matches the number of documents in the training corpus), it can be seen that the log term of the formula will reduce to zero (log(1)), in which case the TFIDF score will be zero regardless of how high the TF score is. Thus, step 708 will produce a TFIDF score for each of the terms in the subject document 104.
At step 710, the term analyzer sorts the terms of document 104 by their TFIDF scores. Then, at step 712, the term analyzer can discard the terms whose TFIDF scores fall below a defined threshold. A practitioner can set this threshold to a value deemed useful for the system (e.g., a threshold of 0.1; although other values could be used). If desired, no threshold could be employed, and the system could report only a ranking of terms by their TFIDF scores so that a user can focus on the most unique terms if desired.
D. User Interfaces:
The AI platform 102 can support a wide variety of UIs 122 for interacting with the system. Through these UIs, users can upload documents for training and/or analysis by the platform 102. For example, a browser interface can be provided for uploading text documents 104 into the system. The AI platform 102 can then analyze the document 104 using the components shown by
GUI 800 can be interactive with users in any of a number of ways. For example, users can interact with the sidebar to explore analysis results. Section 802 can include a list of each known concept recognized by the classifier 106 in the document. Section 804 can include a list of each cluster identified by the clusterer 114 in the document. Any clusters that are found can be identified by the corresponding root verb (e.g., see
In the example of
Further interactivity can be provided to users through the presented sentences of the document. For example, a user can interact with the GUI 800 to select a sentence within the presented document (e.g., by hovering over or clicking on the sentence) to access additional information about the sentence and actively update the system. For example, users can interact with the platform 102 through GUI 800 to change, remove, and/or create a new concept to associate with a given sentence.
In another powerful example embodiment, users can interact with the system through a GUI 122 to teach the platform new concepts “on the fly” via user-entered sentences.
The window 1202 can include a field 1204 that is populated with the selected term. This can serve as the name for a new entity type ontological element to be added to the ontology. Through field 1206, the user can define a base type for the new entity type (e.g., person, place, thing, etc.) (see also step 1220 of
The UIs 122 can also permit users to review the rules used by classifier 106.
E. Example Applications of NLU for NLG Training:
As discussed above, the NLU techniques described herein for AI platform 102 can be used to improve how NLG systems are trained. For example, the above-referenced and incorporated U.S. patent application Ser. No. 16/444,649 (now U.S. Pat. No. 10,706,236) describes how an NLG system can be trained to produce natural language output that is stylistically similar to a training natural language sentence. As described with reference to
As another example,
For example, above-referenced and incorporated U.S. patent application Ser. No. 16/444,649 discloses a trainable NLG system 1408 that uses NLP to detect a plurality of linguistic features in training data, wherein the training data comprises a plurality of words arranged in a natural language. These detected linguistic features are then aggregated into a specification data structure that is arranged for training the NLG system to produce natural language output that stylistically resembles the training data. This specification data structure can comprise a machine-readable representation of the detected linguistic features. Parameters in the specification data structure can be linked to objects in an ontology used by the NLG system to facilitate the training of the NLG system based on the detected linguistic features. Additional details about example embodiments for specification data structures are provided by above-referenced and incorporated U.S. patent application Ser. No. 16/444,649.
In a particularly powerful example embodiment described by above-referenced and incorporated U.S. patent application Ser. No. 16/444,649, the detected linguistic features can include concept expression templates that model how a concept is expressed in the training data. Examples of concepts that can be modeled in this fashion from the training data include change concepts, compare concepts, driver concepts, and rank concepts. In an example embodiment, to detect and extract such concept expression templates from the training data, the training data can be scanned for the presence of one or more anchor words, where each anchor word is associated with a concept understood by the system. If an anchor word is present in the training data, the system can then process the training data to extract an expression template that models how the concept associated with the present anchor word is discussed in the training data. NLP parsing can be applied to the training data and linkages to NLG ontologies can be employed to facilitate this concept expression template extraction.
At step 1502, a processor extracts linguistic features from the ingested training data using a variety of pattern matchers and rule-based NLP heuristics, examples of which are discussed below and in above-referenced and incorporated U.S. patent application Ser. No. 16/444,649. Using these techniques, specific linguistic features can be detected in and extracted from each document, and each document can be converted into a data structure (e.g., a JSON data structure) that contains linguistic feature metadata.
At step 1504, a processor aggregates the extracted linguistic features produced from the documents at step 1502 by iterating over the document-specific data structures. This can include deriving totals, percentages, grouping, and sorting, which operates to produce a specification data structure (e.g., a JSON specification data structure, which is a machine-readable description of the linguistic features extracted from the ingested training data.
At step 1506, a user interface (e.g., a browser-based graphical user interface (GUI)) can process the specification data structure and present a user with the linguistic features discovered by steps 1502 and 1504. Through the user interface, the user can elect to discard any of the discovered linguistic features. In example embodiments, the user can also enter custom sentences into the user interface to add additional ontological vocabulary to the system and/or add concept expressions to the specification. However, as noted above, such user interaction can be omitted if desired by a practitioner.
At step 1508, a processor configures the NLG system 1408 based on the specification data structure to thereby train the NLG system 1408 to produce language that stylistically resembles the training data 1406. In an example embodiment, a platform-specific applicator can take the JSON specification data structure (and any user preferences) as inputs and update the appropriate configuration within the NLG system 1408.
The NLG system 1408 can then use the specification data structure to update its configuration information to control how it produces natural language output 1412. In an example embodiment, the NLG system 1408 can produce NLG output 1412 about a data set based on defined configurations such as parameterized communication goal statements, for example using the techniques described in one or more of the above-referenced and incorporated patents and patent applications.
The concept expressions class of linguistic features is concerned with the sequence of words or phrases used in the training data to express NLG concepts. Concept expressions pattern matchers 1630 can be used to infer the high level concepts that are expressed in the training data, and they thus represent a particularly powerful and innovative aspect that can be employed in example embodiments of trainable NLG system 1408. Examples of concepts that can be detected by pattern matchers 1630 include:
The concept expressions pattern matchers 1630 can use metadata derived from NLP tools and a series of rule-based heuristics to identify candidates for concept expressions, ultimately producing an annotated template that can be structurally compatible with the NLG system 1408.
The system can be configured to assume that all concept expressions contain an anchor word, a single or compound word that is globally unique to a particular concept. The system can then use occurrences of these anchor words to identify candidate phrases for template extraction. Examples of specific anchor words for several concepts are listed below.
For example, one or more change concept pattern matchers 1632 can be configured to detect the presence of any of the following anchor words in a training sentence. Upon detection of one of these anchor words, the subject training sentence can be categorized as a candidate for a change expression and get passed to template extraction logic 1650 (discussed below). Examples of anchor words for a change concept can include:
As another example, one or more compare concept pattern matchers 1634 can be configured to detect the presence of any of the following anchor words in a training sentence. Upon detection of one of these anchor words, the subject training sentence can be categorized as a candidate for a compare expression and get passed to template extraction logic 1650 (discussed below). Examples of anchor words for a compare concept can include:
As another example, one or more driver concept pattern matchers 1636 can be configured to detect the presence of any of the following anchor words in a training sentence. Upon detection of one of these anchor words, the subject training sentence can be categorized as a candidate for a driver expression and get passed to template extraction logic 1650 (discussed below). Examples of anchor words for a driver concept can include:
As another example, one or more rank concept pattern matchers 1638 can be configured to detect the presence of any of the following anchor words in a training sentence. Upon detection of one of these anchor words, the subject training sentence can be categorized as a candidate for a rank expression and get passed to template extraction logic 1650 (discussed below). Examples of anchor words for a rank concept can include:
Furthermore, while the examples discussed herein describe “change”, “compare”, “driver”, and “rank” concepts, it should be understood that a practitioner may choose to detect other concepts that could be present within training data. For example, any of “peaks and troughs” concepts, “volatility” concepts, “correlation” concepts, “prediction” concepts, “distribution” concepts, and others can also be detected using the techniques described herein. Following below are some additional examples of concepts that can be expressed in sentences and for which concept expression templates could be extracted using the techniques described herein:
Further still, while a single anchor word is used to assign a candidate concept classification to training sentences in the example embodiment discussed above, it should be understood that a practitioner could also use an anchor word in combination with additional metadata (such as part of speech tagging) or a combination of anchor words to infer concepts from training sentences. For example, a practitioner may conclude that the word “fewer” could be indicative of both a “change” concept and a “compare” concept, and additional words and/or rules could be used to further resolve which classification should be applied to the subject training sentence. As another example, the detection of a rank concept when the word “top” is present in the training data can be made dependent on whether “top” is being used in the subject sentence as an adjective (in which case the rank candidacy can get triggered) or as a noun (in which case the rank candidacy may not get triggered).
Once candidate phrases have been identified via the anchor word detection, the candidate phrases are then parsed and evaluated by template extraction logic 1650 before producing a concept expression template. The template creation process can employ a sequence of rule-based heuristics. For example,
1. Tokenizing a document into sentences
2. For each sentence:
The AI platform 102 can provide an API for programmatic interaction with the system and UIs. As an example, the API can be an HTTP REST API. As examples, the following frameworks can be used for a number of different programmatic interactions with the system and UIs.
While the invention has been described above in relation to its example embodiments, various modifications may be made thereto that still fall within the invention's scope. Such modifications to the invention will be recognizable upon review of the teachings herein.
This patent application claims priority to U.S. provisional patent application Ser. No. 62/797,787, filed Jan. 28, 2019, and entitled “Applied Artificial Intelligence Technology for Adaptive Natural Language Understanding”, the entire disclosure of which is incorporated herein by reference. This patent application is also related to (1) U.S. patent application Ser. No. 16/744,537, filed this same day, and entitled “Applied Artificial Intelligence Technology for Adaptively Classifying Sentences Based on the Concepts They Express to Improve Natural Language Understanding”, and (2) U.S. patent application Ser. No. 16/744,562, filed this same day, and entitled “Applied Artificial Intelligence Technology for Adaptive Natural Language Understanding with Term Discovery”, the entire disclosures of each of which are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
4992939 | Tyler | Feb 1991 | A |
5619631 | Schott | Apr 1997 | A |
5734916 | Greenfield et al. | Mar 1998 | A |
5802495 | Goltra | Sep 1998 | A |
6006175 | Holzrichter | Dec 1999 | A |
6144938 | Surace et al. | Nov 2000 | A |
6278967 | Akers et al. | Aug 2001 | B1 |
6289363 | Consolatti et al. | Sep 2001 | B1 |
6757362 | Cooper et al. | Jun 2004 | B1 |
6771290 | Hoyle | Aug 2004 | B1 |
6917936 | Cancedda | Jul 2005 | B2 |
6968316 | Hamilton | Nov 2005 | B1 |
6976031 | Toupal et al. | Dec 2005 | B1 |
7027974 | Busch et al. | Apr 2006 | B1 |
7246315 | Andrieu et al. | Jul 2007 | B1 |
7324936 | Saldanha et al. | Jan 2008 | B2 |
7333967 | Bringsjord et al. | Feb 2008 | B1 |
7496621 | Pan et al. | Feb 2009 | B2 |
7577634 | Ryan et al. | Aug 2009 | B2 |
7610279 | Budzik et al. | Oct 2009 | B2 |
7617199 | Budzik et al. | Nov 2009 | B2 |
7617200 | Budzik et al. | Nov 2009 | B2 |
7627565 | Budzik et al. | Dec 2009 | B2 |
7644072 | Budzik et al. | Jan 2010 | B2 |
7657518 | Budzik et al. | Feb 2010 | B2 |
7716116 | Schiller | May 2010 | B2 |
7778895 | Baxter et al. | Aug 2010 | B1 |
7836010 | Hammond et al. | Nov 2010 | B2 |
7840448 | Musgrove et al. | Nov 2010 | B2 |
7856390 | Schiller | Dec 2010 | B2 |
7865496 | Schiller | Jan 2011 | B1 |
7930169 | Billerey-Mosier | Apr 2011 | B2 |
8046226 | Soble et al. | Oct 2011 | B2 |
8311863 | Kemp | Nov 2012 | B1 |
8355903 | Birnbaum et al. | Jan 2013 | B1 |
8374848 | Birnbaum et al. | Feb 2013 | B1 |
8447604 | Chang | May 2013 | B1 |
8463695 | Schiller | Jun 2013 | B2 |
8494944 | Schiller | Jul 2013 | B2 |
8515737 | Allen | Aug 2013 | B2 |
8612208 | Cooper et al. | Dec 2013 | B2 |
8630844 | Nichols et al. | Jan 2014 | B1 |
8630912 | Seki et al. | Jan 2014 | B2 |
8630919 | Baran et al. | Jan 2014 | B2 |
8676691 | Schiller | Mar 2014 | B2 |
8688434 | Birnbaum et al. | Apr 2014 | B1 |
8762133 | Reiter | Jun 2014 | B2 |
8762134 | Reiter | Jun 2014 | B2 |
8775161 | Nichols et al. | Jul 2014 | B1 |
8812311 | Weber | Aug 2014 | B2 |
8843363 | Birnbaum et al. | Sep 2014 | B2 |
8886520 | Nichols et al. | Nov 2014 | B1 |
8892417 | Nichols et al. | Nov 2014 | B1 |
8892419 | Lundberg et al. | Nov 2014 | B2 |
8903711 | Lundberg et al. | Dec 2014 | B2 |
8977953 | Pierre | Mar 2015 | B1 |
9135244 | Reiter | Sep 2015 | B2 |
9208147 | Nichols et al. | Dec 2015 | B1 |
9244894 | Dale et al. | Jan 2016 | B1 |
9251134 | Birnbaum et al. | Feb 2016 | B2 |
9323743 | Reiter | Apr 2016 | B2 |
9336193 | Logan et al. | May 2016 | B2 |
9355093 | Reiter | May 2016 | B2 |
9396168 | Birnbaum et al. | Jul 2016 | B2 |
9396181 | Sripada et al. | Jul 2016 | B1 |
9396758 | Oz et al. | Jul 2016 | B2 |
9405448 | Reiter | Aug 2016 | B2 |
9424254 | Howald et al. | Aug 2016 | B2 |
9430557 | Bhat et al. | Aug 2016 | B2 |
9460075 | Mungi et al. | Oct 2016 | B2 |
9529795 | Kondadadi et al. | Dec 2016 | B2 |
9576009 | Hammond et al. | Feb 2017 | B1 |
9697178 | Nichols et al. | Jul 2017 | B1 |
9697197 | Bimbaum et al. | Jul 2017 | B1 |
9697492 | Bimbaum et al. | Jul 2017 | B1 |
9720884 | Birnbaum et al. | Aug 2017 | B2 |
9720899 | Birnbaum et al. | Aug 2017 | B1 |
9870629 | Cardno et al. | Jan 2018 | B2 |
9971967 | Bufe, III et al. | May 2018 | B2 |
9977773 | Birnbaum et al. | May 2018 | B1 |
9990337 | Birnbaum et al. | Jun 2018 | B2 |
10019512 | Boyle et al. | Jul 2018 | B2 |
10037377 | Boyle et al. | Jul 2018 | B2 |
10049152 | Ajmera et al. | Aug 2018 | B2 |
10115108 | Gendelev et al. | Oct 2018 | B1 |
10185477 | Paley et al. | Jan 2019 | B1 |
10387970 | Wang et al. | Aug 2019 | B1 |
10489488 | Birnbaum et al. | Nov 2019 | B2 |
10565308 | Reiter | Feb 2020 | B2 |
10599767 | Mattera | Mar 2020 | B1 |
10699079 | Paley et al. | Jun 2020 | B1 |
10706236 | Platt et al. | Jul 2020 | B1 |
20020046018 | Marcu et al. | Apr 2002 | A1 |
20020083025 | Robarts et al. | Jun 2002 | A1 |
20020107721 | Darwent et al. | Aug 2002 | A1 |
20030004706 | Yale | Jan 2003 | A1 |
20030061029 | Shaket | Mar 2003 | A1 |
20030216905 | Chelba et al. | Nov 2003 | A1 |
20040015342 | Garst | Jan 2004 | A1 |
20040034520 | Langkilde-Geary et al. | Feb 2004 | A1 |
20040138899 | Birnbaum et al. | Jul 2004 | A1 |
20040174397 | Cereghini et al. | Sep 2004 | A1 |
20040225651 | Musgrove et al. | Nov 2004 | A1 |
20040255232 | Hammond et al. | Dec 2004 | A1 |
20050027704 | Hammond et al. | Feb 2005 | A1 |
20050028156 | Hammond et al. | Feb 2005 | A1 |
20050033582 | Gadd et al. | Feb 2005 | A1 |
20050049852 | Chao | Mar 2005 | A1 |
20050125213 | Chen et al. | Jun 2005 | A1 |
20050137854 | Cancedda et al. | Jun 2005 | A1 |
20050273362 | Harris et al. | Dec 2005 | A1 |
20060031182 | Ryan et al. | Feb 2006 | A1 |
20060101335 | Pisciottano | May 2006 | A1 |
20060181531 | Goldschmidt | Aug 2006 | A1 |
20060212446 | Hammond et al. | Sep 2006 | A1 |
20060271535 | Hammond et al. | Nov 2006 | A1 |
20060277168 | Hammond et al. | Dec 2006 | A1 |
20070132767 | Wright et al. | Jun 2007 | A1 |
20070185846 | Budzik et al. | Aug 2007 | A1 |
20070185847 | Budzik et al. | Aug 2007 | A1 |
20070185861 | Budzik et al. | Aug 2007 | A1 |
20070185862 | Budzik et al. | Aug 2007 | A1 |
20070185863 | Budzik et al. | Aug 2007 | A1 |
20070185864 | Budzik et al. | Aug 2007 | A1 |
20070185865 | Budzik et al. | Aug 2007 | A1 |
20070250479 | Lunt et al. | Oct 2007 | A1 |
20070250826 | O'Brien | Oct 2007 | A1 |
20080005677 | Thompson | Jan 2008 | A1 |
20080198156 | Jou et al. | Aug 2008 | A1 |
20080250070 | Abdulla et al. | Oct 2008 | A1 |
20080256066 | Zuckerman et al. | Oct 2008 | A1 |
20080304808 | Newell et al. | Dec 2008 | A1 |
20080306882 | Schiller | Dec 2008 | A1 |
20080313130 | Hammond et al. | Dec 2008 | A1 |
20090019013 | Tareen et al. | Jan 2009 | A1 |
20090030899 | Tareen et al. | Jan 2009 | A1 |
20090049041 | Tareen et al. | Feb 2009 | A1 |
20090083288 | LeDain et al. | Mar 2009 | A1 |
20090119584 | Herbst | May 2009 | A1 |
20090144608 | Oisel et al. | Jun 2009 | A1 |
20090175545 | Cancedda et al. | Jul 2009 | A1 |
20090248399 | Au | Oct 2009 | A1 |
20100146393 | Land et al. | Jun 2010 | A1 |
20100161541 | Covannon et al. | Jun 2010 | A1 |
20100325107 | Kenton et al. | Dec 2010 | A1 |
20110022941 | Osborne et al. | Jan 2011 | A1 |
20110044447 | Morris et al. | Feb 2011 | A1 |
20110077958 | Breitenstein et al. | Mar 2011 | A1 |
20110078105 | Wallace | Mar 2011 | A1 |
20110087486 | Schiller | Apr 2011 | A1 |
20110099184 | Symington | Apr 2011 | A1 |
20110113315 | Datha et al. | May 2011 | A1 |
20110113334 | Joy et al. | May 2011 | A1 |
20110213642 | Makar et al. | Sep 2011 | A1 |
20110246182 | Allen | Oct 2011 | A1 |
20110249953 | Suri et al. | Oct 2011 | A1 |
20110288852 | Dymetman et al. | Nov 2011 | A1 |
20110295903 | Chen | Dec 2011 | A1 |
20110307435 | Overell et al. | Dec 2011 | A1 |
20110311144 | Tardif | Dec 2011 | A1 |
20110314381 | Fuller et al. | Dec 2011 | A1 |
20120011428 | Chisholm | Jan 2012 | A1 |
20120041903 | Beilby et al. | Feb 2012 | A1 |
20120069131 | Abelow | Mar 2012 | A1 |
20120109637 | Merugu et al. | May 2012 | A1 |
20120143849 | Wong et al. | Jun 2012 | A1 |
20120158850 | Harrison et al. | Jun 2012 | A1 |
20120166180 | Au | Jun 2012 | A1 |
20120265531 | Bennett | Oct 2012 | A1 |
20120310699 | McKenna et al. | Dec 2012 | A1 |
20130041677 | Nusimow et al. | Feb 2013 | A1 |
20130091031 | Baran et al. | Apr 2013 | A1 |
20130096947 | Shah et al. | Apr 2013 | A1 |
20130144605 | Brager et al. | Jun 2013 | A1 |
20130145242 | Birnbaum et al. | Jun 2013 | A1 |
20130173285 | Hyde et al. | Jul 2013 | A1 |
20130174026 | Locke | Jul 2013 | A1 |
20130211855 | Eberle et al. | Aug 2013 | A1 |
20130238330 | Casella dos Santos | Sep 2013 | A1 |
20130246934 | Wade et al. | Sep 2013 | A1 |
20130253910 | Turner et al. | Sep 2013 | A1 |
20130262092 | Wasick | Oct 2013 | A1 |
20130275121 | Tunstall-Pedoe | Oct 2013 | A1 |
20130304507 | Dail et al. | Nov 2013 | A1 |
20130316834 | Vogel et al. | Nov 2013 | A1 |
20140006012 | Zhou et al. | Jan 2014 | A1 |
20140040312 | Gorman et al. | Feb 2014 | A1 |
20140062712 | Reiter | Mar 2014 | A1 |
20140075004 | Van Dusen et al. | Mar 2014 | A1 |
20140134590 | Hiscock, Jr. | May 2014 | A1 |
20140163962 | Castelli et al. | Jun 2014 | A1 |
20140200878 | Mylonakis et al. | Jul 2014 | A1 |
20140201202 | Jones et al. | Jul 2014 | A1 |
20140208215 | Deshpande | Jul 2014 | A1 |
20140314225 | Riahi et al. | Oct 2014 | A1 |
20140351281 | Tunstall-Pedoe | Nov 2014 | A1 |
20140356833 | Sabczynski et al. | Dec 2014 | A1 |
20140372850 | Campbell et al. | Dec 2014 | A1 |
20140375466 | Reiter | Dec 2014 | A1 |
20150049951 | Chaturvedi et al. | Feb 2015 | A1 |
20150078232 | Djinki et al. | Mar 2015 | A1 |
20150142704 | London | May 2015 | A1 |
20150161997 | Wetsel et al. | Jun 2015 | A1 |
20150169548 | Reiter | Jun 2015 | A1 |
20150178386 | Oberkampf | Jun 2015 | A1 |
20150186504 | Gorman et al. | Jul 2015 | A1 |
20150199339 | Mirkin et al. | Jul 2015 | A1 |
20150227508 | Howald et al. | Aug 2015 | A1 |
20150227588 | Shapira et al. | Aug 2015 | A1 |
20150242384 | Reiter | Aug 2015 | A1 |
20150261745 | Song et al. | Sep 2015 | A1 |
20150324347 | Bradshaw et al. | Nov 2015 | A1 |
20150324351 | Sripada et al. | Nov 2015 | A1 |
20150324374 | Sripada et al. | Nov 2015 | A1 |
20150325000 | Sripada | Nov 2015 | A1 |
20150331846 | Guggilla et al. | Nov 2015 | A1 |
20150331850 | Ramish | Nov 2015 | A1 |
20150332665 | Mishra et al. | Nov 2015 | A1 |
20150347400 | Sripada | Dec 2015 | A1 |
20150347901 | Cama et al. | Dec 2015 | A1 |
20150356967 | Byron et al. | Dec 2015 | A1 |
20150363364 | Sripada | Dec 2015 | A1 |
20150370778 | Tremblay et al. | Dec 2015 | A1 |
20160019200 | Allen | Jan 2016 | A1 |
20160026253 | Bradski et al. | Jan 2016 | A1 |
20160027125 | Bryce | Jan 2016 | A1 |
20160054889 | Hadley et al. | Feb 2016 | A1 |
20160103559 | Maheshwari et al. | Apr 2016 | A1 |
20160132489 | Reiter | May 2016 | A1 |
20160140090 | Dale et al. | May 2016 | A1 |
20160196491 | Chandrasekaran et al. | Jul 2016 | A1 |
20160217133 | Reiter et al. | Jul 2016 | A1 |
20160232152 | Mahamood | Aug 2016 | A1 |
20160232221 | McCloskey et al. | Aug 2016 | A1 |
20160314121 | Arroyo et al. | Oct 2016 | A1 |
20170004415 | Moretti et al. | Jan 2017 | A1 |
20170017897 | Bugay et al. | Jan 2017 | A1 |
20170024465 | Yeh et al. | Jan 2017 | A1 |
20170026705 | Yeh et al. | Jan 2017 | A1 |
20170060857 | Imbruce et al. | Mar 2017 | A1 |
20170061093 | Amarasingham et al. | Mar 2017 | A1 |
20170068551 | Vadodaria | Mar 2017 | A1 |
20170116327 | Gorelick et al. | Apr 2017 | A1 |
20170140405 | Gottemukkala et al. | May 2017 | A1 |
20170199928 | Zhao et al. | Jul 2017 | A1 |
20170212671 | Sathish | Jul 2017 | A1 |
20170213157 | Bugay et al. | Jul 2017 | A1 |
20170242886 | Jolley et al. | Aug 2017 | A1 |
20170270105 | Ninan et al. | Sep 2017 | A1 |
20170293864 | Oh | Oct 2017 | A1 |
20170358295 | Roux et al. | Dec 2017 | A1 |
20170371856 | Can et al. | Dec 2017 | A1 |
20180025726 | Gatti de Bayser et al. | Jan 2018 | A1 |
20180082184 | Guo et al. | Mar 2018 | A1 |
20180114158 | Foubert et al. | Apr 2018 | A1 |
20180232443 | Delgo et al. | Aug 2018 | A1 |
20180260380 | Birnbaum et al. | Sep 2018 | A1 |
20180261203 | Zoller et al. | Sep 2018 | A1 |
20180285324 | Birnbaum et al. | Oct 2018 | A1 |
20180314689 | Wang et al. | Nov 2018 | A1 |
20190138615 | Huh | May 2019 | A1 |
20190236140 | Canim et al. | Aug 2019 | A1 |
20190370696 | Ezen Can | Dec 2019 | A1 |
20200074310 | Li et al. | Mar 2020 | A1 |
20200089735 | Birnbaum et al. | Mar 2020 | A1 |
Number | Date | Country |
---|---|---|
9630844 | Oct 1996 | WO |
2006122329 | Nov 2006 | WO |
2014035400 | Mar 2014 | WO |
2014035402 | Mar 2014 | WO |
2014035403 | Mar 2014 | WO |
2014035406 | Mar 2014 | WO |
2014035407 | Mar 2014 | WO |
2014035447 | Mar 2014 | WO |
2014070197 | May 2014 | WO |
2014076524 | May 2014 | WO |
2014076525 | May 2014 | WO |
2014102568 | Jul 2014 | WO |
2014102569 | Jul 2014 | WO |
2014111753 | Jul 2014 | WO |
2015028844 | Mar 2015 | WO |
2015159133 | Oct 2015 | WO |
Entry |
---|
Allen et al., “StatsMonkey: A Data-Driven Sports Narrative Writer”, Computational Models of Narrative: Papers from the AAAI Fall Symposium, Nov. 2010, 2 pages. |
Andersen, P., Hayes, P., Huettner, A., Schmandt, L., Nirenburg, I., and Weinstein, S. (1992). Automatic extraction of facts from press releases to generate news stories. In Proceedings of the third conference on Applied natural language processing. (Trento, Italy). ACM Press, New York, NY, 170-177. |
Andre, E., Herzog, G., & Rist, T. (1988). On the simultaneous interpretation of real world image sequences and their natural language description: the system Soccer. Paper presented at Proceedings of the 8th. European Conference on Artificial Intelligence (ECAI), Munich. |
Asset Economics, Inc. (Feb. 11, 2011). |
Bailey, P. (1999). Searching for Storiness: Story-Generation from a Reader's Perspective. AAAI Technical Report FS-99-01. |
Bethem, T., Burton, J., Caldwell, T., Evans, M., Kittredge, R., Lavoie, B., and Werner, J. (2005). Generation of Real-time Narrative Summaries for Real-time Water Levels and Meteorological Observations in PORTS®. In Proceedings of the Fourth Conference on Artificial Intelligence Applications to Environmental Sciences (AMS-2005), San Diego, California. |
Bourbeau, L., Carcagno, D., Goldberg, E., Kittredge, R., & Polguere, A. (1990). Bilingual generation of weather forecasts in an operations environment. Paper presented at Proceedings of the 13th International Conference on Computational Linguistics (COLING), Helsinki, Finland, pp. 318-320. |
Boyd, S. (1998). Trend: a system for generating intelligent descriptions of time series data. Paper presented at Proceedings of the IEEE international conference on intelligent processing systems (ICIPS-1998). |
Character Writer Version 3.1, Typing Chimp Software LLC, 2012, screenshots from working program, pp. 1-19. |
Cyganiak et al., “RDF 1.1 Concepts and Abstract Syntax”, W3C Recommendation, 2014, vol. 25, No. 2. |
Dehn, N. (1981). Story generation after TALE-SPIN. In Proceedings of the Seventh International Joint Conference on Artificial Intelligence. (Vancouver, Canada). |
Dramatica Pro version 4, Write Brothers, 1993-2006, user manual. |
Gatt, A., and Portet, F. (2009). Text content and task performance in the evaluation of a Natural Language Generation System. Proceedings of the Conference on Recent Advances in Natural Language Processing (RANLP-09). |
Gatt, A., Portet, F., Reiter, E., Hunter, J., Mahamood, S., Moncur, W., and Sripada, S. (2009). From data to text in the Neonatal Intensive Care Unit: Using NLG technology for decision support and information management. AI Communications 22, pp. 153-186. |
Glahn, H. (1970). Computer-produced worded forecasts. Bulletin of the American Meteorological Society, 51(12), 1126-1131. |
Goldberg, E., Driedger, N., & Kittredge, R. (1994). Using Natural -Language Processing to Produce Weather Forecasts. IEEE Expert, 9 (2), 45. |
Hargood, C., Millard, D. And Weal, M. (2009) Exploring the Importance of Themes in Narrative Systems. |
Hargood, C., Millard, D. and Weal, M. (2009). Investigating a Thematic Approach to Narrative Generation, 2009. |
Hunter, J., Freer, Y., Gall, A., Logie, R., McIntosh, N., van der Meulen, M., Portet, F., Reiter, E., Sripada, S., and Sykes, C. (2008). Summarising Complex ICU Data in Natural Language. AMIA 2008 Annual Symposium Proceedings, pp. 323-327. |
Hunter, J., Galt, a., Portet, E, Reiter, E, and Sripada, S. (2008). Using natural language generation technology to improve information flows in intensive care units. Proceedings of the 5th Conference on Prestigious Applications of Intelligent Systems, PAIS-08. |
Kittredge, R., and Lavoie, B. (1998). MeteoCogent: A Knowledge-Based Tool for Generating Weather Forecast Texts. In Proceedings of the American Meteorological Society AI Conference (AMS-98), Phoenix, Arizona. |
Kittredge, R., Polguere, A., & Goldberg, E. (1986). Synthesizing weather reports from formatted data. Paper presented at Proceedings of the 11th International Conference on Computational Linguistics, Bonn, Germany, pp. 563-565. |
Kukich, K. (1983). Design of a Knowledge-Based Report Generator. Proceedings of the 21st Conference of the Association for Computational Linguistics, Cambridge, MA, pp. 145-150. |
Kukich, K. (1983). Knowledge-Based Report Generation: A Technique for Automatically Generating Natural Language Reports from Databases. Paper presented at Proceedings of the Sixth International ACM SIGIR Conference, Washington, DC. |
McKeown, K., Kukich, K, & Shaw, J. (1994). Practical issues in automatic documentation generation. 4th Conference on Applied Natural Language Processing, Stuttgart, Germany, pp. 7-14. |
Meehan, James R., Tale-Spin. (1977). An Interactive Program that Writes Stories. In Proceedings of the Fifth International Joint Conference on Artificial Intelligence. |
Memorandum Opinion and Order for O2 Media, LLC v. Narrative Science Inc., Case 1:15-cv-05129 (N.D. IL), Feb. 25, 2016, 25 pages (invalidating claims of USPNs 7,856,390, 8,494,944, and 8,676,691 owned by O2 Media, LLC. |
Moncur, W., and Reiter, E. (2007). How Much to Tell? Disseminating Affective Information across a Social Network. Proceedings of Second International Workshop on Personalisation for e-Health. |
Moncur, W., Masthoff, J., Reiter, E. (2008) What Do You Want to Know? Investigating the Information Requirements of Patient Supporters. 21st IEEE International Symposium on Computer-Based Medical Systems (CBMS 2008), pp. 443-448. |
Movie Magic Screenwriter, Write Brothers, 2009, user manual. |
Portet, F., Reiter, E., Gatt, A., Hunter, J., Sripada, S., Freer, Y., and Sykes, C. (2009). Automatic Generation of Textual Summaries from Neonatal Intensive Care Data. Artificial Intelligence. |
Portet, F., Reiter, E., Hunter, J., and Sripada, S. (2007). Automatic Generation of Textual Summaries from Neonatal Intensive Care Data. In: Bellazzi, Riccardo, Ameen Abu-Hanna and Jim Hunter (Ed.), 11th Conference on Artificial Intelligence in Medicine (AIME 07), pp. 227-236. |
Reiter et al., “Building Applied Natural Generation Systems”, Cambridge University Press, 1995, pp. 1-32. |
Reiter, E. (2007). An architecture for Data-To-Text systems. In: Busemann, Stephan (Ed.), Proceedings of the 11th European Workshop on Natural Language Generation, pp. 97-104. |
Reiter, E., Gatt, A., Portet, F., and van der Meulen, M. (2008). The importance of narrative and other lessons from an evaluation of an NLG system that summarises clinical data. Proceedings of the 5th International Conference on Natural Language Generation. |
Reiter E., Sripada, S., Hunter, J., Yu, J., and Davy, I. (2005). Choosing words in computer-generated weather precasts. Artificial Intelligence, 167:137-169. |
Riedl et al., “Narrative Planning: Balancing Plot and Character”, Journal of Artificial Intelligence Research, 2010, pp. 217-268, vol. 39. |
Robin, J. (1996). Evaluating the portability of revision rules for incremental summary generation. Paper presented at Proceedings of the 34th. Annual Meeting of the Association for Computational Linguistics (ACL'96), Santa Cruz, CA. |
Rui, Y., Gupta, A., and Acero, A. 2000. Automatically extracting highlights for TV Baseball programs. In Proceedings of the eighth ACM international conference on Multimedia. (Marina del Rey, California, United States). ACM Press, New York, NY 105-115. |
Smith, “The Multivariable Method in Singular Perturbation Analysis”, SIAM Review, 1975, pp. 221-273, vol. 17, No. 2. |
Sripada, S., Reiter, E., and Davy, I. (2003). SumTime-Mousam: Configurable Marine Weather Forecast Generator. Expert Update 6(3):4-10. |
Storyview, Screenplay Systems, 2000, user manual. |
Theune, M., Klabbers, E., Odijk, J., dePijper, J., and Krahmer, E. (2001) “From Data to Speech: A General Approach”, Natural Language Engineering 7(1): 47-86. |
Thomas, K., and Sripada, S. (2007). Atlas.txt: Linking Geo-referenced Data to Text for NLG. Paper presented at Proceedings of the 2007 European Natural Language Generation Workshop (ENLGO7). |
Thomas, K., and Sripada, S. (2008). What's in a message? Interpreting Geo-referenced Data for the Visually-impaired. Proceedings of the Int. conference on NLG. |
Thomas, K., Sumegi, L., Ferres, L., and Sripada, S. (2008). Enabling Access to Geo-referenced Information: Atlas.txt. Proceedings of the Cross-disciplinary Conference on Web Accessibility. |
Van der Meulen, M., Logie, R., Freer, Y., Sykes, C., McIntosh, N., and Hunter, J. (2008). When a Graph is Poorer than 100 Words: A Comparison of Computerised Natural Language Generation, Human Generated Descriptions and Graphical Displays in Neonatal Intensive Care. Applied Cognitive Psychology. |
Yu, J., Reiter, E., Hunter, J., and Mellish, C. (2007). Choosing the content of textual summaries of large time-series data sets. Natural Language Engineering, 13:25-49. |
Yu, J., Reiter, E., Hunter, J., and Sripada, S. (2003). SUMTIME-TURBINE: A Knowledge-Based System to Communicate Time Series Data in the Gas Turbine Domain. In P Chung et al. (Eds) Developments in Applied Artificial Intelligence: Proceedings of IEA/AIE-2003, pp. 379-384. Springer (LNAI 2718). |
Prosecution History for U.S. Appl. No. 16/444,649, now U.S. Pat. No. 10,706,236, filed Jun. 18, 2019. |
Number | Date | Country | |
---|---|---|---|
62797787 | Jan 2019 | US |