The technical field relates generally to capturing domain knowledge. More particularly, the technical field relates to capturing domain knowledge for use in artificial intelligence.
Artificial intelligence (AI) is widely accepted as the most revolutionary achievement in computer science and is used in every technological field to aid the human experience. AI touches every aspect of life and business: from healthcare to agriculture and farming, to autonomous vehicles and flying, manufacturing, to retail fashion and shopping, and the list goes on.
Even more fascinating, AI is now performing tasks that simulate human intelligence, such as thinking autonomously and reasoning at the level of humans. For example, AI is key in the design, development, and testing of new computer programs and is used in inventing technology. To perform these higher-level intelligence tasks, humans (e.g., subject matter experts) must be able to transfer in-depth knowledge and expertise into AI computers in meaningful ways.
For example, a great deal of a subject matter expert's (SME's) scientific knowledge can be found in written documents, and perhaps most explicitly in computer program code encapsulating scientific calculations. To some extent, all of these repositories of scientific knowledge have, as their target audience, other SMEs or humans with some level of skill in a particular technical domain.
For documents, skill is required to correctly interpret the meaning of the text. For computer programs, the skill is in providing correct inputs inputs that are consistent in ways that may be left implicit to some degree, or that conform with documentation that must be read by the user. Unfortunately, implicit computer program inputs, that may be sufficient for human interpretation, represent ambiguities for AI computer and cannot be interpreted.
This becomes a critical problem when computational models are to be integrated into the semantic framework of an AI that is expected to make use of the models, so that the outputs of one model are the inputs to another model, etc. In other words, all of the implicit knowledge that an SME uses to properly invoke the computational models must now be explicitly captured in the knowledge base of an AI computer.
The need for explicit capture creates a significant deficiency in traditional AI computers. The deficiency is that traditional AI computers do not include an ability or mechanisms to explicitly capture the implicit knowledge of an SME in a semantic framework.
Given the aforementioned deficiencies, a need exists for systems and methods that provide an ability to explicitly capture the implicit domain knowledge of an SME in a semantic framework. What is also needed are methods and systems that enable an AI based computer to apply the knowledge represented in an equation in a manner similar to that of a human of ordinary skill in the art.
More specifically, methods and systems are needed that use graph patterns to capture the domain knowledge of the relationships between all inputs of a mathematical equation to each other, and the relationship of all of the inputs to an output of the equation.
Under certain circumstances, embodiments of the present invention include a computer system including at least one processor for modeling operations related to capturing domain knowledge. The operations comprise creating, via the processor, a graph model of inputs to an equation relevant to the domain knowledge. The graph model relates at least one of the inputs to another one of the inputs, wherein the model relates the inputs to an output. The operations also include deriving augmented-type information from the graphical model, and adding the derived augmented-type information to the equation, the adding facilitating use of the equation by artificial intelligence.
Pursuant to some embodiments, knowledge that is often implicit (and only provided by human experts, for example) is made explicit in the captured knowledge and made available for machine inference.
Embodiments permit using graph patterns to capture the domain knowledge of the relationships between the inputs and outputs, that is the relationship of all the inputs to each other, and the output to all of the inputs—for any type of mathematical and/or scientific equation.
The foregoing has broadly outlined some of the aspects and features of various embodiments, which should be construed to be merely illustrative of various potential applications of the disclosure. Other beneficial results can be obtained by applying the disclosed information in a different manner or by combining various aspects of the disclosed embodiments. Accordingly, other aspects and a more comprehensive understanding may be obtained by referring to the detailed description of the exemplary embodiments taken in conjunction with the accompanying drawings, in addition to the scope defined by the claims.
The drawings are only for purposes of illustrating preferred embodiments and are not to be construed as limiting the disclosure. Given the following enabling description of the drawings, the novel aspects of the present disclosure should become evident to a person of ordinary skill in the art. This detailed description uses numerical and letter designations to refer to features in the drawings. Like or similar designations in the drawings and description have been used to refer to like or similar parts of embodiments of the invention.
As required, detailed embodiments are disclosed herein. It must be understood that the disclosed embodiments are merely exemplary of various and alternative forms. As used herein, the word “exemplary” is used expansively to refer to embodiments that serve as illustrations, specimens, models, or patterns. The figures are not necessarily to scale and some features may be exaggerated or minimized to show details of particular components.
In other instances, well-known components, systems, materials, or methods that are known to those having ordinary skill in the art have not been described in detail in order to avoid obscuring the present disclosure. Therefore, specific structural and functional details disclosed herein are not to be interpreted as limiting, but merely as a basis for the claims and as a representative basis for teaching one skilled in the art.
In
Various aircraft (i.e., an object) 106 in
Implicit in the illustration of
That is, the answer to these questions depends on the knowledge in the SME's head to know that the object speed 103 must be divided by the speed of sound 104 at the location where the object 106 is moving through the air. Thus, if the object 106 is moving through the air at an altitude of 32,000 feet, then that is where the speed of sound 104 is measured: at 32,000 feet. An SME would understand this assumption when using the equation 101 in
In a more complex case (Example #2), consider that the maximum total thrust for an aircraft is as follows: [the number of aircraft engines]×[total net thrust/engine]. How would an AI computer use this equation? The relationship is: Total thrust of an aircraft=[the number of engines of the aircraft]×[the net thrust of an engine of the aircraft]. The assumption is that all of the engines on the aircraft have the same net thrust.
The challenge, when trying to solve an equation using AI, is to explicitly capture the relationships between all of the inputs to the equation (to each other), and the relationship of the inputs to the equation to the output, in domain terms. These relationships are captured by relating them all to the same object. In this more complex case (example #2), the object was the aircraft.
In
Thus, the speed of sound 104 (the denominator) in the equation 101 of
In conventional
gam=1+(gamma−1)/(1+(gamma−1)*[(theta/T){circumflex over ( )}*e{circumflex over ( )}(theta/T)/(e{circumflex over ( )}(theta/T)−1){circumflex over ( )}1){circumflex over ( )}2]) Equation 204:
where
However, static temperature can be computed from altitude for a “standard Earth day” as follows. If the altitude (h) is less than 36,152 feet (troposphere), then the temperature in degrees F. is given by:
T=59−0.00356 h
If the altitude h is between 36,152 and 82,345 feet, the temperature is:
T=−70
If the altitude h is greater than 82,345 feet, then the temperature is:
T=−205.05+0.00164 h
To properly use these equations, an AI computer must know how the inputs and outputs of each equation are related in domain terms, and must have explicit knowledge of the units of inputs and outputs of each equation and ensure that they are aligned when using multiple equations.
This approach requires the user choose the units. The required units implied by this choice are shown in the applet interface 300, as illustrated in
In the illustrious example of
In the exemplary embodiments, one approach for capturing and communicating knowledge of a domain includes understanding relationships between values of various inputs to equations, representing the domain, to each of the other inputs, and understanding the relationships of those values to the outputs of those equations. Computational modeling is one approach that can define these relationships.
There are multiple formal modeling methodologies that one might use to capture and communicate knowledge of a domain. By way of example only and not limitation, predicate logic is one such methodology, that is also a very powerful formalism, that is often used. If predicate logic is restricted to predicates of arity 2 or less, it corresponds exactly with the graph methods used by the Web Ontology Language (OWL). This is the case because a predicate of arity 2 is an edge in a directed graph. When using graph representations, predicates of arity greater than 2 must be represented by creating additional, mediating concepts that map the arguments of the higher-arity predicate together.
While predicate logic is one approach, other alternatives within the spirit and scope of the embodiments include, but are not limited to, conceptual graphs (CG), ISO standard Common Logic (CL), and Knowledge Interchange Format (KIF).
For the example of
For example, the Gas 406 has a MolarMass 614. This value comes into play because one can compute the gasConstant 616 from the universal gas constant and the molecular weight, but the molarMass 614 of that gas that is being computed must be known. By way of example only, and not limitation, the PhysicalObject 502 and the Gas 406 include the subclasses:
Physical Object (502) is in the domain of a number of important properties, including:
Gas (406) includes several important properties relevant to gases. By way of example, several properties include:
Graphing the domain model, is achieved in the example of
Most programming languages have some concept of built-in types, e.g., integer, float, string, Boolean, etc. These types are used to specify the type of a variable, the signature of a method call, the type of value or values returned. In an object-oriented language, classes may be defined that align with the classes in the domain model and these classes may then be used as types.
However, there are important differences between the expressivity of most object-oriented languages and the expressivity of a graph-based ontology language. Not least among these is that in most object-oriented languages, properties are represented as fields in a class and have no independent existence. By contrast, properties in graph ontology languages are first-class citizens and can have restrictions on value type, cardinality, etc., based on class of the thing having the property.
This means that more than one class can be in the domain of the same property, and that a property may have restrictions on cardinality, type of value, etc., which are different for different classes but is still identifiably the same property. So even in the case of object-oriented languages, the useful parts of the code may be methods whose types are the built-in or primitive types. Such is the case, for example, in Java applets examples, such as the applet 300 of
When integrating computational models whose inputs and outputs are typed according to the built-in data types of the language into a semantic framework, both the built-in data type and the semantic domain type are important. However, these two types are insufficient.
Consider some examples using the exemplary equations 101 and 102 above. By way of example, the equations 101 and 202 will be illustrated in the Semantic Application Design Language (SADL) as equations 1-8 below. Other languages, however, could be used. The information content: (1) the primitive data type of the inputs and outputs, (2) the domain concept of which the input or output is a value, and (3) how the inputs and outputs are related in domain terms, is the essential point.
Stated another way, the following is a declaration of the signatures for equations 1-4, speed of sound, gamma of a calorically imperfect gas, temperature of the atmosphere as a function of altitude, and Mach number for a flying object, in terms of the language built-in types of inputs and returned values.
What is determined by this equation (i.e., the equations 1-4), is the speed of sound of the (same) gas, in either meters/sec or ft/sec. The significance of discussing equations 1-4 is to emphasize that the equations 1-4 do not include any augmented-types information. Without augmented-type information, an AI computer would not be able to use the equations 101 and 202, depicted in
As stated above, the equations 1-4 do not include any augmented-types information and as a result, the AI computer (illustrated below in
Using the example equations from
A combination of
In Equation 5, the graph pattern need only relate properties whose values are being input and output to a particular instance of the class Gas 406. In this syntax, the use of “a Gas” in the first argument and “the Gas in the subsequent arguments and in the return type indicates explicitly that all refer to the same Gas. Thus, the use in this structured-English language is consistent with meaning in standard English usage. The same is true in Equation 6. In Equation 7, the computational model being described is actually specific to Air 408, not just any subclass of Gas 406. This is important to capture.
In Equation 8, which is illustrative of a more complex equation that includes properties of more than one concept, the inputs and outputs are related via a graph pattern going up to the node representing the instance of a PhysicalObject, defined as “a” PhysicalObject in the first argument and “the” PhysicalObject (502) in subsequent arguments and in the return value. Note that the first reference to “Air” is “some Air”. This is an allowed alternative to “an Air” as it is more natural-sounding for a substance. Note also that an equation might have made reference to more than one PhysicalObject, in which case the identity of each different object must be made explicitly clear.
This can be accomplished in the SADL language by using “a PhysicalObject”, “the Physical Object”, etc., for the first, and “a second PhysicalObject”, “the second PhysicalObject”, etc., for the second. This extends to a “third PhysicalObject, etc. (This usage of indefinite and definite articles is covered by invention Concept Level Rules (Crules), US Patent reference number 317709-US-1, and by the paper Concept-level Rules for Capturing Domain Knowledge, A. Moitra, A. Crapo, R. Palla. 12th IEEE International Conference on Semantic Computing, Jan. 31-Feb. 2, 2018. https://ieeexplore.ieee.org/abstract/document/8334469/ the contents of each of which are hereby incorporated by reference in their entirety for all purposes).
Also significant to note is importance of capturing the constraints on units of inputs and outputs. For example, Equation 7 illustrates the case of a computational model that only works for a single set of units: “ft” as the unit of input and “[degrees] Rankine” as the unit of output. The other equations are modified slightly from those shown in Equations 1, 2, and 4, to take an additional argument showing whether the unit system is Metric or Imperial.
For example, in Equation 5, if the UnitSystem is Metric, the unit of the first argument must be Kelvin, the unit of the 3rd argument must be “g/mole”, the unit of the 4th argument Kelvin, and the unit of the returned value is “m/sec”. Similarly, if the UnitSystem is Imperial, the units are respectively Ranking, “lbm/lbmole”, Rankine, and “ft/sec”. Note that the 2nd argument is unitless. The presence of the unit system argument implies that the encoding of the equation has a computations for both systems of units with appropriate flow control within.
Input device(s) 810 may comprise, for example, a keyboard, a keypad, a mouse or other pointing device, a microphone, knob or a switch, an infra-red (IR) port, a docking station, and/or a touch screen. Input device(s) 810 may be used, for example, to enter information into the computer system 800. Output device(s) 820 may comprise, for example, a display (e.g., a display screen), a speaker, and/or a printer.
Data storage device 830 may comprise any appropriate persistent storage device, including combinations of magnetic storage devices (e.g., magnetic tape, hard disk drives. and flash memory), optical storage devices, Read Only Memory (ROM) devices, etc., while memory 825 may comprise Random Access Memory (RAM), Storage Class Memory (SCM) or any other fast-access memory.
Services 835 and application 840 may comprise program code executed by processor 805 to cause the computer system 800 to perform any one or more of the processes described herein (e.g.,
Data 845 (either cached or a full database) may be stored in volatile memory such as memory 825. Data storage device 830 may also store data and other program code and instructions for providing additional functionality and/or which are necessary for operation of apparatus 800, such as device drivers, operating system files, etc.
Services 835 and application 840 including one or more process modules, for example, an augmented types module 840a performs specialized processing to augment integration of models into semantic framework. Process Modules #2 (840a) and #3 (840b) may comprise program code executed by processor 805 to cause the computer system 800 to perform any one or more of the processes described herein (e.g.,
This written description uses examples to disclose the invention, including the best mode, and also to enable any person skilled in the art to practice the invention, including making and using any devices or systems and performing any incorporated methods.
The patentable scope of the invention is defined by the claims, and may include other examples that occur to those skilled in the art. Such other examples are intended to be within the scope of the claims if they have structural elements that do not differ from the literal language of the claims, or if they include equivalent structural elements with insubstantial differences from the literal languages of the claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/023031 | 3/16/2020 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62818873 | Mar 2019 | US |