ARTIFICIAL INTELLIGENCE SYSTEM FOR A JOINT GROUP PREDICTION BY INDIVIDUAL PREDICTION AND EXPLANATION-BASED RE-RANKING

Description

FIELD

The present invention relates to artificial intelligence (AI) and machine learning (ML) and in particular to a method, system and computer-readable medium for providing a joint prediction for a group of individual entities.

SUMMARY

In an embodiment, the present invention provides a computer-implemented method of consolidating recommendations based on a plurality of individual recommendations. The method is implemented in one or more processors connected to a memory and includes receiving a knowledge graph including a plurality of source entities, a plurality of target entities and a plurality of attribute entities. Each source entity is linked to one or more of the plurality of attribute entities, and each target entity is linked to one or more of the plurality of attribute entities. Using a trained prediction learning model, a prediction is determined for each source entity based on the knowledge graph. The trained prediction model was trained using prediction training data including historical data. The prediction for each source includes recommendation data identifying one or multiple target entities. Using a trained consolidation learning model, a consolidated prediction is determined for the plurality of source entities based on the prediction for each source entity. The trained consolidation learning model was trained using consolidation training data including the historical data and the recommendation data. The consolidated prediction identifies a target entity that maximizes a joint probability of the plurality of source entities.

BRIEF DESCRIPTION OF THE DRAWINGS

Subject matter of the present disclosure will be described in even greater detail below based on the exemplary figures. All features described and/or illustrated herein can be used alone or combined in different combinations. The features and advantages of various embodiments will become apparent by reading the following detailed description with reference to the attached drawings, which illustrate the following:

FIG. 1 illustrates a method and system architecture for providing a consolidated recommendation for a group of entities according to an embodiment of the present invention;

FIG. 2 illustrates, before the domain alignment component, an example for the historical graph database where entities belong to one of three subgroups (source, target or attribute);

FIG. 3 illustrates, after the domain alignment component, graph fusion data as an example for the knowledge graph after the graph fusion step, wherein two new source entities are now connected to the historic graph;

FIG. 4 illustrates, after the prediction engine, graph fusion and recommendation data, wherein the prediction engine produces recommendations for the new source entities of interest;

FIG. 5 illustrates, after the preference collector component, graph fusion, recommendation and preference data as an input graph for the consolidation engine component;

FIG. 6 illustrates a neural network architecture of the consolidation engine component; and

FIG. 7 is a block diagram of a processing system according to an embodiment.

DETAILED DESCRIPTION

Often preferences of different individuals contradict each other, but common environments of individuals can require a joint decision. Embodiments of the present invention provide a mechanism where the predictions of a group of individual entities are consolidated into one joint prediction, which respects individual wishes as well as possible rankings of individuals within the group.

AI systems typically make predictions for one particular entity. However, often different entities are linked and the goal can be to arrive at one joint solution. For example, consider a group of people who would like to go together on vacation. Individually, the people might choose a different vacation destination than the one they would choose as a group. This should be respected when the AI system gives vacation recommendations to users. Embodiments of the present invention therefore have a technical aim and serve a specific technical purpose, in particular to improve the technical functionality and flexibility of non-human, computerized and automated or semi-automated AI systems. In particular, the accuracy of joint predictions, and the ability to make joint predictions generally, are improved.

Existing or naive solutions suffer from two major drawbacks: (1) if a group of entities (e.g., individuals) is represented as one entity, then it is not possible to respect the wishes of individual entities, nor is it possible to later leverage learned information if one entity becomes part of another group of entities; and (2) if a majority vote is taken across individual predictions, this does not respect the strength of individual wishes, nor the aspect that the prediction of one entity might have a stronger influence on the final prediction than the prediction of other entities.

In an embodiment, method is provided with which the predictions of a group of individuals can be consolidated to form one joint prediction. In an embodiment, the group is not modeled as a whole (as this would lead to a loss of information about the individuals of the group), which is advantageous and provides improvements over existing or naïve solutions for several reasons: (1) the individual entities are able to later be a part of other groups; (2) the individual entities are able to have varying strengths of preferences with regards to certain prediction that can be taken into account; and (3) the prediction of some individual entities are able to weigh higher than that of others. Another advantageous improvement is that the links between group members can be explicitly encoded in order to learn the relations between the group members.

Embodiments use a knowledge graph which includes triples t=(h, r, t), where h and t are entities and r is a relation. Entities of the knowledge graph are grouped in the following subgroups: (1) source entity; (2) target entity; and (3) attribute entity (see FIG. 2). The goal is to produce a consolidated prediction for a group of source entities. For example, the source entities could be different people who want to go together on vacation. Each individual receives their own individual recommendation and these recommendations are then consolidated to provide a final recommendation for the entire group. In this scenario, people will be source entities, vacation destinations will be target entities and additional attributes (such as swimming pool, beach, gender or age of a source entity, etc.), which are associated with either source and target entities or both types of entities, will be attribute entities. Depending on the use case, these groups can overlap.

In a first aspect, the present disclosure provides a computer-implemented method of consolidating recommendations based on a plurality of individual recommendations. The method is implemented in one or more processors connected to a memory and includes receiving a knowledge graph including a plurality of source entities, a plurality of target entities and a plurality of attribute entities. Each source entity is linked to one or more of the plurality of attribute entities, and each target entity is linked to one or more of the plurality of attribute entities. Using a trained prediction learning model, a prediction is determined for each source entity based on the knowledge graph. The trained prediction model was trained using prediction training data including historical data. The prediction for each source includes recommendation data identifying one or multiple target entities. Using a trained consolidation learning model, a consolidated prediction is determined for the plurality of source entities based on the prediction for each source entity. The trained consolidation learning model was trained using consolidation training data including the historical data and the recommendation data. The consolidated prediction identifies a target entity that maximizes a joint probability of the plurality of source entities.

In a second aspect, the present disclosure provides the method according to the first aspect, wherein one or more of the plurality of source entities are linked to one or more other source entities and/or one or more target entities.

In a third aspect, the present disclosure provides the method according to the first or second aspect, wherein the determining, using a trained prediction learning model, a prediction for each source entity includes learning vector representations of the knowledge graph.

In a fourth aspect, the present disclosure provides the method according to any of the aspects above, wherein the consolidated prediction identifies multiple target entities in a ranked order.

In a fifth aspect, the present disclosure provides the method according to any of the aspects above, further comprising applying source entity constraints of one or more of the plurality of source entities to the ranked order to create a filtered ranked order of the identified multiple target entities.

In a sixth aspect, the present disclosure provides the method according to any of the aspects above, wherein the recommendation data for each prediction includes a prediction explanation, and the consolidated prediction includes a consolidated prediction explanation.

In a seventh aspect, the present disclosure provides the method according to any of the aspects above, wherein the prediction for each source entity includes a weight, and wherein the determining a consolidated prediction is further based on the weights of the predictions for each source entity.

In an eighth aspect, the present disclosure provides the method according to any of the aspects above, further comprising: fusing a new source entity into the knowledge graph by linking the new source entity to one or more of the plurality of attribute entities to produce a fused knowledge graph, updating the step of determining a prediction for each source entity and the new source entity using the fused knowledge graph, and updating the step of determining a consolidated prediction for the plurality of source entities and the new source entity.

In a ninth aspect, the present disclosure provides a system configured for consolidating recommendations based on a plurality of individual recommendations, the system comprising one or more processors, which alone or in combination, are configured to provide for execution of a method comprising: receiving a knowledge graph including a plurality of source entities, a plurality of target entities and a plurality of attribute entities, wherein each source entity is linked to one or more of the plurality of attribute entities, and each target entity is linked to one or more of the plurality of attribute entities; determining, using a trained prediction learning model, a prediction for each source entity based on the knowledge graph, the trained prediction model having been trained using prediction training data including historical data, wherein the prediction for each source includes recommendation data identifying one or multiple target entities; and determining, using a trained consolidation learning model, a consolidated prediction for the plurality of source entities based on the prediction for each source entity, the trained consolidation learning model having been trained using consolidation training data including the historical data and the recommendation data, wherein the consolidated prediction identifies a target entity that maximizes a joint probability of the plurality of source entities.

In a tenth aspect, the present disclosure provides the system according to the ninth aspect, wherein the method further includes: fusing a new source entity into the knowledge graph by linking the new source entity to one or more of the plurality of attribute entities to produce a fused knowledge graph, updating the step of determining a prediction for each source entity and the new source entity using the fused knowledge graph, and updating the step of determining a consolidated prediction for the plurality of source entities and the new source entity.

In an eleventh aspect, the present disclosure provides the system according to the ninth or tenth aspect, wherein one or more of the plurality of source entities are linked to one or more other source entities and/or one or more target entities.

In a twelfth aspect, the present disclosure provides the system according to any of the ninth through eleventh aspects, wherein the consolidated prediction identifies multiple target entities in a ranked order, and wherein the method further includes applying source entity constraints of one or more of the plurality of source entities to the ranked order to create a filtered ranked order of the identified multiple target entities.

In an thirteenth aspect, the present disclosure provides the system according to any of the ninth through twelfth aspects, wherein the recommendation data for each prediction includes a prediction explanation, and the consolidated prediction includes a consolidated prediction explanation.

In a fourteenth aspect, the present disclosure provides the system according to any of the ninth through thirteenth aspects, wherein the prediction for each source entity includes a weight, and wherein the determining a consolidated prediction is further based on the weights of the predictions for each source entity.

In a fifteenth aspect, the present invention provides a tangible, non-transitory computer-readable medium having instructions thereon which, upon being executed by one or more processors, alone or in combination, provide for execution of the method according to any of the first through eighth aspects.

FIG. 1 illustrates a method and system architecture according to an embodiment comprising four components: (1) domain alignment; (2) prediction engine; (3) preference collector; and (4) consolidation engine. Input data is a knowledge graph that includes triples. The triples represent information, which may have been gathered by various sensors (e.g., microphones, cameras, acceleration sensors, etc.). Output is a series of recommendations and, preferably, explanations for the recommendations. The output is directed to provide recommendations and explanations to a computer-implemented system.

The domain alignment component (input: FIG. 2, output: FIG. 3) handles and preprocesses the input data for the system. In particular, this component includes a graph database to manage historical data (e.g., target and attribute entities, as well as historic source entities) and a graph fusion module to connect new entities or the entities of interest (source entities) with the database. This includes to establish links between the existing source and target entities and links with their attributes, such as preferences (e.g., “likes beach”), characteristics (e.g., “is x years old”), etc. which are stored in the database. The output of this component, also referred to herein as “graph fusion data”, is a knowledge graph including source, target and attribute entities (see FIG. 3).

The prediction engine component (input: FIG. 3, output: FIG. 4) learns vector representations of the knowledge graph and then predicts for each source entity (e.g., individuals) of the group a recommendation or a (preferably ordered) list of recommendations. Additionally, for this prediction, an explanation is also provided. The explanation can be provided in various ways, for example: (1) an explainer selects a subset of the knowledge graph to explain the prediction; (2) all neighbors of the source entity are shown; and/or (3) the entire knowledge graph is shown. For example, a person is recommended a vacation destination because the person likes the beach and the vacation destination has a beach. The output of this component is a personal (ordered) list of target entities (e.g., recommendations), also referred to herein as “recommendation data”, along with explanations for each target entity (e.g., in the form of attribute entities).

When a new source entity is added to the knowledge graph by the graph fusion module, the prediction is updated to learn a vector representation for this new source entity. Given a new source entity, the graph fusion module establishes links between the new source entity and existing entities in the graph in the form of triples. These triples are then used to update the prediction engine, such that only the vector of the new source entity is modified during the updates and the other vectors are left untouched. This is to ensure that the new source entity vector fits into the knowledge graph presentation and is only possible due to the definition of how source entities are added to the knowledge graph via the graph fusion module. The prediction engine component can be a knowledge base learner (KBlrn) and the explainer can be an explainable AI (XAI) engine that uses gradient rollback.

Once the prediction engine component has produced an output with an explanation (see FIG. 4 and FIG. 1, “Personalized Recommendations and Explanations”), this output is given to the preference collector component (input: FIG. 4/FIG. 1, “Personalized Recommendations and Explanations”, output: FIG. 5). This step adds, for each new source entity of interest, possible constraints (e.g., a certain entity or a relation that should or should not be used), personal weights and opinion weights (the weights being between 0 and 1). This can either be done by a person if, e.g., the source entity is a particular person and they would like to update constraints, personal preference or opinions. Alternatively, this can be done by an automated process, which automatically assigns constraints, personal preferences and opinions for the source entities and their predictions. For example, these might include the personal interest of the source entities (e.g., destination A might be the preference of person A, but person A actually does not have a strong opinion for a certain destination), and weights for the opinion for the each source entity (e.g., parents have more decision power than their children).

Taking the information collected so far into account (e.g., graph fusion data, recommendation data, preference data (see FIG. 5)), the consolidation engine component (input: FIG. 5, output: FIG. 2, “Consolidated Recommendation”) computes and outputs a consolidated and ordered list of one or multiple recommendations, preferably including explanations for the group (e.g., the source entities). For the edges of the input graph, it is provided to distinguish between weighted recommendation, weighted preference and arbitrary relation. A recommendation edge connects a source node with a target node and is the result from the prediction engine component. This edge type also reflects the initial ranking order. Having this graph allows to feed it to a machine learning model (e.g., a neural network, a perceptron, a support vector machine (SVM), etc.) which can handle and exploit the specific groups of the graph explicitly. This is advantageous as they cover different information (e.g., weighted vs. non-weighted edge). The consolidation engine can be NSP or a modified version of KBlrn in which predictions are consolidated and re-ranked.

In addition, constraints (e.g., if the target destination is at the ocean, then it should not be winter) are fed to the model in the form of rules (any soft or hard constraints, e.g., of the source entities), and are passed along the graph as training data. These constraints are applied as a post-processing step to the output list from the machine learning model.

FIG. 3 illustrates exemplary the layers of the model. Taking the input, in the form of a graph, recommendation and preference data, the model computes a joint representation. Based on this, the model computes for a series of source queries and respective relations one joint target entity recommendation, for example, the query has the form (s₁,r,?t), . . . , (s_n,r,?t). In particular, the model tries to find the entity that maximizes the joint probability of all source entities subject to the possible constraints. As used herein, the “joint probability” is a statistical measure that calculates the likelihood of two events occurring together at the same point in time. In contrast to a classical link prediction task, the model predicts a single node for a given set of triples with one unknown value. Ultimately, the model returns a ranked list of one or more recommendations for the set of source entities. The constraints can then be applied to this ranking to create a filtered ranking that is returned as the result. There can also be an explainable AI (XAI) engine, which can explain the prediction of the consolidation engine.

The output of the consolidation engine component is passed to the next third party system to process the consolidated recommendations and related explanations for the entities of interest (e.g., the source entities).

FIG. 7 is a block diagram of a processing system 700 according to an embodiment. The processing system 700 can be used to implement the protocols, devices, mechanisms, systems and methods described above and herein. For example, each functional node or component or module or device may include a processing system 700, or two or multiple nodes or components or modules or devices may be implemented by a processing system 700. A processing system 700 may include a processor 704, such as a central processing unit (CPU) of a computing device or a distributed processor system. The processor 704 executes processor-executable instructions for performing the functions and methods described above. In embodiments, the processor executable instructions are locally stored or remotely stored and accessed from a non-transitory computer readable medium, such as storage 710, which may be a hard drive, cloud storage, flash drive, etc. Read Only Memory (ROM) 706 includes processor-executable instructions for initializing the processor 704, while the random-access memory (RAM) 708 is the main memory for loading and processing instructions executed by the processor 704. The network interface 712 may connect to a wired network or cellular network and to a local area network or wide area network, such as the Internet, and may be used to receive and/or transmit data, keys, or other information, etc. as described herein. In certain embodiments, multiple processors perform the functions of processor 704.

Exemplary Embodiments and Use Cases

The present embodiments can be practically applied to improve the technical systems of a smart city for green resource control as follows:

- Use Case: The focus is on reducing the CO2 emissions. In a smart multi-residential building, warm water preparation should happen most efficiently (warm water should only be produced when it is necessary in order to save energy). An embodiment of the present invention allows to determine how often and when warm water needs to be produced, based on the individual needs of the residents.
- Data Source: People within a building (source entities) which need warm water are used. Information about the respective needs (target entities) are gathered by a sensor network or the individuals. Personal preferences are provided by the source entities, for example depending on working hours (attribute entities). Potential constraints might refer to the environmental conditions for the mobile cameras to operate (preference collector).
- Method: Application of the method according to an embodiment provides that the system takes all provided information and computes, based on all opinions and restrictions, the best possible time windows to produce warm water. If conditions change (e.g., people move in and out), the method can be re-run to re-compute the time windows.
- Output: An ordered list of recommendations when to produce warm water preferably along with an explanation why these time windows were selected are a result of the method.
- Physical Change (Technical Effect): The outcome of the system can be used to directly adapt the settings of the water and heating system in a smart building in an automated or semi-automated fashion.

The present embodiments can be practically applied to improve the technical systems of a smart city for maintenance control as follows:

- Use Case: Within a certain area (e.g., districts of a city or a larger region where a disaster has occurred), resources like mobile camera are limited. Hence, to use these resource most efficiently, it is necessary to send them to the location where they are most required to gather information (e.g., about a disaster) or to increase the safety.
- Data Source: People within a city/district or disaster coordinators (source entities) which want to improve/manage the situation. Information about the respective region (target entities) are gathered by a sensor network or public (governmental) data sources are used. Relevant information about the people and regions (attribute entities) are used. Personal preferences are provided by the source entities (e.g., prioritize children) and potential constraints (part of the preference collector) might refer to the environmental conditions for the mobile cameras to operate.
- Method: Application of the method according to an embodiment of the present invention provides that the system takes all provided information and computes, based on all opinions and restrictions, the best possible distribution of the resources (e.g., mobile cameras). If conditions change, the method can be re-run to re-allocate the resources.
- Output: An ordered list of recommendations where to use/position the resources, preferably along with an explanation why these positions were selected are a result of the method.
- Physical Change (Technical Effect): The outcome of the system can be used to position or adapt the position of devices (e.g., (civil) mobile cameras attached to drones) for maintenance control (e.g., to observe certain areas for the purpose of safety or rescue).

The present embodiments can be practically applied to improve the technical systems of a marketplace (e.g., a supermarket) for dynamic advertisements as follows:

- Use Case: Smart panels which show useful information or advertisements are limited since they cannot show everything at the same time. As such, they are most effective when they display information to people that they are most interested in (e.g., fighting disinformation or personalized advertisement).
- Data Source: People which are in the surrounding of the smart panel (source entities) are used. Information about the respective needs (target entities) are gathered by a sensor network, smart devices, or the individuals directly. Personal preferences are provided by the source entities, for example by explicitly providing it (attribute entities). Potential constraints (part of the preference collector) might refer to things which should not be displayed on the screen even if there is a high desire.
- Method: Application of the method according to an embodiment of the present invention provides that the system takes all provided information and computes, based on all opinions and restrictions, the best possible order for information (e.g., how long and when the different information should be displayed on the screen).
- Output: An ordered list of recommendations (e.g., personalized advertisements or information) preferably along with explanations why the respective information should be displayed on the smart screen are a result of the method.
- Physical Change (Technical Effect): The outcome of the system can be used to adapt the content of the screen dynamically and in an automated or semi-automated fashion to meet the needs of the surrounding people. The outcome of the system (e.g., recommendations) can be displayed in sequence (e.g., where the screen can only show one at a time).

The present embodiments can be practically applied to improve the technical systems of a travel recommendation system for providing group suggestions as follows:

- Use Case: A group of people (e.g., a family) wants to book a travel accommodation or travel activity together. Individuals of the group might have different preferences which are to be consolidate to offer one joint recommendation.
- Data Source: People who want to do something together (source entities) are used. The information about the potential options (target entities) may be gathered through common sense or the individual directly. Personal preferences are provided by the source entities (e.g., what they prioritize) and potential constraints (part of the preference collector) might refer to time or other restrictions (e.g., no activity after 10 pm). Attribute entities are attributes of the potential options or preferences of the individual (e.g., beach, to indicate that a location has a beach or that an individual likes a beach, or the age of a source entity)
- Method: Application of the method according to an embodiment provides that the system takes the provided information and produces a consolidated recommendation of what accommodation or activity the group of people most likely want to book.
- Output: An ordered list of recommendations for accommodations and activities preferably along with explanations why the group might be interested in booking these are a result of the method.
- Physical Change (Technical Effect): The outcome of the system could be used to make adaptations or changes in advance (e.g., the type and size of a boat or bus that will transport the people to their destination) in an automated or semi-automated fashion. For the recommended accommodations and activities, reservations or bookings can be made, or suggestions for reservations or bookings can be sent by electronic messaging, also in an automated or semi-automated fashion.

The present embodiments can be practically applied to improve the technical systems of a digital government system for project execution for positive environmental, social and corporate governance (ESG) as follows:

- Use Case: ESG is a way of evaluating how much an organization cares about these three values (environmental, social and corporate governance). Given a group of companies with different value preferences, they could jointly execute a project that respects these values.
- Data Source: Companies or other organizations (source entities) and their respective ESG value preferences (attribute entities) are used. Project execution options (target entities) and their respective ESG values (attribute entities) are also used.
- Method: Application of the method according to an embodiment provides that the system takes the provided information and produces a consolidated recommendation of how the project should be executed so that the values of the involved companies are respected as much as possible.
- Output: An ordered list of execution options for each execution step preferably along with explanations why the these options are a good consolidated choice are a result of the method.
- Physical Change (Technical Effect): The consolidated project execution steps can be automatically or semi-automatically executed. For example, materials can be ordered, products or sub-products can be manufactured and goods can be transported from one location to another (e.g., with a focus on CO2 reduction), and so on, in an automated or semi-automated fashion.

Embodiments may provide the following advantages and improvements over existing technology:

1. Provision of a graph fusion module that combines three subgroups (source, target, and attribute entities) into a single knowledge graph representation to derive, first, individual recommendations (which may be rated) and, subsequently, a consolidated recommendation. This enables, given a new source entity, to update the prediction engine (such that only the vector of the new source entity are modified), to be able to predict for this source entity by learning about the source entity from its connection to attribute, target and other source entities (supplied from the graph fusion module) while holding on to the historic data.

2. Provision of a mechanism that, given the attribute and target entities associated with each source entity of interest and the personal recommendation (from the prediction engine component), links source entities with target entities, and a mechanism that assigns these triples a weight (for the recommendation and personal interests) that indicates the personal importance of the link.

3. Enabling to combine entities, their preferences and personal opinions with domain related knowledge, and personal recommendations into a unified representation to derive a consolidated recommendation for all entities.

An embodiment provides a method for providing a joint recommendation comprising:

1 Providing that the following information is set:

- a. Definitions of a set of source, target and attribute entities.
- b. For all source entities s₁, . . . , s_n, choose one relation r. Each (si, r0) where i=1, . . . n pair, can then be used to predict targets t1, . . . , tm which are then consolidated into one prediction and represent what the group wants to achieve together.
- c. For each source entity, a series of one or more relevant attribute entities are linked (e.g., characteristics/preferences/interests). Source entities may also be linked to other source entities and target entities. It is also possible to define whether the opinion of a certain source entity has a higher weight or whether certain constraints need to be considered.
- d. For each target entity, a series of one or more relevant attribute entities are linked (e.g., characteristics). The target entities may also be linked to other target entities and source entities.
- e. A historical database, which describes a related situation (e.g., data about other source entities who were in the same or a similar situation), and general characteristics and descriptions about the domain.
2. For the initial start, the following steps may be executed in an embodiment:
- a. The prediction engine, and preferably a related XAI engine, are trained on the historical data.
- b. Subsequently, the consolidation engine is trained on the output of the prediction engine, the historical data, and the potential constraints, personal weights and the weighting of the individual opinions.
3. For applying the system, the following steps may be executed in an embodiment:
- a. The entities of interest (source entities) are provided with links to relevant attribute entities, other source entities and target entities. Similarly, target entities are provided with links to relevant attributes entities, other target entities and source entities. A new, previously unseen source entity is fused to the existing graph by connecting to attribute entities, and preferably also to source and target entities.
- b. Define for which set of source entities a joint prediction is to be found.
- c. The resulting graph is used as input to the prediction engine component to compute individual recommendations for each source entity.
- d. Each entity (or an automatic script based on rules) can weight the entries of their personal recommendations.
  - i. To compute a consolidated prediction for a set of source entities, the recommendations and corresponding potential weights are fed to the consolidation engine component. All information may be unified into a single knowledge graph. In an embodiment, the constraints are not included in the knowledge graph.
  - ii. The knowledge graph is fed to the consolidation recommendation AI engine to compute the consolidated recommendation for all given source entities. The consolidation recommendation AI engine returns one consolidated prediction for the defined set of source entities (see Step 3.c.).
- e. Combine input from Step 3.c. and Step 3.d. in a unified representation to compute the final consolidated recommendation.

Existing recommender systems typically use matrix factorization where entities are modeled only as source (often called user) and target (often called item) entities. In this setup, preferences and other concepts cannot be taken directly into account. Faisal M. Almutairi, Nicholas D. Sidiropoulos, and Bo Yang, “XPL-CF: Explainable Embeddings for Feature-based Collaborative Filtering,” In Proceedings of the 30th ACM International Conference on Information & Knowledge Management (CIKM '21), Association for Computing Machinery, New York, N.Y., USA, pp. 2847-2851 (2021), which is hereby incorporated by reference herein, use such a system and make it explainable. In this kind of system, no additional relations can be presented, nor can other entities be presented, such as the attribute entities. Further, explanations can only be other users and items. Therefore, such a system is technically less flexible, less expressive and provides less insights for users compared to the present embodiments. Additionally, an existing recommendation system such as this cannot explicitly model different relations between users, which also is less expressive. It is also not possible using the existing recommendation system to provide a recommendation for a group of users.

There are also other existing recommendation systems that do take other relations and entities into account and can only supply a recommendation for an individual (see, e.g., Lawrence, C., Sztyler, T., and Niepert, M., “Explaining Neural Matrix Factorization with Gradient Rollback,” Proceedings of the AAAI Conference on Artificial Intelligence, 35(6), pp. 4987-4995 (2021), and Pezeshkpour, P., Tian, Y., and Singh, S., “Investigating Robustness and Interpretability of Link Prediction via Adversarial Modifications,” Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), pp. 3336-3347 (2019), each of which is incorporated by reference herein). In contrast to embodiments herein, these recommendation systems cannot provide a joint prediction for a group of entities and consolidate the predictions into one suitable joint decision while considering individual preference strengths. These recommendation systems also do not explicitly model the graph as a set of three entity types. In addition, there is no mechanism to obtain preferences and take a preference weighting into account.

In contrast to these existing recommendation systems, embodiments herein advantageously provide the following technical improvements:

1. Enables to explicitly model the relationship between source entities, allowing to express how a group of source entities interconnects.
2. Enables to incorporate new source entities that were not seen during training and provide an explanation for the source entities' predictions. This is not possible in the previous approaches.
3. Enables to provide a consolidated recommendation for a group of entities. This is not directly possible with the previous approaches. The method and system according to embodiments herein are the first to produce such consolidated predictions, having both individual predictions and consolidated predictions alongside explanations as an output. Naive solutions would suffer from the drawbacks described above.

A recent survey by Tran, T., Felfernig, A. and Tintarev, N., “Humanized recommender systems: State-of-the-art and research issues,” ACM Transactions on Interactive Intelligent Systems (TiiS), 11(2), pp. 1-41 (2021), which is hereby incorporated by reference herein, highlights that psychological factors such as personality, emotions and social connections can significantly affect the outcome of a decision process. Further, the survey describes that the problem of decision bias in group recommender systems is a common problem as, in most cases, users do not have a clear picture of their preferences in mind before starting a decision process.

Embodiments herein are able to overcome these issues by providing a way to reflect personalities and emotions through attribute entities, where social connections are described by edges in a knowledge graph. In addition, by the system providing as a first step personal recommendations to each source entity, the decision bias problem is mitigated.

An embodiment herein can be applied to any technical system where there are individual predictions for a series of entities that would like to arrive at one joint prediction, to boost existing technology, or to improve activities on ESG or material science. A user interface is provided for interacting with the system and providing the input data, and receiving and viewing the output data (e.g., recommendations and explanations).

It is particular advantageous that, according to an embodiment, not only is the consolidated recommendation shown, but also the individual predictions and an explanation. If only the consolidated recommendation would be provided, then a user would not understand why the recommendation was made, therefore lowering the user's trust by not being able to see how individual predictions are combined into a consolidated prediction.

In contrast to an approach in which source entities are represented as one entity, embodiments herein provide: (1) a better recommendation because source entities' preferences can be taken into account; and (2) a more efficient prediction system that can take advantage of the connection(s) between source entities, leading to an overall improved prediction. Embodiments herein also provide improvements by being able to adapt the prediction engine to a new source entity and by collecting weights for the individual connections to the source entities in order to adapt and take preferences into account and thereby return an improved prediction result.

While subject matter of the present disclosure has been illustrated and described in detail in the drawings and foregoing description, such illustration and description are to be considered illustrative or exemplary and not restrictive. Any statement made herein characterizing the invention is also to be considered illustrative or exemplary and not restrictive as the invention is defined by the claims. It will be understood that changes and modifications may be made, by those of ordinary skill in the art, within the scope of the invention, which may include any combination of features from different embodiments described above.

Claims

1. A computer-implemented method of consolidating recommendations based on a plurality of individual recommendations, the method being implemented in one or more processors connected to a memory, the method comprising: receiving a knowledge graph including a plurality of source entities, a plurality of target entities and a plurality of attribute entities, wherein each source entity is linked to one or more of the plurality of attribute entities, and each target entity is linked to one or more of the plurality of attribute entities;determining, using a trained prediction learning model, a prediction for each source entity based on the knowledge graph, the trained prediction model having been trained using prediction training data including historical data, wherein the prediction for each source includes recommendation data identifying one or multiple target entities; anddetermining, using a trained consolidation learning model, a consolidated prediction for the plurality of source entities based on the prediction for each source entity, the trained consolidation learning model having been trained using consolidation training data including the historical data and the recommendation data, wherein the consolidated prediction identifies a target entity that maximizes a joint probability of the plurality of source entities.
2. The method according to claim 1, wherein one or more of the plurality of source entities are linked to one or more other source entities and/or one or more target entities.
3. The method according to claim 2, wherein the determining, using a trained prediction learning model, a prediction for each source entity includes learning vector representations of the knowledge graph.
4. The method according to claim 1, wherein the consolidated prediction identifies multiple target entities in a ranked order.
5. The method according to claim 4, further comprising applying source entity constraints of one or more of the plurality of source entities to the ranked order to create a filtered ranked order of the identified multiple target entities.
6. The method of claim 1, wherein the recommendation data for each prediction includes a prediction explanation, and the consolidated prediction includes a consolidated prediction explanation.
7. The method of claim 1, wherein the prediction for each source entity includes a weight, and wherein the determining a consolidated prediction is further based on the weights of the predictions for each source entity.
8. The method of claim 1, further comprising: fusing a new source entity into the knowledge graph by linking the new source entity to one or more of the plurality of attribute entities to produce a fused knowledge graph,updating the step of determining a prediction for each source entity and the new source entity using the fused knowledge graph, andupdating the step of determining a consolidated prediction for the plurality of source entities and the new source entity.
9. A system configured for consolidating recommendations based on a plurality of individual recommendations, the system comprising one or more processors, which alone or in combination, are configured to provide for execution of a method comprising: receiving a knowledge graph including a plurality of source entities, a plurality of target entities and a plurality of attribute entities, wherein each source entity is linked to one or more of the plurality of attribute entities, and each target entity is linked to one or more of the plurality of attribute entities;determining, using a trained prediction learning model, a prediction for each source entity based on the knowledge graph, the trained prediction model having been trained using prediction training data including historical data, wherein the prediction for each source includes recommendation data identifying one or multiple target entities; anddetermining, using a trained consolidation learning model, a consolidated prediction for the plurality of source entities based on the prediction for each source entity, the trained consolidation learning model having been trained using consolidation training data including the historical data and the recommendation data, wherein the consolidated prediction identifies a target entity that maximizes a joint probability of the plurality of source entities.
10. The system of claim 9, wherein the method further includes: fusing a new source entity into the knowledge graph by linking the new source entity to one or more of the plurality of attribute entities to produce a fused knowledge graph,updating the step of determining a prediction for each source entity and the new source entity using the fused knowledge graph, andupdating the step of determining a consolidated prediction for the plurality of source entities and the new source entity.
11. The system of claim 9, wherein one or more of the plurality of source entities are linked to one or more other source entities and/or one or more target entities.
12. The system of claim 9, wherein the consolidated prediction identifies multiple target entities in a ranked order, and wherein the method further includes applying source entity constraints of one or more of the plurality of source entities to the ranked order to create a filtered ranked order of the identified multiple target entities.
13. The system of claim 9, wherein the recommendation data for each prediction includes a prediction explanation, and the consolidated prediction includes a consolidated prediction explanation.
14. The system of claim 9, wherein the prediction for each source entity includes a weight, and wherein the determining a consolidated prediction is further based on the weights of the predictions for each source entity.
15. A tangible, non-transitory computer-readable medium having instructions thereon which, upon being executed by one or more processors, alone or in combination, provide for execution of the following steps: receiving a knowledge graph including a plurality of source entities, a plurality of target entities and a plurality of attribute entities, wherein each source entity is linked to one or more of the plurality of attribute entities, and each target entity is linked to one or more of the plurality of attribute entities;determining, using a trained prediction learning model, a prediction for each source entity based on the knowledge graph, the trained prediction model having been trained using prediction training data including historical data, wherein the prediction for each source includes recommendation data identifying one or multiple target entities; anddetermining, using a trained consolidation learning model, a consolidated prediction for the plurality of source entities based on the prediction for each source entity, the trained consolidation learning model having been trained using consolidation training data including the historical data and the recommendation data, wherein the consolidated prediction identifies a target entity that maximizes a joint probability of the plurality of source entities.

CROSS-REFERENCES TO RELATED APPLICATIONS

The present application claims priority to U.S. Provisional Patent Application No. 63/309,016, filed Feb. 11, 2022, which is hereby incorporated by reference in its entirety herein.

Provisional Applications (1)

	Number	Date	Country
	63309016	Feb 2022	US

ARTIFICIAL INTELLIGENCE SYSTEM FOR A JOINT GROUP PREDICTION BY INDIVIDUAL PREDICTION AND EXPLANATION-BASED RE-RANKING

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

CROSS-REFERENCES TO RELATED APPLICATIONS

Provisional Applications (1)